Vous êtes sur la page 1sur 1393

THE

ENCYCLOPEDIA
OF PHYSICS
THE
ENCYCLOPEDIA
OF PHYSICS
THIRD EDITION

EDITED BY

Robert M. Besancon

~ SPRINGER SCIENCE+BUSINESS MEDIA, LLC


Copyright e 1990 by Springer Science+Business Media New York
Originally publishcd by Van Nostrand Reinhold in 1990

Ubrary of congress Catllog Card. Number 84-13045


ISBN 978-0-442-00522-1

All righu reserved . Certain ponions ofthiJ work mpyright


e 1985, 1974 and 1966 by Van Nostrand Reinhold.
No part of this work cove red b)' the cop)'ri8ht hereon may
be rep rodu ced or used in any form o r by an )' means-graphic ,
electronic, or mecha nical, includin g ph otoco pying, recording,
tapin g, or information stouge and retrieva l systems- with ou t
permission of the publisher.

15 14 13 12 11 10 9 8 7 6 5 4 3 2

Ubrary of COngresa Cataloging in PubUcatlon Dltl

Main en tr y under title:

Th e Encyclope dia of ph ysics.

1. Physics -Dic ti onaries. 1. Besa n~on , Robert M.


(Ro bert Manln)
OC5.E546 1984 530' ,03' 21 84'13045
ISBN 978-0-442-00522-1 ISBN 978-1-4615-6902-2 (eBook)
DOI 10.1007/978-1-4615-6902-2
CONTRIBUTORS
JEROME L. ACKERMAN, Department of Chem- JOSEPH J. BECKER, General Electric Research
istry, University of Cincinnati, Ohio. Magnetic and Development Center, Schenectady, New
Resonance. York. Magnetism.
KEITSIRO AIZU , Hitachi Central Research ALBERT C. BEER, Battelle-Columbus Laborato-
Laboratory, Tokyo. Ferroicity, Ferroelectric- ries, Ohio. Hall Effect and Related Phenomena.
ity, and Ferroelasticity. BARRY A. BELL, Electrosystems Division, Na-
CHRISTOPHER W. ALLEN, Department of tional Bureau of Standards. Electrical Mea-
Chemistry, University of Vermont. Chemistry. surements (with Forest K. Harris).
DOUGLAS L. ALLEN, Emery Industries, Cincin- DA VID A. BELL, Emeritus, University of Hull,
nati, Ohio. Vapor Pressure and Evaporation. England. Cybernetics.
CHARLES L. ALLEY, Department of Electrical H. E. BENNETT, Michelson Laboratory, Naval
Engineering, University of Utah. Modulation. Weapons Center, China Lake, California.
ROBERT C. AMME, University of Denver, Colo- Reflection.
rado. Ionization. REUBEN BENUMOF, Department of Applied
DA VID L. ANDERSON, Department of Physics, Sciences, College of Staten Island, New York.
Oberlin College, Ohio. Electron. Alternating Currents.
C. L. ANDREWS, Emeritus, Department of M. J. BERAN, School of Engineering, Tel Aviv
Physics, State University of New York at University, Israel. Coherence.
Albany. Doppler Effect. LEO L. BERANEK, Acoustical Consultant, Win-
ROBERT E. APFEL, Applied Mechanics, Yale chester, Massachusetts. Architectural Acous-
University. Cavitation. tics.
H. L. ARMSTRONG, Department of Physics, ERIK BERGSTRAND, Velocity of Light.
Queen's University, Kingston, Ontario. States ARTHUR I. BERMAN, Ris¢> Library, Denmark.
of Matter. Astronautics.
ATAM P. ARYA, Department of Physics, West ROBERT M. BESAN<;ON, Editor, The Encyclo-
Virginia University. Simple Machines. pedia of Physics, 515 Grand Avenue, Dayton,
P. W. ATKINS, Physical Chemistry Laboratory, Ohio. Physics.
University of Oxford, England. Physical GEORGE L. BEYER, Retired, Eastman Kodak
Chemistry. Company. Molecular Weight.
GEORGE E. BACON, University of Sheffield, P. J. BILLING ,Induction Heating.
England. Neutron Diffraction. EDWARD A. BIRGE, Department of Botany
D. C. BAIRD, Royal Military College of Canada, and Microbiology, Arizona State University.
Kingston, Ontario. Measurements, Principles Molecular Biology.
of. CHARLES A. BITTMANN, Solid State Labora-
RADU BALESCU, Faculty of Sciences, Univer- tory, Hewlett-Packard Company, Palo Alto,
site Libre de Bruxelles, Belgium. Statistical California. Transistors.
Mechanics. ALFRED K. BLACKADAR, Department of Mete-
WILLIAM BAND, Department of Physics, Wash- orology, Pennsylvania State University.
ington State University. Mathematical Physics Meteorology.
and Quantum Theory. JOHN P. BLEWETT, Brookhaven National Lab-
L. E. BARBROW, National Bureau of Standards,
Washington, D.C. Photometry. oratory. Accelerators, Linear.
JAMES A. BARNES, National Bureau of Stan- N. BLOEMBERGEN, Harvard University. Light.
dards, Boulder, Colorado. Atomic Clocks. WARREN B. BOAST, Emeritus, Department of
C. J. BARTLESON, Research Laboratories, East- Electrical Engineering, Iowa State University.
man Kodak Company. Color. Potential.
ROBERT P. BAUMAN, Department of Physics, ANDREW H. BOBECK, AT&T Bell Laboratories.
University of Alabama in Birmingham. Absorp- Ferrimagnetism (with W. H. von Aulock).
tion Spectra. H. V. BOHM, Department of Physics, Wayne
WILLIAM L. BAUN, The Materials Laboratory, State University, Detroit, Michigan. Fermi
Wright Patterson Air Force Base, Ohio. Surface (with Norman Tepley and George
X-rays. Crabtree).
v
CONTRIBUTORS vi

BRUCE BOLT, Department of Geology and Geo- E. RICHARD COHEN, Rockwell International
physics, University of California, Berkeley. Science Center, Thousand Oaks, California.
Seismology. Constants, Fundamental.
JILL C. BONNER, Department of Physics, Uni- C. SHARP COOK, Department of Physics, Uni-
versity of Rhode Island. Antiferromagnetism. versity of Texas at EI Paso. Fallout.
ROBERT M. BOYNTON, Department of Psy- JOHN C. CORBIN, U.S. Air Force Aeronautical
chology, University of California, San Diego. Systems Division, Wright-Patterson Air Force
Vision and the Eye. Base, Ohio. Skin Effect.
G. E. BRIGGS, Emeritus, University of Cam- H. COTTON, University of Nottingham, England.
bridge, England. Osmosis. Optics, Geometrical.
FREDERICK C. BROCKHURST, Electrical Engi- HERMAN V. COTTONY, H. V. Cottony Consult-
neering Department, Virginia Polytechnic ing Service, Bethesda, Maryland. Antennas.
Institute and State University. Motors, Elec- GEORGE CRABTREE, Argonne National Labora-
tric and Electric Power Generation. tory, Illinois. Fermi Surface (with H. V. Bohm
STANLEY J. BRODSKY, Stanford Linear Accel- and Norman Tepley).
erator Center, Stanford University. Quantum ROBERT G. CUNNINGHAM, Manufacturing
Electrodynamics (with Toichiro Kinoshita). Technology Division, Eastman Kodak Com-
JAMES J. BROPHY, University of Utah. Elec- pany. Static Electricity (with D. J. Montgom-
tronics. ery).
LAURIE M. BROWN, Department of Physics R. H. DAVIS, Department of Physics, Florida
and Astronomy, Northwestern University, State University. Proton.
Evanston, Illinois. Gauge Theories. L. WALLACE DEAN, III, Pratt and Whitney Air-
STEPHEN G. BRUSH, Department of History craft Division of United Aircraft, East Hart-
and Institute for Physical Science and Tech- ford, Connecticut. Physical Acoustics.
nology, University of Maryland. Kinetic PETER G. DEBR UNNER , Department of Physics,
Theory and Irreversibility. University of Illinois, Urbana. Mossbauer
H. A. BUCHDAHL, Australian National Univer- Effect (with Robert L. Ingalls).
sity, Canberra. Thermodynamics. BARBARA DECKER, Volcanology (with Robert
DONALD G. BURKHARD, Department of Physics W. Decker).
and Astronomy, University of Georgia. Irradi- ROBERT W. DECKER, U.S. Geological Survey.
ance Calculations, Microwave Spectroscopy, Volcanology (with Barbara Decker).
and Solar Concentrator Design, Optics of JOHN DE SANTO, Department of Mathematics,
E. R. CAIANIELLO, Laboratorio di Cibernetica Colorado School of Mines. Ocean Acoustics.
del Consiglio Nationale delle Ricerche, Naples, N. G. DESHPANDE, Department of Physics and
Italy. Field Theory. Institute of Theoretical Science, University of
ELTON J. CAIRNS, Lawrence Berkeley Labora- Oregon. Current Algebra and Electroweak
tory and University of California, Berkeley. Theory.
Energy Storage, Electrochemical. R. E. DE WAMES, Advanced Technology Energy
EARL CALLEN, Department of Physics, Ameri- Systems Group, Canoga Park, California. Spin
can University, Washington, D.C. Magneto- Waves (with T. Wolfram).
striction. A. DINSDALE, Retired, British Ceramic Re-
G. S. CARGILL, III, IBM Thomas J. Watson search Association, Stoke-on-Trent, England.
Research Center, Yorktown Heights, New Viscosity.
York. Amorphous Metals. RICHARD H. DITTMAN, Department of Physics,
THEODORE G. CASTNER, Department of Phys- University of Wisconsin-Milwaukee. Heat.
ics and Astronomy, University of Rochester, ROBERT H. DOREMUS, Materials Engineering
New York. Electron Spin. Department, Rensselaer Polytechnic Institute,
NICHOLAS CHAKO, Retired, Department of Troy, New York. Crystallization.
Mathematics, Queens College, City University GLENN L. DOWNEY, Department of Mechani-
of New York. Aberrations. cal Engineering and Engineering Mechanics,
B. S. CHANDRASEKHAR, Department of Physics, University of Nebraska-Lincoln. Dynamics.
Case Western Reserve University, Cleveland, G. DRESSELHAUS, Lincoln Laboratory, Massa-
Ohio. Superconductivity. chusetts Institute of Technology. Cyclotron
Resonance (Diamagnetic Resonance).
FRANK CHORLTON, Department of Mathe- MILDRED S. DRESSELHAUS, Department of
matics, University of Aston in Birmingham, Electrical Engineering and Physics, Massachu-
England. Differential Equations in Physics. setts Institute of Technology. Semiconductors.
BRUCE P. CLAYMAN, Department of Physics, H. G. DRICKAMER, School of Chemical Sci-
Simon Fraser University, Vancouver, British ences, University of Illinois at Urbana-Cham-
Columbia. Interference and Interferometry. paign. Pressure, Very High.
KENNETH J. CLOSE, Department of Physics, ROBERT H. EATHER, Physics Department,
The Polytechnic of Central London, England. Boston College, Chestnut Hill, Massachusetts.
Vacuum Techniques (with John Yarwood). Aurora.
IRA COCHIN, Department of Mechanical Engi- ERNST R. G. ECKERT, Emeritus, Department
neering, New Jersey Institute of Technology, of Mechanical Engineering, University of
Newark. Gyroscope and Inertial Guidance. Minnesota. Heat Transfer.
vii CONTRIBUTORS

D. EDELSON, AT&T Bell Laboratories, Murray S. J. GREGG (retired), Department of Chemistry,


Hill, New Jersey. Polar Molecules. University of Exeter, England. Adsorption
JOHN A. EISELE, Space Systems Division, Na- and Absorption.
val Research Laboratory, Washington, D.C. CLARK B. GROSECLOSE, Lawrence Livermore
Tensors and Tensor Analysis (with Robert Laboratory, University of California. Positron
M. Mason). (with William W. Walker).
LEONARD EISNER, Norwalk State Technical JOHN B. GRUBER, Departments of Physics and
College, Norwalk, Connecticut. Radiation, Chemistry, Portland State University, Oregon .
Thermal. Rare Earths (with Richard P. Leavitt).
RA YMOND J. EMRICH, Department of Physics, VINCENT P. GWINN, Department of Chemistry,
Lehigh University. Fluid Dynamics and Fluid University of California, Irvine. Neutron Acti-
Statics. vation Analysis.
DUANE D. ERWAY, Xerox Medical Systems, Y. M. GUPTA, Shock Dynamics Laboratory,
Pasadena, California. Solar Energy Utilization Department of Physics, Washington State
(with Abe Zarem). University. Shock Waves.
HOWARD T. EVANS, JR., U.S. Geological Sur- CECIL W. GUINN, Physicist-Consultant, U.S.
vey, Reston, Virginia. Crystallography and Air Force Department of Defense (retired).
Crystal Structure Analysis. Bionics.
A. G. FISCHER, Department of Electrical Engi- WALTER J. HAMER, Washington, D.C., Electro-
neering, University of Dortmund, West Ger- chemistry.
many. Electroluminescence. A. O. HANSON, Department of Physics, Univer-
GRANT R. FOWLES, Physics Department, Uni- sity of Illinois, Urbana-Champaign . Compton
versity of Utah. Schrodinger Equation. Effect.
MARTIN M. FREUNDLICH. Electron Micro- W. HAPPER, Department of Physics, Princeton
scope. University. Optical Pumping.
SUSUMU FUKUDA, Department of Electronics, AKIRA HARASIMA, Emeritus, Tokyo Institute
Kyoto University, Japan. Photoelasticity. of Technology, Japan. Surface Tension.
RICHARD M. FULLER, Department of Physics, FOREST K. HARRIS (retired), Electricity Divi-
Gustavus Adolphus College, St. Peter, Minne- sion, National Bureau of Standards. Electrical
sota. Density and Specific Gravity (with Measurements (with Barry A. Bell).
Robert G. Fuller). ROLAND H. HARRISON, National Institute for
ROBER T G. FULLER, Department of Physics, Petroleum and Energy Research, Bartlesville,
University of Nebraska-Lincoln. Density and Oklahoma. Gases: Thermodynamic Properties.
Specific Gravity (with Richard M. Fuller). R. W. HART, Applied Physics Laboratory, Johns
HAROLD P. FURTH, Department of Astrophys- Hopkins University. Light Scattering.
ical Science, Princeton University . Magnetic RYUKITI R . HASIGUTI, Faculty of Engineering,
Field. University of Tokyo, Japan. Lattice Defects.
T. H. GEBALLE, Department of Applied Phys- SHERWOOD K. HAYNES, Emeritus, Department
ics, Stanford University. Calorimetry (with of Physics, Michigan State University. Auger
Frances Hellman). Effect.
BARRY A. GEORGE. Electron Optics. G. E. HAYTON, Department of Electronic Engi-
H. A. GERSCH, School of Physics, Georgia Insti- neering, University of Hull, England. Feedback
tute of Technology, Atlanta. Boltzmann's (with P. M. Taylor).
Distribution. RAYMOND W. HAYWARD, National Bureau of
ANTHONY B. GIORDANO, Polytechnic Insti- Standards, Washington, D.C. Lorentz Trans-
tute of Brooklyn. Microwave Transmission. formations, Parity.
JOSHUA N. GOLDBERG, Syracuse University, JOHN HEADING, Department of Applied Mathe-
Syracuse, New York, Gravitation. matics, University College of Wales, A beryst-
wyth. Matrices.
PAUL GOLDHAMMER, Department of Physics, EUGENE HECHT, Department of Physics and
University of Kansas. Nuclear Structure. Astronomy, Adelphi University, Garden City,
R. H. GOOD, JR., Department of Physics, Penn- New York. Mechanics.
sylvania State University. Photon. FRANCES HELLMAN, Department of Applied
JOHN B. GOODENOUGH, Inorganic Chemistry Physics, Stanford University. Calorimetry
Laboratory, University of Oxford, England. (with T. H. Geballe).
Diamagnetism. ANTONY HEWISH, Cavendish Laboratory, Cam-
CHARLES D. GOODMAN, Department of Phys- bridge, England. Pulsars.
ics, Indiana University, Bloomington. Critical FRED A. HINCHEY, Department of Mathemat-
Mass and Isospin. ics, Northeastern University, Boston. Vectors
CLARK GOODMAN, 95 Antigua Court, Cora- in Physics.
nado, California. Cross Section and Stopping RUSSELL K. HOBBIE, Space Science Center,
Power. University of Minnesota. Biomedical Instru-
JOSEPH W. GOODMAN, Department of Electri- mentation.
cal Engineering, Stanford University. Fourier JOHN F. HOGERTON , S.M. Stoller Corporation,
Analysis. New York. Atomic Energy.
CONTRIBUTORS viii

CHARLES A. HOLT, Department of Electrical RUDOLPH KINGSLAKE (retired), Institute of


Engineering, Virginia Polytechnic Institute Optics, University of Rochester, New York.
and State University. Induced Electromotive Refraction.
Force. R. H. KINGSTON, Lincoln Laboratory, Massa-
ROBERT E. HOPKINS, Optizon Corporation, chusetts Institute of Technology. Laser.
Rochester, New York. Lens. TOICHIRO KINOSHITA, Laboratory of Nuclear
ROLF HOSEMANN, Gruppe Parakristallfor- Studies, Cornell University. Quantum Electro-
schung, % Bundesanstalt fur Materialorufung, dynamics (with Stanley Brodsky).
Berlin. Colloids, Thermodynamics of; Dif- RANDALL D. KNIGHT, Department of Physics,
fraction by Matter and Diffraction Gratings; Ohio State University. Molecular Spectros-
Microparacrystals; Microparacrystals, Equilib- copy.
rium State of; Paracrystals. HENRY J. KOSTKOWSKI, Spectroradiometry
KAREL HUJER, Emeritus, Department of Phys- Consulting, Fairfax, Virginia. Pyrometry,
ics and Astronomy, University of Tennessee Optical.
at Chattanooga. History of Physics. ALLAN D. KRAUS, U.S. Naval Postgraduate
McALLISTER H. HULL, JR., Department of School. Circuitry.
Physics, University of New Mexico. Calculus REINOUT P. KROON (retired), University of
of Physics. Pennsylvania. Dimensions.
T. S. HUTCHISON, Royal Military College, Kings- H. G. KUHN, Emeritus, Clarendon Laboratory,
ton, Ontario. Ultrasonics (with S. L. McBride). Oxford University. Atomic Spectra.
ROBERT L. INGALLS, Department of Physics, KAILASH KUMAR, Research School of Physi-
University of Washington. Mossbauer Effect cal Sciences, Australian National University,
(with Peter Debrunner). Canberra. Many-Body Problem.
MAX JAMMER, Bar-Han University and Hebrew A. BARRY KUNZ, Department of Physics and
University, Israel. Statics. Materials Research Laboratory, University of
J. V. JELLEY, Nuclear Physics Division, Atomic Illinois at Urbana-Champaign. Solid State
Energy.Research Establishment, Harwell, En- Theory.
gland. Cerenkov Radiation. C. G. KUPER, Department of Physics, Technion-
R. J. JOENK, IBM Information Products Divi- Israel Institute of Technology, Israel. Polaron.
sion, Boulder, Colorado. Ferromagnetism DONALD W. KUPKE, Department of Biochemis-
(with T. R. McGuire). try, School of Medicine, University of Virginia.
RUSSELL H. JOHNSEN, Department of Chem- Centrifuge (with Ralph A. Lowry and Houston
istry, Florida State University. Elements, G. Wood, III).
Chemical. K. O. KUTSCHKE, Division of Chemistry, Na-
FRANCIS S. JOHNSON, University of Texas at tional Research Council of Canada, Ottawa.
Dallas. Space Physics. Photochemistry.
ROBERT A. JOHNSON, Materials Science Depart- ROBERT T. LAGEMANN, Emeritus, Depart-
ment, University of Virginia. Irradiation, Dis- ment of Physics, Vanderbilt University. Wave
placed Atoms. Motion.
JESS J. JOSEPHS, Department of Physics, Smith HELMUT E. LANDSBERG, Institute for Physical
College, Northampton, Massachusetts. Musi- Science and Technology, University of Mary-
cal Sound. land. Geophysics.
P. K. KABIR, Department of Physics, Univer- C. T. LANE. SuperjZuidity.
sity of Virginia. Weak Interactions. KENNETH R. LANG, Department of Physics,
G. MICHAEL KALVIUS, Physik-Department Tufts University. Cosmology.
Technische Universitiit, Munchen, Federal PAUL LANGACKER, Department of Physics,
Republic of Germany. Magnetometry (with University of Pennsylvania. Grand Unified
H. J. Litterst). Theories.
WILLIAM M. KAULA, Department of Earth and D. F. LAW DEN , Emeritus, University of Aston
Space Sciences, University of California, Los in Birmingham, England. Mathematical Princi-
Angeles. Geodesy. ples of Quantum Mechanics.
ROBERT W. KENNEY, Lawrence Berkeley Lab- RICHARD P. LEAVITT, Applied Physics Branch,
oratory, University of California. Bremsstrah- Harry Diamond Laboratories, Adelphi, Mary-
lung and Photon Beams. land. Rare Earths (with John B. Gruber).
D. W. KERST, Department of Physics, University REUBEN LEE, Consulting Engineer. Trans-
of Wisconsin, Madison. Betatron. former.
WILLIAM F. KIEFFER (retired), Department of R. J. W. LEFEVRE, Emeritus, School of Chem-
Chemistry, College of Wooster, Ohio. Mole istry, Macquarie University, North Ryde,
Concept. Australia. Dipole Moments (Electrical and
ALLEN L. KING, Emeritus, Department of Magnetic).
Physics and Astronomy, Dartmouth College. MARC D. LEVENSON, IBM Research Division,
Refrigeration. San Jose, California. Kerr Effects.
GERALD W. KING, Department of Chemistry, SUMNER LEVINE, Department of Materials
McMaster University, Hamilton, Ontario. Mol- Science and Engineering, State University of
ecules and Molecular Structure. New York at Stony Brook. Thermionics.
ix CONTRIBUTORS

DAVID R. LIDE, JR., Standard Reference Data, and Science of Materials, University of Oxford,
National Bureau of Standards. Chemical England. Metallurgy .
Physics. ROBERT M. MASON, Consultant, CLEF, Peter-
INGOLF LINDAU, Electronics Laboratory, Stan- borough, New Hampshire. Tensors and Ten-
ford University. Photoelectricity. sor Analysis (with John A. Eisele).
ROBERT .LINDSAY, Department of Physics, P. T. MA TTHEWS, Department of Applied Mathe-
Trinity College, Hartford, Connecticut. Reso- matics and Theoretical Physics, University of
nance. Cam bridge, England. Strong Interactions.
RAPHAEL M. LITTAUER, Laboratory of Nuclear R. D. MATTUCK, Physics Laboratory I, H. C.
Studies, Cornell University. Pulse Generation. Cj)rsted Institute, University of Copenhagen.
F. JOCHEN LITTERST, Physik-Department Tech- Feynman Diagrams.
nische, Universitat Miinchen, Federal Republic JOSEPH E. MAYER, University of California,
of Germany. Magnetometry (with G. Michael San Diego. Liquid State.
Kalvius). S. L. Mc BRIDE, Royal Military College of Can-
MICHAEL G. LITTMAN, Department of Mechan- ada, Kingston, Ontario. Ultrasonics (with T.
ical and Aerospace Engineering, Princeton S. Hutchison).
University. Zeeman and Stark Effects. DA VID E. Mc CULLOUGH, Tektronix, Inc.,
A. L. LOEB, Department of Visual and Environ- Beaverton, Oregon. Oscilloscopes.
mental Studies, Harvard University. Heisen- C. B. A. MCCUSKER, School of Physics, Univer-
berg Uncertainty Principle. sity of Sydney, Australia. Quarks.
JOSEPH J. LOFERSKI, Division of Engineering, D. K. McDANIELS, Physics Department, Uni-
Brown University. Photovoltaic Effect. versity of Oregon. Solar Energy Sources.
EDWARD J. LOFGREN, Lawrence Berkeley Lab- RAYMOND H. MCFEE, Consultant, Laguna
oratory, University of California. Cyclotron Hills, California. Infrared Radiation.
and Accelerators, Particle. T. R. MCGUIRE, IBM Thomas J. Watson Re-
RALPH R . LOWRY, School of Engineering and search Center, Yorktown Heights, New York.
Applied Science, University of Virginia. Cen- Ferromagnetism (with R. J. Joenk).
trifuge (with Donald W. Kupke and Houston A. E. E. McKENZIE (deceased). Optical Instru-
G. Wood, III). ments (with Nigel C. McKenzie and J. D.
ROBERT A. LUFBURROW, Department of Phys- Walker).
ics, St. Lawrence University. Carnot Cycles NIGEL C. MCKENZIE. Optical Instruments
and Carnot Engines. (with A. E. E. McKenzie and J. D. Walker).
H. R. LUKENS, IRT Corporation, San Diego, G. T. MEADEN, The Journal of Meteorology,
California. Radioactive Tracers. Tornado and Storm Research Organization,
PAUL S. LYKOUDlS, School of Nuclear Engi- Bradford-on-A von, England. Conductivity,
neering, Purdue University. Magneto-Fluid- Electrical.
Mechanics. HOWARD C. MEL, Donner Laboratory, Univer-
DA VI D N. LYON, Department of Chemical Engi- sity of California, Berkeley. Radiation, Ion-
neering, University of California, Berkeley. izing, Basic Interactions (with Paul Todd).
Liquefaction of Gases. MAEL A. MELVIN, Emeritus, Department of
WILLIAM J. MACKNIGHT, Polymer Science Physics, Temple University. Antiparticles.
and Engineering, University of Massachusetts. HAROLD METCALF, Physics Department, State
Polymer Physics (with Lawrence E. Nielson). University of New York, Stony Brook. Bio-
J. D. MACKENZIE, School of Engineering and physics.
Applied Science, University of California, Los DIETRICH MEYERHOFER, RCA Laboratories,
Angeles. Vitreous State. Princeton, New Jersey. Tunneling.
ALFRED U. MAC RAE ,AT&T Bell Laboratories, WOLFGANG E. MOECKEL (retired), NASA-
Holmdel, New Jersey. Electron Diffraction. Lewis Research Center, Cleveland. Electric
S. M. MAHAJAN, Institute for Fusion Studies, Propulsion.
University of Texas at Austin. Plasmas. ORREN C. MOHLER, Emeritus, Department of
Astronomy, University of Michigan. Solar
FRED C. MAIENSCHEIN, Oak Ridge National Physics.
Laboratory. Nuclear Radiation Shielding. D. J. MONTGOMERY, College of Engineering,
B. W. MANGUM, Temperature and Pressure Divi- Michigan State University. Static Electricity
sion, Center for Basic Standards, National (with Robert G. Cunningham).
Bureau of Standards. Temperature and Ther- KARL Z. MORGAN, School of Nuclear Engi-
mometry. neering, Georgia Institute of Technology.
STEPHEN P. MARAN, Laboratory for Astronomy Health Physics.
and Solar Physics, NASA-Goddard Space A. H. MORRISH, Department of Physics, Uni-
Flight Center. Radio Astronomy. versity of Manitoba, Canada. Paramagnetism.
D. MARCUSE, AT&T Bell Laboratories, Holm- J. MORT, Webster Research Center, Xerox Cor-
del, New Jersey. Fiber Optics. poration. Photoconductivity.
HUMPHREY J. MARIS, Physics Department, ERWIN W. MULLER, Emeritus (deceased), De-
Brown University. Phonons. partment of Physics, Pennsylvania State Uni-
JOHN W. MARTIN, Department of Metallurgy versity. Field Emission (with Tien T. Tsong).
CONTRIBUTORS
"
RAYMOND L. MURRAY, Emeritus, Nuclear A. EDWARD PROFIO, Department of Chemical
Engineering, North Carolina State University. and Nuclear Engineering, University of Cali-
Nuclear Reactors. fornia, Santa Barbara. Nuclear Instruments.
RA YMOND R. MYERS, Department of Chemis- JOHN E. PRUSSING, Department of Aeronauti-
try, Kent State University (Ohio). Rheology. cal and Astronautical Engineering, University
NORMAN H. NACHTRIEB, Department of of Illinois at Urbana-Champaign. Kepler's
Chemistry, University of Chicago. Diffusion Laws of Planetary Motion.
in Liquids. ERNEST RABINOWICZ, Department of Mechan-
GERARD NADEAU, Department de Physique, ical Engineering, Massachusetts Institute of
Universite Laval, Quebec. Elasticity. Technology. Friction.
PAUL NELSON, Department of Mathematics, S. RAIMES, late of Imperial College, London.
Texas Tech University. Transport Theory Wave Mechanics.
(with G. Milton Wing). LAWRENCE L. RAUCH, Jet Propulsion Labo-
JACOB NEUBERGER, Department of Physics, ratory, California Institute of Technology.
Queens College, City University of New York. Telemetry.
Expansion, Thermal. FREDERICK REINES, Department of Physics,
LAWRENCE E. NIELSON, Redmond, Oregon. University of California, Irvine. Neutrino.
Polymer Physics (with William J. MacKnight). JOHN R. REITZ, Research Staff, Ford Motor
JOHN S. NISBET, Ionosphere Research Labora- Company. Propagation of Electromagnetic
tory, University of Pennsylvania. Ionosphere. Waves.
JOHN F. NOXON, Fritz Peak Observatory, Aer- D. C. REYNOLDS, Air Force Wright Aeronauti-
onomy Laboratory, NOAA, Boulder, Colo- cal Laboratories, Avionics Laboratory, Wright-
rado. Airglow. Patterson Air Force Base, Ohio. Excitons.
ALLEN NUSSBAUM, Department of Electrical J. A. REYNOLDS, Culham Laboratory, United
Engineering, University of Minnesota. Fourier Kingdom Atomic Energy Authority, Abing-
Optics and Optics, Geometrical, Advanced. ton, Oxfordshire. Fusion Power and Laser
R. F. O'CONNELL, Department of Physics and Fusion.
Astronomy, Louisiana State University. Fara- JAMES A. RICHARDS, JR., Agricultural and
day Effects. Technical College, State University of New
HARRY F. OLSON, late of RCA Laboratories. York, Delhi. Brownian Motion and Fermi
Noise, Acoustical. Dirac Statistics and Fermions.
JOHN M. OLSON, Institute of Biochemistry, WOLFGANG RINDLER, Department of Physics
Odense University, Denmark. Photosynthesis. and Mathematics, University of Texas at Dal-
STEPHEN J. O'NEIL. Servomechanisms. las. Relativity.
D. D. OSHEROFF, AT&T Bell Laboratories. PETER H. ROGERS, School of Mechanical Engi-
Cryogenics. neering, Georgia Institute of Technology.
RALPH T. OVERMAN. Radioactivity. Electroacoustics.
THORNTON PAGE, NASA Johnson Space Cen- JACQUES E. ROMAIN, Centre de Recherches
ter, Houston, Texas. Astrophysics. Routieres, Belgium. Time.
WILLIAM E. PARKINS, Consultant, Woodland DONALD M. Ross, Propulsion Consulting Engi-
Hills, California. Energy Levels, Atomic and neer. Flight Propulsion Fundamentals (with
Work, Power and Energy. G. P. Sutton).
ROBERT PETERS, Speech and Hearing Sciences, THOMAS D. ROSSING, Department of Physics,
University of North Carolina. Hearing. Northern Illinois University. Acoustics.
NORMAN E. PHILLIPS, Department of Chemis- MILTON A. ROTHMAN (retired), Department of
try, University of California, Berkeley. Heat Physics, Trenton State College (New Jersey).
Capacity. Conservation Laws and Symmetry.
RICHARD A. PHILLIPS, Foster Grant Corpora- ROGERS D. RUSK, Emeritus, Department of
tion. Optics, Physical. Physics, Mount Holyoke College. Nuclear
JULIAN M. PIKE, National Center for Atmo- Radiation.
spheric Research, Boulder, Colorado. Coriolis
Effect. HAJIME SAKAI, Department of Physics and
J. J. PINAJIAN, Oak Ridge National Laboratory. Astronomy, University of Massachusetts.
Isotopes. Spectroscopy.
MAR TIN A. POMERANTZ, Bartol Research R. T. SANDERSON, Emeritus, Department of
Foundation of The Franklin Institute, Univer- Chemistry, Arizona State University. Bond,
sity of Delaware. International Solar-Terres- Chemical and Periodic Law and Periodic
trial Physics Programs. Table.
ALAN Y. POPE, Albuquerque, New Mexico. ROBERT B. SCHAINKER, Electric Power Re-
Aerodynamics. search Institute, Palo Alto, California. Energy
G. M. POUND, Materials Science Department, Storage, Thermal Mechanical.
Stanford University. Condensation. J. E. SCHIRBER, Sandia National Laboratories.
R. D. PRESENT, Department of Physics and DeHaas-van Alphen Effect.
Astronomy, University of Tennessee. Gas H. M. SCHLICKE, Consulting Engineer, Milwau-
Laws and Intermolecular Forces. kee, Wisconsin. Capacitance.
xi CONTRIBUTORS

PETER A. SCHROEDER, Physics Department, Queen Elizabeth College, University of Lon-


Michigan State University . Thermoelectricity. don. Mechanical Properties of Solids.
WILLIAM T. SCOTT, Department of Physics, J. C. SPROTT, Department of Physics, Univer-
University of Nevada. Electricity. sity of Wisconsin-Madison. Electron Tubes.
GLEN T. SEABORG, Lawrence Berkeley Labora- H. EUGENE STANLEY, Center for Polymer
tory, University of California. Transuranium Studies and Department of Physics, Boston
Elements. University. Critical Phenomena.
ARTHUR H. SEIDMAN, Department of Electrical ROBERT L. STEARNS, Department of Physics,
Engineering, Pratt Institute. Diode (Semicon- Vassar College. Bose Einstein Statistics and
ductor). Bosons.
J. M. H. LEVELT SENGERS, Institute for Basic SAMUEL STEIN, Time and Frequency Division,
Standards, National Bureau of Standards. National Bureau of Standards. Frequency
Compressibility, Gas. Standards.
R. S. SHANKLAND, late of Case Western Re- K. AA. STRAND, (formerly) U.S . Naval Obser-
serve University. Michelson-Morley Experi- vatory, Washington, D.C. Astrometry.
ment. E. C. G. SUDARSHAN , Center for Particle Theory
A. G. SHARKEY, JR., Pittsburgh Energy Tech- and Department of Physics, University of
nology Center. Mass Spectrometry. Texas at Austin. Elementary Particles.
WILLIAM F. SHEEHAN, Department of Chem- G. P. SUTTON, Consulting Engineer, Danville,
istry, University of Santa Clara (California). California. Flight Propulsion Fundamentals
Chemical Kinetics. (with Donald M. Ross).
ERIC SHELDON, Department of Physics, Uni- J. D. SWIFT, School of Physics, University of
versity of Lowell (Massachusetts). Neutron. Bath, England. Electrical Discharges in Gases.
HOWARD A. SHUGART, Department of Physics, S. M. SZE, AT&T Bell Laboratories, Murray
University of California, Berkeley. Atomic Hill, New Jersey . Semiconductor Devices.
and Molecular Beams. P. M. TAYLOR, Department of Electronic Engi-
WILLIAM A. SHURCLIFF, Harvard University, neering, University of Hull, England. Feed-
Polarized Light. back.
R. P. SHUTT, Brookhaven National Laboratory. R. E. TAYLOR, Department of Anthropology,
Spark and Bubble Chambers. University of California, Riverside. Radiocar-
W. A. SIBLEY, Oklahoma State University . bon Dating.
Color Centers. NORMAN TEPLEY, Department of Physics,
MIRIAM SIDRAN, Department of Natural Sci- Oakland University, Rochester, Michigan.
ences, Baruch College/City University of New Fermi Surface (with H. V. Bohm and George
York. Photography. Crabtree).
LESTER S. SKAGGS , Cancer Therapy Institute, JAMES TERRELL, Los Alamos National Labora-
King Faisal Specialist Hospital, Riyadh, Saudi tory. Fission.
Arabia. Medical Physics. RUDOLPH E. THUN, Raytheon Company, Lex-
MERRILL I. SKOLNIK, Radar Division, Naval ington, Massachusetts. Thin Films.
Research Laboratory. Radar. PAUL TODD, Program in Molecular and Cell
L. SLIFKIN, Department of Physics, University Biology, Pennsylvania State University. Radi-
of North Carolina. Diffusion in Solids. ation, Ionizing , Basic Interactions (with
J. SMIDT, Laboratorium voor Technische Na- Howard C. Mel) .
tuurkunde, Technische Hogeschool Delft, The RICHARD TOUSEY , Consultant, Space Science
Netherlands. Relaxation. Division, Naval Research Laboratory, Washing-
HOWARD M. SMITH, Eastman Kodak Company, ton. Ultraviolet Radiation.
Rochester, New York . Holography. LAWRENCE M. TRAFTON, Astronomy Depart-
M. G. SMITH, Faculty of Computing, Manage- ment and McDonald Observatory, University
ment Science, Mathematics and Statistics, of Texas at Austin . Planetary Atmospheres.
City of London Polytechnic. Laplace Trans- L. E. H. TRAINOR, Department of Physics,
forms. University of Toronto, Canada. Theoretical
CHARLES P. SMYTH, Emeritus, Department of Physics.
Chemistry, Princeton University. Dielectric MYRON TRIBUS, Center for Advanced Engi-
Theory. neering Study, Massachusetts Institute of
B. A. SOLDANO, Physics Department, Furman Technology. Entropy.
University . Mass and Inertia. G. J. F .TROUP, Physics Department, Monash
S. L. Soo , Department of Mechanical and Indus- University, Victoria, Australia. Maser.
trial Engineering, University of Illinois at JOHN G. TRUMP, Department of Electrical
Urbana-Champaign. Equilibrium. Engineering, Massachusetts Institute of Tech-
DAVISON E. SOPER, Institute of Theoretical nology. Accelerator, Van de Graaff and High
Science, University of Oregon, Quantum Voltage Research.
Chromodynamics. N. W. TSCHOEGL, Department of Chemical Engi-
J. W. T. SPINKS, University of Saskatchewan. neering, California Institute of Technology.
Radiation Chemistry (with R. J. Woods) . Viscoelasticity.
M. T. SPRACKLING, Department of Physics, TIEN T. TSONG, Department of Physics, Penn-
CONTRIBUTORS xii

sylvania State University. Field Emission (with MILTON G. WHITE, late of Department ofPhys-
Erwin W. Miiller). ics, Princeton University. Synchrotrons.
W. N. UNERTL, Laboratory of Surface Science DONALD J. WILLIAMS, Applied Physics Labo-
and Technology, University of Maine at Orono. ratory, Johns Hopkins University. Magneto-
Surface Physics. spheric Radiation Belts.
EDGAR VILCHUR, Foundation for Hearing Aid FERD WILLIAMS, Physics Department, Univer-
Research, Woodstock, New York. Reproduc- sity of Delaware. Luminescence.
tion of Sound. JOHN H. WILLS, Kennett Square, Pennsylvania.
WILHELM H. VON AULOCK, AT&T Bell Labo- Phase Rule.
ratories, Whippany, New Jersey . Ferrimagne- RICHARD C. WILLSON, Jet Propulsion Labora-
tism (with Andrew H. Bobeck). tory, California Institute of Technology. Solar
JEARL WALKER, Department of Physics, Cleve- Total Irradiance and its Spectral Distribution.
land State University, Ohio. Optical Instru- A. J. C. WILSON, Crystallographic Data Centre,
ments (with A. E. E. McKenzie and N. C. University Chemical Laboratory, Cambridge,
McKenzie). England. X-ray Diffraction.
WILLIAM W. WALKER, Department of Physics, G. MILTON WING, Los Alamos National Labo-
University of Alabama at Tuskaloosa. Positron ratory. Transport Theory (with Paul Nelson) .
(with B. Clark Groseclose). DENNIS E. WISNOSKY, Industrial Systems
FRANKLIN F. Y. WANG, Department of Mate- Group, GCA Corporation, Naperville, Illinois.
rials Science and Engineering, State Univer- Computers.
sity of New York at Stony Brook. Rectifiers. FRANK L. WOLF, Department of Mathematics,
KENNETH M. WATSON, Marine Physical Labo- Carleton College. Statistics.
ratory of the Scripts Institution of Oceanog- HUGH C. WOLFE, American Institute of Physics.
raphy, University of California, San Diego. Symbols, Units, and Nomenclature.
Collisions of Particles. T. WOLFRAM, Physics Department, University
A. H. WEBER, Emeritus, Department of Physics, of Missouri. Spin Waves (with R. E. DeWames).
Saint Louis University. Rotation-Curvilinear HOUSTON G. WOOD, III, Department of Chem-
Motion. ical and Aerospace Engineering, University of
WALTER L. WEEKS, School of Electrical Engi- Viriginia. Centrifuge (with Ralph Lowry and
neering, Purdue University. Electromagnetic Donald Kupke).
Theory. G. K. WOODGATE, St. Peter's College, Univer-
VERNON G. WELSBY, late of the Department sity of Oxford. Atomic Physics.
. of Electronic and Electrical Engineering, Uni- R. J. WOODS, Department of Chemistry, Uni-
versity of Birmingham, England. Inductance. versity of Saskatchewan. Radiation Chemistry
H. L. WELSH, Emeritus, Department of Physics, (with J. W. T. Spinks).
University of Toronto, Canada. Raman Effect JOHN YARWOOD, (retired), Department of
and Raman Spectroscopy. Physics, The Polytechnic of Central London.
CHARLES WERT, Department of Metallurgy and Vacuum Techniques (with Kenneth J. Close).
Mining Engineering, University of Illinois, HSUAN YEH, Towne School of Civil and Me-
Urbana. Solid State Physics. chanical Engineering, University of Pennsyl-
N. REY WHETTEN, General Electric R&D Cen- vania. Impulse and Momentum.
ter, Signal Electronics Laboratory. Secondary A. M. ZAREM, Beverly Hills, California. Solar
Emission. Energy Utilization (with Duane D. Erway).
FREDERICK E. WHITE, Emeritus, Boston Col- ALEXANDER ZUCKER, Oak Ridge National
lege, Chestnut Hill, Massachusetts. Vibration. Laboratory. Nuclear Reactions.
PREFACE TO THE
THIRD EDITION
Welcome to physics, the science of relativity and gravitation, matter and energy, quarks
and quanta, lasers and masers, and many other intriguing scientific areas.
Since the appearance of the second edition many new developments have become impor-
tant, and this edition has been expanded to include as many as possible of these new
ideas without neglecting older, but still valid, concepts.
In a major effort to strengthen coverage of the rapidly moving and vitally significant
field of particle physics, four new articles were added: Gauge Theories, Quantum Chromo-
dynamics, Electroweak Theory, and Grand Unified Theories. In addition, many revisions
were made in second edition articles in this field, including Elementary Particles, Weak
Interactions, Strong Interactions, Parity, Current Algebra, and Quantum Theory.
Similarly, to increase emphasis on the important role of physics in medicine and biol-
ogy, articles were added on Molecular Biology and Biomedical Instrumentation, and
second edition entries on Biophysics, Medical Physics, Health Physics, and Bionics were
brought up-to-date. Many other fields of physics, such as optics and energy storage, have
received additional emphasis.
The general plan of this edition follows that of the first two editions and is discussed
in their prefaces which follow. A reader with a limited background in physics should find
it worth while to start by reading the article on "Physics," and then reviewing the articles
on major areas of physics mentioned in that article and listed as cross-references at the
end of the article.
The major credit for this book must go to the contributors who made it possible. Thanks
are also due to the staff at the Van Nostrand Reinhold Company, who suggested the third
edition, did a large amount of editing, and made a book out of a collection of manuscripts.
To my wife, Leigh, goes my deep appreciation for accomplishing the vast amount of
record keeping and secretarial work needed, and for doing much of the checking of manu-
scripts and proof-reading.
ROBERT M. BESAN<;:ON

Dayton, Ohio

xiii
PREFACE TO SECOND
EDITION
This second edition of the Encyclopedia of Physics follows the same
general plan as was used for the first edition; that is, each article is
written so as to be of primary value to the type of reader who is most
apt to look for the particular topic. There are articles on major areas of
physics which are ~t a low technical level, so as to be of maximum value
to the reader with little prior knowledge of physics. There are also
articles on major divisions and subdivisions of these areas. In general,
these latter start with an introduction intended to define the topic and
describe the concepts involved. This is followed by more detailed and
advanced treatment for the reader with a stronger background in physics.
To cover more of physics, the book has been considerably expanded,
both by adding new articles and by including new material on topics in
the first edition. Many of the articles have been completely rewritten,
others received major changes, while others, particularly those on major
areas of physics, required little or no change.
As in the first edition, the major credit for any success the book may
achieve belongs to the authors, many of whom not only contributed a
tremendous amount of time and effort in preparing articles, but made
valuable suggestions for other parts of the book.
The editors at the Van Nostrand Reinhold Company contributed a
great deal to the readability and accuracy of the book, and to my wife,
Leigh, goes credit for much careful proofreading, for the preparation of
the extensive index, and for the typing and detailed record-keeping
required in assembling a book of this magnitude.

To all of these workers my heartfelt thanks are due.

ROBERT M. BESAN~ON
Dayton, Ohio
April 10, 1974

xv
PREFACE TO FIRST EDITION
THE AIM of this book is to provide in one volume concise and accurate information about
physics. It should be of use to physicists who need information outside of their own
special areas of interest, to teachers and librarians who must answer inquiries, to students
who wish to add to their funds of knowledge, and to engineers and scientists who en-
counter physical concepts in pursuit of their professions. The book has been made pos-
sible by the thoughtful and generous cooperation of more than 300 authors, both in this
country and abroad, who have unstintingly contributed their time, skill and knowledge.
Their names and affiliations are shown immediately before this preface.
The most challenging problem for the editor was deciding which topics to include and
which to leave out, since the space available was very limited compared with the vast
amount of knowledge that could have been included. The approach used was to provide
short introductory articles on physics, on the history of physics, on measurements, and
on symbols, units and nomenclature, plus general articles on the major areas of physics:
heat, light, mechanics, acoustics, etc. To these were added entries on divisions and sub-
divisions of the major areas; these are more detailed and pitched at somewhat higher tech-
nical levels than the broader, more general articles. Other topics lie on the interfaces be-
tween major areas of physics or are on subjects that include both physics and other
disciplines. These include, among others, astrophysics, geophysics, biophysics, and mathe-
matical biophysics. Finally, a few articles cover sciences that are so closely related to
physics that the differences are frequently merely matters of emphasis.
Each article attempts to provide not just a definition of a term but an explanation of an
area of physics. No attempt was made to hold all articles at the same technical level; on
the contrary, the level for each entry was aimed at those readers who would be most apt
to look for information on that specific topic. The contents of each article was left to the
discretion of the author as the one most capable of making the proper selection. Some of
the authors found it necessary to use mathematics, as is done in many books on physics.
However, the reader with a limited mathematical background will find many articles with
no mathematics at all, and others with very little, while the reader who is so inclined can
sink his teeth into the more mathematical paragraphs.
Most of the authors have provided references to summary articles and books, and in
addition, cross-references to other articles in this book have been added wherever it was
felt that they might be of particular help to the reader. A few cross-references are shown
by the use of small capitals in the body of the text (thUS, MECHANICS); others are listed
at the end of the article. The index should serve to locate particular topics that might not
be subjects of complete articles.
I should like to extend my heartfelt thanks to the authors who contributed so much and
to Mr. G. G. Hawley and Mr. H. Simonds of the Reinhold Publishing Corporation who in-
vited me to compile and edit this book as one of the series of scientific and technical en-
cyclopedias published by that company. I also owe a very great deal to Mrs. Alberta
Gordon and her staff, who did much of the editing and proofreading, and to my wife,
Leigh, who contributed the bulk of the tremendous amount of clerical work involved as
well as adding a great deal of enthusiasm and inspiration.

ROBERT M. BESANCSON
Dayton, Ohio
November 1, 1965

xvii
THE
ENCYCLOPEDIA
OF PHYSICS
A
ABERRATION Principle of Least Action for charged or un-
charged particle motion, namely,
1. Introduction* When light is considered as
an electromagnetic process or disturbance, it is
characterized as a vibration phenomenon taking
place in time and space. It propagates from one
V(A,B) = fA
B
/l(x,Y,z)ds, (l.la)

region to another in the form of waves. Mathe-


matically, such a vibratory motion is governed SeA, B) = JB .Jh - tp(x, y, z) ds, (1.1 b)
by certain second-order partial differential
A
equations-wave equations-which are derived
from Maxwell's equations of the electromag- where /l(x,Y,z) is the refractive index of the
netic field. The solutions of these equations are medium and ~ is the action function of
distinguished by the presence of certain quanti- mechanics, h being the constant of energy and
ties accompanying periodic motion in space and tp the potential energy function. In both prob-
time, namely wavelength A and frequency II. lems, the first variation along the paths from A
They depend on the medium in which propaga- to B must vanish.t
tion of waves takes place. However, in the clas- The differential equations satisfying the varia-
sical description of light propagation it was tional conditions are known as the Euler-
assumed that the process acts like a stream of Lagrange equations of light rays and particles,
particles; consequently such light phenomena respectively.
are described as rays-the old corpuscular hy- The equations of ray paths in a medium of
pothesis. In this description, the periodicity of refractive index /l(x,Y, z) and of relativistic
waves is lost, i.e., the wavelength A does not (fast) electrons in an electromagnetic field E,
have any direct effect on the paths of the light H are: 3 ,4

Light Optics Electron Optics

~ (.Jtp(l + etp) dY ) = -.!. I + 2etp Ex + AO(B dz _ Bz dY )


ds ds 2 .Jtp(l + etp) Y ds ds

Y 1 .JI + 2etp Ey + ho (Bz


- d (.Jtp(l + etp) -d ) = - - dX- - Bx -dZ) (II)
ds ds 2 tp(1 + etp) ds ds

~ (.Jtp(1 + etp) dZ) = _.!. 1+ 2etp Ez + A/Bx dy _ B dX)


ds ds 2 .Jtp(1 + etp) ~ ds y ds
e e
AO= -2-' e=---
mo 2moc2

rays through a medium. In analogy to the mo-


tion of particles, the propagation of light as tThe differential ds is the element of arc of the
rays must be governed by differential equations path of the raysJ.0rt;articies. Fermat's Principle is of
similar to the equations of particles. Indeed, the
equations of light rays and of particles are de- least time, i.e., edt =0, from which (l.la) is
rived from variational principles, the Fermat to
Principle for light propagation as rays and the derived by writing dt =(ltv) ds. The corresponding
*This article gives a theoretical treatment of op- stlate~ent in mechanics is Hamilton's Principle, i.e.,
tical aberration. For an introductory discussion, see
the articles entitled LENS and OPTICS, GEOMET- o L dt = 0, from which (l.Ib) is derived, pro-
RICAL. For astronomic aberrations, see VELOCITY to
OF LIGHT. vided t does not enter explicitly into the Lagrangian L.
ABBERATION 2

where € is the relativistic correction factor, 'P is The expression V(B) - V(A) gives the optical
the electric potential, and E = -grad 'P. If € = 0, distance of the path of a ray from a point A(x,
the electron optics equations reduce to the equa- y, z) of the wavefront ~o, V(A) = co, to the cor-
tion for electrons of nonrelativistic velocity. responding point B(x', y', z') on ~', V(B) = cl.
A heuristic procedure for deriving the equa- Since the rays are normal (normal congruence)
tions of geometric optics is to assume a wave- to the wavefronts, the latter are optically equi-
form solution of the scalar wave equation distant from each other-a two-parameter of
~u - (1 /u 2 ) Utt = 0, i.e., transversal surfaces. They may be regarded as
an optically parallel (transversal) family of sur-
u(x, y, z, t) = A(x, y, z) faces, the rays constituting the geodesics of the
optical medium. Now each point P(x, y, z) on
·exp(ik(V(x,y,z)- ct)) (1.2) the wavefront V(A) = Co may be regarded as

°
Substituting (1.2) into the wave equation and giving rise to a source, or disturbance of a wave
afterwards letting A -+ or the propagation con- at an instant t', with amplitude A(x, y, z). At
stant k -+ 00, the remaining terms involve only another instant t, the wavelet arrives at a point
B(x', y', z'). The totality of all wavelets origi-
the phase function V:
nating on ~o will now form a surface wave
~', which is the envelope of all the wavelets.
The distance from ~o to ~, is expressed by
the characteristic function (point) V(x, y, z, x',
where /1 is the refractive index of the medium. y', z') = u'(t' - t), where u is the velocity of the
This equation is called the characteristic equa- wave in the medium. The transformation of
tion of Hamilton or the Eiconal Equation. The ~o to ~', or the rays issuing from points on ~o
solution of this equation will give all the infor- to the corresponding rays intersecting ~', de-
mation about the imaging properties of a me- scribes a finite contact transformation, and the
dium with refractive index /1. Unfortunately, directions (optical) (Po, qo) on ~o and (p', q')
complete (closed) solutions of 0.3) are known on ~, are given by the equations
for only a few cases.
The phase function V = constant represents p =--
au , au
surfaces which propagate through the medium. o axo' p = - ax'
At each point of the surface or wavefront a
light ray intersects it normally. The amplitude q =--
au ,
q =--
au
on the surface ~ is given by A (x, y, z), so each ° ayo' ay'
intersection point of a light ray with the phase
function surface ~ becomes a source or dis- au , au
turbance which propagates from it in the ro = - az o ' r = - az'
form of a surface or wavefront, the wavelets
of Huygens. where r0 2 = /10 2 - P02 - q02, etc., and r =
Equation (1.2) is equivalent to the propaga- Y/12 - p2 - q2 is the so-called Geometrical Op-
tion of light as rays, since grad V is proportional tics Hamiltonian.
to a vector SO which is normal to the wavefront. 2. Aberrations A light ray originating at a
To account for the optical properties of the me- point Po = (xo, Yo, zo) on the object plane
dium, the direction of the rays is s = (p, q, rO), Z = zo,\ is imaged by the optical system at a point
derived from SO as follows. If Q, {3, r are the Pi = (Xl, Y1. Zl), which we assume to be on the
geometric directions, so that SO = (Q, (3, r), then Gaussian plane Z 1 = Zg of the system, and let
we have s = /1so, with the point Pg = (Xg, yg, Zg) be the Gaussian image
of Po. The two planes, Z = Zo and Z = Zg are con-
p = /1 Q , q = /1{3, s . SO = 1. jugate planes.
Therefore, we have grad V =so, with We select in the image space a wave surface
(front) ~' which crosses the axis of the system
at Oa = (0, 0, Oa). Let the ray intersect the sur-
face ~' at P' = (x', y', z') as shown in Fig. 1.
Now draw a spherical reference surface S of
Furthermore, multiplying by the element of radius R equal to the distance P19a,. (Fig. 1).
arc ds (along the rays), we obtain The displacement vector PgP l measures the
deviation of the actual image of Po from the

IB
A (grad V· SO) ds =
JB as
A
dV
ds
corresponding (aberration free) Gaussian image.
This deviation is called the geometrical aberra-
tion of the ray, or the ray aberration. On the
other hand, the optic)al distance measured along
the ray, namely, P'Pg is called the aberration of
= V(B) - V(A) = JB
A
/1 ds (l.S) the wave front, or simply the wave aberration,
which is positive if the reference surface is be-
tween the wave surface ~, and the image plane
which is the Fermat formulation (1.1 a). Z = Z 1, negative otherwise.
3 ABBE RATION

Ob ject pi ane Entrance pupil plane Exit or aperture plane

Po_+------ p---+-----

0_

--fjP' 00 §JP'
0_
~;; 0,
• Z

L' S
e
P,(x"y"Zg)
P'

Pg(xg , Yg,zg)

Z
00 0,

-
PgP, = ray aberration -P'P,

FIG, 1.
= wave aberration

This wave aberration is expressed by the Let Po = (xo, Yo, zo) and PI = (xt. Y I, ZI) be
Hamilton characteristic function V(xo, Yo, Zo; the coordinates of the object ray and the image
x I, Y I, ZI) or the difference of the characteris- ray, respectively, We write this change of the
tics at Ps and P', namely, object ray into the image ray in the form of a
transformation or mapping given by the relations
V(xo,Yo, Zo; XS,YS, zs)
- V(xo,Yo, Zo; x',y', z'), (2,1) XI = XI (xo, Yo, Po, qo, zo, ZI)
(2,1 a)
Y I = Y I (xo, Yo, Po, qo, zo, ZI)
Consequently, we can say that any ray, or
bundle of rays emerging from the object plane PI = PI (xo, Yo, Po, qo, zo, ZI)
(2.1b)
(space) will reappear as another (unique) ray ql =ql(xo,Yo,Po,qo,zo,zl)
or bundle crossing the image* plane (space).
The latter are characterized by the coordinates Eliminating Po, qo from the first of set equa-
and directions in the image space, which will tions (2.1 a) with the aid of the second set (2.1 b)
depend on the corresponding quantities of the (assuming the lacobians do not vanish), say
object ray or bundle and the nature of the opti-
cal medium. Mathematically, this is called a Po = Po(xo, Yo, PI, qt, zo, zt>
mapping of the object space into the image (2,2)
space by the optical system; the intervening qo = qo(xo, Yo, PI, ql, zo, zd
medium thus determines the type of mapping
or transformation function.t we rewrite equations (2.la, b) as follows:
*This is true except at singular points of the map-
ping function, i.e., at focal points, caustics, etc. XI =XIj(xO,YO,PI,qt. ZO,ZI)
tThe type of transformation determining the image
YI = YI (xo, Yo, PI, ql, Zo, ZI)
ray from the object ray in terms of the characteristic
(2.3)
function V is called contact transformation, which Po = POi(xO, Yo, PI, ql, Zo, zt>
changes any surface in object space into any new
surface in the image space. qo = qo(xo, Yo, PI, ql, zo, ZI)
ABBERATION 4

By separating the linear terms in the argu- and, in a number of practical cases; the charac-
ments, (2.la) becomes teristic functions are overall better suited for
classifying aberrations. 3 ,4,5,6
xli = LaijXOj + L bijPoj + ii(XOj. POj) 3. General Theory of Aberrations In the de-
j sign and construction of image forming systems,
one tries to minimize the effects of aberrations
Pij = L eijxOj + L dijPoj + gi(xOj, POj) on the image quality, * or at least remove some
j of the harmful aberrations by compensating dif-
ferent aberrations of various orders. 4 ,5
(ij=1,2) (2.4) In general, the balancing of aberrations is a
difficult problem. However, a general (formal)
where ii(XOj, Pli), gi(XOi, POj) are the deviations analysis of aberrations without specifying the
from an aberration-free image point. The latter characteristic properties of the optical system-
is represented by the linear forms. The four except for geometric configurations (symmetry
functions h gi are called the ray aberrations of etc.) based on the wavefront concept of light
geometrical optics. Knowledge of them will de- propagation-will facilitate the discussion of
termine the deviation of the image from the ob- characterizing and classifying the aberrations.
ject produced by the optical system. The fundamental functions needed to make
The constants aij etc. will depend also on the a formal analysis of the image forming proper-
positions of the object and image planes. How- ties of an optical system are the characteristic
ever, by properly choosing their positions and functions of Hamilton: the point characteristic,
making the principal ray coincide with the or eiconal, V(xOi,-xi); two mixed characteris-
optic axis of the instrument, the linear forms tics,t W(XOi' Pi), W(X1i, POi);:!: and the angular
may be reduced to characteristic T(Poi, Pli). These functions are
Xli =MXOi
related to each other through a so-caUed Le-
(i= 1,2) (2.4') gendre transformation. For instance, if the
Pi = NXOi + LPOi point characteristic V(XOi, xi) is known, then
the mixed characteristic W(XOi, Pi) is obtained
where M is the magnification factor, N is the by subtracting a term X 1P1 + Y 1 q I from V, i.e.,
reciprocal of the focal distance, and L = M-1.
If ii = gi = 0 (no aberration), (2.4) defines an W(XOi, Pi) = V(XOi, Xi) - (XIPI + Ylqd (3.1)
ideal optical system.
Consequently, the main task in any optical where Xl, Y1 are expressed in terms of (xo,Yo;
imaging problem is to calculate the position PI, qd. § We then obtain the image coordinates
Xi and the directions (optical) Pi in image space, on the image plane Z = Z 1 = 0 and the directions
usually on a fixed plane Z = Zi and most often Po, qo on the object plane Z = Zo = 0 from
on a Gaussian image plane, from the knowledge oW
of the functions fi, gi. In practice, one deter- Po=--
mines Xi, Pi by tracing individual rays through axo
the system, applying the laws of refraction or (3.2)
oW oW
reflection (mirrors) or, in the case of isotropic Y --- qo=-oyo'
media, one employs numerical integration. The 1 - Oql'
labor of obtaining fairly good approximate
values of fi, gi is long, but with modern com- The above relations, together with the rela-
puters one is able to reduce the time by several tions Xo = av/oPo, Xl = oV/oPt. given by the
orders of magnitude, so that now it is possible point characteristic, and the expression V - W =
to calculate the image position function to a X I P I + Y I q I, constitute a Legendre transfor-
high degree of accuracy, at least for moderate mation; the inverse of this transformation,
orders of aberration. i.e., W,is obtained from Vbysubjectingx1PI +
Theoretically, the image functions can also be Ylql (the image coordinatl;1s) to a Legendre
derived if one of the characteristic functions of transformation.
the optical system is known, for instance, the The other mixed £haracteristic (object char-
point characteristic V or the mixed characteris- acteristic) is simply W(XI,YI, Zl, Po, qo, zo).
tic W. They are found from the relations The angular characteristic, often used in tele-
scopic reflecting systems, etc., is related to V
av oW
Pi=+-o ' Xi=-- (i = 1,2) (2.5) *In electrical terminology, "noise."
Xi OPi
tThe image and the object characteristics, respec-
where W= V- (XIP1 +Y1ql), exception being tively.
made for telescopic systems, where the angular :j:We have assumed the object and image planes to
characteristic applies. be conjugate planes in the Gaussian sense. In general,
We have already mentioned the difficulties W will depend on the six parameters, and for the case
encountered in finding Vor W in explicit form, where the image plane is in a homogeneous medium it
except in a few optical systems. However, for will depend on five parameters: xo, Yo, PI, ql, Zl·
theoretical discussions an analysis of aberrations §W = v+ YoPo + yoqo·
5 ABBERATION

as follows. The coordinates of the object and We do not write the directions (Po, qo) of the
image planes are given by the expressions ray at the object point (xo, Yo). Equations (3.8)
are the same linear terms we discussed in sec-
aT XI
aT
= --- tion 2. Therefore, W2 represents what is known
Xo = oPo' OPI as the paraxial approximation of the charac-
(3.4) teristic function and of the image functions
aT aT
Yo = oqo' YI = ---aq;. (x I, yd. In certain arrangements (special optical
systems) mentioned in section 2, the coeffi-
cients a23, a34, and a14, and even a33, a44, may
The relations above, together with those cor- be eliminated so that one obtains an ideal sys-
responding to V, also constitute a Legendre tem. Consequently, one does not consider this
transformation between T and V, and T = approximation to Was an aberration.
V+ xoPo + YoPo - XIPI + YIPI· The next approximation is the homogeneous
In the analysis of wave or ray aberrations, it polynomial of degree 3, W3 • Its expansion is
is more convenient to deal with the mixed char- given by
acteristic W(xo, Yo, PI> ql; Zo, zd, especially in 4
the diffraction theory of optical systems, since
it enters in the phase function of the solution W3 = L aijkUiUjUk. (3.9)
i,j, k=1
of the wave equation discussed in section 1.
Now we see that apart from the linear terms, the Further, W4 is expanded as
ray aberration functions fi(xOi, Pi), gi(XOi, Pi)
are derived by simple differentiation of W or 4
Vor T, whichever is appropriate to the problem.
We have already mentioned the difficulties of
L aijklUiUjUkUI. (3.10)
i,j, k, 1=1
obtaining explicit (closed form) expressions for
V or W, therefore we express W in a series of There are altogether 20 terms in W3 • If we fix
homogeneous polynomials of various degrees the object plane, Z = Zo, then the homogeneous
(orders) in the four variables, say (xo, Yo, PI, q I) terms in XoYo (alone) are omitted, since they
for fixed object (z=zo) and image (Z=ZI) do not account for any displacement in the
planes: * image plane. There are four such terms in W3 ,
W= Wo + WI + W2 + W3 + W 4 + ... + WN + ... X03, x02 Yo, xoY02, Y03

(3.5) so the number of independent wavefront aber-


rations reduces to 16.
where WN is a homogeneous polynomial of de- In W4 there are altogether 35 terms, but by
gree N in the arguments. The leading term Wo discarding the terms in Xo, Yo alone, the num-
does not play any role, and W I represents free ber of independent wave aberrations diminishes
space. The quadratic polynomial W2 may be to 30.t On the other hand, if the system has a
written as plane of symmetry, say, either the x-z or y-z
4 plane, i.e., rays lie on a meridian plane, then the
W2 = L aijUiUj (3.6) number reduces further to 16 terms. Identify-
ing UI = Xo, U2 =Yo, u3 = PI, U4 = qlo the terms
i,j=1
with the following coefficients aijkl vanish::j:
where UI = Xo, U2 = Yo, U3 = PI, U4 = ql. We
write in full a0031 = a0013 = al003 = a0130 = al021
W2 = allx02 + al2xoYo + a22Y02 + al3 x OPI = a2011=a1l20 =a1l02 = a3001
+a23YoPI +al4 x oql +a24Yoql =a2110 =a0310 =aI201'
+a33P1 2 +a34Plql +a44qI2. (3.7) The number of wave aberration terms in WN
increases considerably with N. Therefore we
Therefore, from (3.2) the image coordinates are shall limit the analysis to W4, i.e., to third-order
(ray) aberrations for optical systems possessing
oW 2 rotational symmetry, since in most if not all
XI = -~- = al3 x O + a23YO + 2a33PI + a34ql
UPI
tThe number of individual terms in WN is given by
oW 2
YI = -~- = al4 XO + a24YO + a34PI + 2a44ql' (n + l)(n + 2)(n + 5)
uql N= .
3!
(3.8)
;The numbers appearing in the subscripts indicate
*The Wk depend on zo, zi. However, being fixed the exponents of the variables associated with the in-
quantities, they are absorbed by the constant coef- dices. For example, a1030 corresponds to the term
ficients: Aij, etc. alO30xOp3,allll to a lll1xOYOPq, etc.
ABBE RATION 6

practical cases the instruments possess geomet- pansion of the wave aberration functions for
rical symmetry. The wave aberration function rotational symmetric systems is usually given
W then depends only on three parameters, in terms of three new parameters a, p, I{),
which are related to the ray functions Xo, yo,
PI, q, in object and image space (planes) as
follows:
(3.11)
known as the three optical invariants of the
system. In the expansion of W, the odd poly- ap cos I{) = U3 =2(xop, + yoqd. (3.15)
nomials disappear, and we have
The wave aberration function is written in the
W=W O +W Z +W 4 +···. form
The image functions are W( X 0, y 0, P " q " z)
, -- ~ bl nm a ZI +m cos m In
L.... .,..
I,n,m
awZ
XI =-a- =2(a3 x O + azpd, (3.16)
PI

YI =
oWz
-a-
ql
= 2(a3YO +azqd.
where /, m = 0, I, ... , n =0, 1,2, and n - m >
(3.12)
and even. This expansion is the standard or
°
classical expansion used by earlier investigators
These expressions give an image free of before the Zernike-Nijboer papers on diffraction
aberration. theory of aberrations. The degree of the poly-
The W4 term contribution is given by nomial WK is equal to the sum of the exponents
of a and p in the term bl nm a 2/ +m pn cos m I{).
z 2
W4 =allul +aZ2uZ2 +a33 u3 +alZ u lu2 For instance, if we let 21 + n + m = 2, i.e., Wz ,
then the three terms of Wz are obtained by let-
+a13ulu3 +a23u2u3. (3.13) ting m=O, 1= 1, n=O; m=O, 1=0, n=2;
The image functions (deviations from the m = 1, 1= 0, n = 1.
ideal or Gaussian image) are derived from the A more convenient expansion of W is a
expressions Fourier-like expansion*

l:X ._ aW4 _ oW 4 au;


W= L blnma21+m pn cos mI{). (3.17)
U II - - l,n,m
apli au; OPIi'
Here, a single term bl nm a 2/ +m pn cos mi{> repre-
l:p . __ oW4 __ OW4 aUj sents any of the homogeneous polynomials
U 01- - discussed above. For example, Wz is obtained
OXOi aUj aXOi
by letting I = I, n = m = 0; I = m = 0, n = 2; and
U= 1, 2;j= 1,2,3) (3.14) 1= 0, m = n = 1. The Wz polynomial is given by
the expressions
In (3.13) one notices the presence of only six
aberration coefficients, all, a22, ... , a23. Of W2 = b,ooa z + bo zop2 + bOll up cos I{) (3.18)
these all is associated with the object coordi-
nates XOi = (xo~Yo). It is not considered an which is the same as in the standard expansion.
aberration per se. To obtain the five types of However, the Fourier form has the advantage of
third-order aberrations from (3.13) we assume being an orthogonal function in I{), so it facili-
without loss of generality rays restricted to tates the integration over i{> in the diffraction
points (xo,O) on the object plane and, after integrals.
differentiation, letting PI = P cos I{), q I = P sin I{), On the other hand these expansions are super-
we arrange the right-hand side of (3.l3) in de- seded in their application to the diffraction
scending powers of p. The Xli will now depend theory of aberrations by the Zernike-Nijboer
on five aberration coefficients, excluding all, form of expansion, i.e.,
whereas POi will be free of a22. This arrange-
ment for OX Ii will determine the five geomet-
W= L blnm a 21 + m Zn m (p) cos ml{) (3.19)
l,n,m
rical "Seidel" aberrations of third order. The
terms in (ox!, oyd depending on p3 alone give where I, n, m have the same meaning as in the
the spherical or aperture aberration; the terms earlier expansions, but here Z nm (p) is a poly-
in p2 give the primary coma; the linear terms in nominal of degree n in p and not a single term
p are known as astigmatism and curvature of pn as in the other expansions. Furthermore, the
field; and the term independent of p produces Zernike polynomials, Zn m are orthogonal over
distortion of the pincushion or barrel type. a circular plane region. In the next sections we
For a detailed discussion, we refer the reader shall discuss the advantage of this expansion
to the treatises in Refs. 3 and 5, and for elec-
tron optical systems to those listed in Ref. 4. *For nonsymmetric systems one introduces another
However, in diffraction problems 2 ,s the ex- sum in sin m'P, i.e., a complete Fourier-series expansion.
7 ABBERATION

over the other two, the standard and the TABLE 2.


Fourier developments.
In (3.20) below and in Tables I and 2 are ZoO =1 ZSl = lOpS - 12p3 + 2p
listed the expansions of W2, W4, and W6 in Zll =P Z s3 =5ps - 4p3
the three expansion forms s and the Zernike Zss =pS
polynomials Zn m up to n = 6, m = 6. Z20 = 2p2 - 1
In each of them the order for a single aberra- Zl=p2 zl = 20p3 - 30p4 + 12p2-1
tion member is N=21+n+m-l, and the
zi =3p3- 2p Z62 = 15p6 - 20p4 + 6p2
number of types in a single aberration is M =
Zl=p3 Z6 4 =6p6 - 5p4
(l/'Y)(N+ I)(N+ 8). The degree of the poly- Zl=p6
nomial WK representing the wavefront function Z 40 =6p4 - 6p2 + 1
is K =N + 1 = 21 + n + m. Z4 2 =4p4 - 3p2
W2 =alu 2 + a2p 2 +a3up cos I{) Z 44 = p4

W4 =al1u4 +a12u2p2 + a22p 4


+ al3u3 P cos I{) + a23up3 cos I{) Zernike-Nijboer development (new)
+ a33 u 2 p2 cos 1{)2 blnmu21+mZnm(p) cos ml{) (4.1 a)
W6 =al1lu6 +a112u 4 p2 +a122u 2 p4 Fourier development
+ a222p6 + a113 us p cos I{) blnmu21+mpn cos ml{) (4.1 b)
+a123u3p3 cos I{) + a133 U4 1{)2 cos 2 1{) Standard (classical) development
+ a223 ups cos I{) + a233 u 2 p4 cos 2 I{) alnmu21+mpn cos 2 ml{) (4.1 c)
(3.20) The order N of a single aberration (geo-
metrical or ray) is N = 21 + n + m - 1, and the
4. Classifications of Aberration for Rotational num ber of terms or types of aberration is
Symmetric Systems 2 ,4,S To classify the ordi- M = 1(N + I)(N + 7). The degree L of the poly-
nary (ray) aberrations, we shall examine the dif- nomial Wn giving the order of the wave aberra-
ferent terms of the Wn of the wavefront aberra- tions is L =N + 1 = 21 + n + m, where 1 = 0, 1,
tion function W given by the three expansion 2, ... , n = 1,2, ... ,m = 0, 1,2, ... ,and
forms UI, U2, U3' Each Wn depends on the three n - m is even, making L even and equal to the
optical invariants U1 = X02 + Y02, U2 = xoP + sum of the powers of u and p which appear in
yoq, U3 = p2 + q2, or in u, p, and I{) given by the each individual member of the wave aberration
relations u 2 = U1, p2 = u3, and up cos I{) = U2. function.
The first three terms of W, i.e., W2, W4 , W6 Consider first-order aberrations, i.e., W2 . Since
are given in Tables 1 and 2, from which we can N = 1, there are actually three terms in W2.
obtain the ordinary aberrations of first, third From Tables I and II they are
and fifth orders. A typical term of Wn is Zernike-Nijboer

TABLE 1.
nm 0 2 3

N= 1
1 ap cos '1'(011)
2 p2(020)

N= 3
1 a 3p cos 'I' (111)
2 a2 p2(120)
3 ap3 cos '1'(031)
4 p4(040)

N= 5
1 as p cos '1'(211)
2 a 4 p2(220)
3 a 3p3 cos '1'(131)
4 a 2p4(140) a 4 p2 cos 2 '1'(042)
5 aps cos '1'(051)
6 p6(060)
ABBERATION 8

bIOOa2Z00(p}, bOllaZtl(p)COSIP, From the tables we can write the individual


members of W6 from which we could obtain
bOZOZ20(P) (4.2a) the individual fifth-order aberrations by dif-
ferentiation. *
Fourier We can classify various types of aberrations
from the subscripts appearing in the coefficients
bOil ap cos IP, b020 P2 (4.2b) blnm . The aberrations associated with the coef-
ficients bo zk o(k = 1,2, ... ) depend only on p.
Standard Such membe;s produce spherical aberrations of
order 2k - 1; thus in W4 the term of b040 is
aOll ap cos IP, called spherical aberrations of third order.
Terms whose coefficients are bl,2k+l,1 produce
We shall ignore the term or terms which de- coma of order k, where 1 =0, 1,2, ... , k = 1,2,
pend solely on a. Therefore we have only two For 1 = 0, k = 0 we have a simple distor-
members, whose coefficients are bOil and tion of the image and it is not considered an
bozo, etc. However, these two terms are not aberration. The term with coefficient b 031 is
aberrations in the sense of representing a de- the first-order coma or primary coma, with
formation of the image. They simply shift the bos I secondary or second-order coma, with
focal point, the b 020 term along the principal b 071 third-order coma, etc. However, terms
axis and the bOll in the image plane at (xt. 0, 0) with coefficients bl,,,k+l,l are also coma aber-
from the origin (0, 0, O)-the intersection of rations, where 1= 1,2, ... , k = 1,2, .... To
the principal ray with the image plane. Conse- distinguish those terms with 1 = 0 from the
quently, one does not consider the first-order others with 1 = 1, 2, . .. we call the aberra-
terms as aberrations. tions with I = 0 pure, so all terms with coeffi-
The next order aberrations come from W4. cients bO•2k+l,1 are pure coma aberrations of
These are called third-order aberrations, five in order k, the others mixed comas. Terms of the
all according to the value of N (see above). wave aberration function Wn having for coef-
Again we ignore terms solely in a, since they ficients m=2, 1=0,1,2'00" n=2k, k=l,
do not influence the image pattern. The terms 2, . .. , produce astigmatism of order k. For
in W4 are given below. From the tables, we 1= 0, k = 1, m = 2, the aberration term b022
have: produces first-order or primary astigmatism;
for m = 2, k = 2 secondary astigmatism; etc.
Zernike-Nijboer type When we take m = 3, etc., no particular name
is given this type of aberration. The special
1 = 0, n + m = 4 terms of the type b022 a" p" cos" <p is called
wing aberration, and for m = 3, n = 1, 1= 0, 1,
b040Z40(P) = b 04 o(6IP 4 + 6p2 + I) etc., arrow-type aberrations. For m = 0 and
m = 0, n = 4, 1> 0, the aberration wave function produces
a curvature of the field, thus all the terms
b 031 aZ31 (p) cos IP = b031 a(3p3 - 2IP) cos IP associated with b/,"k,o produce a curvature of
field of order k, producing a change of focus
m=l,n=3, depending on the field given by a 2/•
In comparing the three expansions of W4, the
bozza2Z22(p) cos 2IP= b ozz a 2p2 cos 2IP Z-N development contains additional terms of
m = 2, n = 2, (4.3a) lower power in p. The additional terms displace
the point (focal) to maximum illumination that
1=1,n+m=2 is compensating (balancing) higher-order aberra-
tions with the lower orders. This is an impor-
b I20 a ZZ"O(p) = b I20 a"(2p 2 - 1)
tant advance over the standard expansion of
the wave aberration functions, especially when
m = 0, n = 2, the Z-N expansion is introduced in the phase
function of the diffraction integral, not to men-
b 111 a3 Z II (p) cos IP =bill a 3 p cos IP tion the advantage of employing orthogonal
m=l,n=l,
functions over the domain of integration for
evaluating the diffraction integrals; in most
optical situations, the domain is circular.
Fourier type 5. Diffraction Theory The starting point of
b 040 p4, b 031 ap3 cos IP, b022 a 2IP" cos 2IP,
b lZ oa Zp2, bl1 1a 3pcosIP (4.3b) *SXt=a[wpcosip-Wip s~<p]l _
w - W(xOi, p, '1').
Standard type 6YI =0(
~Wp sm",+
. cos 'I']
W<p -p-
a040P\ a031 ap3 cos IP, a022 a 2pZ cos 2 IP,
O(depends on the distance from the reference surface
a120a Zp2, a111 a 3p cos IP. (4.3c) to (xt. Yl) and the maximum value of p.
9 ABBERATION

the modern theory of diffraction of optical named holography. 7 * In holography, the theory
imaging systems may be traced to the famous of diffraction of aberrations plays an important
paper on the diffraction theory of the phase role in the design of optical and other image
contrast method by Zernike. 1 He and his stu- forming systems to obtain faithful images of
dents made significant advances in the theory objects emitting or illuminated by different
of diffraction. Nijboer2 was able to calculate kinds of radiation-coherent, incoherent or
to an unprecedented degree of accuracy the mixed. The vast usefulness of the diffra;tion
intensity distribution of the field (contours) theory of aberrations is accounted for in the
produced by an optical system affected by similarity of the mathematical differential
several types of aberration of various orders. equations governing wave phenomena and the
Moreover, experiments were performed in analogous solutions, which, subject to certain
Zernike's laboratory by another of his pupils, appropriate conditions, resemble each other in
Nienhuis, to verify the theory. The laboratory mathematical form.
results agreed to a high degree with Nijboer's We would like to refer also to some of the
calculations. Likewise, experiments with micro- earlier significant contributions to the theory
waves were carried out at McGill University by of diffraction of aberrations by Ignatowski,
Bachynski and Bekefi 2 using lenses made of Fischer, Steward, Picht and more recently by
dielectric materials; the results were compared Luneburg and others. 3
with calculations based on Nijboer's formula- The basis of the diffraction theory of aberra-
tion of diffraction of aberrations. Again, com- tions of optical systems is founded on Kirch-
hoff's integral or a modified form of it, namely,

Ii
parison of calculations with experimental ob-
servations surprisingly shows a great likeness
to the contours of the intensity distribution on
the Gaussian plane. Further investigations have U(P) = - i~; ...;Kuo(Q)
shown the usefulness and the advantages of
the Zernike-Nijboer formulation of the dif-
fraction of aberrations over the earlier formu- . exp ik[W + (r' s)] dS (5.1)
lations in other fields than optics. where Uo(Q) is the value of the field on the
The Zernike-Nijboer formulation of diffrac- wave surface (front) S after passing through the
tion of aberrations rests on the expansions of optical system, usually located in the image
the wave aberrations function in a series of space; n, K, and r are, respectively, the refrac-
orthogonal functions, known as Zernike poly- tive index of the medium in image space, the
nomials, which are orthogonal over a circular Gaussian curvature of the wavefront, and the
region. Since the geometrical shapes of aper- distance vector from the point Q, located on S,
tures or stops in lens systems, microscopes, to the image point (observation point) P. The
cameras, or telescopes, are circular, this form function W is Hamilton's mixed characteristic
of expansion is the most appropriate one for of the optical system which depends on the
treating diffraction, scattering and other prob- coordinate of the object and the optical direc-
lems including problems of vibrating elastic tion of the ray at Q, namely the vector s. Here
systems. The advantage of introducing this the field represented by Uo(Q) is the geomet-
kind of expansion in the diffraction integrals rical optics wave solution of the harmonic scalar
lies in the simplification achieved not only in equation or a component of the vector wave
the evaluation of the integrals, but also on equation derived from Maxwell equations. Here
account of the orthogonality of the polynomials U 0 is considered a scalar quantity, or a compo-
over the domain of integration; the coefficients nent of the vector field.
of the expansion do not mix with each other A more convenient form of (5.1) used fre-
when the aberrations are small. The extension quently in actual problem is 3,4t
to large aberration was carried out by van
Kampen. 2
Since then, great advances have been made,
both in theory and experimental observations
leading to important applications in the im-
ik
u(P) = - -
21T II g(p . q)

provement of optical instruments. This success . eik[W+(r's») dp dq, (5.2)


has been paralleled in other fields of science where
such as in electron optical image formation, in
microwave optics, and in the design of image g(p, q) = ~ Uo(p, q). (5.3)
formation apparatus employing high- or low-
frequency waves (radio, infrared, ultraviolet, The symbol A denotes the discriminant of the
X-rays, and even acoustic waves). Significant second differential form of S, i.e., A =LN - M2,
contributions have been made in the design with L, M, N given by the expressions
of microwave lenses (dielectric), microwave p2 + r02
imaging instruments such as satellite cameras, nL = Wpp - n2r02 Z
mirrors for collecting (exploring) information
on sources of electromagnetic radiation of dif-
ferent frequencies from distant objects such *Recently acoustic waves have been used for wavt7
as planets, stars, and galaxies, as well as in front reconstruction imaging. 9
the development of the new branch of optics t(r ' s) =xp + yq + z.Jr'n"2-_-p"2-_-q""'2.
ABBERATION 10

pq torus, which is circular. Since the fields are not


nM = Wpq - 2 3 Z (Z = pWp + qWq - W), only functions of space but of time as well
n ro
(periodic in time), the electron-optical analysis
q2 + r02 of the system will involve aberrations both in
nN= Wqq - 2 2 Z. (5.4) space (geometrical) and in time (stability). In
n ro
addition, as the charged particles are accelerated
where s = (p, q, ro) are the optical direction co- the energy varies, and this variation introduces
sines of a ray on ~ with r02 = .Jn 2 - p2 - q2. A a continuous variation in the de Broglie wave-
point P(x, y, z) of ~ is given by the equations length of the particles, which introduces an ad-
ditional kind of aberration known in optics as
x=-Wp+"A{p , q)p, y=-Wq+"A(p,q)q, chromatic aberration. 7 Therefore, the problem
here involves not only geometrical aberrations
z=-"A(p,q)ro (5.5) (stability), but also chromatic aberrations, which
interact with each other, making the analysis
where "A(p, q) is the distance from P to the very complex.
image point. In microwave focusing 8 (spectroscopy, mi-
In practice one takes as the surface of inte- croscopy) and especially in the construction of
gration the aperture of the exit pupil. The in- giant mirrors and telescopes for exploring the
tegrals (5.1) and (5.2) form the basis of the sources of radio, ultraviolet, and X-rays in outer
diffraction theory of aberrations. They give all space, care is taken in the design to minimize
the information about the image field or the the aberrations, in particular spherical aberra-
intensity distribution at any point in image tions and astigmatism, caused by temperature
space. They are known as Debye-Picht-Lune- variations on the surface of these instruments.
burg Integrals. Recently, sound and ultrasound waves 9 have
Evaluation of the integrals (5 .1) or (5.2) is been used in studying structural defects in solid
difficult even for simple imaging systems, on materials, and in the exploration and surveying
account of the presence of W, which is not of the ocean floor and land masses. In addition,
known explicitly except for simple optical in- the recent development of apparatus used for
struments. In addition, the standard expansion imaging biological matter, in particular the
of W makes the analysis of the field distribution internal organs of the human body,IO is a strik-
or equal illumination contours on the image ing example of the advances which have been
plane difficult. This is simplified when W is made in extending the optical ideas of imaging
expanded according to the Zernike-Nijboer 2 or systems to the whole wave spectrum, including
Fourier procedures. elastic and sound waves on the one hand and
The problem of spherical aberrations of any charged particles on the other, thanks to the
order has been treated exhaustively by Boivin, dual nature of particles and waves. I I
not only for circular apertures, but for annular
apertures as well. The calculation of the dif- NICHOLAS CHAKO
fracted field in the presence of higher-order
aberrations, including the more general prob-
lem of nonsymmetric optical systems, is given References
in reference 3.
All the methods discussed above are valid 1. Zernike, F.,Physica, 1,689 (1934).
only for small aberrations. For large or moder- 2. Nijboer, B. R. A., "The Diffraction Theory of
ately large aberrations, one must resort to Aberrations," Groningen thesis, 1942. For the
asymptotic methods, which at present are suf- experimental part, see the thesis by Nienhuis, K.,
ficiently developed to include most of the inter- Groningen, 1948. For microwave experiments see:
esting cases occurring in the theory of diffrac- Bachynski, M. P. and Bekefi, G.,IRE Trans. , AP-4,
tion of optical systems. When these analytical No.3 , 412 (1955). "Studies in Microwave Optics,"
methods are combined with the present prog- McGill Univ. Tech. Rept., 38 (1957).
ress in computational methods, the intensity 3. Steward, G. C., "The Symmetrical Optical Sys-
distribution produced by an optical system can tem," Cambridge, Cambridge Univ. Press, 1928.
be calculated to any desired degree of accuracy. Picht, Johannes, "Optische Abbildung," Braun-
There are many branches of science in which schweig, 1931.
aberrations play an essential, though "negative," Luneburg, R. K., "Mathematical Theory of Op-
role, wherever image forming systems are in- tics," Providence, R.I., Brown University, 1944.
volved or stability of dynamic systems is studied. Reproduced by the University of California Press,
In the theory of reconstruction of wavefronts,6 Berkeley and Los Angeles, 1964.
spherical aberration and astigmatism have to Linfoot, E. H., "Recent Advances in Optics,"
be controlled in order to reconstruct the object London, Oxford Univ. Press, 1955.
structure accurately. In high-energy accelera- Born, M., and Wolf, E., "Principles of Optics,"
tors (linear, circular), especially in toroidal ac- New York, Pergamon Press, 1959.
celerators for achieving thermonuclear fusion Boivin, A., "Theorie et Calcul des Figures de
(deuterium-tritium plasma ignition), both the Diffraction de Revolution," Paris, Les Presses de
accelerating field and the magnetic field are l'Universite Laval, Quebec and Gauthier-Villars,
used to confine the beam of particles within a 1964.
tubular region close to the axis (optical) of the Marechal, A., and Franc;on, M., Diffraction Struc-
11 ABSORPTION SPECTRA

ture des Images, Paris, Masson et Cie, Editeurs, Tomography," Academic Press, New York, 1980.
1970. Gruning, J. R. Medical Physics, Proc. Intern.
Fran~on, M., in "Handbuch der Physik," Vol. 24, School of Physics "Enrico Fermi," 1981, Vol. 76,
Berlin, Springer, 1956. Varenna, Italy.
Chako, Nicholas, "Contribution ala Theorie de la II. Synge, J. 1. "Geometrical Mechanics and de
Diffraction," Centre d'Etudes Nucleaires de Broglie Waves," Cambridge Univ. Press, 1954.
Saclay, CEA-R-3151, Saclay, France, 1969.
"Etudes sur les Developpements Asymptotiques Cross-references: DIFFRACTION BY MATTER AND
des Integrales Multiples de la Physique Mathema- DIFFRACTION GRATINGS; LENS; OPTICAL IN·
tique" (large aberration diffraction), CEA-R-3263, STRUMENTS; OPTICS, GEOMETRICAL; OPTICS,
PHYSICAL; VELOCITY OF LIGHT.
Saclay, France, 1968.
4. de Broglie, 1. "Optique Electronique et Corpus-
culaire," Paris, 1950. ABSORPTION SPECTRA
Glasser, W. "Grundlagen der Elektronenoptik,"
Berlin, 1955. Spectroscopy is the measurement of the amount
Sturrock, P. "Static and Dynamic Electron Op- of light, or other radiant energy, transmitted,
tics," Cambridge, 1955. absorbed, or emitted by a sample of matter, as
5. Buchdahl, H. "An Introduction to Hamiltonian a function of the frequency, or wavelength, of
Optics," Cambridge, 1970. the radiation. Absorption spectra provide
Herzberger, M., "Geometrical Optics," Academic information on individual atomic or molecular
Press, New York, 1958. units and their immediate environment. Thus
Pegis, R. J., "The Modem Development of Hamil- it is possible to determine what species are
tonian Optics," Progress in Optics, I, Ed. E. Wolf, present in a sample and in what form they are
Amsterdam, North Holland Publ. Co. (1961). present, including the sizes and shapes of
Focke, J., "High Order Aberration Theory," individual molecules and often the environ-
Progress in Optics, IV (1965). ments of the molecules. The information
"Handbuch der Physik," Vol. XXIX, Ed. S. obtained directly is the amount of absorption
Flugge, Berlin, Springer Verlag, 1967. Articles by or emission, related to the quantity present
Welford, Walter T., "Optical Calculations and Op- and the probability of the radiative transition,
tical Instruments, An Introduction"; Marechal, and the frequencies absorbed or emitted, which
Andre, "Methode de Calcul des Systemes Op- are proportional to the separations of the energy
tiques"; Helmut, Max, "Theorie der Geomel- levels in the sample. Spacings of energy levels
risch-Optischen Bildfehler." are equal to Planck's constant times the fre-
Hawkes, P. W., "Quadrupole Optics, Electron quency of the radiation, AE = hll, and the
Optical Properties of Orthogonal Systems," frequency is related to the vacuum wavelength
Springer Series, "Tracts in Modem Physics," Vol. A by All =e, where c is the speed of light in
42, Berlin, 1966. vacuum. In practice, energy information is often
See also the Luneburg, Chako, Linfoot, and Born expressed as a vacuum wavenumber, a(or v),
and Wolf titles listed under Ref. 3. which is the number of waves per unit oflength,
6. Gabor, D. Microscopy by reconstructed wave- and hence the reciprocal of the wavelength; a =
fronts, I, Proc. Roy. Soc. A197, 454 (1949); II, l/A = Il/e. Wavelengths are measured in meters
Proc. Phys. Soc. 64, 449 (1951). or fractions of meters (e.g. nanometers, nm);
Stroke, G. S. "An Introduction to Coherent Op- wavenumbers are measured in waves per meter
tics and Holography," Academic Press, New York, or, more often, in waves per centimeter, cm- 1 .
1966. Spectroscopic studies extend across the elec-
Okoshi, T. "Three-Dimensional Imaging, Tech- tromagnetic spectrum. Gamma rays and x-rays
niques," Academic Press, New York, 1976. provide information about particles inside the
7. Septier, A. "Focusing of Charged Particles," Vols. nucleus and about inner-shell electrons of
I and II, Academic Press, New York, 1967-68. atoms. Ultraviolet and visible radiation interact
Baber, R. K., and Cosslett, V. E. (Eds.), "Ad- primarily with electrons in the outer shells of
vances in Optical and Electron Microscopy" atoms and molecules. Infrared and microwave
(serial), Vol. I, Academic Press, New York, 1966. frequencies produce molecular vibrations and
8. Cornbleet, S. "Microwave Optics," Academic rotations. Radio-frequency spectroscopy mea-
Press, New York, 1976. sures energies of reorientation of nuclear spin
Proc. Can! Electron Microscopy, Kyoto, 1966. angular momenta in magnetic fields. The present
9. Greguss, D. "Ultrasonic Imaging," Focal, New article will consider only absorption of infrared,
York, 1981. visible, and ultraviolet radiation by molecules,
Bergmann, P., et al., "Physics of Sound of the omitting Mossbauer and atomic absorption,
Sea," Parts I-IV, Gordon and Breach, New York, fluorescence, and spirt resonance techniques, as
1967-68. well as microwave and Raman spectroscopy.
Wade, Glen (Ed.), "Acoustic Holography" (serial), Experimental Methods Conventional spec-
Vol. IV, Plenum Press, New York, 1972. troscopy relies upon dispersion of radiation, or
10. Coulon, M., et al., "The Physical Basis of Medical separation according to frequency, by a prism
Imaging," Appleton-Century-Crofts, New York, or grating monochromator. Most current instru-
1981. ments follow similar principles of design. In a
Herman, G. T., Image reconstruction from pro- Littrow mount, radiation passing through the
jections, in "Fundamental of Computerized entrance slit is collimated, or converted to
ABSORPTION SPECTRA 12

approximately parallel rays, by a spherical or on the beam that is different for each frequency.
paraboloidal mirror. The collimated beam is Taking the Fourier transform of the signal,
passed through a prism and returned by a with a computer, generates the spectrum as
reflecting surface through the same prism, or intensity vs. frequency. When applied to infrared
is reflected by a plane grating, returning to spectroscopy the technique is commonly called
the collimating mirror and from there to the exit FTIR. Interferometers have a substantial energy
slit, from which it is focused by an ellipsoidal advantage over dispersive instruments. They can
mirror onto a small detector. The Czerny-Turner accept radiation from a larger aperture, and
mount employs two spherical mirrors in place because the entire spectrum is examined at
of the single collimating mirror. These are side once, rather than measuring small frequency
by side, with the grating lying along the bisector increments sequentially, there is an energy gain
of the line between them so that, except for for thermal detectors that increases with the
the angular position of the grating, the arrange- number of resolved frequency intervals in the
ment resembles a Littrow system and its spectrum. This second factor, often called
mirror image, sharing a common grating. An Fellget's advantage, disappears for photon
Ebert monochromator substitutes a single detectors because the noise increases with total
large spherical mirror for the two mirrors of radiation striking the detector surface.
the Czerny-Turner design. If a concave grating The newest experimental approach takes
is substituted for the plane grating, no other advantage of tunable sources, eliminating the
lenses or mirrors are required. The slit, grating, need for a monochromator or interferometer.
and image point will lie on a large circle called First applied in microwave spectroscopy, the
the Rowland circle. Some instruments incorpo- technique has now been extended to the infrared
rate a large part of the circle; others include and visible regions with tunable dye lasers and
little more than the three critical points inside diode lasers. Photoacoustic spectroscopy is a
the case, shifting the circle with respect to the variation in which the absorption of radiation
instrument, by rotating the grating, instead of produces expansion of the absorber, and thus
moving components around the circle. an acoustic signal.
Prisms offer good dispersion properties for Conventional sources for the ultraviolet
the visible and ultraviolet regions, but are region are hydrogen, deuterium, or xenon
generally inferior to gratings for the infrared discharge lamps. For the visible and near
region and for highest resolution spectra. infrared regions, tungsten filament lamps are
Most gratings have been made by ruling an satisfactory. Common infrared sources include
aluminum surface with a diamond tip, then a hollow cylinder of refractory oxides (Nernst
casting replicas of the ruled surface in plastic. glower), a rod of silicon carbide (Globar),
Gratings with lower levels of scattered light alumina tubes, Nichrome wires, and, for the far
may now be produced holographically, both infrared, silica jacketed mercury arcs.
plane and concave. For very high resolution, The standard detector for the visible and
a coarse grating can be employed in a very ultraviolet region is the photomultiplier, which
high order (an echelle grating) with cross has almost entirely replaced photographic
dispersion by another element to separate emulsions. For the near and medium infrared,
the orders. Low resolution infrared spectra may solid state photoconductors (PbS, PbSe, Au-
be obtained with a wedge interference filter. doped Ge, etc.) provide high sensitivity, espe-
Some research instruments employ only a cially if cooled. Triglycine sulfate, a pyroelectric
single beam passing through the sample and detector, and the Golay pneumatic cell, a ther-
monochromator to the detector, but nearly all mal detector, have advantages for the far
commercial instruments are now of double-beam infrared. Thermal detectors, including thermo-
design, with a reference beam serving to correct couples and bolometers, are less sensitive than
for solvent and atmospheric absorption and, photon detectors, such as cooled photocon-
especially, for variations in source intensity ductors, for high frequencies but are usable
with frequency. These may be compared by an over the entire spectrum.
optical null system that automatically attenuates Solid samples and strongly absorbing liquids
the reference beam to match the sample beam, or are usually dissolved in a solvent. Water is
by an electronic comparison of beam intensities. transparent in the ultraviolet (to beyond 200
This is increasingly being accomplished with nm, or 50,000 cm -1). Other suitable solvents,
digital electronics under microcomputer control. in order of decreasing range, include ethanol
To avoid sample decomposition, ultraviolet and ether, saturated hydrocarbons (e.g. isooc-
spectrometers disperse the beam before the tane and cyclohexane), methylene chloride,
sample, whereas infrared spectrometers usually chloroform, and carbon tetrachloride. Absorp-
place the sample before the monochromator tion cells usually have a path length of I cm.
to avoid displacements of the image of the exit There are no good transparent solvents in the
slit on the small detector surface. infrared, but carbon tetrachloride is clear over
A very different design concept employs a large region and carbon disulfide is transparent
interferometry to provide the coding by which in most regions where carbon tetrachloride is
frequency information is extracted. The beam not, for cells less than I mm thick. The thin
is not dispersed, but the movement of a mirror spacings required, coupled with lack of dura-
with time superimposes an interference pattern bility of most infrared cell windows, makes
13 ABSORPTION SPECTRA

maintenance of infrared cells a significant predict which transitions may be active in


problem. absorption for various possible structures.
Sample cells for the visible region may be of Comparison of observed with predicted spectra
glass or any other transparent material, but then usually allows a determination of the
special glasses or, especially, quartz or fused actual structure of the molecule.
silica are required for the ultraviolet. (CaF 2 , Prediction of intensities is much more difficult,
NaCI, and other materials are also transparent depending more strongly on detailed models
but are not competitive because of price, work- for the absorber, but successful correlation of
ability, and durability disadvantages.) Infrared intensities and structure can therefore provide
cells are typically made with NaC! windows significant additional information.
(transparent to about 625 cm- 1 ) or other alkali Analysis of low-frequency spectra of crystals
halides (KBr, CsBr, CsI). For selected regions, relies on the symmetry of the unit cell. Vibra-
quartz or silica, calcium fluoride, magnesium tional motions, called phonons, travel through
fluoride, and certain special glasses (e.g., arsenic the crystal and may be analyzed according to
compounds) are advantageous. Polyethylene and the theory of transmission lines with discrete
silica are transparent in the very far infrared. elements. The shapes of absorption bands are
Rolled AgCl and KRS-5, a mixed thallium determined primarily by the densities of states.
bromide-iodide, are impervious to water and If phonon energy is plotted against phonon
transparent over a large part of the infrared. wavenumber in the crystal, the curves are flat
Solid samples that cannot be dissolved or for certain wavenumber values (often near the
prepared in thin enough sections for transmission boundaries of the Brillouin zones), so the
measurements may be immersed in a medium probability of absorption at these energy values
of similar refractive index (e.g., mulled in is significantly increased. The energy determines
mineral oil or pressed in KBr or other soft mate- the frequency of the external radiation that
rial into a pellet) or examined by attenuated will be absorbed. Symmetry arguments in
total internal reflectance. A half-rod or prism crystals are modifications of those for isolated
of transparent material with a refractive index molecules, involving lattice groups as well as
greater than that of the sample is placed against point groups.
the sample and the beam is totally internally Selection rules may also be derived on the
reflected at the interface. As the sample index basis of conservation of angular momentum. A
changes with frequency, because of absorption photon carries one unit (h) of angular momen-
bands, the total reflectance is modified, giving a tum, and thus adds this unit to the molecule
spectrum that resembles the absorption spec- upon absorption. Depending upon the nature of
trum. The method shows high sensitivity, the molecular transition, this angular momentum
especially for thin films where the prism acts will change the rotational state, or a degenerate
as a light pipe to give multiple internal reflec- vibrational state, or the orbital momentum of
tions from the surface. an electron. Spin angular momentum is seldom
Structure Determinations To a good approx- affected, except insofar as it is coupled with
imation, the electronic energy of a molecule other motions. Because angular momentum
may be found by ignoring the motions of nuclei; follows vector addition rules, the angular
the energy obtained by solving the Schroedinger momentum of the molecule can change in
equation then contains an error, as a function magnitude by ± 1 or 0 units, unless it is initially
of positions of nuclei, that is just the vibrational zero. Group theory provides additional insight
potential function. Similarly, the vibrational into the axis along which the angular momen-
energy may be evaluated by ignoring rotations. tum can change.
This separation of the wave function is called Models of the electronic structures of mole-
the Born-Oppenheimer approximation. It allows cules are usually based upon one-electron wave
first-order interpretation of spectra in different functions, called orbitals. Some examples
regions as if they were solely electronic, vibra- include the united-atom and separated-atom
tional, or rotational in origin. models for diatomic molecules and their
The primary theoretical tool of the molecular extension to polyatomics, the free-electron
spectroscopist is group theory, applied to the model for unsaturated and aromatic structures,
symmetry properties of the system understudy. and Huckel calculations for aromatics. In addi-
Absorption of radiation normally requires a tion, theoretically based rules have been given
change in dipole moment of the absorber by Walsh and by Woodward and Hoffman that
during the transition. The intensity of absorp- predict which orbitals will be occupied and the
tion is determined by the transition moment, effects on molecular shapes and reactivities.
Ab initio calculations, employing self-consistent
'I1l M'I1j dr, the integral over initial and field approximations, have been carried out for
final state wave functions and the dipole small molecules, providing a theoretical basis
moment operator. From symmetry arguments for extrapolations to larger systems.
it can be shown that certain transitions must Qualitative Analysis Molecular vibrations
have zero transition moment (at least in the can be described as superpositions of normal,
usual approximations, such as isolated harmonic or independent, modes of vibration. Each
oscillators or one-electron wave functions.) normal mode involves the entire molecule and
These conditions provide selection rules, that has the symmetry of one of the symmetry
ABSORPTION SPECTRA 14

species, or irreducible representations, of the Despite advances in theory underlying absorp-


point group that describes the ground-state tion spectra, most qualitative identifications
symmetry of the molecule. Nevertheless, certain rely on empirical correlation of observed spectra
normal modes are affected primarily by specific with the spectra of known compounds. In the
functional groups within the molecule. For ultraviolet and visible, relatively few bands are
example, C-H stretching mo.des appear at a observed and these are often broad and some-
wavenumber near 3000 cm- I -, somewhat higher times featureless. Nevertheless, the positions
if the hydrogen is near an electron-rich group offer significant information on possible
such as an aromatic ring, an oxygen, or a absorbers, which can be supplemented by
chlorine, and generally a little lower for aliphatic changing sample conditions. For example, a
C-H. Stiffer bonds absorb at higher frequencies. nonpolar molecule such as benzene will show
N-H and O-H are near 3300-3700 and S-H little frequency shift with change of solvents;
around 2500 cm- 1 ; C=O is near 1650-1750, a polar compound such as a ketone will show
compared with C-O, closer to 1000-1200 an appreciable shift; and addition of acid to
cm- I . Larger mass moves the vibration to lower an amine will produce the amine hydrochloride,
frequencies; C-Cl stretching bands appear near which has a totally different absorption spec-
600-900 cm- 1 and C-I lower yet. Vibrational trum. Extensive compilations of electronic
absorptions that always tend to appear in the spectra for comparison are now available.
same place are called characteristic frequencies. Quantitative Measurements Except for beams
By contrast, low-frequency vibrations, including of unusually great intensity, the absorption is
modes involving primarily bending motions, determined by the Lambert-Bouguer law or
and stretches that may couple with adjacent Beer's law. The fractional intensity lost in any
bonds, such as C-C in aliphatic chains, are small increment of sample thickness is the
much more variable in position, depending on same, and is proportional to the thickness incre-
the particular molecule in which they occur. ment and to the concentration of the absorbing
Thus the low-frequency end of the spectrum is species. This leads to the idealized law, for
called the fingerprint region. Often the nature monochromatic radiation,
of a' compound can be determined from the
characteristic frequencies and the exact com- 1 = 10 lO-abc
pound then identified by comparing the finger-
print region with spectra of known compounds. where a is the absorptivity of the absorbing
One must be particularly carefulin identifying species, present at concentration c, in a cell of
spectra of gases and solids. Rotational structure length b. In practice, however, deviations are
can appear in the gas phase, especially for small often observed. If the absorptivity varies over
molecules, that depends to some extent on the the frequency spread of the beam, the more
gas pressure. Spectacular misinterpretations of strongly absorbed frequencies will be depleted,
spectra have been reported when different and the absorptivity will therefore decrease
rotational branches of a single absorption band with sample path length. Chemical and physical
have been assigned to different molecular interactions in the sample may modify the
absorbers. Often solids will give spectra very nature of the absorbing species, causing the
similar to solution spectra, but the spectrum absorptivity to vary with concentration. Thus
may also vary markedly depending upon it is necessary to check each sample to determine
crystal structure and orientation of crystallites whether Beer's law is obeyed before attempting
with respect to the beam. The halide pellet tech- a quantitative analysis. So long as the absor-
nique tends to introduce spurious hydroxyl bance, A =abc = -log (transmittance), is addi-
bands and the high pressure or chemical reactions tive for all absorbing species present, non-
sometimes alter the spectrum significantly. linearities in the calibration curve of measured
A first approximation to explanation of elec- absorbance vs. concentration can be accommo-
tronic absorption spectra, in the visible and dated by graphical techniques or successive
ultraviolet, was provided by the model of approximations.
chromophores and auxochromes. Certain struc- Any number of components may be deter-
tural fragments (chromophores) were recognized mined simultaneously by making measurements
that cause absorption, For example, a single at the same number of frequencies, but accuracy
aromatic ring (benzene) absorbs weakly around will suffer unless several conditions can be
250 nm (40,000 cm- I ), as does a carbonyl or satisfied. There should be significant differences
nitro group. Substituents (auxochromes) on the of absorptivity of the various species at the
molecule shift the absorption, most often to frequencies selected. No absorptivity should
lower frequencies. For example, methyl, chioro, change rapidly with frequency at any of the
hydroxy, methoxy, amine, and acetyl sub- analytical frequencies. If an absolute background
stituents on the benzene ring produce shifts, (10, or A = 0) line is not available, the reference
increasing in the order given, of the strong background line must be drawn between fixed
absorption band initially near 200 nm (49,000 frequencies, rather than between similar features
cm- 1 ). Although the chromophore-auxochrome (e.g., transmittance maxima) of the curves.
model was initially empirical, theoretical justifi- Greater accuracy in a multicomponent
cations of many of the absorption bands and analysis is achievable by making measure-
frequency shifts are now possible. ments at more frequencies than the number of
15 ACCELERATOR, PARTICLE

unknowns. The analytical equations may be Vol. 3, "Electronic Spectra of Polyatomic Mole-
expressed in matrix form, A = KC, where A cules," 1967.
is a column matrix of measured absorbance J. W. Robinson, "Handbook of Spectroscopy," Vol.
values at the several analytical frequencies, C II, CRC Press, Inc., Cleveland, Ohio, 1974.
is a column matrix of the (unknown) concen- R. M. Silverstein, G. C. Bassler, and T. C. Morrill,
trations of the species present, and K is a "Spectrometric Identification of Organic Com-
rectangular matrix of calibration values, one pounds," 4th Ed., John Wiley & Sons, Inc., New
measured value (ab) at each analytical frequency York, 1981.
for each species. Multiplication of each side on B. P. Straughan and S. Walker (Eds.), "Spectroscopy,"
the left by the transpose of K gives KA =A' = Vols. 2and 3, Chapman & Hall, London, 1976.
KKC = K'C, where A' has the same dimension
as C and K' is a square matrix. Solution of A' = Cross-references: ATOMIC SPECTRA; COLOR;
K'C gives the best values of the concentrations MOLECULAR SPECTROSCOPY; OPTICS, PHYSI-
to a least squares approximation. CAL; POLARIZED LIGHT; RADIATION, THER-
Quantitative analysis is usually much more MAL: RAMAN EFFECT AND RAMAN SPECTRO-
difficult for solid and gaseous samples than for SCOPY; REFRACTION; SPECTROSCOPY; X-RAYS.
solutions. Solid samples are generally not
spatially homogeneous, so Beer's law cannot
ACCELERATOR, PARTICLE
be expected to apply. However, if the sample
has been very finely divided before mulling in Particle accelerators are electromagnetic de-
oil or pressing in a transparent matrix, the vices used to generate energetic beams of charged
deviations may be small enough. It is often particles-electrons, protons or other ions. They
necessary to resort to some form of internal are widely used in research in many fields of
standard, a compound of known absorptivity physical science and they have many practical
added in known concentration to the sample, applications in medicine, manufacturing, and
so that the amount of sample in the beam can engineering.
be determined. The earliest forerunners of particle accelera-
Gases are spatially homogeneous but because tors were the gas discharge tubes and x-ray tubes
of rotational fine structure may appear inhomo- of the late 1800s. They provided some of the
geneous with respect to frequency. The beam early technological base for accelerator devel-
typically averages transmittance over several opment, but they were special in purpose, very
narrow absorption bands and the gaps between, limited in voltage, and did not provide the moti-
so the observed absorbance depends strongly vation for accelerator development.
on the widths of the absorption bands, which The need for particle accelerators became
depend, in turn, on the pressures of the gases apparent in the 1920s after Rutherford had
in the sample. The apparent absorptivity demonstrated the existence (1911) and the
increases as the gas pressure increases (i.e., disintegration (1919) of the atomic nucleus
absorbance increases faster than pressure, for using alpha particles from a radioactive sub-
given cell length) because pressure broadening stance to probe the structure of the atom.
makes the sample more nearly homogeneous Clearly these experiments gave promise of a
with respect to frequency over the spectrometer radical new understanding of the nature of
bandwidth. matter. But just as clearly , particles from radio-
Stray radiation , scattered within the mono- active substances were not adequate for the
chromator and reaching the detector mixed task of exploring atomic and nuclear structure.
with the signal at another frequency, also may Beams from such sources were very limited in
cause quantitative errors. The problems are int ensity, poor in collimation, lacked control
most severe near the ends of the spectrometer of energy, and were limited to i3-rays (electrons)
range, where the signal intensity is low. and a-particles (helium nuclei) . Reviewing the
ROBERT P. BAUMAN need for particle accelerators and the techno-
logical base for developing them , Rutherford
stated in a famous address in 1927 :
References It would be of great scientific interest if it were
L. J. Bellamy, "The Infra-red Spectra of Complex possible in laboratory experiments to have a supply
Molecules," Chapman and Hall, London, 1975. of electrons and atoms of matter in general, of
N. B. Colthup, L. H. Daly, and S. E. WiberJey, "Intro- which the individual energy of motion is greater
even than that of the or-particle. This would open
duction to Infrared and Raman Spectroscopy," 2nd up an extraordinarily interesting field of investiga-
Ed., Academic Press, New York, 1975. tion which could not fail to give us information of
J. G. Grasselli and W. M. Ritchey (Eds.), "CRC Atlas great value, not only on the constitution and sta-
of Spectral Data and Physical Constants for Organic bility of atomic nuclei but in many other direc-
Compounds," 2nd Ed., 6 volumes, CRC Press, tions . . ... [Blut it is obvious that many experi-
Cleveland, Ohio, 1975 . mental difficulties will have to be surmounted
G. Herzberg, "Molecular Spectra and Molecular before this can be realised.
Structure," Van Nostrand Reinhold, New York, Inventors and experimenters were already at
Vol. I , "Spectra of Diatomic Molecules," 2nd Ed., work to overcome the difficulties with a variety
1950; Vol. 2, "Infrared and Raman Spectra," 1945; of approaches: electrostatic devices, Tesla coils,
ACCELERATOR, PARTICLE 16

~ ,,~.4'//1
100 TeV tered in the realization of this concept are con-
nected with generating the high voltage and
with electrical breakdown both internally and
10 TeV
externally in the accelerating tube.
J. D. Cockcroft and E. T. S. Walton, who were
1 TeV
"0'" "O.. the first to achieve nuclear disintegration by
(equivalen1 en~rgy)
~/ electrically accelerated particles in 1932, used
>- a voltage-multiplying rectifier circuit of four
"ffi 100 GeV stages which effectively charged capacitors
z in parallel and discharged them in series to reach
w Proton
::!
;:; 10 GeV synchrotron f Electron linac about 400 kV. They used a glass accelerating
III tube which was divided into two segments with
a:
o
Electron / Synchrocyclotron an intermediate electrode at mid-potential. The
!< 1 GeV Proton Ijnae
ion source was a low-voltage discharge in hydro-
a:
w Betatron / gen yielding a supply of protons. This type of
-'
w accelerator has been continuously improved by
.
8100 MeV increasing the number of power supply stages
and the segments of the accelerating tube, in-
10 MeV creasing the frequency of the charging circuit,
generator immersing the voltage supply and the acceler-
ating tube in insulating fluid or pressurized gas
1 MeV
Rectifier generator
to reduce breakdown and other variations and
refinements. The Cockroft-Walton accelerator
100keVL----L----L----L--~----~~
remains in use today in many applications up
1930 1940 1950 1960 1970 1980 to a few Me V and as a preaccelerator typically
operating at 750 keY to inject a beam into a
FIG. 1. The maximum energy of each type of accelera- higher-energyaccelerator.
tor is plotted against the year it was achieved. For The other application of the simple concept
colliding beams, the energy of an equivalent fIxed tar- of direct application of voltage to an acceler-
get accelerator is plotted. Maximum energy has in- ating tube that has had enduring success is the
creased by about 8 orders of magnitude in 5 decades. Van de Graaff or electrostatic accelerator. In
this accelerator the potential is supplied elec-
trostatically by an insulating belt transporting
transformers, voltage multipliers, radio-fre- charges between ground and a large, usually
quency resonance acceleration, even atmos- spherical, high-voltage terminal. R. J. Van de
pheric electricity. By the early 1930s several Graaff demonstrated an electrostatic generator
types of particle accelerators had been invented of this type in 1931 and the first application to
and successfully used in nuclear experiments. nuclear studies was made by a group headed
The electric force is the only macroscopic by M. A. Tuve at the Carnegie Institution in
force strong enough to accelerate particles. The Washington in 1933 with a beam of 600 kV
magnetic force, while also strong, is exerted protons. This type of accelerator has also been
perpendicular to a line of motion, and thus may subject to many improvements and variations.
be used to change the direction of, but not to The charging belt and the accelerating tube may
accelerate, particles. Applying these forces, be vertical or horizontal. After the earliest ex-
many kinds of accelerators have been developed, amples these accelerators were invariably housed
but all are based on a few physical concepts. in pressure tanks to exploit the superior voltage
The emergence of these concepts will be used holding properties of various gases at high pres-
as a framework to discuss the principal types sures. Refinements in the design of segmented
of accelerators. accelerating tubes and the use of shells at inter-
Direct-Potential-Drop Accelerators The sim- mediate potentials between the grounded pres-
plest particle accelerator concept entails a source sure tank and the high voltage terminal have
of charged particles at one end of an insulating been the main factors leading to reliable opera-
evacuated tube and a source of high voltage tion of Van de Graaff accelerators at over 10
placed across the ends of the tube. The parti- MeV. More than twice that voltage is achieved
cles are accelerated from the source end of the in the tandem design in which negative ions are
tube to the target end by the electric field and accelerated from ground potential to a positive
gain kinetic energy. The energy of accelerated terminal where they are stripped of electrons to
particles is universally expressed in electron- form positive ions and then further accelerated
volts, eV, the energy gained by a particle bear- to ground potential.
ing a charge equal to that of an electron acceler- Both the Cockcroft-Walton and the Van de
ated through a potential difference of one vo1;t. Graaff accelerators are characterized by good
The energies that are of interest in nuclear regulation, easy control of voltage, and excel-
studies are several hundred kiloelectron-volts, lent beam collimation. They also work with
ke V, and upwards without a presently preceived either sign of charged particle and a compara-
limit. The technological difficulties encoun- tively simple change of ion source may permit
17 ACCELERATOR, PARTICLE

acceleration of partially or completely stripped to provide a changing flux linking the orbit to
nuclei of any element. accelerate the ions were demanding, and success
Resonance Acceleration The concept of was not achieved until 1941 when D. Kerst
resonance acceleration, that is, repeated acceler- demonstrated electron acceleration in his beta-
ation by radio-frequency power at relatively tron. The betatron was immediately recognized
low voltages to produce high-energy particles, as being very well adapted to accelerate electrons
made it possible to avoid the severe technolog- for high-energy x-ray production, and it was
ical problem of electrical breakdown at high commercially developed and extensively used
voltage. R. Wideroe in 1928 demonstrated the for that purpose. A typical energy was about 25
principle with a single tubular electrode sup- MeV. It was also developed as a research tool to
plied with radio-frequency power mounted an energy of about 300 MeV but was soon
between two grounded electrodes. Sodium ions, rendered obsolete by other developments.
Na +, were accelerated into the electrode while More recently, special purpose linear induc-
it was negative, passed through, and further tion accelerators have been developed to pro-
accelerated at the other end when the potential vide very intense, short bursts of electrons at
of the tube became positive, thus attaining an energies of a few MeV.
energy corresponding to twice the applied ra- The Principle of Phase Stability In 1944 and
dio-frequency voltage. In 1929 E. O. Lawrence 1945 V. Veksler and E. McMillan, seeking a
elaborated the concept to include the effect means of circumventing the energy limitation
of a magnetic field and invented the cyclotron. of the cyclotron, independently formulated the
If, in a uniform magnetic field, ions are ac- principle of phase stability. They pointed out
celerated perpendicular to the field by a radio- that in the acceleration of charged particles by
frequency voltage applied to a reentrant elec- a radio-frequency field, particles in a certain
trode, then for appropriate values of the field phase band were stable; that is, if they had
and frequency the ions will pursue a circular small errors of phase with respect to the ac-
path and return to the opening of the electrode celerating field or of energy, the acceleration
when the voltage has reversed and the ions will itself automatically tended to correct the error.
again be accelerated. Continuing, the ions will This principle had far-reaching consequences;
describe a spiral path increasing in radius and it effectively removed the energy limitation
energy as they enter and leave the accelerating (except the economic one) of accelerators and
electrode in resonance with the applied radio- it led to the development of several new designs.
frequency voltage, making it possible to reach In the synchrocyclotron, as the ions are ac-
energies corresponding to hundreds of times the celerated and their mass increases, the frequency
applied voltage. An additional requirement is of the accelerating voltage is slowly decreased.
that the ions remain in the mid-plane of the The ions automatically remain in the proper
cyclotron; that requirement, focusing, is met by phase and increase in energy to the limit im-
a slight decrease of the magnetic field with ra- posed by the size of the magnet. Synchrocy-
dius. The great advantage of achieving high clotrons were rapidly developed for protons,
energies without the necessity of generating deuterons, and alpha particles to energies of
high voltages led to the rapid development of hundreds of MeV. The magnets weighed several
the cyclotron as an accelerator of protons, thousand tons and had pole diameters up to
-deuterons, and alpha particles of energies of about 5 meters. The size and cost of the magnet
several tens of Me V. It was the leading type of became the limiting factor as the need for higher
accelerator until the end of World War II and energies in nuclear research continued.
over a hundred were in operation in laboratories The principle of phase stability coupled with
all over the world. As the requirements of re- the cyclotron resonance condition provided the
search called for increasing energy the limitation means around the limitation of the synchrocy-
of the classical cyclotron became apparent. The clotron also. In the proton synchrotron, low-
cyclotron resonance condition specifies a fre- energy ions are injected into a ring-shaped mag-
quency of the accelerating voltage equal to the net. Both the frequency of the accelerating
rotational frequency and proportional to the voltage and the magnetic field are increased,
magnetic field and the charge to mass ratio of holding the ions in a nearly constant radius orbit
the ions, but as the energy of the ion increases. as they are accelerated. The weight and cost of
the mass of the ion increases (relativistic effect), a magnet for constant-radius orbits are very
decreasing the rotational frequency and vio- much less than those of one spiral orbits. As in
lating the resonance condition. This limits the the cyclotron, the ions are focused by a small
number of turns that the ions will stay in phase decrease of field with radius. Proton synchro-
with the accelerating voltage and limits the trons of this type extended the practical energy
energy of the cyclotron to some tens of MeV. limit to over 10 GeV (10,000 MeV).
The Induction Accelerator There were many Electron synchrotrons or just synchrotrons
attempts in the 1920s and 30s to devise an are based on the same principles, with one sim-
accelerator using the electric field induced by plifying feature but a new limitation. The sim-
a changing magnetic flux (transformer action) plification is that since an electron attains
to accelerate ions. The requirements of a mag- nearly the constant velocity, c, at an energy of
netic field to hold ions in a circular orbit and a few MeV, only the magnetic field need be
ACCELERATOR, PARTICLE 18

varied if the electrons are introduced into the Linear accelerators similar in principle to pro-
synchrotron with a few MeV energy. The limi- ton linacs may also be used to accelerate nuclei
tation compared with the proton synchrotron of any atom, including Uranium. Such heavy-
is an energy limitation and is due to the radia- ion linear accelerators (hi/acs) are more compli-
tive loss of energy of charged particles in circular cated because they must provide for acceleration
orbits. This is called synchrotron radiation and of particles of various charge-to-mass ratios
is a special case of bremsstrahlung. This effect, corresponding to different charge states and
which placed a practical energy limit of about different nuclei.
I Ge V on the first generation of synchrotrons, Sector-Focused Cyclotrons We have seen
is not significant in proton accelerators because that the energy of the classical cyclotron is
the energy loss varies inversely as the square of limited by violation of the resonance condition
the mass of the particle. as the mass of the particle increases with energy.
Linear Accelerators The concept of the The synchrocyclotron provided a way around
linear accelerator, that is the acceleration of this difficulty but only at the expense of in-
charged particles along a linear path by radio- tensity because the magnets are pulsed. If the
frequency fields, goes back to the 1920s, and magnetic field of a cyclotron increased with
early development was carried out in the 1930s. radius, the resonance condition could be
However, the necessary rf power technology matched to the increasing particle mass; how-
was not available at that time and the cyclotron ever, an increasing field defocuses and all the
and the Van de Graaff accelerators were so particles would be lost. L. H. Thomas in 1938
successful that linear accelerator development pointed out that a focusing force could be re-
languished. Radar and communications devel- stored if the magnet poles were sectored, pro-
opments during World War II resulted in great ducing alternate regions of high field and low
advances in rf technology and specifically in field even if the average field increased to match
the availability of high-power high-frequency the resonance condition. This idea was not
tubes of several kinds. This led to the develop- immediately exploited, but developments be-
ment of two kinds of linear accelerator in the ginning in 1949 resulted in numerous variations
immediate post-war years. of cyclotrons characterized by azimuthally
The first was the electron linear accelerator varying magnetic fields and constant rotational
or electron Unac by W. Hansen in 1947. In this frequency. Accelerators of this type have been
accelerator a traveling rf wave is introduced built for protons to energies of about 600 MeV
into a waveguide which has been loaded with a with currents of 150 /.lA, a factor of 100 greater
series of washer-shaped irises to reduce the than can be achieved with a synchrocyclotron.
phase velocity of the wave to c. Electrons pre- The sector focusing idea introduced a flexibil-
accelerated to about 2 MeV, where their veloc- ity into design, making it possible also to build
ity is 0.98c, ride the crest of the advancing wave cyclotrons which could accelerate iorts of dif-
as, in analogy, a surfer rides a water wave. Typi- ferent species and with variable energy. These
cally the frequency is about 3000 MHz and the developments have rendered both the classical
diameter of the waveguide accelerating tube is cyclotron and the synchrocyclotron obsolescent.
8 cm. The maximum energy achieved with ac- Alternating-Gradient Focusing The principle
celerators of this type has been 24 GeV at the of phase stability removed the energy limitation
SLAC 2-mile accelerator. In the lower energy of the classical cyclotron and it made possible
range, about 100 MeV, hundreds of these ac- the design of accelerators using annular magnets,
celerators have been built commercially and are but the focusing requirement was still met as
used as x-ray sources both for therapy and in the case of the cyclotron by introducing a
radiography. negative gradient of the magnetic field with
The second was the proton Unac developed radius which gives a force restoring ions to the
by L. Alvarez in 1948. In this accelerator, a median plane of the magnet (vertical focusing
standing rf wave is set up in a resonant tank in in the usual orientation). This focusing force,
a mode in which the maximum electric field is which is relatively weak, determines the space
along the axis of the tank. A series of "drift required by the beam, hence the size and cost
tubes" of appropriate length, shape, and spac- of the magnet. The focusing force cannot be
ing are distributed along the axis so that charged increased simply by increasing the gradient of
particles are accelerated by the rf field when the magnetic field, because then the ions would
they are between drift tubes, but are shielded not be confined in the radial direction (hori-
from the field during the reverse half cycle. An zontal defocusing).
initial energy at injection is necessary; in the N. C. Christophilos in 1950 and E. D. Courant,
modern proton linac this is typically a Cock- M. S. Livingston, and H. S. Snyder in 1952 in-
croft-Walton accelerator operating at 750 KeV. dependently devised a new focusing scheme
Proton linacs themselves are used as injectors called alternating-gradient or strong focusing.
into proton synchrotrons. For this purpose If a magnet is divided into segments, alternating
their energy may be 50 or 200 MeV. The high- segments with vertical focusing and radial de-
est-energy proton linac is an 800 MeV accelera- focusing forces with segments having vertical
tor designed to exploit the high current capabil- defocusing forces and radial focusing forces the
ity of linacs. net effect will be focusing in both directions.
19 ACCELERATOR, PARTICLE

Strong focusing incorporated into proton beam of 30 GeV on a target would give only
synchrotron design reduced magnet aperture about 7.75 GeV. A single storage ring may also
cross sections by a factor of ten or more, mak- be used to store two counter-rotating beams of
ing it economically possible to design proton particles of opposite sign. Thus PETRA at
accelerators up to several hundred GeV. The Hamburg and PEP at Stanford are single' storage
largest of these at the Fermi National Accelera- rings designed for electrons and positrons at
tor Laboratory (1972) and at the European about 20 GeV each. At CERN the 450 GeV
Organization for Nuclear Research, CERN, proton synchrotron has been reconfigured
(1976) have annular magnet systems of 2 km with a complicated set of auxiliary rings to
major diameter and aperture cross sections accelerate, store, and collide protons and anti-
of about 5 by IS cm. The maximum proton protons at 270 GeV, giving collisions of 540
energies of these accelerators are 500 and 450 GeV. To produce this collision energy with a
GeV, respectively. The energy of electron single accelerator and a fixed target would re-
synchrotrons incorporating strong focusing is quire an energy of about IS TeV (IS X 10 12
still limited by radiative energy loss, but the eV)!
advantage of small magnet cross section has Uses of Accelerators While the demands of
made it possible to achieve energies of more nuclear and particle physics research have been
than 10 GeV. the strongest driving force in the development
Colliding Beams In a collision between a of new accelerators and the achievement of
particle and a stationary target nucleus, not all high energies, the applications of accelerators
the kinetic energy is available to induce a reac- in other sciences and in industry has been wide-
tion. Part of the energy, as required to conserve spread and the contributions very important.
momentum, goes to the motion of both parti- Usually the accelerators designed for practical
cles after the collision. For accelerated and tar- applications operate at less than the maximum
get particles of equal mass and for energies energy for their type, but they may often be
where relativistic effects are small, the available required to meet other demands at the limit of
energy is approximately one-half the energy of technology-intensity, reliability, compact size,
the accelerated particle. This is not a serious etc.
loss; however, for particles accelerated to In medicine, accelerator-produced radioiso-
higher energies, an increasing fraction goes to topes are routinely used to image internal struc-
the energy of motion. For protons at relativistic tures and to monitor functions. Thousands of
energies striking target protons, the available small compact electron linear accelerators are
center-of-mass energy is approximately V2E used in hospitals to generate penetrating x-rays
GeV, where E is the energy of the incident for cancer therapy. Accelerated particles rang-
particle in Ge V. Thus the largest proton syn- ing from protons to silicon nuclei and second-
chrotrons of about 500 GeV energy can deliver ary beams of neutrons and pions are also used
only about 30 GeV to a reaction. If two beams for cancer therapy but on an experimental
of particles of energy E traveling in opposite basis. In engineering and manufacturing, elec-
directions could be made to collide head on, an tron accelerators are used to generate pene-
energy 2E would be available for reactions. The trating x-rays to examine large structures;
possibility that this obvious, but very difficult small ion accelerators are used to implant con-
to achieve, objective might be realized was de- trolled impurities in the fabrication of semi-
rived from suggestions made independently by conductor devices. Radioisotope tracers are
D. W. Kerst, G. K. O'Neill and others in 1956. used to study and monitor chemical reactions,
Because even the most intense accelerator wear, and other processes. Small accelerators
beams are not adequate to give a useful inter- are used to log oil wells and other bore holes
action rate if two accelerator beams are simply by analysis of the characteristic radiation from
pointed at each other, it is necessary to collect, various elements when excited by neutrons.
store and recycle the accelerated particles. This Plastics with superior electrical and chemical
is done by injecting the beam from an accelera- properties are produced by curing organic
tor into an annular magnet with a constant mag- polymers with electron beams. Extremely
netic field. If the magnetic field is very precise sensitive and nondestructive analysis can be
and if the pressure in the vacuum chamber very accomplished by inducing characteristic x-ray
low « I 0 -9 Torr), the beam may be made emission by proton or alpha-particle beams
to circulate for many hours, even days. Such from cyclotrons or electrostatic accelerators.
storage rings may be constructed in intersecting Synchrotron radiation, electromagnetic radia-
pairs with provisions for loading them in oppo- tion from energetic electrons confined to orbits
site directions with particles from an accelera- by magnetic fields, a limiting factor in energy
tor. At the beam intersections a small fraction of electron accelerators, is an extremely useful
of the particles interact and the noninteracting source of intense, highly collimated radiation
ones continue around for repeated chances to extending from the infra-red to x-rays. There
interact. are many applications of this radiation in chem-
The ISR (Intersecting Storage Rings) at CERN istry, metallurgy, and biology.
provides for collisions of proton beams at 30 The Future of Accelerators The course of
GeV, giving a total energy of 60 GeV. A single accelerator development may be displayed in an
ACCELERATOR, PARTICLE 20

interesting way in a plot first due to M. S. Liv- ACCELERATORS, LINEAR


ingston. The maximum energy achieved with
each type of acclerator is plotted against the Linear accelerators (often abbreviated to
year it was achieved, see Fig. 1. It will be noted "linacs") are used for acceleration of electrons,
that, as each type of accelerator reaches or protons, and heavy ions. Electron linear acceler-
approaches a limiting energy, a new type ap- ators have yielded electrons at energies above 20
pears. A linear envelope of these curves shows Ge V; proton linear accelerators have not yet
that maximum accelerator energy has increased reached energies above 800 MeV.
by a factor of about 8 each decade for 50 years. Although the tenn "linear accelerator" is
There appears to be no letup in the demand for occasionally used to describe systems in which
higher energies for research directed towards particles are accelerated by electrostatic fields
the ultimate structure of matter; yet the sizes (Cockcroft-Walton or electrostatic accelerators),
of the largest accelerators are measured in kilo- the term is generally used to apply to systems in
meters and the cost in hundreds of millions of which particles are accelerated along a linear
dollars. It seems likely then that further ad- path by application of rf fields. Only accelera-
vances in accelerator performance will depend tors of this type will be discussed in this article.
upon the emergence of new concepts to cir- The linear accelerator has the advantage that
cumvent the limits of size and cost. Supercon- the accelerated beam is easily extracted for ex-
ductivity is already coming into use to provide perimental use. In principle it is capable of pro-
higher magnetic fields at lower power costs and ducing well-focused beams of higher intensity
will be exploited more fully. Strong electric than are available from circular machines of the
fields are associated with intense laser beams; synchrotron or synchrocyclotron type. It does,
a way may be found to apply these fields to however, require very high power levels at fre-
accelerate particles. The very strong magnetic quencies where conversion equipment is rela-
fields associated with an intense electron beam tively expensive. For a given final energy, a
may be useful to confine other particles. The linear accelerator will usually be materially
collective effects of a swarm of particles may more expensive than a synchrotron. (For a
be used to transfer energy to other particles. general discussion of accelerators see ACCELER-
Invention and development are continuing and ATORS, PARTICLE.)
there will be new concepts almost surely lead- Field Patterns Used in Linear Accelerators
ing to new types of accelerators with perfor- The rf fields used for acceleration are set up in a
mance going well beyond the present large long cylindrical cavity whose axis is to be the
accelerators. axis ofthe accelerated beam. Hence for accelera-
tion the field pattern must have a major electric
EDWARD J. LOFGREN field component parallel to the axis. This
requirement is satisfied by the TMolO waveguide
References mode in which a paraxial electric field has its
maximum strength at the axis and falls to zero
Livingood, John Jacob, "Principles of Cyclic Particle at the cavity wall. Azimuthal magnetic fields lie
Accelerators," Van Nostrand Reinhold, New York, in planes normal to the axis, have small values
1961. near the axis and increase to maximum values at
Livingston, M. Stanley, and Blewett, John P., "Particle the cavity walls. Usually the field pattern is
Accelerators," McGraw-Hill Book Co., New York, maintained by coupling to these magnetic fields
1962. by loops or apertures excited by external power
Hicks, J. W. (Ed.), Eighth International Conference on sources. Corresponding to the high rf magnetic
Cyclotrons and Their Applications, IEEE Transac- field at the wall, paraxial currents flow in the
tions, NS-26(2), (1979). walls and are responsible for a major fraction of
Livingston, M. Stanley, Early History of Particle Ac- the power loss in the system. When high electric
celerators, in "Advances in Electronics and Electron fields are required on the axis to accelerate to
Physics," Vol. 50, pp. 1-88, Academic Press, New high energy in reasonable distances, the wall cur-
York,1980. rents are correspondingly high. For acceleration
Duggan, Jerome L., and Morgan, I. L., 1980 Confer- rates of 2 MeV/m, power losses in copper walls
ence on the Application of Accelerators in Research will be of the order of 50 kW 1m.
and Industry, IEEE Transactions, NS-28(2), (1980). Both standing wave and traveling wave pat-
Newman, W. S. (Ed.), H~ 1th International Conference terns are used in linear accelerators. If traveling
on High-Energy Accelerators," Birkhauser Verlag, waves are used, as is the case in most electron
Basel, 1980. machines, the phase velocity of the waves must
Placious, R. C. (Ed.), 1981 Particle Accelerator Con- be made equal to the velocity of the particles ac-
ference, Accelerator Engineering and Technology, celerated; as the particle velocity increases, the
IEEE Transactions, NS-28(3), (1981). phase velocity also must increase. But phase
velocities in simple waveguides always are greater
Cross-references: ACCELERATORS, LINEAR; AC- than the velocity of light, and loading must be
CELERATORS, VAN DE GRAAFF; BETATRON; introduced to reduce the phase velocity to the
CYCLOTRON; SYNCHROTRON. desired value. This is accomplished by introduc-
21 ACCELERATORS, LINEAR

FIG.!. Cutaway of iris-loaded waveguide for electron linear accelerator.

tion at intervals of washer-shaped irises, as desirable to operate at as high a frequency as


shown in Fig. 1. possible. But, as wavelength is decreased the di-
Standing wave patterns are used in proton ameter of the structure and of the beam aperture
linear accelerators. Cavities many meters in decrease correspondingly. The highest frequency
length are excited in the TM oio mode in which that gives convenient beam apertures and at
the axial field is uniform from one end of the which adequate power sources are available is in
cavity to the other. Protons which enter the the 3000-MHz range. For reasons that are pri-
cavity at a low injection velocity may arrive at a marily historic, most electron linear accelerators
phase of the rf field at which they are im- in the United States are operated at a frequency
mediately accelerated, but before they have of 2856 MHz.
traveled more than a few centimeters, the field Both the phase velocity and the group velocity
will reverse and become decelerating. To protect in the guide are determined by the dimensions of
the particles from the field in its decelerating the guide and the loading irises. The group ve-
phase, "drift tubes" are introduced, as shown in locity is fixed also by the capabilities of the rf
Fig. 2. These are pipes coaxial with the cavity power sources. Klystrons with outputs of the
and of such length that the particle is protected order of 20 MW have become standard; each
from the field during its reverse phase and klystron can excite a section of waveguide 3 m
emerges only after a complete rf cycle when the long to axial fields of 10 MV 1m. The group ve-
field again is accelerating. As the particles gain locity suitable for this operation is 1 per cent of
energy, the drift tubes are increased in length. the velocity of light. The dimensions indicated in
It would appear that the rather complicated Fig. 1 result in a phase velocity of e and a group
drift-tube structure is conceptually and mechan- velocity of O.Ole when the guide is excited at
ically inferior to the rather simple iris-loaded 2856 MHz.
traveling-wave system. It is adopted at the rela- Injection is from a conventional electron gun.
tively low phase velocities required for protons In some cases a short "bunching section" pre-
in the range below about 200 MeV because the groups the electrons around the peak of the ac-
extreme loading required to reduce the phase celerating wave. In this section, the phase veloc-
velocity of the iris-loaded system to velocities ity is matched to the electron velocity by
below one-half of the velocity of light results in suitable choices of dimensions.
very high losses. From the point of view of rf The power levels required are so high as to pre-
power consumption, the drift-tube structure is clude continuous operation. Typical operation is
much superior at low phase velocities. with two-microsecond (2-J.(sec) pulses repeated
Electron Linear Accelerators Electrons very several hundred times per second. Of the 2-J.(sec
rapidly approach the velocity of light (e) as they pulse, the first half is required to build up the
are accelerated. At 1 MeV an electron already has accelerating field .
reached 94 per cent of its ultimate velocity. At Electron linear accelerators in the energy range
energies higher than this satisfactory acceleration below 100 Me V are widely used for x-ray pro-
will be achieved if all sections of the accelerator duction and are commercially available. Most
are made to have phase velocities equal to e. of the pioneer work on electron linear accelera-
This makes much easier the tasks of construction tors was done at Stanford where a machine two
and of operation. For example, rf excitation of miles long is in operation at over 20 GeV. At
a section of the accelerator may fail and the Orsay, France, a 1.3-GeV electron machine is in
whole machine will still be operative, although operation, and in Kharkov in the USSR a 2-
at a slightly lower final energy. GeV accelerator is also in operation.
Losses per unit length in waveguides generally Proton Linear Accelerators Because of the
decrease as the square root of the rf wavelength lower velocities of protons at million-electron-
for equal axial fields. Hence, where possible, it is volt energies, proton linear accelerators suffer
ACCELERATORS. LINEAR 22

from several limitations from which the electron which enter the machine at phases in the neigh-
machines are free. Injectors for protons usually borhood of the correct phase. Often pre-
are Cock croft-Walton voltage multiplier sets giv- bunchers are used to collect a large fraction of
ing energies of 500 to 750 keY. At 750 keY the the injected beam around the accelerating phase.
velocity of a proton is only 0.04c. The accelerat- These prebunchers have the same design as the
ing field component at such low phase velocities modulating gap in a klystron and function in
varies strongly with radius at a rate that is ap- the same fashion.
proximately proportional to the square of the At the stable phase the field across an acceler-
frequency. This effect sets an upper limit of ating gap is rising as the proton crosses the gap.
about 200 MHz for the frequency of the ac- As the proton enters the gap, the accelerating
celerating field, and most proton linear ac- field has a focusing component, but as it enters
celerators are operated in the neighborhood of the next drift tube it feels a larger defocusing
200 MHz. A cavity resonant in the TM o10 mode field and the net effect is a strong defocusing. In
at that frequency will be about 90 cm in early proton linacs this effect was overcome by
diameter. the introduction of rudimentary grids at the
Figure 2 is a schematic cross section through a downstream end of each gap. These grids give
50-Me V proton linac formerly used at the Brook- unsatisfactory performance because they inter-
haven National Laboratory as the injector for cept a large fraction of the beam and because
the 33-GeV synchrotron. Sections are shown at their poor optical quality results in loss of many
the injector end, at the region where the protons protons. With the advent of alternating gradient
have an energy of about 10 MeV, and at the focusing, grids in linacs were largely abandoned
high-energy end. The over-all length of the and focusing is now accomplished by quad-
machine is about 33 m. The drift tube shapes rupole magnets imbedded in the drift tubes. This
indicated have the purpose of keeping each sec- has resulted in an increase in output current by
tion of the machine resonant to give a uniform two orders of magnitude to levels of the order of
accelerating field pattern and, at the same time, 100 rnA.
of holding the resistive losses in the walls of the As in electron machines, the high rf power level
drift tubes to levels as low as possible. required forces operation at a relatively low duty
The principle of phase stability is operative in cycle. Since, at this frequency, the time required
proton linacs whereas, at the extreme relativistic to build up the field in the linac cavity is about
velocities of multi-Me V electrons, electron linacs 200 /J.sec, pulse lengths for research use are
do not enjoy phase stability and require extreme chosen to be several hundred microseconds.
precision in axial dimensions. In the proton Duty cycles are rarely larger than 1 per cent.
linac, the drift tube lengths increase at a rate The first proton linac was the 40-ft machine
corresponding to acceleration at a phase dis- at the Lawrence Radiation Laboratory in which
placed 20° or 30° from the peak of the wave. protons were accelerated to 32 MeV. This ma-
The phenomenon of phase stability (see chine was made possible by the development
ACCELERATORS, PARTICLE) results in contin- during World War II of powerful rf sources for
ual restoration to the correct phase of protons radar applications. It was a pulsed machine

DRIFT TUBES

750 KeV INJECTED


BEAM
SECTION
10 MeV
AROUND /
I 50 MeV ACCELERATED
BEAM

o ____
~' ~ ____
2
~'L-
3FT.
__--JI

FIG. 2. Cross section through proton Iinac.


23 ACCELERA TORS, LINEAR

operating at peak currents in the microampere be accelerated in a structure similar to a proton


range. linac but usually operated at a lower frequency.
In the early 1950s a daring attempt was made Multiply charged ions are injected at a few hun-
at Livermore to build a continuously operating dred electron volts and accelerated to energies
prototype for a very-high-energy linear accelera- of the order of 1 MeV per nucleon. The earliest
tor to accelerate protons to energies of hundreds heavy-ion machines resulted from a joint design
of MeV where they could be used in production study between Berkeley and Yale. Machines
of fuels for reactors and weapons. The prototype were built at both centers and proved very
was built and operated. In those days the high- valuable in research on nuclear physics and
est frequency practical for continuous operation nuclear chemistry. The Yale machine has been
was 12 MHz and as a consequence the resonant shut down but the Berkeley accelerator has
cavity was 60 ft in diameter, so big that a rail- undergone massive improvements. In 1982 the
road spur was run into the resonator to deliver Berkeley "Superhilac" had its beam transported
heavy parts. The prototype was 60 ft long. a considerable distance across the campus and
Focusing was by solenoids inside the drift then injected into the Bevatron, a large synchro-
tubes-alternating gradient focusing had not yet tron where much higher energies are achieved.
been invented. After about two years of work it Heavy-ion accelerators have been particularly
was discovered that there was considerably more valuable in production of transuranium ele-
natural reactor fuel than had been suspected ments. Their energies are continually being
and the project was abandoned. The project, pushed to higher levels in the hope of produc-
called the MT A, was highly classified at the time tion of elements in the predicted "islands of
but was declassified in 1957. stability" far beyond the end of the presently
In the late 1950s, 50-MeV pulsed linacs oper- collected periodic table of the elements.
ated at 200 MHz were built to serve as injectors A very ambitious heavy-ion accelerator was
for the proton synchrotrons at the Argonne built during the 1970s at a new laboratory in
National Laboratory (ANL), the Brookhaven Darmstadt. It combines several linear accelerator
National Laboratory (BNL) and the CERN types in sequence, each peculiarly suitable for
International Laboratory in Geneva. These were use in the energy range in question. In 1981 this
in operation by 1961. In a few years the BNL machine-the UNILAC-was undergoing major
linac was replaced by an updated 200-MeV rebuilding and modernization. When this is
machine which is still in operation in 1982. A complete the machine's output can include
similar linac was built at the newly formed heavy ions with energies of 14 MeV per nucleon.
Fermi National Laboratory in Batavia, Illinois New Developments Three new developments
to be the injector for the second stage of the during the decade of the 1970s merit attention.
200-500-GeV proton synchrotron. First is the "radio-frequency quadrupole" or
A 20-MeV proton linac is in operation at the RFQ. It was proposed in the Soviet Union
KeK Laboratory in Japan where it is the injec- during the years after 1971 that a structure
tor for the 12 GeV synchrotron. This machine's could be designed in which the rf accelerating
KeK Laboratory in Japan where it is the in- fields would not defocus the beam but could
jector for the 12 Ge V synchrotron. This ma- provide focusing as well as acceleration. The
chine's structure has some valuable innovations structure would consist mainly of four bars
that serve as protection against damage by parallel to the particle orbits, one above, one
earthquakes. below, and one on each side of the orbit. The
The first proton linac to be built by industry surfaces of these bars facing the orbits would
for industrial use (production of radiopharma- be lightly corrugated, the corrugations above
ceuticals) came into operation in 1981 at 40 and below facing each other, and the rising part
MeV. It was built by the New England Nuclear of the side corrugations facing the low part of
Corporation near Boston. the upper and lower ones. There was consider-
The highest energy reached in a proton linac able skepticism in the United States about this
by 1982 was achieved at Los Alamos where an scheme, and it was not tested in the US until
800-MeV "meson factory" has been in opera- 1978. The tests were at Los Alamos where it
tion for about a decade. This interesting machine was shown that the Russian predictions were
combines a 100-MeV drift-tube linac with a completely correct and that the RFQ system
100-800 Me V iris-loaded section. The iris-loaded has notable advantages at low energies over con-
structure is not as simple as were the early ventional drift-tube systems. These include pos-
electron linacs; coupling down the linac is via a sible injection at much lower energies than the
cavity on the side of the structure which is conventional 750 keV -the Los Alamos system
coupled to the cavities on each side of an iris. used 100-keV injection. Possible applications of
This is reported to improve stability and to the RFQ will be mentioned below.
decrease rf losses. At 100 Me V the proton Another step forward, applicable to the linac
velocity is approaching half of the velocity of as well as in many other places is the evolution
light and the iris-loaded structure becomes as of rare earth permanent magnets. Pioneering
efficient as the drift-tube system. work on these materials has been carried on in
Heavy-Ion Linear Accelerators Heavy ions many places, mainly industrial, in the United
such as C, N, 0 and Ne and higher masses can States, Japan, and elsewhere. These materials
ACCELERA TORS, LINEAR 24

would be used primarily in linacs in the form of to those just described for the SNQ except that
quadrupoles. Permanent magnet quadrupoles for it would have a 100% duty cycle. Its energy and
linac focusing were proposed and studied at current might be higher than those of the SNQ
Brookhaven in the 1950s, but the barium ferrite by, perhaps, a factor of two. The design and use
permanent magnet materials did not have suffi- of such an accelerator have been discussed for
cient remanent fields. The new materials have some time at several centers, particularly at
remanent fields a factor of ten higher-of the Brookhaven and at the Chalk River Laboratory
order of 1 Tesla. The problem of how to con- of Atomic Energy of Canada, Ltd.
struct these materials into a powerful quadrupole Medical Applications Electron linacs have
was first solved in LBL; since then contributions been used for several decades in hospitals for
have been made in a number of laboratories. radiography and for treatment of tumors. A
The first working application of rare earth favorite energy is around 50 MeV. They are
quadrupoles is in the New England Nuclear produced commercially. Medical and industrial
accelerator mentioned above. applications involve over 1000 electron linacs
Sporadic work has been in progress for in the United States alone.
twenty-five years on development of supercon- Neutrons produced by disintegration of deu-
ducting rf cavities. Early work in the High terons accelerated in cyclotrons have been used
Energy Physics Laboratory at Stanford led to for tumor irradiation since the 1930s. More
optimism that rf fields could be increased and recently protons accelerated in linacs have been
losses drastically reduced in superconducting used in neutron production. Most notable is the
niobium cavities. But difficulties intervened. neutron beam generated at the Fermilab in
There have been few applications; notable has Illinois by an extracted 67 MeV beam from the
been the superconducting cavity in the Cornell injector for the Fermilab synchrotron.
synchrotron. More recently a super conducting Proton beams from linacs and synchrotrons
heavy-ion linac is almost completed at Argonne. have proved useful in radiography; they appear,
In 1981 most of the linac was assembled and in for example, to yield more sensitive detections
operation. Eventually it will accelerate heavy of breast cancers than do the conventional X-ray
ions to energies of 25 MeV per nucleon. beams.
Design Studies and Proposals FMIT. A major Perhaps the most spectacular medical acceler-
problem in the design of fusion reactors will be ator application involves the use of negative
the choice of materials for the "first wall," the pions. These are unstable antiparticles. They
wall closest to the reaction area. This wall will can penetrate human flesh without doing much
be subjected to bombardment by about 10 14 damage; then, at the end of their orbits they
IS-MeV neutrons per second. Various projects join with a stable particle in an explosive anni-
using the D-T reaction have been undertaken hilation. Since the orbits have well defined
but generally they produce neutron fluxes too ranges this is evidently a very desirable particle
low by two orders of magnitude. Finally it for tumor irradiation. During the late 1970s a
became evident that the only way to meet the design study was done at the Stanford Univer-
parameters required was by use of a 30-MeV sity Hospital. One conclusion of the study was
deuteron linac operating continuously with that the facility would be very expensive. A
currents of the order of 100 milliamperes. With linac with four rooms for patient irradiation
a liquid metal target this device could yield as would cost some tens of millions of dollars.
much as 10 16 neutrons per second. The original Hence construction has been postponed pend-
design was done at Brookhaven; later, the effort ing results from a pion study at Los Alamos. A
was joined by several other laboratories. Event- pion beam has been derived from the Los
ually the detailed design study was awarded to Alamos "meson factory" and a number of
the Hanford Engineering organization, later to patients have been treated. In the meantime the
be joined by Los Alamos. This team has made Los Alamos group has been attempting to
important progress in the design effort. In 1982 design a facility that can be materially less
it is not clear if or when construction will be expensive. Part of the Los Alamos enthusiasm
approved. for development of the RFQ has been for in-
SNQ. At Karlsruhe, a design study is in prog- clusion in a pion irradiation facility.
ress on a linac to serve as a spallation source. A Theory Since linear currents have been
l.l-Ge V proton linac with a current of 100 mA brought to high levels-hundreds of milliam-
will yield neutron fluxes comparable with those peres-a number of mysterious phenomena have
from high-flux reactors. The Karlsruhe linac been observed. In electron linacs spurious field
would have a 5-10% duty cycle. There is some modes have deflected beams into the accelerator
optimism that this project may be approved. walls. In proton linacs the beam "emittance"-
Breeders. The basic idea behind the MT A its size and angular distribution-has increased
project mentioned above has been revived. Ad- for no immediately visible reason in the low-
vances in linac technology make it possible to energy part of the machine.
build a much more reasonable machine than In 1982 it can be said that intensive work
was the MTA. The parameters of such an accel- using modern computers has resolved these
erator, for use in production of plutonium from mysteries. This comes as a result of studies in
U238 or production of tritium, would be similar the United States, Europe, and the USSR. Con-
25 ACCELERATORS, VAN DE GRAAFF

sequently it is now possible to design linacs geneous in energy and with smooth control over
with confidence that the machine performance the voltage range of the machine. General ac-
will be as predicted by the linac theory groups. ceptance of the Van de Graaff accelerator as
the precision instrument for experimental nu-
JOHN P. BLEWETT clear research followed rapidly, and its further
development for this purpose has been continu-
References ous since that time. Greater compactness and
higher voltage were attained by insulating the
Smith, Lloyd, Linear Accelerators in "Handbuch der belt generator and tube with compressed gas;
Physik," Vol. 44 pp. 341-389, Springer-Verlag, greater beam intensity came through improved
Berlin. ion source and acceleration tube technology.
Lapostolle, P. M., and Septier, A. L. (Eds.) "Linear About 300 such accelerators were in use by
Accelerators," North-Holland, Amsterdam, 1970. 1960, producing particles and radiation with
By far the best references on linear accelerators are energies from 400keV to 10 MeV. At that time,
the Proceedings of a series of linear accelerator con- Van de Graaff accelerators for nuclear science
ferences begun in 1960 at Brookhaven. The eleventh incorporated the "tandem acceleration" princi-
of this series was held in October, 1981 at Santa Fe, ple described below. It opened the way to far
New Mexico. The Proceedings of the 1981 Linear higher particle energies by applying the tandem
Accelerator Conference can be obtained from Los principles to multiply charged heavy atoms.
Alamos. It is Los Alamos report LA-9234-C. Van de Graaff accelerators can accelerate any
electrified particle, including any of the 92
Cross-references: ACCELERATOR, PARTICLE; AC- elements, electrons, and clumps of matter
CELERATORS, VAN DE GRAAF; BETATRON; simulating micrometerorites. In addition to use
CYCLOTRON; SYNCHROTRONS. in experimental nuclear physics with high-en-
ergy positive ions, Van de Graaff electron accel-
erators designed for voltages in the 1 to 5 MeV
ACCELERATORS, VAN DE GRAAFF range are used to produce megavolt x-rays for
the treatment of malignant disease and for the
The electrostatic particle accelerator originated radiographic inspection of heavy opaque struc-
by American physicist Robert Jemison Van de tures such as metal forgings, weldments, and
Graaff is widely used for nuclear structure rocket engines. Streams of electrons from such
research. These constant -potential accelerators accelerators are also used for radiobiological
make use of the electrostatic belt generator and radiochemical research and for the treat-
invented by Van de Graaff about 1930. They ment of skin malignancies. Radiation processing
belong to the direct accelerator family in which studies for such purposes as the sterilization of
the high voltage power is applied directly across surgical materials, the cross-linking of polyethy-
the terminals of a highly evacuated multi-elec- lene and other plastics, the deinfestation of
trode tube. Electrified atoms or electrons from grains, and increased shelf life of foods have
a source within the high-voltage terminal gain often made use of Van de Graaff accelerators.
velocity and energy as they move along the tube Vande Graaff Generator Operating Principles
axis to ground under the action of the applied Although a variety of electrostatic machines had
electric field. As each particle emerges from the been developed since the first frictionally ex-
accelerator, it is moving with a kinetic energy cited generator of Otto von Guericke in the
equal to q V where q is the particle charge and middle of the seventeenth century, all have been
V the generator voltage. superseded by the Van de Graaff generator be-
While a Rhodes Scholar at Oxford during cause of its greater voltage capability and com-
1927 and 1928, Van de Graaff selected the parative simplicity. The essential components
electrostatic approach to fulfill the need, much of the generator, outlined in Fig. 1, include a
emphasized by Rutherford, for more copious well-rounded metal terminal supported by an
sources of atomic particles comparable in energy insulating column and an endless insulating belt
to those spontaneously emitted from naturally system which physically conveys electric charge
radioactive materials (see ELECTROSTATICS). from ground to the high voltage terminal.
Subsequently, at Princeton University, Van de Electric charge of the desired polarity is de-
Graaff produced over one million volts between posited on the moving belt surface by corona
the spherical terminals of two small electro- from a row of metal points at a controllable
static belt generators of a new and surprisingly voltage with respect to the lower pulley toward
simple design; in 1931, he described the electro- which they are directed. In addition to over-
static belt generator principles, and their suit- coming friction and windage, the motor-driven
ability for the bombardment of atomic nuclei, belt does work in carrying this charge from
before the American Physical Society. The ground to the terminal potential. Transfer of
method was first applied to nuclear investi- the charge from belt to terminal is accom-
gations at the Carnegie Institution of Washing- plished by again presenting a row of points to-
ton in 1932. The early machines, insulated in ward the electrified belt. This time the electric
atmospheric air, produced streams of light posi- field of the surface-bound charge produces the
tive ions such as protons and deuterons homo- gaseous ionization needed for conduction across
ACCELERATORS, VAN DE GRAAFF 26

lligh·voltagc
terminal U pp er spray points

Upper pulley I nsulating belt


( inS:Jlated from - -- +-- - -\----i-tt----
te rminal )

Collecto r _ !fotor-driven pulley

Controllable spra y
Lower apray point.
voltage

FIG. 1. Diagram of Van de Graaff electrostatic belt generator. Reproduced by permission of The Institute of
Physics and The Physical Society from the article by R. J. Van de Graaff, J. G. Trump, and W. A. Buechner,
Reports on Progress in Physics, 11,1 (1948).

the point-to-belt gap. Van de Graaff pointed includes the particle current through the accel-
out that these ionized charge-transfer processes erating tube, the current through resistors which
remain independent of the terminal voltage if divide the terminal voltage uniformly along the
they are located in the field-free space within supporting column , and any corona from the
the hollow terminal or below the ground plane. terminal itself arising from the high electric
The current of such an electrostatic generator is field at its surface. By adjusting either the belt
limited by the maximum charge density which current or the load current, the terminal voltage
can be insulated in the gaseous medium sur- may be maintained at any desired value up to
rounding the belt and by the total area per the maximum which can be insulated. This
second of charge-laden surface entering or leav- maximum voltage depends only on the physical
ing the terminal. To increase the current capa- size and geometry of the terminal and on the
bility of the system, the return run of belt may electrical strength of dielectric medium sur-
be charged within the terminal in a similar rounding it. An isolated metallic sphere would
manner but with the opposite polarity. be the ideal terminal, but modifications are
The potential, V, of the high voltage terminal necessitated by the supporting column, belt,
of a Van de Graaff generator is determined by and tube .
the amount and polarity of the accumulated The pair of generators built by Van de Graaff
charge on its insulated terminal. At any instant at Princeton in 1930 each had an aluminum
V = Q/C where Q is the net positive or negative spherical terminal 2 ft in diameter supported in
charge on the terminal and C is the capacitance air by a slender glass rod 7 ft long. A silk ribbon
of the terminal system to ground. Although the was employed as the insulating charge conveyor.
Van de Graaff generator is inherently a con- The voltage insulated in atmospheric air be-
stant-current machine, it can be maintained tween these two generators, one accumulating
steadily at the desired voltage by balancing the positive and the other negative charge, was
current arriving at the terminal against the total more than twice any previously attained con-
current delivered to the load. The load usually stant voltage.
27 ACCELERATORS, VAN DE GRAAFF

About 5.5 million volts were insulated in air to 25 atmospheres. Electronegative gases such
between two larger generators constructed by as sulfur hexafluoride (SF 6) and "Freon" (CCI 2 -
Vande Graaff in the early 1930s for nuclear F 2 ) are now increasingly used instead of mix-
research. This voltage required spherical ter- tures of nitrogen and CO 2 , since they insulate
minals 15 ft in diameter supported on insulating approximately the same voltages at one-third
tubular columns 25 ft high. This historic equip- gas pressure.
ment, shown in an early sparking demonstration Acceleration System The evacuated accelera-
in Fig. 2, was used in a modified form for pre- tion tube, the source of positive ions or elec-
cision nuclear research at Massachuesetts Insti- trons, and the target to which the energized
tute of Technology for nearly 20 years. It is particles are directed, constitute the particle
now impressively installed at Boston's Museum accelerating system of the Van de Graaff accel-
of Science for daily demonstrations of high-volt- erator. The insulating length of the evacuated
age phenomena before large audiences and ex- acceleration tube is divided into many sections
planation of the underlying electrical principles. by metal disk-like electrodes, each with an axial
The need for still higher constant voltages for opening for the passage of the particle beam.
nuclear investigations, and the desire for more Each disk is mounted between annular rings of
compact apparatus, led to the use of high- glass or porcelain to form a slender vacuum-
pressure gases for the insulation of electrostatic tight accelerating column. The tube electrodes
accelerators. Today nearly all Van de Graaff take their potential from the metallic members
accelerators operating at potentials in excess of in the generator column along which the ter-
one-half million volts are within a steel pressure minal voltage is divided by resistors. The charged
tank and insulated in gases compressed to 10 particles, acted upon by the electric field be-

FIG. 2. 5.5-million volt Van de Graaff generator in sparking demon-


stration.
ACCELERATORS, VAN DE GRAAFF 28

tween these electrodes, are progressively accel- "Emperor" tandem Van de Graaffs at the
erated and focused as they mqve through the Brookhaven National Laboratory.
electric fields between the electrodes. At the Although the light elements, hydrogen and
remote end, the beam emerges as a collimated helium, were almost exclusively used as atomic
and directed stream of energetic particles. projectiles in nuclear structure physics until
Tandem Acceleration and Multiply Charged 1960, interest in heavier nuclei developed rapid-
Ions Van de Graaff accelerators for nuclear ly as higher energies became possible. It is
science now reach higher particle energies with estimated that, by applying tandem acceleration
a given terminal voltage by switching the polar- principles, a two-stage Vande Graaff accelerator
ity of the accelerated particles. In the two-stage with a IS-MV positive terminal can produce
tandem diagramed in Fig. 3, negatively charged a beam of uranium ions with energies up to
ions are produced at ground and then acceler- 400 MeV. In large part because of the more
ated toward a high-voltage positive terminal. complete electron stripping attained at higher
Within this terminal, the swiftly moving nega- energies, three-stage acceleration could produce
tive ions are stripped of electrons by passing uranium ions with energies over 1000 Me V.
through a thin gaseous region. The resultant During the 1970s a number of two-stage Van
positive ions continue through the tube under de Graaff accelerators for national physics re-
the second accelerating action of the positive search became operational at university and
terminal. A singly charged particle, such as a national laboratories in several countries at
proton, thus arrives at the ground end of the controllable terminal voltages up to 16 MV. In
system with an energy of 2q V. England at Daresbury an SF 6 -insulated, verti-
At sufficiently high energy, atoms of higher cally mounted two-stage Van de Graaff designed
atomic number may be stripped of several or for terminal voltages up to 30 MV reached the
even of all their satellite electrons. An ion which testing stage in 1981.
lacks N electric charges during the second accel- Medical Applications of Van de Graaff Accel-
eration stage gains a total energy of (N + I) V in erators Since x-rays are the form of electro-
a two-stage tandem accelerator. Three-stage ac- magnetic energy, similar to light, produced by
celeration is secured by adding an additional in- the sudden stopping of high-energy electrons,
line two-stage accelerator with a central negative Van de Graaff accelerators are often used as
terminal and using it to produce one stage of x-ray sources for the treatment of malignant
negative ion acceleration for injection into the disease and for radiography. In this application,
second tandem. In 1967 the first three-stage the high-voltage terminal is operated at negative
tandem Van de Graaff, developed by the High polarity, the electrons are emitted from a tung-
Voltage Engineering Corporation and using ter- sten source at the terminal end of the accelera-
minals at 6 MV, was brought into use for nuclear tion tube, and they are suddenly stopped after
research at the University of Pittsburgh. This traversing the length of the tube by striking a
was shortly followed by a three-stage tandem water-cooled metal target, usually of tungsten
7.S-MV terminal at the University of Washington or gold.
in Seattle and in 1970 by two in-line 10-MV A 2 million volt x-ray generator of this type,

'OSITIVI. HleH YOUle[ ytRMINAl

/
lLlCT"ON aDOINC C""ul

'ARGlT

FIG. 3. Diagram of two-stage tandem Van de Graaff accelerator. Reproduced from the article by
R. J. Van de Graaer in Nuclear Instruments and Methods, 8,195-202 (1960), by permission of the
North-Holland Publishing Co.
29 ACOUSTICS

in which a gold target is bombarded with 300 tics, noise control, shock and vibration, under-
p.A of electrons, yields an x-ray intensity of water acoustics, speech, physiological acoustics,
100 rlmin measured 1 meter from the target in and bioacoustics.
the electron direction. The quality of this radi- Physical acoustics deals with the production,
ation is closely similar in its physical properties propagation, and detection of mechanical waves
to that of the gamma rays from radium or from in continuous media. Of particular interest are
the radioactive isotope cobalt 60. To equal this the radiation, reflection, refraction, diffraction,
x-ray intensity would require over 4000 curies attenuation, and scattering of longitudinal
of cobalt 60 or 6000 grams of radium. This Van waves. Much attention has been given to acous-
de Graaff accelerator for therapy is housed in a tic waves of high intensity (nonlinear acoustics)
steel tank 3 ft in diameter and 6 ft long and is and those of high frequency (ultrasonics).
insulated by a mixture of nitrogen and CO 2 at Two interesting acoustical effects, whose un-
300 psi. derstanding requires the use of quantum me-
JOHN G. TRUMP chanics, are the propagation of phonons (lattice
vibrations of very short wavelength) in solids
and sound propagation in liquid helium. At least
References five different kinds of sound have been identified
Van de Graaff, R. J., Trump, J. G., and Buechner, W. in liquid helium at very low temperatures, where
W., "Electrostatic Generators for the Acceleration of it becomes a superfluid. Acoustic signals gener-
Charged Particles," Rept. Progr. Phys., 11,1 (1948). ated by high-energy particles in water appear
Van de Graaff, R. J., "Tandem Electrostatic Accelera- to be promising as a means for detecting high-
tors." Nuclear Instr. and Methods, 8, 195-202 energy protons, muons, and neutrinos.
(1960). Architectural acoustics deals with sound in
Wittkower, A. B., Rose; P. H., Bastide, R. P., and buildings, and in particular the efficient distribu-
Brooks, N. B., "Injection of Intense Neutral Beams tion of desirable sound and the exclusion or
into a Tandem Accelerator," Rev. Sci. Instr. 35,1-11 reduction of undesirable sound. It is usually
(January 1964). this branch of acoustics which comes to mind
Wright, K. A., Proimos, B. S., and Trump, John G., when the layperson hears mention of the term
"Physical Aspects of Two Million Volt X-ray Ther- acoustics.
apy," Surg. Clin. North Am., 39,1-12 (June 1959). In most auditoriums, the intensity of the re-
Livingston, M. Stanley, and Blewett, J. P., "Particle flected sound exceeds the intensity of the direct
Accelerators," Ch. 3, New York, McGraw-Hill Book sound reaching most listeners. Thus the charac-
Co., 1962. ter of the perceived sound is very much depen-
Trump, J. G., "New Developments in High Voltage dent on the nature of the reflected sounds and
Technology," IEEE Trans. Nuclear Sci., NS-14, No. especially their temporal and spatial distribu-
3,11>-119 (1967). tions. The reflected sound which reaches the
Goldie, C. H., "High current Multimegavolt Ion Accel- listener within about SO milliseconds of the
erators," Proc. 3rd Intern. Conf. on Electrostatic direct sound is often called the early sound.
Technology, Oak Ridge National Laboratory, April Later-arriving reflections make up the reverber-
13-16, pp. 254-257 (1981). ant sound. One of the parameters character-
izing an auditorium is the reverberation time,
Cross-references: ACCELERATOR, PARTICLE; AC- usually defined as the time required for the
CELERATORS, LINEAR; CYCLOTRON; HIGH reverberant level to decrease 60 dB after the
VOLTAGE RESEARCH; STATIC ELECTRICITY; sound source ceases.
SYNCHROTRON. Psychoacoustics deals with the perception of
sound. Loudness, pitch, timbre, and duration
are attributes used to describe sound. These
ACOUSTICS attributes depend in a rather complex way on
measurable quantities such as sound pressure,
The word sound is used to describe two differ- ~requency, spectrum of partials, duration, and
ent things: (1) an auditory sensation in the ear; nvelope. The relationship of the subjective
(2) the disturbance in a medium, which can cause . tributes of sound to physical quantities is the
this sensation. (Making this distinction answers central problem of psychoacoustics.
the age-old question, "If a tree falls in a forest Musical acoustics considers special problems
and no one is there to hear it, does it make a connected with the production, transmission,
sound?") and perception of musical sound. Of consider-
The science of sound, which is called acous- able interest is the physics of musical instru-
tics, has become a broad interdisciplinary field ments, including studies of modes of vibration
encompassing many academic disciplines- and feedback processes which sustain oscilla-
physics, engineering, psychology, speech, audi- tions in wind instruments.
ology, music, architecture, physiology, and Musical acoustics overlaps several other
others. Among the branches of acoustics are branches of acoustics, such as architectural
physical acoustics, architectural acoustics, psy- acoustics (concert halls and music listening
choacoustics, musical acoustics, electroacous- rooms), psychoacoustics (perception of loud-
ACOUSTICS 30

ness, pitch, and timbre), speech communication to 20,000 Hz. Below this audible range is infra-
(singing), and electroacoustics (reproduction of sound and above it is ultrasound. It is possible
music). to generate sound waves in air with frequencies
Electroacoustics deals mainly with transduc- of hundreds of megahertz, and in solids fre-
ers, such as microphones and loudspeakers that quencies up to 2.5 THz (2.5 X 10 12 Hz) have
convert sound to electrical signals and vice versa. been generated.
Sometimes the amplification, recording, and re- The strength of a sound field is measured by
production of sound are included as well. The its mean square pressure p expressed as a sound
multi-billion dollar audio and entertainment pressure level Lp in decibels. Decibels are loga-
industries depend upon and apply the principles rithmic units tliat compare a sound pressure to
of electroacoustics, and so it is familiar to the a reference pressure, usually 20 micropascals or
consumer. 2 X 10- 5 N/m 2 , which is near the threshold of
Unwanted sound or noise has been receiving hearing for a healthy young person with normal
increasing recognition as one of our critical en- hearing. Prolonged exposure to sound levels
vironmental pollution problems. Like air and above 85 dB can damage the hearing mecha-
water pollution, noise pollution increases with nism, and at levels of 130 dB damage can occur
population density; in our urban areas it is a almost instantaneously.
serious threat to our quality of life. Noise-in- The loudness of a sound is a subjective quality
duced hearing loss is a serious health problem that depends mainly upon the sound pressure,
for millions of people employed in noisy en- but to a lesser extent on the frequency, spec-
vironments. Finding technical solutions to many trum, and duration of a sound as well. In an
of our environmental noise problems is central effort to obtain a quantity proportional to the
to the branch of acoustics called noise control. loudness sensation, a loudness scale was devel-
Vibration is a term that describes oscillation oped in which the unit is called the sone. One
in a mechanical system. Shock is a rather loosely sone is defined as the loudness of a IOOO-Hz
defined aspect of vibration wherein the excita- tone at a sound pressure level of 40 dB. The ear
tion is sudden, severe, and nonperiodic. The shows a marked decrease in sensitivity at fre-
branch of acoustics referred to as shock and quencies below 200 Hz, and this decrease is
vibration includes theoretical and experimental most pronounced at low levels.
studies of both deterministic and random vibra- Various methods are available for expressing
tion. Methods of measuring shock and vibration the loudness of complex sounds from their
have received considerable attention, as have sound pressure levels in octave or third-octave
methods for predicting their effects on physical bands. Environmental noise levels are usually
structures. measured with a sound level meter using the
Underwater acoustics deals with the propaga- A-weighting network, and these A-weighted
tion of sound in water, especially in sea water. levels correlate reasonably well with subjective
The special interest in this field results from loudness. For some types of noise, such as that
two important applications: underwater com- of a jet aircraft, a scale of perceived noise level
munication and sonar (SOund NAvigation and may be preferable.
Ranging). Slight changes in sound velocity due Acoustic disturbances can usually be regarded
to temperature gradients in the oceans result as small perturbations to an ambient state (which
in refraction of sound, the creation of special in a fluid may be described by the pressure, den-
channels and shadow zones, and other effects sity, and fluid velocity). The linear approxima-
of importance to nautical and naval personnel. tion (sometimes called the acoustic approxima-
Acoustical studies of speech communication tion) considers only first-order changes in these
constitute an important branch of acoustics. variables, and assumes that their time averages
The production, analysis, and synthesis of speech are zero. In a sufficiently intense sound field,
have been popular areas of research for acous- however, the time average of one or more of
ticians, and their work has been greatly facili- these variables may differ from its ambient
tated by the availability of the digital computer. value. This is an example of nonlinear behavior.
Physiological acoustics is mainly concerned Steepening of wavefronts in intense sound
with the auditory system, how it responds to waves is another.
sound, and evoked responses to sound. New
insight into the mechanics of the ear have re- THOMAS D. ROSSING
sulted from probing with laser light and the
Mossbauer effect, and this has contributed to References
some new mathematical models of how the Books
cochlea or inner ear functions.
Bioacoustics deals with the interaction of Beranek, L. L., "Acoustics," New York, McGraw-Hill
sound waves with biological tissues in humans Book Co., 1954.
and animals. Much recent research in this area Beranek, L. L., "Noise and Vibration Control," New
is concerned with the use of high-frequency York, McGraw-Hill Book Co., 1971.
ultrasound in medical diagnosis and treatment. Kinsler, E. E., A. R. Frey, A. B. Coppens, and J. V.
The human auditory system responds to Sanders, "Fundamentals of Acoustics," 3rd ed.,
sounds with frequencies ranging from about 20 New York, John Wiley & Sons, 1982.
31 ADSORPTION AND ABSORPTION

Kuttruff, H., "Room Acoustics," London, Applied tive pressures, then passes through a point of
Science, 1973. inflection (situated usually between -0.05 and
Lindsay, R. B., "Acoustics: Historical and Philosophical -0.30pO) to become convex to the p/po axis
Developments," Stroudsburg, PA, Dowden, Hutchin- at higher relative pressures; the point of inflec-
son & Ross, 1973. tion corresponds to the monolayer capacity
Morse, P. M., and K. U. Ingard, "Theoretical Acous- nm of the adsorbent, i.e., the amount of adsor-
tics," New York, McGraw-Hill, 1968. bate which can be accommodated in a com-
Pierce, A. D., "Acoustics: An Introduction to its Physi- pleted single molecular layer ("monolayer") on
cal Principles and Applications," New York, McGraw- the surface of the solid; the convex branch cor-
Hill, 1981. responds to the building up of a multimolecular
Rossing, T. D., "The Science of Sound," Reading, MA, layer ("multilayer"). From nm it is possible in
Addison-Wesley, 1982. principle to calculate the specific surfaceA(= sur-
Periodicals face area per gram) of the solid, by assigning a
value to am, the area occupied per molecule of
Journal of the Acoustical Society of America (1929- adsorbate in the monolayer. In practice, how-
). ever, very few adsorbates other than nitr.ogen
Acustica (1951- ). (at its b.p., 77 K) are found to be suitable, and
Proceedings of the International Congresses on Acous- the "BET -nitrogen" method is now the standard
tics. procedure for estimation of the specific surface
Journal of the Audio Engineering Society (1953- ). of finely divided or porous solids. (When A <
Journal of Sound and Vibration (1964- ). ~l m 2 g-t, krypton, though less reliable, has
Applied Acoustics (1968- ). to be used).
If the solid contains mesopores (pores having
Cross-references: ARCHITECTURAL ACOUSTICS: a width between tens and hundreds of A) the
ELECTROACOUSTICS; HEARING: MUSICAL isotherm is similar to Type II except that the
SOUND; NOISE, ACOUSTICAL; PHONONS: convex branch is replaced by a hysteresis loop
PHYSICAL ACOUSTICS: REPRODUCTION OF in which the curve for desorption lies above that
SOUND: SONAR; ULTRASONICS; VIBRATION: for adsorption, (Type IV isotherm); frequently,
WAVE MOTION. with xerogels for example, the isotherm bends
over to become effectively horizontal as satura-
tion pressure is approached. By application of
ADSORPTION AND ABSORPTION the Kelvin equation to points on the loop (usu-
ally the desorption branch) it is possible to cal-
When a porous solid such as charcoal is exposed, culate the pore size distribution of the adsorbent,
in a closed space, to a gas such as ammonia, the but owing to the arbitrary nature oftheassump-
pressure of the gas diminishes and the weight of tions that have to be made, including oversim-
the solid increases; this is an example of the ad- plified pore models, the detailed interpretation
sorption of a gas by a solid. I t is termed physical of the results is uncertain. Even so, the method
adsorption because the forces bringing it about (again with nitrogen as adsorbate) is valuable
are the "van der Waals" forces of attraction for comparative purposes, and is indeed virtually
which act between the molecules of the gas and the only method available for the lower end of
the atoms or ions comprising the solid. It is the meso pore range.
now known that all solids, whether porous or Since adsorption is exothermic, the amount
nonporous, will adsorb all gases physically, adsorbed at a given pressure must fall as temper-
whereas the phenomenon of chemisorption is ature increases. The differential molar heat of
specific in nature. Thus hydrogen is chemi- adsorption q-i.e., the limit of the ratio (llQ/lln),
sorbed by transition metals such as nickel or where llQ =heat evolved for an increment lln
iron but not by oxides such as alumina. in the amount adsorbed-is a function of the
In physical adsorption the amount of gas taken amount adsorbed. In general, 4 decreases gradu-
up per gram of solid depends on the temperature ally from an initial high value (e.g., -2qL) as
T, the pressure p, and the nature of both the n increases over most of the monolayer range,
gas and the solid : n =f(p, T, gas, solid). For a then rises to a low maximum (e.g., -1.3qL) as n
given gas (the "adsorbate") adsorbed on a parti- approaches nm, and finally falls to a nearly con-
cular solid (the "adsorbent") at a fixed temper- stant value close to qL in the multilayer region
ature, the amount adsorbed depends only on the (qL = molar heat of condensation). The initial
pressure of the gas, and the relationship between fall is ascribed to surface heterogeneity (adsorp-
nand p, viz., n =f(p)r,g8S,SOIid, is termed the tion occurring preferentially on the high-energy
adsorption isotherm. For vapors, the alterna- sites), and the low maximum is explained in
tive form n =f(p/pO)r,gas,solid is preferable terms of the attractive interaction between ad-
(pO =saturation vapor of the adsorbate at tem- sorbed molecules in the monolayer as they be-
perature T). come closely packed together.
Adsorption isotherms may be classified into The net heat of absorption (4 - q L) deter-
the five types of the Brunauer-Emmett-Teller mines the shape of the isotherm in the mono-
(BET) classification. The basic isotherm, the layer region: the greater the value of (4 - q L)
Type II, is concave to the p/po axis at low rela- the sharper is the "knee" of the isotherm; and
ADSORPTION AND ABSORPTION 32

if (q - qd R: 0, the isotherm actually becomes reasonable time requires a temperature much


convex to the p/po axis, giving a Type III or a higher than that at which the chemisorption
Type V isotherm. (The latter, like the Type IV, occurred. Even so the adsorbate may be released
bends over as p/po approaches unity). in a chemically changed form; thus carbon
In micropores (Le., pores of width ~l 5 A.), the monoxide chemisorbed on zinc oxide at room
attractive fields from neighboring walls overlap temperature is desorbed as carbon dioxide at
so that the net heat of adsorption, and with it the 300°C.
adsorption at a given relative pressure, is en- Chemisorption is an essential primary step in
hanced; consequently the isotherm rises steeply heterogeneous catalysis. At least one of the reac-
from the origin. Thus a solid containing micro- tants must be chemisorbed on the surface of the
pores plus mesopores will give a Type IV, and catalyst, and each of its molecules then forms,
one with micropores plus a large external surface on the surface, a "transition complex" with a
will yield a Type II isotherm, but with a much chemisorbed molecule of the second reactant
sharpened knee in each case. With a wholly B, or with a molecule of B which hits it directly
micro porous solid, where mesopores are absent from the gas phase.
and the external surface is negligible, the point Physical adsorption is an extremely wide-
of inflection vanishes and the isotherm soon spread phenomenon, frequently unwanted. The
reaches a plateau and remains horizontal for adsorption of water vapor by chemicals, by tex-
the rest of its course (Type I isotherm). The tiles, by building materials and by glass is fre-
mechanism of adsorption in micropores is still quently troublesome and can only be avoided
a matter of controversy and reliable methods by taking extreme precautions; sometimes, how-
for the estimation of micropore size distribu- ever, the adsorption of water may be beneficial,
tion are still lacking. and it plays an important role, for instance, in
The detailed course of the isotherms of all the hygiene of clothing.
types, and of the corresponding curves of q Adsorption, whether physical or chemical, also
against n, vary considerably with the mode of reduces the adhesion, and therefore the friction,
preparation and subsequent treatment of the between solids; gases can accordingly act as
solid, which influence both the porosity of the lubricants. In addition, adsorption diminishes
solid and the structure of its surface. However, the tensile strength of brittle solids; the break-
in recent years some success has been achieved ing stress of glass when exposed to nearly satu-
in preparing standardized samples of a few sub- rated water vapor is four times less than when
stances, notably silica, 'Y-alumina, and graphitized exposed to a vacuum. The mechanism is a mat-
carbon blacks. ter of controversy, but it is probably connected
Since physical adsorption results from van der with the fact that adsorption reduces the free
Waals forces, the greater the condensability of surface energy (in the thermodynamic sense) of
the gas or vapor as measured by its boiling point the solid. Adsorption also causes a small (a frac-
or its critical temperature, the greater is the tion of I per cent) expansion of the solid, but
amount of gas or vapor adsorbed at a given pres- the swelling pressure set up-Le., the pressure
sure. Thus at room temperature and atmospheric which would have to be exerted on the solid
pressure, the "permanent gases" such as hydro- to prevent expansion-is very high and may
gen or nitrogen are only slightly adsorbed even reach many atmospheres. Stresses set up in
on a good adsorbent such as charcoal, while structures made up of porous solids, such as
carbon dioxide is more adsorbed, and benzene cement and mortar, when they take up or lose
and carbon tetrachloride are strongly adsorbed. vapors, particularly water, may be so great as to
At very low temperatures the adsorption is cor- cause cracking.
respondingly greater, so that nitrogen at its Absorption is said to occur when the mole-
boiling point of -195°C has an adsorption, on a cules of the gas or vapor actually penetrate into
given solid, comparable with that of benzene at the solid phase itself, so that a solid solution is
25°C. For the adsorption to be readily measur- formed; hydrogen is absorbed by iron at elevated
able, however, the solid needs to have a relatively temperature in this way, and many synthetic
large area-a completed monolayer of nitrogen, polymers absorb water vapor; benzene vapor is
1 square meter in extent, weighs only 0.3 mg, extensively taken up by rubber and water vapor
for example-so that adsorption phenomena by gelatin. Extensive swelling occurs and if the
may escape notice unless the solid is "highly solid is mechanically weak, the absorption may
disperse," i.e., has an area exceeding several continue until the system acquires the consis-
square meters per gram. tency of a jelly, or even the fluidity of a sol. An
Chemisorption results from valency forces- absorption isotherm (analogous to the adsorp-
from the sharing of electrons between the ad- tion isotherm discussed earlier) can be deter-
sorbate molecule and the adsorbent-so that, in mined, but is generally complicated and is best
effect, a surface chemical compound is formed. handled theoretically as a branch of solution
Chemisorption is characterized by a high heat thermodynamics.
of adsorption (of the order of many tens of In adsorption from solution, when a solid
kilojoules per mole, in contrast to the 20-30 kJ having appreciable surface area (say -1 square
mor 1 of physical adsorption) and by difficulty meter per gram) is shaken up with a solution of
of reversal: to desorb a chemisorbed gas in a substance A in solvent B, both A and Bare ad-
33 AERODYNAMICS

sorbed, but to different relative extents. This to a fluid results in an equal but opposite reac-
manifests itself in a change in the composition, tion which in turn causes a rate of change of
e.g., a change ~X.A in the mole fraction of A in momentum in the fluid."
the solution. The problem is thus more compli- (2) The Equation of State: (also called the
cated than in the adsorption of gases, and the Gas Law) : "The product of pressure and vol-
measured isotherm-the curve of ~X.A against ume of a gas, divided by its absolute tempera-
XA -is not susceptible to any simple theoretical ture, is a constant."
treatment. In a dilute solution of A, however, (3) The Continuity Equation: "The mass that
A is always relatively more adsorbed than the passes a station in a duct, or in a natural tube
solvent B; and if A is colored, the resulting bounded by streamlines in a given time, must
diminution in the concentration of A in the equal that passing a second station in the same
solution will be readily detected by eye or time. "
colorimetrically. (4) The Energy Equation: "The total energy
in a mass of air remains constant unless heat or
S. J. GREGG work is added or subtracted."
The above relations, written algebraically and
References limited or combined, yield a vast array of equa-
tions used to calculate the practical problems of
Gregg, S. 1., and Sing, K. S. W., "Adsorption, Surface
aerodynamics. One of the equations thus derived
Area and Porosity," 2nd Ed., London, Academic
is due to Bernoulli and is widely used in low
Press, 1982. speed aerodynamics. It states that in free flow
Steele, W. A., "The Interaction of Gases with Solid the sum of the static and dynamic pressures is a
Surfaces," Oxford, Pergamon, 1974. constant. Static pressure is that pressure which
Flood, E. A. (Ed.), "The Gas-Solid Interface," 2 Vols.,
is equal in all directions; dynamic pressure is the
London, Arnold, 1967. pressure rise realized by bringing the fluid to
Parfitt, G. D., and Rochester, C. M. (Eds.), "Adsorption rest. Bernoulli's equation states that as air
from Solution at the Gas-Solid Interface," London, speeds up, its pressure falls, thus explaining the
Academic Press 1983 . "lifting suction" as the airstream traverses the
curved upper surface of a wing.
Cross-references: INTERMOLECULAR FORCES, The overwhelmingly important law of low
SURFACE PHYSICS, VAPOR PRESSURE AND speed aerodynamics is that due to Newton (1,
EVAPORATION. above). Thus a helicopter gets a lifting force by
giving air a downward momentum. The wing of
a flying airplane is always at an angle such that
AERODYNAMICS it deflects air downward. Birds fly by pushing
air downward. Propellers and jet engines make a
Aerodynamics is the science of the flow of air forward force ("thrust") by giving air a rear-
and/or of the motion of bodies through air. It ward momentum. For some aircraft in some
is usually directed at achieving flow or flight flight modes, both the engine and the wing are
with the maximum efficiency. Aerodynamics is used to provide downward momentum. In the
a branch of aeromechanics; the other main above statements, it is much more accurate to
branch is aerostatics (lift of balloons, etc.). In use the expression "downward (or rearward)
popular usage aerodynamics differs from Gas- momentum," rather than "downward (or rear-
dynamics in that the latter considers other ward) velocity," since less dense air at high alti-
gases and products of combustion (and com- tudes produces smaller forces for comparable
bustion); from aero physics , which implies sub- velocity changes. Forcing the air downward
stantial molecular changes in the gas; and from does not occur instantaneously; it takes place
hydrodynamiCS, which implies employing a me- over the lifting surface. Indeed, the motion
dium of density approximating that of useful reaches ahead of the airplane so that some
bodies in it , and not infrequently a sharp limit downward velocity occurs before the airplane
to its extent (i.e., a water surface). arrives. Thus, airplanes (helicopters, birds, etc.)
Aerodynamics is conveniently divided into are always "flying uphill" which is another way
low and high speed regimes. (The latter is in of saying it takes a force to fly even when the
the articles on COMPRESSIBILITY, FLUID DYNAM- air is considered frictionless. This force is used
ICS, and SHOCK WAVES; here low-speed aero- to push air forward, and is called "drag due to
dynamics is discussed.) lift.," It increases as an airplane slows down or
The many facets of aerodynamics include: flies higher, and it may be reduced (for low-
(1) aerodynamic performance, (2) aerodynamic speed aircraft) by increasing the wing span,
design, (3) aerodynamic loads, (4) aerodynamic Supersonic aircraft, operating under the same
structures, (5) aero-elasticity, (6) aerodynamic laws, but in a different manner, still have a
heating, (7) aerodynamic compressibility, and drag due to lift which increases as above, but
(8) aerodynamic research for all of the above. is reduced designwise by reducing the span;
The computation of aerodynamic effects is hence the new airplanes whose wings crank
based on four laws (given as adapted for fluids): back for high-speed flight. (The "uphill" con-
(1) Newton's Second Law : "A force applied cept has an analogy in the rolling resistance
AERODYNAMICS 34

of a wheel. The weight a wheel carries deflects nal weights (fuel, etc.) and, rarely, through
the surface on which it rests so that it sits in a strengthening the structure.
"gully." Either way it rolls, the path is uphill). At the higher speeds, the performance, loads,
A second important phenomenon (but not a structures and elasticity problems are greatly
"law" in the sense that it cannot be circum- worsened by the aerodynamic heating which oc-
vented) is the manner in which air flows over curs. This is discussed in the article on COM-
say, an airplane surface. Away from the surface PRESSmILITY.
some free-stream velocity exists. As the surface Aerodynamics is not only concerned with air-
is approached the local velocity becomes less craft. The wind loads on signs, buildings, trees;
than freestream and finally becomes zero at the the aerodynamics drag of autos, boats, trains;
surface. The zone in which the air is appreciably the air pollution from smoke stacks of factories
slower is called the boundary layer. Slowing the and ships; the evaporation of open water; the
air reduces its momentum, and by Newton's law blowing of sand and snow; and the internal
(above), a force is produced which acts in a di- losses of air-conditioning ducts-all deserve and
rection to slow the airplane. Like forward force are getting scrutiny by aerodynamicists. The
needed to produce lift, this/rictional drag force forces on all are proportional to the rate of
must have a forward force supplied from some- change of momentum they give, or is given by
where to balance it. For high-speed aircraft a the air; and all have friction drag in their bound-
heating occurs in the boundary layer which re- ary layers. Aerodynamic research scientists seek
quires additional force. Skin friction drag is de- to further understand and improve the air flow
creased by reducing the amount of surface the air involved in each.
scrapes against (i.e., making the aircraft smaller) Aerostatics Aerostatics is the science of mak-
and by flying slower or higher-essentially ing things (balloons, zeppelins, etc.) statically
the opposite of the actions which reduce the buoyant in the air.
drag due to lift. Thus the science of aerody- The basic principle of aerostatics is due to
namics seeks the most efficacious melding of Archimedes. "A body immersed in a fluid [or
the two types of losses. Factors which must be gas] is buoyed up by a force equal to the weight
added include providing space for fuel and of the fluid [or gas] displaced." Thus for buoy-
people, and enough wing to yield a reasonable ancy the weight of structure plus the weight of
landing speed . The drag due to lift and the drag the contained gas must equal the weight of the
due to friction are balanced by the forward air displaced.
thrust provided by the propeller or jet engine Wind Tunnels Wind tunnels are devices which
(which also operates by Newton's law). How- provide an airstream of known and steady con-
ever, this is not enough. The distribution of lift ditions in which models requiring aerodynamic
on wing and tail must be so located that the air- study are tested. The essential elements of a tun-
craft is aerodynamically balanced. Like a child's nel are :
swing, upon being disturbed it should tend to (1) A drive system consisting of either a com-
return to its original ("trimmed") condition. pressor for continuous operation or a tank of
In the above paragraphs the concern has been compressed air for intermittent operation;
for aerodynamic efficiency through optimum (2) a test section in which models are held
design for optimum performance. After these and their orientation is changed.
have been achieved (by studies of previous de- (3) instrumentation capable of reading force,
signs and tests in wind tunnels) the aerodynami- pressure, temperature, and optical effects pro-
cist provides aerodynamic loads to which an duced by the model;
aero-structural engineer must design . Aero- (4) an air efflux system consisting of free exit
structural design is one of the most challenging to the atmosphere or to a vacuum tank, or a
of all design problems as the loads must be car- tunnel returning the air to the compressor.
ried by minimum weight. Aeroplanes carry no There are approximately 500 wind tunnels in
"factor of safety" (sometimes called "factor of the country, ranging in test section size from 1
ignorance"). The aerodynamic-loads engineer in. X I in. to 80 X 120 ft, with speeds from 50
furnishes the maximum air loads the aircraft is to 7000 mph.
ever expected to see; the structure is designed to
withstand these loads without being permanently
bent. If it ever sees a greater load it will be bent ALAN POPE
or destroyed; there is about a 50 per cent differ- References
ence between maximum no-permanent set load
and catastrophic destruction, depending on the Perkins, Courtland, D., and Hage, Robert E., "Airplane
material of which the aircraft is constructed. Performance, Stability, and Control," New York,
In an effort to keep aerodynamic structures John Wiley & Sons, Inc., 1949.
light, they often become flexible, and, in turn, Kuethe, A. M., and Shetzer, 1. D., "Foundations of
become susceptible to flu tter, a motion similar Aerodynamics," New York, John Wiley & Sons, Inc.,
to that of a flag in a wind. The motion gets 1959.
worse with speed. It is the job of the aero- Pope, Alan, and Harper, John J., "Low-Speed Wind
elasticity engineer to assure that flutter will not Tunnel Testing," New York, John Wiley & Sons,
occur, usually through a redistribution of inter- 1960.
35 AIRGLOW

Cross-references: ASTRONAUTICS, DYNAMICS, in principle, this energy can appear in the form
INERTIAL GUIDANCE, MECHANICS, SHOCK of light emission at any wavelength longer than
WAVES. about 1800 A. In most cases the energy actually
goes into heating the atmosphere but when it
does come out as light one has a component of
AIRGLOW the airglow. In effect, what has happened is that
a small fraction of the incoming sunlight has
The aurora has been known to mankind since been trapped in the atmosphere and later re-
antiquity (see AURORA). The intensity, color, leased as longer wavelength light. One may think
and motion of an active auroral display invari- of this as a phosphorescence with a time delay
ably create a vivid and memorable impression imposed both by the need for oxygen atoms to
upon those fortunate enough to observe it. The drift downward to altitudes where they can
aurora is a luminescence of air at high altitude recombine and by the recombination lifetime
which results from the penetration of energetic of many days even at this latter altitude.
charged particles (principally electrons) into the This sort of airglow process includes not only
atmosphere. The existence of the Earth's mag- dissociation and recombination of neutral mole-
netic field restricts the entry of these particles cules but also the case in which a molecule or
to high latitudes; thus only those living in the atom is ionized by very short solar wavelength;
far North or South enjoy regular entertainment here the neutral particle is stripped of one of its
by this magnificent natural phenomenon. electrons, leaving a heavy positively charged ion
There is another way by which the upper behind. Eventually the free electron will meet
atmosphere becomes luminescent; this arises and recombine with an ion, and in this process
from the release of chemical energy stored up there also can be emission of light. It is the cloud
during the day by the absorption of sunlight. of temporarily liberated electrons which reflect
This luminescence is known simply as the air- radio waves back to Earth (see IONOSPHERE).
glow, although in earlier times it was rather Not all of the airglow comes from suchrecom-
ponderously dubbed "the light of the night sky." bination processes, although they are the prin-
The sequence of events which lead to airg!ow cipal source of the night airglow. During the
may be understood by the following example. daytime atoms and molecules in the upper
Over the range of wavelength (or colors) to atmosphere can absorb sunlight at wavelengths
which the eye is sensitive the atmosphere is too long to dissociate or ionize. This energy is
highly transparent. This is of course no accident almost instantaneously re-emitted either at the
but a natural result of evolution, as is the further same wavelength or at a longer wavelength; it
fact that the human eye is most sensitive in the can be considered as a fluorescent component
yellow-green (-5500 A) just where the sun in the day airglow. Yet another type of day air-
radiates light most strongly. But deep in the glow comes from the process in which a fast
ultraviolet (below 1800 A) oxygen molecules moving electron, newly formed by ionization,
cease to be transparent and absorb light strongly; collides with an atom or molecule and transfers
in this process the molecule is broken up into energy to it; the excited atom or molecule can
two oxygen atoms. Most of the sunlight which then emit light.
thus dissociates molecular oxygen is absorbed From the Earth's surface one can only deter-
between 100 and 150 km above the Earth's sur- mine the total airglow intensity coming from
face. At such altitudes an atom makes a collision above . To find out the altitude from which the
with another atom relatively infrequently, per- glow arises one must measure either by using
haps 100 times per second. At the surface of a rocket which passes through the emission
the Earth collisions occur at a rate of about a layer or by viewing the horizon from a satellite.
billion per second. But even if one oxygen atom Such measurements show that nearly all of the
encounters another they will not recombine to airglow comes from the altitude region from
reform a molecule unless a third particle is also about 50 km to 400 km. There is also a power-
present to satisfy laws requiring that energy and ful means by which one can, in some cases,
momentum be conserved. The frequency of such determine the emission altitude of airglow from
triple collisions is of course much less than the the ground. This is to make measurements
frequency of simple two-body collisions. Thus during the twilight period after sunset or before
it is that above 100 km a newly released oxygen sunrise when the upper atmosphere is still
atom could expect to live for months before it illuminated by the sun even though it is dark at
joins up with another in a triple collision to re- the surface. After sunset the dayglow emissions
form a molecule. But at lower altitudes the life- will persist until the Earth's shadow reaches the
time is shorter. So the oxygen atoms created by altitude from which the glow arises. The time at
sunlight above 100 km drift downward until which the glow fades away will indicate what
they arrive at the region between 90 and 100 this altitude is.
km where their lifetime is short enough that Despite all this furious chemical activity in
they there recombine to form molecules rather the upper atmosphere the net result is only a
than continue their downward drift. The re- rather feeble glow, particularly at night. All the
combination releases just the amount of energy various recombination processes together pro-
originally required to split the molecule and so, duce a night airglow which is not immediately
AIRGLOW 36

obvious even far from cities. One way to see that While much continues to be learned about our
there is such a glow is to notice that one's hand atmosphere using ground based studies of the
looks dark against the night sky in some direc- airglow, there is today even more being done
tion where there are very few stars, on a moon- from space. An example is a study of the ozone
less night far from cities. Another way is to layer using infrared airglow from the oxygen
look from a jet aircraft just above the horizon molecule. Dangerous ultraviolet light from the
at night (if the cabin lights are out). A faint sun (just below 3000 A) is blocked by the ozone
greenish glow can be seen near the horizon; this layer and when it is absorbed the ozone is disso-
is airglow from about 95 km altitude which ciated into an oxygen atom and an oxygen
appears brighter because one is looking nearly molecule. The molecule is excited and radiates
tangentially through the atmosphere. The green near 13,000 A; a measurement of this airglow
glow comes from oxygen atoms which have radiation allows one to determine the density
been excited by the energy released in the re- of the parent ozone molecules and to study
combination of two other atoms. As it happens, how and why the ozone varies in both space
it is this same radiation from oxygen atoms and time.
which gives aurora its yellow-green color, but
in that case the atoms have been excited by an JOHN F. NOXON
energetic stream of electrons injected into the
atmosphere from above (see AURORA). References
The day airglow is of course not detectable to McCormac, B. (Ed.), "Physics and Chemistry of Atmo-
the eye owing to the enormously more intense spheres," Dordrecht, Holland, D. Riedel, 1973.
visible sunlight scattered in the lower atmos-
phere. It can be seen and photographed from Cross-references: AURORA; GEOPHYSICS; ION-
above and astronauts have done just this. IZATION; IONOSPHERE; LUMINESCENCE; MAG-
Scientists have found that a great deal can be NETIC FIELD; SOLAR PHYSICS.
learned about the upper atmosphere by study-
ing the airglow using sensitive devices (spectrom-
eters) which isolate one or another of the many
different wavelengths present in the airglow AL TERNA TING CURRENTS
emission spectrum. Even the day airglow can be
detected and measured from the Earth's surface Definition of Alternating Current An alternat-
with the help of special instruments although ing current is a periodic function of the time,
this is best done using spectrometers carried the function being such that the average value is
aboard satellites. One can identify the atom or zero. A special case of an alternating current is
molecule responsible for airglow emission at a shown in Fig. I. The square wave is clearly
certain wavelength. Then one can use the result periodic, and in anyone cycle, the area under
of laboratory studies on how recombination the curve above the horizontal axis is equal to
processes lead to light emission to infer what the area below the horizontal axis. If the two
particular atoms or ions must be recombining in areas are not equal, the current may be de-
order to produce that particular emission. It is scribed as an alternating current superposed on
even possible to use the airglow as a thermom- a direct current, provided the resultant current
eter and find out the temperature of the atmo- varies in a cyclic manner.
sphere at the altitude from which a particular In general, any alternating current may be
airglow emission arises. One way this is done is considered to be the sum of a Fourier series of
to use the Doppler effect; an atomic emission sinusoidal waves. For example, the square wave
line in the airglow actually possesses a finite shown in Fig. 1 may be written as
width in wavelength (about 1/100 A) determined
by the velocity of the emitting atoms. This 4A (sin 2rrft +
rr
! sin 6rrft + ! sin 10rrft + ... )
velocity is a measure of the atmospheric tem-
perature. Sometimes the temperature is seen to
oscillate up and down over a period of an hour
or less by as much as 50°C; when this happens Mognitude
one knows that waves are moving up in the
atmosphere causing a periodic compression and
heating. These waves generally start in the lower
atmosphere, often from weather disturbances;
they grow in amplitude as they rise through the
atmosphere and so produce large oscillations in 1/2! Ilf 3/21 2/f Time
temperature when they arrive at, say, 100 km
altitude. Their importance lies in the fact that
they carry energy from the lower atmosphere
to high altitudes. At the altitude where the wave
dissipation occurs the character of the upper
atmosphere is modified by the wave energy
deposited there. FIG. 1. A square wave alternating current.
37 ALTERNATING CURRENTS

where A is the amplitude of the square wave, R


f is the frequency in hertz (cycles per second),
and t is the time in seconds. Since any alternat-
ing current may be expressed as the sum of a
series of sinusoidal terms, the remainder of this B
article will be devoted to a discussion of sinus-
oidal voltages and currents.
Root-mean-square Value The equation for an Vm sin wt L
alternating current i may be written as
A
i=/msin(wt-o) (I)

where 1m is the maximum or peak value of the


current, w is 21T times the frequency f in hertz
(Hz), and (j is a phase angle. A graph of the
current i is shown in Fig. 2. Since the positive c
and negative loops are mirror images , the aver-
age value of the current over a complete cycle is FIG. 3. An ac series circuit.
zero. The latter statement is valid for all alter-
nating currents and, hence, gives no information
about a particular alternating current. Clearly, the effective value of a sinusoidal alter-
A useful way of stating the magnitude of an nating current is 70.7 per cent of the maximum
alternating current is to give its effective or or peak value . Similarly the effective value of a
root-mean-square value. The term root-mean- sinusoidal voltage is 70.7 per cent of the maxi-
square is derived from the idea of taking the mum or peak value.
square root of an average square of the current. Alternating-current Series Circuit A simple
Thus, by definition, the effective value Ie of the alternating-current series circuit is shown in
current i given by Eq. (I) is Fig. 3. At the instant considered in the diagram,
the current is in the direction shown. The cir-

Ie = ~ J0
211 / W

1m 2 sin 2 (wt - (j) dt (2)


cuit consists of a generator connected in series
to a pure resistance R, a pure inductance L, and
a capacitance C. It is important to understand
the relationship of the current to the potential
where 21T/W is the time for one cycle. In effect, difference across each element. In each case, the
the quantity under the square root sign is the best starting point is a basic definition. Accord-
sum of the squares of the currents during one ing to Ohm's law for a pure resistance,
cycle divided by the time for one cycle. The
value of Ie may be found by performing the R = VR (4)
integration . The result is i
1m where VR is the voltage drop across the resis-
Ie = V2 = 0.707/m (3) tance. The inductance L of a coil is given by

L=-
N<J>
(5)
i
where N is the number of turns and if> is the
magnetic flux passing through one of the turns
of the coil as a result of the current i. Writing
Current
Eq.(5)as
Li=Nif>
and then differentiating both sides, we obtain

L .!!i. = N d4> (6)


Time dt dt
According to Faraday's law, the right-hand side
of Eq. (6) is the magnitude of the induced emf.
The left-hand side, L di/dt, therefore, is the
voltage drop VL across the inductance L.
Finally, the capacitance C is by definition
C =..!L (7)
FIG. 2. A sinusoidal alternating current. Vc
ALTERNATING CURRENTS 38

where q is the instantaneous charge on the posi- tance is greater than the capacitative reactance,
tive plate and Vc is the drop in potential in the phase angle 8 is positive and the current lags
going from the positive plate to the negative behind the voltage. If the inductive reactance is
plate. less than the capacitative reactance, the current
The relation between the impressed voltage leads the voltage.
Vm sin wt and the instantaneous current i fol- Vector Diagram The current and the various
lows from Kirchhoff's law that the sum of the voltages may be related in a meaningful way by
differences in potential in going around a com- means of a vector diagram. Equation (10) may
plete circuit must be zero. At the instant shown be rewritten as follows:
in Fig. 3, there is a potential rise Vm sin wt in
going from A to B and there are potential drops, Ve = VR2Ie2 + (wLIe - Ie/wC)2 (12)
VR, VL, and Vein traversing the rest of the
circuit. According to Kirchhoff's law Equation (12) implies that Ve is the resultant
of a vector Rle at right angles to a vector
V m sin wt - VR - VL - VC =0 wLle - Ie/wC. This is shown in Fig. 4. The cur-
rent Ie is drawn along the horizontal axis, and
di q the effective voltage drop across the resistance,
V sin wt - Ri - L - - - = 0
m dt C Rle , is also drawn along this axis. The effective
voltage drop across the coil is wLle , and this
Since the current is the rate of flow of charge potential difference is drawn along the positive
vertical axis. Finally, the effective potential
. dq
1=- (8) drop across the capacitor is Ie/wC, and this
dt vector is drawn along the negative vertical axis.
The resultant of the three vectors is V e , in
It is now possible to express the current i as a agreement with Eq. (12).
function of t. The result neglecting initial tran- Resonance If the capacitance C can be varied,
sient effects is the current Ie will be a function of C in accor-
. Vm sin (wt- 8) dance WIth Eq. (10). When
1 = VR2 + (wL _ 1/wC)2 (9) I
wL=- (13)
where tan 8 = (wL - l/wC)/R. The phase angle wC
8 is the angle by which the current i lags behind the effective current will be a maximum. The
the impressed voltage. circuit is then said to be in resonance. Actually,
The maximum or peak value of i is the inductance L or the angular frequency w
may be varied instead of the capacitance C. The
Vm circuit will be in resonance whenever Eq. (13)
1m = --"i::::::::=~=-==='='==:=:=­
VR2 + (wL - 1/wC)2 holds. At resonance, the impedance Z is equal
to R, and the circuit, under such circumstances,
If both sides of this equation are divided by acts as though it contains resistance only. The
V2, we obtain process of obtaining resonance is called tuning
the circuit.
1m Vm /V2 Average Power The potential difference V
V2 VR2 + (wL - 1/wC)2 across an ac generator at any instant is the
work required to transfer a unit charge from the
Ve
Ie = (10)
VR2 + (wL - l/wC)2
wLIe
Equation (10) states the relation between the
effective value of the current and the effective
value of the impressed voltage.
Impedance and Reactance The denominator
of Eq. (10) may be defined as the impedance Z WLl.-~ 1 - - - - - - - - 0 = - - -.........
of the circuit. We may therefore write

I - Ve (11)
e - Z RI,

Equation (11) is similar in form to Ohm's law,


Eq. (4). The impedance Z is the square root of
the sum of the squares of two terms. The first Ie /GJC

is the resistance R, and the second is wL - 1/ we.


The latter is called the reactance. The quantity
wL is the inductive reactance whereas l/wC is
capacitative reactance. If the inductive reac- FIG. 4. A vector diagram for an ac series circuit.
39 ALTERNATING CURRENTS

negative to the positive terminal. The work From the latter form, it may be concluded that
done in transferring a charge dq is consequently the average power is the average rate at which
V dq, and the work done per unit time is heat is developed in the circuit. Equation (18)
also shows that a direct current having a value
p= Vdq (14) Ie would produce the same heating effect as an
dt alternating current having an effective value Ie.
The Complex-number Method In the forego-
where P is, by definition, the instantaneous ing, an alternating-current series circuit was dis-
power and dt is the time interval to transfer the cussed by representing voltages as vectors in the
charge dq. Since real plane. For more complicated circuits, this
method is too clumsy. It is much more con-
. dq venient to deal with vectors analytically by
1=-
dt utilizing the i-operator. By definition,
Eq. (14) may be written i = y-:::r
P= Vi When a real number is multiplied by j, it be-
comes an imaginary number. In other words, a
The instantaneous power is the product of the point on the real axis is rotated through 90 0 so
instantaneous voltage and current. that it becomes a point on the imaginary axis.
When alternating current -.fircuits are con- The "complex" impedance of a series circuit
sidered, the average power P rather than the may thus be written
instantaneous power is of interest. By definition

p= ~
21T
L 0
21r/W
Vidt (15)
Z =R+ i (WL - wlc)
since the reactance may be considered to be at
(19)

where, as before, 21T/W is the time for a com- right angles to the resistance. When several im-
plete cycle. The average power may be evaluated pedances are connected in series, the total com-
by making the following substitutions in plex impedance is
Eq. (15): Z=Zt+Z2+Z3+··· (20)
V= Vm sin wt and, when several impedances are connected in
i=Im sin (wt- 0) parallel, the total impedance is given by
I I
The result is - = - + -I + -I + . . . (21)
- 1 Z Zt Z2 Z3
P = "2 Vm 1m cos 0 (16)
The effective voltage V across the generator may
Equation (16) may be rewritten be considered to be a vector along the real axis.
The effective current I furnished by the gen-
- Vm I erator is therefore
P = - - - - cosO
Y2Y2 V
p= Vele coso (17) 1= - (22)
Z
Eviden tly, the average power is the effective By solving Eq. (22), the magnitude of I and the
voltage times the effective current multiplied phase relation between I and V may be found.
by the cosine of the phase angle. In this connec- Although new mathematical techniques are
tion, cos 0 is called the power factor. Equation needed, the saving of time usually justifies the
(17) may be interpreted to mean that only the use of the complex number method of handling
component of Ve in phase with Ie contributes complicated ac circuits.
to the average power. The other component Transfer Function From a more advanced
may be said to be wattless. Since point of view, we may consider the voltage V in
Eq . (22) as the source and the current I as the
response. If we set jw equal to a new variable s,
called the complex frequency, then we may
and refer to I /Z(jw) = I/Z(s) as the transfer func-
tion. For a complicated network, the transfer
R function may generally be expressed as the ratio
cos 0 =- of two polynomials in s. Thus
Z
Eq. (17) may be written as follows: I N(s)
--=-- (23)
P = Ie2R (18) Z(s) D(s)
ALTERNATING CURRENTS 40

The zeros of the transfer function are those with widths of several centimeters and at speeds
values of s for which N(s) = 0, and the poles of in excess of kilometers per minute.
the network are those values of s for which Research on amorphous metals has resulted
D(s) = O. At a zero, the network does not re- from both scientific and technological interest
spond to the source. At a pole, the network in these materials. They provide opportunities
may oscillate at a natural frequency given by for investigating effects of structural disorder
the imaginary part of s. For a sinusoidal input, on the basic physical properties of metallic
the amplitude of the response may be very solids, and they offer promise for achieving new
large at a pole. Network analysis is facilitated and useful combinations of properties for tech-
through the use of the transfer function. Basi- nological ends. Many of the unique properties
cally, the transfer function expresses all the of metallic glasses are thought to result from
characteristics of the network. the remarkable isotropy and homogeneity of
most of these materials on all scales greater
REUBEN BENUMOF than a few atom <;liameters.
Formation Metallic glasses, like other amor-
References phous solids, are always unstable with respect
to some crystalline phase or phase mixture.
Benumof, Reuben, "Concepts in Electricity and Mag- However, glasses can sometimes be formed
netism," Ch. 14, New York, Holt, Rinehart, & from the liquid state because liquids become
Winston, 1961. progressively more viscous and atomic mobility
Benumof, Reuben, "Concepts in Physics," 2nd Ed., within the liquids becomes more limited as the
Ch. 14, Englewood Cliffs, NJ: Prentice-Hall, Inc., temperature is reduced. If crystallization does
1972. not intercede, many liquids undergo large con-
Kirwin, G. J., and Grodzinsky, S. E., "Basic Circuit tinuous changes in viscosity in a narrow tem-
Analysis," Ch. 5, Boston, Mass., Houghton Mifflin perature interval somewhere below their equi-
Co., 1980. librium freezing temperature T m' The large
Bobrow, Leonard S., ''Elementary Circuit Analysis," increase in viscosity on cooling through this
Ch. 8-10, New York, Holt, Rinehart & Winston, interval causes the supercooled liquid to become
1981. a rigid glass below a "glass transition" tempera-
Hayt, W. H., and Kemmerly, J. E., "Engineering Cir- ture Tg at which atomic mobility is severely re-
cuit Analysis," Ch. 8-11, New York, McGraw-Hill duced and crystallization is thereby suppressed.
Book Co., 1978. Although glasses are ultimately unstable against
crystallization, they can be retained almost
Cross-references: CAPACITANCE; CIRCUITRY; indefinitely at temperatures well below Tg .
CONDUCTIVITY, ELECTRICAL; ELECTRICAL The high fluidity of metallic liquids for tem-
MEASUREMENTS; ELECTRON; INDUCED ELEC- peratures greater than Tg , in contrast to more
TROMOTIVE FORCE; POTENTIAL.
familiar glass formers like Si0 2 , greatly facili-
tates crystallization as the liquids are cooled
below T m; but for temperatures less than T$
AMORPHOUS METALS the high viscosity and very limited atomIC
Amorphous metals are materials with good elec- mobility make formation of stable crystalline
trical and thermal conductivity, with lustrous nuclei and growth of these nuclei very unlikely.
appearance, and with other common metallic Thus the critical temperature range in which
properties, but with atomic arrangements which rapid crystallization is liable to occur at appre-
are not periodically ordered as in more familiar, ciable rates is from T m to Tg , and the less time
crystalline metallic solids. Noncrystalline, amor- spent in traversing this temperature interval, the
phous, and vitreous are equivalent terms used
to describe solids in which atoms are not peri-

JJ
STREAM OF MOLTEN ALLOY
odically arranged and which in fact lack any
sort of long-range order. The term glass has often
been reserved for amorphous solids formed by
continuous solidification of a liquid, but metallic
glasses are now commonly considered to include
amorphous metals produced in a variety of
ways, including evaporation, sputtering, and
electro- and chemical deposition, as well as
cooling from the liquid state. ROTATING
COOLING
Widespread interest in metallic glasses has DRUMS
developed since 1960, when it was first demon- QUENCHED
ALLOY
strated that metallic glasses could be formed by (a) (b)
very rapidly cooling, or quenching, some molten
metallic alloys. Liquid quenching techniques, FIG. 1. Continuous quenching devices for making
illustrated in Fig. 1, have been developed for metallic glasses: (a) melt spinning and (b) roller
producing metallic glasses as continuous ribbons quenching.
41 AMORPHOUS METALS

higher the probability of successful glass forma-


tion. The rate of crystallization Rc is expected
to peak in this interval.
Many alloy systems and alloy compositions
have been discovered which can be quenched to
form a glass with cooling rates of lOs-10 7 °K/sec.
These alloys involve compositions for which
T m is suppressed with respect to melting points
of nearby crystalline phases, and compositions
of high glass forming tendency are those with
the lowest Tm values. Two devices used for pre-
paring metallic glasses are illustrated in Fig. 1.
CO2 P _~ll.L..L_--'--""LUII_
Rapid quenching is accomplished by spreading
the molten alloy on a metal surface which serves fcc Co _~-'- _ _ ----'_---L
as a heat sink. This speeds the cooling by re-
ducing the thickness across which heat must
diffuse and by ensuring that the alloy remains
in good thermal contact with the heat sink
during the critical cooling period. Metallic glass
ribbons produced in this way generally have
thicknesses of less than 50 11m.
Several classes of metallic glasses which have
been made by liquid quenching are listed in
Table 1, and the list continues to grow. The 10 20 30 40 50
most widely studied class consists of the tran- SCATTERING ANGLE, 28 (deg)
sition metal-metalloid alloys with a bout 20
atomic percent metalloid. Electrodeposition FIG. 2. X-ray scattering patterns for amorphous and
and chemical or electroless deposition have also polyphase, crystalline Co gO P20 .
been used to produce several amorphous alloys
of the transition metal-metalloid type.
An even larger range of materials has been overlapping peaks. Part of the diffraction pattern
prepared as amorphous solids by vapor quench- for an amorphous Co-P alloy with about 20
ing methods involving deposition by evaporation atomic percent phosphorus is shown in the
or sputtering onto a cooled substrate. Here the lower section of Fig. 2. Heating this alloy
problem is to obtain metastable, noncrystalline changed the scattering pattern to that shown in
atomic arrangements at sufficiently low tem- the center of the figure. Appearance of sharp
peratures to make spontaneous crystallization maxima indicates that the alloy began to crys-
very improbable. Some nearly pure metals have tallize to the expected crystalline phases of
been prepared as amorphous thin films on sub- CO 2 P and Co. Further heating to 700°C pro-
strates cooled to 4°K. Nominally pure films of duced the sharper scattering pattern shown at
cobalt crystallize at about SOcK, and intentional the top of the figure, indicating improved per-
addition of impurities, for example a few per- fection of the crystalline materials. Diffraction
cent of silicon, greatly increases the crystalliza- patterns of amorphous alloys look more like
tion temperature of such amorphous films. those of liquid metals than like diffraction
Amorphous metallic alloys which can be retained patterns of crystalline materials. Electron micro-
to temperatures well above room temperature graphs of most amorphous alloys are featureless
have also been prepared by vapor quenching, compared with those of crystalline solids.
including alloy systems and compositions for The diffuse scattering patterns produced by
which glass formation by liquid quenching has amorphous solids can be used to calculate radial
been unsuccessful. distribution functions, RDF(r), which describe
Structure X-ray, electron, and neutron correlations among atomic positions in the
scattering experiments, together with electron materials. The methods of analysis are the same
microscopy, have provided evidence for non- as those used for liquid alloys. Examples for
periodic atomic arrangements in the amorphous two amorphous metal-metalloid alloys are
alloys discussed above. These materials produce shown in Fig. 3. Maxima in RDF(r) occur for
diffuse diffraction patterns, with a few broad, r-values corresponding to frequently occurring

TABLE 1. CLASSES AND EXAMPLES OF METALLIC GLASSES.

Transition metal-metalloid alloys FegOP 13 C 7


Late transition metal-early transition metal alloys Ni 6o Nb 4o
Transition metal-rare earth metal alloys C020GdgO
Simple metal alloys Ca6sAl3S
AMORPHOUS METALS 42

materials. Efforts to improve dense random


packing models for binary alloys have used
computers to generate dense random packings
with two sizes of spheres, chosen to represent
100 the two elements of the binary alloy. These
structures have been relaxed by allowing those
small displacements of atom sites which reduce
the energy, calculated by assuming that the
atoms interact with one another by pairwise
forces. Although such models have been fairly
-u. 50
successful in reproducing experimentally ob-
Cl served distribution functions for most metal-
a:: metalloid and rare earth metal-transition metal
alloys, the more general applicability of dense
random packing as a structural model for
metallic glasses has not yet been established.
Electrical Resistivity Resistivities of amor-
Of-----'I phous metallic alloys are typically between 100
and 200 /lU-cm, values which are higher than
resistivities of most crystalline metals and alloys
a
but are similar in magnitude to electrical resis-
tivities of liquid metals. Although the reversible
temperature dependence of electrical resistivity
2 4 6 8 10 12
r('&)
for amorphous alloys is less than for most crys-
talline metals, when the alloys crystallize, their
FIG. 3. Radial distribution function for amor- resistivities decrease abruptly and irreversibly,
phous transition metal-metalloid alloys Ni76P24 and as shown for example in Fig. 4.
Co 7S P22 , and for a dense random packing of hard The major features of the reversible resistivity
spheres (DRPHS) structural model. [From G. S. Car- behavior can be explained in terms of a model
gill III, AlP Conf Proc. 24 138 (1975).) originally developed for liquid metals. The basic
features of this model are that the electrical
interatomic separations in the amorphous alloys. resistivity arises from scattering of conduction
RDFs illustrate several differences between electrons by the amorphous structure and that
atomic arrangements in amorphous and crys- the structure-dependent scattering can be evalu-
talline alloys. Long-range structural periodicity ated from experimentally measured X-ray, elec-
is absent in the amorphous alloys. There are tron, or neutron scattering patterns of the
only weak correlations between atomic positions material.
separated by more than four or five atom diam- A number of amorphous alloys become super-
eters. There is no unique nearest neighbor dis- conductors at sufficiently low temperatures.
tance in these amorphous alloys. Widths of the Much early work on vapor quenched amorphous
nearest neighbor maximum in the RDFs are alloys was stimulated by the discovery that
typically between 0.4 A and 0.5 A full width at amorphous bismuth was metallic and became
half maximum (FWHM). Radial distribution superconducting at 6°K, although crystalline
functions for crystalline Co or Ni would have bismuth is a non-superconducting semimetal.
much narrower nearest neighbor maxima, with
widths less than 0.2 A FWHM. Crystallization
of these alloys increases their density by less 100
than one percent. The RDFs and density mea-
surements together provide statistical descrip- ] 80
AMORPHOUS ~
tions of atomic arrangements in amorphous c:
alloys and serve as critical tests for three- ~
>- 60
~ ~
dimensional structural models. f-
;;
The most extensively studied models for r::: 40 ~
atomic arrangements in metallic glasses are Cf)
~~'" ~~
iii ';:J~" ",~'v'v
based on the nonperiodic "random" packing
of spheres and are called dense random packing '"a: 20 ,<-(1.
c,~
",e;,

of hard spheres models. The RDF for a physi-


cally constructed dense random packing of
several thousand single-sized steel spheres is a 200 400 600 800 1000 1200
shown in Fig. 3, together with the experimen- TEMPERATURE (OC)
tally determined RDFs for two transition metal- FIG. 4. Electrical resistivity for a PdsoSi 20 alloy in
metalloid alloys. The agreement is impressive, amorphous, equilibrium crystalline, and liquid forms.
although the model takes no account of the [From P. Duwez, Trans. Am. Soc. Metals 60, 607
alloy nature of the experimentally studied (1967).)
43 AMORPHOUS METALS

However, subsequent research has failed to class of amorphous alloys has been of special
discover any amorphous superconductors with interest because of its potential usefulness in
transition temperatures above lOoK. magnetic memory devices, for which the mag-
Magnetic Properties Metallic glasses are as netic anisotropy and ease of magnetic domain
diverse in their magnetic behaviors as are crys- wall movement are essential.
talline metals and alloys, ranging from strong Other basic aspects of magnetic behavior of
ferromagnetism to weak diamagnetism, includ- amorphous alloys which have been studied are
ing ferrimagnetism and spin glass type magne- the overall temperature dependence of magne-
tism. The largest amount of research has been tization, the characteristic magnetic excitations,
carried out on ferromagnetic transistion metal- including spin waves, and the effects of struc-
metalloid (TM-M) alloys and on ferrimagnetic tural and chemical disorder on phase transition
transition metal-rare earth metal (TM-RE) phenomena, including sharpness of the ferro-
alloys. The magnetically ordered amorphous magnetic-paramagnetic transition and the crit-
TM-M systems are well described as aligned ical exponents associated with this transition.
ferromagnets, as illustrated schematically in Mechanical Properties Many mechanical
Fig. 5(a), where metalloid atoms are not shown. properties of metallic glasses have been investi-
Some TM-RE systems are ferrimagnets, with gated, including elastic constants, yield and
RE moments pointing opposite to the TM fracture strengths, modes of plastic deforma-
moments. For alloys with gadolinium as the tion, the dependence of these properties on
rare earth element, the moments of TM atoms thermal history and measurement temperature.
(small circles in Fig. 5(b)) are closely antiparallel Most of these studies have employed quenched-
to those of the RE atoms (large circles), but for from-the-liquid glasses.
other rare earths, notably terbium and dyspro- Metallic glasses, like most polycrystalline
sium, the RE moments are strongly coupled to solids, are mechanically isotropic, so only two
local easy directions, giving a canted ferrimag- constants, for example the bulk modulus and
netic arrangement like that shown in Fig. 5(c). the shear modulus, are needed to characterize
It is usual to think of glasses and amorphous their elastic properties. Elastic stiffness con-
solids as being macroscopically isotropic, and stants of the glasses are generally smaller than
indeed these materials are isotropic with respect those of the same materials in crystalline form,
to most of their physical properties. Magnetic which are usually mixtures of crystalline phases.
anisotropies for most amorphous transition The bulk moduli differ only by 5-10%, but the
metal-metalloid alloys are very small and their shear modulus J1 and Young's modulus E for
effects on coercivity and ease of magnetic glasses are lower by 30-50% than for corre-
saturation can be largely eliminated by suitable sponding polycrystalline solids. The lower resis-
annealing treatments, which relax internal tance to shear deformation has been explained
strains and allow some local rearrangements of in terms of nonuniform atomic displacements
atom positions. Coercivities for suitable annealed which can occur in the glasses but which are
TM-M alloys can thus be as small as 10-2 Oe. restricted by symmetry constraints in the crys-
Annealing in the presence of applied magnetic talline forms of these materials.
fields can also be used to induce macroscopic Metallic glasses are typically very strong, hav-
magnetic anisotropies in the amorphous TM-M ing observed yield strengths u y ~ E /50 as high
alloys. Low coercivity and high permeability as 370 kg/mm 2 (640 X 10 3 psi) for FesoB2o,
are very desirable properties for a variety of similar to that of polycrystalline, cold drawn,
applications, including low-loss transformer high carbon steel. However, unlike high strength
cores, presently one of the most widely pursued polycrystalline alloys, which are inherently
applications for metallic glasses. brittle, most metallic glasses are microscopically
The behavior of magnetic domains in amor- ductile and can sustain appreciable local plastic
phous Gd-Co alloys is often dominated by large deformation, particularly in compression or
perpendicular easy axis magnetic anisotropies shear. Although there have been several propo-
of 104 -10 5 erg/cm 3 induced in the sputtered sals for using metallic glass wires or ribbons to
films by appropriate deposition conditions. This form reinforced composites with plastic, cer-
amic, or metallic matrices, no commercially
t, t, t, important applications of this sort have yet
t, emerged.
~ t, t, A ~ t, t, ~ t, )JJ"'I'J
t, ~ t, t, ~ t, t, J 0.. t; Plastic deformation in metallic glasses for
A ~ t, A t, A ~ ~ "pt,<\ temperatures much lower than the glass tem-
t, ~ A t, .! A perature Tg is inhomogeneous, occurring in
t, t, A A J ~
A ~ localized shear bands along directions of maxi-
(a) (b) (c) mum resolved shear stress. In contrast to the
usual work hardening which accompanies plastic
FIG. 5. Atomic magnetic moments in amorphous deformation in crystalline solids, the localized
magnets: (a) ferromagnetic as in FesoB2o, (b) ferri- shear bands of metallic glasses are manifestations
magnetic as in Gd 30 Co 70 , and (c) canted ferrimagnetic of "work softening." The glassy material is
as in Tb33Fe67' weakened as a result of the local disruption of
AMORPHOUS METALS 44

Cross-references: CR YST ALLIZATION;LIQUID


STATE; MAGNETISM; MECHANICAL PROPERTIES
OF SOLIDS; METALLURGY; SOLID STATE PHYS-
ICS; STATES OFMATTER;THIN FILMS; VITREOUS
STATE; X-RAY DIFFRACTION.

ANTENNAS
Communication systems characteristically con-
sist of cascaded networks, each network designed
to carry out some operation on the energy
conveying the information. In radio communi-
cation, transmitting systems' antennas are the
networks serving to transfer the signal energy
from final circuit network to space. In receiving
systems, on the other hand, antennas serve to
FIG. 6. Fracture surface of a glassy Pd-Cu-Si wire. transfer the signal energy from space to the
[From L. A. Davis and S. Kavesh, J. Mat. Sci. 10,453 input circuit networks. In circuits the flow of
(1975).) energy is restricted to one of two directions.
The effectiveness of the transfer of energy
between the antenna and its adjoining circuit
is, therefore, determined solely by the terminal
short-range order or the local creation of ex- impedance of the antenna and that of the
cess free volume by the previous plastic defor- adjoining circuit network. The terminal imped-
mation. Tensile failure of metallic glass wires ance of a circuit network can generally be
designed for operation over a relatively wide
or ribbons occurs in the localized shear band, frequency band. On the other hand, the termi-
with characteristic veinlike fracture surface nal impedance of an antenna may, in some
features like those shown in Fig. 6, which are cases, vary outside acceptable limits, thus
suggestive of local melting. For temperatures
closer to the glass transition temperature, de- limiting the useful frequency bandwidth of
formation takes place by homogeneous viscous the antenna. The terminal impedance of an
creep, with the strain rate proportional to the antenna and its variation with frequency is,
applied stress. therefore, one of the important parameters
The ductility of many metallic glasses is lost describing the performance of the antenna.
through irreversible embrittlement when the The relationship between an antenna and
alloys are annealed, even though the annealing space, however, is more complex. The distribu-
temperatures are too low to cause any detectable tion of the radiated energy varies with direction
crystallization. Most metal-metalloid alloys in space, giving rise to the directive properties
show this behavior, and those containing phos- of the antenna. The energy radiated by the
phorus are particularly susceptible. Some experi- antenna is in the form of electric and magnetic
ments suggest that the embrittlement is caused fields. These are vector quantities which, at a
by fine-scale phase separation or segregation distance from the antenna, are at right angles
involving metalloid elements. to each other and to the direction of propaga-
tion. The planes in which these vectors are
G. S. CARGILL III located, and whether they are stationary or
rotate with time, determine the polarization
of the radiated field. The performance of an
References antenna can, therefore, be fully described
only by specifying several parameters, such as
P. Chaudhari, B. C. Giessen, and D. Turnbull, "Metallic radiation pattern, power gain, 1 , 2 directive
Glasses," Scientific American 242, 98-117 (April gain, and polarization. In discussing antenna
1980). properties, it is convenient to consider the
Giintherodt, H. J. and Beck, H. (Eds.), "Glassy Metals antenna as a radiating rather than as a receiving
I, Ionic Structure, Electronic Transport, and Crystal- network. The antennas, however, are passive
lization," Topics in Applied Physics, Vol. 46, and linear networks and, therefore, subject to
Springer-Verlag, New York, 1981. the law of reciprocity. 3 Therefore, the perfor-
Masumoto, T. and Suzuki, K. (Eds.), "Proceedings of mance of an antenna, in terms of radiation pat-
the Fourth International Conference on Rapidly tern, gain, or polarization, is the same whether
Quenched Metals," Japan Inst. of Metals, Sendai, the antenna radiates or absorbs radiation.
1982. Except for the region in the immediate
Gilman, J. J. and Leamy, H. J. (Eds.), "Metallic vicinity of the antenna, termed the near-field
Glasses," American Society for Metals, Metal Park, region of the antenna, the radiated electro-
Ohio, 1978. magnetic energy propagates radially away
45 ANTENNAS

from the antenna, and the radiation intensity* upon the size of the antenna, expressed in wave-
varies inversely as the square of the distance lengths. The larger the antenna, the greater is
from the antenna. This attenuation with likely to be the gain. The values of gain for dif-
distance is the propagation loss. In radio ferent antennas range from 1.5 for an electrically
communication systems it is the principal small dipole to values hundreds and thousands
component of the system's loss and must be of times greater. In practice, antenna gains are
compensated for by increasing the power of usually expressed logarithmically, in decibels.
the transmitter, the gains of the transmitting For the low-frequency end of the ratio spec-
and receiving antennas, and the sensitivity of trum (15 kHz to 3 MHz) antennas, although
the receiver. In describing antenna performance, large physically, are relatively small in terms
it is customary to disregard the propagation of wavelengths. Therefore, the directive gains
loss and to represent the distribution of the of these antennas seldom exceed 3 (4.8 dB).
radiated power as a function of the two direc- The radiation efficiencies of low-frequency
tion angles only. Such distribution is commonly antennas are usually very low. As a result, their
represented graphically in the form of a radia- power gains are significantly lower than their
tion pattern. Radiation patterns can take a directive gains and may be negative when
variety of forms. One form is a polar diagram expressed in decibels. In the high-frequency band
with radial displacement proportional to the (3-30 MHz), which is used principally for long-
field strength or to the radiation intensity in distance communication, antenna gains of 10-
that direction. Another form is a rectangular 100 (10-20 dB) are frequently encountered. At
diagram with abscissa representing one of the microwave frequencies, where the wavelengths
directional angles, and the ordinate representing are a fraction of a meter, gains of several hundred
the radiated field intensity. The intensity may and even thousand times (20 to over 30 dB),
be represented linearly as power or logarithmi- are common.
cally in decibels. 4 For representing the directive When an antenna has one or more of its dimen-
properties of an antenna in all directions, con- sions significantly larger than a wavelength, its
tours of equal radiation intensity may be radiation pattern is likely to have more than
plotted, with the two direction angles as abcis- one maximum. The radiation pattern in such
sas and ordinates, respectively. cases is said to have a lobe structure. That part
The directive properties of an antenna also of the radiation pattern which encompasses
lead to the concept of antenna gain. The the direction of the largest maximum and the
directive gain of an antenna in a specified radiation immediately to each side of it is
direction is the radiation intensity in that direc- referred to as the main lobe. The radiation about
tion compared to what it would be if the total the minor maxima is referred to as the secondary
radiated power were distributed equally in all or side lobes. One of the common goals in
directions. Besides the directive gain there is antenna design is the reduction in the levels of
also the concept of power gain. The latter secondary lobes. These may, at times, be a
differs in that the total input power rather than source of interference to other transmissions.
the radiated power is used as the reference. The In common with light, radio waves consist
power gain for an antenna is always smaller of electric and magnetic fields at right angles
than the directive gain by the factor of radiation to each other and to the direction of propaga-
efficiency. IEEE Standards Publications 145 tion. The orientation of these fields, specifically
and 149 (Refs 1 and 2) should be consulted for the electric field, determines the polarization of
exact terminology. For some applications, the wave. Thus, if the electric field vector is
such as point-to-point communication, high parallel to the ground, the radio wave is termed
values of antenna gain are desired because horizontally polarized. Although the polariza-
such antennas concentrate the available power, tion of the energy radiated by an antenna, in
thus effectively increasing it. Similarly, in general, varies with direction, an antenna is
receiving applications, such antennas are more usually designated to be horizontally (or verti-
responsive to radiation arriving from the desired cally, or circularly, etc.) polarized, depending
direction and, at the same time, reduce the on the polarization of its radiation in the direc-
response to signals arriving from other (possibly tion of the main lobe maximum.
interfering) directions. Conversely, for other The importance of polarization in radio engi-
applications such as broadcasting, broadly neering lies principally in the different reflective
directional antennas with wide coverage may properties of the ground for horizontally and
be desired. vertically polarized waves. Different radio ser-
The gain of an antenna is dependent principally vices are served best by different polarizations.
Antennas for use in the low-frequency end of
the radio spectrum, 15 kHz through about 3
* In this discussion, radiation intensity has the MHz, are almost invariably vertically polarized.
dimensions of power flow per unit area, normally, This includes the AM broadcast band (535-
watts per square meter. Electric field strength, on 1605 kHz). In the high-frequency band, 3-30
the other hand, is in volts per meter. Definitions of MHz, both horizontal and vertical polarizations
these and other terms can be found in Reference 1. are employed. For television broadcast service
ANTENNAS 46

in the United States and many other countries


(but not in the United Kingdom), horizontal
polarization is in use. In the United States
horizontal polarization is also standard for FM
broadcasting service. However, to accommodate
reception in private automobiles, FM broadcast
stations are now permitted to add a vertically
polarized component to the transmission, but
it may not exceed the horizontally polarized
component.
Recent years have witnessed a very rapid
growth in the importance of communication
using satellite relays. Since the antennas used To receiver
in this service are highly directive, and the direc-
tions to satellites are significantly above the (a)
horizon, ground reflections play no part in such
communications. For this reason various organi-
zations involved in satellite relay communication
make use of horizontal, vertical, and circular
polarizations, depending on local considerations.
One current practice is to use circular polariza-
tion in both directions: left-hand circular
polarization for the up-link (ground-to-satellite)
and right-hand circular polarization for the To receiver
down-link. At the time of this writing (Decem ber
1981) a growing practice is to reuse frequencies. (b)
By this is meant the use of the same frequency
for two channels in the same direction; one
channel uses right-hand circular polarization,
the other, left-hand circular polarization. Isola-
tion of 27 dB (power ratio of 500: 1) between
the two channels is being reported using this
practice.
The types and varieties of antennas encoun-
tered in practice are very numerous. Each type
has some advantage over the others for some
specific applications. Some of the more impor-
tant and frequently encountered requirements
are those for operating bandwidth, high radia-
tion efficiency, a specified degree of directivity,
Co·axial
To receive r or transm ission
transmitter line

RAD IATING ELEMENT


(e)
FIG. 2. Some examples of elementary magnetic radi-
TO TRANSMITTER ators: (a) single-turn loop antenna; (b) ferrite rod
OR RECE IVER antenna; (e) half-wavelength slot antenna.
~==~~~~~~=
(a)
whether high or low, polarization, etc. By no
RADIAT I NG ELEMENT
means the least of the requirements is that of
economy; it is a poor engineering practice to
overdesign the antenna system. Two fundamen-
tal types of antennas encountered in practice
are illustrated in Figs. 1 and 2. Fig. I shows two
of the electric radiators, a monopole and a
dipole. A monopole, shown in Fig. l(a), in one
form or another, is used almost exclusively
TO TRANSM I TTER
throughout the low-frequency end of the radio
OR RECE I VER (b) spectrum. It is vertically polarized. The dipole,
shown in Fig. l(b), is more versatile since it can
FIG. 1. Two types of elementary radiators: (a) be oriented to give either horizontal or vertical
Monopole over ground; (b) dipole. polarization. It is frequently used as an elemen-
47 ANTENNAS

tary radiator in large, array-type antennas. element to the rear which acts as a reflector
Basically, a monopole is a special case of a and several parasitic elements in the forward
dipole where one-half of the dipole is replaced direction, known as directors. Parasitic elements
by its electromagnetic "image" reflected by the are so termed because they derive their excita-
ground. An electric dipole may be considered tion by an electromagnetic coupling to driven
to be one of the two most fundamental types elements. The reflector element is slightly
of antenna. In its most elementary form of two longer than its resonance at the operating fre-
point charges it is used to derive the mathemati- quency, which gives it a positive reactance. The
cal expressions for the radiated fields. The other directors, on the other hand, are shorter than
fundamental type is the magnetic radiator shown their resonant length, which gives them a nega-
in Fig. 2. Fig. 2(a) illustrates the simplest form tive reactance; this causes them to enhance the
of a magnetic radiator, a single-turn loop. In radiation in the forward direction. The Yagi
the early days of broadcast radio the loop antenna is compact, with a moderately high
antenna, in a multiturn form, was sometimes directive gain. However, it has a very narrow
used with home receivers. Because of its small operating bandwidth.
size and portable nature it is still used for this An antenna res em bling the Yagi antenna physi-
purpose, especially for portable receivers, but cally, but with very different performance
in the form of a compact, ferrite-loaded coil, characteristics is the log-periodic an tenna, illus-
such that shown in Fig. 2(b). In industrial appli- trated in Fig. 4. Like the Yagi antenna, it consists
cations a loop antenna is sometimes used in a of a tandem array of dipole elements; however,
modified form for FM transmitting antennas. all the elements are driven, being connected to
For microwave applications the magnetic radia- the transmission line running axially through
tor finds use in the form of a "slot" antenna. the array. At anyone frequency, only three or
An elementary form of the slot antenna is four of the elements are radiating, those whose
shown in Fig. 2(c). It consists of a narrow, half- lengths are near resonance at that frequency. At
wavelength-long slot in a sheet of metal and is the lowest operating frequency for which the
excited at two points across the slot by a antenna is designed, the longest elements, those
transmission line. For proper impedance match at the rear end of the antenna, are in operation.
to the transmission line, the points of excitation As the frequency of operation is increased, the
are usually well off the midpoint of the slot. "active region" moves forward along the
Slot antennas usually find their application in antenna, and at the highest design frequency
large, array-type antennas; in such cases the slot only the forward elements are in operation. In
antennas are often excited by waveguides passing contrast to the Yagi antenna, the log-periodic
beneath the slots. antenna has only a moderate directivity, but it
Departing from the elementary radiators, Figs. can be designed for a very wide frequency
3,4, and 5 show three antenna types employing operating band. This last characteristic has made
wire conductors which have widespread use in the log-periodic design popular for consumer
the lower end of the useful radio spectrum, up applications for TV reception; there the VHF
to somewhere between 30 and 300 MHz. Figure band alone extends from 54 to 216 MHz, a 4 to
3 is a diagrammatic representation of a Yagi 1 spread.
antenna. It consists of a single driven element-a Another antenna employing wire conductors
half-wavelength dipole with a single parasitic but very different in design, is the rhombic

REfLECT~ENT
DRIVEN ELEMENT

\
0:::
DIRECTOR ELEMENTS

~
DIRECTION OF
MAXIMUM RADIATION~

TO TRANSMITTER
OR RECEIVER
FIG. 3. A five-element Yagi antenna.
ANTENNAS 48

/ Supporting Catenary Cables

Transmission
/r~::- Line
II ',,,,,Aperture
~ III L~ Angle
'I~<a ,
-1L ~ " ,,
~
, ,
'~

FIG. 4. Horizontally polarized log-periodic antenna.

TERM INATING RESISTOR

--
RADIATING WIRES

DIRECTION OF
MAXIMUM
RADIATION

••
II" •
: - SUPPORTING POLES
""" •
"
""
~TO TRANSMITTER
OR RECEIVER
FIG. 5. A rhombic antenna.

antenna, illustrated in Fig. 5. It consists of two this antenna type is the reflector. This consists
wires in rhombic configuration; each side of the of two rectangular reflecting surfaces joined
rhombus is, characteristically, two to five wave- at the axis, forming a corner. In operational
lengths long. Its principal use is for long-distance antennas the metal reflector surfaces are
point-to-point communication in the HF (3- usually replaced by wire screens. The width of
30 MHz) band. Its construction is simple and the corner reflector must be at least one-half
trouble-free; however, it does require a large wavelength. In practice, the lengths and the
site. Its design and construction is described widths are usually one to two wavelengths. The
in detail by Harper. 5 greatest usefulness for this antenna type is in
An antenna type which may be considered the VHF (30-300 MHz) band. The gains and
as transitory between the lower frequency, radiation patterns of the corner-reflector
wire-conductor types and the microwave, antenna, as functions of length, width, and
reflector-type antennas is the corner-reflector aperture angle, have been measured in detail at
antenna, illustrated in Fig. 6. The active element the National Bureau of Standards and reported
shown in the illustration is a half-wavelength by Cottony and Wilson. 6 ,7
dipole; however, collinear arrays of dipoles can Figure 7 shows two antenna types, a horn
be and have been employed. The key element of antenna and a paraboloid reflector antenna.
49 ANTENNAS

Width - W - - - - l

Aperture angle, IJ

---
--~~-------s----_~~---
Distance of
dipole from D r iven element

FIG. 6. Diagram of a corner-reflector antenna showing the nomenclature of its parameters.

PARABOLIC REFLECTOR

TO TRANSMITTER ... .. ..... ~"-.-J


OR RECEIVER

(a)
(b)
FIG. 7. Examples of microwave-type antennas: (a) horn antenna. (b) paraboloid-reflector antenna.
ANTENNAS 50

These are characteristic of the antennas used of the number of elements rather than of the
for microwave frequencies. The horn antenna, area over which they are arrayed . Uzkov 10
Fig. 7(a), is used generally where moderate showed that, for broadside arrays, the theoreti-
directivity suffices, but a high front-to-back cal gains are directly proportional to the num ber
ratio of directivity is desired. The paraboloid of elements and, for end-fire arrays, to the
antenna, on the other hand, is used for high-gain square of the number of elements. Bloch, Med-
applications; it is a quasi-optical device . It is hurst, and PoOlll presented a comprehensive
the type widely used for ground stations for survey of supergain and superdirectivity. To
satellite communication. For this application realize even a modest degree of supergain, one
it is beginning to some extent to enter the should be able to control the excitation currents,
consumer market. both in magnitude and phase, with a very high
The statement in the introductory paragraph order of precision. Supergain and superdirec-
regarding the importance of the terminal imped- tivity are, therefore, of theoretical rather than
ance of the antenna understates one of the practical interest.
fundamental antenna problems: the relationship For a more complete listing and discussion
between the size of the antenna, expressed in of antenna types the reader is referred to texts
wavelengths, and the properties of the antenna, on antennas such as the "Antenna Engineering
both its terminal impedance and its directivity. Handbook."12 For additional discussion of the
As the size of the antenna is reduced below principles underlying antennas, texts by Kraus 13
one-half wavelength, the resistive component and Schelkunoff and Friis14 are suggested.
of the terminal impedance rapidly diminishes
while the reactance increases. As a result, either H. V. COTTONY
the operating bandwidth or the radiation effi-
ciency, or both, are rapidly reduced . Numerous
papers have been published in an attempt to References
quantify this relationship , the most recent being
that by Hansen. 8 Hansen's paper also lists numer- 1. IEEE Standard No . 145-1983, "Definitions of
ous references to earlier work. It is appropriate Terms for Antennas," The Institute of Electrical
to note, however, that the mathematical opera- and Electronics Engineers, Inc., New York, 1983 .
tions employed by Hansen implicitly limit the 2. IEEE Standard No. 149-1979, "Test Procedures
validity of the final results to dipoles. for Antennas," The Institute of Electrical and
The relationship between the size of the Electronics Engineers, Inc., New York, 1979.
antenna and its power and directive gain is 3. Jordan, E. C. "Electromagnetic Waves and Radi-
easier to grasp. In the case of quasi-optical ating Systems," pp. 327-328, Prentice-Hall, Inc.,
antennas, such as horn and paraboloid reflector Englewood Cliffs, N.J. , 1950.
antennas, (Fig. 7), the opening of the antenna 4. Schelkunoff, S. A. "Electromagnetic Waves,"
is known as the aperture, a term borrowed from pp. 25-26, Van Nostrand Reinhold, New York,
optics. The gains of such antennas are directly 1943.
proportional to the areas of the apertures in 5. Harper, A. E. "Rhombic Antenna Design," D.
wavelengths modified by the illumination taper. Van Nostrand Co., Inc., New York, 1941.
The concept of aperture is carried directly to 6. Cottony, H. V., and Wilson, A. C., "Gains of
the broadside arrays of dipoles or other elemen- finite-size corner-reflector antennas," IRE Trans.
tary radiators. In the case of end-fire arrays and on Antennas and Propagation, AP-6(4), 366-
other antennas which depend upon length for 369 (October 1958).
directivity, such as Yagi antennas, the concept 7. Wilson, A. C., and Cottony, H. V., "Radiation
of aperture loses its applicability. Instead, equiv- patterns of finite-size corner reflector antennas,"
alent aperture is sometimes used. The gains of IRE Trans. on Antennas and Propagation, AP-8,
such antennas are roughly proportional to the 144-157 (March 1960).
square root of the antenna lengths expressed in 8. Hansen, R. C. "Fundamental limitations in
wavelengths. antennas," Proc. IEEE, 69(2), 170-182 (February
The preceding discussion of the relationship 1981).
between the size of the antenna and its gain is 9. Woodward, P. M., and Lawson, J. D., "The theo-
correct insofar as the practical applications are retical precision with which an arbitrary radiation
concerned. To be complete, however, one should pattern may be obtained with a source of finite
mention the concept of supergain or superdirec- size," Journal lEE, 95(Part III), 362-370 (Sept.
tivity . Woodward and Lawson 9 demonstrated 1948).
that , using arrays of dipoles, it is in theory 10. Uzkov, A. I., "An approach to the problem of
possible to realize much higher directivities than optimum directive antennae design," Comptes
those predicted by the aperture concept. To Rendus (Doklady) de l'Academie des Sciences
realize such directivities, special excitation of de I'URSS, 53(1),35-38 (1946).
elements must be employed . The excitation is 11. Bloch, A., Medhurst, R. G., and Pool, S. D.,
such that, while some elements radiate power, "A new approach to the design of super-directive
others absorb it, feeding it back to the elements aerial arrays," Proc. lEE, 100(Part III, No. 67),
that radiate . The gain then becomes a function 303-314 (Sept. 1953).
51 ANTIFERROMAGNETISM

12. Jasik, H. (Ed.), "Antenna Engineering Handbook," between adjacent ions as encountered in FER-
McGraw-Hill Book Co., New York, 1961. ROMAGNETISM). Superexchange results from
13. Kraus, J. D., "Antennas," McGraw-Hill Book Co., charge transfer (see BOND, CHEMICAL), and
New York, 1950. it is best illustrated by a typical example, say
14. Schelkunoff, S. A. and Friis, H. T., "Antennas: MnO.
Theory and Practice," John Wiley & Sons, New Ground and Excited States of (MnOMn)*
York,1952.
Mn*O--Mn* Mn+(+-)O-Mn*
Cross-references: ALTERNATING CURRENT; CIR- ground state excited state
CUITR Y ; PROPAGATION 0 F ELECTROMAGNETIC
WAVES; RESONANCE; WAVE MOTION. In the ground state, the purely ionic configura-
tion, there is no interaction between metal ions.
If, however, one of the two bonding electrons
of 0 -- is transferred to the Mn * at left, there
ANTIFERROMAGNETlSM
will be strong Hund's rule coupling within that
AnHferromagnetism is the most common form ion, and also the unpaired electron on O· can
of magnetic order. It is found in the majority of couple with the Mn* at right. Since the two
inorganic compounds of the transition metals, bonding electrons on 0-- have opposite spins,
rare earths and actinide elements; it is also found the overall interaction will appear as antiparal-
in Cr, Mn, Pt, Pd, and rare earth metals and al- leI exchange coupling (i.e. antiferromagnetic
loys, although the situation is rather more com- coupling) between the two Mn ions.
plicated in the case of metals, and the discussion The archetypal form of the Hamiltonian
here will focus mainly on insulators. However, function (the energy) which describes the mag-
during the last decade great interest has arisen netic properties of the exchange interaction is
in organic conductors, of which TTF-TCNQ is the Heisenberg exchange form
a typical example. These systems are strongly
anisotropic in lattice character, Le., are quasi- J(= LJijSi' Sj,
one-dimensional, and their magnetic character
is antiferromagnetic. A striking experimental where Si and Sj are the spin angular momentum
development since the mid-1960s has been the vectors (operators) of a pair of nearest-neighbor
synthesis of magnetic materials which have an ions. (Interactions between more distant neigh-
effective dimensionality D less than three. Such bors are much weaker and will largely be ignored
"low-dimensional" magnets are chemically in what follows.) Jij is the exchange integral
engineered by inserting large diamagnetic or- (exchange constant) between the spins Si and
ganic "spacer" molecules to minimize the Sj, and is a measure of the strength of the mag-
magnetic interactions in one or two directions netic interaction. For relatively simple systems,
in the crystaL Low-dimensional antiferromag- J may be calculated quantum-mechanically
nets show striking differences in behavior from from first principles. However, in general, J is
the familiar three-dimensionaiantiferromagnets, best regarded as a phenomenological constant,
such as MnF., because quantum effects (fluctu- to be inferred from experiment. The Heisenberg
ations) are particularly pronounced in such Hamiltonian can describe either ferromagnetism
systems. or antiferromagnetism. J is negative for ferro-
The principal feature of antiferromagnetism is magnets, since then parallel spin alignment is
the spontaneous antiparallel alignment of elec- energetically favored. Antiferromagnetism is
tron spins on neighboring magnetic ions, which characterized by J positive, which favors anti-
takes over from the paramagnetic state (where parallel order.
spin-spin interactions are essentially negligible) While the antiferromagnetic exchange interac-
as the temperature is lowered. The critical tem- tion produces antiparallel alignment of the spins,
perature of the antiferromagnetic phase transi- their direction with respect to the crystalline
tion is called the Nee! temperature (TN). The axes is a consequence of the magnetic anisot-
strength of the ordering interaction is character- ropy. There are three origins of anisotropy: (1)
ized by the magnitude of TN, which typically dipole-dipole interactions among the array of
ranges from below I K to above room tempera- ionic moments which gives anisotropy in all but
ture. Some antiferromagnets whose properties cubic symmetry; (2) Stark-effect interactions of
have been studied in detail are listed along with each single ion with the local crystalline electric
their Neel temperatures: (3D) NiO, 520 K; fields; and (3) anisotropic exchange, a result
Cr203, 310 K; MnF 2 , 67.4 K; (2D) K2NiF4, 97 of spin-orbit coupling and super-exchange be-
K; (lD CuCI •. 2(pyrh, (CPC) 1.1 K; [(CH3)4N] tween excited orbital states. The latter two
Mn Cl 3 (TMMC), 0.8 K. (Note: (pyr) denotes mechanisms are important for non-S-state ions,
large organic pyridine complexes.) especially in crystals of low symmetry. Crystal
The ordering interaction between neighboring field effects are represented by including a
metal ionic spins in an insulator is called super- term of the type D 1:i(S1)2 in the Hamiltonian,
exchange, since it takes place via an intervening where S1 is the z-component of the spin angular
anion, 0, F, S, CI, etc. (cf. direct exchange momentum (Si) of the ith ion. This term gives
ANTIFERROMAGNETISM 52

rise to spin-anisotropy such that when D < 0


H
easy-axis anisotropy is favored, and when D<O
easy-plane anisotropy is favored. For aniso-
tropic exchange in real systems the Hamiltonian
may effectively be written:
tt
J(= L (Ji/S/S/ + Ji/S/S/ +Ji/S/S/).
ij

If JZ > JX ':::!. JY uniaxial anisotropy is present,


and when JZ <Jx c:::.JY easy-plane anisotropy is
present. The extreme limit of uniaxial anisotropy
occurs when JX =JY =0, and is called the Ising
model of magnetism. The extreme limit of easy-
plane anisotropy is when JZ = 0, and this is
called the XY model. When JZ =JX =JY, the
isotropic Heisenberg exchange model is re-
H,
covered.
The effects of an applied magnetic field on
the antiferromagnet are represe-nted by the T
addition to the Hamiltonian of a Zeeman energy
term -g[..LBH~ S/, where g is the Landeg-factor
and [..LB is the Bohr magnet on. The phase dia-
gram of an Ising antiferromagnet as shown in
Fig. 1. In the H-T plane, the antiferromagneti-
cally ordered phase is enclosed by a second-
order phase boundary. Note that antiparallel
order is destroyed both by increasing temper-
ature and by increasing magnetic field. The
critical field, He, is reached when the Zeeman
energy tending to align a pair of spins parallel FIG. 1. Phase diagram (schematic) of an ideal classi-
balances the antiferromagnetic exchange energy. cal antiferromagnet. T is temperature, H is direct field,
At T = 0, antiferromagnetic order exists for and Hs is staggered field. The arrows suggest the pre-
H < He. while paramagnetic disorder occurs dominant pattern of spin ordering.
for H> He. He is therefore a special point. For
the pure Ising model, to which Fig. I applies,
nonphysical effects occur at He. There is a
nonzero entropy at T =0 and H = He, which to measure in the case of a single crystal sample.
can be calculated exactly in ID and 2D, and One is the susceptibility measured with the
approximately in 3D. This implies a violation field parallel to the easy-axis (easy-plane),
of the third law of thermodynamics and re- called XII, and the other, Xl, measured with the
flects the somewhat artificial, or classical, field in a direction perpendicular to the easy-axis
character of the Ising model. Inclusion of terms (easy-plane). In a truly spin-isotropic, Heisen-
such that JX or JY -=1= 0 eliminates this defect. berg, system XII and Xl are equivalent above Tn.
Also shown in Fig. I is a different kind of This is not true in general, however. When any
field, namely a staggered field, Hs , which re- type of anisotropy is present XII and Xl are
verses sign at every successive spin site. This is different. Mean-field theory is a simple, approx-
the so-called ordering field for an antiferromag- imate calculational approach applicable to both
net. For example, an antiferromagnet in a stag- ferromagnets and antiferromagnets. Further, it
gered field shows phase behavior similar to a is believed to describe phase behavior correctly
ferromagnet in a direct (ordinary) applied for magnetic systems in dimensions D ;;:. 4, Fig. 2
field. It might seem that a staggered field is an shows the susceptibilities of an antiferromagnet,
artificial concept, not realizable in the labora- in the limit of a vanishing field, in the mean-
tory, which is generally true. However, in the field approximation. Both XII and Xl vanish at
case of the complex, six-sublattice-structured high temperatures where spin-spin interactions
antiferromagnet dysprosium aluminum garnet become negligible. XII then exhibits a maxi-
(DAG), application of a direct field in general mum at the transition temperature TN and sub-
results in the appearance, in addition, of an sequently vanishes as T~ O. Xl by contrast,
"internal" staggered field. goes to a nonzero value as T~ O. There is a
The obvious magnetic parameter for experi- vanishing total magnetization as the anti-
mental measurement is the susceptibility, de- parallel alignment of the spins becomes in-
noted X, which is effectively a measure of the creasingly complete.
response of the spins to an external applied It is interesting to observe how Fig. 2 is modi-
field. In general, there are two susceptibilities fied for real antiferromagnets with D < 4. This
53 ANTIFERROMAGNETISM

x antiferromagnets, 50-70% for 2D antiferro-


magnets, and 90% or more for good ID anti-
ferromagnets. Strictly , TN = 0 for an ideal lD
antiferromagnet with short-range interactions.
However, weak residual 3D interactions allow
3D ordering to take place at appropriately low
temperatures.
Antiferromagnets show very rich and com-
plex phase behavior when competing interac-
tions are present, and/or as a function of the
degree of anisotropy. A special class of Ising-
like antiferromagnets are called meta magnets ,
and their H-T phase diagram is shown in Fig.
4, which resembles Fig. 1, except now a high-
field portion of the antiferro-paramagnetic
phase boundary has become first order. The
~~---c---L---------------------T point at which the boundary changes character,
T*, is a special kind of critical point called a
tricritical point. At a normal critical point,
FIG. 2. The susceptibility (schematic) of an antifer- or along a line of critical points, divergences
romagnet in zero applied field, as a function of tem- occur in the thermodynamic properties which
perature, in the mean-field approximation, xII and Xl are characterized by so-called critical expo-
are as explained in text. xp is the powder susceptibility nents. The critical exponents at a tricritical
which is the weighted average of XII and Xl. point are markedly different from those at
an ordinary second-order critical point. Figure
5, which is an extended version of Fig. 4
is now well known due to exact and accurate including one additional parameter, the stag-
approximate calculations in lD, 2D, and 3D, gered field Hs, illustrates the phase behavior
and is shown schematically in Fig. 3. Both sus- of a metamagnet in a more illuminating way.
ceptibilities pass through a rounded maximum The first-order boundary of Fig. 4 now appears
T max, which is not the critical temperature TN, as a line of triple points where three first order
as the temperature is lowered. In zero-field TN
is manifested as a vertical tangent in both XII
and Xl on the low-temperature side of T max'
The difference A = T max - TN is a measure of H
dimensionality, increasing as D decreases. Typi-
cally, A is about 8% of Tmax or less for 3D
tt
X

He - - .............
" "- .....
..... ,
t l

~----~~--~~------------------+T

Do ~-----------------------J--------·T
FIG. 3. The susceptibility (schematic) in zero field
as a function of temperature according to non-mean- FIG. 4. The H-T phase diagram (schematic) of an
field theories. The difference between TN and T max is Ising-like metamagnet. The hatched portion of the
a function of dimensionality. TN corresponds to a phase boundary is first order and T* denotes the tri-
vertical tangent in xii and Xl· critical point.
ANTIFERROMAGNETISM S4

H H

tt p

AF
H,
T
~----------------------~------·T
FIG. 5. The extended phase diagram of a metamag-
net in terms of parameters T, H, and staggered field H $'
The shaded areas denote fust-order two-phase coexis- FIG. 6. H-T phase diagram of a spin-flop antiferro-
tence surfaces. magnet in a field parallel to the easy axis. The low-
field phase region has antiparallel aligned spins along
the easy axis. The high-field phase region has "flopped"
phase boundary surfaces meet. The tricritical spins, aligned at an angle to the field. The boundary
point T* is the point at which three critical between the phases is fust order. B is the bicritical
phase boundary lines meet. Examples of meta- point. The dashed lines illustrate the phase boundaries
magnetic systems are ferromagnetic layers with in the neighborhood of the bicritical point in the mean-
antiferromagnetic interlayer coupling, and sys- field approximation.
tems with nearest-neighbor antiferromagnetic
coupling and weak next-nearest-neighbor ferro-
magnetic coupling. The phenomenon of meta- type have indicated that the two second-order
magnetism was first observed experimentally in lines meet the first-order line tangentially. This
FeCI 2 , which is an example of the layer-type effect is difficult to observe in 3D antiferro-
system. DAG is also a well known metamagnet. magnets, but is very prominent in quasi-ID
Another interesting class of antiferromagnets, antiferromagnets on account of the low dimen-
characterized by rather small anisotropy, con- sionality. The effect has been experimentally
sists of the spin-flop antiferromagnets. The H-T confirmed in both 3D and ID antiferromagnets.
phase diagram of a spin-flop system is shown in Examples are MnF2 and TMMC, respectively.
Fig. 6. Two distinct ordered phases appear. In An interesting recent development concerns
addition to the regular antiferromagnetic phase, phase transitions in 3D antiferromagnets of
a new phase appears at high fields. In this phase, basic lattice structure fcc (or bee). The low-
in a classicial vector-spin picture, the spins tend temperature ordered phases of these multi-
to align themselves perpendicular to the applied sublattice magnetic systems can be fairly com-
field on crossing the boundary line, which is plex in terms of spin-ordering arrangements,
first order, ehnce the name spin-flop. This sys- characterized by doublings of the magnetic
tem has two critical fields at T = 0, HI (the spin- unit cell relative to the crystal unit cell as the
flop field) and H 2 • Two lines of critical points system goes from a paramagnetic state into an
meet the first-order line at a special point ordered phase as the temperature is lowered.
B. B is a second example of a special type Examples are: Type I antiferromagnets such
of critical point (these special points are gen- as U0 2 ; Type II antiferromagnets such as
erally called multicritical points). B is called TbSb, MnO, NiO; and Type III antiferromag-
a bicritical point; from Fig. 6 it may be ob- nets such as K2 IrCI 6 . An order parameter of
served that two critical lines meet at B. When a unusual complexity is required to describe
staggered field is included, the phase structure such systems, and experimental testing con-
has several interesting features, but is too com- tinues of calculations based on group theoreti-
plicated to describe here. It should be noted cal predictions of Landau and Lifshitz com-
that early calculations of mean-field type bined with the currently very popular technique
showed the three phase boundaries meeting at of renormalization group analysis.
an angle. Modern calculations of non-mean-field Other current areas of investigation where
55 ANTIPARTICLES

antiferromagnetic exchange interactions are 5. Stryjewski, E., and Giordano, N., "Metamagnetism,"
important are Ising systems showing Lifshitz Advances in Physics, 26,487 (1977).
points, multiphase points, and a sequence of
modulated phases. This theoretical work has Cross-references: BOND, CHEMICAL; CRITICAL
been stimulated by recent magnetization and PHENOMENA; FERRIMAGNETISM; FERROMAG-
neutron-scattering experiments on rare-earth NETISM; MAGNETISM; MAGNETOMETRY; PARA-
pnictides, particularly CeSb. MAGNETISM.
Spin-glasses are under intensive investigation at
this time. They are commonly regarded as sys-
tems with random ferromagnetic and anti ferro-
magnetic interactions, resulting ultimately from ANTIPARTICLES
a modulated, long-range RKKY interaction.
Spin-glasses exhibit many curious experimental I. Overview: Particles and Quanta. Classifica-
features, but despite much theoretical effort no tion by Statistical Behavior and by Behavior
clear and definitive microscopic explanation has with Respect to the Four Fundamental Inter-
yet emerged. Systems which have features in actions One of the great discoveries of modern
common with random spin-glasses, but where physics is that for every type of elementary
enough regularities remain to render the system entity of matter and radiation-"particle"-
exactly solvable in a number of cases, are called there exists a corresponding conjugate type of
spin-frustrated systems. A typical example is entity - "antiparticle." In the antiparticle certain
the Ising model on a triangular lattice with of the particle-defining properties are identical-
nearest-neighbor interactions. The condition "conjugation-invariant" -and others-"conjuga-
for antiparal1el spin alignment cannot be tion-reversing" -are reversed in sign. The
satisfied around a triangular unit of the lattice. reversed sign in a conjugation-reversing property
Hence the origin of the term spin frustration. allows one to maintain a conservation law for
Spin dynamics is another area where new that property in the dramatic processes of pair
theoretical breakthroughs are being made, and creation and pair annihilation in which an anti-
where antiferromagnets play a major role. Ap- particle is observed to appear and disappear
proximate analytic and computer-based classical together with the particle to which it is conju-
calculations are now available in 3D. By con- gate. In those cases where all the conjugation-
trast, in 1D spin ! antiferromagnets, many reversing properties occur with zero values, the
striking quantum mechanical features have been antiparticle is identical with the particle. The
observed, enhanced by the low dimensionality. progressive recognition of the existence of anti-
These developments have led to advances in particles was initiated by Dirac's relativistic
nuclear magnetic resonance experimental tech- anti-electron theory in 1931, and by Anderson's
niques, with corresponding insight into the de- independent experimental discovery of the
tails of charge distributions around ions. In anti-electron (positron) in 1932.
addition to susceptibility and magnetic reso- A simple descriptive and inductive introduc-
nance, an important experimental technique tion to the fact of existence of antiparticles,
in antiferromagnetism is neutron scattering, and conceptualizations of it, is given in the arti-
which reveals details of the spin structure array cle entitled ANTIPARTICLES in the first edition
and the effective spin dimensionality D, and of this encyclopedia, and in the articles in the
probes the nature of the energy excitations. present edition on FIELD THEORY and ELE-
MENTARY PARTICLES. An enumeration of
JILL C. BONNER known particles and antiparticles up to 1983
will be found in the latter. Here we attempt to
convey the most general theoretical setting in
References which the existence of antiparticles, and sym-
metry of conjugation between particles and
1. Mattis, D. C., "The Theory of Magnetism I: Statics antiparticles, may be established. At first a
and Dynamics," Springer-Verlag, Berlin-Heidelberg- brief overview will be given and then a more de-
New York, 1981. tailed systematic discussion. Some of the tech-
2. Bonner, J. C., and Muller, G., "Low Dimensional nical terms used are defined in the sequel.
Magnetism," Oxford University Press (forthcom- A major classification of all particle types is
ing). according to their statistics or "social behavior."
3. de Jongh, L. J., and Miedema, A. R., "Experiments This is based on symmetry or antisymmetry of
on Simple Model Systems," Advances in Physics, the mathematical function describing an identi-
23, 1 (1974). 1. J. de Jongh, "Some Recent Ex- cal multi particle state (i.e., a state with two or
periments on Quasi 1- or 2-Dimensional Magnetic more particles of the same kind) under inter-
Systems," in "Recent Developments in Condensed change in any pair. Those particle types for
Matter Physics," J. T. Devreese (Ed.), Vol. 1, p. which there is symmetry under intra pair ex-
343, Plenum, New York, 1981. change are bosons; those for which there is
4. Nagle,1. F., and Bonner, J. C., "Phase Transitions- antisymmetry are fermions, which fall into the
Beyond the Simple Ising Model," Annual Review two subtypes: quarks and leptons.
of Physical Chemistry, 27, 291 (1976). Cutting across the boson-fermion classification
ANTIPARTICLES S6

of particle types according to their statistical composite baryons (quark triplets) with baric
behavior are other classifications based on their charge I and the mesons (quark-antiquark pairs)
interaction behavior or "charges." This refers with baric charge O. The leptons also have baric
to their participation or neutrality in the four charge O. The mesons, however, in contrast to
fundamental physical interactions found in the leptons, do interact hadronically, which is
nature: hadronic, electromagnetic, weak, and to be expected from their dipolar structure with
gravitational. Formerly these were considered respect to baric charge. The leptons are elemen-
to be distinct, as once upon a time, the magnetic tary, have no quark content and therefore no
interaction and the electrical interaction were hadronic interactions. The leptons in turn dis-
considered to be distinct only to be united later play what seem to be other types of conserved
in the one electromagnetic interaction. Similarly, charges. There appear to be three types, tau-
at the levels of energy of experiments in which leptic charge (for tauon and its neutrino),
they have been observed until now, the four mu-Ieptic charge (for muon and its neutrino) and
interactions are distinguished by markedly dif- e-Ieptic charge (for electron and its neutrino).
ferent intrinsic strengths (couplinl!: constants), Table 1 diagrams these classifications.
roughly in the ratio I: 10- 2 : 10- T4 : 10- 39 , as The most fundamental theory that we have of
well as different symmetries. Only the first source particles and their interactions is Quan-
three have hitherto played an important role in tum Field Theory (QFT). The central concept
quantum particle physics. Particles subject to in the classical ancestor of this theory is that of
hadronic interactions are called hadrons, the the force field, in which the effects of action by
elementary ones, out of which all others are a source at anyone point are propagated contin-
composed being called quarks. Composite uously to other points via an intermediary
fermionic hadrons are called baryons; all other influence, i.e., not by "action at a distance" but
hadrons are mesons. It is found that baryons by local action from point to point. The quan-
all carry a nonvanishing baric charge B which tum interaction field retains some of these
is very nearly, perhaps absolutely, conserved; features, but in modified form.
i.e., up to a very high probability-perhaps The quantum modification of the classical
certainty-the total baric charge of the compo- field concept of propagation of influence is a
nent entities after a reaction equals the total picture of emission and absorption of inter-
baric charge of the component entities before mediary entities or excitations-the interaction-
a reaction. At both ends of the reaction the quanta-which are exchanged between source-
total baric charge is computed by simple addi- particles and which can also exist free. The
tion. The baric charge of the elementary conception of these intermediary excitations
hadrons-the quarks-out of which all other first appeared explicitly and with incisive success
hadrons are constructed is ~, and the baric for phenomenological description and predic-
charge of antiquarks is - !. The only fundamen- tion in Einstein's 1905 work on light quanta or
tal hadrons directly observed in nature are the photons (for which, together with his "work in

TABLE 1. CLASSIFICATION OF PARTICLE TYPES ACCORDING TO SYMMETRY


UNDER PERMUTATION OF IDENTICAL PARTICLES (STATISTICS)

Bosons Fermions

Baric
'"
I': Charge 0 Mesons
0
.; Baryons
01
::r: Baric (nucleons, hyperons,
Charge *- 0 baryon resonances, ... )

Baric Photon
and
Leptic
Charge 0 Weak-Interaction Boson (?)

Tau-Leptons
'" (taus and tau-neutrines)
...
I':
0
$:I.
Leptic
Charge *- 0
Mu-Leptons
(muons and mu-neutrinos)
'"
....l e-Leptons
(electrons and e-neutrinos)

Graviton (?)
57 ANTIPARTICLES

theoretical physics," he was awarded the Nobel spacetime symmetry types, are mass and spin.
prize in 1921, with no mention of the relativity Besides being of a given spacetime or external
theories!). The similar intermediary quanta that symmetry type, each source-particle also is of
have been established since that time in theory an internal symmetry type-one of those which
for the other fundamental interactions, are serve as representations of an internal symmetry
interpreted as excitations of fields whose poten- group. Corresponding type attributes are charges
tial is distributed over spacetime. Thus we now of various kinds-hadronic, electric, and weak,
speak of gluon fields, and intermediate vector which play a role in the interaction of the
boson fields, along with the photon field. various types of particles with the corresponding
In QFT, source-particles are also interpreted forces. Interaction-quanta possess similar sym-
as excitations of the modes of a field-the metry attributes.
appropriate source-particle-field. Upon inter- The electromagnetic interaction is perhaps
acting, a given source field and a given inter- the simplest. Its internal symmetry group is
action-quanta field can excite or de-excite UI , the group of all phase factors, i.e., complex
each other. The excitations which make up our numbers of unit magnitude; the corresponding
world of substance "sources" are traditionally attribute is electric charge. The electromagnetic
recognized as systems of one or more "particles;" interaction operates only between electri-
this corresponds to the fact that whenever we cally charged or magnetically active particles,
have a "pure" excitation of the source field, exchanging energy and momentum. It does not,
i.e., such as occurs in one and only one field- however, transfer electric charge, i.e., it is itself
mode, it always comprises an exact whole num- neutral. Its intrinsic strength, 1/137, is labeled
ber of elemental excitations identical with each (te.
other. (Similarly for "quanta" in an interac- The situation is quite different with the
tion field.) This exact whole-number multiplic- hadronic and weak interactions. The hadronic
ity is already a sufficient basis for calling the force, and likewise the weak force-in contrast
theory "quantum." For such an elementary ex- to the long-range electromagnetic and gravita-
citation, we shall reserve the term "particle" in tional forces-both are "short-range" forces.
the case of source fields ("quantum" in the case This means that the region of appreciable
of interaction fields) or, more technically, "one- manifestation of the hadronic force is limited
particle" (one-quantum) state. The term "exci- to a very small volume around the source parti-
tation," though not in common technical use, is cles, basically a quark, from which the force
really more appropriate than "particle" or emanates. This force acts only on other quarks
"quantum" because "excitation" connotes as- or quark structures, but not on leptons.
pects of wave or vibrational patterns, as well as Both quarks and their intermediary hadronic
the wave property that all elementary excita- interactions-"gluon interactions" -are classified
tions of a given type are indistinguishable from according to the symmetry types of an exact
each other. SU3 C group, the indicated internal symmetry
In summary: The hadronic interaction-also group of the hadronic interactions. The super-
called "strong" or "nuclear" interaction-acts script c stands for "color," a label which takes
fundamentally between the members of one of on the three primary color values R (red), G
the two basic families of elementary source- (green), and B (blue), each symbolic of a par-
particles. The members of this basic hadronically ticular kind of hadronic charge. Antiquarks come
interacting family have been named "quarks." with the complementary three "anticolors" or
The other basic family of source-particles, anticharges. Because of the exact symmetry,
which have no hadronic interactions, are called the intrinsic strength of all the color charges is
"leptons." The term "source-particles" is used the same.
here for the members of these two families to Besides the color charge classification of
contrast them with other fundamental massive quarks there are two other type classifications
and radiation entities which transmit interac- found empirically that are not given by the
tions; these interaction-transmitting entities we exact SU 3 C group, but which provide an approxi-
shall designate "interaction-quanta." Among mate symmetry grouping. There are six "flavors"
these are the "photon" and "weak-interaction divided into three families." The three families,
vector bosons" and the "gluons" which transmit each comprising two flavors, are: (1) up (u),
the hadronic interaction between both quarks down (d); (2) charm (c), strange (s); (3) top (t),
and leptons. Tables 1 and 2 diagram these bottom (b).
classifications. The quark color classification corresponds to
Within the two particle families-quarks and the first nontrivial symmetry-type, triplet (3) of
leptons-there are many finer distinctions. In S U3 c. Its threefoldness is represented by R, G,
our subsequent discussion we shall see that any and B. There is a corresponding antitripleL (1)
specific type of source-particle has attributes with the three complementary anticolors R, G,
that come from its belonging to a corresponding and 11 for the antiquarks. Thus, combination
space-time symmetry type, one of the "irreduc- of three quarks having each of the three colors
ible representations" of the symmetry group of makes a "white" particle (baryon), or combina-
spacetime. Among these attributes, to which tion of a quark of a certain color and its anti-
different values are assigned in the different quark makes another type of "white" particle
TABLE 2. CLASSIFICATION OF PARTICLE-TYPE ACCORDING TO SYMMETRY UNDER PERMUTATION OF IDENTICAL PARTICLES (STATISTICS) AND THEIR ROLE AS
INTERMEDIARY QUANTA OR PARTICLES IN THE STANDARD MODEL

(SU 3 )strong X (SU 2 X Udweak,

WHERE WITHIN EACH WEAK MULTIPLET EXCESS ELECTRIC CHARGE == WEAK CHARGE, I.E . , Q - Qav == R 3 •
Particles presently considered to be composite bound states of the more elementary QUARKS and LEPTONS are written in lower-case type. The question marks in parentheses
indicate "not yet verified experimentally."

BOSONS (intermediary quanta and qtj structures) FERMIONS (matter particles)


(a ~ R , G, B)
HADRONICALL Y QUARKSq~ (r ~ U, d; c, $; t, b)
INTERACTING (baryons hyperons, baryon resonances, . .. )
Baric Charge B '" 0

Baric Charge B ~ 0 HADRONIC (STRONG) Mesons


tIo INTERMEDIAR Y GLUONS
.. QUANTA G~ (a,~~R , G,B)

ELECTROWEAKL Y QUARKS LEPTONS


INTERACTING
SV2 X VI Singlets PHOTON
Weak Charge R3 ~ 0 A
NEUTRAL WEAK-INTERACTION BOSON (?): D UP and DOWN QUARK: uD, dD D ELECTRON: eb
ZO D STRANGE and CHARM QUARK: sD, CD DMUON: p.15
ELECTRO-WEAK D TOP (?) and BOTTOM QUARK: tD, bD D TAUON: T15
INTERMEDIARY
SV 2 Doublets QUANTA CHARGED WEAK-INTERACTION BOSONS (?) : L UP-DOWN QUARKS: (~) L L NEUTRINO e- e- : (~qL
Weak Charge R3 '" 0 W· L STRANGE-CHARM QUARKS: (~) L L NEUTRINO p.- p.-: ("/1)L
W- L TOP O)-BOTTOM QUARKS: (t) L L NEUTRINO T- T-: (~)L

GRAVITATlONALL Y GRAVITON (?):


INTERACTING r
59 ANTIPARTICLES

(meson). Strong interactions between the various Both sets of type-observables are associated
quarks depend only on their colors and are with exact symmetries of the particle types and
governed by an exact unbroken 8 U 3 C gauge their interactions. What a "symmetry," or
group, implying eight massless gluons, which "symmetry operator," is will be defined more
mediate the quark interactions. All structures precisely later; here we shall designate the
are governed by the following fundamental aspects of nature to which it applies, and indi-
rule: While various flavored quarks can be cate the meaning of the symmetry concept, by
found mixed in various proportions, only the following statement: Nature lends itself to
"white" color singlets are found in independent a classification into two kinds of aspects, that
particle entities. of systems and that of natural processes. A na-
It is this rule which summarizes in a formal tural process (e.g., motion in an arbitrary elec-
manner the phenomenon of "quark confine- tromagnetic field) is generally considered to be
ment" which one requires in the current theory a more fundamental feature than a particular
of strong interactions that has become known system (e.g., the earth), since the process is taken
as "quantum chromo dynamics. " As already to correspond with a law of nature whereas a
stated, the hadronic force does not operate on particular system corresponds to the more
the members of the other set of elementary special features which we label side conditions.
source-particles found in nature, the leptons; (There is a blurring of the distinction in micro-
in other words, leptons have no hadronic (or physics where the properties-to a certain extent
gluon) charges. Also, leptons, unlike quarks, are unexplained-of isolated microphysical systems
actually observed in a state of freedom-a fact appear to be instances of laws of nature rather
which suggests that the fundamental hadronic than of side conditions; this goes, of course,
interquark (gluon) interaction is necessary for with the fact that these isolated microphysical
the "total confinement" which seems to be systems are found in vast numbers of identical
operating for quarks. The phrase "total confine- replicas: "elementary particles.") Symmetry
ment" signifies that only 3-quark and quark- operators are applicable to either side condi-
antiquark composites have been observed- tions or laws of nature or, alternatively, to
their quark content being inferred, however, either systems or natural processes: A symmetry
from compelling evidence. This suggestion operator of a system or process is a possible
has been incorporated in the quantum chromo- change which leaves certain overall relevant
dynamic field theory, where the fundamental characteristics of the system or process unaltered
hadronic force is comprised of the octet of (a possible change is one that can be actually,
gluon interactions. The mediating gluons are or in imagination, carried out on the system or
endowed with color and are therefore self- process). If we are looking at nature as a whole,
interacting, thereby producing antiscreening be- and are concerned principally about processes,
tween source particles. This property, if it the relevant characteristics to be preserved are
could be proved to increase without limit with the dynamics of transitions between arbitrary
distance, would lead to total confinement. initial conditions and the possible final condi-
Residually, the hadronic force acts also tions in all spontaneous natural processes. In
between the 3-quark composites which we classical physics, this dynamics is given by
recognize as nucleons, neutrons and protons, differential equations and it is the form of these
which constitute units out of which nuclei of and certain parameters appearing in the equa-
atoms are constructed. This force is, however, tions which are to be preserved; in the truer-to-
relatively (only relatively!) small compared to nature quantum physics the dynamics may be
the interquark force at the same distances. It expressed even more directly by an array (8-
is analogous to the Van der Waals forces between matrix) of transition probabilities each of which
electrically neutral atoms. is to be preserved individually. The additional
II. Antiparticle Existence Principle: Sym- invariant parameters here are such quantities
metry, Conjugation As we have already indi- as the mass and spin of the system (see next
cated, it is one of the great theoretica( and paragraph).
empirical discoveries of modern physics that The symmetry operators which preserve the
there is a general duality in nature in that to dynamics are, first of all, those which corre-
every particle type there exists a mutually spond to the homogeneity (invariance under
annihilating "antiparticle" type with certain translations) and isotropy (invariance under
exactly identical and certain other exactly space-space and space-time rotations) of space-
reversed defining properties. The exact duality time in regions small enough so that the effects
of particle and antiparticle serves to classify the of gravitation may be neglected, i.e., over re-
two sets, that of exactly identical and that of gions small compared with the reciprocal of the
exactly reversing properties. Among the exactly local value of the acceleration of gravity-in
identical properties are the mass and spin, and units in which the speed of light is 1 and all
among the exactly reversing properties are all speeds are dimensionless ratios to this speed.
the fundamental charges. The mem bers of both (For all moderate gravitational fields these are
sets will be called type-observables and the very large regions, e.g., for an acceleration equal
following discussion will be concerned with to the earth's surface gravity the linear dimen-
specifying them. sion of the limiting region is "'" 10 18 cm "'" 1
ANTIPARTICLES 60

light year.) This set of homogeneity and iso- cle types) may be called the "antiparticle
tropy operators characterizes the special rela- existence principle." Later we shall discuss the
tivity theory of spacetime and is called the derivation of this principle as a theorem in the
Poincare group. Thus, besides the fundamental context of other more general principles. We
conjugation-reversing interaction charges, the shall review the general basis that has been
principal symmetry-related observables which proposed for it in quantum field theory-which
define particles and antiparticles are those adds the principle of locality to general algebraic
which result from the continuous spacetime quantum theory-and inS-matrix theory-which
symmetries which express homogeneity and is a stripped-down and occasionally extrapolated
isotropy: The invariance of natural evolution version of quantum field theory.
processes under an arbitrary spacetime transla- The validity of the antiparticle principle or
tion (homogeneity) implies existence of con- theorem means that there exists an operator f)
served four-component energy-momentum f; which divides all particle type-observables into
the invariance under an arbitrary spacetime ro- two sets and thereby ranges existing particle
tation (isotropy) implies existence of conserved and antiparticle types in one-to-one correspon-
six-component angular momentum J - (M, lif). dence. We call this operator particle conjugation
The conjugation-invariant type-observables here and we may restate the antiparticle existence
are mass m (with which may be associated principle: There exists a parJicle conjugation
phenomenologically, as an imaginary part, life- operator f), and a division of symmetry-related
time T) and magnitude of spin-intrinsic angular observables ("type-observables") in two sets,
momentum-so Both mass m and spin-magnitude such that f) acting on the type-observables of
s are spacetime scalars or "invariants," which any existing elementary entity of matter and
are algebraic functions of f and}; by saying radiation- "particle" -gives the type-observables
that they are invariants, we mean that they are for an existing conjugate entity- "antiparticle"
left unchanged by any spacetime translation or -with the first set of observables identical-
rotation. (Actually, as we shall see, it is natural "conjugation-invariant"-and second set re-
to require that they be invariants-but not versed - "conjugation-reversing" -in sign. Fur-
necessarily functions-of every symmetry op- ther, f) is an "involution "; acting on the type-
eration.) Because two translations commute observables of antiparticles it restores the values
(give the same result if performed in alternate for the corresponding particles. Thus the opera-
order) there is no theoretical restriction on the tor EJ and the identity, together, make up a
possible values of the mass, though empirically two-member "particle-conjugation group, " with
such restrictions exist. Because rotations are the conjugation-invariant and conjugation-re-
noncom muting in a characteristic manner, the versing type-observables belonging respectively
theoretically allowed values of spin are integral to the even and odd symmetry types ("irreduc-
or half-integral multiples of an elementary unit, ible representations") of the group.
as observed. In short, the theory of (linear
operator) representations of the relativity sym- What "acting on" an observable means for-
metry group (Poincare group) prescribes mally will be defined later.
clearly the existence of the particle-characteriz- I t is to be emphasized that the exact doubling,
ing properties of spin and mass but only partially and no more than doubling, as between particle
explains their values: (1) The specific range of and antiparticle types, is in contrast to the
observed spin values (all half-integral or inte- approximate higher multiplicities found in
gral multiples of a fundamental unit) is fully nature. Such are the pion triplet and N*
explained; the allocations to specific types of quadruplet of electric charge values, which
particles are partly explained by the general respectively fit into the larger multiplicities of
spin-statistics theorem (see following). (2) The the meson charge-hypercharge octet and the
specific range of observed masses is not funda- baryon charge-hypercharge decuplet; these are
ascribed to representations of higher algebraic
mentally interpreted; however, a partial inter- groups of approximate symmetry, which will
pretation of the mass differences among the be discussed at a later point.
strongly interacting particles in a multiplet and The conjugation exists not only for free
supermultiplet in terms of an approximate particles and antiparticles but also for inter-
algebraiC internal symmetry group, SU3 , does acting systems of particles and antiparticles.
exist. This larger content of the antiparticle existence
The occurrence of the duplicity of particle principle may be stated as follows: The con-
types with the same conjugation-invariant prop-
erties (like mass and spin) but opposite conjuga- jugation EJ acting on any type-observable of
tion-reversing properties (like electric and baric any possible system of particles and anti-
charge) has been much discussed. In general each particles gives the observable of a corresponding
member of the double is called the "antiparticle" possible system of antiparticles and particles.
of the other, and the basic fact that all particle Again, in the following we shall discuss the
types may be ranged in doubles (allowing for derivation of this extended principle as a
cases of identity between particle and antiparti- theorem - in the usual interpretation of EJ known
61 ANTIPARTICLES

as the "CPT theorem"-in quantum field theory tive charges are all reversed. Except in those
and S-matrix theory. "totally neutral" cases where these charge
III. Superselection and the Associated values are all zero, the particle type can be
Charges; Conjugation Uniqueness Principle; Par- distinguished uniquely, under all circumstances,
tial Conjugation Operators Besides the anti- from its antiparticle type by a reversal of sign
particle existence principle there appears to be in all charge values. How is it that when more
another principle relating to the uniqueness of than one charge occurs with a nonzero value,
particle-antiparticle conjugates which, as far as they all reverse together for the antiparticle
we know, has not been emphasized in the litera- type? Thus, for example, there is a unique anti-
ture. This may be called the principle of unique proton, negative electrically (Q = -1) and with
superselective conjugation. We now describe it. negative baric number (B = -1); there does not
Among the type-observables- the basic ob- exist a negative baryon (Q = -1, B = +1) or a
servable properties used to define particle positive anti baryon with protonic mass and
types-it is an empirical fact that there are some spin. The striking general fact that there is exact
which are simultaneously sharply measurable doubling-with whatever occurring nonzero B,
with all other observables, and yet are not Q, Le or LIJ.' and LT reversing simultaneously-
trivial "constant observables" - they are not seems to be of special significance.
trivial in that they exhibit different values for In our discussion we asserted that the spin
different systems. These "superselection ob- [helicity 1, and therefore the valence, remains
servables" which we discuss in detail later are unchanged under conjugation while the super-
principally the "superseleciion charges" baric B, selective charges undergo the required reversal.
electric Q, mu-leptic LIJ., electron-leptic Le, and A priori there were two possibilities, of which
tau-leptic LT (the latter three are included with this was one. The other would have been that
some reservations in that their empirical basis is there exist mutually annihilating particle-anti-
not as firm as that for Band Q). B, Q, and (L e , particle conjugates which have the spin (helicity)
L IJ., L T ) are related respectively to the hadronic, integral for one and half-integral for the other;
electromagnetic, and weak interactions. Besides the two members of such a doublet would
these, by virtue of the Poincare-group space- exhibit the same, or equal opposite, Qaric,
time symmetry there exists another superselec- electric, or leptic charge but opposite valence.
tion observable; this is the valence index (-1 ) 28 3 Because of conservation of angular momentum,
[for massless particles (-1)n 1 defined by the upon annihilation an odd number of half-integral
observable component S3 of spin [or, for mass- spin particles would appear. Such a situation
less particles, the helicity A1 defined in the seems to be in marked contrast to what we ob-
following. The empirical evidence for the super- serve in nature where, for example, any anti-
selection nature of all these quantities is the meson and its meson (both hadrons with zero
total absence of interference terms between baric charge) have the same integral spin, where-
systems with different values of any of the as an antibaryon and its baryon have the same
charges or of the valence index. (The superselec- half-integral spin.
tion of the valence index can be correlated The principle involved may be stated as fol-
theoretically with the fact that the continuous lows: For reasons as yet unknown, there exist
mathematical function describing half-integral the three fundamental quantum-relevant inter-
spin [helicity 1 particles requires two whole actions: hadronic, electromagnetic, weak, with
rotations to repeat, in contrast with the contin- associated superselective charges: baric B, elec-
uous function describing integral spin [helicity 1 tric Q, tau-Ieptic L T , mu-Ieptic LIJ. and e-Ieptlc
particles, which is periodic with respect to a Le. The principle of unique superselective con-
single whole rotation about a point; thus no jugation-an empirically verifiable (or, in prin-
coherent linear combination of the two mathe- ciple refutable) assertion-states: The multipli-
matical functions-i.e., with a definite complex- cation of particle types by reversals of sign of
number phase on one function with respect to superselective charges is an exact doubling; i.e.,
the other-is consistent, and physically this there is a simultaneous reversal of sign of all su-
means: no interference phenomena.) perselective charges, whereas the type-observ-
Besides spin and mass, the numerical values abies associated with the Poincare group-mass,
of all the "superselectables" (superselection spin, and valence-remain unchanged. This al-
observables) are the principal quantities needed lows us to speak of a "unique" antiparticle to
to specify a particle type and to distinguish every particle. Alternatively f!escribed: The par-
antiparticle from particle. Under conjugation ticle conjugation operator e is inique and, to-
the valence index remains unaltered: The an ti- gether with the identity operator, defines a
particle of an integral (half-integral) spin parti- unique particle conjugation group such that all
cle also has integral (half-integral) spin. That superselective charges below to its odd repre-
this must be so is a clear consequence of the sentation. Only the situation we have described
fact that s3 or A remains integral or half-integral under the heading of the "principle of unique
under conjugation; even with S3 or A reversed superselective conjugation" appears to occur
in sign, (- 1)283 or (-1)n remains unchanged. in nature. The behavior of the valence can be
By contrast, under conjugation the superselec- inferred from the fact that the conjugation is a
ANTIPARTICLES 62

symmetry operation which leaves all Poincare- time-inversion T, or 8' with p, one defines the
group type-observables unchanged. operator 8PT == 8 P == C, known traditionally
The uniqueness of conjugation-the simulta- as "charge conjugation," which is a kind of
neous reversal of sign of all the superselective "purely internal and unitary conjugation oper-
charges-requires a more subtle explanation. It is ator." Except for weak-interaction systems and
customary to interpret the rigorous conjugation. processes, it is again a symmetry.
operator to be what is commonly called CPT; To understand all this in detail it is necessary
here C is the electric charge-conjugation opera- first to clarify what we mean by "particles." The
tor found to be an exact symmetry in quantum fundamental entities of matter and radiation
electrodynamics, as also are P the parity of constituting the physical world are associated
space reflection operator, and T "time-inver- with observables which are: (I) quantum-
sion" (actually "motion reversal") which unlike mechanical; (II) symmetry-governed; (III) invar-
the first two is an antilinear operator (defined iant or covariant under the governing symmetry.
formally in the following). By taking the prod- (I) By the quantum-mechanical aspect we mean
uct only of the first two, one defines a linear that a key role is played by probabilistic-more
operator 8' commonly known as C (I' (it has also explicitly: statistically deterministic-concepts
been called "coparity" by the writer). To our rather than detailed deterministic concepts as in
present knowledge 8' is an exact symmetry for classical physics. This is because of the funda-
all processes with the sole exception of the weak mental fact of nature that any two observables
decays of one single type of particle-the kaon; A and B are not necessarily compatible, i.e.,
in these decays invariance under 8' appears to starting in a given situation and measuring the
be broken at the level- 10- 3 [the number two observables in alternation, they do not each
10- 3 refers to the ratio: Rate (KOL -+ 21T)/Rate repeat their original measure values; instead, the
(KOL -+ 31T) 1. measurements give for each a distribution of
Following the survey here, and an introduc- measure-value probabilities characteristic for
tory theoretical orientation, we propose that observable when preceded by a measure-
a reason why the principle of unique super- ment of the other. (II) By the symmetry-govern-
selective conjugation must hold: If the universal ing aspect we mean that it is of central impor-
conjugation operator can be identified with 8- tance to classify observables according to their
a symmetry which though nonobservable is behavior under physically relevant groups of
linear-and we also assume the von Neumann symmetries; each individual symmetry is by
property of the algebra of all observables, then definition a transformation ("change") which
the principle follows. In the standard interpreta- (a) leaves measure-value probabilities unchanged,
tion of the conjugation with another operator (b) is evolution-independent, and (c) leaves the
not 8', but spacetime type-observables, mass, spin, and
valence, unchanged. (III) By the invariance-co-
variance aspect we mean that among all possible
E> = CPT == 8 observables-and relative to a given degree of
generality of observation procedures-there is a
division into two classes-the type-observables
all the homogeneity operators, energy-mo- which are invariant with respect to the groups of
mentum P, are conjugation-invariant and symmetries, i.e., go into themselves when trans-
all the isotropy operators, space-space and formed by any symmetry, and the state-observ-
space-time angular momentum J, are con- abies which are covariant, i.e., mix non trivially
jugation-reversing and so likewise is the (re- with other members of a set of observables when
lated) observable component S3 of the spin. transform ed.
These quantities are not type-observables but A more explicit discussion follows.
rather "state-observables" and we shall discuss IV. Observables and Their Algebras. Sym-
them in the following . Their behavior under metries (I) The modern algebraic version of the
conjugation is not so clearly indicated empir- quantum-mechanical concepts is streamlined in
ically as that of the type-observables. For ex- comparison with the earlier formulations in
ample, with 8' as a conjugation operator, the P that the concept of a given state space has re-
components, rather than the Xi components, are ceded to the background; with the greater
conjugation-reversing. - economy and generality of algebraic quantum
Following the discussion of 8' we give an ac- theory, many results may be demonstrated
count of some of the conventional formalism more elegantly.
which has been successful in deriving the anti- (la) We consider all directly or indirectly mea-
particle existence principle and the conjugation surable properties which we call observables A.
principle (8 interpreted as CPT) as theorems in The observables constitute an algebra with a
a general context of relativistic quantum theory. certain formal structure. We first remark that
There then follows a brief discussion of the spe- observables in quantum mechanics are not all
cial and more stringent form which the general mutually sharp or compatible: In any given sit-
conjugation operator takes for the basic inter- uation, keeping all other conditions constant, if
actions : By combining 8 with parity P and we measure an observable A repeatedly we get
63 ANTIPARTICLES

consistently the same measure value. The same measurements give the same result in either
is not necessarily the case if any two observables order. For brevity the operator representing
A and B are measured in alternation repeatedly, an observable will also be called an observable.
keeping all other conditions constant; we find a A general operator A is said to be bounded or
distribution of values for each rather than a sin- to have a norm if there exists a positive number
gle value. We then say: "A and B are not com- b such that
patible." Two observables are compatible if in a
given system, prepared appropriately and re- II A¢> II ,;;;; b II ¢II. all r/> in X
peatedly in the same way, each observable al-
ways exhibits one characteristic definite measure The smallest number b with this property is the
number ("quantum number," "eigenvalue") norm of A and is denoted II A II. It follows that
upon consecutive repeated measurements appro-
priate to each observable in either order. Con- II A + B II ,;;;; II A II + II B II II AB II ,;;;; II A 1111 B II
stant observables, i.e., those which exhibit only
one measure value under all circumstances with II AA II = I A III A II
all systems (multiples of the "identity"), are of
cotlrse compatible with all others. If there is We now turn to the characterization of the
also a nonconstant observable-Le., in experi- algebra of observables in X. What the algebra
ments with various systems it yields differing formalizes are (1) the property that the numer-
measure numbers-which is compatible with all ical values obtained as the result of any mea-
other observables, we call it a superselection ob- surement are (a) real and (b) bounded; (2) the
servable. We shall discuss some examples later,
property that not only is every observable or
among them electric charge Q. algebraic function of observables compatible
In the following, let the complex conjugate of with all superselectables, but that every opera-
a scalar number, e.g., A, be represented by a tor compatible with all superselectables is an
observable or algebraic function of observables;
superior bar, 1. Consider an infinite-dimensional in short compatibility with superselectables
linear complex vector space X with an inner characterizes the algebra of observables.
product (r/>, 1/1 ), i.e., The formal structure which satisfies the re-
(r/>, 1/1 + e) = (r/>, 1/1) + (r/>, e) quirements of these two sets of properties is
that of a von Neumann algebra, a type of sub-
(r/>, A1/1) = A(r/>, 1/1) (Ar/>, 1/1) =1(r/>, 1/1) algebra (1 == {A} of the algebra of all linear
bounded (or, equivalently, continuous)operators
(r/>,1/1)=(1/1,r/» in X, which sub algebra satisfies two conditions:
1. (1 IS A C* ALGEBRA. A C* algebra in-
for all vectors r/> and 1/1; X is assumed complete volves (a) an involution applied to (b) normed
in the norm II r/> II defined by I r/>, r/> 1112 == II r/> II, operators.
and has a countable basis (every Cauchy se- a. lnvolutive application: There is a * or ad-
quence of vectors has a limit vector in the space joint application of (1 into itself such that, for
and a complete orthonormal basis can be intro- any A in (1, A *-the adjoint of A -has the prop-
duced), i.e., X is a Hilbert space. The unit vec- erties
tors of physical interest are only those lying in
certain subspaces of X defined by certain values (A+B)* = A * + B*, (AA)* = 1A *,
of the superselectables. These unit vectors, up (AB)*=B*A*,
to an arbitrary phase factor, represent states. A**=A.
Each observable is represented formally by a An operator for which A * = A is said to be self-
linear self-adjoint operator or "matrix" A acting adjoint.
on the vectors of X. The eigenvalue A of A for b. Normed operators: In a C* algebra we
an eigenstate r/>, i.e., such that A r/> = Ar/>, is the have that all operators are normed and that their
measure value found for the observable corre- norms satisfy
sponding toA when measured in the state r/>. The
role of self-adjointness is to ensure that all such IIA*II = IIAII, IIA*AII=IIAII2, (all A).
eigenvalues are bounded and real. What "self-
adjoint" means formally and what further prop- An operator H which satisfies the equation be-
erties are to be expected of A is clear from the tween inner products
following discussion.
Besides their action on the vectors of X, one (1/1,Hr/»=(H1/1,r/»
also has the effect of one operator in X acting
on another to give a resultant operator, and in is called hermitian. It is easy to verify that every
the case of the operators representing two ob- eigenvalue ot a hermitian operator is real:
servables, this product of operators represents (r/>, Hr/» = (r/>, Ar/» = A(r/>, r/» = (H¢>, r/» =1(r/>, ¢».
the successive measurement of the two observ-
abies. Two compatible observables are then rep- Likewise for every operator which is self-adjoint.
resented by two commuting operators corre- Every self-adjoint operator is hermitian, but un-
sponding to the fact that the two successive bounded hermitian operators are not self-adjoint.
ANTIPARTICLES 64

(It can be proved conversely that a hermitian frequency or probability measure for the eigen-
operator which is everywhere self-adjoint is value Aof A in the state Mis idim PAA iaA where
necessarily bounded.) aA is the coefficient of PAA in the spectral-
2. (1 IS EQUAL TO ITS DOUBLE-COMMU- theoretic direct sum expansion of M-and this
TANT (1". The latter is the set of all linear probability measure is given by Trace (MP0).
operators in :J(' which commute with the first On the other hand, the eigenvalue A itself oc-
commutant which, in turn, is the set of all linear curs as the coefficient of PAA in the expansion
operators in :J(' which commute with (1. of A. Thus if-having prepared the system each
The algebra of observables is a von Neumann time in the same way-we repeatedly measure
algebra based on self-adjoint operators where the observable A, the expectation value of A
simultaneous observability of observables means over a large number of similar trials is Trace
that the self-adjoint operators, which represent (MA).
the observables, commute. Since (1 contains all (II) Symmetries. A symmetry is a mapping
observables and algebraic functions of them, of the set S of all density matrices onto S which
condition (2) means here that a linear operator (a) preserves probabilities,
which is a nonmember of the observable algebra (b) is evolution-independent,
cannot commute with all members of the com- (c) leaves mass, spin, and valence index in-
mutant (1' of (1. Since (1' contains, besides the variant.
constant operators, only the set of all super~ A symmetry may or may not also be an observ-
selectables and functions of these, the assump- able.
tion that (1 is a von Neumann algebra requires (lla) Just as in the case of complex numbers
such a linear nonmember of the observable in the complex plane, each of the operators in
algebra not to commute with one or more of state space may be viewed in an active role as
these superselectables. We shall see in the sequel operating on other objects or in a passive role as
that this is one of the key point~ in rationalizing one of the objects being operated on. A particu-
the occurrence of unique superselective conjuga- lar kind of operation on an operator A by an-
tion as described above. other operator G is called "transformation of A
The possible measure numbers of the observ- by G" or a similarity mapping of A by G:
abies are represented by the spectrum of eigen- GAG-I. (This gives the operator in the G-
values ai of A. For simplicity suppose these transformed state space which corresponds to A
spectra are discrete. By the spectral theorem of in the original space, i.e., it accomplishes the
Hilbert space theory, A is a linear combination corresponding operation upon corresponding
with real coefficients of bounded self-adjoint states.) If under transformation by G, A goes
operators called projections (or "projectors") into itself: GA G-l = A, then A is said to be in-
pA, having the idempotent property (pA)2 = variant under G; more generally, if for 'each
pA, with pA corresponding to the eigenvalue A. member of a set {Ai}, GAiG-l gives a member
In particular, for describing various specific con- Ai' =Ai of the set, Ai and all its partners in {Ai}
ditions in which a given system is prepared, one are said to be covariant under G. If the entire
is interested in the subset S of self-adjoint oper- set (t of all observables is transformed onto it-
ators with all eigenvalues ai nonnegative and self by G, then we say G performs a similarity
such that the sum of all ai-the "trace" -is equal mapping of (1 .
to unity. We call these unit-trace nonnegative Condition (lla) says that a symmetry is a
self-adjoint operators "density matrices." These probability-conserving similarity mapping of S.
are in one-to-one correspondence with the Probabilities are given by the coefficients in the
"states," both pure and mixed, of the earlier "convex" (i.e., with real positive coefficients)
less-streamlined formulation of quantum me- linear combinations into which density matrices
chanics. (It should be noted, however, that the can be decomposed according to the spectral
modern C*-algebraic quantum theory recognizes theorem. Thus the preservation of probabilities
much more general "states" than can be repre- is equivalent to the preservation of convex linear
sented by density matrices.) Even this subset, combinations or projectors. There are two kinds
consisting of nonnegative self-adjoint operators of operators which, acting as transforming oper-
of trace 1, however, is too large for physics when ators, preserve convex linear combinations of
superselection observables are present. :J(' is then projectors.
divided up into mutual totally incoherent sub- I. A unitary operator U is linear
spaces corresponding to the different eigenvalues
of the superselection observables; the operators U(o:¢ + (31/1) = o:UI/J + {3U1/I
representing physical observables act only within and isometric, meaning there is equality in abso-
each subspace but not between them. lute value of both sides of
(Ib) Probabilities. The co~cept of probability
gives physical meaning to the projection p0 cor- (U¢, U1/I) = (I/J, 1/1).
responding to an eigenvalue X. Suppose a system
is prepared repeatedly in the same manner, so 2. An antiunitary operator 8 differs only in that
that a given density matrix M describes the state it is antilinear isometric, i.e.,
of the system, and we measure the observable A
each time. On the one hand, the occurrence 8(o:¢ + (31/1) = a8IjJ
+ ~81/1; (81/J, 81/1) = (I/J, 1/1).
65 ANTIPARTICLES

The second condition implies that the antiuni- empirical fact that there exist particle types for
tary transform of an operator A is given by each case of only one charge differing from
zero-ail other charges having zero values:
A' = (8- 1 A8)*, and therefore (AB)' = B'A',
B Q Lp. Le LT
or there is a reversal of order of operator prod-
ucts if 8 is antiunitary. baryon,
otherwise neutral: AO 1 0 0 0 0
It is clear that the transformation @A@-l, electrically charged hadron,
where @ is a unitary or antiunitary operator, otherwise neutral: rr+ 0 0 0 0
preserves convex linear combinations of observ- mu-Iepton,
abIes. When we limit A to the subalgebra otherwise neutral: vp. o0 0 0
S of all density matrices in ct, the converse is e-lepton,
also true. By the general Wigner-Kadison theo- otherwise neutral: Ve o0 0 0
rem (Wigner, 1931-1959; Bargmann, 1964; tau-lepton,
Kadison, 1951) every convex combination-pre- otherwise neutral: VT o0 0 0
serving one-to-one map of S on S is either
unitary or antiunitary. Moreover this mapping Considering each of these systems in turn, we
operator is unique up to a phase factor. (Wig- draw the conclusion that each of the superselec-
ner's formulation of the theorem was in terms tive charges, when occurring singly, anticom-
of preservation of absolute scalar products of mutes with the symmetry operator 8', so that,
states rather than of convex linear combina- for consistency, when several occur together
tions of density matrices). with nonzero values, they must all reverse to-
We should qualify that, in the presence of gether. An example is the proton p (B = 1, Q = I)
superselectables, the symmetry mapping may be
from a sub algebra S, of density matrices onto
which , under conjugation of the decay A0 ~
p+ + rr- has to give the unique antiparticle p
-
some other coherent subalgebra S2 isomorphic (B=-I, Q=-l).
with S" but labeled by different values for the We note that to be able to draw the conclusion
superselective observables. This is the case when that a superselective charge is reversed by a non-
the symmetry operator 8' is not a member of observable symmetry - "conjugation" -it was
the observable algebra. If, in addition, 8' is necessary for that superselective charge to occur
linear, we can draw an important conclusion. with an eigenvalue unequal to zero, while all
As we have already emphasized, in this case- others have zero eigenvalues (this is what we
because the algebra of observables is a von mean by "otherwise neutral"). The argument
Neumann algebra-the linear nonobservable will, however, not go through if we have not es-
symmetry operator 8' cannot commute with all tablished the conjugation invariance of that one
superselective charges. As the notation has al- remaining superselective observable, which is not
ready indicated CP == 8' is such an operator; un- a charge, i.e., the valence (-1 )2S 3 [or (-1)2 A for
like 8 == CPT, 8' is linear-P is linear and C(see the massless casel. This conjugation invariance
end of article), as dictated for instance by quan- is assured, however, because the valence must
tum electrodynamics, is linear. Also, unlike 9' or always be +I or -1 depending upon the integer
C separately, 8' is a symmetry operator in all or half-odd integer value of the spin; but, by
cases, with the apparent exception of the neutral condition (c), a symmetry leaves the spin and
kaon decays. Further, except in the small sub- therefore the valence index invariant.
space of totally neutral particle states, 8' is not It is striking that the only violations of 8'
a member of the observable algebra: There is no which prevent it from being taken as a universal
preparation or measurement procedure, or com- conjugation under which interactions are invari-
bination of such procedures, corresponding to ant are in the neutral kaon decays. The neutral
turning particles into antiparticles. Thus, over- kaon case is precisely the one in which there
looking for the moment the difficulty with the exists no superseJective charge which by its re-
neutral kaon decays, 8' is the natural candidate versal defines the difference between particle
for a nonobservable linear symmetry operator and antiparticle. Only the approximately con-
representing particle conjugation, which cannot served "strangeness" or "hypercharge" (see fol-
commute with all the superselective charges. lowing) is available as a possible charge to dis-
That 8' must actually anticommute with the
superselective charges, e.g., letting Q be the criminate between K and K.
relevant charge: (IIb) We note with emphasis the importance
of condition (lIb). Though every unitary or
8'Q=-Q8' anti unitary operator preserves probabilities, it is
not necessarily a symmetry. The requirement of
follows from the fact that all probabilities, in- evolution independence is what brings the enor-
cluding the square of the expected value of the mous number of unitary and antiunitary opera-
charge, must be preserved. That this happens tors in J( down to the physically significant sub-
simultaneously with all superseJective charges set of symmetry operators. Evolution indepen-
(principle of "unique superselective conjuga- dence may be described as a "generalized time
tion") becomes evident when we recognize the independence." The evolution matrix-known as
ANTIPARTICLES 66

the "S-operator" or "S-matrix" -is an operator tion matrix invariant." It is clear that for find-
which takes anyone of a set of initial asymp- ing the S-matrix, which is central in elucidating
totic states to anyone of a set of final asymp- fundamental processes, the knowledge of its
totic states, where the states are characterized symmetries is invaluable.
by the values of certain observables and where The symmetries of any system form one or
"initial" and "final" refer to appropriate bound- more groups. In general there are discrete groups
ary conditions. The initial and final set can be of symmetries (e.g., the permutations of identi-
the same and be a complete set spanning the cal particles in a multiparticle system), and con-
entire state space. (In the case of the "collision tinuous groups of symmetries. In the latter case
states" appropriate to scattering problems, this the generators are of particular importance. Let
condition is known as asymptotic completeness, G be any symmetry operator differing infinites-
and it goes with unitarity, S* = S-i , of the S- imally from the identity operator I. Then the
matrix). The evolution matrix is a replacement "business end" of this operator defines a gener-
of a "moment-to-moment" time-displacement ator §: G = I + € §. In this case the transforma-
operator by an overall or global "before-and- tion by G of any operator A goes-to the first
after" operator. The need for such a replace- order in €-into A plus the commutator [§ , A ]
ment is evident when we reflect on the fact that of § with A:
the passage of "time" is relative to the observer,
according to the special relativity principle GAG-i ~A + e[§ ,A] [§,A]=§A-A§.
which governs nature-at least locally. In funda-
mental processes, only the before-and-after rela-
tions described by the S-matrix have an observer- The invariance of A under all members of a
invariant significance, and the formal expression group {G} requires then that the commutator
of the property of evolution independence, of A with all generators of {G} be zero. In par-
which every symmetry G must satisfy may be ticular the S-operator must commute with the
stated: When the symmetry is performed first generators of any symmetry group.
and followed by evolution, it gives the same re- The S-operator governing the evolution of
sult as when evolution is performed first and fundamental processes, is a particular though
then the symmetry. important case. Its structure, as well as the def-
For a unitary symmetry U this condition inition of particle types which undergo the fun-
takes the form damental processes, is elucidated by considering
the invariant and covariant bedfellows of the S
SU= us operator. Quite generally, in its passive role as an
operand, any operator will be found to have
which may be read either way as: SUS-i = U, certain transformation or covariance properties
"the evolution matrix leaves every unitary sym- under all the generators of a given group of state
metry invariant"; or USU-i = S, "every unitary space transformations. This characteristic trans-
symmetry leaves the evolution matrix invariant." formation behavior, as well as the relation struc-
For an antiunitary symmetry e, because initial ture of operators with each other, plays a key
and final states are interchanged under the action role in deciding whether an operator may be in-
of e, the evolution-independence condition has terpreted as representing a physical quantity
to be stated as having that transformation behavior. The gener-
ators themselves are usually merely covariants
se= es* of the group, but certain polynomials or func-
tions of them may be invariant; a function of
where S*, the adjoint of S, takes final to initial generators which is invariant-i.e., commutes
states; this equation is easily checked by letting with all generators-is called a "Casimir opera-
the right and left hand sides operate respectively tor" or "Casimir invariant" of the group.
on a final state f. The results of the sequence of (IIc) This condition is roughly equivalent to
operations on the right and left can be symbol- the one otherwise expressed by the statement
ized as follows (primes indicate e-transformed that a symmetry operation should leave the sub-
states and f indicates the result of S operating space of one-particle states invariant. We have
on 0: already seen its usefulness in our argument con-
([' +- i', i' +- f) = ([' +- i, i +- f).
cerning the invariance of the valence index un-
der conjugation.
(III) As we have already indicated, by the
The relation for Sand e may then be rewritten "invariance-covariance" aspect of the funda-
S* = e-ise mental physical entities we mean that among
all possible observables there is a (heuristic)
and, referring back to the expression given earlier division in two classes:
for the antiunitary transform of an operator, we
see that, even though it is not a simple similarity I. A subset which we call "invariant" or
invariance, this relation can be expressed by a "particle type-<iefining";
statement parallel to that for the unitary case: II. all other observables which we call "co-
"every antiunitary symmetry leaves the evolu- variant" or "state-defining."
67 ANTIPARTICLES

In our formulation both subsets will be char- in matched particle-antiparticle pairs by ~, but
acterized by labels associated with symmetry also that ~ is represented in JC by an anti-
groups. The first set of observables-used to unitary symmetry 8. We call this the conjuga-
define fundamental particles-are certain ex-
ternal and internal Casimir invariants, so called tion principle. That E> should be represented
because they are associated with external and ("implemented") by an operator 8, unitary or
internal symmetry groups governing the funda- anti unitary , in JC is a purely mathematical
mental systems. In contrast the second set of consequence of E> mapping onto itself the
observables-used to characterize the states in subalgebra of density matrices (Wigner-Kadi-
which fundamental particles are found-are son). The nontrivial physical point of the con-
covariants of these same external and internal jugation principle-a key point, commonly re-
symmetry groups. We shall see how this very ferred to as "the CPT Theorem" -is that 8 is
valuable distinction is nevertheless relative to not just an operator but is asymmetry operator.
the degree of generality of the phenomena In other words the evolution operator S, and
being considered. the Poincare invariants, mass spin, and valance
The external symmetry group is the proper (and therefore the dynamics of all fundamental
spacetime symmetry group, i.e., the group of processes), are invariant under 8, Le., E>, which
all rigid translations and rotations in spacetime we have defined as a mapping of symmetries
(Poincare group). The covariants here are phys- on symmetries, itself gives rise to an antiunitary
ical representatives of the four translation symmetry. More explicitly, the conjugation
generators-the components of the energy-mo- princple may be stated: The conjugation oper-
mentum l-and of the six rotation generators-
the space-5pace and space-time components of
ator e is represented in state space JC by (1) an
evolution-independent mapping 8 of states in JC
the angular momentum l '" (M, m). The two onto other states in JC, such that: (2) transition
~ ,-+

probabilities are preserved, with initial states


principal Casimir invariants here define the being mapped on final and final on initial; and
principal external properties of elementary (3) mass, spin, and valence are preserved.
particles: Their magnitudes give the numerical
values of the first two of these properties, mass S8- 8S =0 (1)
m and spin s. These numerical values label the
possible "symmetry types" or irreducible repre- (1/I!;',R', 1/I!;,R) ---+ (81/1!;',R', 81/1!;,R)
sen tations ("reps") of the proper spacetime
symmetry group. = (1/I!;,-R, 1/I!;',-R') (2)
We have discussed the general doubling in 8m L m 2 8=0 8s(s+ 1)- s(s+ 1) 8=0 (3)
Nature of particle types for which the conjuga-
tion-invariant observables are identical, whereas We emphasize that the conjugate of the
the values of all other type-defining observablesoriginal initial (final) state plays the role of the
are exactly opposite for the two members of final (initial) state in the conjugate process, of
the double; and we have introduced the map- which the rate is identical to that of the original
ping operator E> which acts on observables. process. Again we shall discuss later how the
Formally, E> "acting on" an observable means conjugation principle may be derived as a
transforming according to ~( )~-I. The super- theorem in the context of general theory.
selective charges: B, Q, Land Q, and A are in- V. Conjugation-Invariant and Conjugation-
cluded among the type-defining conjugation- Reversing Observables The mass squared is the
squared magnitude of the energy-momentum
reversing observables, and give - B, - Q, - L, - Q,
- Aunder this transformation. four-vector (Table 3). The spin comes in as fol-
What about the behavior of state-characteriz- lows: In general there is defined a four-compo-
nent polarization operator Wu made up by the
ing observables under E>? These also may be outer product of the linear momentum operators
sorted out once and for all into two sets, con- and the angular momentum operators. (The time
jugation-invariant and conjugation-reversing.
component Wo =P . M is the longitudinal polari-
~ ~

From now on we use the generic symbol L to


represent all conjugation-invariant observables, zation.) There are two cases generally recognized
and the symbol R to represent all conjugation- to be of physical interest, m 2 > 0 and m 2 = O.
reversing observables, whether type or state. In the massive case, Wu yields as the measure of
its invariant magnitude the spin s according to
Thus, E> is an operator which maps the sub- the formula W2 = m 2 s(s + l).
algebra of symmetry-related observables onto By a general spin-statistics theorem of rela-
itself according to tivistic quantum theory, the spin of a one-
boson system is integral and of a one-fermion
system half-integral.
Closely related to the existence of anti- In the case of zero-mass particles, W splits
particles-almost but not quite as firmly es- into two parts, each consisting of a pair of
tablished empirically-is the further fundamen- components. The pair of components perpen-
tal fact that not only may all types be arranged dicular to the momentum define, by the sum of
ANTIPARTICLES 68

their squares, the "continuous spin" invariant cussed one of these sets: Each fundamental
r2 where r is a real nonnegative number. The entity belongs (in its mathematical description)
remaining components, longitudinal and time- to an irreducible representation- "rep" -of the
like, are equal in this case, and they yield symmetry groups governing all physical systems,
another invariant, the projective index a = 0 or and the type-characterizing observables are as-
! or valence index +1 or - 1, which specifies the sociated with the Casimir operators whose
integral or half-integral character of the forward eigenvalues are the labels specifying the rep.
angular momentum (i.e., the projection along What are the other labels with which the state-
the direction of motion). For r > 0 this forward characterizing properties are associated? They
angular momentum takes on an infinite dis- are the labels for the rows of the reps and do
crete ladder of values, while for r = 0 the not have the same invariant significance as the
ladder breaks up into its individual segments rep labels since various basis systems of states
(the representation matrix reduces to the di- may be chosen to span the vector space con-
rect sum of its diagonal entries). In these r = 0 stituted by each rep. The type-observables have
cases the forward angular momentum-now an relatively stable values, defining the identity of
invariant-takes on a unique integral or half- the particle, so that it may be recognized as the
integral value, the helicity, A, and the valence same at the end as at the beginning of a process;
index ceases to be an independent invariant- whereas the state observables are more labile,
just as it happens in the massive case. All these having values which can vary between the be-
relations and cases are summarized in Table 2. ginning and end of the many processes in which
In nature only cases where the continuous in- the identity-e.g., m and s-of the particle is
variant r is equal to zero seem to occur. As we unaltered. For example, a rep of the Poincare
have indicated, the two sets-particle-defining group comprises a vector space of states, the
observables and state-defining observables-are vector space consisting, •. g., of states of all
associated respectively with two sets of labels of momenta and longitudinal polarizations asso-
mathematical groups-groups which describe the ciated with a particle of given mass and spin.
symmetries of natural systems. We have dis- Alternatively, a basis of angular momentum

TABLE 3. GENEALOGY OF THE EXTERNAL-PROPERTY INVARIANTS


MASS m AND SPIN S OR HELICITY A

Energy-Momentum Total Angular Momentum Polarization Vector

Four Components of a Three Space-space Compo- Four Components of a


Spacetime Four-vector nents of a Spacetime Spacetime Four-vector
Six-vector (Axial)
J23 ==MI J 31 ==M2 J I2 ==M3; Wo ==Pl h3 +P2 J 31 +P3J 12
and Three Spacetime WI == POJ 23 + P 2J 30 + P 3J02
Components W2 ==POJ 31 +P3J IO +P IJ 03
JIO == 1II1 J 20 == lR2 J30 == lR3 W3 ==POJ I2 +P IJ 20 +P 2J OI
3 3
Timelike Energy P~ _ ~ P~==mZ ~ W~ - W~ = m 2 s(s + 1)
Momentum. k=l k=l
Standard form Po =m Wo =0
taken when state PI =0 Ml =Sl WI = mSI
is limited to the Pz = 0 M2 =S2 W2 = mS2
subspace of P3 = 0 M3 =S3 W3 = mS3
sta tes in which
the center of
mass is at rest.

Lightlike Energy m =0 ~ W~ - W~ =r2


Momentum. k=l
Standard form Po =p Wo = W3 = pH3
taken when state PI = 0 MI -lR 2 = TI WI = pTI
space is limited P2 = 0 M2+lRJ=T2 W2 = pT2
to the subspace P 3 =p M3 = J(3
of states in In caser = 0
which the mo- Wa == A.Pa
mentum is in the A = lJ! . e/lf!.!
x 3 direction.
69 ANTIPARTICLES

might be chosen. The state-characterizing ob- modular-"unitary" group on one object), Le.,
servables include then energy-momentum P, groups of evolution-independent transforma-
and angular momentum J, or W, depending;)n tions having the form eiBct>, eiQct>, eiLct>, eiQct>,
the basis chosen. -- respectively (</> is an arbitrary parameter ranging
We have in the Poincare group an example of over all elements of each group). In these cases
how the very valuable distinction between B, Q, L, and Q are the sole generators-indeed
particle-defining labels and state-defining labels Casimir operators of their respective SUI groups.
is relative to the degree of generality of the ob- With the specification that they are genera-
servations. Suppose that we limit the allowed tors of symmetries-i.e., evolution-indepen-
changes of observation systems (or "frames of dent-it comes about that, as in the case of
reference") to space rotations, so that the ex- energy-momentum and angular momentum, the
ternal symmetry group reduces to the rotation sum total value of each of the internal
subgroup of the Poincare group. Then the fourth charges is conserved in any process. In the case
component-the energy Po -of P, which in the of mass and spin of a composite system, the
addition is vectorial rather than simply algebraic,
full group is merely a covariant, becomes an and in the case of mass, there may also be a
invariant, Po =(/>2 +m 2 )1I2, corresponding to contribution due to internal energy changes.
the Casimir operator />2 of the space rotation In contrast, the charges have values which are
group. This example illustrates the general simply additive in a composite system. As al-
proposition that as the scope of observation is ready mentioned, the sign reversal of the values
increased, it becomes more and more natural to of the charges for antiparticles is essential to
view different particles merely as different states maintaining the conservation laws in the pro-
associated with one supersystem. cesses of pair creation and pair annihilation in
We turn now to the internal properties of which antiparticles are observed appearing and
particles. As already remarked, Fermionic had- disappearing with their conjugate particles. Only
rons are called baryons, bosonic hadrons are those particles for which the values of all
called mesons; nature permits us to define charges are zero, e.g., the photon -y, the neutral
baric charge B equal to ± I for the former and pion 11'0, and the fI, p, w, and </> mesons are
o for the latter in such a way that the total identical with their antiparticles; these are
baric charge is conserved in all processes. For called "self-conjugate particles." Any number of
some unknown reason baric charge is related to these may occur together with pair creation or
spin or statistics. annihilation. Examples are
So far only fermionic leptons of two types-
electron or e-type and muon or mu-type-are
known, but the existence of bosonic leptons
involved in weak interactions has been con- depending upon whether the spins of the
jectured. Again, in accord with exact conserva- electrons are antiparallel or parallel.
tion in all processes, nature permits us to de- As indicated earlier there is an external prop-
fine: (1) electron leptic charge Q, equal to ± I for erty or type observable, the parity operator P
electrons and e-neutrinos, and 0 for muons and associated with invariance of non-weak evolu-
mu-neutrinos; (2) mu leptic charge L, equal to tion processes with respect to the discrete oper-
±I for muons and mu-neutrinos, and 0 for ation of space inversion (reflection). When one
electrons and e-neutrinos. (The universality of extends the proper Poincare group to include
the muon- and electron-Ieptic charge labeling- space inversion, the symmetry types and the
based on the phenomena observed in weak corresponding states double. Naively one says
interactions-is not quite certain.) that P2 = I, and that the parity of a one-par-
Intermediate between hadronic and weak ticle state or one-antiparticle state is +1 or - 1
interactions in strength is the electromagnetic according as the state function is preserved or
interaction. Here nature permits the definition reversed in sign when the space coordinates
of a conserved electric charge Q equal to ± I and upon which the state function depends are all
o for various particles, hadronic and leptonic, reversed. (Properly speaking the term "parity"
as the case may be. as used here means "the eigenvalue of the par-
Another way to regard the existence of ity operator when applied to the state," and
these internal charges, in which the aspect of the state must therefore be an eigenstate of P.)
conservation is derivative rather than primary, Actually the result of a double inversion can be
is in terms of symmetries. In this way too we asserted to equal the identity only up to a
are led to four independent universally ob- phase factor-a complex number of modulus
servable internal properties; baric, electric, 1, namely p2 = exp (i2CXR), where cxR is some ar-
muon- and electron-Ieptic charge B, Q, L, and bitrary phase angle which may be different for
Q, defined by symmetries associated with the be- the states associated with each superselective
havior of particles under the three fundamental family-i.e., for each set of particle types with
quantum-relevant interactions: strong, electro- given values of the charges, electric, baric, mu-
magnetic, weak. Each of the types of charge may leptic, electron-Ieftic, and hypercharg,e Y (see
be associated with an SUI ("special"-Le .. uni- later discussion): Q B, L, Q; Y}:::: {R}. Exam-
ANTIPARTICLES 70

pies of such families are (1) the self-conjugate reason why a boson can have an unambiguous
or totally neutral (Q = B = L = Q = 0) "non- parity (relative to the conventionally chosen
strange" (Y = 0) mesons; (2) all the nonstrange parity of the vacuum, P = +1) is that the con-
mesons of positive electric charge; etc. With tinuous function describing it obels the "Co-
p2 = exp (j2aR) the two possibilities for the ef- lumbus principle," Le., that a 360 movement
fect of P are multiplication by exp (jaR) and of the observing apparatus around an axis re-
-exp (jaR). Within any given family, by renor- stores the original description. Because the de-
malizing, P -+ exp (jaR) p, one can reduce to the scription of a one-fermion system does not
two alternatives + I and - 1. See PARITY. obey this principle, the intrinsic parity of a one-
More explicitly, we describe the situation as fermion system is not uniquely defined.
follows. Within any superselective sector, Le., a A fermion intrinsic parity is, however, often
superselectively separated subspace of state assigned conventionally, as for the nucleon fam-
space, we assume the "principle of maximum ily where it is taken to be +1. More generally, as
coherence" to hold: any two physical states cP a consequence of the isospin and SU 3 groupings
and 1/1 can cohere into a single physical state, of particle types with respect to hadronic inter-
with the relative phase angle of 1/1 with respect actions (see later discussion), it is natural to
to cP in the coherent superposition cP + 1/1 unam- assign those in the SU3 multiplet the same in-
biguously determined. Observationally the trinsic parity. It is important to emphasize how-
phase angle ~ appears in the interference term ever that the choice of relative parity between
2 1 cP, 1/1) 1 cos ~ in the scalar product two superselectively separated particle types
(e.g., neutron and proton) is only conventional.
(CP + 1/1, cP + 1/1) = (CP, CP) + (1/1, 1/1) This is because any two physical states rp and 1/1
+ 21 (CP, 1/1) I cos~. lying in different superselective sectors can not
cohere into a physical state, and the behavior of
If, under reflection, cP and 1/1 behave the same the relative sign of cP with respect to 1/1 under a
(CP + 1/1-+ cP + 1/1) the interference term remains space inversion cannot be deduced since there is
unchanged, and we say the relative parity of cP no interference to be observed. It is true how-
and 1/1 is + 1; if they behave oppositely (CP+ 1/1 -+ ever that within each superselective sector a
cP - 1/1 :: ~ -+ ~ + 1r) the interference term choice, albeit conventional, of the intrinsic par-
changes sign, and cP and 1/1 are said to be of rel- ity of one particle, A, determines that of any
ative parity - 1. Relative parity, being a special other particle, B, according to the rule derivable
case of relative phase factor, has the transitivity from the transitivity property
property, Le.,
(Intrinsic ParitY)B
P(A with respect to B) X P(B with respect to C)
= (Relative ParitY)B/A (Intrinsic ParitY)A
= P(A with respect to C).
By its nature as a phase factor-the eigenvalue
In the coherent superposition case where 1/1 of a discrete symmetry operation rather than of
has relative parity -1 with respect to cP, cP + 1/1 is a generator of a continuous symmetry opera-
not a parity eigenstate though it is a physical tion-the parity of a composite system (product
state. For simplicity and brevity we shall refer state) is the product of the parities of the parts.
to all physical states with definite charges and Thus given the parities of the parts the parity of
Poincare group indices, which are also parity the composite is determined, but there are cases
eigenstates, simply as particles, even though where the converse is not true: other than in
from some points of view they are composite: totally neutral cases, particle states are pre-
n-particle states, particle-antiparticle states, etc. vented from combining into states (coherent
The vacuum state (Le., with no particles pres- superpositions) with the antiparticle states by
ent) has the same values, {R} = 0, for the the different values-of one or more superselec-
sup.erselective charges as the totally neutral par- tive charges. This has the consequence that the
ticles. Thus the relative parity of any totally intrinsic parity of the antiparticle with respect to
neutral particle with respect to the convention- that of the particle is not defined. In the case of
ally chosen parity of the vacuum, P= + I, is a system made up of a particle and its own anti-
empirically determined . This is called the intrin- particle (which is therefore always equivalent
sic parity of the particle. ' Because of the extra- to a totally neutral boson and consequently in
ordinary empirical fact that the set of all fer- the same superselective sector as the vacuum)
mions is coextensive with that of all leptons or the intrinsic parity of the pair is however de-
baryons, as exhibited by the equivalence: half fined. In the case where the pair consists of a
integral spin ~ either B or L or Qf 0, the totally boson and its antiboson the intrinsic parity of
neutral particles occur only among the bosons the system is +I. (If the boson itself is totally
and only for these is the intrinsic parity unam- neutral then of course it and the antiboson
biguously determined. Another way of obtain- must have the same parity, either + I or - 1; but
ing the same result follows from the fermion if it is not totally neutral it can be assigned
side of the above empirical equivalence. The P = e ia , and then the antiboson must be as-
71 ANTIPARTICLES

signed ~ = e-iO!; a can be arbitrary, e.g., a = rr /2 lows. There are three generators II, 12 , 13
so that boson and antiboson could be assigned for the isospin group SU2 . This group has
opposite parity.) The intrinsic parity of a given identically the same Lie algebra-and therefore
fermion-antifermion pair (constituting, there- the same theory-as spatial angular momentum.
fore, a one-boson system) is, however, always By convention the observable component-rep-
-1. (If the conventional assignment of P = + 1 is resented by a diagonal matrix-is taken to be 1 3 •
made for the nucleon" then P =-1 for the anti- The ladder of diagonal elements has of course
nucleon; but if P = ei<fi for the nucleon, then the three properties found with angular mo-
P = ei(Tr-<fi) for the antinucleon-and 4> can be mentum: (l) It is equal-runged; (2) it is sym-
arbitrary, e.g., 4> = rr/2 so that both nucleon and metrical about zero-thereby specifying integral
antinucleon have the same parity, i. or half-integral units of isospin as eigenvalues;
The general result that P = + 1 for a boson (3) it is finite, i.e., begins with (I3)min =-I and
particle-antiparticle pair and ~ = - 1 for a fer- ends with (I3)max = I. The operational signifi-
mion particle-antiparticle pair is derivable the- cance of the observability of 13 , i.e.,
oretically by applying the assumption of ana-
lyticity, in the sense of complex function 13 =Q + const
theory, to the description of scattering pro-
cesses. This is done in the manner indicated plus the definition of Y as Qmax + Qmin, then
later, in connection with the derivation of the leads to the Gell-Mann-Nishijima relation
antiparticle existence theorem, by analytic con-
tinuation of the scattering amplitude for the Q=/3+ Y/2.
process
Clearly, along with Q, 13 and Yare conjugation-
a+b~a+c reversing.
The invariant isospin 1 of an isomultiplet is,
to a region where it corresponds to the process like spin s to which it is analogous, conjugation-
invariant. Under appropriate conditions there is
b ~ ii+ a + c. also an isoparity G which, like space parity P,
is conjugation-invariant. G is defined unambig-
It is confirmed observationally by selection uously only for hadronically interacting sys-
rules which hold in various pair annihilation tems for which B = Y = 0 (and of course L =
processes. Q= 0), i.e., for hadronic systems which, "neu-
We also have some observables associated with tral" in every other respect, need not be elec-
approximate internal symmetries in processes in trically neutral. All such systems can be
which only the strong interactions are impor- connected directly or through a sequence of
tant: The hypercharge Y, isospin I, and iso- intermediate systems to a basic number of
parity G are observables characterizing certain pions; the latter are stable against hadronic
isomultiplets of nearly coincident mass but decays. The isoparity G equals + 1 or - 1 ac-
differing electric charge into which the mesons cording to whether the basic number of pions is
or baryons group. These isomultiplets are of the even or odd. Individual pions then have odd
internal symmetry group SU2 • The members of isoparity and, as its name and the rule indicate,
each isomultiplet have the same baric charge isoparity is multiplicative for a composite sys-
and differ by consecutive values of the electric tem. Among mesons, G can be defined only for
charge Q. The hypercharge Y is the sum of the the Y = 0 "nonstrange" isomultiplets for which
lowest and the highest electric charges in the 1 is integer because Qmax = - Qmin = 1 cannot
isomultiplet. The isospin magnitude 1 is a be half-integral insofar as has been established
number which measures the multiplicity of the for any actual particle. (In practice such iso-
isomultiplet as 21 + 1. The full significance of multiplets are either isosinglets or isotriplets
Y comes in a further grouping of these SU2 since in none of the well established meson sys-
isomultiplets into hypermultiplets belonging to tems does 1 exceed 1.) There are 17 such ha-
the higher symmetry group SU3 which has been dronically unstable mesons for which G is very
found very successful. The isomultiplets within well known; in every case it is found to be con-
each hypermultiplet still have the same baric served in the purely hadronic decays of these
charge B but differ by consecutive values of the particles. (Review of Particle Properties, April
hypercharge Y. Further suggested higher internal 1973). G is violated in other interactions as, for
symmetries have been found partially successful. instance, in the decay of the hadronically
In the hadronically relevant isomultiplets stable lightest 1] meson (550 Mev) into three
(associated with an SU2 group) the generator 13 pions by virtual electromagnetic interactions.
is an observable which is conjugation-reversing. In the generalization to S U 3, with the group-
The hypercharge Y which, under purely ha- ing of isomultiplets to hypermultiplets, B re-
dronic interactions, is a generator of an SUI mains an invariant while Y which was (a non-
symmetry group, is again a conjugation-reversing Casimir operator) invariant for SU2 becomes a
observable, i.e., behaves like a "charge." (covariant) generator. The electric charge Q
Both of these results are confirmed as fol- which was originally the unique generator-and
ANTIPARTICLES 72

therefore, trivially, the Casimir invariant-of an (A) the merely statistical determination in
SUI group, and became a displaced generator of nature; (B) the disturbance by the act of ob-
the isospin SUz group through the Gell-Mann- servation of physical systems-which is related
Nishijima relation, becomes a regular generator to mere statistical determination; (C) within the
in the SU 3 group. context of mere statistical determination, the
To sum up then, the set of type-defining ob- causal independence of well separated experi-
servables includes (in the massive case) mass m, ments; (D) analyticity of the collision or S-
spin s, parity P (with certain qualifications in matrix elements in its arguments.
fermion systems), and also electric, baric, muon- (III) A somewhat more specific quantum
and electron-Ieptic charges (Q, B, L, Q) and mechanical assumption than the above refers to
(with certain qualifications) hypercharge Y, the operational impossibility of counting, as
isospin I, isoparity G, and SU 3 rep labels p and distinct, particles of the same kind.
q. In the massless cases occurring in nature the (I) We have already discussed in some detail
type-characterizing observables comprise be- the manner in which the existence of space-
sides helicity A, parity P, isoparity G, and muon- time symmetry allows us to define particle
and electron-Ieptic charges Land Q (the latter types and states.
three with qualifications). The values taken by (IlA) This general quantum mechanical prin-
these observables classify all existing entities ciple takes the form of assuming the existence
into particle types. of a Hilbert space of asymptotically free 1-
VI. Derivation of Antiparticle Existence and particle momentum eigenstates. A scalar prod-
Conjugation Principles from Relativistic General uct of these states with themselves and with
Quantum Theory; Other Results We turn now each other is defined. The one-particle states may
to a discussion of proposed theoretical deriva- be added. When added they obey the super-
tions of the antiparticle principle and the position principle: Any linear combination
conjugation principle from very general theory (except for certain superselection restrictions)
with a minimum of supplementary features. is itself a state. Conversely any given first state
Successful theories in physics are often built may be considered as a linear combination of a
in stages: (1) a general theory with room for complete set of states built up beginning with
further structure (thermodynamics, Maxwell's any other second state. The probability of
E-M theory without constitutive relations, etc.); finding the second state immediately after
(2) more particular theories filling out the struc- establishing the first state is given by the
ture. The primary general theory of elementary square of the probability amplitude which is it-
particles and antiparticles is relativistic general self given by the scalar product of the second
quantum theory, and the two more particular state with the first.
theories which fill out, in alternative ways, what (IlB) Takes the form of recognizing that
is to a large extent the same structure are: (1) arbitrarily chosen observables are not necessarily
general quantum field theory and (2) its some- compatible with each other. A complete set of
what stripped-down version known as S-matrix compatible observables (C.S.C.O.) is any set
theory. (In taking these theories to be adequate needed to define the states uniquely, and the
we are trusting that the problems of making individual state is labeled by the set of values
them applicable universally, i.e., also to infinite which the C.S.C.O. takes on for it. The states
range interactions"-zero mass quanta- as in are represented as rays in a Hilbert space, the
electrodynamics, can be resolved by appro- observables as linear operators. Compatibility
priate procedures or reformulations, e.g., alge- (noncompatibility) of two observables is rep-
braic quantum theory. It is to be noted that resented by commutativity (noncommutativity)
difficulties associated with the possible occur- of the corresponding two operators. In many
rence of zero mass particles are common to both instances two such operators anticommute, and
the S-matrix theory and the field theory.) this is how quantum theory describes the re-
The formal ingredients of relativistic general versals of sign of certain properties under 8 as
quantum theory, which is the common basis of indicated ahove. Thus we have
the two more particular theories, are:
(1) Relativistic elements expressing: (A) con- 8Q + Q8 =0 or 8 Q8- 1 =- Q
tinuous spacetime symmetry or invariance of
natural processes under the Poincare (inhomoge- 8B + B8 = 0 or 8B8- 1 = - B, etc.
neous Lorentz) group of translations, rotations,
and uniform velocity shifts; (B) operational pos- where the similarity transformation on the left
sibility of distinguishing before and after in of the second equation on each line represents
natural processes (distinction between forward the effect of conjugation on the corresponding
and backward light cones). (C) More specific charge operator.
than any of the preceding postulated elements (lIC) Besides the scalar product and the addi-
is the assumption that no zero mass particles tion of one-particle states, one may also con-
occur. sider taking ordered outer products of one-
(II) Quantum mechanical elements expressing: particle states with the objective of representing
73 ANTIPARTICLES

multi particle states. This is straightforward in existence of the S-matrix is postulated directly,
the quantum field theory. In a somewhat old- with the cluster decomposition property as that
fashioned but picturesque manner the quantum one of its properties which expresses the causal
field may be described as a linear superposition independence of well separated experiments.
of creation and annihilation operators. The field This leads (Froissart and Taylor, 1967) to the
with its constituent operators and associated representation of multiparticle or collision
excitation states may be described as some- states by ordered outer products of one-
what analogous to an idealized piano con- particle states, such that these are given by the
structed with an infinitude of key-hammers standard creation operator formalism of the
(creation operators) and individual pedals (de- field theory.
struction operators) associated with each possi- If in addition one more assumption is made:
ble tone (excitation state); this tone can be ex- asymptotic completeness-that there are no ad-
cited to various degrees (corresponding to the ditional states besides those that are superposi-
number of particles in that state). All nonempty tions of collision states-the quantum field
states are produced by the creation operators, theory also derives the unitarity, SS+ =S+S = I,
acting on the vacuum state; the latter cor- and certain analyticity properties of the colli-
responds to the totally quiescent piano. In this sion matrix. The unitarity expresses the fact
formalism, in the simple case of noninteraction that the total probability summed overall pro-
the basic states are n-particle states, n having all cesses is unity-the conservation of probability.
possible values. A specific n-particle state Unitarity and analyticity have to be assumed ad
In I , n2 , ... > == I.(! > is characterized by the set hoc in the S-matrix approach, as the price of
of occupation numbers nl, n2, ... (nl + n2 + avoiding the locality assumption of the field
. . . =n) specifying the numbers of particles theory .
having quantum numbers ql, q2, .... The state The dependent variables of which the S-
10, 0, ... > = Ivac> is called the vacuum state, matrix elements are functions are prescribed by
and assumption (IC) excluding zero mass par- the "relativity" or "Lorentz-Poincare invari-
ticles implies that there is a mass-gap above the ance" of the theory (IA). This requires that the
vacuum state. laws of nature be invariant under spacetime-
interval preserving transformations. Here it
In the quantum field theory the assumption means that any S-matrix element is a scalar un-
that corresponds to causal independence of well der these transformations, or that the amplitude
separated regions is that of locality, i.e., that combinations corresponding to interacting clus-
field operators at spacelike-separated points ters are functions of spacetime-invariant vari-
commute. This, together with the assumption ables only, i.e., scalar products of the four-
(IC) that zero mass particles do not occur, suf- momenta of the interacting particles (or their
fices to derive the existence and uniqueness of sums and differences).
the set of stationary collision states in which (liD) This assumption is that of analyticity of
arbitrary numbers of beams of stable particles the probability amplitudes as functions of the
collide and produce final stable particles as in- invariant variables. As we have remarked, it is
teraction products. These collision states define
derivable-in a complicated way-from locality
a unique evolution-governing collision matrix in the field theory, but it has to be introduced
(S-matrix) whose matrix elements are the prob- ad hoc in the sparser short-circuiting S-matrix
ability amplitudes for transitions between the approach. The assumption is sometimes called
asymptotic initial and final states at t =- 0 0 and "causality," but it is probably more restrictive
t = +co (assumption (IB) tells us that we can mathematically than what is implied by the
distinguish between these "in" and "out" causality concept.
states), The collision matrix has the "cluster (III) leads to a condition on the statistics as-
decomposition" or "connectedness" property; sociated with a given particle type. Identical
i.e., when particles interact the interaction multiparticle states, i.e., states containing two
(amplitude) is a sum of subinteractions (sub- or more particles of the same kind, occur in
amplitudes) by clusters, of particles two at nature only with symmetry or antisymmetry
a time, three at a time, etc., each cluster ac- (of the mathematical function describing the
companied by a remaining set of noninteracting multiparticle state) under interchange in any
particles. The contribution to the S-matrix sub- pair. This symmetry difference gives a major
amplitude for each noninteracting particle classification of all particle types; this is usually
involves a Dirac o-function expressing the con- described as a classification according to ·"sta-
stancy of momentum and energy of that tistics" or "social behavior," since those with
particle, whereas the interacting clusters con- interchange symmetry-called bosons-tend to
tribute combinations of amplitudes which are aggregate in the same momentum state, whereas
non-O-functions with momentum-energy vari- those with antisymmetry-called fermions-
ables as arguments. tend to exclude each other from the same state.
In the S-matrix approach, which seeks to cir- From these general ingredients, either in the
cumvent the assumption of a local field, the field theory version (Streater and Wightman,
ANTIPARTICLES 74

1964; Jost, 1965), or in the stripped-downS- lying within the mass shell section-sufficient
matrix version subject to a subtle proviso con- condition) the argument also demonstrates the
cerning analytic continuations (Olive, 1964; Lu property known as "crossing," i.e., that the
and Olive, 1966; Froissart and Taylor, 1967), amplitude for a process in which a given initial
one can derive a number of remarkable general particle disappears without issue is the same
results: function as for a process, otherwise identical,
1. Hermitian analyticity. Hermitian conju- except that the corresponding antiparticle ap-
gate amplitudes are associated with the same pears without antecedent:
analytic functions. Ampl (A + ... + N + J -+ A' + ... + N)
2. Existence of antiparticles. The argument
by traditional field theory is detailed later. The ,..", Ampl (A + ... +N-+] +A' + ... N').
argument as presented by the S-matrix protago-
nists is interesting and worth sketching. It de- 4. CPT or Conjugation Theorem. In the S-
pends on matching singularity structures of dif- matrix approach to the CPT theorem, similar
ferent subamplitudes when the cluster-structure considerations of residues of single particle
is inserted in the unitarity relation SS* = 1. poles are used as in discussing the crossing
This leads, e.g., to a relation between the am- theorem. The proviso, made in all the S-matrix
plitude A44 for the complete cluster interaction arguments, concerning existence of "right
paths" joining particle and antiparticle poles
A+B+C+D-+E+F+G+H are related to the two basic facts used in the
field theory proofs of the conjugation theorem:
and the amplitudes A 23, A 32 for the sequential
interactions a) vacuum expectation values of products of
field operators are boundary values of
A +B-+E+F+J C + D + J -+ G + H. analytic functions;
The bubble diagrams and the equation below b) the spacetime inversion x -+ - x can be
express this relation connected with the identity transforma-
tion through complex Lorentz transfor-
mations.
A
B
[1. E
F With the help of these two facts one can prove

C
~ 1:) G
the existence of an antiunitary e, acting on
fields, which has the properties of a symmetry.
For example, for a scalar field <I>(x) the two
p H basic facts a) and b) lead to the identities (1/10 is
the vacuum state):
( q 2-2m) A *44
1 q'=m2 = A*A*
23 32·
[1/10, <I>(xd ••• <I>(xn) 1/101
Left and right sides of this equation are analytic
functions of their variables and one can con- = [1/10, <I>(-x n )'" <I>(-xI)1/Iol
tinue the equati~n analytically to a different
part of the physical region for A 44 . In the new for the vacuum expectation values of the prod-
part of the physical region the continued pole ucts of field operators at any number of space-
represents the two successive interactions time points XI, ••• Xn. Because of the reversal
of order of operators, these identities are
equivalent to the existence of an antiunitary e
A
C) E
satisfying

/r
B F
e<l>(x)8 -I =<1>(- x).
C
D
(:> G
H
But then it follows that e maps stationary col-
lision in-states onto stationary out-states of the
C+D-+G+H+] A+B+]-+E+F corresponding antiparticles with the same mo-
menta and opposite spin projection.
The new particle] has the same mass as J but
opposite internal charges (since they obey a rule e1/lin(PI ... Pk) = 1/Iout(PI ' ... Pk').
of additivity). Thus the antiparticle principle is The symbol Pi represents the momentum Pi and
proved and becomes the antiparticle theorem
(provided the analytic continuation brings us spin of the jth particle of the colliding beams,
back to the "right side" of the physical region and P;' stands for the same momentum but
singularity of A44 *). opposite spin projection for the antiparticle. As
3. Crossing. Since, under analytic continua- an immediate consequence, we have the equation
tion, residues also continue (provided there between the S-matrix element for the particle
exists a path of analytic continuation in the reaction PI ... Pk -+ PI ... Pm and the S-matrix
complete amplitude joining the two points and element for the antipluticlereaction
75 ANTIPARTICLES

I I , ,
!!.I ... ~ ~ PI ... Pk : numerical functions of space and time. The
quantum version of the theory requires that
[IPout(PI," . !!.m), IPin(PI . " Pk)] the fields be operators. Also, the theory, neces-
sarily a many-particle theory, is best described
=[8IPout(f1I' .. !!.m), 8IPin(Pl ... Pk)] at first for the noninteracting case when we can
take the basic states to be n -particle states,
= [IPin(!!.I'··· !!.m), IPout(PI"" Pk')] characterized by the set of occupation numbers
= [IPout(PI'" . Pk'), IPin(E/'" !!.m')] nl, nz,'" (nl + nz + ... =n) specifying the
numbers of particles having quantum numbers
ql, qz, q3, ... corresponding to a complete set
5. Spin-statistics theorem. It is found in na- of commuting observables (C.S.C.O.): Ql, Qz,
tur~, so far without any demonstrated excep-
tion, that bosons have integral spin and fermions
half-integral spin. This is the spin-statistics rela-
°
Q3, .... The state 10, 0, 0, 0, ... > with all
nj = is the vacuum state. As in any vector
space, there is also the zero vector 0. The single-
tion. It has been derived in field theory with step destruction operator aOi. for particles, and
various degrees of generality. (See Streater and creation operator bOi. * for antiparticles associated
Wightman, 1964, for a very general proof.) In with the ath quantum number are introduced
the S-matrix approach (Lu and Olive, 1966), in the usual way (see FIELD THEORY) such that
again provided certain plausible features are con- they satisfy the standard commutation (anti-
tained in the singularity structure, the relation commutation) rules if the particles satisfy
between spin and statistics is established. It is Einstein-Bose (Fermi-Dirac) statistics. One then
found that connectedness-together with Lo- introduces the field operator, which is a linear
rentz invariance, unitarity, and analyticity-im- expansion in the aOi. and bOi.* (all a), and which
plies that the connected parts for processes in- formally satisfies certain field equations, the
volving particles with the wrong relation between latter usually being chosen on the basis of a
spin and statistics vanish, so that such particles classical analogy or on the basis of relativistic
are unobservable. covariance and general agreement with experi-
Besides the behavior under e of the S-matrix ment. All of the consequences of interest then
and the type- and state-characterizing observ- follow from the definition of the field operator
ables, we may be interested in the behavior and from that of the C-conjugation operator:
under 8 of fields, and interactions between
fields, which are also operators in quantum field
theory. For all but the weak interactions, an ad- CaOi.C-l ='TIcbOi. CaOi.*C-l =l1c bOi.*
ditional symmetry over and above the universal
8 symmetry holds. This is what is traditionally
CbOi. *C-l ='TIcaOi. * CbOi.C-l =~c aOi..
known as "charge conjugation" and denoted by
the symbol C. As we have already remarked, it Here 'TIc is a phase constant (I 'TIel = 1) which
corresponds to e!PT where !P is the (unitary) is nonmeasurable and can be chosen +1 by con-
space reflection operator, and T is the (anti- vention. If particle and antiparticle are identical
unitary) time-inversion (more properly "motion- then bOi. = aOi., and 'TIc becomes measurable and
reversal") operator. Besides interchanging initial equal to ± 1, two physically distinct cases.
with final states, T reverses the signs of the For a specific theory, particularly if it is given
three space-space components of L (the angular in Lagrangian form in which case each of the
symmetry observables is defined by a symmetry
momentum M) and of the three space compo- operation on the Lagrangian, these observables
~
nents of linear momentum P; !P reverses the can be written as bilinear functionals in the field
operators.
signs of the latter and of the three spacetime As an example, we consider the case of a scalar
~
components of J, the centroidal moments '!II, (spin zero) charged field <I>(x) associated with a
The net result orcombining T with e then is to mass m, and satisfying the Klein-Gordon equa-
undo the antiunitarity, restoring the original tion
order of before and after, and also to undo the
conjugations of all external st~Je observables.
(0+ m 2 )<I>(x) = °
Thus 8!PT == C is a kind of "internal conjugation For the C.S.C.O. we choose the linear momen-
operator. " tum and the (not-independent) energy. Denoting
VII. Charge Conjugation in Lagrangian Quan- the destruction operator for the particle of mo-
tum Field Theory C -conjugation ("charge
conjugation") plays an important role in the mentum k and energy ko = .,Jk2 + m 2 byak,
history and elementary discussions of funda- and the creation operator for th~ corresponding
mental particles. We here review its traditional antiparticle by h*, the field operator is given by
treatment in the Lagrangian form of quantum
field theory-a theory which was originally
inspired by the analysis of oscillations and waves
in classical physics where fields are simply
ANTIPARTICLES 76

An explicit representation for Cis References


Dirac, P. A. M., "Quantized Singularities in the Electro-
C = exp [~i ~ (ak* -Tic bk*)(ak - l1chk)] magnetic Field," hoc. Roy. Soc. London, Ser. A,
133,60 (1931).
Wigner, E. P., "Gruppentheorie" (Germany, Frederick
and for the total energy-momentum operator, Vieweg und Sohn, Braunschweig, 1931, pp. 251-
obtained for instance from the Lagrangian for 254; "Group Theory," (New York, Academic Press,
the Klein-Gordon equation above, we have 1959, pp. 233-236.

fk;
Kadison, R., "Isometries of Operator Algebras," Ann.
Math., 54,325 (1951).
d3 k Wick, G. C., Wightman, A. S., and Wigner, E. P., "The
Pp, = kp,(ak*ak + bk*bk)
Intrinsic Parity of Elementary Particles," Phys. Rev.,
88,101 (1952).
It is now possible by direct calculation to estab- Bargmann, V., 1. Math. Phys., 5,862 (1964).
lish whether Pp, (or any other observable) com- Wolfenstein, L., and Ravenhall, D. G., "Some Conse-
mutes or anticommutes with C. In this way we quences of Invariance under Charge Conjugation,"
can derive for all the free physical fields which Phys. Rev., 88,279 (1952).
have been considered applicable to nature, the Lee, T. D., and Yang, C. N., "Elementary Particles and
results stated earlier in this article concerning Weak Interactions," Office of Technical Services,
preservation and reversal of signs of the funda- Department of Commerce, Washington, D.C., 1957.
mental quantum numbers. Melvin, M. A., "Elementary Particles and Symmetry
It is possible in a similar way to examine the Principles," Rev. Mod. Phys., 32,477 (1960).
validity of invariance under C-conjugation of the Melvin, M. A., "Remarks on the Infinite Spin Case for
interactions between fields. For instance, the Zero Mass Particles," Particles and Nuclei, 1, 34
electromagnetic interaction between the elec- (1970).
tron-field current jp, and the photon-field poten- Jost, R., "TCP-Invarianz der Streumatrix und inter-
tial Ap, is jp,Ap,. The two results found for the polierende Felder," Helv. Phys. Acta, 36,77 (1963).
separate fields by the methods described in the Jost, R., "General Theory of Quantized Fields," Amer.
foregoing. Math. Soc. Publications, 1963.
Olive, D., "Exploration of S-Matrix Theory," Phys.
Cip,C-l =-jll Rev., 135, B 745 (1964).
Streater, R. F. and Wightman, A. S., "PCT Spin and
then guarantee that Statistics and All That," New York, Benjamin, 1964.
Cip,AIlC-l = jllA Il Ekstein, H., "Rigorous Symmetries of Elementary Par-
ticles," Ergebnisse der exakten Naturwissenschaften,
Similarly, C-invariance holds for the accepted 37 (Berlin, Springer, 1965).
forms of strong interactions. Lu, E. Y. C., and Olive, D. I., "Spin and Statistics in
C-invariance does not hold for the V-A (polar S-Matrix Theory," Nuovo Cimento, 45,205 (1966).
vector minus axial vector),four-fermion type of Froissart, M., and Taylor, J. R., "Ouster Decomposi-
weak interaction occurring, e.g., in J3-radioactiv- tion and the Spin-Statistics Theorem in S-Matrix
ity. As we have seen, however, for quite general Theory," Phys. Rev., 153,1636 (1967).
interactions with proper orthochronous space- Wightman, A. S., "What is the Point of So-Called 'Axi-
time symmetry, provided they are local, there is omatic Field Theory?' " Physics Today, p. 53 (Sep-
automatically invariance under the combined tember 1969).
operation CPT:: e (CPT theorem). "Proper or- Armenteros, R., and French, B., "Antinucleon-Nucleon
thochronous" spacetime symmetry of the inter- Interactions," in "High Energy Physics," Vol. IV,
actions means that they are invariant under rota- New York, Academic Press, 1969.
tions, translations in space and time, and shifts Lasinski, Thomas A. et aI., "Review of Particle Prop-
to uniformly moving frames: "locality," in prac- erties," Rev. Med. Phys., 45, Supplement, 51 (Apr
tice, means that the interactions consist of a 1973).
linear combination of products of the fields and Shapiro, I. S., "The Physics of Nuclear-Antinuclear
finite-order derivatives of the fields. Equality of Systems," Physics Reports 35, 129 (1978).
mass and lifetime, antiequality of charge and Perl, M. L., "The Tau Lepton," Annual Reviews of
magnetic moment, and (with certain restrictions) Nuclear and Particle Science, 30, 299 (1980).
conjugacy of decay schemes for a particle and its Berko, S. and Pendleton, H. N. , "Positronium,"
antiparticle all follow from CPT invariance alone. Annual Reviews of Nuclear and Particle Science
And this rests only on proper spacetime sym- 30,543 (1080).
me try , general principles of quantum theory, Fry, J. N., Olive, K. A., and Turner, M. S., "Evolution
and the locality requirement (or, in the S-matrix of Cosmological Baryon Asymmetries I. The Role of
rendition, the requirements of connectedness Gauge Bosons" and " . . . II. The Role of Higgs
and analyticity) . Bosons," Phys. Rev. D 22, 2953 and 2977. These pa-
pers contain a summary and detailed further analysis
M. A. MELVIN of the suggestions made by earlier investigators to
77 ARCHITECTURAL ACOUSTICS

account-on the basis of standard cosmology and the of the strength of a sound source in terms of
SU 5 Gauge Unified Theory of quarks and leptons sound power level, W, in decibels, defined by
(see GRAND UNIFIED THEORIES)-for the: item 1 in Table 2.
a) matter content (10- 9 ± I baryons per photon),
b) matter-antimatter asymmetry (practically no
We note that sound power W is expressed in
naturally occurring and antibaryons) in the universe.
watts.
A listener does not experience the total sound
Cross-references: BOSE-EINSTEIN ST ATISTICS
power from a source, since it radiates in all di-
AND BOSONS ; CONSERVATION LAWS AND SYM·
rections, but rather the proportion that arrives
METRY ; ELECTRON ; ELEMENTARY PARTICLES ;
at his ear. Thus we speak of sound intensity, I,
FERMI-DIRAC STATISTICS AND FERMIONS;
as the sound power passing through a small area
FIELD THEORY; GRAND UNIFIED THEORIES;
at the point of observation. The units are watts
per square centimeter or per square meter. Sound
PARITY; POSITRON; QUANTUM THEORY;
intensity level, L], in decibels, is defined by item
RELA TIVITY.
2 in Table 2.
Until recently there has been no commercially
available instrument for measuring sound inten-
ARCHITECTURAL ACOUSTICS sity so it has usually been determined indirectly
fro~ the mean-square sound pressure, p2, i.e.,
Although the practice of architectural acoustics the time average of the square of the instanta-
involves a wide variety of special problems and neous sound pressure in the acoustic wave. This
techniques, the basic reasons for acoustical de- quantity can be determined readily with a pres-
sign are simply : .. sure microphone. The relation is given by:
(a) to provide a satisfactory acoustical enVI-
ronment, not too noisy and of!en not too quiet, (1)
for people at work and relaxation;
(b) to provide good hearing conditions for where, p is the density of air (or other gas) in
speech; and .. kilograms per cubic meter and c is the speed of
(c) to provide a pleasant acoustical enVIron- sound in air in meters per second. Sound pres-
ment for listening to music. sure level, L p , in decibels, is defined by item 3
Designing for Satisfactory Acoustical Environ- in Table 2.
ment Each acoustical situation must be treated Instruments and techniques for the measure-
as a system comprised of three parts; source, ment of sound pressure levels are widely avail-
transmission path, and listener. When the pr?p- able. 2 Typical measured values of sound power
erties of the source are known, the transmISSIOn levels for many sources are given in references
path can be modified to attenuate the sound to 1 to 3.
suit the listener's needs. Paths. Sound may travel from a source to a
Sources. Noise sources are specified in terms receiver by many paths, some in the air (outdoors
of the total acoustical power radiated in each of or in a room), some through walls, and some
a number (generally between 8 and 25) of con- along solid structures. In the latter two cases,
tiguous frequency bands. 1,2 A standard set of the sound is radiated into the air from the
ten bands is listed in Table 1_ vibrations of the surfaces.
Because of the wide range of sound powers Outdoors, the relation between the sound pres-
encountered in practice, it is customary to ex- sure level measured at distance r from a source
press them in a logarithmic form_ Thus we speak and the sound power level of the source is given
by,
TABLE 1. STANDARD OCTAVE FREQUENCY
BANDS LpO =. Lw + Dlo - 20 loglo r - 11 dB (2)

Lower and Upper Geometric Mean where it is assumed that the source is near a
Frequency Limits Frequency of hard-ground plane at a distance r in meters from
of Each Band (Hz) Each Band (Hz) the receiver (also near the plane) and that the
source produces different sound intensities in
22.1- 44.2 31.5 different directions, 8, as described by a direc-
44.2- 88.5 63 tivity index, Dlo (see reference 2). If the source
88.5- 177 125 radiates sound equally in all directions, then
177 - 354 250 DI = O. In practice, sources generally have direc-
354 - 707 500 tivity indexes in the range of 0 to 12 dB in the
707 - 1,414 1,000 direction of maximum radiation. At large dis-
1,414 - 2,828 2,000 tances r there will be losses in the air itself at
2,828 - 5.655 4,000 freque~cies above 1500 Hz. Also, wind, temper-
5,655 -11,310 8,000 ature gradients, and air turbulence may reduce
11 ,310 -22,620 16,000 or augment the value of Lpo determined from
Eq. (2).
ARCHITECTURAL ACOUSTICS 78

TABLE 2

Decibel Scale Abbreviation Reference Quantity Definition

W
Sound power level LW Wref = 10- 12 watt 10 log 10 - - dB
Wref
I
Sound intensity level Iref = 10-12 watt/m 2 10 loglo- dB
Iref
= 10- 16 watt/cm 2
p2
Sound pressure level Lp Pref =20 micropascal 10 loglo - 2 -
P ref
P
= 0.0002 microbar = 20 loglo-- dB
Pref

In a room, the sound pressure level produced meters and a


is the average absorption coeffi-
by a nondirective source is given by, cient for the whole room (see Reference 2).
Typical values of R are found in Fig. 1.
In practical cases, we are often interested in
Lp=LW+10lOgIO(~+~)dB
41fr R
(3) the sound pressure level produced in a room
separated by a partition (wall) from a room in
which the source is located. We assign a trans-
where r is the distance between the receiver and mission loss, TL, in decibels to the intervening
the microphone and R is the room constant in wall. Curves of transmission loss versus frequency
square meters. R =SCi, where S is the total area for several different building structures are given
of all wall, ceiling and floor surfaces in square in Fig. 2 and reference 4. The equation relating

ROOM VOLUME, V, CUBIC FEET


1000 2000 5000 10000 20000 50000 100000 200000 500000 1000000
I I I I
1000
BOO
,/ ,/ /' I0000

600
V ,/
/ ' 'f"
/'
,/ ,/
U)
a: :/ /.,/ ./ 5000 ,...
400
,...
w ,/
/ v w
w
::;
300
~v
~~
~oo 1-- ~oo
~~
.y :/ ./
~ '/
w
"-
w
w
a: 200 \0 0 ./ ./ a:
<I
\\)~ ~''\ , / 2000 ::>
<I
::> -o.~
0 '0.
~"'() ~oo 0~,Y ,// 0

/' 7,.<) \ "';jP


U)
0
U)
~r;,-l'" ,/ ci
ri 100 o· "'~,
I ,....
,.... BO
/
0. ~ \1>'>I \)~7 ./ 1000
Z
z ,/ ./,/ 0'<) ~ 1",:.,....'/ ,...
<I
,... \
I ,-l',y3-',/
0
<I 60 /~
V
U)
~o
U)
z ./
/
./
0.
/0.+ 0 " 500
z
0
u
0
u
::;
40
'/ ./ :/ 0 9<) "> ::;

/ V
30 0
0
0
a: ./
V ./ ./
-0.;/ 0
a:
20
,/ 200

V ,/ / / V
:/
10 ./ ./
100
30 40 60 80 200 300 400 600 800 2000 3000 4000 6000 8000 20000 30000
100 1000 10000
ROOM VOLUME, V, CUBIC METERS

FIG. 1. Approximate value of room constant R for five categories of rooms ranging from "live" to "dead."
Metric units referenced at bottom and left. English units top and right. The Greek letter Ci indicates the per-
centage of the energy that is removed from a sound wave when it reflects from an "average" surface of the
room. It is called the average sound absorption coefficient.
79 ARCHITECTURAL ACOUSTICS

2' Solid Partition


~*, . . -.tIII"11I

e
60
..
..- -
~, .•' -t<' "",~ \~ .~\
, Ii
0
:'
rw k~ -.-
\ :/ CD
/i,.-- C~
~
~ II 3' Gypsum Tile

"
0
1/ ~\V'
Ptit.lttDNWt 0
V'
I
~I J
.. " _ ..........,, ""'; I
.~_ ~I
0
I-
J
:;.- 5 .'
.'
.-

-V v~/
., .. .. f-" .' "
J

0
"1 I! Staggered Stud Partition
.-rIft IIIWi.ftlC'tl elln~t1

0
:/

o ®
12~ 200 320 500 e)Q 12~0 2000 32;0 ~000
100 160 250 400 630 1000 1600 2~OO 4000 6300

THIRD OCTAVE BAND CENTER FREQUENCY IN CYCLES PER SECOND

FIG. 2. Transmission loss, TL, of six typical building structures.

Lp to the L w of the source* is, ~partment, while wall 1 would not . Finally,
mcrease the room constants by adding sound-
Lp2 =Lw, - TL + 10 loglo (~~) absorbing materials to either or both rooms, e.g.,
carpets and draperies, or acoustical materials on
ce~i~g o,r walls or. both. The sound-absorbing
effiCienCies of vanous materials are given in
+lOlOglo(_l +~)dB (4) references 2, 3, and 5.
Sw R2 It is of great importance to observe that when
where Lp2 is the sound pressure level in the a wall is placed between two rooms or when an
second room produced by a source in the first enclosure is built around a noisy machine, the
room; TL is the transmission loss of the com- structure must be hermetically sealed or if air-
mon wall ; Sw is the area of the wall in square me- flow is necessary, it must be conducted 'in and
ters, and RI and R2 are the room constants for out of the enclosure through suitable silencers.
the first and second rooms respectively in square A hole even as small in diameter as a pencil can
meters. It is assumed that the sound pressure render an otherwise satisfactory wall or enclo-
level is measured near the common wall ' in the sure acoustically inadequate.
center of the room it may be 3 to 5 dB l~wer. In cases of very high noise levels where it is
Techniq';le. It is apparent from Eq. (4) that acoustically impractical to quiet or isolate the
the techmques for noise reduction indoors are machine, then ear plugs, ear cushions, or both,
threefold. First, make every effort to reduce must be worn by personnel exposed to the noise.
the sound power radiated by the source i.e. use Criteria for Design. Acceptable noise levels in
quiet ventilat~ng fans, quiet typewrite;s, quiet rooms of various types in each of eight octave
facto~y ma~hmery, and so forth. Enclose noisy
frequency bands are shown by Fig. 3 and Table
machmery m separate rooms or in ericlosures. 3.
Mo~mt vibrating machinery on resilient pads or
Auditoriums for Speech Three goals must be
spnngs. Second, provide walls with suitably high m.et in the de~ign of auditoriums for speech.
transmission losses between rooms. F or example First, the ambIent noise levels must be suffi-
between adjoining apartments walls 4 to 6 of ciently low (see Table 3). Second speech must
Fig. 2 are usually satisfactory, ~hile walls 1 to 3 be loud enough in all parts of the' room so that
are not. On the other hand, walls 2 and 3 would faint syllables can be heard in the presence of
nor~al audience noise. This second goal is
be satisfactory between rooms of the same
achIeved in small auditoriums (under about 500
seats) by proper shaping of the front part of the
hall so that the speaker's voice is directed uni-
*When the only sound path between the two rooms formly to all parts of the hall. In large halls
is through the common wall. (over 500 seats) , electronic amplification of
ARCHITECTURAL ACOUSTICS 80

... ;1971 PREFERRED NOISE CRITERIA


PNC - CUR VES
~ 70~~~~~~~~~__~__________~
i

.....
FIG. 4. Satisfactory ceiling shape for a speech
auditorium with less than 500 seats .
...>
..J

...
.
~.o~~~~~~~~~~~__~~~
;;0
(That is to say, a loud sound should take about
'"'" 1.5 seconds to die down to inaudibility after its
~ ~~--~~~~~C-~~~~~~_~
~~~ source is cut off abruptly. This quantity, called
o
z mid-frequency reverberation time, is measured
~PNC·~
with full audience present at 500 to 1000 Hz
;;0
o
~ lO~--:----~--':""""'--~~::';'~~=--. PNC· 25

..
z
PNC·20
and averaged.) Music of the Classical period
(early Beethoven, Mozart and Haydn) was com-
posed for larger halls with medium reverberation
~ 10 APP,ol.j~le fhr~hOId of - · PNC ·15 times (about 1.8 sec). On the other hand, music
g ~~~':~:~r,~f;~inl-:(~91i:; , of the Romantic period (after 1850), was in
POQe 33, Fi Q. 14
general, composed for fairly large halls with long
"3L~ 63 Il' l~O ~OO 1000 2000 4000 8000 reverberation times (about 2.0 sec). Today, halls
OCTAVE - SAND CENTER F REOUENC IES IN H,
must not only accomodate a musical repertoire
extending over centuries, but often they must
FIG. 3. Preferred Noise Criteria (PNC) curves for seat so large an audience that they become an
various types of building spaces given in Table 3. entirely new type of space in which to perform
Measurements are made with an octave band filter music.
and the readings in each band should not exceed that In the development of the design of a hall, the
shown on the appropriate PNC curve. acoustics dictate the cubic volume and strongly
influence the orientation of every sound-reflect-
ing surface, the interior materials, and even the
seating.
speech is usually necessary. Third, the reverbera- Concert hall and opera house design is com-
tion in the auditorium should be sufficiently plex,6,7 but some guiding principles stand out.
low that speech is distinct. In auditoriums where The seating capacity should be low, below 2200
there is no sound system, this requirement means if possible. The ceiling should have an average
that either the ceiling should have an average height of 45 ft, if there are no balconies, or 55
height of less than 30 ft above the main floor, ft with balconies, measured above the floor be-
assuming that the seats are upholstered and that neath the main floor seats. The hall should be
there are no large floor areas without seats. If narrow, or other means such as suspended panels
the ceiling height is over 30 ft, sound-absorbing should be provided for producing early sound
materials will have to be added to the walls, and reflections at listener's positions. Finishes for the
perhaps to the rear ceiling to control the rever- interior should primarily be plaster on lath. Not
beration. A satisfactory shape of a 500-seat
auditorium for unamplified speech is shown
in Fig. 4.
Auditoriums for Music There appears to be
no single, ideal architectural solution for the
acoustical design of a hall for music. Successful
acoustics have been achieved with rectangular,
fan or wedge, horseshoe, and even asymmetrical,
plans. But though this is true, the many attri-
butes of musical-architectural acoustics are so
closely interrelated that if a hall is to be success-
ful, the architect must solve all requirements
simultaneously.
The music of each era of the past was com-
posed for a different acoustical environment.
Music of the Baroque period (Bach and earlier),
except for organ music, was composed for small
halls with relatively short reverberation times. FIG. 5. Drawings of Symphony Hall, Boston, Mass.
81 ARCHITECTURAL ACOUSTICS

TABLE 3 RECOMMENDED CATEGORY CLASSIFICATION AND SUGGESTED


NOISE CRITERIA RANGE FOR STEADY BACKGROUND NOISE AS HEARD IN
VARIOUS INDOOR FUNCTIONAL ACTIVITY AREAS.

Approximate
Type of Space (and acoustical requirements) PNCCurve* LA, dBA

Concert halls, opera houses, and recital halls 10 to 20 21 to 30


(for listening to faint musical sounds)
Broadcast and recording studios (distant micro- 10 to 20 21 to 30
phone pickup used)
Large auditoriums, large drama theaters, and Not to exceed 20 Not to exceed 30
churches (for excellent listening conditions)
Broadcast, television, and recording studios Not to exceed 25 Not to exceed 34
(close microphone pickup only)
Small auditoriums, small theaters, small Not to exceed 35 Not to exceed 42
churches, music rehearsal rooms, large meeting
and conference rooms (for good listening), or
executive offices and conference rooms for 50
people (no amplification)
Bedrooms, sleeping quarters, hospitals, resi- 25 to 40 34 to 47
dences, apartments, hotels, motels, etc. (for
sleeping, resting, relaxing)
Private or semiprivate offices, small conference 30 to 40 38 to 47
rooms, classrooms, libraries, etc. (for good
listening conditions)
Living rooms and similar spaces in dwellings (for 30 to 40 38 to 47
conversing or listening to radio and TV)
Large offices, reception areas, retail shops and 35 to 45 42 to 52
stores, cafeterias, restaurants, etc. (for moder-
erately good listening conditions)
Lobbies, laboratory work spaces, drafting and 40 to 50 47 to 56
engineering rooms, general secretarial areas
(for fair listening conditions)
Light maintenance shops, office and computer 45 to 55 52 to 61
equipment rooms, kitchens, and laundries (for
moderately fair listening conditions)
Shops, garages, power-plant control rooms, etc. 50 to 60 56 to 66
(for just acceptable speech and telephone com-
munication). Levels above PNC-60 are not
recommended for any office or communica-
tion situation
For work spaces where speech or telephone 60 to 75 66 to 80
communication is not required, but where
there must be no risk of hearing damage
*See Reference 2 for other rating curves.

over 20 percent of them should be wood if the listed above. Its mid-frequency reverberation
strength of the bass tone is to be preserved. Ir- time, with full audience, is 1.8 sec.
regularities on all the surfaces should be provided LEO L. BERANEK
to produce diffusion and blending of the sound.
Above all, avoid echo, noise and tonal distor-
tion. Finally, the orchestra enclosure should References
provide sectional balance in the orchestra and 1. Peterson, A. P. G., and Gross, E. E., Jr., "Handbook
permit the musicians to hear each other. of Noise Measurement," Gen Rad Co., West Con-
Boston Symphony Hall, one of the world's cord, Mass. (Ninth Ed.) 1980.
best-liked concert halls, is rectangular, as shown 2. Beranek, L. L., Ed., "Noise and Vibration Con-
in Fig. 5, and meets the general requirements trol," New York, McGraw-Hill Book Co., 1971.
ARCHITECTURAL ACOUSTICS 82

3. Harris, C. M., Ed., "Handbook of Noise Control," a series of successive approximations which are
New York, McGraw-Hill Book Co. (2nd Ed.) 1979 . still in progress. Initially, the sun, and planets
4. "Performance Data-Architectural Acoustical Mate- were observed against the "fixed stars." From
riais," Acoustical Materials Association, New York, these observations came the first approxima-
N.Y., published annually (A.l.A. No. 39-B). tions of the motions of the solar system by the
5. "Solutions to Noise Control Problems in the Con- laws of dynamics and of the effect of the chang-
struction of Houses, Apartments, Motels, and ing orientation of the earth's axis of rotation
Hotels," Owens-Corning Fiberglas Corp ., Toledo, (precession) upon the positions of the stars.
Ohio, 1963 (A.LA. No. 39-E). Successive repetitions of the observational pro-
6. Beranek, L. L., "Music, Acoustics, and Architec- cess have gradually improved our knowledge of
ture," New York, John Wiley & Sons, Inc., 1962. these and other motions affecting the funda-
7. Furrer, W., "Room and Building Acoustics" (trans- mental planes of the coordinate system, each
lated by E. R. Robinson and P. Lord), London, improvement resulting in an increase in our
Butterworths, 1964. W. Furrer and A. Lauber, knowledge of the positions and motions of the
"Raum- und Bauakustik-Laermabwehr" (3rd Ed.) stars.
Birhauser Verlag, Basel, Switzerland, 1972. Observational programs for the improvement
8. Kinsler, L. E., and Frey, A. R., "Fundamentals of of the celestial coordinate system are long and
Acoustics," New York, John Wiley & Sons, Inc., tedious and must be conducted with meticulous
2nd Ed., 1962. care. They make use of highly developed instru-
9. Beranek, L. L., "Acoustics," New York, McGraw- ments and observing techniques which, in com-
Hill Book Co., 1949. bination with adopted theories of the rotation
of the earth and its motion around the sun,
Cross-references: ACO USTlCS; HEARING ; MUSICAL enable the positions of the equator and equinox
SOUND ; NOISE, ACOUSTICAL; PHYSICAL ACOUS- to be derived anew and the positions of the stars
TICS; RESONANCE; VIBRATION. to be related to them. Each such program is
an independent effort to reconstruct the celestial
coordinate system. Meridian circles have gener-
ally been used for this kind of work. The results
ASTROMETRY of such programs are said to be fundamental
Astrometry deals with the space-time behavior and are usually published in the form of star
of celestial bodies and therefore belongs to the catalogs.
classical field of astronomical studies. It is often From time to time, when sufficient funda-
referred to as fundamental, positional or obser- mentally observed catalogues have accumulated,
vational astronomy. they are combined with similar earlier material
Early astrometric investigations were directed to form a Fundamental Star Catalog. This cata-
mainly toward establishing a suitable frame of log is usually regarded as the best representation
reference for the determination of the complex that may be had of the celestial coordinate
motions of the planets, while the studies of the system at the time of its pUblication; the right
positions and motions of the individual stars as ascensions and declinations of the stars in the
well as the various stellar systems gradually catalog define the system for the equinox and
developed as improved precision of observations epoch chosen for the catalog. The proper mo-
made it possible to discover and observe these tions in combination with the adopted values
motions. of the constant of precession permit the system
The fundamental, and perhaps most difficult, to be referred to equinoxes and equators at other
problem of astrometry is the establishment of epochs.
a reference system against which the motions of The latest and most precise of the fundamen-
the celestial bodies can be measured. tal catalogs is designated the FK4 and was pub-
The principal planes involved in the spherical lished by the Astronomischen Rechen-Instituts,
coordinate systems usually used in astrometry Heidelberg, Germany in 1963. The catalog con-
are the equator, defmed by the rotation of the tains the positions (right ascension and decli-
earth on its axis, and the ecliptic, defined by the nation) and the changes with time (precession
revolution of the earth around the sun. The and proper motion) of 1535 stars. These data
positions of both these planes vary continuously were compiled from nearly 200 star catalogs
in a most complicated manner due to gravita- containing observations over a span of 110
tional forces and couples between earth and the years. Its successor, the FK5, will appear in 1984,
moon, the sun, and the principal planets . Such with data compiled from more than 250 catalogs
motions of the reference planes are reflected in of approximately 3 million observations. Two
the positions of the stars referred to them. other fundamental catalogs that have been ex-
The motions of these planes cannot be derived tensively used are the GC, Albany General Cata-
entirely from theory alone, but must be deduced logue of 33342 Stars and the N30, Catalog of
from observed changes in the positions of the 5268 Stars.
stars which, in turn, are also in motion . This The coordinate system provided by the posi-
complication has forced the construction of the tions and motions of the stars in a fundamental
astronomical coordinate system to proceed by catalog serves as a reference system for the me as-
83 ASTROMETRY

urement of other star positions and proper rived positions have been published. Among the
motions which must be carried out for a variety catalogs of this nature may be mentioned the
of problems originating in the study of stellar AGK2 (Zweiter Katalog der Astronomischen
motions, in geodesy, in the determination of Gesellschaft) and the Yale and Cape photo-
time, in space research and others. graphic catalogs.
With the exception of the GC, which contains The AGK2 was related to the fundamental
all the stars brighter than the 7th magnitude, system represented by the FK3 (Dritter Funda-
fundamental catalogs do not contain a complete mentalkatalog des Berliner Astronomischen
list of all stars down to a certain magnitude as, lahrbuchs) through the use of simultaneous
for example, the survey catalogs do. The proto- visual observations of about 13000 moderately
type of the survey catalogs for star positions is bright stars in making the plate reductions. The
the Bonner Durchmusterung which contains the AGK2 plates were taken at the Bonn and Berge-
positions of 320000 stars to a limiting magni- dorf Observatories and covered the sky in 5° X
tude of 9.5 and north of declination - 2°. Al- 5° overlapping fields from - 2° to the north pole.
though the observations for the catalog were The resulting catalog contains the positions of
made in the middle of the past century, the over 180000 stars for the mean epoch of 1930.
catalog and the charts made from it have been Between 1956 and 1959 a second photographic
an extremely useful tool for astronomers for series of observations of these stars was carried
identification of star fields. The survey was later out at the Bergedorf Observatory. The measured
extended to the south celestial pole by the Bonn, positions on the plates were reduced to the sys-
Cape and Cordoba Observatories. tem of the FK4 by use of the positions of some
Positions of the fainter stars on a fundamental 21000 reference stars observed simultaneously
system are obtained by a close coordination be- through an international cooperative program
tween visual and photographic programs. The involving 12 meridian circles in the northern
positions of a selected number of moderately hemisphere. A comparison of the plate results
bright stars (7th to 9th magnitude) are related at the two epochs gives rather accurate proper
to the fundamental system by meridian circle motions with respect to the fundamental sys-
observations. These stars are then used as a posi- tem for the entire 180000 stars. The majority
tion reference for the photographic observations of these stars are brighter than the 9th magni-
of the fainter stars, thus tying them to the tude. A good many, however, are as faint as the
fundamental system. 11.5 photographic magnitude. The catalog is
An example of this procedure is the large named the AGK3. Except for gaps between +85 0
astrometric project initiated toward the end of to +60°, +50° to +30 0 , and - 50° to - 60°, the
the nmeteenth century and carried out by Yale Zone Catalog covers the sky from +90°
international cooperation. to - 90° declinations, while the Cape catalogs
The fundamental system adopted for this provide a complete coverage from - 30° declina-
undertaking was embodied in the FC (Funda- tion to the south pole. Both series of catalogs
mental-Catalog fur die Zonen-Beobachtungen were taken by zones of declination by use of
am Nordlichen Himmel) developed by Auwers. wide-angle cameras (from 5° X 5° to 10° X 14°)
The visual program, designated the AGK (As- and were reduced to a fundamental system (not
tronomische Gesellschaft Katalog), was carried always the same one) by use of contemporary
out through the collaboration of 12 northern meridian circle observations. The mean epochs
hemisphere observatories and resulted in the of the positions in these catalogs range from
determination of the positions with respect to the early 1930's to the late 1940's. The stars in
the FC of 144128 stars to the limiting magni- these catalogs are similar in magnitude range to
tude of 9 and north of - 2° declination. The those in the AGK2 and AGK3. A catalog espe-
extension of the visual work into the southern cially prepared for geodesy and other computer
skies was gradually carried out by other observ- oriented research based on accurate satellite
atories. The adjunct photographic program orbits is the Smithsonian Astrophysical Obser-
known as the Carte du Ciel or the Astrographic vatory Catalog (SAO). It is derived from selected
Catalogue called for observations down to ap- visual and photographic catalogs, contains posi-
proximately the 11 th magnitude covering the tions and proper motions of 259,000 stars, and
entire sky by 2° X 2° fields. Originating in 1887, covers the entire sky in zones of 10-degrees-wide
the program was completed by 1970 and in- declinations. It has been published in book
volved the participation of 18 different observa- form and as a set of star charts, but like the
tories. The positions in the catalogs are given in other catalogs previously mentioned, it is also
the form of rectangular coordinates as measured available in machine readable form from the
on the plates, but by means of auxiliary tables, Centre de Donnees Stellaires at the Observa-
these coordinates can be translated into right toire de Strasbourg, and from NASA-Greenbelt,
ascension and declination. Each field was photo- Maryland. A program of observing 20000 stars
graphed a second time with a longer exposure in the Southern Hemisphere with meridian
with a limiting magnitude of 14 to be used for circles is now nearing completion. Known as
the purpose of star charts. the SRS (Southern Reference Star) Program, it
Several other catalogs of photographically de- has been carried out through international co-
ASTROMETRY 84

operation with northern observatories (U. S. being made that those objects do not show any
Naval Observatory, Pulkovo Observatory, and systematic rotation with respect to the local
Hamburg-Bergedorf) participating with stations inertial frame of rest.
in Argentina, Chile, and Australia, respectively. A large number of proper motion studies of
The positions and proper motions of these stars galactic clusters has been carried out in order to
in the fundamental system are intended for use establish membership of the individual stars in
in obtaining new positions and proper motions the field. Because of the high internal precision
of some 200000 stars in the Southern Hemi- required for this work, these studies have been
sphere observed photographically. confined primarily to long-focus telescopes, with
An important source for obtaining proper plates taken over time intervals of SO or more
motions of the fainter stars is a combination of years.
early photographic plates with recent ones taken Several surveys of the sky for stars with high
with the same telescope. The proper motions proper motion, largely with the aim of finding
derived in this way are relative proper motions absolutely faint stars, have been carried out over
and require further reductions for transfor- the past several decades. Two such surveys are
mation into absolute proper motions in a funda- one with the 48-inch Schmidt telescope at Mt.
mental system. This procedure has been follow- Palomar and the other with the 13-inch tele-
ed in several extensive programs aimed at solving scope at Lowell Observatory. They cover 80
such problems as determining the solar motion, percent of the sky to a limiting magnitude of
and deriving secular parallaxes and galactic ro- 21 and 17 respectfully. In surveys of this magni-
tation. tude, it is essential that telescopes of not too
Proper motions for tens of thousands of stars large focal length be used to limit the number
have been obtained by this method while radial of plates needed to cover the sky and that the
velocities for a lesser number of stars have been "moving" stars be found by rapid scanning of
determined from the spectroscopic application the plates. These surveys have drastically in-
of the Doppler principle. Besides the proper creased the known number of white-dwarf,
motion and radial velocity, the distance of a star sub-dwarf, and faint red-dwarf stars, which, at
is needed to determine its motion in space. For the present time attract much interest among
stars beyond 100 parsecs from the solar system astronomers.
it becomes increasingly difficult to obtain all An important area within astrometry is the
three factors involved, and often knowledge of determination of the distances of individual
stellar motion is either based on proper motions stars. Because of the extremely small quantities
or radial velocities alone. However, by various to be measured, the ultimate in precision is
statistical devices substantial information about required. The geometric method of measuring
stellar motions has been obtained. distances is based upon the surveyor's principle:
On the basis of these studies, the sun's velocity the object is observed from both ends of a base
has been determined to be about 20 km/sec line. In determining the trigonometric parallax
towards a point in space not far from Vega, of a star, the semimajor axis of the earth's orbit
although the amount and direction of the is used as the base line. Reliable individual dis-
motion varies depending upon the chosen group tances have been measured in this way for sev-
of stars. eral thousands of stars within 30 parsecs of the
As a result of the sun's motion through space, solar system. The majority' of the photographic
the stars show a parallactic or secular shift plates in these determinations were obtained
which can be used to determine their distances. with long-focus refracting type telescopes, and
Because of the individual motions of the stars, the plates measured on manually operated mea-
this method is applicable only to groups of stars suring machines. The discovery of the large
with the assumption that their individual mo- number of intrinsically faint stars in the solar
tions are random. By means of the secular paral- neighborhood with apparent magnitudes beyond
lax method, general ideas of the distances of the practical limits of the refracting type tele-
stars up to 1000 parsecs have been obtained. scope demanded the development of a reflect-
From statistical studies of proper motions and ing-type telescope with greater light gathering
radial velocities, it was found in 1927 that the power, and with a highly stable optical system.
stars in our galaxy are moving in orbits not Such a telescope, named an astrometeric reflec-
greatly inclined to the galactic equator. The tor, has been in operation since 1964. In com-
observations are consistent with the assumption bination with a high precision automatic mea-
that the principal force governing the motions is suring machine, parallaxes have been achieved
gravitational with the center of mass near the with a precision of 2 milliarcseconds, thereby
galactic center. The period of rotation at the obtaining reliable distance determinations to
sun's distance from the center is 2 X 108 years. 1SO parsecs by this method. The importance of
There are, at the present time, two programs stellar distance determination is realized from
in progress which will attempt to establish abso- the fact that the distance of a star must be
lute stellar proper motions using the distant known before its intrinsic luminosity and its
galaxies as a reference frame, the assumption rate of energy generation can be determined.
85 ASTROMETRY

Studies of the stars in the solar neighborhood duced which take advantage of the latest techno-
have revealed that the majority of them are logical developments in automation.
components of double and multiple systems. Within recent years measurements with a
Since the motions of the stars within a system precision of a few milliarcseconds have been
are governed by their mutual gravitational at- achieved in astrometry in both the optical and
traction, it is possible to determine their masses radio spectrum by means of interferometry.
by use of Kepler's third law (see KEPLER'S In the optical spectrum, speckle interferometry
LAWS OF PLANETARY MOTION), whenever their of close binaries has achieved standard errors of
orbital motions and the parallax of the system 5 milliarcseconds in their relative positions,
become known. This is the only direct way while long baseline optical interferometry of
masses of the stars can be determined. the order of 50 meters is capable of very high
Routine observations of the motions in binary resolution of one milliarcsecond. Similar ac-
systems began about 150 years ago. Originally curacies are expected in transcontinental radio
all observations were carried out visually. Al- interferometry. High precision optical and radio
though this method continues to be used for position measurements are currently being car-
close pairs, it has been largely replaced by a ried out of radio sources and their optical coun-
more accurate photographic method for wider terparts to improve the relationship between
pairs. optical and radio reference frames, with the
Various searches for double stars have pro- possibility of establishing an inertial reference
duced some 75000 visual binary systems, but frame from the measurements of the cosmo-
for only a small fraction (approximately 100) of logical distant quasars.
these systems are data available for determining Astrometry from space observations will be a
the individual masses with an accuracy of 30 reality within the next few years. Among these
per cent or better. These masses range from projects can be mentioned the astrometric satel-
about 0.08 solar mass for a star 3000 times less lite by the European Space Agency (ESA),
luminous than the sun to 6 times the sun's mass named Hipparcos. With this satellite, in a geo-
for a star 100 times more luminous than the stationary orbit, it will be possible to observe
sun. Larger masses, as high as 50 to 100 times some 10,000 stars during a 2-3 year mission.
the solar mass, are found among the very close Positions, yearly proper motions, and parallaxes
binaries such as eclipsing and spectroscopic bi- with an accuracy of two milliarcseconds are pro-
naries. Although these objects cannot be re- jected. Similar accuracies are expected in the
solved into individual components, their orbital astrometric programs planned with the Space
motions can be determined from the periodic Telescope of NASA, which is scheduled to be in
variation in light and radial velocity (observed orbit by 1986.
Doppler shift).
Stellar masses smaller than the value of 0.08 K. AA. STRAND
quoted above have been discovered in recent
years by intensive photographic studies of near-
by single stars and components in double stars. References
These studies, representing the ultimate in ac-
curacy in photographic astrometry, have re- Eichhorn, H., "Astronomy of Positions," New York,
F. Ungar Publisher, 1974.
vealed unseen companions of such small masses Mueller, I. I., "Spherical and Practical Astronomy,"
that, according to theoretical estimates, they are New York, F. Ungar Publisher, 1969.
either stars so small that they will never bum
Prochazka, F. V., and Tucker, R. H. (Eds.), "Modern
nuclear fuel or planets of the size of Jupiter. Astrometry," International Astronomical Union Col-
Aside from demands for such utilitarian pur- loquium No. 48, Vienna, Austria, University Observa-
poses as navigation, geodesy and space research, tory Vienna, 1978.
astronomers themselves are making heavy de- Kraus, J. D., "Radio Astronomy," New York, McGraw-
mands for substantial gains in quality and quan- Hill Book Company, 1966.
tity in astrometric observations, extended to Abell, G. 0., "Exploration of the Universe" (3rd Ed.),
fainter and fainter stars. New York, Holt, Rinehart, & Winston, 1975.
The discovery of the intrinsically faint stars in Taff, 1. G., "Computational Spherical Astronomy,"
the solar neighborhood has demanded an ex- New York, John Wiley & Sons, 1981.
tensive parallax program with an entirely new Strand, K. Aa., "Basic Astronomical Data," Chicago,
telescope of special design. University of Chicago Press, 1963.
Positions and space velocities on a large scale
of the individual stars and of stellar systems
within our galaxy are essential to understand its Cross-references: ASTRONAUTICS; ASTROPHYS-
dynamics and evolution as well as the physical ICS; COSMOLOGY; DOPPLER EFFECTS; GRAVI-
properties and evolution of the individual stars TATION; INTERFERENCE AND INTERFEROME·
which populate it. TRY; KEPLER'S LAWS OF PLANETARY MOTION;
To accomplish this, new instrumental and PULSAR; ROTATION-CURVILINEAR MOTION;
analytical techniques are currently being intro- SPACE PHYSICS.
ASTRONAUTICS 86

ASTRONAUTICS * If the earth were rotating with a period of an


hour and a half instead of 24 hours, our centrip-
Future historians may record that the age of etal acceleration at the equator would cause
space flight marked a turning point in modern the effective g to vanish completely, our scales
times. Its physical principles-the laws of mo- would not register, objects would be unsup-
tion of celestial bodies-marked the turning ported, and for all practical purposes they and
point of medieval times. From Copernicus to
Kepler to Galileo to Newton, the Aristotelian we would be weightless.
myth of a man-oriented universe succumbed to Formally, we could state that any accelerating
the conception of a detached mechanically frame produces a local gravitational field gacc
oriented universe, operating through laws which that is equal and opposite to the acceleration.
were a synthesis of the new knowledge gained Thus, a rotating frame generates a centrifugal
in the formerly separate domains of terrestrial gacc opposing the centripetal acceleration. We
and celestial mechanics. Mankind never quite have at any point
recovered from that detachment.
Weight and Weightlessness In Newtonian me- leff = I + lace (1)
chanics, weight is understood to mean the force where I is the field produced by matter alone
that an object exerts upon its support. This (e.g., the earth). By identical reasoning, an ob-
would depend on two factors: the strength of ject in orbit, whether falling freely in a curved
gravity at the object's location (things weigh or in a straight path, will carry a reference frame
less on the moon) and, as Newton called it, the in which geff is zero, for its acceleration will
quantity of matter in a body (its "mass"). At always exactly equal the local g by the defini-
any given location, where gravity is fixed, mass tion of the phrase, "freely falling."
can be measured relative to a standard by noting This concept was placed on a firm footing by
the extension of a spring to which it and the Einstein who maintained that Eq. (1) is reason-
standard are successively attached. Alterna- able not only in mechanics but in all areas of
tively, the unknown and standard may be hung physics including electromagnetic phenomena.
at opposite ends of a rod and the balance point We arrive at the inevitable conclusion that we
noted. However, by an entirely separate experi- cannot distinguish by any physical experiment
ment, mass can also be measured by noting the between an apparent g accountable to an ac-
resistance of the object to a fixed force applied celerating frame and a "real" I derived from a
horizontally on a frictionless table. The mea- local accumulation of mass. This central postu-
sured acceleration provides the required basis of late of the General Theory of Relativity also
comparison with the standard. Needless to say, unified the two separate conceptions of mass.
all objects measure identical accelerations when An object resting on a platform that is ac-
freely falling in the vertical force of gravity. celerating toward it will resist the acceleration
This merely means that, unlike the arbitrary in an amount depending on its inertia. It presses
force we apply horizontally in the experiment against the platform with a force equal to that
above, gravity has the property of adjusting it- it would have if placed at rest on the surface of
self in just the right amount, raising or lowering a planet with local field equal and opposite to
its applied force, to maintain the acceleration the acceleration of the frame.
constant. General Principles of Central Force Motion
It was well known that objects appear to in- The gravitational force between point masses is
crease or decrease their weight (alter the exten- inverse square, written
sion of the spring) if the reference frame in /
which the measurement takes place accelerates 1m m
mg=- - - r (2)
A

up or down. As gravity did not really change, r2


however, most people were inclined to draw a
distinction between weight defined as mg, where where the center of coordinates from which the
m is the mass and g is the local gravity field, r
unit vector is described lies in m /, one of the
and the appearance of weight, the force of an masses. Thus, the force on m is directed -I,
obfect on its support as measured by the spring's toward m / and is proportional to 1jr2 with 1 the
extension. One way to avoid the difficulty has constant of proportionality. The quantity g is
been to speak of an effective g, which takes into the force on m divided by m (or normalized
consideration the frame's acceleration. For ex- force) for which the name "gravitational field
ample, at the equator of the earth, we measure, of m /" is reserved. Of course, if m were in the
say by timing the oscillation of a pendulum, the field of a collection of mass points, or even in a
effective g, some 0.34 per cent less than the g continuous distribution of mass, the summated
produced by the mass of earth beneath our feet. or integrated I at the location of m would no
longer be an inverse square function with re-
spect to any coordinate center. However, in one
*John Wiley & Sons, Publishers, has kindly permitted special case, the inverse square functional form
extensive use of copyright material from "Physical would be preserved: if the source mass were
Principles of Astronautics" by Arthur I. Berman, who symmetrically distributed about the coordinate
is also the author of this article. center. This would be the case if the source
87 ASTRONAUTICS

were a spherical shell or solid sphere, of density b


constant or a function only of r. The sun and
earth can be regarded, at least to a first ap-
proximation, as sources of inverse square gravi-
ta tional fields.
There are some important general statements
we can make about the motion of an object
placed with arbitrary position and velocity in a
centrally directed force field, i.e., a field such
as the one described, which depends only on ~t===~=s'==========~
distance from a central point (regardless of
whether or not the dependence is inverse
square). As the force has only a radial and no
angular components, it cannot exert a torque
about an axis through the center. This means
that the initial angular momentum is conserved.
Now angular momentum is a vector quantity
and therefore is conserved both in direction and
magnitude. It is defined by r X p, where r is the FIG. 1. Kepler's second law. The sector area S
position vector to the mass of momentum p. swept out is proportional to the time required for the
The direction of the angular momentum vector planet to move from a to b. Thus, if tcd = tab, then
is thus perpendicular to the plane containing r S'=S.
and p. As this direction is permanent, so also
must be the plane. The planar motion of the
object can be expressed in polar coordinates, so
rr
that by writing r = and p = m(rr + r¢~), we By applying Newton's law of motion F = ma,
find the specific angular momentum (angular a relationship between a, the second derivative
momentum per unit mass) called h, to be of the position vector, expressed in polar form,
and F 1m or g, as given by Eq. (2), leads to the
(3) familiar conic solution for the trajectory of an
object in an inverse square field.
This too then must be a constant of the motion.
I 'Ym'
Consider now the rate at which area is swept -;: = h2 + A cos (cf> - cf>o) ( 6)
out by the radius vector, dSldt. We recall from
analytic geometry that dS = }r2 cf>. Thus where A and cf>o are constants. A rotation of
dS h axis will eliminate cf>o, there by aligning the co-
(4) ordinate axis with the conic's major axis. Also,
dt 2 by expressing the general conic, an ellipse or
so that this is a constant of the motion as well. hyperbola, in terms of the usual parameters of
On integration, we conclude that the size of a semimajor axis a and eccentricity €, we can
sector that is swept out is proportional to the relate the geometric parameters to the gravita-
time required to sweep it out. In the case of a tional-dynamical constants, viz:
closed orbit, the total area S would then be re-
lated to the specific angular momentum as (7)
and
S=hT (5) I '
2 - ='Ym2 (l + € cos cf» (8)
r h
This sector area-time relationship is Kepler's
second law of planetary motion which was in- Note that by substituting Eq. (7) into Eq. (5)
duced from Tycho Brahe's observation of Mars and expressing the area of an ellipse as S =
without prior knowledge of gravity and its cen- 1Ta 2 (1 - €2 )112 we arrive at Kepler's third law,
tral character.
The Laws of Kepler Kepler stated two other _ 21T 3/2
laws of planetary motion: The orbits of all the T - ('Y m ')112 a (9)
planets about the sun are ellipses (a radical
departure from the circles of Copernicus), and The energy of the orbiting object can be cal-
the squares of their periods are proportional to culated with ease by evaluating it at an extremal
the cubes of their mean distance from the sun, point, say the nearest point to the gravitational
this mean being the semimajor axis of their source, called pericenter or perifocus. As the
ellipses. The third law pertained to the one energy is constant, it is immaterial where the
characteristic common to all the planets: the calculation is made. Here the velocity has only
sun. Taken together, the three laws led Newton an angular component so that the kinetic energy
to the concept of gravitational force and its for a unit orbiting mass is! V2 = !r2cf>2. The po-
inverse-square form. tential energy at pericenter is - ('Ym' /rpe) where
ASTRONAUTICS 88

rpe is the distance of the unit mass from m', the


focal point. Here rp = 0 so that by Eq. (8), E=2
1 rm'
- = -2-(1 + e) (10)
rpe h
On substituting Eq. (7), we find the total kinetic
and potential energy to be
E =1
(11)

Our conclusion: All objects in orbit with the


same major axes have identical periods and
identical energies per unit mass. Knowledge of
E is invaluable in determining an object's speed
when its distance from the source is known, and
vice versa.
In the event that the orbiting object's mass is E = 0.5 E =0
not negligibly small compared with that of the
gravitational source, one must take note that
the combined center of mass, from which the
acceleration is described, no longer may be as-
sumed to lie in the center of the gravitational
source. This complicates our equations some-
what, for the accelerating force still is expressed
relative to the center of the source (if spherical).
The adjustment that results, when center of
mass coordinates are transformed to relative co-

/
ordinates in the expression for acceleration, re-
quires our equations to take the form r(m' + m)
wherever formerly rm' appeared.

FIG. 3. Orbits of differing eccentricities and major


axes which pass through a common point. Higher
speeds correspond to higher energies and longer major
axes. For an ellipse the eccentricity € is the ratio of
the distance to the focus (gravitational center) from
the point of symmetry of the curve to the semimajor
axis of the ellipse. For a hyperbola it is the ratio of
the distance to the focus from the point of symmetry
of the hyperbolic pair to the semimajor axis.

Disturbances in the Central Field The earth,


of course, is spherical to only a first approxima-
tion. More accurately, it is an ellipsoid of revo-
lution about a minor axis-an oblate spheroid.
Still more accurately, it appears to be slightly
pear-shaped and, in addition, its figure is dis-
FIG. 2. Conics formed by the intersection of planes torted by continuous local variations. The sphe-
inclined at various angles with the axis and sides of a roidal figure, nevertheless, accounts for nearly
cone. Ellipses and hyperbolas are general curves in all the anomalous effects on satellite orbits. For
astronautics, i.e., general solutions of the two-body one thing, the gravitational force on the satellite
problem, while the circle and parabola are special is no longer centrally directed; the excessive
cases. An object in circular orbit is at constant distance mass in the equatorial plane produces a force
from the gravitational source and therefore has con- on the satellite directed out of its orbital plane.
stant potential energy; it moves at constant speed and The resultant torque causes the direction of the
therefore has constant kinetic energy. An object in a angular momentum vector to change; i.e., the
parabola is in a transition curve between an ellipse and plane containing the satellite's ellipse turns. The
hyperbola; its major axis, therefore, is infinite and its plane turns continuously about the polar axis
total energy zero. maintaining its angle with the axis and with the
89 ASTRONAUTICS

(
----_J S
(a) (b)
FIG. 4. The orbit of an earth satellite. The earth's equatorial bulge causes retrograde motion of the points of
intersection nand n' of the orbit and equatorial plane. This can alternatively be interpreted as a retrograde
motion, about the north-south axis, of the plane containing the closed orbit. The plane moves in the direction
shown by the arrow in (b), maintaining a constant angle with the axis.

equatorial plane constant. The turning rate is the speed at the astronaut's perigee in (a) can
greatest for low orbits and small angles of in- be written
clination with the equator. For polar satellites,
the plane remains fixed. A separate effect of
this equatorial bulge perturbative force is the r m-' ( 2 -ro)
u2 = - - (12)
slow turning of the ellipse's major axis within ro a
the orbital plane. This effect vanishes at an in-
clination of 63.4°; the major axis turns back- where ro is the satellite's orbital radius and a
ward at inclinations above this angle and for- the astronaut's semimajor axis. If
ward below.
Astronautical Examples From the preceding r m'
=-- ut (13)
material some interesting problems in astronau-
tics can be solved readily. Suppose (Fig. 5) an ro
astronaut releases himself at a speed ou of where U is the satellite's circular speed, found
10m/sec from a satellite in a small circular by letting c
a = ro in (12). We have all the infor-
orbit about the earth. What would be his veloc- mation needed to find the distance of separation
ity and that of his satellite when at their maxi- between ellipse and circle as
mum separation if his initial thrust is (a) the
same as that of the satellite in its orbit, (b) 4roou
toward the earth's center, and (c) normal to 2a- 2ro = : - -
the orbital plane? Also , what would be the Uc
maximum separation between astronaut and
satellite and where would it occur? 4(6.38) (10 3 km) (10 m/sec)
Writing E in terms of both the kinetic-poten- 7.9 (10 3 ) m/sec
tial energy sum at any point in orbit, and its
constant value given by Eq. (11), we find that = 32 km. (14)

w ~ ~

FIG. 5. Orbits of an astronaut released from a circularly orbiting satellite


(0) in the direction of satellite motion, (b) toward the center of attraction, and
(c) normal to the orbital plane. In the last case the astronaut moves in a new
orbital plane inclined to that of the satellite by angle i.
ASTRONAUTICS 90

By applying angular momentum conservation- Specific impulse is one performance charac-


that is, the apogee speed times the focal distance teristic which applies to the propellant's ability
at apogee equals the perigee speed times the to be ejected at high speed regardless of the
focal distance there (or simply Eq. (7»-wefind weight penalty required to do this. It is the im-
that the astronaut will have a relative speed at pulse produced per mass of propellant ejected,
apogee of 30 m/sec. or m8v/8m e , or, by Eq. (15), simply Ve. In engi-
In (b), the astronaut will orbit in an ellipse in neenng usage, it is impulse per weight ot pro-
the plane of the satellite's circle that intersects pellant ejected, or ve/ge where ge is the acceler-
the circle twice. As his injection velocity is ation of gravity at the earth's surface. Its units
radial, his specific angular momentum is that of are seconds, and it can be interpreted as the
the satellite. This, together with Eq. (12), where thrust produced by a rocket per weight of pro-
now v at launch is not the linear sum of 8v and pellant ejected per second. By itself, thrust is of
Vc but the square root of the sum of the squares little importance unless it is sustained for a sig-
of these terms, leads directly to the orbital nificant time by a large backup of propellant
parameters a and e. We find the latter as 8v/vc tankage. It is here that the mass-ratio term in
and the orbital separation at apogee as 8 km Eq. (16) would play an important role in any
with the speed there relative to the satellite as evaluation of a rocket's true performance.
10m/sec, the same as at launch. It is important to realize how sensitive is the
The astronaut moves out of the satellite's energy imparted to a moving vehicle to a given
orbital plane in (c) and into a new plane inclined boost in speed. If an object, say a rocket, moving
at angle i = 8v/v c . The maximum separation at some speed v gets a boost in the same direc-
between orbits now becomes iro = 8 km. But tion in an amount 8v, then its increase in kinetic
the astronaut never departs more than 20 m from energy is v8v + t (8V)2. Thus, if a rocket re-
his initial distance from earth, so his speed is ceives a small boost in speed, say from turning
always close to that of the satellite. on its engine for a short time, the rocket can
Rocket Propulsion A rocket operates by the thereby receive a very large increase in kinetic
simple principle that if a small part of its total energy provided it was already moving fast
mass is ejected at high speed, the remaining before getting the boost.
mass will receive an impulse driving it in the op- This principle is made use of in the firing of a
posite direction at a moderate speed. As 8 me, multistage rocket. Suppose, for example, that
the. propellant, leaves at speed Ve with respect we have a two-stage rocket of similar compo-
to the rocket, the remaining rocket mass m re- nents: The "burnout" speed of each stage is the
ceives a boost in speed 8 v such that same, lifting each to the same apogee. We com-
pare the effect of two firing programs: In the
8meVe =m 8v (15) first case, firing of the upper stage is delayed
until its apogee as booster payload is reached,
If additional equal propellant mass is ejected at but in the second case its firing takes place
the same speed, the boost in rocket speed is close to the ground immediately following
slightly greater than before as the rocket mass booster burnout. Assuming that the force of
has been slightly depleted by the prior ejection. gravity is constant at all heights, air drag is
Indeed, if the residual rocket mass eventually negligible, and the burnout time is small com-
were minuscule, its boost in speed could reach pared with the flight time, the apogee of the
an enormous value. The integrated effect of payload of the second case will be twice as high
these nonlinear boosts is found as as in the first.
Transfer Orbits If one wishes to leave one
Vt - Vo = Ve loge mo (16) orbit and enter another by rocket, an optimum
mt path is generally chosen to minimize the total
propellant required. Nevertheless, this should
where Vo and mo are the rocket speed and mass not be done at the expense of unduly long
at some arbitrary initial time and Vt and mt are flight times, complicated guidance equipment,
the same quantities at some time t later. or high acceleration stresses. These would re-
From these simple considerations, it is appar- quire unprofitable weight expenditures which
ent that the highest rocket velocities are at- would offset the frugality in propellant
tained if we could increase the propellant speed tankage.
as well as the mass ratio molmt. The mass ratio Let us examine a simple but recurring exam-
can be maximized by obvious methods such as ple of a transfer problem, that of leaving a refer-
choosing a high-density propellant which cuts ence frame or space platform in one circular
the tankage requirement or avoiding unneces- orbit and entering another larger one concentric
sarily complicated apparatus for ejecting pro- with the first. If the transfer path were radial or
pellant at high speed. A nuclear rocket, for ex- near radial (a so-called ballistic orbit) then one
ample, may perform well in its ability to eject would have to launch at a large angle to the
propellant an order of magnitude higher in direction of motion of the frame, accomplished
velocity than conventional chemical rockets; only by a velocity component opposed to the
nevertheless, the penalty required in reactor frame's motion. On reaching the outer platform,
weight and shielding severely limits its a soft landing can be made only by a substantial
effectiveness. rockef velocity boost tangent to the orbit.
91 ASTRONAUTICS

Clearly, the total propellant expenditure would


be far greater than one alternative of launching
the rocket in the direction of motion of the
\
\ first frame with just sufficient speed to reach
\ \ the outer circle, timed so that the outer frame
\ \
will meet the spacecraft. The transfer orbit will
\
\
\\ be an ellipse contangent with both circles. The
\ \ outer frame will be moving much faster of
\ \ course at the contact point as the major axis of
\ I
\ I its orbit is much greater [see Eq. (I 1)], but the
\, / difference in speed is not nearly as pronounced
,_/ J as for the ballistic transfer case. A differential
speed increment at contact completes the
maneuver.
The return trip, from an outer to inner circle,
is made by following the second half of this co-
tangent ellipse, named the Hohmann transfer
orbit after the German engineer who discovered
its optimal property with regard to propellant
expenditure. In the return case, the spacecraft
is launched in opposition to the outer plat-
FIG. 6. Four launch trajectories into a satellite orbit form's motion. This removes kinetic energy and
about a planet. (a) If that planet has an atmosphere, forces the spacecraft to fall in closer to the at-
the rocket may ascend in a "synergic" trajectory from tractive center in order to make cotangent
the planetary surface to the final orbit, i.e., it cuts contact with the inner circle. The total propel-
through the denser portions in an initially vertical lant expenditure from the outer to the inner
path and gradually bends over into a horizontal path platform is the same as for the original journey.
during burnout. (b) If there is no atmosphere it may An interesting question arises if one wishes to
ascend from the ground in a ballistic ellipse. This same leave a frame for an outer orbit when it initially
ascent path may be chosen if the departure is from a is in an elliptical orbit rather than a circle.
parking orbit or "space platform" close to ground Should we depart from apocenter where we are
level. A far better choice would be (c) the Hohmann furthest from the gravitational source and
ellipse, with pericenter at the planet's surface and closest to our destination? Or should we depart
apocenter at the satellite orbit. Burnout time is assumed instead from some other point in the ellipse?
short in both this and the ballistic case. (d) A vehicle
such as an ion rocket, which can sustain a micro thrust
for a very long time, cannot be launched from the
ground but only from a parking orbit. It will spiral
out to the desired altitude with few or many turns
about the planet, depending on the magnitude of the
thrust relative to that of the gravitational force.

FIG. 8. Alternative ways to escape from a circular


orbit to "infinity." Left: The rocket simply blasts off
into a parabola or hyperbola. Right: The rocket
reverses thrust to fall into a close-in ellipse and then
at the pericenter or near apsis of the ellipse (perigee if
orbiting about the earth) it blasts off into a parabola
FIG. 7. Reaching the center of attraction from a or hyperbola. Despite the loss of circular orbital energy
circular orbit by a double-thrust maneuver. The rocket in maneuver (b), it is preferred to (a) provided the
enters an eccentric ellipse by a forward thrust, and at energy remaining in the hyperbola after completely
apocenter reverses thrust to cancel the small orbital leaving the gravitational field exceeds the escape speed
speed there, enabling it to fall radially into the source. at the radius of the circular orbit. (If the arrows were
By this maneuver radioactive waste could be brought reversed the figures would represent alternative maneu-
to the sun at up to 40% saving in propellant. vers for entering a circular orbit from infinity.)
ASTRONAUTICS 92

I nilial orbilal motion


\ \
\ \
\ \
\ \
\ \
\
:1 \
\
I F
\
From~
Sun ===== ,
\
I
, \
I
I I
FIG. 9. Atmospheric-entry braking "ellipses," a I
trajectory controlled by gravity and the reverse force I
of atmospheric drag. Drag is concentrated at the peri- 11
center, where air density is high. This acts to decrease I
the major axis and eccentricity of the ellipse, so that I From ~
Sun ~
the orbit gradually approaches a circle. This analysis is I
somewhat idealized, however, and in an actual case the I
pericenter lowers slightly and the major axis turns in
each successive cycle. (a) (b)

Paradoxically, our best launch pOInt is at peri-


center, for here the largest possible amount of FIG. 10. Orientation of a solar sail for a voyage
energy will be transferred to the spacecraft for from earth to (a) a superior, and (b) an inferior planet.
a given expenditure of propellant. This was F is the vector sum of the forces due to the incident
noted in the previous section. A given thrust and reflected beams. When F is aligned as in (b) it con-
applied for a given time interval will do more stitutes a drag and causes the sail to spiral into the sun.
work on the spacecraft when it is moving fast,
as at pericenter, for it covers a greater distance
during the interval. This advantage offsets the glOn may take a shorter time in a spiral trajec-
undesirability of being at a lower potential tory. In a long Hohmann ellipse, for example,
energy point at pericenter. most of the journey is made at very low speed.
Powered Trajectories In the usual operation In a powered spiral, on the other hand, the
of a solid- or liquid-propelled rocket, the pro- spacecraft could be made to move fast, for the
pellant is depleted in a time negligibly small thrust, though small, is integrated over many
compared with the total flight time. The trajec- months.
tory analysis may generally be considered as The spiral concept is ideal for rockets where
that of a free orbit subject to burnout initial very high ejection velocities are feasible by
conditions as in the discussion above. If, how- using electromagnetic or electrostatic particle
ever, the propellant ejection is sustained over accelerators, but only at the expense of a
long periods, as in an ion-propelled rocket, the low propellant flow rate and relatively heavy
trajectory analysis is necessarily complicated, power-generating equipment. However, the pro-
for, in addition to the varying gravitational pellant reserve, and thrust, could then last the
force, the vehicle, of slowly diminishing mass, is required long time. Such an ion rocket with its
subject to a thrust which may be changing both very low thrust-to-weight ratio could hardly be
in direction and magnitude. Even one of the expected to take off from the ground, and
simplest thrust programs, a constant thrust in could only take off from an orbital platform.
the direction of motion, requires an electronic In the vacuum of space, the ion beam meets its
computer analysis in order to obtain the posi- ideal environment.
tion and velocity at future times (see ELEC- ARTHUR I. BERMAN
TRIC PROPULSION).
The continuous-thrust trajectory is a spiral
References
with many advantages over the orbital ellipses.
First, the lower sustained thrust precludes the Berman, Arthur I., "The Physical Principles of Astro-
high-acceleration stresses associated with rapid- nautics," New York, John Wiley & Sons, 1961.
burning chemical rockets. Much of the struc- ,Moulton, Forest Ray, "An Introduction to Celestial
tural weight usually needed to withstand these Mechanics," New York, The Macmillan Co., 1914.
stresses can be replaced by propellant. Also, Danby, J. M. A., "Fundamentals of Celestial Mechan-
flights to the extremities of a gravitational re- ics," New York, The Macmillan Co., 1962.
93 ASTROPHYSICS

Sterne, Theodore E., "An Introduction to Celestial 100 times the Earth's, temperatures of 750 K,
Mechanics," New York, Interscience Publishers, and possible seas of acetic acid. The high tem-
1960. perature is attributed to atmospheric gases,
Berman, Arthur I., "Space Flight," New York, Double- mostly CO 2 , that let sunlight in but prevent
day Book Co., 1979. the outward, long-wavelength thermal radiation
from leaving the atmosphere.
Cross-references: AERODYNAMICS;
DYNAMICS; High-speed jet-stream motions were detected
ELECTRIC PROPULSION; FLIGHT PROPULSION in the atmospheres of Venus, Mars, Jupiter, and
FUNDAMENTALS; GRAVITATION; INERTIAL Saturn. On Venus, they produce changing cloud
GUIDANCE; KEPLER'S LAWS OF PLANETARY patterns, on Mars they produce global dust
MOTION; MASS AND INERTIA; ROTATION- storms, and on Jupiter and Saturn they produce
CURVILINEAR MOTION; WORK, POWER, AND vortices that show as small white spots, but by-
ENERGY. pass the giant red spot on Jupiter (explanation
still unknown). By far the most dramatic Voy-
ager (1981) photos are those of Saturn's rings
(Fig. I). Earth-based observations showed none
ASTROPHYSICS of the detail of the hundreds of single rings,
some of them entwined by the gravitational
Starting with the advent of photography and effects of nearby small moons. Less impressive
the study of stellar spectra in the second half of rings were also found around Jupiter, and Earth-
the nineteenth century, astrophysics now in- based observations showed rings around Uranus.
cludes optical and radio observations of planets, These rings are related to stages in the origin
stars, clusters, interstellar material, galaxies and and evolution of the solar system, probably the
clusters of galaxies, and their interpretation. remnants of proto-planet clouds of gas and dust
Radiation from these external sources provides that formed the giant planets in much the same
information on the direction of the source, its way as the solar nebula formed the Sun and
velocity, composition, temperature and other planets. From all this, the new science of plane-
physical conditions, including magnetic fields, tology developed among astrophysicists, geo-
density, degree of ionization, and turbulence. physicists, meteorologists, and geochemists.
The term "astrophysics" is generally understood Comets have been observed and recorded by
to include all these aspects except the measure- men for several thousand years. Starting in the
ment of direction (ASTROMETRY -positions of 1920s, astrophysicists observed their spectra,
stars in the sky and changes due to parallax and which showed that as a comet approaches the
proper motion), and the orbits of planets, aster- Sun, sunlight boils off various molecules to
oids and comets (celestial mechanics). Because form a coma around the nucleus, then two long
of its proximity, the sun can be studied in more tails, pushed out away from the Sun by radia-
detail than other stars; its structure and its in- tion pressure and by the solar wind-outward
fluence on the nearby planets and comets are moving electrons and ions ejected by the Sun.
the concern of SOLAR PHYSICS and are closely The dust tail, showing in reflected sunlight, is
related to geophysics and stellar astrophysics. formed of dust released from the comet nucleus
Study of the motions of stars in pairs, groups, as frozen gases boil off; the ion tail is formed
clusters, associations, and galaxies is the overlap from ionized gases that show an emission-line
of celestial mechanics with astrophysics, and spectrum. These two tails deviate from one
the study of the distribution and patterns of another because the solar wind blows in a
motion of the distant galaxies is the overlap slightly different direction from solar radiation.
with "COSMOLOGY." The Oort cloud of comet nuclei (named after
Astrophysical studies of planets began in the Jan Oort, a Dutch astrophysicist) is believed to
1930s with measurement of their spectra, show- be the outer fringe of the solar nebula con-
ing the composition of the atmospheres of densed from gas and dust to form nuclei revolv-
Venus, Mars, Jupiter, Saturn, and Neptune. ing about the Sun at distances larger than 50
These spectra showed bands of CO 2 , CO, H2 , Astronomical Units (1 A.U. = 93 million miles,
CH 4 , NH 3 , and other, more complex molecules. the distance from Sun to Earth). These small
The atmospheres of Jupiter, Saturn, Uranus, bits of frozen gas and dust represent the early
and Neptune seemed to have about the same composition of the solar nebula. Every now
atomic composition as the Sun, and theoretical and then, one gets deflected from its circular
models of Jupiter showed that its core must orbit to a highly elliptical orbit, falling toward
consist of metallic hydrogen under high pres- the Sun, sweeping around at perihelion close
sure in the interior. Starting in 1959, the space to the Sun with large coma and long tails, then
age brought spacecraft missions to Mars, Venus, receding from the Sun out to aphelion, losing
Mercury, Jupiter, and Saturn, with landings coma and tails, then falling back in again.
on Venus and Mars (as well as Earth's Moon), Halley's Comet is the archetype of these long-
and measurements in situ of their atmospheric period comets, returning to the Sun every 75
composition, density, and temperature. Venus years. In 1985-86 it will reappear. European
probes in 1970-75 found the most extreme and Soviet spacecraft are expected to rendez-
conditions; surface pressure and density about vous with Halley's Comet then in an effort to
ASTROPHYSICS 94

FIG. 1. Saturn's Rings; NASA JPL color photo.

photograph the size of its nucleus and to sam- cluded dust, may help with calculations of how
ple the gas and dust boiling off. Astrophysicists the primordial solar nebula cooled and con-
want to determine what the frozen gases in the tracted about 5 billion years ago. The planets'
nucleus are; spectra of the coma show only sizes, masses, densities, rotations, distances from
molecules that have been dissociated by sun- the Sun, satellites and rings, and the thousands
light. The nature of these gases, and the in- of asteroids between Mars' and Jupiter's orbits,
95 ASTROPHYSICS

provide the rest of the data needed for a theory radio dishes thousands of miles apart; the slightly
of the origin and evolution of the solar system. different arrival times of radio waves from the
A recent discovery adds the fact that Pluto has same object provides resolution of 0.001 arc-sec
a small moon, whose orbit provides the first or better.
firm measure of Pluto's mass, much smaller than Optical telescopes, both in orbit and ground-
expected. based, are fitted with spectrographs that spread
The instruments used by astrophysicists have the incoming light into a spectrum of colors
multiplied during the past two decades, and from short wavelength to long, using a finely
their resolving power (needed, for instance, to ruled grating or an echelle to resolve wavelengths
detect Pluto's small moon close to that planet) as close as 1/100,000 of their size. This high
has increased enormously. Telescopes for the spectrographic resolving power allows very
1980s (see reference) inclUde the 2.4-meter small Doppler shifts to be measured, and the
(95-inch) Space Telescope (Fig. 2), the Very spectral lines of different atomic isotopes to be
Large Array of radio receivers near Soccorro, detected.
N.M., and the orbiting Einstein x-ray telescope Needed for orbiting telescopes and spectro-
launched in 1978. The European Space Agency graphs, electronic detectors have been devel-
is supplying Space Lab with a cooled infrared oped to substitute for photographic emulsion,
telescope to be flown on NASA's Space Shuttle. and are now being used in ground-based obser-
Space Telescope and VLA can resolve objects as vatories as well. The most successful is the
close as 0.02 arc-sec apart, and detect objects charge-coupled device (CCD). Arrays of 800
100 times fainter than previously possible. Very minute CCDs, 15 microns on a side, turn out
long baseline interferometry is achieved with digital information on each pixel (picture ele-

Equ ipment section

Scientific
instruments

FIG. 2. Diagram of Space Telescope; Longair & Warner, "Sci Res with ST" NASA CP-2111.
ASTROPHYSICS 96

ment) in an image or a spectrum. These digitized ments of many stars, it has been possible to
data can be fed directly into a computer for correct for the effects of interstellar absorption
processing. Such use of electronics and com- and limited wavelength range, as well as for
puters has changed the observer's role in astro- the inverse-square law (I/D2), to obtain total
physics. Instead of spending long cold nights at luminosities and colors. The luminosities are
the eyepiece of a telescope in an unheated often expressed in "suns", that is, multiples of
dome and developing his plates or film in a the sun's luminosity (about 4 X 10 33 ergs/sec).
photographic darkroom, he now prepares com- Analogous measurements at radio frequen-
puter programs in advance, picks the object he cies are expressed as the flux in watts/square
wants on a TV screen in the comfortable con- meter/cycle/second. Spectrophotometric mea-
trol room, and waits during a suitable "inte- surements from rockets and artificial satellites
gration time" for his processed results to be above the earth's atmosphere have been made
printed out or graphed by the computer. in the far ultraviolet, and it is to be expected
Early studies of stellar spectra revealed differ- that the intensity distribution, I(X), will soon
ences due primarily to surface temperature and be observed for all wavelengths from 10- 4 A
described by the sequence of spectral types to several km.
ranging from "0" (30000 K or more) through X-ray observations started in 1962, using
"B " "A " "F " "G " and "K" to "M" (2000 to ionization chambers and Geiger counters
3000 K): The' type' of a spectrum is set by rela- mounted on rockets and sent up for a few
tive intensities of lines and bands due to ions, minutes above the earth's atmosphere. As the
atoms, and molecules. The earth's atmosphere rocket rotated, the x-ray detectors scanned part
limits the wavelength region observable from of the sky, but the angular resolution was at
terrestrial observatories since ozone and other first poor (about 3°). Still, these early observa-
constituents are opaque at wavelengths '11.< tions showed several discrete sources (Sco X-I,
3000A, and water vapor bands block much of Sco X-2, Tau X-I, eyg X-I, etc.) named for the
the region 1.2/.1 < A < 8/.1. The ionized layers constellation in which each is located.
block radio waves longer than 20 meters. Never- Later orbiting x-ray observatories (Uhuru,
theless, thousands of spectrum lines, mostly in HEAO, Einstein Observatory) have detectors
absorption, have been identified in the range with much higher space resolution; they can
3000 < A< l2000A, and the pattern of lines locate x-ray sources to within a few arc-minutes.
within one spectral type has been found to vary Many of these sources have been identified with
with a second parameter, the "luminosity class" optical objects; they are many light-years dis-
designated by roman numerals "I" (highly tant, and their energy output is very high. Some
luminous "super giants") through "II", "III", are star-like in size, and thought to be the rem-
"IV" to "V" ("dwarfs" of relatively low nants of old supernovae-possibly neutron stars;
luminosity). others are whole galaxies, and our Milky Way
The continuum between spectral lines has an has a faint x-ray background. The measured
intensity distribution with wavelength that flux is given in counts/cm 2 -sec, each count
roughly matches Planck's theoretical distribution representing a quantum of about I A wave-
for a blackbody, B(A) dA = 2hc 2 A-5 (ehc/AkT- length, or about 104 eV, or 1.6 X 10- 8 erg.
1)-1 where T is the temperature that accounts Gamma rays of wavelength about 10- 3 A
approximately for the ionization and excitation (energy 10 MeV) can also be detected above the
of atoms producing the star's line spectrum, or atmosphere using scintillation counters-crystals
the dissociation of molecules producing bands- that emit a flash of light when penetrated by a
that is, for the spectral type. In so far as the lines gamma ray. Such detectors are sensitive, but
can be ignored, the color of a star is approxi- not accurate in direction. However, the direc-
mately that of a blackbody of temperature T tions of gamma-ray sources have been accurately
and the total ("bolometric") luminosity is determined by using three or more gamma-ray
given by Lb = 41TR2 aT4, where R is the radius detectors timing gamma-ray pulses or "bursts."
of the star and a is Stefan's constant. The These times, accurate to microseconds, deter-
"apparent brightness" of a star (observed opti- mine direction from the differences in time of
cal flux) depends upon its distance, D, and its arrival at spacecraft in known positions. Since
luminosity in the wavelength region observed- 1975, scores of gamma-ray bursters have been
approximately 4000 < A< 6sooA for visual located, and there is a diffuse gamma-ray back-
observations and 3700 < X< soooA for photo- ground all over the sky, indicating many more
graphic observations through glass optics. Re- distant sources. Solar flares emit gamma rays,
cently the introduction of photoelectric equip- and other stars probably do the same. The
ment has allowed more accurate measurements strong gamma-ray sources are probably distant
in smaller wavelength regions. Standard mea- galaxies, but bursters seem to be in our Milky
sures are designated U (ultraviolet), B (blue), Way Galaxy, and are thought to be neutron
V (visual), I (infrared), etc., and are usually ex- stars or black holes with matter falling into
pressed in magnitudes, an inverse, logarithmic them to produce the bursts. At least six are lo-
scale. An increase of S magnitudes corresponds cated at the centers of globular clusters.
to a decrease in brightness by a factor 100. By The major gap in observed spectra, 1('11.), is
successive approximations in such measure- from X = 0.08 to 2.S mm, between the far
97 ASTROPHYSICS

infrared and short radio waves. In the far infra- and that surrounded by an outer convective
red, measurements from high-flying aircraft and shell.
spacecraft show many star-like objects and a few The successful fitting of nuclear energy genera-
galaxies that are strong emitters between A = tion into such gas-sphere models of stars by
0.02 and 0.08 mm. In 1983 NASA's Infrared 1940 led to the idea of stellar evolution, the
Astronomical Satellite (IRAS), cooled by liquid conversion of hydrogen to helium in its core
helium to 16 K, surveyed the whole sky in the causing a star to age. Direct evidence of this
band 0.02 to 0.3 mm, and discovered thousands aging was first obtained from clusters of stars.
of far-infrared sources. The stars in one cluster, relatively close together
The masses of stars are determined from mo- in space, are assumed to have been formed at
tions of double stars, ranging from widely sepa- the same time, and the pattern of stellar
rated visual binaries whose relative motions can characteristics differs from one cluster to
be photographed, to close spectroscopic and another in a systematic way. The pattern is
eclipsing binaries with orbits calculated from easily recognized on a Hertzsprung-Russell
variations in radial velocity (observed Doppler ("H-R") diagram of log L vs spectral type (or
shift). Among some 50 pairs, masses of indi- color) on which the vast majority of stars ap-
vidual stars are found from 0.08 to 20 solar pear near a diagonal line, the "main sequence"
masses, and there is less definite evidence of (Fig. 3).
others as low as 0.03 and as high as 50 or 100. Several other classes of stars can be distin-
(One solar mass is 2 X 10 33 gm.) Over most guished by location on the H-R diagram ("red
of this range the luminosity L is proportional giants," "white dwarfs," etc.) and theories of
toM3. .stellar evolution account for a change in loca-
Astrophysical theory has achieved consider- tion along an "evolutionary track" for anyone
able success in explaining the spectra of stars by star. The rate of such change will vary in general;
theoretical models of the atmospheres, involving for instance, the large-mass, high-L blue stars are
the surface temperature, surface gravity, abun- expected to exhaust their hydrogen in a few
dances of chemical elements, turbulence, rota- million years, whereas yellow and red dwarfs
tion, and magnetic fields. The strengths of ab- remain for billions of years on the main se-
sorption lines are found to fit a "curve of quence. H-R diagrams for clusters confirm the
growth," the relation between measured line aging (also the theoretical star models and the
strengths and the strengths predicted by quan- assumption of cluster origin) and provide evi-
tum theory for unit abundance of the one ion, dence of the age of each cluster.
atom, or molecule involved. The theory of
stellar interiors further relates mass M, lumi- -8
nosity L, and radius R with chemical abun- NGC2362
dances, the opacity of the material, and the -6
generation of energy by nuclear reactions. More
spectacularly, it has explained stellar evolution h+XPER~
in terms of changes due to nuclear reactions. - 4
The theoretical models of stellar interiors are ".=<"c
based on stability and two modes of transferring -2
energy outward to the surface: radiative transfer '"'"
E
and convective transfer. Radiative transfer, by
repeated emission, absorption, and emission of '";;a 0

light, implies a temperature gradient dependent :;" +2


on the opacity and on the flow of radiative Si
.0
energy, or L. After calculating the opacity of <l:
+4
gaseous stellar material (about 60 per cent hy-
drogen, 35 per cent helium, and 5 per cent
heavier elements, by weight) at various tem- +6
peratures and densities, the astrophysicist can
compute the temperature, density and pressure +8 -0.4 _ 00 0.4 1.6 2.0
in shells at various depths inside a star, starting Color index (8 - V)
with a definite radius and surface temperature,
and adding up the shell masses to get the total FIG. 3. This is the H-R diagram for galactic clusters
mass. At some level, the temperature and density used by Allan R. Sandage to estimate their relative
are sufficiently high for nuclear reactions to take ages. The turnoff from the age-zero main sequence
place, the simplest being conversion of hydro- (on which the sun is located) is lowest for the oldest
gen to helium genera ting 7 X 10 18 ergs of energy clusters, NGC I 88 and M67. The stars in the upper-
per gram. The rate of energy generation over the most sequences, such as the red giants of the Perseus
whole inner core must match L. Calculations for Double Cluster, are intrinsically some 10,000 or more
convective transfer follow a similar pattern but times brighter than the sun. (From Thorton Page and
depend upon matching the adiabatic gas law for Lou W. Page, eds. The Evolution of Stars, in Sky &
over-all stability. A combined model may have a Telescope Library of Astronomy. Copyright © 1967
convective core surrounded by a radiative shell by Macmillan Publishing Co.)
ASTROPHYSICS 98

A large part of astrophysical research is de- Interstellar material in the form of bright
voted to filling in the details of stellar evolution nebulae has been known since telescopes were
and the variety of nuclear reactions involved. first used. During the first two decades of this
For example, after its hydrogen is exhausted, century, evidence was collected showing less
the core of a giant star contracts and heats up obvious clouds of dust and interstellar gas in the
to a billion degrees; then helium combines to plane of the Milky Way , based on the obscuring
form carbon, providing a new intense source of and color effects of dust and the spectral absorp-
energy, and gas is probably blown off the star. tion lines of gas (primarily sodium and ionized
Later, a small white dwarf remains. The ex- calcium). In 1945 the polarization of starlight
plosion driving off a large fraction of a star's caused by the interstellar dust was discovered,
mass probably accounts for supernovae, which and in 1950 the radio telescope added the
are seen every 100 years or so in our Milky emission by interstellar atomic hydrogen at
Way and in other galaxies. One that was seen 21-cm wavelength. This interstellar medium is
to blow up in A.D. 1054 now has a large ex- now known to extend in a thin, flat slab centered
panding cloud of ionized gas around it-the Crab in the Milky Way. When highly luminous blue
Nebula in Taurus. Near the center is a peculiar stars are in or near it, the gas is ionized by
small star thought to be the core of the super- ultraviolet radiation, and the resulting electrons
nova, possibly a neutron star with density (of produce emission lines of hydrogen, oxygen,
pure neutrons) about 10 15 gm/cm 3. Such super- helium and other elements by recombination
nova remnants (SMRs) are found in other gal- or by electron excitation. In addition to such
axies such as the Large Magellanic Cloud (LMC). "H II regions," the dimensions of which depend
In 1968 this was confirmed when astronomers on the temperature and luminosity of the ex-
in England discovered a rapidly pulsating radio citing star and the density of the medium,
source, the first of about 300 pulsars now astrophysicists have studied more complex neb-
known, one of which is near the center of the ulae in which the material density varies from
Crab Nebula. By 1970 it was fairly well agreed one place to another. The interstellar medium
that pulsars are rapidly rotating neutron stars is often denser near young clusters or individual
with strong magnetic fields that interact with blue stars, as expected from the theory of star
surrounding gas to emit a radio pulse on each formation
rotation. This pulse timing is accurately peri- Since 1963, radio astronomers have detected
odic-1.33 730 sec for the first pulsar discovered over 50 different kinds of molecules in the in-
-but the pulses measured in different radio fre- tersteller medium, mostly in dark clouds of
quencies (wavelengths) are out of step, an effect dust where light from stars cannot penetrate.
due to the ionized gas along the line of sight. In These molecules consist of the most common
several cases, the period of the pulses (star rota- atoms, H, C. N, and 0 ; two include sulfur (S),
tion period) has been found to be increasing by and one silicon (Si). Absorption lines of CH and
10- 6 sec or so each year, showing the "braking" CN were discovered in optical spectra of stars
action of the surrounding gas. (see PULSARS). about 1940, and H2 in far-ultraviolet spectra
The formation of a star starts with gravita- taken from a rocket above the atmosphere in
tional contraction of a large cloud ("nebula") 1969. The study of these molecules is the new
of interstellar gas and dust, including matter subject of astrochemistry , and involves such
ejected from previous generations of giant stars. questions as how the interstellar molecules are
The recycling of material back and forth from formed, what other ones should be there, and
nebulae to stars involves changes in composi- the conditions (density, temperature, turbulent
tion-the abundances of helium and heavy ele- motions) of the gas. Preliminary estimates are
ments increasing with time . Since 1954, astro- that most of the 8 simple diatomic molecules
physicists have therefore been concerned with may be formed by collisions in space; the more
nuc/eogenesis, the creation of the chemical ele- complex ones, such as H2C0 2, CH 3CN, and
ments in stars (or in an early stage of the evolv- NH3CO, are probably formed on grains of inter-
ing universe). Differences in composition of stellar dust in clouds where the concentration of
stars in various locations are now interpreted as hydrogen molecules (H 2) is 106/cm3 or higher.
evidence of past star making. These molecules leave the dust grains after com-
The formation of the solar system (sun, bination, but other molecules stick to the dust
planets, asteroids, meteors, and comets) is one until strong light "boils" them off. Hence dust
case of star formation studied in great detail by grains can deplete the interstellar gas in large
astrophysicists, geologists and chemists. Radio- dark nebulae-another process going on in our
active dating of minerals in the earth, moon, Milky Way.
and meteorites places this event about five bil- The molecular absorption lines are at slightly
lion years ago when a slowly rotating nebula different wavelengths when hea~ isotopes 13C
contracted, forming earth and planets, but losing or 18 0 are involved instead of 1 C or 16 0, and
a good deal of its mass in the process. Fraction- the line strengths show the relative abundances,
ation of chemical elements and compounds or "isotope ratios," which are related to the
during the condensation is linked with astro- source of the interstellar gas. Relative strengths
physical interpretation of chemical analyses of of different spectral lines from the same mole-
meteorites, and lunar and terrestrial minerals. cule show that the excitation temperature in
99 ASTROPHYSICS

instellar clouds is very low-about 3 K, and the is possible that these ejected ions are accelerated
absolute strengths of the spectral lines show to enormous speeds by the uneven magnetic
that the density of, for instance, CO is 20 mole- field in the Milky Way system. Cosmic rays are
cules/cm 3 . Two molecules, OH and H 2 0, show studied above the atmosphere to determine the
maser interaction; i.e., these molecules are ex- relative abundance of different ions, their en-
cited so as to "pump" energy into a few se- ergy spectrum, and source. Their energy flux is
lected radio wavelengths, which may come out about the same as starlight, and they probably
of the gas cloud in a single direction. affect the temperature of the interstellar gas.
The whole Milky Way system of stars, nebulae They certainly produce x-rays and gamma rays
and interstellar gas and dust is assumed to be in by collisions with atoms and molecules.
dynamic equilibrium; that is, the mass distribu- The many other galaxies well outside our own
tion can be calculated from individual motions are found to include some (classed as "spirals")
under the gravitational attraction of the whole very similar to our Milky Way galaxy in struc-
galaxy, another important part of modern astro- ture and internal motions. Others are strikingly
physics. Since 1920 the dimensions of the gal- different (classed as "ellipticals"), probably due
axy have been determined from distances of the to different conditions of formation. All the
large bright globular clusters and from the dis- techniques of astrophysics are being applied to
tances of nearer stars. The resulting model, a the study of these objects: measurement of
flat disk with a high-density nucleus (total mass their sizes, luminosities, colors and masses, their
about 1011 suns) also fits the average motions proportion of interstellar material, the forma-
of stars within a few thousand light years' dis- tion and evolution of their stars, etc. These
tance from the sun, and the radial motions of physical characte!istics are fairly well correlated
cold atomic hydrogen out to 50 000 light-years with morphological type; the spirals are similar
determined by Doppler shifts in the 21-cm to our Milky Way, but the ellipticals have al-
radio emission line. The stellar motions are de- most no interstellar material, and are much
rived from statistics of Doppler shifts and more massive for their size and luminosity than
changes in direction, allowing for random in- the spirals.
dividual motions differing from the general The evolution of galaxies is due to the ageing
circulation. of their member stars. This is a slow process,
Most of the stars in the Milky Way share in a and no change can be observed during a man's
circular velocity, vc, around the center of the lifetime, except for supernova explosions, and
galaxy, and the mass distribution is inferred light variations in the nuclei of a few "active gal-
from the observed change of Vc with distance axies." But as we look to more distant (fainter)
from the center. The globular clusters and many galaxies, we are looking back in time, seeing
other stars appear to move in orbits at high in- them at a younger age. This is because faint
clination to the Milky Way plane, forming a galaxies are hundreds of millions of lightyears
"halo" around the center having little or no away, and the light we see now left them hun-
angular momentum. It thus appears that there dreds of millions of years ago. By 1986, it may
are two populations in the galaxy: "Population be possible to look back more than 5 billion
!" stars, nebulae, gas and dust in the outer parts years, using the new Space Telescope, launched
of a thin rotating disk, and "Population II" stars into orbit then by Space Shuttle. Astrophysi-
in the nonrotating halo, probably formed at an cists have traced galactic evolution backwards
earlier time. A large fraction of Population I in and expect that elliptical galaxies were very
the disk is in the form of nonluminous inter- luminous 10 billion years ago, when their blue
stellar dust and gas, although this interstellar supergiant stars had just formed. They are eager
material is only 10 to 15 percent of the total to confirm this, and also to observe young spiral
mass of the system. During the 1970s, after the galaxies with Space Telescope.
"mass discrepancy" was recognized, several as- The mass of a single galaxy can be measured
trophysicists assumed the presence of much from the rotational velocity, Vc, near the rim
more nonluminous material in the halo. They and the radius, R. Then M = vc2R/G, where G
found some evidence of such "dark haloes" in is the gravitational constant, and Vc = v/sin i,
other galaxies (see below), and calculated that where v is measured from the relative Doppler
as much as half the mass of a galaxy is in its shift (rim to center), and i is the inclination of
dark halo. Efforts have been largely successful the disk to the line of sight. However, R is
in confirming this assumption for the Milky poorly determined because galaxies do not have
Way system. sharp rims. Moreover, the observed change of
There is a weak magnetic field between the Vc with distance from the nucleus r does not
disk stars which probablY lines up the inter- always follow the expected form of a rapid rise
stellar dust particles like small magnetic needles, to a maximum and then a decrease (vc a: l/V)
and this pattern accounts for the polarization of after r includes most of the galaxy's mass. Dur-
starlight passing through. The field also deflects ing the 1970s many such velocity curves were
moving ions and electrons, and may account for measured for spiral galaxies, and several showed
the energy of very high-speed cosmic rays. no peak; that is, Vc remained at its peak value
Lower-speed cosmic rays are ejected by solar out to r for the faintest rim stars observable.
flares, and other stars probably do the same. It This is considered evidence of a spherical dark
ASTROPHYSICS 100

400
"0
C
o
~ Blitz (110).

~ v.; •
~

~ 300
~ .La
v
a ...
~
(I)
~
I"'"
E
~ 200
...............
---
Schmidt (1965)
~ r-- _ _
'uo
Qj
> 100 --- "--
Keplerian decline
coc
o
';;
~
o
a:
5,000 10,000 15,000 20,000 25,000 30,000 35,000 40,000 45,000 50,000
Distance from galactic center (parsecs)

FIG. 4. Rotation curve graphs the circular velocity of matter in rotation about the center of the Milky Way.
Here two such curves are drawn. A rotation curve plotted in 1965 by Maarten Schmidt of the Hale Observatories
(solid black line) shows a circular velocity that declines toward the limit of the visible galaxy at 20,000 parsecs.
If all the mass in the galaxy lay inside that limit, a test mass placed farther out would rotate at lower speed, in
approximate obedience to a law first formulated (for the motion of the planets) by Johannes Kepler (broken
line). Data analyzed by Leo Blitz and his colleagues at the University of California at Berkeley now yield a rota-
tion curve (light solid line) that rises toward a value of 300 kilometers per second at 20,000 parsecs. The rise of
the newer curve implies unseen mass in great quantity outside the visible limit of the galaxy. Each point in the
newer data is the circular velocity of a cloud of hydrogen atoms. (From Bart J. Bok, "The Milky Way Galaxy,"
Scientific American, March 1981. Copyright © 1981 by Scientific American, Inc. All rights reserved.)

halo-nonluminous material extending almost mass MH, about 10 8 SUns within a radius RH =
to the edge of the luminous disk. Fig. 4 shows 2GMH/C 2 = 3 X lOi~ cm = 3 X 10- 5 light year.
the velocity curve for the Milky Way system. (This "Schwarzschild radius" RH comes from
Another method can be used to measure the setting the velocity of escape Ve equal to
masses of the many galaxies in pairs, one orbit- the velocity of light c in the equation !mvi =
ing around the other. In such a pair, MI + M2 = GmM/R.) With such a small radius (0.01 light-
(VI - V2)2S/G, where S is the separation. How- day), the black hole in the nucleus can vary
ever, the inclination of the orbit cannot be rapidly in brightness as gas falls into it.
measured, and an average value of M must be Galaxies are observed in increasing numbers
obtained from statistics of many pairs. This at larger and larger distances, roughly in uni-
method shows that the average spiral has Ms = form distribution, but with marked clustering.
4 X 10 10 suns = 8 X 1043 gm, while the average In 1981, a large void was found, about I billion
elliptical has 30 times this mass. The luminosity lightyears from us in the direction of the con-
of either type is 10 10 suns, on the average, so a stellation Bootes, where astrophysicists can de-
spiral has about four times as much mass per tect no galaxies in a region 300 million light
watt of light output as the sun does, while an years across. There is some evidence that com-
elliptical has over 100 times as much. pact clusters contain a preponderance of ellip-
Except for the dark haloes, the mass of most ticals; the total number of galaxies in the Coma
galaxies seems to be concentrated in their Cluster is about 500, of which only a dozen are
nuclei, and several of these nuclei appear to be spirals. The total mass of such clusters can be
exploding. The most dramatic is the irregular measured by the relative velocities of galaxies
galaxy, Messier 82, with jets coming out of the near the edge, and when this was done (first in
nucleus (Fig. 5). Doppler shifts in these jets 1932), astrophysicists were surprised to find
show that they are moving outward at about that the cluster mass is 5 to 10 times larger than
1000 km/sec. A few dozen Seyfert galaxies the sum of all galaxy masses in it. This mass
have active nuclei that vary in brightness and discrepancy meanS that we see only 10 to 20
show strong, broad emission lines indicating percent of the material in a large region of
explosive conditions. Both jets and Seyfert space-possibly in the rest of the observed uni-
nuclei, as well as the superluminous quasi- verse, as well. The "missing mass" may be a
stellar objects (QUASARS) are thought to be thin intergalactic gas or discrete, nonluminous
powered by black holes at the center. The power objects, or due to the fact that many galaxies
comes from mass falling into the black hole at have dark haloes and masses 10 times as large
rates of up to I solar mass (2 X 10 33 gm) per as the "average mass" used. Another possibil-
year. Such black holes are estimated to have ity was brought out in 1980, when physicists
101 ASTROPHYSICS

FIG. 5. The irregular galaxy Messier 82, photographed in red light of hydrogen with the 200-inch Palomar re-
flector. The enormous puff of luminous hydrogen gas escaping from the nucleus of this system stretches over
10,000 light years above and below the disk of the galaxy, which is about 20,000 light years across. (Palomar
Observatory Photograph.)

found evidence that neutrinos have a small mass. distances, we see now the quasars as they were
Before then, neutrinos were considered to have 5 billion years ago, and it seems likely that
a rest mass of zero. Nuclear reactions in the many galaxies exploded at that time (see
Sun's core are spewing out neutrinos at the rate QUASARS).
of 10 38 per second, and more luminous stars do Here astrophysics leads into COSMOLOGY,
so at even higher rates. Over the past 15 billion based on general RELATIVITY. Observations of
years, all space must have been filled with neu- the short-wave (3-mm) radio emission coming
trinos, although their number levelled off as from all over the sky (isotropic) seems to con-
the density increased and neutrino pairing be- firm the "big-bang" cosmological model of the
gan to destroy them as fast as they are being universe, and to disprove the steady-state theory
created. Light as they are, the large numbers which assumes continuous creation of matter.
of neutrinos add up to a mass somewhat larger The isotropic background radiation with leA)
than all the visible galaxies, and provide a good matching a black body at 3 K, is the remnant 01
part of the "missing mass" in the universe. the very hot explosion about 20 billion yean
The spectra of distant galaxies all show large ago that sent the galaxies moving outward to
red shifts which, if interpreted as Doppler shifts, the great distances where we see them today.
indicate a recessional velocity proportional to Astrophysical evidence for the missing mass in
distance (Hubble's Law). In fact, the red shift clusters of galaxies shows that the average den-
is used to measure all distances (D) larger than sity of matter in the universe is probably higher
about 40 million light-years by D =vR/H, where than 10- 28 gm/cm 3 so that the space curvature
H = 17 km/sec/million light years. Starting in predicted by general relativity fits the numbers
1958, radio astronomers began to survey the of galaxies we count at <;Iifferent distances. For
whole sky for faint radio sources, and found higher density of matter, the predicted curva-
that many galaxies are strong radio emitters. ture is larger, and at the extreme, ~pace "wraps
The most puzzling were faint, starlike (quasi- itself around" a large collapsed mass so that no
stellar) objects, soon named quasars. In 1963 light or radio waves can get in or out. This
these were found to have very large red shifts, leaves a "black hole" which exerts gravitational
therefore very far away (up to 5 billion light- attraction on distant masses, but cannot be
years) and about 100 times more luminous than seen-a possible explanation of the missing mass
the average galaxy. This large energy output is in clusters of galaxies.
probably an enormous explosion, lasting but a
brief fraction of a galaxy's life. At these large THORNTON PAGE
ASTROPHYSICS 102

References The Stern-Gerlach experiment (1924) demon-


strated the validity of space quantization of
Abell, G. 0., "Exploration of the Universe," 2nd Ed., angular momentum and established the electron
New York, Holt, Rinehart and Winston, 1969. spin as 1/2. This historic work placed quantum
Pasachoff, J., and Kutner, M., "University Astronomy," mechanics on a firmer foundation and initiated
Philadelphia, Saunders, 1978. beam investigations of atomic and nuclear elec-
Page, T., and Page, L. W., "The Evolution of Stars," tromagnetic properties. Although many low-
New York, Macmillan, 1968. precision results appeared in subsequent years,
Page, T., and Page, L. W., "Stars and Clouds of the high-precision spectroscopy began in 1937 with
Milky Way," New York, Macmillan, 1968. the introduction of the magnetic-resonance
Bok, B., and Bok, P., "The Milky Way," Cambridge, method by Rabi. In this method, transitions be-
Mass., Harvard Univ. Press, 1981. tween quantum states separated by an energy hv
Page, T., and Page, L. W., "Beyond the Milky Way," are induced by a radio-frequency field of fre-
New York, Macmillan, 1969. quency v (h is Planck's constant). From the
Page, T., and Page, L. W., "Space Science and Astron- Heisenberg uncertainty principle, the width of
omy," New York, Macmillan, 1976. the rf resonance is small, owing to the long life-
Burbidge, G., and Hewitt, A., "Telescopes for the times of the beam quantum states and to the
1980s," Palo Alto, Annual Reviews, 1981. ability to irradiate the beam with radio fre-
Longair, M. S., and Warner, J. W., "Scientific Research quency for as long as a few milliseconds along
with the Space Telescope," NASA CP-2111, Washing- its path.
ton, US Govt. Printing Office, 1979. Precision atomic-beam measurements have
Burbidge, Burbidge, Fowler, and Hoyle, "Synthesis of' contributed to many theoretical and practical
the Elements in Stars," Rev. Mod. Phys., 29, 547 developments. The deuteron quadrupole mo-
(1957). ment (1939) pointed to the necessity of a tensor
Menzel, D. H., Whipple, F., and deVaucouleurs, G., interaction in nuclear forces. The anomalous
"Survey of the Universe," Englewood Cliffs, N.J., electron moment (1949) and the Lamb shift
Prentice-Hall, 1971. (1950) in the atomic-hydrogen fine structure
were resolved by quantum electrodynamics.
Cross-references: ASTROMETRY, COSMOLOGY, Nuclear spins (f) as well as magnetic-dipole (j1)
GRAVITATION, DOPPLER EFFECT, PULSARS, and electric-quadrupole (Q) moments have been
QUASARS, RELATIVITY, SOLAR ENERGY important in providing test information for the
SOURCES, SOLAR PHYSICS. shell model (1949) and collective model (1953)
of the nucleus. Atomic hyperfine-structure con-
stants (dipole, a; quadrupole, b; and octupole, c),
ATOMIC AND MOLECULAR BEAMS which describe the interaction between the elec-
trons and the nucleus, as well as the numerous
This field of research utilizes a collision-free constants required to describe a molecule and
stream of neutral atoms or molecules as they its internal interactions, have contributed to the
traverse a vacuum chamber. With 10-6 mm Hg theory of atomic and molecular structure. The
pressure in a vacuum chamber, air molecules at cesium "clock" or frequency standard (1952)
room temperature travel on the average about represents a widespread practical application of
300 meters between collisions and move with beam technology by using the hyperfine-struc-
an average speed of about 500 m/sec. Between ture transition at 9192.631770 MHz (Ephem-
collisions these molecules are essentially "free" eris time) to regulate a quartz-crystal oscillator.
and unperturbed by molecules of the residual Other frequency standards such as the thallium
gas or by atoms in the walls of the apparatus. clock, ammonia maser (1954), and hydrogen
The mathematical description of such isolated maser (1960) also employ beam techniques for
systems is much less complicated than for denser quantum-state selection. Very recent, high-
gases, liquids, or solids containing interacting energy nuclear accelerators have been equipped
particles. with atomic-beam sources to produce polarized
Since 1911, when Dunoyer proved that a protons.
stream of neutral atoms would remain collimated Among nonresonant experiments, beams im-
in a vacuum, atomic- and molecular-beam re- lJmging on solid surfaces produce information r)ll
search has become one of the most versatile, pre- the wave nature of particles, work functions,
cise, and sensitive techniques for studying the and accomodation coefficients. Charge-exchange
properties of isolated atomic systems and inter- cross sections and interaction potentials are ob-
actions between such systems. Numerous funda- tained from experiments with crossed beams,
mental discoveries in beam research have con- one neutral and one charged. Chemical reaction
tributed to the present understanding of physical kinetics in isolated systems are studied in crossed
laws. The earliest experiments (1920) sought the beams of two reactants. Precision optical studies
molecular velocity distribution, which is impor- combine beam and laser techniques.
tant in the kinetic theory of gases. Atomic diam- Four individuals have received Nobel Prizes
eters (cross sections), van der Waals' interaction for beam research: Otto Stern (1943) "for his
potentials, and polymer vapor composition were contribution to the development of the molec-
obtained after later refinements of technique. ular-ray method and for his discovery of the
103 ATOMIC AND MOLECULAR BEAMS

1.5

J; 1'2
1.0

0.5
F =I

-'lL 0
t:.W
'"
<l
.t::
- 0.5
F=O

-1.0

-1.5
0

FIG. 1. Hyperfine-structure energies (ordinate) of an atom with J = 1/2 and 1= 1/2


in an external magnetic field (abscissa).

magnetic moment of the proton"; 1. 1. Rabi Jf/h (Hz) =aI • J + b (quadrupole operator)
(1944) "for his application of the resonance
method to the measurement of the magnetic - KJ(/J..o/h)J • H - g[{iJ..o/h)I • H.
properties of atomic nuclei"; P. Kusch (1955)
"for his precision determination of the magnetic where some symbols have been defined pre-
moment of the electron"; and W. E. Lamb viously, KJ=f.1J/(Jf.1o), K[=f.1[/(If.1o), and f.1o is
(1955) "for his discoveries concerning the fine the magnitude of the Bohr magneton. The first
structure of the hydrogen spectrum." two terms represent the dipole and quadrupole
A discussion of the energy levels of a simple hyperfine-structure interactions between the
atom and of one particular apparatus will illus- electrons and nucleus; the last two terms ex-
trate the magnetic-resonance technique. An press the interaction of the electron and nuclear
isolated atom in an external field, H, has energy magnetic moments with the external magnetic
states which are calculable from the Hamiltonian field. For the simple case of I = 1/2, J = 1/2, b =

PUMP 1
1 [ ~~ jC)1 [ /$ 1 i
C-- ~-- - --.~-- I ~
o
o ~P.I B
Rf. . r

tIi o
I FI
f
.It ' ,i i'
(VH) {V H)
'
l1
i: HA I :' ~': I
!-+-i- ---6 0- -- ,: --
He
01 D
;'-'-'~l=-,
I I " ~ I

, ... ,-----
I I .... ,
,
\0
FIG. 2. Schematic of an atomic-beam, magnetic resonance apparatus.
A TOMIC AND MOLECULAR BEAMS 104

o as in the ground electronic state of atomic References


hydrogen, the energy levels that arise from dif- W. J. Childs, "Hyperfine and Zeeman Studies of
ferent "relative orientations" of the nuclear (I) Metastable Atomic States by Atomic-Beam Magnetic-
and electronic (f) spins are shown in Fig. 1 as a
function of the magnetic-field parameter X = Resonance," in Case Studies in Atomic Physics, Vol.
3, No.4, North-Holland, Amsterdam, 1973.
[(-gJ + g/)J.l.oHj /flW. Here flW is the hyperfine-
English, T. C., and Zorn, J. C., "Mo~ecular Beam
structure separation at H = O. The levels are
Spectroscopy," in "Methods of Expenrnental Phys-
labeled by either the low-field quantum numbers
ics," vol. 3, 2nd edition, Academic Press Inc., New
(F, mF) or by the high-field n,umbers (mJ, "!/), York, 1973.
where F = II ± fl, and the m s a~ t~e proJec- Estermann, I., Ed., "Recent Research in Molecular
tions of F I or f along the field duectIon. Beams," a collection of papers dedicated to Otto
Of the ~~ny magnetic or electric resonance Stern, on the occasion of his seventieth birthday,
apparatuses, one specializ~d type ~as proved New York, Academic Press, 1959.
valuable for measuring atomIC propertIes of both
M. A. D. Fluendy and K. P. Lawley, "Chemical Appli-
stable and radioactive isotopes. In Fig. 2, the
cations of Molecular Beam Scattering," Chapman
"oven" or source 0 may take one of many
and Hall, London, 1973.
forms-a microwa~e discharge to dissociate gas-
Kopfermann, H., "Nuclear Moments," English.transla-
eous diatomic molecules, a closed tantalum
tion by E. E. Schneider, New York, AcademIC Press,
crucible (with an exit slit) heated by electron
1958.
bombardment, or one of many other devices
Kusch, P., and Hughes, V. W., in Flugge, S., Ed.,
for evaporating atoms. The atoms pass between
"Handbuch der Physik," Vol. 37/1, Berlin, Springer-
the poles of three separate electromagnets (de-
Verlag, 1959.
noted A, C, and B, successively, from oven to
Nierenberg, W. A., in Ann. Rev. Nuc1. Sci., 7, 349
detector). The inhomogeneous A ~nd B ~agnets (1957).
have eccentric cylindrical pole tIps WhICh pro- Ramsey, N. F., "Molecular Beams," London Oxford
duce a field gradient, aH/aZ. In this field, an University Press, 1956 (reprinted in 1963).
atom experiences a force F = J.l.eff«lH/aZ), where Smith, K. F., "Molecular Beams," London, Methuen
J.l.eff(= -aW/aH) is the negative slope of an en- and Company, 1955.
ergy level in Fig. 1. Within t~e h~mogeneous C
field, a superimposed rf fIeld l.nd~ces ~tate Cross-references: COLLISIONS OF PARTICLES,
changes. Thus an atom which remams m a ~mgle CROSS SECTION AND STOPPING POWER, MAG·
state [( 1, 0) for example] is deflected similarly NETIC RESONANCE, CHEMICAL KINETICS.
by the strong A and B magnet~ and fol~ows tra-
jectory 1 in Fig. 2. The stopwue, S, shIelds the
detector from fast atoms and atoms with ~~all
deflections. If, in the C-field region, a tranSItIon ATOMIC CLOCKS
occurs that causes the high-field slope (- J.l.eff) to
change sign [e.g., (l, 0) ~ (l, -1)], the A and B Time Standards The concept of time has at
deflections are opposite, and the atom follo~s least two distinct aspects: (l) duration (or time
trajectory 2 to the det~ctor D ..A res<;mance IS interval) and (2) date. Ordinary clocks can be
observed as an increase m beam mtenslty at the
detector. Values of the constants in the Hamil- used for' either aspect, while stopwatc~es mea-
tonian are deduced from the observed resonant sure duration only. The concept of date mcludes
frequencies of the atoms in known magnetic not only the calendar day but finer subdivisions
fields. Some detection methods in frequent use as well. For example, a date might be 10.:27 AM
are: EST November 13 1982. Both duratlOn and
(a) Deposition on a surface with subsequ.ent date' use the same 'set of units; that is ~ays,
assay by radioactive counting, neutron actIva- hours minutes and seconds can refer to eIther
tion, or optical means (earliest detector). date ~r duratio~.
(b) Ionization of alkali atoms on a ho~ tu.ng- Almost all clocks have three main parts: (1) A
sten wire, and measurement of the resultmg lOn pendulum or other periodic phenomenon to
current. control the clock rate; (2) a counting element
(c) Electron-bombardment. i~ni~ation ~ith to count the cycles of the periodic el~ment j~st
subsequent mass analysis to dlscnm.mate agamst mentioned' and (3) a display mechamsm to dIS-
background gas ions. The beam lOns ar~ f!e- play the ~urrent count (Le., time). Atomic
quently counted by using electron multIplier clocks use natural resonances in atoms or mole-
tubes. cules for the periodic part. It is possible to find
(d) Other detectors employing the principles natural resonances which are very insensitive to
of radiometers, pressure manometers, thermo- environmental conditions as well as being in-
piles, bolometers, laser fluorescence, and changes sensitive to the detailed construction of the
in space charge. clock. This allows independent laboratories .to
fabricate atomic clocks which agree in rate WIth
other comparable clocks to within a few parts
HOWARD A. SHUGART in ten to the 14th power.
105 A TOMIC CLOCKS

The Early Atomic Clocks In 1949, Harold ten to the tenth. Although expensive, several
Lyons at the U.S. National Bureau of Standards standards laboratories used them.
announced the operation of the world's first In a joint experiment between the US Naval
atomic clock. This clock was based on micro- Observatory (USNO) and the National Physical
wave absorption in ammonia and was stable to Laboratory (NPL) at Teddington, England, the
about one part in a hundred million, which was frequency of the appropriate transition in cesium
roughly comparable to the best clocks of that was measured in terms of the standard of the
time (pendulum and quartz). second as then defined (the Ephemeris Second).
In the 1950s and early '60s, scientists and en- This experiment involved NPL's laboratory ce-
gineers investigated various atomic and molecu- sium frequency standard and USNO's deter-
lar resonances as well as various means of ex- mination of Ephemeris time. The experiment
tracting the desired information to run a clock. lasted about three years and resulted in a fre-
In the mid 1940's, I. I. Rabi suggested using quency value for cesium of 9,192,631,770 Hz.
cesium in an atomic beam as a frequency stan- The accuracy was thought to be about one or
dard. With innovations due to N. Ramsey, the two parts in ten to the ninth, due mostly to the
essential design of the cesium beam clock was difficulties in measuring Ephemeris time, not
complete as we know it today, with, of course, the cesium resonance.
numerous small refinements. In the '60s and '70s competing firms devel-
In the mid-'S Os, C. H. Townes and others at oped smaller and more advanced cesium beam
Columbia University demonstrated the ammonia models. Although the basic design remained
MASER (Microwave Amplification by Stimu- about the same, the beam tube was made smaller
lated Emission of Radiation). In the early '60s, and the electronics were reduced greatly in size
N. Ramsey and others developed the hydrogen with the advances in solid.,state electronics. The
maser. This maser is unique in that the hydrogen performance of cesium clocks was close to the
maser stores hydrogen atoms in a quartz bulb theoretical limits imposed by the granularity of
for periods of a second or so. This relatively the beam (Le., "shot-noise"), and design limita-
long interaction time results in a very sharply tions such as size, power, and reliability.
defined resonance (Le., a high Q). The hydro- Several primary standards laboratories built
gen maser has demonstrated the highest fre- laboratory versions of the cesium beam stan-
quency stability of any atomically controlled dard. The basic philosophy was to build a device
device for sampling periods out to a few hours. which could allow the laboratory to measure
Even today, hydrogen masers are used in special any physical parameter which might possibly
applications where this high stability is needed. influence the frequency. Experiments were per-
Because the hydrogen atoms collide with the formed to test agreement of theory and perfor-
walls of the quartz storage bulb, the hydrogen mance of the standard. The laboratory then
resonance is shifted. Various wall coatings have made a final accounting of uncertainties caused
been used in attempts to avoid this and other by such things as imperfect knowledge of the
problems, but so far the hydrogen maser has magnetic fields within the beam tube and many
not been able to compete with the cesium beam others. The resulting figure is an accuracy esti-
device for absolute frequency accuracy. mate of that laboratory's frequency standard.
Also in the '50s, scientists and engineers de- Historically, the individual accuracy estimates
veloped the rubidium gas cell. This device opti- of the few primary frequency standards in exis-
cally pumps rubidium atoms (contained in a tence have normally (but not always) been in
sealed cell) to a higher energy state. A micro- accord with the measured differences between
wave signal relaxes the atoms to the lowest standards. Any disparity never has been really
hyperfine level in the ground state provided large.
this microwave signal is suitably close to the On Friday, October 13, 1967, the General
natural rubidium resonance. The resonance con- Conference of Weights and Measures (CGPM)
dition is sensed typically by monitoring the announced a new definition of the second:
light from the optical pumping as a function of The second is the duration of 9,192,631,770 pe-
the microwave frequency. A buffer gas is added riods of the radiation corresponding to the transi-
to the cell containing the rubidium vapor to re- tion between the two hyperfine levels of the
duce the effects of collisions with the cell walls. ground state of the cesium-133 atom. [13th
The commercial versions of rubidium devices General Conference of Weights and Measures
are less expensive and less stable than cesium (1967), Resolution 1.1
clocks, and typically display a systematic fre- International Time Scales Historically, time
quency drift. Still there are several thousand has always been closely tied to the position of
commercial ru bidium cells in use today. the sun in the sky. An ordinary sundial, for ex-
ample, indicates apparent solar time. Keppler's
The Cesium Beam Atomic Clock In the mid- laws allowed scientists to correct apparent solar
'50s commercial cesium beam clocks became time for the earth's elliptic orbit and the inclina-
available for the first time. The early versions tion of the earth's spin axis to the ecliptic plane
were accurate and stable to about one part in (the plane of the earth's orbit). This correction
ATOMIC CLOCKS 106

is called the "Equation of Time," and can often International Atomic Time is a coordinate
be found engraved on sundials. These correc- time scale. The reference elevation is mean sea
tions amount to as much as plus or minus 15 level and frequency corrections are applied in
minutes during the year. This corrected time transferring time measurements between labora-
is often called UTO, Universal Time, "zeroth" tories. The largest gravitational "blue-shift" is
approximation. 1. 8 parts in ten to the 13th for the National
Astronomers discovered that UTO measure- Bureau of Standards (elevation 1.7 km). Accu-
ments at different observatories were not in rate and precise comparisons of hme scales on
agreement. This disagreement amounted to an international basis now require relativistic
about ±30 milliseconds and was traced to the corrections.
fact that the earth is not precisely fixed to In spite of the importance of TAl, many op-
its axis of rotation. That is, the pole wanders erations require a time reference tied to earth
around within a circle of about 15 meters in position (Le., UTI). Unfortunately, the earth is
diameter. Of course, the natural response was a poor clock compared to atomic clocks. Still,
to correct time for the wandering of the pole. earth time is quite important. For example,
This new time was called UTI. small boat owners often use sextants for celes-
In the 1930s, scientists discovered that UTI tial navigation. For these purposes (and others),
had periodic variations. Although corrections pure atomic time is not useful since it is not
were applied to UTI to get UT2, this latter related to earth position. Beginning in 1972,
time scale does not indicate the actual varia- nations agreed to a compromise time scale be-
tions in the earth's rotation rate nor is it sig- tween the needs for atomic time and earth time.
nificantly more uniform than UTI. Today, one The compromise time scale, called Coordinated
seldom hears of UT2. Universal Time (UTC), runs at precisely the
Back near the turn of the century, Simon same rate as TAl, but offset an integral number
Newcombe, at the US Naval Observatory, com- of seconds. When UTI accumulates nearly an
puted a table of future positions of the sun, entire second error relative to UTC, the UTC
moon, and some of the principal planets for clocks are reset one complete second, called a
several years in advance. The computations "leap second."
were based on the best theories available. Such The rate of rotation of the earth is variable at
a table is called an "Ephemeris." Newcombe a few parts in ten to the eighth. (Atomic clocks
discovered, however, that the various celestial are almost a million times less erratic.) Unlike
objects systematically departed from their pre- leap years, leap seconds are not predictable years
dicted positions. He noticed that if the time in advance. Since 1972, eleven leap seconds have
were somehow in error, then the tables would been added (Fall, 1982) to UTC; bringing the
be in good agreement for all of the objects. total time difference to 21 seconds. (TAl was
Newcombe correctly recognized that the earth's set to agree with earth time January 1,1958.)
rate of rotation was not constant. The official time almost everywhere in the
The natural response was to use the Ephemeris world is based on UTC, with an integral number
backwards. That is, when the sun reached its of hours difference for the appropriate time
predicted position, the time was the value listed zone.
in the Ephemeris. Conceptually, this kind of Who Needs It? Many of the uses of precise
time, called Ephemeris Time, is based on the time are related to distance measurements. In
orbital motion of the earth, and should be more fact, a new definition of the meter is expected
uniform in its rate than the earth's rotation, to be in terms of the distance traversed by a
since the earth can undergo geometrical changes light wave in a specified fraction of a second.
that alter its rotational rate. Thus, the cesium beam will be the standard for
The definition of the second, the unit of mea- both the second and the meter. The speed of
sure in the International System (S.I.) of units, light then becomes a defined constant rather
was defined as the fraction 1/86,400 of the than a subject for measurement.
mean solar day (24 X 60 X 60 = 86,400), prior At this point, there are probably a few thou-
to 1956. In 1956 the second was redefined as sand commercial cesium beam devices in opera-
the fraction 1/31,556,925.9747 of the tropical tion. They can be found in television stations,
year 1900. Since 1967, the definition of the telephone control systems, NASA tracking sta-
second has been in terms of cesium, as noted tions, scientific laboratories, standards labora-
above. tories, and military systems, to name a few. The
The definition of the International Atomic Department of Defense is currently developing a
Time Scale (TAl) incorporates the definition of satellite navigation system, called the Global Po-
the second. The formal definition reads: sitioning System (GPS), which is dependent on
state-of-the-art timing systems. Atomic clocks
International Atomic Time (TAl) is the time refer- are essential to the success of the GPS program.
ence coordinate established by the Bureau Interna-
tional de l'Heure (BIH) on the basis of the readings Future Clocks For over 25 years now, the
of atomic clocks functioning in various establish- cesium beam frequency standard has surpassed
ments conforming to the definition of the second, all potential rivals. The accuracies specified by
the time unit of the International System of Units. the national standards laboratories have steadily
[14th CGPM (1971), Resolution 1.1 dropped to finer and finer tolerances. Today·s
107 ATOMIC ENERGY

standards (1982) quote accuracies of a very few within the atomic nucleus. Events that release
parts in ten to the 14th. Still there are new po- atomic energy involve basic changes in nuclear
tential rivals, as well as ideas to improve the ce- structure and result in the formation of one or
sium beam standard. more different nuclides, which may be isotopes
The leading candidate in several laboratories of the original atom or altogether different ele-
is trapped ions. The basic idea is to take a num- ments. The release of atomic energy is thus a
ber of ions of some particular element and hold more fundamental process than the release of
them in a Penning (or rf) trap. The trap can be chemical energy, which merely involves a re-
arranged to cause only insignificant frequency grouping of intact atoms into different molecu-
shifts in the ions, and can hold the same set of lar forms.
ions for extended periods of time. This allows To date, three basic atomic energy mech-
long interaction times with the interrogating anisms have been exploited in practical applica-
signal and hence provides very narrow reso- tions: (I) the fission of certain heavy nuclides;
nance linewidths. (2) the fusion of certain light nuclides; and
A modification of the trapped ion method (3) the process of radioactive decay. These will
cools the ions to temperatures below one Kel- be discussed in the order listed.
vin. The cooling is accomplished by irradiating Fission In fission, a heavy nuclide splits into
trapped ions with a laser set slightly below an two lighter and predominantly unstable nu-
optical transition frequency. The motion of clides, commonly referred to as fission prod-
warm ions will Doppler shift the laser radiation ucts, with the accompanying emission of several
into resonance with the ion, and the ion will neutrons and the release of approximately 200
absorb a photon. At some time later, the ion MeV of energy. Nuclides that readily undergo
will reradiate the photon in a random direction. fission on interaction with low-energy or "slow"
The ions thus absorb photons with an energy neutrons « 0.5 eV) are referred to as fissile
bias below the ion's natural resonance fre- materials. There are three primary fissile mate-
quency but reradiate photons with no such rials:
bias. The energy imbalance is supplied by the (l) Uranium 235, which is a natural constit-
kinetic energy of the ions-thus cooling them. uent of the uranium element and accounts for
In present atomic frequency standards, the 0.71 per cent by weight of that element as
motion of the atoms cause the primary limita- found in nature.
tions on improved accuracy. Although a lot of (2) Plutonium 239, formed by neutron ir-
effort has been expended on attempts to cancel radiation of uranium 238.
first order Doppler shifts, significant difficulties (3) Uranium 233, formed by neutron irradia-
remain. Second order Doppler shifts are also sig- tion of thorium 232.
nificant. Laser cooling may solve both of these Uranium 238, which does not undergo fission
problems. on interaction with slow neutrons, does so on
JAMES A. BARNES interaction with high-energy or "fast" neutrons
(> 0.1 MeV). Table I lists representative fission
References energy distributions for the four nuclides cited.
Ashby, N., and Allan, D. W., "Practical Implications of A useful rule of thumb is that an energy re-
Relativity for a Global Coordinate Time Scale," lease of 200 MeV per fissioning atom corre-
Radio Science, 14,649 (1979). sponds to an output of approximately one
Helwig, H., Evenson, K. M., and Wineland, D. J., megawatt-day of thermal energy per gram of
"Time, Frequency, and Physical Measurement," fissioned matter.
Physics Today, 31, 23 Dec. (1978). Practical applications of fission are based on
Markowitz, W., Hall, R., Essen, L., and Parry, J., "Fre- the principle of a self-sustaining fission chain
quency of Cesium in Terms of Ephemeris Time," reaction, i.e., a reaction in which a neutron
Phys. Rev. Lett., 1, 105-107 (Aug. 1958). emitted by atom A triggers the fission of atom
D. J. Wineland, "Laser Cooling of Atoms," Phys. Rev. B, and one from atom B triggers the fission of
A, 20, 1521 (1979). atom C, and so on. For this to be achieved
requires the assembly of a "critical mass" of
Cross-references: ATOMIC AND MOLECULAR fissile material, i.e., an amount sufficient to re-
BEAMS, ATOMIC PHYSICS, ATOMIC SPECTRA, duce the probability of neutron losses to a
DOPPLER EFFECT, FREQUENCY STANDARDS, threshold value. The amount required depends
KEPPLER'S LAWS OF PLANETARY MOTION, on a number of factors, notably the concentra-
LASERS, MASER, MICROWAVE SPECTROSCOPY, tion of the fissile material used and the compo-
MICROWAVE TRANSMISSION, OPTICAL PUMp· sition and geometry of the reaction system.
lNG, QUANTUM THEORY, RELATIVITY. There are two basic application concepts. One
is the essentially instantaneous fission of a mass
of highly concentrated fissile material in such a
way as to generate an explosive force. This, of
ATOMIC ENERGY
course, is what occurs in atomic weapons.
The terms "atomic energy" and "nuclear en- Atomic explosives are also of interest in con-
ergy" are used interchangeably in the contem- nection with peaceful uses such as large-scale
porary literature to mean energy that originates excavation projects.
ATOMIC ENERGY 108

TABLE 1. ENERGY DISTRIBUTION IN FISSION

Quantity of Energy (MeV)


Slow Fission Fast Fission
Type of Energy 235 U 239pu 233U 238U

Kinetic energy of fission products 165 172 163 163


Kinetic energy of neutrons emitted 5 6 5 5
Instantaneous emission of gamma rays 8 7 7 7
Beta emission during fission product decay 9 9 9 9
Gamma emission during fission product decay 7 7 7 7
Totala ; 194 201 191 191
aExclusive of nonrecoverable energy associated with neutrino emission during fission pr.o~uct decay. All num-
bers are rounded to the nearest integer. It should be mentioned that 8 or 9 MeV of additional energy become
available in a nuclear reactor as the result of neutron capture and subsequent gamma-decay phenomena.

The other application concept is that of the become operative. This means that one or both
controlled and gradual fission of an atomic fuel nuclei must be accelerated ("heated") to veloc-
in a nuclear reactor, which may be designed for ities sufficient to overcome the strong electro-
one or more of the following principal pur- static repulsion that exists between part~c~es
poses: having the same electrical charge. The velOCIties
(I) To provide fluxes or beams of neutrons required correspond to particle "temperatures"
for experimental purposes. This category of use of the order of tens or hundreds of millions of
includes research and materials-testing reactors. degrees, which in turn correspond to particle
(2) To produce materials by neutr<;>fl ir!adia- energies of thousands or tens of thousands of
tion. Examples are reactors used pnmanly to electron volts. The term "thermonuclear" re-
produce plutonium for atomic weapon sto~k­ actions is reserved for fusion reactions in which
piles or for the production of vanous radiO- both nuclei are traveling at high velocity (as
isotopes for use in science and industry . distinct from reactions between an accelerated
(3) To supply energy in the f~rm of heat f?r projectile particle and a static target nucleus, as
such applications as the g~neratlOn of ele.ctnc in particle accelerator experiments).
power, the propulsion of ShIPS or space vehIcles, The only practical application of thermonu-
or the production of process steam. clear reactions developed to date is in thermo-
nuclear weapons (so-called "hydrogen bombs")
The first demonstration of a fission chain in which the energy released by a charge of
reaction was achieved by E. Fermi and co- fissile material serves to create the conditions
workers on December 2, 1942 when the world's required to bring about the reaction of "fusion-
first nuclear reactor (Chicago Pile No . l) was able" materials. The first test of a thermonuclear
successfully operated in a convert~d s9-uash .veapon, which was the first demonstration of a
court beneath Stagg Field at the UniVersIty of man-made thermonuclear reaction, took place
Chicago. on October 31, 1952 at a U.S. testing site in the
The most important application of fission Marshall Islands. Peaceful uses of thermonuclear
promises to be in the electric power field. The explosives are being studied and have the advan-
basis for this expectation is that, if exploited tage, relative to straight fission-based explosives,
efficiently, known and inferred deposits of that problems of radioactive contamination are
atomic fuels represent a potential energy reserve greatly reduced. This reflects the fact that the
many times larger than that of the fossil fuels nuclides formed by fusion are stable and hence,
(coal, oil and natural gas) on which the world's apart from neutron activation effects, the forma-
electric energy economy largely depends at tion of radioactive substances is limited to the
present. fission component of the explosive.
By the end of 1981 approximately 150,000 Research has been in progress for two de-
electrical megawatts of atomic power capacity cades on techniques for controlling the fusion
were in operation, under construction, or process as a means of supplying energy for elec-
planned for construction in the United States. tric power generation. The thermonuclear re-
Fusion Fusion is a general term for reactions actions of primary interest in this context are
in which the nuclei of light elements combine the deuterium-tritium reaction:
to form heavier and more tightly bound nuclei
with the simultaneous release of large amounts
of energy. In order for this to occur the inter- D+T~4He+n + 17.6 MeV
acting nuclei must be brought sufficiently close
together to permit short-range nuclear forces to and the deuterium-deuterium reactions:
109 A TOMIC PHYSICS

13He+n+3.2MeV experimental basis in a number of applications


such as navigational satellites, automatic weather
D+D stations and coastal light buoys, all of which re-
\t quire a compact power source that can operate
T+ P +4.0 MeV
unattended for sustained periods (months or
years). In the case of space applications, alpha-
Deuterium is a stable isotope of hydrogen with emitting radio nuclides such as plutonium 238 or
a natural abundance of 0.0015 per cent. Tritium curium 244 are mainly used as the fuel. In ter-
is an unstable hydrogen isotope with a radio- restrial applications, the principal fuel used to
active half-life of 12.3 years and is produced date is strontium 90, a beta emitter.
from lithium 6 by the neutron-alpha reaction.
The latter thus represents a relatively expensive JOHN F. HOGERTON
"fuel" for thermonuclear reactions; however,
the ignition temperature of the deuterium-
tritium reaction is roughly an order of magni- References
tude lower than that of the deuterium-deuterium Hogerton, John F., "The Atomic Energy Deskbook,"
reactions and the energy release is greater. New York, Van Nostrand Reinhold, 1963.
In most controlled fusion systems as presently Glasstone, Samuel, "Sourcebook on Atomic Energy,"
conceived, the fuel is in the form of an ionized New York, Van Nostrand Reinhold, 1958.
gas, or "plasma," confined by magnetic pressure
within a high-vacuum apparatus. In effect, the Cross-references: FISSION, FUSION, ISOTOPES,
plasma is held in a "magnetic bottle," thereby NUCLEAR REACTIONS, NUCLEAR REACTORS,
preventing fuel particles from dissipating heat NUCLEONICS, RADIOACTIVITY.
in collisions with the physical walls of the appa-
ratus. The objective is to achieve a situation in
which an adequately hot plasma of adequate
density can be magnetically confined for a long ATOMIC PHYSICS
enough interval of time for the desired reaction
to take place. One approach is to constrict and Throughout the twentieth century the study of
confine a high-current discharge of fuel ions and atoms, their internal structure and their interac-
hold the resulting dense plasma in confinement tion with electromagnetic radiation and with
while its temperature is raised by adiabatic com- other particles has played an important part in
pression or other methods. Another approach attempts to understand the physical world. The
is to accelerate fuel ions to high energies and subject of atomic physics has been the testing
then trap them in a magnetic field, maintaining ground of fundamental theories and at the same
confinement long enough for a dense plasma to time the source of many applications.
accumulate. The concept of an atom as an indivisible par-
In experiments in various experimental de- ticle, the ultimate building block of bulk mat-
vices, the time-tern perature-density multiple has ter, is attributed to the ancient Greeks. During
steadily been increased; however, there appears the nineteenth century such atoms, of which it
to be no conclusive evidence that true thermo- was recognized that there had to be many dif-
nuclear conditions have been achieved in any ferent kinds, were identified with the chemical
laboratory. Beyond laboratory demonstration elements of the periodic table. But the era of
of controlled fusion per se lies the problem of modern atomic physics may be said to have be-
demonstrating that devices can be designed to gun with two discoveries, that of the electron
produce more power than they consume and and that of the nuclear atom. The idea of the
beyond that lies the problem of demonstrating indivisible atom was to be abandoned.
the economic feasibility of practical thermo- The first discovery was made in 1897 by
nuclear power plants. J. J. Thomson, who was able to isolate an elec-
The chief incentive for thermonuclear power trically charged particle, an electron, and to
development is the promise of a virtually in- measure the ratio of its charge e to its mass m.
exhaustible energy source, assuming the ultimate Following his work it came to be believed that
use of deuterium as the primary fuel. electrons were constituents of atoms. It was a
Radioactive Decay As radioactive atoms un- little later that R. A. Millikan measured di-
dergo decay by alpha, beta or gamma emission, rectly the electron charge itself: this charge e
heat is generated by the interaction of the radi- was the same for all electrons (by convention,
ation with surrounding matter. Devices that uti- it is taken to be negative). The two results taken
lize this heat to produce electricity are known together led to the conclu~on, already guessed
as isotopic power generators. Research has been at, that the mass of the electron is only about
conducted on a range of such devices for spe- 1/2000 of the mass of a hydrogen atom, the
cialized applications requiring from fractions of lightest element in the periodic table.
a watt to tens of watts of electricity. Thermo- The second discovery was made by Sir Ernest
electric or thermionic techniques are used to Rutherford in 1911. At his instigation Geiger
convert the heat to electricity. At present, iso- and Marsden carried out experiments in which
topic power generators are being used on an alpha particles (which are positively charged)
ATOMIC PHYSICS 110

were scattered from thin foils. The result, very stability of the lowest energy level nor for the
surprising at the time, that many alpha particles rate at which radiation would take place when
were scattered through large angles was ex- an atom decayed from a higher level to a lower
plained quantitatively by Rutherford. He con- one. Even more difficult would be the attempt
cluded that the mass of an atom of the foil is to predict in detail what would happen in an
concentrated in a nucleus whose radius is of electron-atom or atom-atom collision. Whereas
the order of 10- 14 m, very much smaller than the idea had already been grasped that a light
the radius of an atom, 10- 10 m, and that the wave, through its quantum nature, can have the
nucleus carries a positive charge Ze where Z, an feature of a particle, namely localization in
integer, is the atomic number appropriate to space, what was missing was the equally extra-
the ordering of chemical elements in the peri- ordinary idea that a particle, sayan electron,
odic table. The model of a neutral atom, then, can have the feature of a wave-a spreading-out
is that it consists of a small massive nucleus of in space. The latter idea was put forward by
positive charge Ze surrounded by Z electrons, Louis de Broglie in 1924: he attributed to a
each of negative charge e, filling a much larger particle of momentum p a wavelength A = hlp.
volume. Electrostatic forces bind the whole Planck's constant appears again in this relation
thing together. and its size is crucial: electrons bound in atoms
However, the nuclear atom, resembling a have momentum such that their de Broglie
miniature solar system, would not be stable wavelength A is of the order of the size of the
according to the laws of classical physics be- atom. These electrons are only localized to the
cause the orbiting electrons, being charged, extent that they are somewhere inside the atom,
would emit electromagnetic radiation in a con- which is now visualized as a nucleus surrounded
tinuous spectrum of frequencies, lose energy by an electron charge cloud having no well de-
and spiral into the nucleus. This does not hap- fined boundary.
pen: atoms are stable. It is true that they can These quantum ideas were put into mathe-
emit radiation, but only when they are given matical form in 1925. Schrodinger's wave me-
enough energy, for example when free atoms chanics of matter was soon generalized in the
are bombarded by electrons in a light source formal theory of quantum mechanics by Born,
such as an electric arc. The emitted spectrum Heisenberg, Dirac and others. Quantum me-
consists not of a continuous frequency dis- chanics, which includes the famous Uncertainty
tribution but rather of discrete frequencies Principle, is the basis for all calculations of
characteristic of a particular atom. Spectro- physical problems in which the magnitude of
scopists working in the nineteenth century had Planck's constant cannot be considered to be
studied these "spectral lines," many of which negligible, and by this criterion the theory is
appear as light in the visible region, and had certainly needed to describe atomic physics.
measured their frequencies with considerable The birth of quantum mechanics has been a
precision. So more than one difficulty was revolution in the theory of physics, and this
raised by the model of the nuclear atom, and revolution has had in its turn a profound influ-
the problem was tackled in 1913 by Niels Bohr. ence on philosophical thinking.
He had at his disposal the concept of a "quan- The hope that one now understands the prin-
tum" of energy of radiation which had arisen ciples of atomic physics is one thing, but the
in Planck's treatment of blackbody radiation task of making quantitative predictions is
and in Einstein's theory of the photoelectric another, for it involves making detailed calcu-
effect. A quantum of radiation has energy E = lations which are especially difficult in many-
hv where v is the frequency of the radiation electron atoms. Fortunately, approximation
and h is a universal constant known as Planck's methods can be used in treating the forces which
constant, having the value 6.626 X 10- 34 J HZ-I. govern atomic structure, at least in simple atoms
Bohr developed a set of rules which required with few electrons. The largest forces are the
the angular momentum of the orbiting electron electrostatic attraction between nucleus and
in hydrogen to be "quantized," that is, to have electrons, and the electrostatic repulsion be-
only integral values in units of hI21f. It followed tween the electrons themselves. These are dealt
that the energy of the atom was also quantized with first. Electrons are found to have, in addi-
with the consequence that the frequency of tion to their charge, an intrinsic angular mo-
emitted radiation had to satisfy the relation mentum (called electron spin) together with an
hv = Em - En where Em and En are the discrete associated magnetic moment. The interaction
allowed energies of the mth and nth "energy between this magnetic moment and the mag-
levels" of the atom. Bohr's calculation of the netic field which an electron experiences within
allowed frequencies in the spectrum of atomic an atom is a small perturbation in our hierarchy
hydrogen, based on the mechanics of his model, of approximations. This effect leads to a "fine-
agreed with precise spectroscopic measurements. structure" splitting of the energy levels of an
This was a brilliant achievement. atom. Even smaller is a "hyperfine-structure"
Yet Bohr's calculations only worked well for splitting which arises from the interaction be-
hydrogen, the one-electron atom. Further, tween the electron and the magnetic moment
there was still no adequate explanation for the with which the nucleus is endowed (the nu-
111 ATOMIC PHYSICS

cleus, too, has the property of spin angular tralline can be measured accurately because the
momentum). frequency bandwidth of the line is made very
On the theoretical side the aim has generally narrow. Moreover, the atoms, which are made
been to calculate the energies of excitation to travel as a directed beam through a vacuum,
available to an atom in terms of all the inter- do not collide with each other and so may be
actions mentioned above, along with other regarded as completely unperturbed by their
properties such as the average lifetime of an neighbors: they interact only with the applied
excited atom before it decays to a lower energy radio-frequency field. By this method a large
state, emitting electromagnetic radiation (typi- number of measurements, particularly of hyper-
cal lifetimes are of the order of 10- 8 s). In all fine-structure splittings, have been made in
this the role of the modern computer has been nearly all the elements. Such work provided a
important, especially for many-electron atoms body of data on nuclear spins and nuclear mo-
whose complexity would otherwise present in- ments at a time when nuclear physicists could
tractable problems. On the experimental side make good use of it in constructing models to
there have been two main ways of approaching describe the nucleus; the precise measurement
the study of atomic structure. One is by mea- of hyperfine structures also led to a better
suring the frequencies and intensities of spectral quantitative understanding of the behavior of
lines (spectroscopy) and the other is by causing the atomic electrons when they are close to
collisions between atoms and particles-electrons the nucleus.
or other atoms, for example. Whereas optical Technological developments soon followed.
spectroscopy had been a traditional experi- The hyperfine structure of the lowest energy
mental discipline since the middle of the nine- level of cesium could be measured so precisely
teenth century and collision physics began in and reproducibly that a cesium atomic-beam
the 1920s, both experimental approaches en- apparatus was adopted as a time standard, that
joyed the benefits of the enormous technical is, the second is now defined in terms of the
advances made during and after the Second frequency of an atomic clock:
World War.
The ever increasing precision of spectroscopic Av(Cs) = 9 192 631 770 Hz.
methods has had important consequences. One
During this period of prolific activity in the
of these is exemplified by the Lamb-Rutherford 1950s the maser was invented. The first maser
experiment (1950) which is justly famous not was made to work by Townes and his colleagues
only because it was a tour-de-force of experi- in 1954 and subsequently many other workers
mental technique but also because it played a contributed to the development of various kinds
central part in an effort to develop a new and of maser. This device (the word is an acronym
deeper understanding of physics-always the for microwave amplification by stimulated
primary goal. Lamb adapted the methods of emission of radiation) relies for its operation on
radio-frequency spectroscopy to measure the the maintenance, in a dynamic equilibrium, of a
small separation between two energy levels in population inversion in an assembly of atoms:
the lowest excited state of atomic hydrogen, this means that, given two energy levels of an
a separation subsequently called the "Lamb atom, there are more atoms in the upper level
shift." Even the sophisticated relativistic quan- than in the lower. In this way it is possible to
tum mechanics developed by Dirac had not make an oscillator tuned to the frequency of
predicted a Lamb shift. The theory of quantum an atomic spectral line. A particular version,
electrodynamics (also initiated by Dirac in the hydrogen maser developed by Ramsey, has
1927), in which electromagnetic fields were led to a measurement of the hyperfine structure
taken to be quantized, was needed to explain in the lowest level of hydrogen which is utterly
the Lamb shift, and it was Lamb's precise ex- remarkable for its precision:
periment which stimulated efforts to make
accurate calculations from the theory. Quan- Av(H) = 1 420405 751.7662 ± 0.0030 Hz.
tum electrodynamics is regarded as a striking
success because of its ability to achieve precise An optical analog of a maser, called a laser, was
agreement (parts in 10 6 ) between theory and first made to work in 1960. It is not an over-
experiment in the interaction between radia- statement to say that the laser, in its various
tion and matter. The theory is also the fore- forms, has brought about a revolution in optical
runner of more general quantum-field theories technique. For example, the use of a dye laser,
needed in the attempt to understand elementary which is a tunable, nearly monochromatic,
particle physics. powerful light source, is now widespread in
The particular radio-frequency method to spectroscopic work. Many new experiments
which Lamb's experiment is related is called requiring high resolution and high power have
atomic-beam magnetic resonance. This has been performed on atoms and molecules and
been developed by I. I. Rabi and his co-workers, many more are likely to be performed during
beginning in 1938, and has been very fruitful. the 1980s. Just one example of new work is
The method ensures, in an especially elegant that on very highly excited levels of neutral
way, that the frequency of the peak of a spec- atoms. These so-called Rydberg atoms may be
ATOMIC PHYSICS 1I2

regarded as being very large in the sense that References


the outermost electron' has a large "orbit" of
the .order of 10- 6 m radius, and they are also Enge, H. A., Wehr, M. R., Richards, J. A., "Introduc-
fragile because only a tiny perturbation is re- tion to Atomic Physics," Addison-Wesley, 1972.
quired to ionize them. Born, M., "Atomic Physics," London, Blackie & Son
1%~ ,
The study of controlled bombardment of
Woodgate, G. K., "Elementary Atomic Structure,"
at0rr.ts (or molecules) by other particles, in
particular electron-atom, atom-atom and ion- Oxford Univ. Press, 1980.
atom collisions, has had a development parallel Massey, H., "Atomic and Molecular Collisions" Lon-
to that of spectroscopy. The experiments have don, Taylor & Francis Ltd., 1979. '
to be most carefully performed because in November 1981 issue of Physics Today, American
obtaining quantitative results on the beha~ior Institute of Physics, 1981.
of a variety of atoms undergoing collisions it Fortson, E. N., Wilets, 1., "Parity Nonconservation in
Atoms: Status of Theory and Experiment," in "Ad-
is necessary to distinguish one of several po~si­
vances in Atomic and Molecular Physics," Vol. 16,
ble processes from another. For example, elec- p. 319, Academic Press, 1980.
trons used as bombarding particles may be
scattered by atoms, they may excite them they Cross-references: ATOMIC ENERGY; COLLISIONS
may ioniz~ them and they may be captur~d by OF PARTICLES; CONSERVATION LAWS AND
them. RelIable results on the probability of SYMMETRY; ELECTROMAGNETIC THEORY;
such events have to be achieved over a wide ELECTRON; ELECTRON SPIN; ELEMENTS, CHEM-
range of bombarding energies with good resolu- ICAL; LASER; MASER; MOLECULES AND MO-
tion. Only then can they be used both as a LECULAR STRUCTURE; NUCLEAR STRUCTURE;
s~vere test of the quantum-mechanical descrip-
PERIODIC LAW AND PERIODIC TABLE; QUAN-
tion of atoms and as a means of understanding TUM ELECTRODYNAMICS; QUANTUM MECHAN-
the collision processes occurring in more com- ICS; SPECTROSCOPY.
plicated environments.
Naturally, refinement of experimental tech-
nique has brought progress in many different
dire~tions . For example, as a result of the pio- ATOMIC SPECTRA
neenng work of Dehmelt on ions trapped in
electromagnetic fields it has become possible Fundamental Facts Light from electric dis-
to study a single, isolated ion trapped for sev- charges in gases shows line spectra due to free
eral hours on end-a remarkable feat. Some atoms excited by electron collisions. Noble gases
modern work has overlapped other fields: tests and metal vapors produce almost pure atomic
of fundamental symmetry principles and con- spectra, while discharges in molecular gases
servation laws, often regarded primarily as the show both molecular band spectra and atomic
concern of high-energy particle physicists, have line spectra. Some spectral lines can also be ob-
been conducted by means of ingenious experi- served in absorption when white light is made to
p~ss through the gas into a spectroscope. Under
ments on atoms; and close to the interests of
nuclear physicists has been the study of "exotic high spectroscopic resolution, all lines are found
atoms," those containing unstable particles to have nonzero width. This is due to random
playing the role of nucleus or electron for ex- motion (DOPPLER EFFECT) and disturbing in-
ample positronium, muonium muo~ic and fluences of neighboring atoms, molecules, ions
kaonic atoms. ' or electrons (pressure broadening); but even after
Atomic physics is an applied science in the all~w.ance for these effects, a spectral line has a
sense that in other branches of physics it is deflmte, generally very small, width due to radi-
necessary Y> know about atoms. Obviously the ation damping (natural width). Precision mea-
astrophysIcist needs to know about lifetimes s~rement of wavelengths or resolution of very
and structures of excited states of atoms be- fme structures requires light sources giving nar-
cause he is trying to learn about a source of row lines-discharges at low gas- and current-
radiation (a star) which is in no way under his density and low temperature-or even atomic
control! Similarly the study of atmospheres beams at right angles to the line of sight. Con-
both planetary and stellar, relies heavily on th~ tinuous atomic spectra are generally weak under
results of collision physics and of spectroscopy laboratory. co~ditions; in emission, they are due
conducted in .the laboratory, as does work on ~o re~ombmatIo.n of an electron with a positive
plasmas associated with nuclear fusion. Solid- Ion; m absorptIon, to the reverse process of
state physicists clearly must understand the photoionization.
atoms which are the components of their assem- :rhe term atoI?ic spectra includes positive ions,
blies of interacting systems. None of these other With the followmg terminology: spectrum of Na,
fields can be thought of as divorced from atomic arc spectrum, NaI; of Na+, Na*, - - . : first, sec-
physics itself. ond, ... spark spectrum, or NaIl, NaIll, .. - _
~pectra of ~ighly ionized or stripped atoms are
G . K. WOODGATE Important m astrophysics and occur in high-
113 A TOMIC SPECTRA

temperature plasmas. Systems with the same the more rigorous Schrodinger theory, by solving
number of electrons, such as Na, Mg+, AI++ show the wave equation with the assumption of con-
marked similarities and are called iso-electronic stant energy E. Provided E < 0 (bound state), it
sequences. assumes discrete values, those of the stationary
In the ultraviolet, visible or infrared, the spec- states of motion or the eigenvalues of the wave
troscope, in the form of a grating or interferom- equation. Emission and absorption arise from
eter, measures primarily the wavelength (A.) of transitions between two energy levels En, En',
the spectral lines. It is generally expressed in with the frequency of the light given by
angstrom units (A) defined as 10- 8 cm (10 nm)
or, by recent international convention, the frac- Vn,n' = (En' - En)/h (2)
tion 1/6056.12525 of the wavelength of a line of
the isotope 86 of krypton, in air under standard where h is Planck's constant. Equation (1) is a
conditions. The wave number (v or a), the recip- special case of Eq. (2), with En = -hZ 2 Rc/n2.
rocal of the wavelength in vacuo, is measured in Allowance for the motion of the nucleus of
cm- I or kayser (K), or in millikayser (mK). The finite mass M causes R to differ slightly for dif-
frequency v is derived by multiplying by c, the ferent M; it is given by R=/(l + m/M) where
velocity of light in vacuo; in the range of micro- R= = 109737.3 cm- I .
waves and radio frequencies, v is measured The solution of the SCHR6DINGER EQUATION
directly (I mK = 29.9793 MHz). for a mass point in space leads to 3 quantum
In contrast to frequencies, intensities of lines numbers. In polar coordinates, with a force
are strongly dependent on experimental condi- derived from a central potential VCr), the quan-
tions, and special experiments are required for tum numbers nr, J.1 and m give the numbers of
deriving quantities expressing the strength of a nodes of the wave function in the range of the
line as a characteristic constant of the atom. coordinates r, {) and <p. Introducing the azi-
This can be defined in various forms; the f-value muthal quantum number I = 1m I + J.1 and the
is a number giving the ratio of the absorptive or principal quantum number n = nr + I + 1, we
dispersive power of the line to that of the classi- find the set n, I, m to have the following mean-
cal, harmonic electron oscillator of the same ing: the z-component of the angular momentum
frequency; the transition probability or Einstein is Lz = mfi, where fi = h/27T, the square of its ab-
A-value is the probability, per second, of an solute value is ILI2 = l(l + l)fz2, and the energy
excited atom emitting a light quantum. E depends on n and I only. For the special case
Hydrogen-like Spectra The spectra of atoms of the Coulomb field V ~ 1/r, E depends on n
containing one electron only (H, He+, Be++ ... ) alone: En =constant/n 2 . An energy level En has
are very simple if the fine structure is disre- to be considered as consisting of anum ber g of
garded; they form the basis of the classification states of different I and m. This situation is de-
and theory of atomic spectra. Balmer's empirical scribed as degeneracy, and g is the statistical
discovery of a numerical relationship between weigh t of the level. The degeneracy in m is due
the wavelengths of the visible hydrogen lines led to the central symmetry of the force field and
to a formula expressing the wave numbers of all occurs in all atoms in the absence of external
hydrogen-like spectra by one constant R, the fields. The degeneracy in I is peculiar to the
charge number Z (=1 for H; =2 for He+, ... ) Coulomb field in nonrelativistic treatment.
and two integral numbers, n, n' > n: Alkali-like Spectra The spectra of the alkali
atoms and their isoelectronic ions (Li, Na, ... ,
v=Z2R(1/n2 - l/n'2)= Tn - Tn' (1) Be+, Mg+, ... ) show lines arranged in series sim-
ilar to those of hydrogen. Their wave numbers
A series arising from a sequence of values n' is can be represented by empirical relations which
characterized by regularly decreasing spacings are generalizations of Eq. (1). Series of term
and intensities of the lines towards increasing values Tn can be defined in such a way that
wave numbers. Substitution of n = I, 2 and 3 in Tn ~ 0 for n ~ 00, and the observed wave num-
Eq. (I) with Z = 1 gives the Lyman, Balmer and bers are equal to term differences Tn - Tn'
Paschen series, in the ultraviolet, visible and near (Ritz combination prin-ciple). In contrast to
infrared respectively . The wavelengths of the Eq. (I), however, there are several series of
Balmer lines,H Q , H{i and Hoy (n' = 3, 4,5) are terms, so that apart from n, a second index
6562.8, 4861.3 and 4340.5 A. The Lyman 0: number I has to be introduced. The term values
line, the resonance line of hydrogen, has the Tn,1 can then be identified with quantized values
wavelength 1215.7 A. of - E / ch and the index numbers n and I with
The relation of Eq. (1) can be derived theo- quantum numbers, as implied by the letters
retically by applying nonrelativistic quantum chosen, if we assume that in these atoms one
theory to a model consisting of a point electron electron moves in a central force field different
of mass m and charge - e and a fixed point nu- from a Coulomb field. This valency or optical
cleus of charge Ze. In the Bohr-Sommerfeld electron has to be imagined as more loosely
theory, this is done by imposing quantum con- bound than the others and moving in the field
ditions on the classical orbit of the electron; in of electrostatic attraction by the core consisting
ATOMIC SPECTRA 114

of the nucleus of charge Ze and the remaining with a magnetic dipole moment Jl. The inter-
Z - 1 electrons. At large distances from the core, action of Jl with the magnetic field due to the
the field is like that caused by a single charge e orbital motion, the spin-orbit coupling, causes
as in hydrogen. At smaller distances, the optical anyone energy level of given n and I (except for
electron penetrates into the electron cloud of I =0) to split into two levels. They are character-
the core and experiences an increased attraction. ized by a new quantum number j associated
This picture leads to a qualitative understanding with the total angular momentum resulting from
of the term diagrams and spectra of alkali atoms vector addition of Land S. It can have the two
as exemplified for Na in Fig. 1, where terms values j = I ± s = I ± !. Optical transitions are
with I = 0, 1, 2, 3,4, ... are conventionally de- subject to the selection rule !:1j = ± 1 or O. The
scribed as S,P,D,F, G, ' .. terms. For any given width of the doublet splitting increases with
n, an energy level is the further below that of core penetration and thus with decreasing I; it is
hydrogen (the term value the larger) the smaller most prominent in P terms. It decreases rapidly
I becomes, because a smaller angular momentum with increasing n and increases from Li to es. In
decreases the centrifugal force and brings the the term symbol, j is written as suffix and the
electron closer to the core. Emission or absorp- doublet character is indicated by superscript 2.
tion of radiation according to Eq . (2) does not For example, two transitions forming the yellow
occur for all pairs of levels but is subject to the resonance doublet of Na are written 3 2S 112 -
selection rule !:11 = ± 1. Transitions from P levels 3 2PII2 and 3 2S112 - 3 2P312. The absolute value
to the lowest S level form the principal series. of n (= 3 in this case) can be deduced by com-
Its lines can also be observed in absorption since parison with hydrogen (Fig. 1).
the lower level is the ground level; the first mem- Since for all term values Tn -+ 0 for n -+ 00,
ber is the well-known yellow resonance line. the extrapolation of a series Tn - Tn' for n' -+ 00
Transitions from D or higher S levels to the gives the term value Tn. If this is ~ the ground
lowest P level form the diffuse and sharp series, term, ch Tn is the ionization energy of the atom.
those from F levels to the lowest D level, the A convenient conversion formula is 8066 cm- 1
Bergmann or fundamental series. = 1 eV.
A feature that cannot be explained by this Under high resolution, hydrogen-like spectra
model is the doublet structure, a doubling of all are found to have a rather complex structure
except the S levels. It is due to the fact that the known as fine structure (the same name is often
electron possesses a spin, i.e., an intrinsic angular applied to the much wider doublet or multiplet
momentum S of fixed absolute value given by structures); it can be explained by a relativistic
ISI 2 =s(s+ 1)1'1 2 , where s=!, and connected velocity dependence of the electron mass re-
moving the degeneracy of states of different I,
and by magnetic spin-orbit coupling causing a
"8- doublet splitting. In fact, these two effects are
2 2 2 2 e related since the spin itself is relativistic in ori-
Sl/2 Fj/2,312 D3/2 ,5/2 F512 ,7/2 :; gin. The theory gives the result that a hydrogen
5 r--'---"---,-::':"':''-=--,-:':''::':-'.....::--':'''''::':''-.:r--o
level of given n depends on j only, so that, e.g.,
n =2, 1= 0, j =! should coincide with n =2, I =
5 5000 I, j = !. In fact, such terms show a small differ-
4 4 ence known as Lamb shift. Its existence can be
explained by quantum electrodynamics.
I
4 5 ~Oj
,# 10000 A more complete description of the state of an
alkali atom has to treat the core as a dynamical
~ system of many particles. This cannot be done
1 rigorously, but as a useful zero-order approxima-
4 15000 'j'
E
3 u tion, quantum numbers n and I can be assigned
~ to individual electrons; in extension of the sym-

!
'0 20000 f-
> bolism used before, values 1= 0 , 1, 2 . . for . in-
Q)
dividual electrons are described by small letters
I 2
2
25000 s, p, d ... , and the number of electrons of the
same n and I, called equivalent electrons, by a
superscript. Thus two 2p-electrons (n = 2i I = I)
,'"~." 0<0
~-::.
-s 30000 are said to form the configuration 2p . The
II number of equivalent electrons in one atom is
limited by the Pauli principle to 2(21 + 1), i.e.,
35000
to 2 for s-electrons, 6 for p-electrons and 10 for
d-electrons. When this limiting number is com-
40000 pleted, a closed subshell is said to be formed.
Thus the 10 electrons of the atom Ne form the
FIG. 1. Term diagram of Na. Approximate wave- three closed subshells Is2 , 2s2 , and 2p 6, and the
lengths in angstroms. The doublet splitting of the entire atom contains the two closed shells n = 1
terms is not shown. (1s2) and n = 2 (2s2 2p 6) . The same applies to
115 ATOMIC SPECTRA

the 10 electrons forming the core of the atom sion or absorption of a spectral line forms a good
ofNa. approximation for most of the strongest spectra
In a complete subshell or shell the orbital an- in the optical range, from near infrared to near
gular momenta of the electrons cancel one an- ultraviolet. Somewhat arbitrarily but conve-
other out, as do the spins of the individual elec- niently one can define complex spectra as those
trons; the total charge distribution has spherical in which more than one electron outside closed
symmetry. In an alkali atom the quantum num- >
subshells assumes a value of I O. One then has
bers I and s of the outer electron therefore de- to consider the interaction of more than two
termine the orbital- and spin-angular momentum angular momentum vectors of orbits and spins,
of the entire electron structure of the atom. The and a simple interpretation in terms of a vector
ground level of Na, e.g., can be described either model is meaningful only in certain limiting
as an s-level or an S-level, since L = 1 = o. conditions. Very often, especially in low-lying
The classical concept of the core as a rigid levels, a description in terms of the Russell
charge distribution is expressed by the assump- Saunders coupling scheme (L, S coupling) is pos-
tion that the quantum numbers of the core elec- sible because the electrostatic interaction pre-
trons remain constant during the excitation of dominates over the magnetic spin-orbit inter-
the outer electron. action. As a result, we can define terms, each of
Other Simple Spectra Helium, and also Li+, which is characterized by a set of values L, S (or
Be++, ... , have two electrons, and the atoms in a multiplicity 2S + 1), and each term is split into
the second column of the periodic table Be, Mg levels, each characterized by a value of j, the
(also B+, Al+, ... ) have two electrons outside highest of which is equal to L + S. The classifi-
closed shells. The analysis of the spectra leads to cation and terminology are obvious generaliza-
two systems of terms, singlets and triplets, with tions of those for two-electron spectra. For con-
only weak intercombinations; the S terms are figurations of 3; 4; 5 electrons the possible
single also in the triplet system. These facts can multiplicities are respectively: doublets and
be formally described, in analogy to alkali spec- quartets; singlets, triplets and quintets; doublets,
tra, by vector addition of the two spins to form quartets and sextets, etc. The strongest lines
the two possible resultant spin quantum num- arise from transitions between terms of the same
bers S = 0 and 1. The first alternative produces multiplicity. Such line multiplets are often rec-
singlets (j = I), the second triplets (j = 1 - 1, I, ognizable by their characteristic groupings of
I + 1) unless 1 = O. This implies a strong inter- the lines and their intensity ratios. The level
action forming the resultant S of the two spins spacings are governed by the interval rule: they
and a weaker interaction forming j. The latter are in the ratio of the j values, e.g., the levels of
interaction is the same magnetic spin-orbit inter- 4 D 712 512 312 112 have spacings in the ratio
action that causes doublet splitting in alkali 7:5:3:1.' ,
atoms, but the former is, in a less obvious way, Other, often much more complex forms of
due to the electrostatic repulsion between the coupling occur, especially in the higher levels,
two electrons. In all the terms concerned, the and Land S then lose their meaning. One im-
symbol for the configuration shows only one portant property of any level which always
electron to be excited, e.g., Is2p in He, but remains well defined is the parity, and the La-
owing to the identity of the electrons, it is not porte rule states that even terms combine only
possible to attribute the excited state 2p to one with odd terms and vice versa. If the configu-
particular electron. The situation is analogous ration is defined, a level is even or odd if }:,I is
to that of two identical, coupled, linear oscil- even or odd. As one proceeds to higher levels
lators showing two normal modes of vibration, or to heavier elements the concept of the con-
each involving both oscillators in a symmetrical figuration becomes less distinct. A given energy
and an tisym metrical way. Application of Pauli's level may still be said to belong to a certain
principle to the wave mechanical description of configuration, but a more accurate description
this two-electron system leads to two energy often requires the inclusion of one or more
states, one with parallel and one with anti-paral- other configurations of the same parity in the
lel spins, S = I and 0 respectively. Elements in form of a perturbation. In other cases the con-
the subgroup (Zn, Cd, Hg) show similar singlet figuration interaction is so strong that even the
and triplet spectra; in the heavier elements, how- lowest approximation has to be based on the
ever, the magnetic spin-orbit interaction is no concept of mixed configurations.
longer weak compared with the electrostatic re- Hyperfine Structure (hfs) and Isotope Shift
pulsion, and the division into singlet and triplet These structures are usually of the order of
terms has a very restricted meaning. fractions of I cm- I and generally require inter-
In the elements of the third column, B, AI, Ga, ferometric methods for their study. Hyperfine
In, TI, the single electron outside a closed sub- structure is primarily due to the magnetic inter-
shell produces doublet spectra, but in contrast action of the nuclear magnetic moment MN with
to the alkali spectra, the ground term is a P term. the field produced by spins and orbital motions
Complex Spectra The assumption that the of the electrons. A level of given j splits into
quantum numbers n and I of only one of the hyperfine levels, each characterized by a quan-
electrons outside closed shells change in emis- tum number F, where F can assume the values
A TOMIC SPECTRA 116

j+ I, j+I - I, ... , Ij-I I. The nuclear spin lis a In the spectroscopy of hyperfine structures
characteristic property of each nucleus and has and isotope shifts, a limiting factor is often the
integral or half-integral values for even or odd Doppler width of spectral lines caused by the
values of the atomic number. The structure of random motion of the atoms. Cooling of dis-
hyperfine multiplets is similar to that of fine charge tubes and the use of atomic beams for
structure muitiplets, with P, I, j taking the absorption or emission of light have often greatly
place of j, S, L. However, there is often also an reduced this effect but rarely quite eliminated
electrostatic interaction between the electrons it. The recent development of tunable lasers, in
and the nucleus if the nuclear charge distribution the form of dye lasers, has opened up new pos-
has no spherical symmetry. Deviations from the sibilities by providing the spectroscopist with a
interval rule in hyperfine muitiplets have led to highly monochromatic source of continuously
the discovery of such nuclear deformations adjustable frequency and very high intensity.
described mainly by the quadrupole moment Q. The possibility of selective and powerful excita-
Hyperfine structures in ground states can be tion offers obvious advantages for emission
measured very accurately by methods of atomic spectroscopy, but the less obvious advantages
beam resonance. Hyperfine structure studies for absorption spectroscopy are best explained
lead to values of I, approximate values of /IN by two examples. The absorption spectrum is
and Q, and accurate values for the ratio of the scanned by means of the laser whose spectral
two latter for different isotopes. width determines the spectroscopic resolving
For different isotopes of an element the spec- power. If, instead of the transmitted beam, the
tral lines, or the centers of gravity of their hy- fluorescent light is focused on the detector,
perfine muitiplets, are often displaced against the recorded signal is proportional to the beam
each other; this isotope shift has two quite intensity, and the great radiation density of
different causes. The atomic nucleus, though the laser beam allows the study of very weak
heavy, does not stay at rest during the motion absorption lines. In this way absorption lines
of the electrons, and its mass has a small influ- due to two-photon transitions have been ob-
ence on the energy levels, thus causing a mass served. These can occur between two atomic
shift between lines of different isotopes. For levels of equal parity, of energy E and E', at ex-
atoms with only one electron such as H or He+, actly half the frequency corresponding to the
the effect is fully described by a factor I/O + level difference,
m/M) in the term values. For other spectra this
factor, often referred to as normal mass shift, is ,,= ~(E' - E)/h
at best a rough approximation; the influence of
the other electrons is rarely negligible and not with a probability proportional to the square
easy to calculate. All mass shifts, however, de- of the light intensity, in accordance with the
pend on M in a regular, monotonous way for concept of the simultaneous absorption of two
any given spectral line, and are generally much photons. In the case of Na (see Fig. I). Laser
smaller for heavier elements. These facts often light of wavelength 6022.3 A has been found to
allow them to be separated from the more im- cause transitions from the ground state 3 2 S1/2
portant field shifts (or volume shifts) which to 5 2S 1/ 2 which were detected by fluorescence
markedly increase with atomic weight in the of wavelength 6161 A (5 2S 1/2 - 3 2P3/2). The
list of elements, a fact pointing to the nonzero virtual energy level introduced by the theory of
size of the nucleus as a cause. Within the vol- this process half-way between E and E' is then
ume occupied by the nuclear charge, the elec- slightly below 3 2P3/2' Its closeness to this level
trostatic attraction acting on the electron is accounts in the theory for the comparative ease
much smaller than that caused by a positive of observing the two-photon absorption. If the
point charge. This raises the energy levels com- atomic vapor is exposed to two laser beams
pared with their fictitious values for a point traveling in opposite directions, e.g., by being
nucleus. For any two isotopes, the difference placed inside the laser cavity, the spectrum
in neutron number and the resuiting difference shows a very sharp, virtually Doppler-free peak
in nuclear size or shape leads to a difference of caused by two photons of opposite direction,
the energy levels. In the transition to another flanked by much weaker wings of full Doppler
term, a line shift between the two isotopes is width due to photon pairs of the same direction.
then observed, equal to the difference of the Of probably wider application is the technique
level differences. The field shift of levels is of crossing a laser beam with an atomic beam at
particularly large for s-electrons, (l = 0), on right angles. In contrast to earlier work it is
account of their high probability density near now possible to use atomic beams of much
the center, and the observed line shift is there- lower density and therefore of much higher col-
fore large in lines involving transitions of one limation, since the probability of an atom get-
or two electrons from their s-states. Studies of ting excited during its passage through the laser
field shifts have provided valuable information beam is close to I. Such methods of Doppler-
on nuclear structure, especially the shapes and free spectroscopy are proving to be of great
sizes of charge distribution and their dependence value in work on isotope shift and hyperfine
on the number of neutrons. structure.
117 ATOMIC SPECTRA

Magnetic and Electric Effects The effects F of the term and yield an accurate value of the
caused by magnetic fields play a great part in hyperfine splitting for B = O. The various tech-
research on atomic spectra, particularly in mag- niques differ mainly in their methods of detect-
netic resonance methods. An external magnetic ing resonances.
field removes the degeneracy due to the spher- In the atomic beam resonance method (see
ical symmetry of atomic force fields and causes ATOMIC AND MOLECULAR BEAMS) the beam
the energy to depend on the magnetic quantum passes through two strong, magnetic fields hav-
number m . This leads to the formation of the ing gradients of opposite directions, one before
Lorentz triplet or normal Zeeman effect in and the other after passage through the reso-
singlet spectra and to the more complex struc- nance region with a homogeneous field; the
tures of anomalous Zeeman effects in multiplet fields are all in the same direction. Though the
lines. Zeeman effects in hyperfine structures are quantization is different in weak and strong
especially important for the determination of fields, the total component of the angular mo-
nuclear spins. The Stark effect, due to electric menta of the atom remains the same during its
fields, is of somewhat less importance . It causes passage (mF = mj + mj), unless resonance oc-
the energy to depend on I m I only, thus not re- curs, and the two deflections cancel out. A re-
moving the degeneracy completely. While mag- sulting deflection can thus be used for the de-
netic splittings are proportional to the field tection of resonance.
strength, Stark splittings are generally propor- In the methods of double resonance and opti-
tional to the square of the electric field. cal pumping, changes of intensity or polarization
Direct Measurement of Hyperfine Structures. of light in emission or absorption are used for
The optical spectroscopy of hyperfine structures the detection of resonances. These phenomena
aims at the determination of a very small quan- depend on the population of sublevels of differ-
tity (a level difference in an atom) by means of ent m within a large assembly of atoms. Since
measuring the difference between two much the Boltzmann factor is close to I, all these
larger quantities (wavenumbers or frequencies). levels have normally the same population, and
If, instead of these , the small level difference resonance transitions between them have statis-
can be measured directly, as a single, low fre- tically no effect. Absorption of polarized light
quency, one can expect a gain in accuracy on in a resonance line, however, causes an anom-
general grounds; more specifically, since the alous population in the excited state, as shown
Doppler width is proportional to the frequency, by the polarization of the fluorescent light.
such direct measurements are virtually Doppler- When a radiofrequency field is applied, reso-
free. Many hyperfine structures, and also some nances in the excited state tend to restore nor-
other small, spectroscopic effects such as Lamb- mal population and can thus be detected by the
shifts , have been measured in this way to an resulting depolarization of the fluorescence.
accuracy beyond that of optical methods. It The method is known as double resonance . If
has to be remembered that isotope shifts can circularly polarized light is used in the fluores-
not be measured by such direct methods be- cence experiment, the optical transition always
cause they are not due to a level difference in increases (or decreases) the value of m to m + I
one atom. (or m - I), according to the sense of polariza-
Radiofrequency Methods. All hyperfine levels tion and produces a state of partial orientation
and magnetic sublevels within a given configura- in the excited state. The reemission, in accor-
tion have the same parity, and electric dipole dance with the selection rules, then shifts the
transitions between them are forbidden . An population in the ground state in the same
electromagnetic wave can, however, act through sense. A state of partial orientation of the angu-
its magnetic field on the spin and orbital mo- lar momenta and the magnetic moments has
tions and their precession about the direction thus been produced solely by the action of
of an external field B . Whenever the frequency light , a process known as optical pumping. Such
of the radiation field equals the Larmor fre- a state can be remarkably long-lived; it can
quency (l/h times the spacing of the energy easily be monitored by the strength of absorp-
levels), transitions m ~ m ± I can occur, with tion in the resonance line. Resonances in the
equal probability in both directions, and are ground state can then be produced and detected
described as magnetic resonances. A change of in the same way.
m implies a change of the component of angu- Level Crossing and Quantum Beats. A group
lar momentum and magnetic moment along the of methods for measuring hyperfine structures
direction of B. This field is described as weak, and radiative lifetimes of excited states is based
intermediate, or strong according to its effect on the properties of atomic systems in or close
as compared with the hyperfine structure split- to a condition of degeneracy. A plot of the en-
ting. Each resonance point is found by scanning ergy levels of different m, of any hyperfine state,
B at constant frequency. In weak fields, transi- as function of B shows 2F + I lines crossing one
tions between the magnetic levels of each hy- another at B = O. Some of the levels arising
perfine level are measured separately and allow from different values of F and differing m also
the nuclear spin I to be determined; measure- cross at intermediate fields. Any such crossing
ments in intermediate fields involve all values of point represents a degenerate system ; a transi-
A TOMIC SPECTRA 118

tion from this to the ground state "corresponds" edges of the K-, L-· .. bands. The state of an
to a classical, isotropic oscillator. It responds to atom with a "hole" in one of the inner shells,
a light wave by oscillating in the direction of caused by x-ray absorption or electron bom-
the electric vector, emitting light waves of the bardment, represents a highly excited state of
appropriate direction and polarization. Slow the atomic ion. It leads to emission of the x-ray
changes of B from 0 or from its value at other line spectra by transition of an electron from a
crossing points remove the degeneracy and higher shell to the vacant state of the inner
cause precessional motions of the oscillator, shell.
thereby changing or destroying the polariza- Spectra in the gap between x-rays and the
tion of the fluorescence. This change takes near ultraviolet have recently received increas-
place when the frequency of precession (Em - ing attention, partly owing to their importance
Em±d/h is equal to the reciprocal of the radia- to astrophysics. Work in this far and extreme
tive lifetime of the excited state which can thus ultraviolet requires vacuum spectrographs, usu-
be measured. For zero-field crossing, these phe- ally with gratings used at grazing incidence.
nomena, known as the Hanle effect, are often Emission spectra " of highly ionized atoms
used for measuring radiative lifetimes. More in- (stripped atoms) canbe observed in the solar co-
formation is gained from crossings at interme- rona and in stellar nebulae, and in the laboratory
diate fields. The exact value of B for any cross- in condensed sparks and high-temperature plas-
ing point can be determined by the very distinct mas. Their strongest lines are in the extreme
change of polarization, and this allows the ultraviolet; e.g., in work ranging down to about
width of the field-free hyperfine splitting to be 20 A, all the spectra of the isoelectronic se-
derived; this is known as the method of level quence from NaI (Z = 11) to Cu XIX (Z = 29)
crossing. are known. The study of absorption spectra of
In the emi..sion of light from two closely ad- neutral atoms has been extended into the ex-
jacent levels excited by the same source, the treme ultraviolet, partly by the use of the con-
atom passes through a stage during the radia- tinuous background radiation from synchro-
tive lifetime 7 when it can be described by a trons. Such spectra are mainly due to two kinds
mixture of two time-dependent functions. This of processes: either the excitation of one of the
causes a light emission with an intensity modu- electrons in closed shells, e.g., of one of the
lation superimposed on the exponential radia- electrons Is, 2s, or 2p in Na,or the simul-
tive del.".ty. The emission from different atoms taneous excitation of two electrons. A simple
having different velocities is not optically co- example . of the second type is the absorption
herent but contains common Fourier compo- series 1s2 IS - 2snp I P in He between 206 and
nents of the beat frequency. In excitation by 165 A. In both types of absorption spectra,
pulses of duration «7 these beats can be syn- many lines show a peculiar, strongly asym-
chronized and thus made observable, e.g., on metric profile known as Fano profile. It can
an oscilloscope; they are known as quantum be explained by configuration interaction in
beats. This time-resolved spectroscopy has the following way. The same energy as that of
been applied to different kinds of close level the two-electron excitation state 2snp I P can
structures, also in beam-foil spectroscopy where be reached from the normal atom by removal
the foil produces the sudden excitation and the of a single electron with excess kinetic energy,
beam provides the time resolution. i.e., the absorption line falls in the range of the
Far Ultraviolet and X-ray Spectra Atomic continuous absorption extending towards higher
spectra of frequencies about 1000 times those frequencies from the limit n = 00 of the absorp-
of optical spectra have long been known as tion se9.es IS2 IS - Imp I P. Configuration inter-
characteristic x-ray spectra (see ENERGY LEV- action· between the discrete state and the con-
ELS). Though generally observed in condensed tinuous state of the same energy and parity
matter, they are characteristic of the constituent causes the anomalous line profile by a kind of
atoms, only slightly modified by chemical bonds interference effect. The absorption of light
or crystal structure. They are due to the inner within this line leads either to re-emission of the
electrons of the atom which are very tightly same frequency or to nonradiative transition to
bound by the strong attraction of the nuclear the ionized state, a process known as autoioni-
charge, only partly screened by the other elec- zation. In x-ray spectroscopy, the analogous
trons of the same or lower n. The energy of the process had been detected by the appearance of
inner electrons depends primarily on n, as in fast, free electrons and is known as Auger effect
hydrogen-like spectra, and in x-ray terminology (see AUGER EFFECT). The process of autoioni-
one classifies electrons and shells by the value zation is closely related to the inverse process
of n as K -, L-, M, ... electrons and shells for of resonance capture of electrons by ions.
n = I, 2, 3, .... Since the inner shells are filled Particle accelerators normally used in nuclear
in atoms of not too small Z, the absorption structure research have been successfully ap-
spectrum shows mainly continuous bands, due plied to the optical spectroscopy of multiply
to the removal of an electron from one of these ionized atoms. In a technique known as beam-
shells. The photoionization energies for elec- foil method, the high-speed ion beam is made
trons of n = I, 2, ... mark the low-frequency to pass through a very thin foil (usually of car-
\19 AUGER EFFECT

bon) and the spectrum is observed at different Principal Features Auger showed that:
distances from the foil. The rate of decrease of (I) The photoelectron and its Auger electron
intensity allows life times of excited states to arise at the same point.
be determined. In spite of difficulties due to (2) The Auger-electron track length is inde-
cascading processes numerous transition prob- pendent of the wavelength of the primary
abilities could be measured in this way. Modu- x-rays, but the photoelectron track length in-
lation structures superimposed on the expo- creases with x-ray energy.
nential decay are sometimes observed and are (3) The direction of ejection of the Auger
known as quantum beats. They are due to finer electron is independent of that of the photo-
level structures and can be understood in the electron.
way indicated above. (4) Not all photoelectron tracks show a coin-
cident Auger track.
H.G.KuHN
Filling of vacancies by the Auger effect can
occur for any vacancy for which there are two
References electrons in the atom sufficiently less tightly
Kuhn, H. G., "Atomic Spectra," 2nd ed., London, bound that a net positive energy is available for
Longmans, and New York, Academic Press, 1971. the ejected Auger electron. Because photon
Woodgate, G. K., "Elementary Atomic Structure," emission is more easily detected and has played
Oxford Univ. Press, 1980. such an important role in the development of
Hanle, W., and Kleinpoppen, H., "Progress in Atomic quantum theory, it is not generally realized that
Spectroscopy," Vol. A, Plenum Press, New York Auger emission is much more probable. Only
and London, 1978; Vol. B, 1979. for vacancies in the K shell in atoms with
Corney, A., "Atomic and Laser Spectroscopy," Ox- atomic number above 32 and in vacancies in the
ford Univ. Press, 1977. outer two electron states of an atom does pho-
Berry, H. G., "Beam Foil Spectroscopy," Rep. Progr. ton emission dominate. The Auger effect also
Phys. 40, 155 (1977) . occurs after capture of a negative meson by an
atom. As the meson changes energy levels in
Cross-references:
ATOMIC AND MOLECULAR approaching the nucleus, the energy released
BEAMS, ATOMIC PHYSICS, AUGER EFFECT, may be either emitted as a photon or trans-
ELECTRON SPIN, ENERGY LEVELS, MOLEC- ferred directly to an electron which is emitted
ULAR SPECTROSCOPY, OPTICAL PUMPING,
as a fairly high-energy Auger electron (keV for
RAMAN EFFECT AND RAMAN SPECTROCOPY
hydrogen, MeV for heavy elements). Finally we
SCHRODINGER EQUATION, SPECTROSCOPY:
note that each Auger process increases the posi-
X-RAYS, ZEEMAN AND STARK EFFECTS.
tive ionization of the atom by changing one
initial vacancy into two final vacancies.
Energy Spectra of Auger Electrons Auger
spectroscopy is the measurement of the num-
AUGER EFFECT ber, energy, and intensity of lines present. The
spectrum of Auger electrons resulting from a
Definition and History The Auger effect is given vacancy is more complex than the corre-
the filling of an electronic vacancy in the atom sponding photon spectrum. The energy of the
by one electron from a less tightly bound state Auger electron resulting from the filling of a
with the simultaneous emission not of a photo~ vacancy V of energy E(V) by production of
but of a second electron from another less yacancies Xi and Yj of energies E(Xi) and E(Yj)
tightly bound state. IsE(V- XiYj)=E(V)-E(Xi)- E(Yj)- t:.EXiYj,
Following experiments by Barkla (1909) and where t:.EXiYi can be interpreted as either the
Sadler (1917) in which the number of charac- increase in binding energy of the Yj electron
teristic K x-rays emitted by material absorbing due to an Xi vacancy, or vice versa . Experimen-
higher-energy x-rays appeared to be substan- tal energy determination in Auger spectra then
tially less than the number of x-rays absorbed consists of determining t:.EXiYi for each transi-
in the K shell, Kossel (1923) suggested that the tion. Exact calculation of the number of pos-
remaining vacancies might be filled by a radia- sible Auger transitions, their energies, and their
tionless transfer of the excess energy to an relative probabilities necessitates the use of a
emitted electron. This interpretation was reiter- relativistic intermediate-coupling theory. In the
ated by Barkla and Dallas (1924), who observed above notation, X and Y refer to the total
an increase in the number of electrons emitted quantum number of a group of levels, and i and
when x-rays were absorbed. Wilson (1923) had j to the individual substates within the group.
observed in a cloud chamber, simultaneous ejec- At low resolution the Auger spectrum from a
tion of two electrons from the same atom. It vacancy with initial total quantum number n
remained for Auger (1925, 1926) to make sys- always consists of at least three well separated
tematic investigations of this phenomenon in groups which can be characterized by vacancies
argon. The effect has since been called the asn~(n+ l)(n+l),n~(n+l)(n+m),and
Auger effect, and the ejected electrons have n ~ (n + m) (n + m), where (n + m) represents
beeri called Auger electrons. all final vacancies with total quantum numbers
AUGER EFFECT 120

greater than or equal to (n + 2). Thus for an L-MN and L-NN spectra. Considerable medium
initial K vacancy we have K-LL, K-LX, and resolution data exist for M and N Auger spectra.
K-XY groups where X and Y stand for all vacan- While the global K-Auger intensity is easy to
cies with quantum numbers equal to or greater obtain by integrating under the spectrum, the
than n + 2 (in this case 3). Similarly, for an L 3 total L 1 , L 2, and L 3 intensities are difficult to
primary vacancy we have L3-MM, L3-MX, and obtain because of the overlap of the three
L3-XY, and similarly for Ll and L2. But for bands.
the L shell the groups do not appear well sepa- Relative Probability of Auger and X-ray Emis-
rated because the L 1 and L2 groups overlap sion The evaluation of the relative probability
each other and the L3 groups. of x-ray and Auger emission for different initial
When a higher resolution is used each band is vacancies, and the determination of the relative
seen to be composed of numerous lines of intensities of various Auger lines constitute one
which many are ordinary lines resulting from a of the most important aspects of Auger effect
single initial vacancy and others are satellites research.
resulting from multiple initial vacancies, mul- The fluorescence yield, Wi, for any initial
tiple Auger processes, and other complex vacancy i is defined as the fraction of vacancies
phenomena. filled by emission of photons. The Auger yield
Complete interpretation of the ordinary lines is defined correspondingly as the fraction filled
necessitates relativistic-intermediate coupling by emission of Auger electrons. The Auger
which is a combination of L-S coupling (small yield is divided into two parts, one (denoted by
binding energies) and j-j coupling (very large ail which transfers the vacancy to a level with a
binding energies). As an example of these inter- higher total quantum number, and the other
pretations we show in Table 1 the designations (denoted by iii) which transfers the vacancy to
for the K -LL group. a lower-energy vacancy with the same total
Auger spectra also enable the relative intensi- quantum number. The latter process is called
ties of the lines or groups of lines to be deter- the Coster-Kronig effect.
mined. At high resolution the K-LL intensities For the K shell the following equation holds
of all the ordinary lines can be accurately mea-
sured. No theoretical calculation gives the cor- 1 = wK + aK·
rect relative intensities for all values of atomic
number but relativistic j-j coupling gives quite Although in principle WK(Z) should be readily
good agreement above 80. K-LX and K-XY lines calculable, the number of (frequently relativis-
are less well resolved and theories are less devel- tic) electron wave functions which must be
oped. For L-MM spectra the experimental in- known for each Z, and the number of permuta-
tensities are much less precise due to the much tions and combinations of these functions
lower energies and the overlapping of the bands. which must be handled in order to calculate
Nonrelativistic L-S and j-j coupling theoretical the individual probability of every line and, by
intensities have been calculated, and crude summing, the total K-Auger probability per
agreement is obtained with the former for low unit time, present a formidable problem even
atomic numbers and with the latter for high with the aid of a sophisticated computer. The
atomic ~umbers. Very little good data exist for K fluorescence yield is therefore normally

TABLE

L~Coupling Intermediate i-i Coupling


(light elements) Coupling (heavy elements)

K - 2s2s ISO K-LILI ISO K-LILI


1Pl K-L 1L 2 1P 1
K-LIL2
K- 2s2p K-L 1L 2 3pO
3p K-L 1L 3 3P2 K-LI L 3
0,1,2
K-L 1L 3 3P2
ISO K-L2L2 ISO K-L2L2
ID2 K-L2 L 3 1D2
K-L2 L 3
K- 2p2p K-L2L3 3P 1 parity
forbidden
3p K-L 3L 3 3PO
0,1,2
K-L 3L 3 3p2 K-L 3L 3

For K·Auger electrons, measurements and comparison with theory


are extensive.
121 AUGER EFFECT

found by fitting theoretical expressions fOT WK of photoelectrons by x-rays, there has been a
containing empirical constants to the experi- strong recent upsurge in interest in several other
mental values. For 20 < z < 55, WK is given to types of experimental work which involve
a few per cent by Auger spectra closely:
(1) Vacancies created in radioactive decay.
wK = (1 + 7.8 X 105 Z-4)-1 (2) Vacancies created in electron-ion and ion-
ion collisions in gases.
and somewhat less accurately for 10 < z < 20. (3) Auger electrons emitted from the surface
For z < 10 the experimental errors are large layers of solid targets in ultrahigh vacuum when
because of solid state and molecular effects, the surface is bombarded by electrons up to 3
and the theoretical estimates may also be con- keV and sometimes by ions.
siderably in error. When radioactive decay occurs, internal con-
For Z > 50 it is better to estimate aK since version, shake-off, accompanying beta-decay,
relativity has an important effect on aK but not and electron capture all create vacancies. In
on wK which is very close to unity. The follow- fact, the only method of studying the relative
ing equation probably predicts aK to 10 to 15 probability of orbital capture in the various
per cent and hence wK to a per cent or better shells and subshells of a nuclide involves the
for z > 50: study of the x-ray or Auger spectrum of the
aK = [1- (1 +7.8 X 10 5 )-1] (l +3.5 X 10-5 Z2 product nuclide. The interpretation of these
spectra necessitates the knowledge of the con-
+ 3 X 10-8 Z4). stants discussed above.
The importance of the Auger effect for radio-
For the L shell in the j-j coupling limit the active nuclei, as well as the almost 100 per cent
following equations hold : probability of Auger and Coster-Kronig emis-
sion for the M and higher levels, is indicated by
1 = W3 + a3 measurement of the total charge accumulated
by certain radioactive nuclei. For example, a
= W2 + a2 + [23 vacancy produced in xenon 131 m by internal
= WI + a 1 + [12 + [13· conversion gives rise in some cases to as many
as 21 Auger processes, leaving a xenon ion with
In addition, the average L-fluorescence yield a charge of - 22e.
WL for an atom with an L vacancy, having The highest-resolution Auger spectra are those
probability n I, n2, and n 3 of being in each of from vacancies created in gaseous atoms or
the three subshells, is molecules at low pressure by bombardment by
electrons or other ions. Such spectra are partic-
wL = 1 - OL =nl (WI + [12 W 2 + [13 W 3 ularly rich in lines (satellites) arising from mul-
tiple vacancies, and their interpretation has con-
+[ld23w3)+n2(w2 +[23 w 3)+n3 w 3· tributed to our understanding of the processes
The values of the nine L-shell constants are which take place in such collisions.
much less well known than the two constants Since about 1967 there has been great inter-
for the K shell. Below atomic number 50 the est in the study of surface impurities by means
Wi are all less than 6 per cent and appear to be of the Auger spectra of solid surfaces in ultra-
less than 50 per cent for all elements. W3 is high vacuum bombarded by low energy, up to
given reasonably accurately by 3 keV, electrons. The experiments are usually
done by modifying a Low Energy Electron Dif-
W3 = (1 + 0.82 X 108 Z-4)-1 Z > 65 fraction (LEED) apparatus to provide it with a
retarding grid. By varying the retarding voltage,
Z>50. an integral spectrum of the scattered and sec-
ondary electrons is obtained. By applying a
W2 is much less accurately known but the small alternating voltage to this grid and tuning
agreement between theory and experiment is the detector amplifier to twice the frequency, a
fairly good. The knowledge of WI is relatively very sharp differential line spectrum of scat-
poor. The Coster-Kronig yields go through sud- tered and Auger electrons is obtained. Impurity
den changes in value as certain transitions be- detection of better than "5b of a monolayer is
come energetically possible or impossible. For claimed in some cases. Although it can be
example, L2 - L3 M transitions are forbidden shown that a deflection-type spectrometer with
for Z from 50 to 73. Experimental values for an alternating voltage on the detector has in
[13 and [23 have precisions ranging from 5 to principle a better signal-to-noise ratio than the
20 per cent, while [12 has only a precision of retarding grid, the combination of the latter
about 30 per cent. Agreement with theory is with electron diffraction studies of the same
not very good. The wL and aL clearly depend surface constitutes an extremely powerful tool
on the type of excitation. for the study of surface physics and chemistry.
Although initial interest in the Auger effect
arose through creation of vacancies by ejection SHERWOOD K. HAYNES
AUGER EFFECT 122

References The general term "aurora" (aurora borealis or


Burhop, E. H., "The Auger Effect and Other Radia- northern lights in the northern hemisphere,
tionless Processes," Cambridge, The University Press, aurora australis or southern lights in the southern
1952. hemisphere) refers to the luminous emissions
Listengarten, M. A., "The Auger Effect" (a Review), commonly seen in the night sky in polar regions.
Bull. Acad. Sci. USSR, 24 (9),1050 (1960). The aurora is much brighter than airglow (see
Burhop, E. H. S., and Asaad, W. H., "The Auger Ef- AIRGLOW), which is a uniform, subvisuallumi-
fect," in "Advances in Atomic and Molecular Phys- nosity and is worldwide in occurrence.
ics," New York, Academic Press 1972. Auroral forms typically take the shape of arcs
Chang, C. C., "Auger Electron Spectroscopy," Surface or bands that align along parallels of geomag-
Sci., 25, 53-79 (1971). netic latitude (see Fig. 1), and often exhibit a
Sevier, K., "Low Energy Electron Spectroscopy," New vertical striated structure (rays) that align along
York, John Wiley & Sons Inc., 1972. the local direction of the earth's magnetic field
Haynes, S. K., "Phenomenological Systematics of L- (see Fig. 2). The height of the aurora may be as
Auger Spectra," Summary of the IUPAP Conference low as -80 km and as high as 300-500 km, but
on Inner Shell Ionization Phenomena, Atlanta, most auroras occur around 110 km. Because
Georgia, April 1972. the extent of auroras commonly exceed the
limited field of view from a ground station,
Cross-references: ATOMIC PHYSICS, ISOTOPES many of the auroral forms reported from the
PHOTOELECTRICITY, X·RA YS. ground (arcs, bands, coronas, etc.) are only ap-
parent shapes, distorted by perspective effects.
The aurora occurs in two oval-shaped (nearly
AURORA circular) bands encircling the north and south
geomagnetic poles, locating about 25° of lati-
The earliest auroral descriptions date back to tude from these poles on the night side of the
the Old Testament and early Chinese chronol- earth, and about 12° from the poles on the day-
ogies. Up until the Renaissance period, there side (see Fig. 3). This instantaneous auroral dis-
was little scientific understanding and most tribution is called the "auroral oval." As the
reports involved superstitious, prognostic, or aurora is typically brighter near local midnight,
religious interpretations. The metaphor "aurora the region of highest probability of seeing an
borealis" seems to have been first used by Gal- aurora is the projection of the midnight sector
ileo, and many famous 17th-19th Century sci- of the auroral oval onto the rotating earth; this
entists (including Halley, Descartes, Celsius, is a circular region about 25° latitude from the
Dalton, Biot, and Angstrom) speculated on the geomagnetic poles, called the "auroral zone."
nature of the phenomenon. However, little real During periods of geomagnetic storms (see
progress in understanding was made until the GEOPHYSICS) the auroral oval shifts to lower
era of polar exploration resulted in systematic latitudes (see Fig. 4) and at rare times is seen
data gathering from high latitude regions. over the tropics.

(a)

(b)
FIG. 1. (a) Wide angle photograph showing typical appearance of an auroral arc as seen from the
ground. (b) Pictures of an auroral arc as seen from space (Spacelab) (R. H.Eather).
123 AURORA

1
I

,.,.---

FIG. 2. An illustration of the variety of auroral structures (NASA).

Auroras are observed to occur simultaneously the International Brightness Coefficient (see
in the northern and southern hemispheres, Table 2).
showing remarkably similar shapes and radiances The auroral light is produced by bombard-
at opposite ends of the same magnetic field line. ment of the upper atmosphere by energetic
Such auroras are said to be magnetically con- electrons and protons in the energy range of
jugate (see Fig. 5). about one-tenth to tens of keY, with electrons
The auroral light arises from spectral emis- normally being the dominant energy source.
sions by atmospheric gases in atomic, molecular The energy of the particles determines how far
and ionized states. The most important auroral they penetrate into the atmosphere, and so
emissions, and approximate relative radiances, determines auroral height. Because atmospheric
are listed in Table 1. The strongest emission is composition changes significantly with height
the yellow-green line of atomic oxygen. above the turbopause, the relative importance
The total integrated radiance of the aurora of various spectral features (Table I), and hence
may vary over many orders of magnitude, and auroral color, also are functions of height. If
is often classified on a scale of I to 4 called auroral radiance does not exceed the color
AURORA 124

Antarctica

FIG. 3. A composite of satellite pictures taken over Antarctica near midwinter. The geomagnetic
south pole is marked with an X, and the geographic south pole with •. Local time is indicated around
the edges of the picture. The active bright nightside aurora is illustrative of a intense substorm; the
dayside aurora is much less intense, and shows a common fanning out of structures from near midday
(S. I. Akasafu).

threshold of the eye (-IBe 2, see Table 2), the The energetic electrons (and protons) that
aurora appears whitish in color. When this generate the aurora have their primary source
threshold is exceeded (scotopic vision), the in the solar wind (see MAGNETOSPHERIC RA-
strong auroral emissions in the blue, green DIATION BELTS and Fig. 6). (A secondary prob-
and red regions of the spectrum (Table 1) can able source of auroral particles is the earth's
combine to give a wide range of hues. Typically, upper ionosphere.) Details of the mechanism
however, bright auroras appear yellowish-green whereby solar wind particles gain entry to the
(557.70I), sometimes with blue- (Ni) or red magnetosphere are not clear but probably in-
On
(N 2 , lower borders, and sometimes with
red upper regions (630.0/636.4 01). The auroras
volve merging of the earth's magnetic field with
the interplanetary magnetic field imbedded in
seen at low latitudes during large magnetic the solar wind. It is likely that particles entering
storms are often excited by lower energy elec- the magnetosphere through the dayside cusp
trons, giving high red auroras. regions have fairly direct access to the dayside
125 AURORA

FIG. 4. Satellite pictures showing the location of the auroral oval over western Europe for different
levels of magnetic activity. For quiet conditions (left panel) the aurora locates north of the northern
coast of Scandinavia. For weak magnetic activity (center panel), the aurora is seen over central Scan-
dinavia. During moderate disturbances, auroras are seen from southern Scandinavia (right panel) (Air
Force Geophysics Lab, R. H. Eather).

FIG. 5. Photographs taken from aircraft flying under the same magnetic field line in the northern
hemisphere (over Alaska) and the southern hemisphere (between New Zealand and Antarctica). The
pictures are mirror images of each other (auroral conjugacy) (T. Neil Davis).

TABLE 1. Main Auroral Emissions in Visible Region.

Spectral Region Approx. Relative


Emission (nm) Height Radiance*

01 557.7 yellow-green 110 km 1.


630.0/636.4 red 175 km .1 --+ 10
N2+1stneg 391.4 violet 1l0km .6
427.8 blue 1l0km .2
N2 lst pos red 90-100 km .5
N2 2nd pos violet 90-100 km 1.
02+ 1st neg yellow-red 90-100 km .1
H Balmer 486.1 H, 656.3 H 120km .1

• Radiance refers to an emitting surface and is not strictly the correct


term in this context. Auroral "radiance" is actually an integrated column
emission rate, and is often quoted in units of Rayleighs, where 1 R = 10' 0
photons m - 2 (column) sec.
AURORA 126

TABLE 2. Auroral Radiance* Classification.

International 557.7 OJ Approximate


Brightness Column Emission Description of Luminance
Coefficient Rate (kR)* Brightness (ft Lamberts)

1 1 Milky Way 10- 4


2 10 thin moonlit cirrus 10-3
3 100 moonlit cumulus 10-2
4 1000 provides ground illumination 10- 1
similar to full moonlight

*See note at bottom of Table 1.


NOTE: Approximate photographic exposure required (fl.4 lens, ASA 400) is 1-2 sees
for IBC 3 aurora.

ionosphere and generate dayside aurora. These ably occurs by magnetospheric electric fields
precipitating particles are typically low energy that drive particles from the flanks of the mag-
(0.1-1 keY) and generate weak, reddish aurora netosphere to the plasma sheet region, and ad-
(often subvisual) at heights of :<: 150 km. The ditional (possibly primary) energization by field
particles generating the more intense nightime aligned electric fields has been discovered at
aurora have higher energies (-I-lOs keY) and a low altitudes (10-15,000 km) above the auroral
source region in the plasma sheet. The accelera- ovals.
tion mechanism required to energize the solar The auroral oval displays considerable dy-
wind particles to keY energies have not been namic behavior on a time scale of the order of
definitively identified. Some energization prob- hours (Fig. 7). Periodically, intense auroral

- - ---
MAGNETOSHEATH
~

-
-
S OLAR WIND

-
-
-
-
-
- -
FIG. 6. Sketch showing the main particle region in the magnetosphere. The nighttime aurora occurs
at the base of magnetic field lines that thread the plasma sheet. Dayside auroras occur at the base of
the polar cusp field line (G. Paschman).
127 AURORA

FIG. 7. These fisheye pictures show the complete sky, and illustrate how auroral forms may rapidly
change during a substorm. This sequence shows changes over a 10 min. period of sub storm development
CR. H. Eather}.
AURORA 128

brightenings appear in the midnight region, fol- lite based detectors. It is such coordinated ex-
lowed by a characteristic poleward expansion in periments, between ground station arrays and
the midnight sector, westward travelling surges multiple satellites both inside and outside the
in the evening sector, pulsating forms in the magnetosphere, that will eventually result
morning sector, and equatorward shifts of day- in definitive answers to three very important
side aurora. This sequence of events, lasting unanswered questions-what is the entry mech-
!-l hour, is called an auroral substorm. The anism(s) for solar wind particles into the mag-
frequency and intensity of substorms increases netosphere; what is the acceleration mech-
during worldwide magnetic storms. The trigger- anism(s) inside the magnetosphere; and what
ing process that initiates this substorm sequence triggers and drives the substorm process?
is not yet identified. The uncertainties concern
whether it is an internal (to the magnetosphere) ROBERT H. EATHER
relaxation process that suddenly releases stored
energy in the tail, or whether the process is ex-
ternally driven by fluctuations in the solar wind. References
There are many phenomena associated with Chamberlain, J. W., "Physics of the Aurora and Air-
the aurora other than the visible light. The glow," Academic Press, New York and London,
auroral ovals carry large electric currents (the 1961.
auroral electrojet) that represent closure paths Eather, R. H., "Majestic Lights-The Aurora in Sci-
for large-scale magnetospheric current systems. ence, History and the Arts," Amer. Geophys. Union,
These currents have strong magnetic effects at Washington, D.C., 1980.
ground level. Ionization in the aurora results in Omholt, A., "The Optical Aurora," Springer-Veraig,
radio-wave reflections at lower frequencies and Berlin-Heidelberg, 1970.
absorption and scintillations at higher frequen- Vallance Jones, A., "Aurora," D. Reidel, Dordrecht-
cies. Electromagnetic emissions from the aurora Holland, 1974.
include x-ray, ultraviolet and infrared, vlf, radio-
wave, and ulf emissions. Auroral heating leads
to a wide variety of chemical changes. Cross-references: AIRGLOW, ELECTRICAL DIS-
The aurora is the only visible manifestation CHARGE IN GASES, GEOPHYSICS, IONOSPHERE,
of the dynamic electromagnetic environment MAGNETIC FIELD, MAGNETOSPHERIC RADIA-
around the earth. This facilitates coordinated TION BELTS, PLASMAS, SOLAR PHYSICS, SPACE
experiments between ground-based and satel- PHYSICS.
B
BETATRON rapidly than l/r, where r is the radius of the
orbit. If this latter requirement is met, the radial
A betatron is a particle accelerator which uses a focusing will be insured; the required centripetal
sustained induced voltage to accelerate charged force to hold the particle in the circle going
particles to full energy during the whole period around the core decreases as l/r. Consequently,
of acceleration of the particle. Since this method if the magnetic force decreases less rapidly than
of acceleration seemed most applicable to elec- this, it will be too strong at large radii to permit
trons, the name betatron was used to indicate the orbit to remain circulating at a large radius,
that it was the agency for producing high-speed and it will be too weak at small radii to main-
electrons. The accelerating action in a betatron tain the particle circulating at a small radius.
is similar to the action of an electrical trans- The particle thus will oscillate about the so-
former in which a high-voltage winding of many called equilibrium orbit radius when the mag-
turns is used. In a transformer, the voltage can netic force has the right value to supply the
be stepped up from a primary voltage Vl to the required centripetal force. The axial or radial
secondary voltage V 2 , oscillations of the particle about the eqUilibrium
V2 = V l N 2 /N l orbit are called betatron oscillations, and this
name appears in the scientific literature refer-
where N2 is the large number of turns of the ring to particle motions and focusing in other
secondary coil and Nl is the small number of accelera tors.
turns in the primary winding. For example, It is possible to use "strong focusing" magnets
diagnostic x-ray transformers producing high with much more rapid variation of magnetic
voltage, such as 100 000 volts, have very many field with radius to provide axial and radial
turns of fine wire and consequently raise the focusing. In this case a succession of strong
primary voltage by a large factor. focusing and defocusing magnets must be used
The accelerating or voltage generating struc- which alternately focus vertical (axial) and
ture of a betatron is really a transformer; particle radial motion. The net result is a focusing
guiding, or focusing, magnets are arranged action, just as for optical lenses. Such magnets
around the transformer core where a secondary are called alternate gradient focusing magnets,
winding might be put. A toroidal vacuum vessel and their value is that they can limit the ampli-
is placed between the poles of the focusing tude of oscillation of a particle about the
magnets so that electrons can travel hundreds equilibrium orbit to a very small size. This
of thousands of times around the core. Each makes possible very large circumference high
time the electron circulates around the core it energy accelerators without immense magnetic
acquires an energy equivalent to the voltage guide field spaces. For example, the magnets
which would have been induced in one turn for the large synchrotrons at CERN and at the
of wire at that instant. The very long path re- Fermi National Accelerator Laboratory provid-
quires magnetic and electrostatic field errors to ing energies of several hundred Ge V are such
be small, particularly when the particles are alternate gradient magnets; but in these acceler-
starting at low speed and are perturbed by small ators the acceleration is not done by a continu-
magnitude field errors. ously rising betatron flux within the orbit. In-
In order to guide the electrons the focusing stead, small transformer cores which operate at
magnet can be such that its magnetic field de- radio frequencies link the orbit so that whenever
creases with increasing radius. Then the lines of the particle returns to the vicinity of the core
force going from pole to pole bulge outwardly the flux is rising in the direction to accelerate
across the orbital plane. This provides forces, the particle. Thus the extremely large iron core
which are always perpendicular to the field line, of a betatron is not needed.
which have a component directed back toward The usual betatron has a focusing magnetic
the orbital plane in case the particle strays away field which rises proportionately with the
from this plane. This bulging field is also used increase in the transformer's magnetic flux
for vertical focusing cyclotrons. However, it is within the orbit. Thus the guiding field increases
necessary that the magnetic field decrease less proportionately with the momentum gained by

129
BETATRON 130

FIG. 1. The 320 MeV betatron at the University of Illinois near completion. The six guide
field magnets placed around the central leg of the big accelerating core would form a syn-
chrotron if a radiofrequency accelerating cavity were used for acceleration instead of the
transformer. 5 watts of x-rays were produced by this accelerator. The light radiation loss
from the revolving 320 MeV electrons was nine percent, which had to be made up by extra
flux driven through the core. The original 2-MeV typewriter-size betatron, if made in the
image of this 320 MeV betatron and biased, would be the size of a match box.

the transformer action, and it provides sufficient or industrial betatrons is of the order of 100 to
magnetic force to hold the particle at a constant 200 roentgens/min at a meter from the target
radius. for x-rays. With the extracted electron beam,
The first betatron of this type produced 2 Me V the ionization doses would depend on how
and radiation equivalent to 2 grams of radium. widely the electrons are spread at the point of
It is now in the Smithsonian Museum in Wash- treatment, but comparable doses are obtained.
ington, D.C. This accelerator is the size of a The large 320-MeV betatron at the University
typewriter. The largest betatron (see Fig. 1) of Illinois, which is shown in the figure, produced
could generate beams of 320 MeV. The x-rays intensities of the order of 20 000 roentgens/min
and electrons from it were used for experiments at a meter or other terms, of the order of 5
to produce mesons and numerous nuclear disin- watts.
tegrations. At such energies circulating electrons There are possibilities for increasing the inten-
radiate an important fraction of their energy as sity of radiation from betatrons. One is the
light. This is a limitation on the use of betatron fixed-field alternating-gradient betatron (FF AG
action for very high energy electrons, but syn- betatron). In this case, the focusing field is con-
chrotrons are better able to make up this energy stant in time, and the particle orbit can be
loss. The most commonly used betatrons are caused to spiral either outwardly or inwardly
for 25 to 35 MeV. These provide x-rays of with increasing energy to the high field region
maximum penetration in iron for industrial of the direct current magnet or permanent mag-
radiography, and they provide x-rays and elec- net poles. The electrons can thus be injected
trons with optimum depth dose characteristics and accepted by the guide field continuously;
for x-ray or electron beam therapy of the human and with a transformer accelerating core with a
body. sinusoidal flux variation a beam of x-rays could
The intensity of radiation from the medical be produced about 20 percent of the time.
l31 BIOMEDICAL INSTRUMENTATION

Betatrons with time-varying focusing fields just Thus the flux change within the orbit during
give one pulse every cycle of the transformer acceleration is twice as big as the flux would be
core, and the pulses are only of the order of a if the flux density B were uniform within the
microsecond duration. Because of the large orbit. Therefore, the transformer core must be
duty factor available with FFAG betatrons, it adjusted so the proper excess flux provided
should be possible to achieve intensities of within the orbit meets the conditions of this
10000 watts of electrons. Although FFAG last relation if the orbit is to be at a constant
combination betatrons and synchrotrons have radius, the assumption made in the above
been made, full advantage has not been taken derivation.
of a large duty factor achievable by incorporat- The flux condition allows the core of the
ing a full-size transformer core within the beta- transformer to be biased with <1>1, reversed
tron orbit. compared with <1>2. Thus the iron in the trans-
Another possibility for increased intensity is former can be started at -16 000 gauss and
operation at a higher frequency or loading each reversed to +16 000 gauss while the orbit field
cycle with more electrons. Medical and indus- B goes from zero to its maximum value. Biased
trial betatrons are filled to approximately their betatrons are thus much smaller than unbiased
space-charge limit at the injection energy of betatrons.
about 60 000 volts. But as the energy increases
the space-charge limit rises because the beam's D. W. KERST
rising current magnetic pinching of itself coun-
teracts its space-charge repulsion. At relativistic Cross-references: ACCELERATOR, PARTICLE; AC-
energies the limit on current is very high-hence CELERATORS, LINEAR; ACCELERATORS, VAN
injection at higher energy allows much more DE GRAAFF; CYCLOTRON; SYNCHROTRON.
current to be held in the guide field. Some at-
tempts at holding high currents in large aperture
guide fields have been made. BIOMEDICAL INSTRUMENTATION
In the case of the conventional betatron with
constant orbit radius, the relation between the This article is restricted to examples of physical
strength of guiding field and total flux change measurements on humans which are useful in
within the orbit can be found as follows: the diagnosis and management of disease. It
The momentum of the particle in the orbit ignores measurements of physiologic variables
for research purposes, measurements in other
e animal species or in plants (for crop manage-
P=-BR ment, for example), and measurements based
c primarily on chemical processes. It also excludes
devices to augment patient function, such as
while the rate of change of momentum artificial organs, limbs, and other prostheses.
Many measurements involve the use of a trans-
dp ducer to convert some physical variable into
-=eE another variable more easily sensed by the ob-
dt
server. Often the transduction process involves
where e is the charge of the electron, c is the an electrical signal as an intermediate stage,
velocity of light, B is the magnetic field in because of the ease with which such signals can
gauss, R is the radius in centimeters, and E is be processed (amplified, filtered, etc.). An elec-
the electric field in electrostatic units. trical intermediary is not always required: blood
pressure is measured with a cuff and stetho-
1 d<l> scope, and x-rays blacken a photographic film.
21TRE= - - Measurable mechanical quantities which may
c dt be useful are displacement, velocity, accelera-
tion, force or pressure. Pressure within the eye
is the electrostatic volts per turn, where <I> is the is measured (tonometry) by determining the
flux linking the orbit. Combining the last two force required to flatten a defined area of the
equations cornea. The blood pressure maximum (systole)
e d<l> and minimum (diastole) can be measured with
dp
the familiar cuff and stethoscope. Measurements
dt 21T RC dt of blood pressure as a function of time are usu-
ally made with catheters inserted in an artery.
Thus, after a lapse of time These either have a small pressure transducer at
their tip or contain fluid which transmits the
e pressure to an external transducer. Recently,
p = - - (<1>2 - <1>1).
21T RC schemes have been developed which measure
the blood pressure changes noninvasively by
Combining this with the first equation yields applying a time-varying external pressure which
unloads the arterial wall (Wesseling; Yamakoshi).
21T R2B = <1>2 - <1>1' The classical instrument for measuring res pi-
BIOMEDICAL INSTRUMENT AnON 132

ratory flow is the water spirometer. A bell jar, . stimuli. The amplitude and time delay of the
balanced by a counterweight and closed at its various components of this signal provide diag-
lower, open end by a water seal or bellows, nostic information (Thornton).
moves up and down as the patient breathes into Ultrasound is used extensively as a diagnostic
the space it encloses. Movement of the bell jar tool. The basic pulse-echo technique detects
can be related to volume changes in the lung by discontinuities in acoustic impedance at ana-
using the gas law. Flow rate is the time deriva- tomical features and locates them by the time
tive of the lung volume. Pneumotachometers required for the echo to return to the trans-
measure flow directly. Some pneumotachom- ducer. Two-dimensional maps are made by
eters use a small turbine in the air stream; others recording the direction in which the transducer
measure the pressure drop across a section which points along with the echo time. By pointing
offers a fixed resistance to the air flow. Still the ultrasound beam at the heart, motion of the
other models measure the convective cooling of heart wall and valve leaflets can be recorded as
a hot wire by the flowing air. a function of time. The Doppler effect is also
Thermal and dye dilution techniques are used exploited in ultrasonic diagnostic to measure
to measure cardiac output. For thermal dilution, the velocity of the reflecting surface. Detecting
catheters are threaded through a vein into the only moving interfaces improves the signal-to-
right atrium and the pulmonary artery. A known noise ratio for monitoring fetal heart or lung
amount of cool liquid injected into the right movement. The magnitude of the frequency
atrium warms up as it passes through the right shift when ultrasound is scattered from red cells
ventricle into the pulmonary artery. The time is used to measure blood flow. Details of various
constant for warming can be related to the flow ultrasound techniques can be found in the
rate from the heart. Cardiac output can be references by Wells.
determined by a similar mathematical analysis One frequently measures directly the electri-
when a dye or a radioactive isotope spreads cal potential difference between two points on
after it is injected. the body. Various kinds of measurements are
The most obvious application of acoustics is shown in Table 1. These potentials arise be-
in the diagnosis of hearing disorders. Audio- cause the interior of a cell is normally at a
grams show the threshold of sensitivity of the potential of 70-90 m V less than the outside.
ear to tone bursts of different frequencies. Mea- As a nerve cell conducts or a muscle cell pre-
surements of the acoustic compliance of the ear pares to contract, the cell depolarizes and the
drum as external air pressure is changed (tym- interior becomes more positive than the outside.
panogram) can be used to distinguish different The resulting currents flowing in the body give
problems in the middle ear. Evoked response rise to potential differences at the surface.
audiometry or auditory brainstem response (These currents also produce magnetic fields
records small electrical signals from the scalp which can be measured with SQUID magnetom-
due to audible stimuli. Because these signals eters (Wikswo; Geselowitz)).
have an amplitude of about 100 nV and are The measurement of these potentials requires
masked by other signals of about 50 /lV, the great care in the design of the electrodes. It is
signal must be extracted by averaging over many necessary to minimize battery effects (polariza-

TABLE 1. Measurements of Electrical Potential Difference Between Points on the Body.


Frequency
Measurement Source or Stimulus Measurement Amplitude Response

Electroencephalogram nerve activity in the brain electrodes on scalp 1O-100/LV 0-100 Hz


Electrocardiogram muscle activity in the heart surface electrodes on the 1 mV 0-100 Hz
limbs and chest
Electromyogram electrical activity in needle electrodes 1 mV 10 Hz-3 kHz
skeletal muscles during surface electrodes 50/LV
contraction; motor and
sensory nerve conduction
Electroretinogram flash of light cornea to lid 200-500/LV 1-600 Hz
periodically moving pattern 10 /LV 0.5-100 Hz
Electrooculogram eye movement skin on nose and temples, l/LV 0-10 Hz
or corners of eye
Visual evoked response flash of light; changing scalp electrodes over visual l/LV 1-600 Hz
pattern (occipital) cortex
Auditory brainstem sound stimulus scalp 100 nV 50 Hz-2 kHz
response
133 BIOMEDICAL INSTRUMENTATION

tion) and spurious signals which can be gener- of high atomic number into the esophagus,
ated in the outer layers of the epidermis (Web- stomach, bowel, kidneys, blood vessels, or
ster; Plonsey). hepatobiliary tree, in order to visualize the
The principles of geometrical optics are used anatomical features.
in ophthalmology to examine the eye and deter- Tomography (Greek tomos: a cut or section)
mine errors of refraction. The ophthalmoscope has been used to blur all but one plane within
works on the principle that an object at the the body by pivoting both the camera and the
focus of a lens produces a beam of parallel film around a point in the plane of interest.
rays. If both the patient and examiner have Computed tomography produces genuine sec-
normal vision and the patient's retina is illumi- tions by reconstructing a function I (x, y) from
nated, the parallel beam of light emerging from
the patient's eye will be focused on the retina a series of projections F(x) = jl(x,Y)dY taken
of the examiner. A laser, used as a flying-spot
scanner, has been used to produce raster images at many angles through the body (Fig. I). In
of the retina (Webb). Total internal reflection transmission tomography, the attenuation of a
has made possible fiber optic endoscopes which pencil beam of x-rays is measured, and the
are quite flexible and which can be used to function I(x, y) is the x-ray attenuation coeffi-
examine the patient's colon, stomach, lungs, cient (Pullan). Emission tomography measures
urinary bladder, etc., with much less discomfort the r-rays emitted by a radioactive pharmaceu-
than with the old rigid-tube instruments. Moire tical agent distributed along a line through the
patterns have been used to record body surface body, and I(x, y) is the concentration of the
contours. radioactive substance.
The most widespread use of lasers in medicine Ultrasonic tomography is also being devel-
has been for photocoagulation, which is outside oped. For ultrasound I(x, y) can be the recipro-
the scope of this article. Speckle patterns have cal of the velocity (if F(x) corresponds to the
been used for refracting the eye. Holographic propagation delay of a pulse) or I (x, y) can be
techniques have been suggested for the diagnosis the ultrasonic attenuation coefficient. The
and quantification of glaucoma. analysis is complicated by the fact that ray op-
The use of x-rays, nuclear radioactivity, and tics is not a good approximation to the propa-
ultrasound for diagnosing and treating patients gation of the ultrasound.
is called medical physics in the United States. Digital radiography uses electronic detectors
Conventional radiography measures anatomical similar to those used in computed tomography
features by using a point source to cast a shadow to record a two-dimensional projected image
of the patient on a sheet of film. X-ray photons similar to that of a conventional x-ray. In addi-
of 60-80 keY are used so that the photoelectric tion to providing greater dynamic range than
effect, which depends strongly on atomic num- film, this technique allows images to be made
ber, will help separate bone from soft tissue. It by digital subtraction. For example, a recording
is often necessary to introduce contrast agents without an intravenous contrast agent, sub-

y
f (x,y)

I
I
I

1
I

rJf\
I

F(,)

F '( x')

..
If.
(0) (b)

FIG. 1. Computed tomography reconstructs a functionf(x,Y) from a series of projections F(x),


F'(x'), etc. (From Hobbie; reproduced by permission of John Wiley & Sons, Inc.)
BIOMEDICAL INSTRUMENTATION 134

tracted from one with the agent, provides an NMR is also used to measure blood flow. The
image of the vascular system. Similar images in measurement relies on the fact that once the
the past usually required an intra-arterial injec- nuclei have been aligned, the RF resonance
tion of larger amounts of the contrast material, field misaligns them even as it produces a mea-
with greater discomfort and risk for the patient. surable signal. If the fluid is flowing, new aligned
A single scan of a limb with a monoenergetic nuclei are brought into the sensitive volume at a
photon source and a scintillation detector allows rate proportional to the flow, and the signal is
quantitative determination of bone mineral increased (Battocletti).
content. The source for this small, desk-top An article this short can only survey the field
machine is a radioactive nucleus which emits a of biomedical instrumentation. The references
gamma ray photon (Cameron, 1963). which follow will provide the reader with more
X-ray fluorescence is used to measure the detailed information about any topic men-
concentration of heavy metals such as lead or tioned. Topics which do not have sl?ecific ref-
mercury in the body. When the x-ray fluores- erences above are discussed in the books by
cence lines are favorably situated this method is Cameron (1978), Cobbold, and Webster.
sensitive to 10-20 ppm (Bloch).
Nuclear medicine measures the 'Y-ray emis- RUSSELL K. HOBBIE
sion of radioactive pharmaceuticals taken up in
various organs of the body. Detectors range
from a small unit placed over the organ of in- References
terest to a large disk of scintillator viewed by
many photomultiplier tubes, which can record Battocletti, J. H., Halbach, R. E., and Salles-Cunha,
a plane-projected image of the body. Emission S. X., "The NMR blood flow meter-Theory and
tomography, mentioned above, uses an array of history," Med. Phys. 8:435-443 (1981).
detectors surrounding the patient (Brownell). Bloch, P., and Shapiro, I. M., "An x-ray fluorescence
X-ray and nuclear medicine techniques provide technique to measure the mercury burden of den-
complementary information. X-ray pictures tists in vivo," Med. Phys. 8:308-311 (1981).
typically have resolutions of 1 mm and measure Brownell, G. L., Budinger, T. F., Lauterbur, P. C., and
anatomical features; the resolution of a nuclear McGeer, P. L., "Positron tomography and nuclear
medicine image may be a factor of 10 worse, magnetic resonance imaging," Science 215:619-626
but since the radioactive pharmaceutical has (1982).
been selectively taken up by some organ, the Cameron, J. R., and Skofronick, J. G., "Medical Phy-
nuclear medicine image shows the physiologi- ics," New York, Wiley, 1978.
cal function of different parts of the organ Cameron, J. R., and Sorenson, J., "Measurement of
(MacIntyre ). bone mineral in vivo: An improved method," Sci·
Neutron activation analysis is used to measure ence 142:230-232 (1963).
specific elements in the body. A neutron source Cobbold, R. S., "Transducers for Biomedical Measure-
such as 2s2Cf irradiates the region of interest. ments: Principles and Applications," New York,
Some of the nuclei in the body are made radio- Wiley, 1974.
active; the excited nuclei emit 'Y-rays of charac- Geselowitz, D. B., "Magnetocardiography: an over-
teristic energy, the intensity of which is propor- view," IEEE Trans. Biomed. Eng. BME·26:497-504
tional to the concentration of the element (1979).
being measured. Hobbie, R. K., "Intermediate Physics for Medicine and
Nuclear magnetic resonance measures the ab- Biology," New York, Wiley, 1978.
sorption and emission of radiofrequency pho- Lai, C.-M. and Lauterbur, P. C., "True three-dimen-
tons as a nuclear spin magnetic moment changes sional image reconstruction by nuclear magnetic
orientation in an external magnetic field. The resonance zeugmatography," Phys. Med. Bioi. 26:
photon frequency is proportional to both the 851-856 (1981).
external magnetic field and the nuclear magnetic MacIntyre, W. J., several chapters in J. R. Greening,
moment. The strength of the signal is propor- ed., "Proceedings of the International School of
tional to the concentration of the nuclei for Physics 'Enrico Fermi,' Course LXXVI," Amster-
which the apparatus is tuned. Studies are most dam, North Holland, 1981, pp. 163-232.
often made of protons (hydrogen), although Plonsey, R., "Bioelectric Phenomena," New York,
other atoms such as 31 P also possess a nuclear McGraw-Hill, 1969.
magnetic moment. Pulsed NMR also allows Pullan, B. R., several chapters in J. R. Greening, ed.,
measurement of relaxation times, during which "Proceedings of the International School of Phys-
the excess nuclear orientation energy is ex- ics 'Enrico Fermi,' Course LXXVI," Amsterdam,
changed with the surrounding lattice or with North Holland, 1981, pp. 7-65.
neighboring nuclei. Since the NMR resonance Thornton, A. R. D., several chapters in J. R. Greening,
frequency is proportional to the external mag- ed., "Proceedings of the International School of
netic field, it is possible to image either hydrogen Physics 'Enrico Fermi,' Course LXXVI," Amster-
concentrations or relaxation times by varying dam, North Holland, 1981, pp. 345-397.
the magnetic field across the patient. The re- Webb, R. H., and Hughes, G. W., "Scanning laser
construction techniques are similar to those in ophthalmoscope," IEEE Trans. Biomedical Engineer·
computed tomography (Brownell; Lai). ing BME-28:488-492 (1981).
135 BIONICS

Webster, J. G., ed., "Medical Instrumentation Applica- the unique methods for information transfer,
tion and Design," Boston, Houghton-Mifflin, 1978. memory storage, and retrieval, united with
Wells, P. N. T., "Biomedical Ultrasonics," London; unique ways of correlating and integrating data
Academic Press, 1977. from many sensors or sensor systems. These
Wells, P. N. T., several chapters in J. R. Greening, ed., unique methods depend mainly on: (a) many
"Proceedings of the International School of Physics converging and diverging information transfer
'Enrico Fermi,' Course LXXVI," Amsterdam, North channels and many connections between chan-
Holland, 1981, pp. 398-440. nels, and (b) the special properties of biological
Wesseling, K. H., "Physics of the cardiovascular sys- components at the points where these channels
tem," in J. R. Greening, ed., "Proceedings of the are interconnected. (2) The superior capabilities
International School of Physics 'Enrico Fermi,' of biological components or particular elements
Course LXXVI," Amsterdam, North Holland, 1981, of the components (e.g., receptor cells or nerve
pp. 66-162. muscle junctions) are derived from specific ways
Wikswo, J. P., "Noninvasive magnetic detection of of interconnection and probably as well from
cardiac mechanical activity: (1) Theory, (2) Experi- specific molecular properties. (3) The relevant
ments," Med. Phys. 7:297-306 and 307-314. data describing biological systems that will rep-
Yamakoshi, K.-I., et aI., "Indirect measurement of resent major improvements over existing phys-
instantaneous arterial blood pressure in the human ical and engineering hardware are analyzed and
finger by the vascular unloading technique," IEEE studied. (4) The present rapid advances in
Trans. Biomedical Engineering BME-27:150 (1980). microminiaturization techniques suggest that,
for the first time, the possibility exists for the
Cross-references: BIOPHYSICS; HEALTH PHYSICS; development of physical components or systems
MEDICAL PHYSICS; MOLECULAR ENGINEERING. that incorporate these superior biological prin-
ciples and processes.
The definition of bionics suggests the method-
BIONICS ology of procedure. We design, grow, or in some
way obtain nonbiologicai systems that function
Bionics is defined as the branch of knowledge in a way "resembling" living systems. The phys-
pertaining to the functioning of living systems ical component simulates the biological way of
and the development of nonliving systems which doing or carrying out its function . To obtain
function in a manner characteristic of, or re- this objective, we first choose the biological
sembling, living systems. The definition of components that perform the desired function;
bionics infers the use of scientific skills and second, we compile the descriptive biological
techniques from biological, physical, mathema- data; third, we translate these data into engi-
tical, and applied sciences in carrying out re- neering terms; and fourth, we apply the trans-
search in which: (l) the functions of chosen lated data for the physical simulation of the
biological components and systems are studied function .
and analyzed to determine underlying principles This process requires application of relevant
and processes that may lead to methods for im- mathematics to describe clearly, and as rigor-
proving physical components or systems, and ously as possible, the biological function by
(2) the theories and techniques of chemistry, some mathematical theory or model. This pro-
physics, and mathematics are applied to advance cess may also require various techniques from
our knowledge of the principles upon which the physical sciences to arrive at the necessary
these functions are based. data defining the biological function of interest.
Bionics research depends on the acceptance While the bionics research procedure requires
of certain postulates. These postulates are of the description, mathematically, of the function
two types- one essentially operational, the other to be performed by the nonbiological system,
essentially technical. The important operational it may also require mathematical and physical
postulates are: (1) Common experience shows descriptions of the properties of the materials
that biological systems perform operations that used to construct the nonbiological analog.
no non biological system can now perform Gaining enough data to describe the given
efficiently, e.g., operations such as pattern biological function may require study of the
recognition and identification, discrimination, biological component or one of its elements at
and learning; (2) biological components per- the molecular level. Similarly, solid-state or
form such functions as detection, filtering, and molecular techniques may be required to con-
information transfer more efficiently and with struct the physical system that is the appro-
greater certainty over broader bandwidths than priate simulation of the desired biological func-
do present nonbiological components; (3) in- tion .
tensive study, analysis, and application of Successful bionics activity may be modified
the principles that make superior biological by several factors . These include the complexity
performance possible can lead to nonbiological of the biological functions; the kind, quality,
systems that equal or may, in some cases, and quantity of data available to describe the
exceed biological systems capabilities. The tech- functions; and the existence of relevant mathe-
nical postulates are : (l) The functional ad- matical and physical techniques essential to the
vantages of biological systems are implied in simulation of the functions.
BIONICS 136

In terms of the bionics objectives, the scope The goals for the components may be of several
of the work must include research on the fol- types, but the emphasis has been on designing
lowing components and/or systems: (a) recep- the components so as to maximize certain
tors or sensors; (b) receptor systems-including parameters associated with signal propagation.
central interconnections and interconnections There is evidence that similar processes may
among receptor systems; (c) central nervous occur in physiological neurons. A preliminary
system networks and the interconnections a- extensive assessment (circa 1980) has been made.
mong parts of the nerve network; (d) effectors A few of the relevant papers are listed in the
and actuators; (e) effector systems-including references under Applications.
the feed-back and feed-forward connectors to
and from the central processor; (f) the inte- CECIL W. GWINN
grated system made up of sensors and their
input channels, the central correlating, control References
and computing networks, and the channels fro~
the central system to the effectors and actuators. General
This complete research program will extend
over a long period of time. However, progress "Cybernetic Problems in Bionics," London, Gordon
and Breach, 1968.
has been made. Data have been acquired, ana-
Beer, S., "Decision and Control," New York, John
lyzed, and put into engineering terms to provide
a set of specifications for the functional prop- Wiley & Sons, 1966.
erties of several types of neurons. The neuron Buckley, W. (Ed.), "Modern Systems Research for the
is presumed to be that physiological component Behavioral Scientist," Chicago, Aldine, 1968.
underlying observed psychological parameters Sommerhoff, G., "Logic of the Living Brain," New
such as learning and adaptation. Therefore, the York, Wiley & Sons, 1975.
construction of a network of artificial neurons Applications
should, to some extent, simulate these observed
behavioral parameters. In contrast, data have Barron, R. L., ansi Gwinn, C. W., "Applications of Self
been acquired and analyzed from the field of Organizing Control to Aeronautical and Industrial
experimental psychology on functions such as Systems," Design Engineering Conference ASME,
learning. These data can be translated into engi- New York, April 1971.
neering and can suggest types of components Mucciardi, A. W., "Self Organizing Probability State
which can simulate the learning function direct- Variable Parameter Search Algorithms for Systems
ly. This example illustrates a procedure which is that must Avoid High-Penalty Operating Regions,"
common in bionics research. One can start at IEEE Transactions on Systems, Man, and Cyber·
the operational level with an observed set of netics, SMC-4 (No.4) (July 1974).
functional parameters and attempt to synthe- Zeger, A. and Burgess, L., "Adaptive Techniques for
size the class of mechanisms that could simulate Radar Control," General Adaptronics Corp. under
these functions, or one can start with the bio- contract F33615-72C-1842, with U.S. Air Force
logical component which is presumed to give Avionics Laboratory, AFAL-TR-394, Nov. 1973.
rise to these functions and attempt to simulate Mucciardi, A., Shankar, R., et aI., "Adaptive Nonlinear
this component. The choice of one method Signal Processing for Characterization of Ultrasonic
over the other depends greatly on the assurance NDE Waveforms," Adaptronics Inc., under contract
one has of the validity of the data at one level with the U.S. Air Force Materials Laboratory,
or the other. F33615-74-C-5122, AFML-TR-75-24, Feb. 1975
Applications, using the former (functional and AFML-TR-76-44, April 1976.
simulation) type of component, have been suc- Hampel, D., "LSI Electronically Programmable Arrays
cessfully made to flight control systems, variable (Configurable Polynomial Arrays)," RCA Corp.
geometry jet engine control, radar detection and under contract with U.S. Air Force Avionics Labora-
search problems, and various industrial process tory, AFAL-TR-76-228.
control problems such as nondestructive ma- Klopf, A. H., "Brain Function and Adaptive Sys-
terials inspection techniques. The functional tems-A Heterostatic Theory," Air Force Cambridge
simulation type of component has been success- Research Laboratories AFCRL-72-0164, Bedford,
fully realized in both microcomputer and LSI MA,1972.
(large scale integrated circuit) forms. Barto, A. G., and Sutton, R. S., "Goal Seeking Compo-
At the biological component level of simula- nents for Adaptive Intelligence: An Initial Assess-
tion a somewhat different approach has been ment," University of Massachusetts under contract
initiated, mostly by the work of Dr. A. H. Klopf to U.S. Air Force Wright Aeronautical Laboratories
(see references). F33615-77-C-1l91, AFWAL-TR-81-070, April 1981.
Heretofore goals for adaptive networks were Klopf, A. H., "The Hedonistic Neuron: A Theory of
presumed specified at the systems level or at Memory, Learning and Intelligence," Washington,
most at the subsystem level; rarely at the indi- D.C., Hemisphere Publishing Corp., 1982.
vidual component level. Dr. Klopf's work has
been exploring the possibilities of goal seeking Cross-references: BIOPHYSICS, CYBERNETICS,
systems realized with goal seeking components. FEEDBACK.
137 BIOPHYSICS

BIOPHYSICS jumper's trajectory h to the work done by the


leg muscles in accelerating the body upward by
Biophysics is a very broad field with poorly distance s gives Fs = mgh where F is the average
defined boundaries. It encompasses all applica- muscular force. For humans, s is about! meter
tions of the principles of physical science to the and the maximum force F is about mg, suggest-
functioning of living organisms. It has consider- ing that we can raise our center of gravity about
able overlap with areas of biomedicine, bioengi- t meter by jumping. This is supported by ex-
neering, biochemistry, and physiology, as well perience. In another example, we find that the
as strong connections to space medicine, sports peak power output of a human athlete is about
medicine, fluid dynamics, psychology of per- 3600 W whether running at maximum speed
ception, molecular biology, membrane physiol- (10 m/sec), throwing a 1 kg football 50 meters
ogy and other related topics. This article will or a 7 kg shotput 16 meters, or broad jumping
provide a very brief summary of some selected 3.5 meters. Each of these calculations is based
topics; it will neither cover all areas nor discuss on certain idealized but sensible models of the
those chosen in great depth. body, in accordance with the practice of physi-
In a strict sense, biophysics is as old as science cists trying to understand nature.
itself. In Western intellectual history, those who The food requirement, energy and oxygen
made significant contributions to physical sci- consumption, and metabolism changes for these
ence also had important roles in the biological activities are now well documented. Studies of
sciences. Aristotle wrote extensively (often in- animal motion are now showing that there are
correctly) about human and animalfunctioning, many different ways to use the metabolic energy
Newton and Galileo studied many aspects of available from the food supply to move about,
human physiology, and Rene Descartes made conserve water, escape from predators, etc.
significant contributions to our understanding. Measurements and studies of energy consump-
Thomas Young, a physician who described tion in the ordinary processes of life have pro-
optical interference and translated the Rosetta vided considerable knowledge about physiolog-
Stone, provided a theory of color vision that ical functions as well as the basis for a wide
has a significant remnant in modern theories of variety of diagnostic tools and methods for the
this most difficult subject. Galvani's discovery practice of modern medicine. The power re-
of bioelectricity was another important event in quired to keep warm, maintain respiration and
the history of biophysics. circulation, and provide necessary chemical
More modern progress has resulted from sur- gradients constitute the basal metabolism rate
prising and unrelated discoveries. The work of (BMR). Control of the flow of this energy be-
Becquerel and Roentgen provided X-ray meth- tween storage areas and external reservoirs is
ods that produced an incredible amount of new the subject of a major field of study. Elemen-
knowledge, even up to the application of the tary aspects of feedback control are apparent
diffraction technique of Bragg for the analysis in some processes; others are so very compli-
of the structure of DNA. Arrhenius and other cated that they haven't been unravelled. There
chemists provided quantitative descriptions of are many pathological circumstances that can
the behavior of ions in solution. Coupled with be traced to improper or inadequate control of
advances in fluid dynamics, these discoveries energy exchange.
induced further progress in understanding the On the microscopic scale, many aspects of
behavior of biological fluids. The organic chem- energy transfer are still not known. We know that
ists provided information about metabolism and mitochondria are the power plants of cells, con-
other processes leading to understanding of verting the energy of metabolism into a phos-
amino acids and proteins and modern achieve- phor bond in ATP, the basic fuel for life. The
ments such as Sanger's synthesis of insulin. ATP transports stored energy for muscle con-
Contemporary biophysics contains very many traction, Na+ expulsion, etc., but many details
diverse areas of investigation. Some of these are of the process are not understood. Activity and
extensions of older subjects while others repre- progress in the study of energy in biological
sent completely new areas of investigation. A progress spans the range from how horses run
small number of topics from each of these cate- to the phosphorylization of ATP.
gories is discussed below. Another very important subject in biophysics
One of the oldest subjects in biophysics is the is sensory perception. The discussion in this
study of energy expenditure, conservation, and article will deal primarily with vision and hearing
conversion in the performance of physiological because both the stimuli and the responses are
functions. Ordinary mechanics can be used to more amenable to quantitative description than
calculate the optimum performance expected are taste, smell, and kinesthesis (touch). The
from human athletes in a variety of events. The sciences of optics and acoustics are very highly
results of these studies compare favorably with developed and as such allow careful study of
Olympic records, thereby 'Supporting the hy- animal response to light and sound.
pothesis that mechanical action for men and We begin with the optics of the human eye.
machines follow the same rules. For example, The 1/4 (approximately) system has a resolu-
equating the potential energy at the top of a tion limited to a few fJ.m on the retina by diffrac-
BIOPHYSICS 138

tion, and this is remarkably close to the size of In order to understand how inhibition affects
the receptor cells (this near agreement is star- perception, we consider an array of receptors as
tling). Measurements of visual acuity reveal that shown in Fig. 1. Suppose that each cell in the
our angular resolution of detail is indeed equal array could inhibit the response of its nearest
to this 10- 4 radian limit. But the naive notion neighbor, but only by some fixed fraction of the
of point-to-point transmission of images from original cell's response. If the array is subject to
eye to brain that would allow connection of a nonuniform stimulus that has a maximum
these two facts is shattered by the additional located at one of the cells, that cell will exert a
information that 10 8 cells 2 11m in diameter stronger inhibitory effect on its neighbors than
cover the retinal surface, but there are only 10 6 they will exert on it, resulting in an enhance-
individual neurons in the optic nerve. This ment of its relative strength. In the numerical
100 : 1reduction is the most elementary of many example of Fig. 1, we assume each cell can in-
pieces of information that demand a more so- hibit the response of its neighbors by 25%. The
phisticated description of human vision. cell whose uninhibited response would be 80 is
The Nobel prize winning work of H. K. Hart- reduced by 25% of 50 (= 12.5) and 25% of
line (1967) and ofD. Hubeland T. Weisel (1981) 100 (= 25) so that its net response is only 42.5
have shown that lateral inhibition (LI) among (similarly for the other cells). We notice that the
receptor cells plays an indispensible role in our center cell that had enjoyed only a 25% higher
visual process. LI arises when the response of a level of response to distinguish it as being at the
particular receptor cell to a fixed stimulus is maximum before inhibitory effects is now ele-
reduced by the stimulation of an adjacent cell. vated to a 40% higher level over its neighbor.
It provides a mechanism for enhancing contrast Inhibition can act in this way to sharpen sensory
and increasing sensitivity to edges and bound- perceptions.
aries. Furthermore, there are higher levels of LI Resolution of detail in a biological optical
that result in considerable data processing, lead- system is not as simple as a discussion of ideal
ing many scientists to the feeling that the retina components with well understood physical
is an extension of the brain. limits. Indeed, the system is subject to all the
It has long been known that there are many irregularities of biological growth as well as the
ways nerves can form junctions (synaptic junc- effects of the scattering of light from impurities
tions) and that some of them are inhibitory. in ocular fluids. The problem of information
That is, if two axons feed into a single ganglion retrieval from severely distorted images by a
cell, it is possible that the ganglion's response to system of limited resolution is a fertile area for
a fixed stimulus from one of the axons could be physical study.
reduced by a stimulus from the other axon. In The visual system provides much more infor-
this case we say that the other axon has an in- mation than simply the image in the field of
hibitory effect. view: one of its most striking capabilities is color

Stimulus
16 20 32 50 80 100 80 50 32 20 16

8 14 22 43 60 43 22 14 8_lnhibited
response
(a)

..
..2
'"
E
.~

B Inhibited and sharpened


~
c
o
~

E=======-______===:::==::::::::::====~
'"
cr:

Position
(b)

FIG. 1.
139 BIOPHYSICS

perception. Although different wavelengths of is a capacitor ripple filter on an ac current source.


light are perceived as different colors, it has been Classical fluid dynamics may apply instanta-
conclusively demonstrated that different color neously to some parts of this sequence of events,
perceptions can be achieved with the same wave- but the boundary conditions are changing so
length, and the same color perception with dif- rapidly that quantitatively accurate descriptions
ferent wavelengths. There is an extraordinarily are not available.
large body of data on color vision, but no single, Flow in the peripheral arteries is somewhat
clear, simple theory has emerged. The role of LI simpler, but branches and elastic walls still pro-
in the process is probably as important as it is vide difficulties. The simple relationship be-
in detail perception. tween radius R and pressure difference AP for
Our hearing process, particularly tonal dis- an elastic tube of wall thickness t (R/Ro =
crimination, is also enhanced by LI. The Nobel I/O - x), x =RoLlP/yt < 1) that might be used
prize winning work of G. von Bekesy (961) in Poiseuille's equations is inapplicable because
showed that tonal discrimination over a large the Young's modulus Y is not necessarily con-
region of the spectrum of audible sound stems stant-arterial walls contain both elastin and
from excitation of spatially distinct regions of collagen, resulting in a nonlinear stress-strain
the basilar membrane, a thin sliver of tissue relation.
curled up in the cochlea in the inner ear. But Flow in the smallest vessels is complicated by
tones having small frequency differences can the presence of blood cells. An average viscosity
excite overlapping regions of the membrane, is no longer a tenable description, and very
resulting in limits to our ability to distinguish complicated conditions can arise. Some capillary
between slightly different tones. The detection vessels are so narrow that the cells must deform
of vibration of the basilar membrane is also sub- to squeeze through them. Sometimes a few red
ject to LI, much like image detection, in a sys- cells can join into larger objects called rouleaux,
tem that functions to enhance discrimination. further complicating the problem by producing
The ability of trained musicians to sense off-key a rather large distribution of cell sizes. In some
notes to a precision of less than 1% attests to circumstances there is a tendency for the cells
the capability of the system. to concentrate in the central core of flow leaving
Another area of major interest in biophysics a layer of liquid adjacent to the vessel walls,
is transport. This includes the gross transport of and thus producing a decrease in the apparent
fluids through the circulatory system for the viscosity (Fahreus-Lindquist effect).
purpose of distributing heat and nutrients, as For these and other reasons, a physicist
well as the microscopic transport of molecules approaches the problem of blood flow with
through membranes and fluids. The most ob- some trepidation. The models must be carefully
vious example of fluid transport in human thought out, applied in the proper domains, and
physiology is blood flow. Classical fluid dy- the results must be interpreted with the proper
namics can be used to describe flow when fric- limitations in mind.
tion can be ignored (P + pgh + pu2 /2 = con- The other meaning of transport in biophysics
stant-Bernoulli's equation) and also when applies to movement of molecules, ions, and
viscous friction plays a dominant role (Q = 1TR4 dissolved gases across membranes and fluid
LlP/817L-Poiseuille's equation). Each of these boundaries. This problem has very many facets,
equations describes ideal fluids (continuous, only some of which will be described here. To
homogeneous, incompressible, etc.) flowing begin, the transport of solutes across concentra-
under ideal conditions (smooth straight pipe tion gradients in the usual kinetic sense (osmosis,
with rigid walls, no turbulence, steady flow, diffusion, etc.) is very important in many circum-
etc.). Application of these models to human stances. There are several distinct phenomena
circulation must be done with the greatest of involved in various examples, but they can most
care because blood contains both rigid and de- readily be divided into the two categories of
formable bodies of various sizes, flows in vessels kinetic and carrier-mediated diffusion. Kinetic
with elastic walls that bend and divide, and is diffusion is the usual topic described by the
pumped into an elastic aorta by a cyclic heart random walk process. The techniques of statis-
that produces pulsatile flow. tical analysis provide a quantitative description
Ventricular contraction produces a rapid injec- called Fick's laws, whose equations are familiar
tion of blood into the aorta. Since resistance to to many physicists. Carrier-mediated diffusion is
flow in the peripheral circulatory system is not significantly different in some concentration
rather high, the elastic walls of the aorta expand domains, but the transported substance may
to accommodate the extra volume: the kinetic reach concentrations that saturate the capabil-
energy of the blood is briefly stored as poten- ities of the carriers, resulting in a nonlinearity
tial energy in the aortic walls and then slowly that doesn't appear in Fick's equations. Never-
returned as the collapsing aorta squeezes the theless, a differential equation similar to that of
blood through the rest of the system. The flow catalyzed chemical reactions describes the
is very fast at first (Reynolds number ~ 2000) process.
and then slows as the dominant pumping comes One of the most important areas of current
from the aortic walls. A good electrical analogy study is transport through membranes. This
BIOPHYSICS 140

process may be suitably described by kinetic applicable to real gases which are sufficiently
methods for substances soluble in the mem- rarefied, for in that case the molecules are al-
brane's lipids, but this is usually not the case. In most always sufficiently far apart for the inter-
the more usual case, transport occurs through action force between them to be negligible. An
pores or channels that are either large or com- individual molecule of such a gas moves freely
parably sized with the solute molecule, and between collisions, but experiences abrupt ran-
different conditions prevail in these cases. Some- dom changes in velocity in each collision, so
times channel transport is carrier mediated, that no exact statements can be made concern-
sometimes the walls of the channel or the nature ing its state at a particular time. However, when
of the solute molecule dictate the dominant the gas comes to eqUilibrium at some fixed tem-
process of transport, and sometimes the proper- perature, predictions can be made concerning
ties of the channel depend on time or on the the average fraction of molecules which are in a
chemical environment. Transport against a con- given single state, or equivalently, the fraction
centration gradient, called active transport, is of time spent by one molecule in that state. (The
most well-known for the "sodium pump" that concept of a single state will be described below.)
maintains a lower Na+ concentration inside Since these average fractions are equivalent to
electrically active cells (nerve and muscle). The probabilities, one can say that predictions can be
details of this process are not yet completely made for the probability distribution for a
understood. molecule over its possible single states. To de-
The uptake of nutrients, the exchange of O2 scribe this distribution, let the set of energies
and CO 2 in the lungs, the response to chemical available to each molecule be denoted by €j. If I
transmitters of nervous information, and many and m stand for two single states of a molecule
other processes involve membrane transport. and €I and €m the energies of the molecule when
Each of them has special characteristics and it is in these states, then the ratio of the proba-
mitigating circumstances that are amenable to a bility PI of finding a molecule in the single state
physicist's model-making approach to nature. A 1 to the probability Pm of finding a molecule in
great deal of understanding of many of these the single state m is
processes has been achieved by· the application
of physical methods to biological problems; PI exp (-€dkT)
many others are under active investigation. (1)
This article has presented a brief summary of Pm exp (-€m/kT)
some restricted aspects of three major topics in This distribution of molecules over different
biophysics. The interested reader is referred to states is called Boltzmann's distribution, after
the references which follow and the work cited the founder of statistical mechanics, Ludwig
in them for further information on these topics Boltzmann (1844-1906). Equation (I) indicates
as well as discussions on very many others. that the ratio of probabilities for a molecule to
HAROLD METCALF be in two selected states depends only on the
energies of these two states and on the thermal
energy kT, which is proportional to the equilib-
References rium temperature T. When T is measured in
Ackerman, Ellis, and Williams, "Biophysical Science," kelvin (T = 273.16 K at the triple point of
Prentice-Hall, Englewood Cliffs, New Jersey, 1979. water) then the universal constant k, called
Hobbie, R. K., "Intermediate Physics for Medicine and Boltzmann's constant, has the value 1.38041 X
Biology, John Wiley & Sons, New York, 1978. 10-23 joules/kelvin.
Metcalf, H., "Topics in Classical Biophysics," Prentice- Using the fact that the sum of the probability
Hall, Englewood Cliffs, New Jersey, 1980.
Plover all accessible states for a molecule must
Ruch and Fulton, "Medical Physiology and Bio- be unity leads to an alternative form of Eq. (l):
physics," Saunders, Philadelphia, 1960. exp (-edkT)
Selkurt, "Physiology," Little, Brown and Co., Boston, PI = (2)
MA,1976. L exp (-€j/kT)
j
Cross-references: BIOMEDICAL INSTRUMENT A-
TION, BIONICS, HEALTH PHYSICS, MEDICAL Equation (2) indicates that the probability for a
PHYSICS, MOLECULAR BIOLOGY, X-RAYS. molecule to be in a single state 1 varies exponen-
tially with the negative of the energy in the state,
divided by kT. The exponential exp (-et/kT) is
BOLTZMANN'S DISTRIBUTION called the Boltzmann factor. The sum of the
Boltzmann factor over all single states for the
Boltzmann considered a gas of identical mole- molecule which appears in the denominator of
cules, able to exchange energy upon colliding, Eq. (2) is called the partition function for the
but otherwise independent of each other. Such molecule. This alternative form of Boltzmann's
a system, in which the interactions between the distribution indicates that the probability of
molecules between collisions is neglected, is finding the molecule in a single state 1is smaller,
141 BOL TZMANN'S DISTRIBUTION

the larger is the energy €/, and that the rate of


decrease of probability with increasing energy is (3)
faster, the smaller is the temperature T.
Boltzmann's original derivation of the proba- where h is Planck's constant, h = 6.6256 X
bility distribution described by Eqs. (1) and (2) 10- 34 joule-seconds, and n x , ny, and n z are in-
was restricted to the molecules of a gas. Sub-
sequently, Gibbs (Josiah Willard Gibbs, 1839- dependent integers, positive or zero, which are
1903) recast the reasoning to show that the called quantum numbers. The probability ampli-
Boltzmann distribution holds not only for a tude, or energy eigenstate t/J is the following
molecule, but also for an arbitrary system in function of particle coordinates x, y, and z:
thermal equilibrium with a much larger system, 8 )1/2 n 1TX
the reservoir, with which it can only exchange t/Jnx,ny,nz(x,y,z) = ( abc sin - ; -
energy. The combination of system plus reser-
voir forms a closed system, insulated from all
external influences. With this greatly extended . ny1TY. n z 1TZ
applica bility, the Boltzmann distribution is of . sm - - sm - - . (4)
vast utility in analyzing the equilibrium behavior
b c
of a wide range of both classical and quantum In the case that no two dimensions a, b, c of
systems. In such applications, it is essential to the box are in a ratio of integers, the energy
understand the meaning of the concept of single levels corresponding to various sets of values of
state used above. This is simpler to describe for the three quantum numbers are all different,
quantum systems, and will lead to a somewhat with one and only one probability amplitude t/J
different but related description for classical associated with each. Energy levels of this type
systems, which can be regarded as limiting cases are nondegenerate. If, however, any pair of a, b,
of quantum systems. c are in a ratio of integers, there will occur cer-
Quantum mechanics asserts that a complete tain values of the energy €n nA nA correspond-
A
description of a physical system is given in terms x' y' z
of pro bability amplitudes and provides a method ing to two or more distinct sets of values of the
for calculating these amplitudes. The probability three quantum numbers n x , ny, and n z , and to
amplitudes describe the state of the system, for two or more independent probability amplitudes
using them one can predict the probabilities for t/J. This is the case of a degenerate energy level.
the result of measurement of a set of compatible For example, if the container is a cubical box
dynamical variables of the system, such as the with a = b = c, most of the energy levels will be
three Cartesian components of momentum for degenerate, ~ince ~ll eigenstates with n 2 x + n 2 y +
a free particle, Px, Py, and Pz, or alternatively, n2 z =q, a fIxed mteger, have the same energy,
its three coordinates x, y, and z. Special impor- €q =h 2 q /8ma 2 . Say q = 30, then any of the six
tance is attached to states, called eigenstates, assignments of integers 1, 5, and 2 to n x , ny,
for which measurements of particular dynamical and nz yield the same energy, but correspond to
variables yield one unique result, rather than a different amplitudes t/J. This example illustrates
range of possibilities. The special values of the the connection between degeneracy and geomet-
dynamical variable corresponding to each eigen- rical symmetry-for the cubical box the direc-
state are called the eigenvalues for that dynam- tions x, y, and Z are all equivalent.
ical variable . For our present purposes we are It may often be more suitable to express the
interested in the eigenvalues of the energy for a Boltzmann distribution in terms of probabilities
system, corresponding to the €/ in Eqs. (1) and peel) for a system to have a given energy €l,
(2), and its corresponding eigenstate. Confined rather than PI, the probability for the system to
systems, like a gas of molecules in a container be in a single state. In that event, the ratio of
have a discrete set of energy eigenvalues. It may probabilities for finding the system in two
be that a particular one of these discrete energy energy levels will include the degeneracy factors
eigenvalues €l corresponds to not one, but rather g for each level, and Eq. (1) will become:
several linearly independent probability ampli-
tudes, each of which describes the system with peel) = gl exp (- €zlkT)
(5)
the same energy €l. Each of these linearly inde- P(€m) gm exp (-€m/kT)
pendent probability amplitudes represents what
we previously called a single state. One says that For example, if the difference in the two ener-
such an energy level €l is degenerate, and a gr gies €l- €m is much less than the thermal energy
fold degenerate energy level is one which is kT, the probability ratio will just be the ratio of
equally well described by gl linearly indepen- the degeneracy factors for these two energy
dent eigenstates (single states). As an example, levels.
take the physical system to be a single structure- Atoms and molecules are properly described
less particle of mass m freely moving in a con- in terms of quantum mechanics, but a descrip-
tainer, which is a rectangular box with length a, tion in terms of classical mechanics may some-
breadth b, and height c. The energy eigenvalues times be a useful approximation. We now con-
for this system are sider the concept of single state in classical
BOLTZMANN'S DISTRIBUTION 142

mechanics. Here the state of a single particle is p and p + tlp is just (2a/h)3 times the volume in
completely described in terms of its position n x , ny, n z space, or a 3 4rrp2 tlp/h 3 • One now
coordinates x, Y. and z, and corresponding sees that this number of states, which is the
momenta Px, Py, and pz. The specification is phase space volume a 3 4rrp2 tlp divided by h 3 ,
complete, since the laws of classical mechanics could also have been obtained by dividing the
are such that knowledge of position and momen- six-dimensional phase space into cells of size h3 ,
tum at anyone time permits prediction of these then counting the number of cells which lie in
variables at any other time. Specification of the that region of phase space which corresponds to
state of the particle is then equivalent to specify- the spatial volume a3 and the region between p
ing a point in the six-dimensional Cartesian space and p + tlp in the momentum space. This result
spanned by three coordinate axes x, y, Z and implies that each quantum state for the particle
three momentum axes Px, Py, Pz. As the posi- occupies a volume h 3 in the classical phase space.
tion and momentum of the particle change with It is an example of a general result: for a system
time, its representative point moves through this with f degrees of freedom, each of its quantum
space, which is called the phase space. The states occupies a volume II in the 2f-dimen-
particle has three degrees of freedom, and is sional phase space in the classical limit. We may
represented in a six-dimensional phase space. In remark that the finite size h for a cell in the
general, a system with f degrees of freedom has phase space x, Px of a particle with one degree
its state described by f coordinates and f con- of freedom corresponds to Heisenberg's uncer-
jugate momenta, and is represented by a point tainty principle, according to which the position
in this 2f-dimensional phase space. and momentum of a particle cannot be more
Connection between the classical description exactly defined than is consistent with the
of a state with its continuously changing coordi- relation !1xtlpx ~ h between products of their
nates and momenta and the quantum mechan- uncertainties. In view of this relation, it would
ical description in terms of discrete states for be meaningless to make a finer division of the
dynamical variables labelled by quantum num- phase space, as it is impossible to decide by
bers is made by considering the classical limit of experiment in which of these cells a particle lies.
quantum systems. In this limit, the discrete We can now apply Boltzmann's distribution
eigenvalues of dynamical variables change by law to find the momentum distribution for a
only a very slight fractional amount with small free particle in a cubical container in thermal
changes in the quantum numbers which deter- equilibrium at temperature T, in the classical
mine them. For example, the classical limit for limit. The particle has a (kinetic) energy € =
the particle in a box, with quantized energies (Px 2 + pi + pz2 )/2m. Suppose we want the
given by Eq. (3) is achieved for large energies, probability P(Px, Py, pz) dPxdpydpz to find its
or large quantum numbers n x , ny, and n z , when momenta in the range between Px and Px + dpx,
energy levels become very closely spaced. Con- Py and dpy, Pz and dpz. Corresponding to this
sider the case of a cubical container with volume specification are a number of quantum states
a 3 and energies €n = h 2 n 2/8ma 2 , with n 2 = equal to a3dpxdp dpz/h 3 . Multiplying this
n2x+n2y+n2z. For large values of the quan-
number of states (degeneracy factor) by the
tum numbers n x , ny, and n z , small changes in
Boltzmann factor e-e/kT and dividing by the
these numbers will produce a change tln in n,
and a corresponding change tl€n in energy, such integral over all states gives the desired probabil-
ityas

Px2 + pi + pz2)
exp ( - 2mkT dPxdpydpz
(6)

that tl€n/€n = 2!1n/n is smaller, the larger is n. Equation (6) is called Maxwell's distribution,
Then the number of quantum states for which after James Clerk Maxwell (1831-79), who ob-
n lies between nand n + tln will be the volume tained it in 1860 before Boltzmann's more
V of one octant of a spherical shell in n x , ny, general derivation in 1871. We may note that
this probability distribution is the product of
n z space with radius n and thickness tln, or V = three independent factors, each of which defines
4rrn2 tln/8. In this classical limit, we expect the the probability distribution for a separate com-
particle energy to depend on momentum as ponent Px, Py, or Pz of the momentum. The
€n = Pn 2 /2m, with p2 = Px 2 + pi + pz2. Com- probability distribution for momenta given by
paring the two expressions for the energy gives Eq. (6) is applicable to the translational motion
Pn = nh/2a. Then the number of quantum states of the molecules of a gas, and in fact does not
for which the particle momentum lies between depend at all on the type or strength of the inter-
143 BOND, CHEMICAL

actions between the molecules in the classical Cross-references: BOSE-EINSTEIN


ST A TISTICS;
limit. FERMI-DIRAC STATISTICS; STATISTICAL ME-
For systems of weakly interacting indistin- CHANICS; RADIATION, THERMAL.
guishable constituent elements such as atoms
molecules, electrons, and photons, the Boltz~
mann distribution properly describes how such BOND, CHEMICAL
elements are distributed among their own indi-
vidual states only when the average number of Every atom consists of a relatively compact
elements or particles in every single state is nucleus bearing positive charge, surrounded by
considerably less than unity. Physically this a cloud of electrons in sufficient number exactly
case corresponds to the gas of particles being to balance the nuclear charge by their negative
sufficiently rarefield. When this condition is not charge. Being thus electrically neutral atoms
satisfied, quantum mechanical symmetry re- might be expected to exert among th~mselves
quirements on probability amplitudes for the only the negligibly weak gravitational forces
system under exchange or any two identical predictable for such low mass. However there
particles lead to two new distribution laws. are two principal means by which more' signif-
Particles with integral intrinsic spin, like pho- icant interatomic attractions can occur and
tons, are described by the Bose-Einstein distri- these are largely responsible for the exi;tence
bution. This distribution is applicable, for and properties of chemical substances.
example, to the treatment of electromagnetic One is the result of the fact that the electrons
radiation, described as photons, in a hollow surround the nucleus in an easily deformable
enclosure which has come to thermal equilib- cloud. Although opposite charges tend to dis-
rium. It leads to the Planck radiation law for tribute themselves if possible so that their cen-
the density of radiant energy from a black ters coincide, when two atoms come near to one
body. Particles with half-integral intrinsic spin another, the repulsion between the two nega-
like protons, neutrons, and electrons, are de~ tively charged electronic clouds results in mu-
scribed by the Fermi-Dirac distribution. This tually induced distortions of the clouds so that
distribution is applicable to the treatment of the c~ar~e centers no longer coincide, creating
conduction electrons in metals and to dense atomlc dIpoles. Although these dipoles oscillate
stellar interiors. rapidly, their net effect is to create an electro-
An interesting physical system which involves static attraction between the atoms. Such attrac-
both the Boltzmann and the Bose-Einstein or tions are known as van der Waals forces. Al-
Planck distributions consists of a rarefield gas though such forces become very significant as
of atoms which absorb or emit electromagnetic exerted between molecules consisting of many
radiation. The radiation may be considered as a atoms, they are so slight per atom pair that at
gas of noninteracting photons. When thermal ordinary temperatures the attractions among
equilibrium is established between the gas of small molecules are usually completely overcome
atoms and the gas of photons, the atoms of the ?y the disruptive forces of kinetic energy, mak-
rarefield gas will be distributed among their mg them gaseous. They are insufficient to hold
individual energy states according to the Boltz- atoms or small molecules together in a con-
mann distribution, while the photons are dis- densed state, liquid or solid, unless most of the
tributed among their individual energy states kinetic energy has been removed by cooling.
according to the Bose-Einstein or Planck dis- However, large molecules do cluster together
tribution. The physical consistency of this at ordinary temperatures as a consequence of
result was first proved by Einstein in 191 7, for these intermolecular attractions, commonly so
slowly moving atoms, and generalized by Dirac strongly that decomposition occurs before the
in 1927 to the case of atoms with arbitrary temperature is high enough for melting or
speeds. vaporization.
In contrast, much stronger per atom pair is the
H. A. GERSCH interatomic force known as the chemical bond.
This depends on the electronic structure of each
atom, described by quantum theory as based
on occupancy by electrons of certain energy
regions of limited number and availability. In
References atoms of only six of the chemical elements,
Wannier, G. H., "Statistical Physics," John Wiley & helium, neon, argon, krypton, xenon, and radon,
Sons, New York, 1966. this electronic configuration, with two outer-
Reif, F., "Statistical and Thermal Physics," McGraw- most electrons in helium, eight in all the others,
Hill, New York, 1965. results in a sufficiently thorough covering up of
Kittel, C., and Kroemer, H., "Thermal Physics," W. H. the nuclear charge that no region of the atom
Freeman and Co., San Francisco, 1980. remains within which an electron from another
Krylov, N. S., "Works on the Foundation of Statistical atom might be stably accommodated. In atoms
Physics," Princeton Press, Princeton, 1979. (Appen- of all the other chemical elements, all of which
dix by Ya. G. Sinai.) have fewer than eight outermost electrons, how-
BOND, CHEMICAL 144

ever, there remain regions, called orbitals, with- understanding of chemical bonds is therefore at
in which an electron from elsewhere would, the very heart of understanding chemistry. All
despite repulsions by the other electrons, ex- substances except the six elements mentioned
perience a net electrostatic attraction between above normally exist in the form of atoms
itself and the effective positive charge of the bonded together. Therefore all chemical reaction
nucleus. Each such orbital can accommodate involves the breaking of existing bonds and the
two electrons only, and there are four outer- formation of new ones. At ordinary tempera-
most orbitals in all atoms beyond helium in tures the usually dominant tendency is for
atomic number. Vacancies exist wherever the bonds to be broken in favor of forming new,
number of outermost electrons is less than stronger bonds. Therefore a knowledge of bond
eight, this being the maximum that the outer- strength and the origin of this strength is also
most shell can hold in any isolated atom. (For essential to understanding chemistry.
more on the effective nuclear charge that can Chemical bonds occur between atoms of all
be sensed in such vacancies, see PERIODIC the chemical elements except the six previously
TABLE AND PERIODIC LAW.) The presence of mentioned, and in all chemical compounds. If
outermost vacancies provides the possibility the atoms linked together are all alike, the sub-
that outer electrons from another atom may be stances is a chemical element, but if the sub-
accommodated, these electrons now being held stance contains bonds between unlike atoms
by both nuclei. Thus the atoms are held together (different elements), it is a chemical compound.
by simultaneous attraction of both nuclei for The following qualities of chemical bonds are
the same electrons. All chemical bonding is of special interest: number of bonds an atom
essentially of this nature. can form, multiplicity, bond length, bond angle,
There are four ways by which vacancies may and bond strength. Under most circumstances,
be utilized to form chemical bonds: atoms tend to form the maximum number of
(1) If the atoms have more outermost vacan- bonds of which they are capable. They can form
cies than electrons, then the outermost electrons one covalent bond for each outermost half-
of each can spread out into the vacancies of the filled orbital they can supply. The number of
others, resulting in what is called delocalized or single bonds or their equivalent in multiple
metallic bonding. bonds that an atom can form is usually called
(2) If each atom provides one outermost its valence. A two-electron bond is called single,
orbital that is half-filled, the single electron of a four-electron bond double, and a six-electron
each can be accommodated within the vacancy bond triple. Bond length is important because
of the other so that a pair of electrons is shared bond energy is greater at shorter internuclear
between the two atoms, the atoms being held distances. It is measured experimentally as the
together because both nuclei are attracted to distance between the two nuclei. When an atom
the same bonding electrons. This is called a forms more than one bond, the angle between
covalent bond. The shared electrons are essen- bonds is important in determining the geometry
tially concentrated within the internuclear of the molecule. In general, electron locations
region, and thus localized. If additional half- in the exterior of an atom consist of bonds or
filled orbitals are available on each atom, mul- lone pairs. These locations tend to follow the
tiple bonds between the same two atoms may laws of like charge repulsions and be separated
form. as far from one another as possible. If all outer-
(3) If one atom has an outermost vacant or- most electrons are in two locations, these must
bital and the other an outermost filled orbital, be at opposite sides of the atom, and if in bonds,
or electron pair, this pair may be accommodated the bond angle is 1800 • If all electrons are in
within the vacant orbital to provide electron- three locations, these are at the corners of an
pair sharing, termed coordinate covalence. equilateral triangle planar with the nucleus, and
(4) If each atom has the requisites for form- any bond angles are 120 0 • If all the electrons
ing a covalent bond but one attracts electrons are in four locations, these are at the corners of
much more strongly than the other, the former a regular tetrahedron, corresponding to bond
may acquire essentially all the bonding elec- angles close to 109°28'. Five bonds usually
trons, thus gaining a unit negative charge at the assume a trigonal bipyramid structure, two tri-
expense of the other atom left with unit posi- gonal pyramids having a common base. Six
tive charge. The electrostatic attraction between bonds most commonly are directed toward the
the opposite charges will hold the atoms to- corners of a regular octahedron. Most molecular
gether in what is commonly called an ionic structure or geometry can be satisfactorily
bond. The existence of truly ionic bonds is in rationalized through this simple type of reason-
serious question, most so-called ionic bonds ing. As for bond strength, to be discussed in
being partly covalent and therefore more accu- detail presently, chemical bonds exhibit a wide
rately described as polar covalent. range of strength, but even the weakest are
All substances derive their special physical recognized as bonds if they are sufficiently
and chemical properties from the nature of the stable to allow experimental studies of the com-
atoms that compose them and the way in which bined atoms. In general, chemical bonds are far
these atoms are attached to one another. An stronger, per atom pair, than the van der Waals
145 BOND, CHEMICAL

forces described previously, and quite able to they hold together has been the unrealized, and
persist at ordinary temperatures despite the probably unrealizable, goal, of theoretical chem-
kinetic energy which threatens their disruption. ists applying quantum mechanics for more than
Although all bonds tend to break at sufficiently half a century. (See PHYSICAL CHEMISTRY.)
elevated temperatures, some persist even at The immense complexity of mathematical cal-
several thousand degrees C. culations involved requires the extensive use of
Chemical bonds may join atoms together into computers even for approximations, and so far
finite molecules or into indefinitely extensive defies efforts to obtain exact results. One cause
three-dimensional .arrays of atoms in solids, lies in the difficulty inherent in problems in-
called nonmolecular. Molecular substances volving, as do atoms, many-particle interactions.
themselves may become solid, if sufficient Another, with respect to chemical bonds, lies in
kinetic energy is removed, through van der Waals the fact that the energies of interactions among
forces which hold the molecules together in atoms are generally very much smaller than the
crystalline array. In molecular substances, each total energies of these atoms. For example, the
molecule contains the same whole number of total energy of a molecule of potassium bromide,
each kind of atom, and thus can be assigned a KBr, is more than 8 million kJ per mole but
molecular formula which indicates its exact differs from that of the separate atoms by only
composition but does not necessarily reveal the about 370 kJ. To evaluate the bond energy as
exact arrangement of the atoms in the molecule. the difference between the total energies of the
For nonmolecular substances, all of which are separated atoms and the molecule is to deter-
solid, the composition is commonly accurately mine accurately a small difference between very
representable by definite chemical formulas, large numbers, in practice impossible.
but many examples exist wherein the compo- Nevertheless by far the major effort has been
sition is slightly variable and the formulas directed toward quantum mechanical applica-
cannot represent the composition accurately tions, even though these require numerous
in exactly whole numbers of atoms. These simplifying assumptions and approximations.
differences are indicated by calling the exact Two ways of looking at a covalent bond that
molecular compositions or nonmolecular com- have received most attention are: (I) the atomic
positions stoichiometric and the inexact com- orbital, or valence bond approach, according
positions of certain nonmolecular compounds to which electron sharing is visualized as per-
nonstoichiometric. Only in the solid state, and mitted through the overlap of bonding orbitals
then only for certain combinations of elements, on each atom so that they occupy a region in
is nonstoichiometry possible in what are called common within which the bonding electrons
pure compounds. Most naturally occurring sub- are attracted to both nuclei; and (2) the molec-
stances are mixtures of different chemical sub- ular orbital approach, according to which,
stances, to which assignment of specific chemical ideally, all the electrons of a molecule belong to
formulas would be meaningless and misleading. all the nuclei, occupying molecular orbitals
Covalent bonds involve bonding electrons which are the equivalent of atomic orbitals in
localized within the internuclear region, but atoms. Wave equations for these molecular orbi-
bonding electrons spread out if they can. Their tals are usually obtained as a linear combination
delocalization in metals, not possible in non- of atomic orbitals (LCAO). For a diatomic
metals where there are no extra vacancies into molecule, coalescence of two bonding atomic
which the electrons can spread out, removes the orbitals is pictured as forming two new molecu-
limitation of valence imposed by the number of lar orbitals, one concentrating the electrons
available orbitals. Regardless of the number of within the internuclear region and called a
normal valence electrons, the atoms of metals bonding orbital, and the other dispersing them
tend most commonly to pack together as closely away from the internuclear region and called an
as is possible for like sized spheres, which gives antibonding orbital. Each electron within a
each interior atom 12 closest neighbors, or in bonding orbital serves to hold the atom together
other close packing wherein each interior atom and each electron within an antibonding orbital
has 8 direct neighbors and 6 more just a little cancels the bonding effect of one bonding
farther away. The special properties of the electron.
metallic state, good conductivity of heat and It is the molecular orbital approach which re-
electricity, malleability, and ductility, are the ceives most current attention, and numerous
consequences of the delocalization of electrons, molecular properties are studied in this way.
allowing them to flow among the atoms with- However, not surprisingly, far less complicated,
out affecting the bonding, and allowing the re- more empirical studies have led to a more prac-
arrangement of atoms without the breaking of tical understanding of chemical bonds. In par-
specific localized bonds. Atoms of metal may ticular, a simple concept of covalence allows a
usefully be regarded as positive ions held to- practical method of calculating bond energies
gether by the negative electron glue (bonding from atomic properties which produce accurate
electrons) that fills the interstitial space. results for most molecular species and for many
The exact description of chemical· bonds or nonmolecular solids as well. Thus it allows an
the molecules or nonmolecular solids which understanding of the origins of the heats of
BOND, CHEMICAL 14(;

formation and reaction compiled by thermo- acquisition of unit charge is a constant times the
chemists, and the prediction of the strengths square root of the atomic electronegativity.
of bonds and the direction of chemical The partial charge is the ratio of the actual
reactions. change in electronegativity in forming the
The concept is based largely on that of electro- compound to the change corresponding to unit
negativity, which is a measure of the attractive charge.
force between a bonding electron and the effec- The energy E of a polar covalent bond, in kJ
tive nuclear charge acting over the distance of per mole, when bond length and radii are ex-
the covalent radius. Atoms of different elements pressed in picometers, is
differ in their electronegativity, or ability to
attract bonding electrons. Consequently the E = tcEc + tie; = tcRc(EAAEBB)1/2/Ro
only chemical bonds in which the bonding elec-
trons are evenly shared are between like atoms. + 138909ti/R o
In all compounds, the bonds involve uneven
sharing of electrons, which results in acquisition where Rc is the sum of the nonpolar covalent
of a partial charge by each atom. The initially radii and Ro the bond length.
more electronegative atoms acquire more than The homo nuclear single covalent bond ener-
half share of the bonding electrons, leaving the gies (E AA and EBB) are determined directly or
initially less electronegative atoms with a par- indirectly from experiment. Here a very impor-
tial positive charge and themselves acquiring tant anomaly is recognized. Normally, as the
partial negative charge. This uneven sharing electro negativity increases and the radius de-
results in an equalization of electronegativity creases from left to right across the major groups,
within the compound, since atoms initially low one expects the homonuclear single bond energy
in attraction for electrons become partially to increase steadily also. From Ml to M4 (e .g.,
positive, thus increasing the attraction for elec- Li-C) it does. Beyond M4, however, there are
trons, while atoms initially high in attraction two changes. The outermost shell now acquires
for electrons become less so as the result of its first lone pair of electrons, having five to be
acquiring partial negative charge. Thus each accommodated within only four orbitals. At
bond between unlike atoms is polar covalent, the same time, the homonuclear bond energy
meaning that electrons are unevenly shared. decreases. It is assumed that the lone pair some-
The energy of such a bond is considered to con- how weakens the bond, so the effect is called
sist of two contributions. One is the nonpolar the lone pair bond weakening effect (LPBWE).
covalent energy Ec which would result if the The effect is very large in nitrogen, oxygen,
two electrons were evenly shared at the observed and fluorine, and significant but much smaller
bond length. This is easily evaluated as the in the heavier elements of these groups. It is
geometric mean of the two homonuclear bond halved when the atom forms a double bond and
energies, corrected for any difference between eliminated when a triple bond is formed. Similar
the sum of the nonpolar covalent radii and the reduction of this weakening is also observed in
actual bond length. The other is the ionic energy certain single bonds. Although not yet thor-
E j which would result if one atom monopolized oughly understood, the LPBWE provides quan-
these two electrons creating a positive and a titative explanation of many common and
negative ion. The ionic energy replaces a part of important chemical facts, such as the existence
the total possible covalent energy, but the ionic of nitrogen, oxygen, and carbon dioxide as
contribution is always larger than the part of the gases instead of solids like phosphorus, sulfur,
covalent energy which it replaces, so that and silicon dioxide.
polarity always strengthens the bond. For this The bond energy equation given below, with
reason, chemical change tends to occur in the appropriate modification for solids, has been
direction of forming more polar bonds, and the successfully applied to thousands of different
earth's matter exists primarily as compounds bonds in hundreds of compounds, including
having polar bonds rather than pure elements most of the common functional types of or-
held together by nonpolar bonds. ganic molecules, with results agreeing usually
A polar covalent bond is thus pictured as a within 1-2% of the experimental values.
blend of a covalent and an ionic contribution, The bond energy obtained in this way is the
both based on hypothetical extremes. The contributing bond energy (CBE), defined as that
blending coefficients are easily evaluated from part of the total atomization energy which this
the partial charges, the ionic coefficient tj being bond provides. Only for diatomic molecules is
half the difference between the two charges, this the same as the bond dissociation energy
and the sum of the ionic coefficient and the co- (BDE). When breaking a bond liberates a frag-
valent coefficient tc being 1.00. In turn the ment-free radical-which consists of more than
partial charges are evaluated from the initial one atom, the liberated bonding electron may
electronegativities. The electronegativity in the either decrease or increase the strength of the
compound is evaluated as the geometric mean remaining bonds. If it decreases the remaining
of all the initial atomic electronegativities. The bond strength, extra energy must be absorbed
change in electro negativity corresponding to when the original bond is broken, making it that
147 BOSE·EINSTEIN STATISTICS AND BOSONS

much more difficult to break and increasing the Cross-references: CHEMISTRY; ELECTRONS, ELE-
bond dissociation energy. If breaking the bond MENTS, CHEMICAL; PERIODIC LAW AND PERI-
i~creases the remaining bond strength, energy so ODIC TABLE; PHYSICAL CHEMISTRY.
liberated will permit the original bond to be
broken that much more readily, decreasing the
bo?d dissociation energy. This energy of re-
adJustment may be termed the reorganizational BOSE-EINSTEIN STATISTICS AND BOSONS
energy of the radical. It is applicable whenever
that particular is liberated by a bond dissocia- Bose-Einstein statistics is a type of quantum
tion. For example, all bonds to the phenyl statistics concerned with the distribution of
group, C6HS -, require about 50 kJ per mole particles of a particular kind among various al-
more to break than would be expected from lowed energy values taking into account the
the contributing bond energy, this being the quantization of the energy values. Quantum
reorganizational energy of the phenyl radical. statistics is a branch of STATISTICAL ME·
Such values can be very useful in organic chem- CHANICS which treats the average or statistical
istry, especially in understanding free radical properties of a system composed of a large num-
reactions. ber of particles using standard mathematical
Finally, no discussion of bonds, however brief techniques and the properties of the constituent
sho!!l? 0n:tit mention of the special bonding particles. It is different from classical statistical
exhIbIted III compounds of hydrogen. Since an mechanics only in that the particles of the sys-
atom of hydrogen has only one electron used tem are described quantum mechanically.
in its bonding, the nucleus remains quite un- Let us consider a system of N non-interacting
protected when the hydrogen bears partial particles. Three different distributions of the
positive charge. If a small atom on another particles among the various energy levels are
molecule bears partial negative charge and a possible depending upon the assumptions that
lone pair of outer electrons, a substantial are made about the particles. If it is assumed
attraction can occur between the positive that each arrangement or distribution which
hydrogen of one molecule and this electron conserves energy is equally probable and also
pair, bridging the two molecules together. that the particles are distinguishable, and if each
Commonly termed a hydrogen bond but better permutation of particles among the possible
called a pro tonic bridge, this kind of bond is levels is counted as a different distribution one
only 5-10% as strong as a typical covalent bond obtains an average for the relative numb~r of
but nevertheless very widespread and important. particles in the various levels known as the
It is an important factor, for example in deter- Maxwell-Boltzmann distribution. If the particles
mining the structure of liquid and s~lid water are treated as indistinguishable and only the
and of proteins. Hydrogen can also form hydri- number of different combinations of particles
dic bridges, in which the hydrogen atom is is counted, the Bose-Einstein distribution is ob-
bonded to two other atoms simultaneously tained. A third distribution, known as the
through what is sometimes called a three-center Fermi-Dirac distribution, results if, in addition
bond, corresponding to half a bond to each to indistinguishability, it is required that the
other atom. In a protonic bridge, a positive particles obey the Pauli exclusion principle
hydrogen bridges two negatively charged atoms which permits no more than one electron in
each having a pair of electrons. In a hydridic each quantum state.
bridge, a negative hydrogen with a pair of elec- These three distributions may be expressed
trons bridges two positively charged atoms each mathematically as follows where n(e) gives the
having a vacant orbital. Intermediate examples number of particles per energy level at energy
of bridging by hydrogen are also observed. e when the particles are in thermal equilibrium
Understanding chemical bonding remains the at temperature T:
most important and fundamental problem in
theoretical chemistry . Continuing evolution of 1
(1) n(e) = e(E-E o )/kT Maxwell-Boltzmann
these concepts is to be expected into the indefi-
nite future.
R. T. SANDERSON 1
(2) n(e) = e(E-Eo)/kT _ 1 Bose-Einstein

References
McWheeny, R., ''Coulson's Valence," Oxford Univer-
(3) n(e) = e(E-€o)/kT + 1 Fermi-Dirac
sity Press, New York, 1980.
Sanderson, R. T., ''Chemical Bonds and Bond Energy," where k is the Boltzmann constant and eo is
Academic Press, New York, 1st Ed. 1971; 2nd Ed. related to the number of particles present and
1976. depends on the temperature in such a way that
Sanderson, R. T., "Polar Covalence," Academic Press, for energies large compared to kT (so that the
New York, 1983. probability of occupation for a level becomes
BOSE-EINSTEIN STATISTICS AND BOSONS 148

considerably less than unity), all three distribu- ties, F. London4 suggested that liquid 4He, be-
tions reduce to the Maxwell-Boltzmann distribu- low the lambda point (often called He II) is
tion. composed of two interpenetrating fluids, a
The appropriate form of statistics to apply to normal and a "superfluid." The superfluid is
an assembly of particles can also be discussed in composed of those molecules in the ground
terms of the symmetry properties of the wave state or lowest energy state whereas the normal
functions describing the particles. Two classes fluid would consist of the remaining molecules.
of w.~ve function 1/1 (a solution of the The finite viscosity is due to the normal compo-
SCHRODINGER EQUATION for two or more nent. As the temperature is reduced toward
identical particles) result from interchanging all absolute zero the number of molecules in the
the coordinates, both spatial and spin, in the superfluid component increases until at absolute
wave function. It should be noted that this sym- zero one would expect to have only a super-
metry class does not change as a function of fluid . Since 4He is a boson and the Pauli exclu-
time. The wave functions for particles obeying sion principle does not apply, there is no limit
Fermi-Dirac statistics (fermions) are anti- to the number of molecules which can exist in
symmetric, while those for bosons (Bose-Ein- the lowest energy state. Thus this completely
stein statistics) are symmetric. Therefore, for a different type of "condensation" caused by the
system of bosons, if all the coordinates of any quantum mechanical properties of the molecules
pair of identical particles are interchanged in the not only provides a qualitative explanation of
wave function, the new wave function will be the strange superfluid nature of He II but also
identical with the original. provides one of the few examples of a macro-
Photons, all mesons (except the mu meson scopic system for which the mechanical behavior
which is really a lepton) and all nuclei of even cannot be explained by classical mechanics. It is
mass number are bosons, while nucleons (Le., significant to note that no superfluid behavior
neutrons and protons), quarks, all nuclei of odd has been observed for 3He which has spin! and
mass number and all leptons, such as electrons, obeys Fermi-Dirac statistics and thus is unable
neutrinos and the muon are fermions. All known to condense into the lowest energy state even at
bosons have angular momentum nh/2rr, where temperatures as low as a few thousandths of a
n is an integer or zero and h is Planck's constant. degree Kelvin.
The statistics of some nuclei have been deter-
mined experimentally by the observation of the ROBERT L. STEARNS
relative intensities of successive lines in the band
spectra of homonuclear, diatomic molecules. References
One application of Bose-Einstein statistics to
1. King, A. L., ''Thermophysics,'' San Francisco, W.
a physical situation is the treatment of a "pho-
H. Freeman Co., 1966.
ton gas." It is possible to obtain the Planck
2. Mandl, F., "Statistical Physics," New York, John
distribution law for blackbody radiation by
treating the electromagnetic radiation inside an Wiley & Sons, 1971.
3. Landau, L. D., and Lifshitz, I. M., "Statistical
enclosure at constant temperature as a gas of
particles of zero rest mass which obey the Bose- Physics," Reading, Mass., Addison-Wesley Pub. Co.,
Einstein distribution law. This treatment pro- 1958.
vides an il!teresting example of the wave-par- 4. London, F.,Phys. Rev. 54:947 (1938).
ticle duality found in nature, since it is in
marked contrast to the original derivation which
was based on the wave nature of electromag- Cross-references: FERMI-DIRAC STATISTICS AND
netic radiation. FERMIONS, PHOTON, SCHRODINGER EQUATION,
Another interesting application is the qualita- STATISTICAL MECHANICS, SUPERFLUIDITY.
tive explanation of the superfluid properties of
liquid helium which occur below the so called
lambda point of 2.186 K. At atmospheric pres-
sure 4He condenses into a liquid at 4.3 K. If the BREMSSTRAHLUNG AND PHOTON
pressure is then reduced the liquid boils until BEAMS·
the temperature reaches the lambda point, where
the boiling immediately stops. The lack of boil- An electron can suffer a very large acceleration
ing is caused by a very large increase in the ther- in passing through the Coulomb field of a nu-
mal conductivity, which becomes essentially cleus, and in this interaction the radiant energy
infinite, so that all parts of the liquid are at the (photons) lost by the electron is called brem-
same temperature . Below the lambda point, sstrahlung,l (bremsstrahlungt sometimes desig-
liquid 4He seems to behave as if it had nearly nates the interaction itself). If an electron
zero viscosity when the viscosity is measured by
passing the liquid through a fine capillary, but *This work was supported by the U.S. Atomic En-
when the viscosity is measured by observing the ergy Commission.
drag on parallel plates moved through the fluid T"Bremsstrahlung" -German; bremsen, to brake and
it is not zero. To explain these peculiar proper- Strahlung, radiation.
149 BREMSSTRAHLUNG AND PHOTON BEAMS

whose total energy Eo 2: BOO/Z MeV traverses 100 ...---,--,--..,.--,----.,r---,


matter of atomic number Z, the electron loses
energy chiefly by bremsstrahlung. This case is

__
considered here. 80
_~x RAYS
Bremsstrahlung in the coulomb fields of the ELECTRON BEA~ I~.J.~
atomic electrons is adequately included by re-
placing Z2 in the formulas by. Z(~ + 1): For 60

<
Z 5, more complicated correctIOn IS reqUIred. I o
Protons and heavier particles radiate relatively a::til a::~ 00375cm'
little because of their large masses (radiation o 40 TARGET THICKNESS
rate is proportional to the square of the acceler- Q
ation, inversely proportional to the square of
the mass). If a very energetic electron traverses 20

one radiation length (X o ) of any matter, brem-


sstrahlung reduces the electron's energy to l/e
of its incident value on the average. Some ex- 0L---L---~--~--~~~~
amples are: o 100 300

PRODUCT OF DEGREES AND MeV.


FIG. 2. Theoretical bremsstrahlung angular distribu-
Element Air C Al Fe Cu w Pb tions from thick tungsten targets for relativistic en-
ergies. These data are obtained from the Natl. Bur.
Std. Handbook, 55. Ra is defined as the fraction of
Radiation
the total incident electron kinetic energy that is radi-
length
ated per steradian at the angle a. I
Xocm 29800 20 9.1 1.7 1.42 0.32 0.51
The angle a is measured with respect to the incident
electron's direction. Since the electron may scatter be-
fore it radiates, a 1- () where a is shown in Fig. 3.
The energy dependence of radiation loss per
centimeter by an electron of energy Eo, travers-
ing matter of density n atoms/cm 3 is given by observed photon distribution actually an aver-
dE/dx = - nE o1>rad where 1>rad is given by the age over electron scattering angles of the basic
curves in Fig. I. bremsstrahlung distribution. The basic brem-
A beam of energetic electrons incident upon a sstrahlung angular distribution has a zero at
radiator produces a bremsstrahlung beam that.is a = 0, which is quite different from the curves
directed sharply forward. Photon angular dIS- of Fig. 2. The basic spectral shape is a weak
tributions for typical "thick" tungsten targets function of photon angle and, in thick radia-
are shown in Fig. 2. Curves for other heavy ele- tors electron scattering modifies this shape
ments are similar if all radiator thicknesses are slightly (Fig. 3). Examples of thick radiator
measured in units of the radiation length. In spectra are shown in Fig. 4 for various incident
such thick radiators the incident electrons scat- electron energies. The bremsstrahlung spectra
ter appreciably, as ~ell as radiate, making any depend upon screening of the nuclear coulomb
field by atomic electrons through the parameter
16
r = 51k/[E o(E o - k)Zl/3), where k is the pho-
ton energy in million electron volts. For com-
14
plete screening (r ~ 0), the thick radiator spec-
trum is given by

HI
12
I~
"-
::
"0 10

8
d;: = 41~~;2 + (ffoY - %ffo]
. In (lB3Z- 1/3) + 1. %....} cm 2/MeV
~

6
9 Eo
4
where E is the final electron total energy in mil-
lion electron volts and ro is 2.B2 X 10-13 cm.
201 10 100 1000
01 10
For no screening (r ~ 1),
2 To IMeV)
dab 4Z2r02
FIG. 1. Dependencelo~~he total radiation cross - - = - - - [ 1 + (%....)2 _ 2:. %....]
dk 137k Eo 3 Eo
section cPrad = (1/Eo)
o
k do on the initial elec-
. [In 2EoE -
0.51k
1.J cm /MeV
2
2
tron kinetic energy, Eo. The parameter ¢ = Z2r02/137.I
BREMSSTRAHLUNG AND PHOTON BEAMS 150

Eo = 40 MeV
Z=78

>- .8
I-
en
Z
W
I-
Z .6

W INTEGRATED
> OVER

~
..J
.4

W
a::
.2
FIG. 4. Dependence of the Born-approximation ab-
solute cross section (integrated over photon direc-
00 tions) on the photon and electron energy. These curves
.2 !4 .6 .8 LO
are from reference 1.
k
EO
100
FIG. 3. Dependence of the spectral shape (Schiffs
calculation) on the photon emission angle, 9. k is the
.,
C
photon energy in MeV. These curves are from refer-
ence 1. Eo is the incident electron energy in MeV. .
.,"
Q.
50
8.<;
a:-
Intermediate screening (2 < 'Y < 15) leads to
much more complicated formulas. l
A remark concerning formulas is in order. 0
Generally, expressions for a given cross section 100
are very different depending upon whether the
electron energy is small or very large, upon
whether the screening is zero, intermediate, or
complete, and upon whether one is dealing with
the most usual electron-nucleus collisions or
with purely electron-electron collisions. Most
calculations have been done in Born approxima-
tion. The reader is referred to Koch and Motz l
for an excellent review article on the subject.
The absolute number of bremsstrahlung pho-
tons in the photon energy interval dk radiated
by a single electron of energy Eo traversing a
radiator of thickness dt and n atoms/cm 3 is
given by (dab/dk)n dt dk, where da/dk can be
found from Fig. 4.
It must be noted that photon-electron show-
ers begin developing in approximately one radi-
ation length, and these formulas and curves
apply only to the basic bremsstrahlung interac- o~~--~--~~~~~~~--~~--~
0.5 1.0
tion itself or to radiators somewhat thinner
than one radiation length. x = k/Eo
Conventional bremsstrahlung beams are par- FIG. 5. Photon intensity and photon polarization
tially polarized only from extremely thin radia- from a diamond target; experimental spectra com-
tors « 10-3 radiation lengths) because the pared with the averaged intensity and theoretical
angular region of polarization is sharply peaked polarization for photon angle equal to 50 mrad and
about the angle () = moc 2 /E o . Electron scatter- electron energy Eo = 4.8 GeV. The solid line repre-
ing in the radiator broadens the peak and shifts sents Hartree potential, the dashed line shows expo-
the maximum to larger angles. Polarization is nential potential.
151 BREMSSTRAHLUNG AND PHOTON BEAMS

defined by ized photons. Unfortunately, yields are low by


normal intensity standards, but not unus.ably
dUl(e, Eo, k) - dun(e, Eo, k) so.
pee, Eo, k) =dUl(e,Eo,k)+dun( e,Eo,k) Coherent bremsstrahlung from an electron in-
cident upon a properly oriented single crystal
where an electron of energy Eo radiates a pho- (e.g., diamond) is discussed in a definitive re-
ton of energy k at angle e. 1 and II directions view article by Palazzi. 2 This effect depends
are with respect to the plane defined by the in- upon the interactions being coherent from the
cident electron and the radiated photon. When scattering centers in a given crystal plane. A
the electron is relativistic before and after the typical spectral shape (Fig. 5) and polarization
radiation, the electric vector is most probably (Fig. Sa) prove to be extremely useful even
in the 1 direction. Polarization in conventional though the spectrum is not monochromatic and
beams is difficult to observe because thin, low- free from background. Techniques for orienting
yield radiators are required. Practical thick-tar- the crystal radiator are given by Luckey and
get bremsstrahlung shows no polarization ef- Schwitters. 3 Typical examples of high-energy
fects whatever. One usually deals with this polarized photon beams and their a~plications
unpolarized bremsstrahlung and therefore aver- to physics are found in Ballam et aI., Bingham
ages over all possible states of polarization of et aI., 5 and in Bologna et ai. 6
the incident photons. Polarized photon beams from high-energy ac-
It is clear that if one makes use of polarized celerators are frequently generated through the
photons when investigating electromagnetic in- inverse Compton effect. Nearly 100 per cent
teractions, additional information on spin and polarization can be achieved with reasonable
angular momentum states can be obtained. The spectral shapes, but photon yields are some-
need for this additional information is so com- what lower than from the coherent bremsstrah-
pelling that special techniques are frequently lung process discussed above . Linearly polarized
employed at the highest-energy electron accel- photons from a high-power pulsed laser are di-
erators (e.g., SLAC, DESY*) to generate polar- rected into a nearly head-on collision with a
*SLAC-Stanford Linear Accelerator Center, Stan- high-energy electron beam. Those photons that
ford, California; DESY - Deutsches Elektronen-Syn- are back-scattered through approximately 180
chrotron, Hamburg, Germany. degrees carry a large fraction of the electron's

Laser

High
Energy
Photon
Beam

Final
Laser Beam Layout Collimator

Not to Scale e- Beam


Dump

"
B2

Inverse Compton
Scattering
Weak Vertical
Steering Magnet Region

FIG. 6. A nearly-head-on collision of a high energy electron and a laser photon is accomplished at SLAC in the
manner shown schematically here. The electron beam employed is part of the central beam facility at SLAC.
Angles are exaggerated and there is no scale. Practical considerations eliminated the possibility p,f crossing the two
beams at angles much less than 3 mrad. The high energy photon beam fmally enters the 82 hydrogen bubble
chamber where its interactions with hydrogen are studied.
BREMSSTRAHLUNG AND PHOTON BEAMS 152

90r------.------.-------.------.-------.------.---.

BO

70 12 GeV Electron Beam

on
.t::
c: 60
:::>
~
~
:c 50
...
!!.
.;;> 40
c:
$
.:

.t:.
.
c:
0
0
30
a..
20 Arrow Indicates Kinematic
Limit Of 2.97 GeV

10

0.5 1.0

Photon Energy (GeV)_

90

80

16 GeV Electron Beam


70

~
c:
:::> 60
~
~
:e 50
!!.
?:
'w
c:
$
40
c:

.c:
0
0
.t:.
30
a..

20 Arrow Indicates Kinematic


Limit Of 4.B7 GeV

10

Photon Energy (GeV) _


FIGS. 7 and 8. Energy spectra obtained from operation with 12 and 16 GeV electron beams, giving 2.97 and
4.87 GeV scattered photons respectively. These spectra were obtained by measuring e+e- pairs produced in the
bubble chamber along the known beam line.
BREMSSTRAHLUNG AND PHOTON BEAMS 154

11. Chahine, C.,Phys. Rev. Lett. 47, 1374 (1981). into "chains" whose length could be measured
12. Duinker, P., Rev. Mod. Phys. 54,325 (1982). and whose "links" could be counted. The micro-
13. Ballagh, H. C., et aI., Phys. Rev. Lett. 47, 556 scopic observation of these uniform particles
(1981). enabled Perrin and his students to verify the
Einstein results and to make four independent
Cross-references: ATOMIC AND MOLECULAR measurements of Avogadro's number. These
BEAMS, COLLISIONS OF PARTICLES, QUANTUM results not only establishrd our understanding
CHROMODYNAMICS, QUARKS. of Brownian motion, but they also silenced the
last critics of the atomic view of matter.
Probably the simplest example of Perrin's
experiments was his test of the Law of Atmo-
BROWNIAN MOTION spheres. If we assume that the air is at rest and
Brownian motion is the randomly agitated be- has the same temperature from ground level
havior of colloidal particles suspended in a fluid. upward, it can be shown that the pressure (and
The phenomenon is named for its discoverer, concentration) of the air falls off exponentially
Robert Brown, an English botanist. In 1828 he with increasing altitude. For particles of mass
observed the "perpetual dance" of microscopic m and density p suspended in a medium of
pollen grains suspended in water. Initially, this density p' at absolute temperature T, the ratio
effect was interpreted as being due to the of the particle concentrations n 1 to n2 at
motions of living matter, but it was later found heights hI and h 2 is given by
that any tiny particles in suspension exhibit
Brownian motion. ~=exp [_ mg(p- p')No(h 1 - h 2 )]
In 1888, M. Gouy attributed the motion to
the bombardment of the visible particles by n2 pRT
invisible thermally excited molecules of the
suspension. In 1900, F . M. Exner expressed the where No is Avogadro's number, g is the accel-
view that the kinetic energy of the visible parti- eration of gravity, and R is the universal gas
cles must equal that of the surrounding sus- constant. Although the concentration of air
pension particles, and he attempted to estimate varies slowly with height, the concentration of
molecular velocities on this basis. the relatively heavy particles varied significantly
In a series of papers published from 1905 to over a height change of a few millimeters. By
1908, Einstein 1 successfully incorporated the observing the concentration variation as a func-
suspended particles into the molecular-kinetic tion of height, all quantities in the given equa-
theory of heat. He treated the suspended parti- tion were known except Avogadro's number
cles as being in every way identical to the sus- which could therefore be determined.
pending molecules except for the vast difference
of their size. He set forth several relationships JAMES A. RICHARDS, JR.
which were capable of experimental verification
and he invited experimentalists to "solve" the References
problem .
Several workers undertook this task. The most 1. Einstein, Albert, "Investigation of the Theory of
notable of these was Perrin.2 Perrin's special the Brownian Movement," A. D. Cowper, trans-
success was due to his technique of preparing lator, New York, Dover Publications, 1956.
particles to suspend which were of uniform and 2. Perrin, Jean, "Atoms," D. L. Hammick, translator,
known size. The uniformity was achieved by London, Constable, 1923.
fractional centrifuging, and the size was estab-
lished by noting that they could be coagulated Cross-references: ATOMIC PHYSICS, HEAT.
153 BREMSSTRAHLUNG AND PHOTON BEAMS

kinetic energy and retain the full polarization collisions of beams of high energy electrons and
of the original laser photons. Several hundred positrons.1 2 At energies near 15 GeV for each
photons of ~ 95 per cent polarization and 5 colliding particle, jets of hadrons (collimated
GeV energy are routinely obtained from each groups of strongly interacting particles, e.g.,
pulse of a two-joule ruby laser in conjunction pions, protons, etc.) are sometimes produced.
with the linear accelerator electron beam at The usual two-jet structure has been seen to be
SLAC. 7 (See Fig. 6.) accompanied occasionally by a third less ener-
Kinematics of these "laser beams" are given getic jet of hadrons. A model and theory for
by interactions of high energy particles, quantum
chromodynamics (QCD), furnishes a quantita-
EoO - a) tive explanation of the observed three-jet struc-
k = 1 +a(r8)2 8« I
tures in terms of a fundamental (?) particle
(quark) emitting a field particle (gluon) in the
where k = scattered photon energy in the scattering process. Gluons in the interaction,
lab, Eo = electron energy In the lab, as well as quarks, have the property of material-
a = [I + (4fk;jm»)-I, m = electron mass (0.5 izing in the laboratory asjets. The least energetic
MeV), and r = Elm. ki = laser photon energy jet is usually interpreted as the materialization
(ruby laser light has k; = 1.786 X 10- 6 MeV) of a gluon emitted with relatively lower energy
and 8 = lab angle (in radians) of scattered pho- than the parent quark. In analogy with a charged
ton. (Electron beam has 8 = 0 rad.) For exam- particle (usually an electron) emitting a corre-
ple, if Eo = 20.000 MeV, k; = ruby laser energy, sponding field particle (a photon), this latter
then k = 7070 MeV. At photon energies in the process, properly termed bremsstrahlung, has
several thousand MeV range (several GeV given its name to the process of gluon emission
range), 8 must be restricted to ~ 10-5 radians by a quark. An excellent review article by
to restrict the lower spectral limit of the pho- Duinker 12 includes this subject as well as ref-
tons to approximately 90 per cent of the maxi- erences to the current literature.
mum photon energy. Typical spectra are shown Hadronic jets have also been observed in the
in Figs. 7 and 8. basic process of scattering of neutrinos by
Measurement of the photon flux in an accel- nucleonsP The hadronic jet structure occa-
erator bremsstrahlung beam is required in order sionally arising from these interactions is, again,
to make quantitative determinations of cross quantitatively explained in QCD only by invok-
sections and to normalize observations. An in- ing the idea of hard gluon bremsstrahlung.
strument called a "quantameter"8 is used quite These latter two processes are found in colli-
successfully for this purpose and is accurate to sions of particles at the highest accelerator
the order of I per cent. It basically provides energies currently available. As energies increase
sufficient matter (copper plates) to contain the with the advent of newer and larger accelerators,
entire electron photon shower volume gener- gluon bremsstrahlung will likely remain an im-
ated by the incident bremsstrahlung beam as portant part of the scattering process.
well as to sample and integrate the intensity of
the showers over their entire extent. ROBERT W. KENNEY
"Inner bremsstrahlung" is an interesting ex-
ample of true bremsstrahlung. In beta decay
interactions and in orbital electron capture, one
sees, on the average, a low-intensity photon References
continuum, the quantum limit of which is equal 1. Koch, H. W., and Motz, J. W., Rev. Mod. Phys.
to the transition energy of the interaction. 31, 920, 1959. Extensive survey of formulas and
These photons are bremsstrahlen emitted by excellent presentation of curves for numerical
the electron in its transition to the final state. calculation.
The Feynman approach to theoretical treat- 2. Palazzi, G. D., Rev. Mod. Phys., 40,611 (1968).
ment of the bremsstrahlung process is detailed 3. Luckey, D., and Schwitters, R. F., Nucl. Inst.
by Williams. 9 A definitive review article on Meth., 81,164 (1970).
bremsstrahlung, especially from gaseous tar- 4. BaHam et aI., Phys. Rev. Letters, 24, 1364 (1970);
gets, has been written by Blumenthal and 24,960 (1970); 23, 498,817 (E) (1970).
Gould. 10 They include the closely allied topic 5. Bingham et aI., Phys. Rev., Letters, 24, 955
of synchrotron radiation and Compton scatter- (1970).
ing as well as some other interesting radiative 6. Bologna et aI., Nuovo Cimento, 42A, 844 (1966).
effects. They provide an excellent and quite 7. Ballam et aI., l'hys. Rev. Letters, 23,499 (1969);
current bibliography. Sinclair, C. K. et al., IEEE Trans. Nucl. Sci, 16,
A recent paper by Chahine ll has shown the 1065 (1969).
importance of the long radiative tail to wide 8. Yount, D., Nuci. Ins. Meth., 52, 1 (1967).
angle hard bremsstrahlung in very inelastic scat- 9. "An Introduction to Elementary Particles," 2nd
tering of mu mesons. Its bibliography is useful in Ed., W. S. C. Williams, New York, Academic
further study of this aspect of bremsstrahlung. Press, 1971 (page 313 ff.).
A process called hard gluon bremsstrahlung 10. Blumenthal, G. R., and Gould, R. J., Rev. Mod.
has been newly observed (1979) in the head-on Phys., 42, 237 (1970).
c
CALCULUS OF PHYSICS
70
To label a topic as the "calculus of physics" is 60
not intended to imply the establishment of
some new type of mathematics, but rather that 50
a point of view different from that comfortable V 40
to the professional mathematician is to be em- 30
ployed in its discussion. Concepts are intro-
duced for the immediacy of their application to 20
the description of physical phenomena, and a 10
heuristic approach is used to introduce them.
We shall not hesitate to ignore interesting but o
uncommon exceptions to our statements and x
shall make use of pictorial representations and
special cases to illustrate our points. FIG. 1
Functions Since many of the processes of
physics are continuing, with the state of things value of x one may obtain y by multiplying x
at a given instant developing smoothly out of by itself and the result by x again. In this case, x
the state of things in the previous instant, the is the independent variable, y the dependent
means of describing these processes compactly variable. The concept of continuity is contained
is with the help of continuous functions. In in the idea that between any two values of x
some sense, any way of naming the members of another can be found; and the concept of con-
a set of objects when given a member of another tinuous function, that to all such values of x a
set comprises a functional relationship. Thus if value of y can be assigned according to the
one is given the set of numbers prescription in Eq. (1).
Our example is one of the simplest types of
x=I,2,3,4'" function: an algebraic function. Functions of
several variables may be considered: functions
and the relation of complex variables, trigonometric functions,
y =x 3 , (1) exponential functions, etc. Because of their im-
then one immediately knows that portance in representing physical processes, let
us consider the trigonometric functions a little
y = 1, 8, 27, 64, . . '. more closely.
The trigonometric functions are functions of
Eq. (1) is one way of representing the function. the variable (J (or x Qr y, or any other symbol
Another would be to tabulate x and y side by you choose), which need not be an angle in the
side: narrow sense, although in physics applications
x 2 3 4 we shall insist that the variable be dimensionless
as an angle is. (The distinction between units
y 8 27 64 in which the size of a quantity is expressed
relative to some standard and dimensions, which
and another is to draw a graph of y against x, are fundamental attributes of a quantity in
as in Fig. 1. The points contain exactly the same terms of mass, length, and time, will not hold us
information as the table; the curve contains here. Suffice to say that, e.g., a second is a unit
much more information, but not as much as for the dimension time, and the ratio of an arc
Eq. (1). Thus it would be a hypothesis to say length to a radius of the arc, which may be ex-
from the table alone that to x = ! there cor- pressed in the unit radian, is dimensionless.)
responded y = i. The curve (if it could be The tables of trigonometric functions provide a
read accurately enough) would allow one to discrete representation, and the familiar graphs
make that assignment, but would not allow one are even more useful in visualizing their prop-
to conclude that x = 5 corresponds to y = 125. erties. Figure 2 is a graph of the function y =
Equation (1) contains all of this information, sin (J and Fig. 3 of y = cos (J. The horizontal
and more. We understand from it that to any scale of each of Figs. 2,3 should be thought of
ISS
CALCULUS OF PHYSICS 156

"'
.!!
'E 150

Sin 11

FIG. 2

o 2 3 4 5
Time, t, in hours

FIG. 5

FIG. 3
x2 x3
eX = l+x+-+-+ .. ·+ .. ·
as extending to the left and right indefinitely 2! 3!
with the curves repeating the behavior as shown
every 3600 • The vertical scales for the sine and x2 x3
e-X = l-x+---+···+···.
cosine functions need be no larger than shown, 2! 3!
since their curves oscillate between plus and
minus one. Derivative We can now introduce a few of
These are examples of continuous functions of the concepts of the branch of mathematics
a single variable (0 in these illustrations). They which deals with functions and their properties,
are periodic functions, i.e., they repeat their the calculus. To be specific, we should like to
values periodically as the independent variable mention the derivative and the integral of a
continuously changes. It is the property of function. The reader will know something of
periodicity which makes these functions suitable these already: e.g., the speed of an automobile
for representing certain physical phenomena. is the time derivative of its position. This is
In Fig. 4 we represent two more useful func- shown in Fig. 5. The slope of each line repre-
tions for physics applications: the exponential sents the speed with which the automobile trav-
curve, y = eX and its inverse, Y = e-X = l/e x . eled from New Haven to Boston: 50 mph,
As a matter of notational convenience, one 37! mph, 30 mph. The speed, v, is given by
frequently replaces y by [(x), to be read as tlX/D.t, where D.t is the time required to tra-
"function of x." As a rule, continuous func- verse the distance D.x. From the graph, one sees
tions can be represented by sums of algebraic that the slope is also the tangent of the angle
functions of the independent variable. For made by the line with the horizontal axis. These
example: are general properties of the derivative.
The question arises as to what happens if the
03 05 07 line is not straight. Then one defines the deriva-
sinO=O--+---+"'+'"
3! 5! 7! tive in the same way, but expects its value to
change from point to point along the curve. At
cos 0 = I - 0- 2 + 0- 4 - 0- 6 + ... + ... each point, one draws the tangent line and cal-
2! 4! 6! culates its slope. This is the derivative. The ab-
stract definition of derivative is based on the
where 0 must be expressed as a dimensionless notion of limit. In mathematical language, if a
ratio; i.e., in radians rather than degrees, and function approaches a fixed value as close as
one pleases while the independent variable ap-
proaches a given value arbitrarily, the function
is said to approach a limit as the parameter ap-
proaches its value. The notation is limx .... a [(x) =
[(a), to be read as "the limit of [(x) as x ap-
proaches a is [(a)." For simple algebraic
y functions the concept is rather obvious: If
[(x) = ax + bx 2 , then lirnx .... 2 [(x) = 2a + 4b.
The reader should be warned, however, that the
situation is not always so obvious. For example,
let [(x) = (ax + bx 2 )/cx. As x -+ 0, both numer-
ator and denominator approach 0, so
lirnx .... o [(x) seems to be 0/0, which is indeter-
x
minate. However, the limit is actually finite;
namely, it is a/c. The derivative has been heuris-
FIG. 4 tically defined as D.f(x)/ D.x, but this has signifi-
157 CALCULUS OF PHYSICS

160 d 2[/dx 2 for the derivative of the derivative (this


Cls would be the acceleration if [(x) were distance
128 and x were time). The reader will imagine that
more derivatives may be taken, and wonder if
.-
.....
~
96 the process is limitless. For some functions, e.g.,
eX, there is no hifhest derivative to be taken.
64 For others, e.g., x ,all derivatives above a given
one are zero-the zero values begin with the
32 fourth derivative in this case.
Examples of derivatives which can be used in
0 calculations are da/dx = 0, where a is a con-
stant, dx/dx = 1, dx 2 /dx = 2x, dxn/dx = nx n - I
t, sec where n is any number, and d(axn)/dx = nax n -I,
FIG. 6 where a is a constant.
de x de- x
- = eX - - = - e-x
dx ' dx '
cance only so long as the ratio remains determi-
nate as D.x ~ O. Formally, one says
df(x) . f(X2) - f(xt>
----ae
dsinO
= cos 0 , ----ae
dcosO
= - sin O.
- - = hm or
dx X2~Xl X2 - XI '
Integral The other important operation of
df(x) D.f(x) the calculus is integration. It may be simply de-
- - = lim - -
dx Clx-+O D.x fined as the inverse of taking the derivative,
although such a definition has only limited use-
For all the cases we shall be interested in dis- fulness-mainly it lulls the unwary into thinking
cussing, the limit exists, is equal to the deriva- he may know something of the process. An op-
tive, and is equivalent to the tangent to a curve erational definition, lacking elegance, may be
of f(x) vs X at the point in question. This is more nearly indicative of the true nature of the
illustrated in Fig. 6, which is a graph of s = 16t2 . integral: It is a function so constructed that its
At 2 sec, the tangent has a slope of 64 ft/sec, derivative yields the function whose integral
and at 3 sec it is 96 ft/sec. Since the speed is was to be found. From our examples of deriva-
changing with time, we can graph it and find its tives, the curve of Fig. 7 may be written as
rate of change or derivative, as shown in Fig. 7.
The slope is seen to be a constant: 32 ft/sec 2 . dx/dt = 32t, (2)
What is its meaning? This is the acceleration of and of Fig. 6 as
the object which is moving according to the
graph in Fig. 6. Thus the acceleration is defined (3)
as the time derivative of the speed. For an ob-
ject moving with constant speed, the accelera- We have been at some pains to show that
tion is zero-as it should be to conform with dx/dt = 32t, so by our definition of integral, x
common sense. is the integral of dx/dt. The notation for inte-
While speed and acceleration are among the gral is shown in Eq. (4):

J
most familiar examples of derivatives, we
should note that the derivative of a function
need not be taken with respect to time. If f is a x = (dx/dt) dt. (4)
function of an arbitrary variable, x, then the
derivative is equal to D.f(x)/ D.x, where D.f and
D.x are measured on the tangent drawn at point Although this is rather a special case, it con-
x. The notation, which we shall have occasion tains a number of interesting features. If it were
to employ, is df/dx for the derivative, and a legitimate operation to "multiply" dx/dt by
dt, the expected product would be dx, and Eq.
(4) would become

(5)

which somehow looks like an identity. In fact,


if we think of dx replaced by D.x and J by
"sum of," then
o 2 3
t, sec x = ("sum of') D.x

FIG. 7 is pretty obvious. Pictorially, we may think of


CALCULUS OF PHYSICS 158

TABLE I In fact, for the simpler functions it is pos-


sible to obtain the integral without adding
t Interval o to 1 sec 0 to 2 sec 0 to 3 sec areas: The means we have already used of look-
ing for a function whose derivative is the func-
tion in hand is one method of doing so. Of
Area 16 64 144 course, with the advent of high~peed digital
computers, the task of adding up many little
strips to calculate the integral numerically is
the integral of a function as the area under a reduced to preparing a program to control the
curve giving the graphical representation of the computer-and the program will work for any
function. In the general case, the notation reads function which can be tabulated.
Sample integral formulas which can be used
are
Jf(X) dX,
JdX = x + c, J x dx = ~x 2 + c,

J
where the dx performs some of the functions
in the derivative notation: It identifies the in-
dependent variable and implies how the opera-
ax2
ax dx = -
f xn dx =--x
1 n 1
+c +
tion is to be carried out. 2 ' n+1 '
To give an example, let us return to Fig. 7
(n *-1)
f f
and calculate the area under the curve. Table 1
contains the results for the area up to 1 sec, up cos 0 dO =- sin 0 + c, sin 0 dO = cos 0 + c,
to 2 sec, etc. It is clear that the numbers in the
"area" row can be obtained from Eq. (3) by
evaluating it for t = 1, 2, 3 sec, respectively. feXdx=ex+c, f e-xdx=-e-x+c,
What of the integral as area when the curve
is not as simple as our example? Even the case
graphed in Fig. (6) appears to be beyond the where c is a constant which cannot be deter-
definition. We are rescued from this dilemma mined in the integration.
by recalling that we deal with continuous func- McALLISTER H. HULL, JR.
tions, so we may employ as small a .!lx as we
please. Thus the integration becomes the sum- References
mation of many areas whose bases are .!lx and
heights the values of f(x) at the point in ques- 1. Hull, M. H., "The Calculus of Physics," New York,
tion. Some error will remain: The lined areas W. A. Benjamin, 1969.
shown in Fig. 8 will not always cancel as they 2. Goldstein, L. S., Lay, D. C., and Schneider, D. I.,
must for exact total area calculation. That is "Calculus and its Applications," New Jersey
our calculation of the area of the strip ~ Prentice Hall, 1980. '
f(x) . .!lx omits the piece with vertical shading 3. Thomas, G. B., and Finney, R. L., "Calculus and
lines and incorrectly includes the piece with Analytic Geometry," Massachusetts, Addison-
horizontal shading. However, as .!lx becomes Wesley, 1980.
smaller, these two pieces will come nearer and 4. Goodman, A. W., and Saff, E. B., "Calculus: Con-
nearer to canceling for each strip. The penalty cepts and Calculations," New York, Macmillan,
for increased accuracy is increasing the number 1981.
of strip areas to obtain and sum. Cross-references: DIFFERENTIAL EQUATIONS IN
PHYSICS, MATHEMATICAL PRINCIPLES OF
QUANTUM MECHANICS.

CALORIMETRY
Calorimetry is the science of measuring the
fix) quantity of heat absorbed or evolved by matter
,, when it undergoes a change in its chemical or
:::
I I ,
physical state. The apparatus in which the mea-
I I I surement is performed is a calorimeter and the
, ' ,-tox
I' ,
experimenter is frequently referred 'to as a
.., I
calorimetrist.
,' ,
, ' I
, I , When matter is involved in a chemical or
, I ,
, I physical process, its total energy content is
x usually altered. The difference in energy be-
tween its initial and final states, .!lE, must be
FIG. 8 transferred to, or from, the environment of the
159 CALORIMETRY

system. This energy exchange between the sys- nature of the processes suitable for calorimetric
tem and its environment is in the form of heat study. However, the basic principles are general,
or work or both. In calorimetry, the energy and their consideration constitutes a common
exchanged as heat is quantitatively evaluated. requirement in practically all designs. Suitable
The heat absorbed by the system, q, is related devices and procedures for three essential mea-
to the work done by the system on its environ- surements are usually required, but one or two
ment, W, and the increase in internal (total) can sometimes be omitted by operating under
energy of the system, t:.E, by the thermody- certain restrictions. The measurements are; (1)
namic relationship the temperature of the calorimeter and its con-
tents, (2) the quantity of energy that is added
q = t:.E + w. (1) to the calorimeter from an external source, and
When calorimetric measurements are performed (3) the quantity of heat that is exchanged be-
at constant pressure and only pressure-volume tween the calorimeter and its environment.
work is involved, q is equal to the increase in Most calorimetric operations involve a tem-
heat content or enthalpy t:.H. Most calorimetric perature change, since the heat liberated (or
measurements are performed under these condi- absorbed) during the process is stored in the
tions , but when other conditions are imposed, calorimeter and its contents by virtue of their
appropriate consideration must be made in the combined heat capacity. Thermocouples, ther-
thermodynamic treatment of the data. mopiles, and resistance thermometers are com-
The process selected for calorimetric study monly used for temperature measurements. The
may be a simple change in the physical state of quantity of energy liberated or absorbed in a
matter, such as a change in temperature of the calorimetric process is most commonly evalu-
material, or it may consist of a series of complex ated in terms of electrical energy. This is done
chemical reactions such as are encountered in by three similar methods. (I) In an exothermic
the combustion of many fuels. In fact, nearly process where heat is liberated, the calorimeter
any process involving a chemical or physical is cooled to the original temperature; the tem-
change in matter might well become a necessary perature rise is then duplicated using an electri-
subject for calorimetric investigation. cal resistance heater. (2) The heat absorbed in
Calorimetric determinations of energy changes an endothermic process is supplied by an elec-
are essential in many theoretical and practical trical heater at such a rate as to keep the tem-
problems. Heat capacity or specific heat data perature constant. (3) In heat capacity measure-
are vital to the design of heat exchange equip- ments an electrical heater supplies known
ment. The thermal properties of steam and amounts of energy to the sample. The resultant
certain metals are major considerations in the temperature change is then monitored. Elec-
design of modern boilers and turbines. The trical energy and temperature can be measured
heats of combustion of fuels are essential in very accurately by modern methods.
rocket, engine, and gas turbine design. The heat The quantity of heat exchanged between the
liberated by chemical reactions must be con- calorimeter and the environment is a more diffi-
sidered in the development of chemical process cult problem. When two adjacent bodies (such
equipment. Often the equilibrium constant re- as a calorimeter and its environment) are not at
quired to determine directions and extent of exactly the same temperature, heat is transferred
chemical reactions is most conveniently ob- from the warmer to the cooler body. This trans-
tained by a simple calculation from the free fer is made by three major processes: (1) gaseous
energy change t:.F. For a great many processes, convection, (2) radiation, and (3) conduction.
numerical values of t:.F can be obtained from Gaseous convection can be completely avoided
the change in heat content t:.H, and the en- by evacuating the space between the calorimeter
tropies of the participating substances S, using vessel and its environment. When evacuation is
the thermodynamic relationship impractical, convection can be minimized by
suitable geometrical considerations in the design
t:.F = t:.H - Tt:.S, (2) of the calorimeter. It is very important to avoid
or at least minimize convection, since the heat
where T is the absolute temperature. The en- transported is a complex function of the tem-
tropies of the individual substances can generally perature difference and an accurate evaluation
be evaluated from heat capacity measurements is impossible. For small temperature differences,
that extend to very low temperatures. In addi- radiation is usually not a serious problem at low
tion to these primarily practical design consid- temperatures but is a major contributor to heat
erations, calorimetric measurements can provide exchange at elevated temperatures. Heat ex-
information about the microscopic parameters change by radiation can be limited to a few
of a material, such as its structure or the energy percent of the blackbody (maximum) values by
levels of its electrons and atoms. This informa- the use of suitable reflecting surfaces on the
tion in turn can provide insight into the funda- outside of the calorimeter and on the adjacent
mental interactions in the material. environment. Conduction by air or other gases
The design and constructional details of cal- is also usually minimized by evacuating as much
orimeters vary widely because of the diversified as possible of the space between the calorimeter
CALORIMETRY 160

proper and its environment. Conduction in solid may, in particular, be used by chemists to pre-
materials, used for supporting the calorimeter dict chemical reactions. The calorimetric vessel
and for electrical leads is optimized by proper consists of a vacuum-tight metal container in
choice of materials and geometrical design. good thermal contact with an electrical resis-
In the absence of convection and for small tance heater and a thermocouple or resistance
temperature differences, the heat transferred, thermometer. The sample under study is sealed
Q, is predominantly via conduction and is es- in the container along with a small amount of
sentially proportional to the temperature dif- gaseous helium. The helium aids in attaining
ference ~ T and time t, in accordance with thermal equilibrium at low temperatures be-
Newton's law of cooling, cause of its high thermal conductivity. The
calorimetric vessel is suspended in an evacuated
Q = k~Tt. (3) chamber by some material, such as a strong
thread, having low thermal conductivity. This
There are two approaches chosen currently. For chamber is often within a massive copper block
large samples, this heat exchange is minimized which provides a uniform and stable thermal
by reducing the quantities on the right-hand environment. The temperature of the protective
side of Eq. (3). The constant k is a measure of block is kept at a temperature near that of the
the thermal link to the environment and is mini- calorimetric vessel. The heat exchanged is evalu-
mized by design considerations. Adiabatic cal- ated by observing the temperature difference
orimeters maintain the calorimeter and the en- ~T as a function of time and applying Eq. (3)
vironment at the same temperature so that the in an integrated form. The constant k is evalu-
temperature difference ~ T equals zero. In this ated by observing the change in temperature of
approach, corrections are made when the heat the calorimeter vessel and its contents under
transferred is not exactly zero. For smaller equilibrium conditions. During this rating period
samples, the connections necessary for measur- the temperature change is due entirely to heat
ing temperature and supplying electrical energy exchanged with the environment. Some calo-
are a source of significant heat exchange and rimetrists use the adiabatic principle and main-
the corrections become rather large. The second tain the temperature of a protective shield as
approach therefore, instead of minimizing the near as possible to that of the calorimeter. This
heat transferred, actually uses the heat exchange procedure results in the elimination of heat
to measure the heat capacity. The rate of change exchange corrections but is not entirely free
of the temperature of the sample d~T/dt times from objections. Although low-temperature
the heat capacity of the sample, C, is equal to calorimeters are used chiefly for heat capacity
the rate of heat exchange: determinations, heats of transition, heats of
fusion, and heats of vaporization are also
dQ Cd~T
measured.
-=---- (4)
dt dt At very low temperatures, calorimetry is used
to measure energies associated with the order-
By accurately measuring the quantities on the ing of the magnetic moments of nuclei, transi-
right hand side of Eq. (3), Eq. (4) may be used tions into the super conducting state of metallic
to obtain C. elements and compounds, and other phenomena
In calorimeters containing liquids, there is a which require only small amounts, or quanta,
possibility of a fourth mechanism for transport- of energy to be activated. A decade decrease in
ing heat. This method involves the transport of temperature means a decade decrease in the size
matter from the calorimeter and its subsequent of the energy quanta which can be studied, and
condensation on the surrounding surfaces. The in this sense the range from 0.1 to 0.01 K covers
effect can be avoided by keeping the environ- a range equivalent to that between 1000 K and
ment warmer than the liquid or by completely 100 K (Le., from far above, to far below room
enclosing the liquid. However, even in a com- temperature at 300 K). Temperatures down to
pletely enclosed system the possibility of vapor- 0.3 K are achieved by reducing the pressure
ization into the space above the liquid with over a bath of the helium isotope of mass 3, a
increasing temperature must be considered for relatively simple process. Using a more complex
volatile liquids. process, a helium dilution refrigerator produces
There are many different varieties of calorim- millikelvin temperatures and thus heat capacities
eters, each being particularly suited for a specific can be measured in the decade around 0.01 K.
type of measurement. Some general features of One method which has been used to measure
several representative types are discussed below. the heat capacity of small samples such as thin
Low temperature calorimetry, used down to films weighing less than 1 mg (10- 4 -1 0- 6 moles)
the temperatures available with liquid and solid at low temperatures is the so-called relaxation
hydrogen, -10 K, has become an important method. The sample is heated by an electrical
source of heat capacity data for the evaluation heater to a small ~T above its environment.
of entropies of substances from measurements The electrical leads provide a previously mea-
extending from near the absolute zero to room sured heat link to the environment, so that when
temperature or slightly above. This information the heater is turned off the temperature of the
161 CALORIMETRY

sample will relax back to that of the environ- tion of operating at one fixed temperature can
ment with a time constant proportional to the be removed.
heat capacity of the sample. Here, as in all heat Quantitative measurements of the heat liber-
capacity measurements, the heat capacity of ated (or absorbed) during the solution of a solid
thermometers, heaters, and electrical leads is or of another liquid by a solvent are performed
lumped together with that of the sample. To in solution calorimeters. Heats of solution, dilu-
isolate the heat capacity of the sample, there- tion, and mixing are common determinations of
fore, all of these addenda must be accurately this type. In addition to participating in the
known and subtracted from the total. process under investigation, the solvent is used
An interesting type of measurement is time- as a means of attaining uniform temperature
dependent heat capacity. In some materials, and composition throughout the calorimeter.
it is possible to put energy into only the elec- This feature necessitates stirring, which is usually
trons or only the ion lattice because the time accomplished with mechanically or magnetically
needed for the electrons and the lattice to driven stirrers. Sometimes the calorimeter itself
exchange energy and reach equilibrium is very is rotated. Regardless of the method used, the
long (up to seconds). Measuring the heat capac- quantity of heat introduced by the stirring must
ity as a function of the time probes both the be determined either directly or indirectly and
equilibration processes and the electron and/or a suitable correction must be applied. Another
lattice heat capacities. feature characteristic of solution calorimeters is
The dropping method is the most common of the method of adding the sample. Either it must
the accurate high-temperature procedures for be equilibrated with the solvent in the calorim-
measuring heat contents. This apparatus consists eter, or its heat content relative to the calorim-
of a carefully regulated furnace and a suitable eter temperature must be determined. A com-
calorimeter, such as a Bunsen ice calorimeter, mon method for solids is immersing a capsule
operating near room temperature. The sample containing the sample in the solvent and break-
under investigation is sealed inside a container ing it at the desired time.
that will not undergo chemical reaction at the The heat of combustion of fuels and similar
highest temperature of the measurements. The materials is usually measured by bomb calorim-
sample and container are thermally equilibrated etry. The solid or liquid sample is contained in
with the furnace and then dropped into the a bomb (pressure vessel) containing excess oxy-
calorimeter. The empty container is afterwards gen or other suitable gas under pressure. The
studied in an identical manner and the differ- bomb is immersed in a calorimeter containing a
ence in the two measurements gives the heat liquid, usually water. The reaction is initiated
content of the sample relative to the room tem- by igniting the sample with a measured amount
perature reference. Heat capacities are derived of electrical energy, and the heat evolved is
from a series of such measurements as a func- measured in terms of the temperature rise of
tion of temperature and the thermodynamic the calorimeter. Electrical energy is usually used
relationship to duplicate the temperature rise and thus eval-
uate the heat liberated. However, sometimes a
c =o(H) = [(oH-H o )l standard sample of a substance having a known
p (oT)p oT Jp' (5)
heat of combustion, such as benzoic acid, is
used to calibrate the apparatus. In bomb calo-
rimetry, corrections to standard conditions must
where Cp is the heat capacity at constant pres- be applied (Washburn corrections) since the sys-
sure, H the heat content, Ho the heat content tem is under pressure and because solutions are
at the reference temperature, and Tthe absolute usually formed.
temperature. There are many other important types of
The Bunsen ice calorimeter is an example of calorimeters, such as flow calorimeters, micro-
an isothermal calorimeter that is operated at a calorimeters, flame calorimeters, etc. Nearly
fixed temperature. The calorimeter is usually any process can be studied by the investigator
surrounded by ice, making it also adiabatic and who is ingenious enough to devise the appro-
thus free from heat exchange. Bunsen's design priate apparatus and who has the resources and
makes use of the very large difference between patience to undertake an extensive project.
the specific volume of ice and water. The calo- Although calorimetric measurements are in
rimeter contains a closed chamber which is full general time-consuming and tedious, they are
of ice and water. A pool of mercury is main- essential for a fundamental and practical under-
tained in the bottom of the chamber, and as the standing of many important chemical and
ice melts, additional mercury enters and keeps physical processes.
the chamber full. The calorimeter has a universal
calibration in the form of energy per unit mass T. H. GEBALLE
of mercury. In early versions, the quantity of F. HELLMAN
ice melted was used as a measure of the heat
liberated in the calorimeter. By replacing the Cross-references: ENTROPY, HEAT, HEAT CAPAC·
ice with other suitable substances, the restric- ITY, HEAT TRANSFER, THERMODYNAMICS.
CAPACITANCE 162

CAPACITANCE (DF = dissipation factor). The power factor IS


related to DF by
Definition and Fundamental (Quasi) Static
Properties If a constant voltage V[V = volts) PF=DFV1/(1 +DF2) (10)
is applied between two conductors insulated
from each other, electrical charges Q [As = cou- The loss factor = (DF) . E is proportional to the
lomb) are so distributed that the conductors energy loss/cycle/voltage 2 /volume.
form equipotentials. The measure for the charges Capacitors are used for: (1) frequency de-
stored is the capacitance C [F =farad = 106 fJ.F = termining or selective networks [LC circuits and
10 9 mfJ.F = 10 12 pF) of the capacitor so formed. filters; cf. Eq. (6»); (2) energy storage [Eq. (2»),
for instance, the capacitor being slowly charged
Q =G- V (1)
and quickly discharged [Eqs. (9) and (10») in a
(Q in coulombs, C in farads, V in volts.) It is short burst of energy; and (3) integrators and
often more convenient to express this storing differentiators [in conjunction with R; cf.
capacity in terms of energy Eq. (3»).
Geometry Uniform Fields. For a uniform
E=(!)V2C (2) field as, for instance, given between two closely
spaced parallel metallic plates (area A in square
(E in watts.) C is defined by meters, distance I in meters) and disregarding
edge effects
C=~fidt or C=IjdV (3)
C[F) =Eo EAII (11)
V dt
For capacitor discharge (Eo = starting voltage), with E oE = dielectric constant of free space =
(361T X 10 9 )-1 [F/m) and E = the relative di-
(4a) electric constant (dimensionless) of the material
between the plates.
and for capacitor charge (Eb = battery voltage) Discontinuity in Uniform Fields. If, in the
ec/Eb = I - £-t/7 (4b) above case, the dielectric consists of two sheets
of different materials with E, (having thick-
with the time constant T ness II) and E2 (having thickness 12 )
T=CR (5) El =2 (12)
where R is the resistor through which the ca- E2 E,
pacitor is being (dis)charged.
For sinusoidal excitation of angular frequency where En is electric field strength = Vn/ln . (13)
w, the reactance of the lossless capacitor is Equation (12) is of great practical significance
if one of the E 's is very high, since then the
V[V)/I[A) = (-)jX[n) = l/jwC[n) (6) sheet with the low E carries nearly all voltage
(for this reason, for example, higher-E ceramic
If, in electrical circuits, capacitors are connected capacitors have to have fired-on electrodes).
in parallel, their capacitances add Nonuniform Fields. The most common ca-
pacitance with nonuniform fields is the coaxial
(7a) capacitor (inside diameter d, outside diameter
D). Its capacitance is

If capacitors are connected in series, their elas- C'[pF/m) = 55.6E/ln (D/d) (14)
tances (the reciprocal of capacitance, S) add
Extreme cases of nonuniformity, often causing
(7b) corona, exist on the sharp edges of plate ca-
pacitors. Remedy: For field equalization, de-
form plates to follow equipotential lines of half
Losses in the dielectric may be expressed by a potential in a capacitive field with twice the
complex relative dielectric constant spacing of the original, flat plates (Rogowski
profile).
E =E' - jE" (8) Dielectrics The dielectric "constant" is often
not constant but a function of crystal orienta-
where E'IE" = Qe determines the dielectric qual- tion (anisotropy), temperature, voltage, and
ity factor. For Qe > 10, the loss resistance of frequency (dispersion).
the capacitor is given by The objective of developing a good fixed ca-
(1/wC)/rs = Qe = R p /(1/wC) (9)
pacitor is to have the largest capacity in the
smallest possible volume for a given operating
where rs is the equivalent series and Rp is the voltage. Ideally, the capacitance is not to change
corresponding parallel loss resistance. Qe = l/DF with voltage, temperature, time, mechanical
163 CAPACITANCE

stress, humidity, and frequency, and (in most


cases) is to have a minimum of losses. The great-
est capacitance can be achieved by maximizing
£ (Case a) and A (Case b), and minimizing I
(Case c) [cf. Eq. (11)].
Typical for Case (a) are ceramic capacitors
made in discoidal or tubular form (and now re-
cently also as coaxially laminated capacitors).
There are four classes of ceramic dielectrics:
(I) Semiconducting, so-called layerized, ce-
ramics with dielectric constants above 105.
These can be used only for very low voltages FIG. 1. High-frequency behavior of capacitors.
(transistor circuits), are quite lossy, and have a
strong dispersion of £ in the megacycle range.
(2) High-£' dielectrics (mostly barium tita-
nates) with £' in the order of 6000. These are of failure" investigations. Typical failure mecha-
quite temperature- and voltage-sensitive (non- nisms are, for instance, precorona discharge in
linearity and hysteresis) and are used as guaran- adsorbed air layers, or silver migration.
teed-minimum-value capacitors (GMV). N on-ideal Behavior at Higher Frequencies
(3) So-called stable dielectric capacitors with Equation (6) presumes ideal conditions. An ac-
an £' of 2000 or, if doped with rare-earth ma- tual capacitor, particularly if considered over
terials, with an £' of 3000 to 4000. These are many decades of frequency, and more so, if
much less dependent on temperature and applied used as a shunting element across lines, is much
dc voltage. more aptly describable as a three-terminal net-
( 4) Linear, high-Q (in the order of several work. Figure I marshals the four key deviations
thousand) temperature-compensating capacitors from the ideal behavior:
made with a prescribed (P positive, N negative, a. At very high frequencies, inductive input-
or NPO) temperature coefficient of the capacity output coupling may override the shunting ef-
for incorporation in temperature-stable tuned fect of the capacitor. Remedy: use feed-through
circuits (compensation of the temperature co- capacitors where input and output leads are
efficient of the inductance). The £ of such ma- separated by a shield.
terials lies between 10 and 100. b. Again, at high frequencies, unless as a
Case (b) (large A) is exemplified best by remedy a feed-through configuration is se-
stacked plates [silvered mica (for military use; lected, a series L in the shunt branch results in
excellent Q, temperature coefficient about - 100 the capacitor behaving as an inductor above
ppm) or ceramic (monolithic)] or rolled di- the resonance frequency thus determined (w =
electric strips [polystyrene (excellent Q; com- (LC)-1/2 ).
mercial use; also about - 100 ppm T.C.); c. If the capacitive reactance at high frequen-
"Mylar"; oil-impregnated paper; "Teflon" etc.]. cies becomes very small, the resulting transfer
Case (c) (small l) is represented by polarized impedance may be determined by the series loss
capacitors (to make them unpolarized, two ca- resistance. A typical case is a tantalytic capacitor
pacitors are connected in series in polarity op- behaving this way. Remedy: It must be paralleled
position, usually in the same housing), and it by a smaller capacitor of less high frequency
includes the older, larger, and cheaper types losses.
like the aluminum foil electrolytics. The newer, d. At very high frequencies, let us say 100
more costly, but much smaller, types (having MHz and above, even ceramic feed-through ca-
much less leakage current) are tantalum oxide pacitors start to resonate internally (transmis-
capacitors. Ta20s stands continuously the ex- sion line effect) rendering them useless above
traordinary field strength of 3 X 106 Vfcm certain frequencies. Remedy: See, for instance,
with an £' of 25, I being measured in angstroms. bibliography I.
The Q is about 100. For microminiaturization, Nonlinear Capacitance The dielectric of
silicon monoxide or dioxide or tantalum oxide highly nonlinear capacitors is the depletion
films of very smalll are utilized. layer formed at the p-n junction by application
Rating The reliability of a capacitor is pre- of proper bias. These back-biased diodes have a
dominantly determined by the dielectric and reasonable Q and are widely used as nonlinear
the seal of the housing. One has to distinguish reactances in parametric amplifiers for VHF
between failure value and withstand value. The and higher frequencies and for varactor tuning
failure value of dielectric strength is the voltage in TV receivers. Nonlinear ceramics are less
at which the material fails and is conventionally suitable for this purpose because of their high
given as the average failure voltage. In contrast, losses and great temperature dependency.
the withstand value is a voltage below which no "Parasitic" Capacitances Capacitances playa
failure can be expected. significant role in voltage limiters used exten-
Deterioration of capacitors with time (aging) sively for interference control. If one employs
can be greatly reduced by systematic "physics sparkgap limiters, one has the advantage of a
CAPACITANCE 164

small shunt capacity. Thus, with R = Zo (char- is now set aside until the pressure is reduced to
acteristic impedance of line), the time constant P2 and the volume is expanded to V2 • The gas
is small and the bandwidth of the line is large- is now in state 2 characterized by P2 , V 2 , and
as it often needs to be. But sparkgaps have dis- Th. The transition from one thermodynamic
advantages like slow response time, possibility state to another is called a thermodynamic
of sustaining arcs, etc. Solid state voltage limiters process. A process is called reversible if done so
(single-crystalline Si Zeners or polycrystalline slowly that no temperature differences arise
metal-oxide varistors) do not have such draw- within the gas and if the piston moves without
backs, but have large e's which may be reduced friction. The cylinder base is kept at tempera-
(a) or exploited (b). To maintain large band- ture Th, so during this process the temperature
width, a small series capacitance is to be added of the gas remains at Th; that is, it is an iso-
(a). On the other hand (b), metal-oxide varistors thermal process. When this is shown on a pres-
have variable resistance, very high below the sure-volume diagram, the process appears as a
knee voltage and very high above it, and in portion of the Th isotherm. To hold the tem-
parallel a considerable capacitance. Their dielec- perature constant, some heat energy Qh must
tric constant is on the order of 3000 and is not flow from the reservoir into the gas. In ex-
bias sensitive and much less temperature de- panding against the weight on the piston, the
pendent than corresponding ceramics. Feed- gas does work W 12 which is represented on the
through capacitors made of such material are P- V diagram by the crosshatched area. The cyl-
not yet commercially available, but they are inder is next moved to an insulated pad where
very small integrated interference suppression the pressure is further decreased by setting aside
devices that filter at low voltages and limit at more weights, and the gas again expands. No
high voltages. heat energy flows into the gas from the outside
during this expansion, and this is called an
H. M. SCHLICKE adiabatic process. The temperature decreases
and when it reaches T e , the temperature of the
Reference second reservoir, the process is stopped. The
gas is now in state 3 characterized by P , V 3 ,
Schlicke, H. M. "Electromagnetic compatibility," and Te. During this process the gas does3 work
Marcel Dekker, New York, 1982. W23 against the load. The cylinder is then
moved to the second reservoir where enough
Cross-references: CIRCUITRY, DIELECTRIC THE- weights are slowly replaced to bring the gas to
ORY, POTENTIAL. state 4, the point on the Te isotherm from
which state I can be reached by an adiabatic
CARNOT CYCLES AND CARNOT ENGINES heat process. During this isothermal compression,
energy Qe flows from the gas into the
Background Steam engines were first built to reservoir, and the piston does work W 34 on the
do heavy, tedious jobs such as pumping water gas. To show that this work is done on the gas
out of deep mines. Not only could they pull while previously the work was done on the
harder than a team of horses, but they did not piston, the appropriate portion of the cross-
get tired. These engines were built by rule of hatched area has been removed. The cylinder
thumb by practical men, for at that time there is finally placed on an insulated pad, the re-
was no thermodynamics to guide the design. maining weights are slowly added, and the gas
Still engines could be rated by how much water returns adiabatically to state 1. Again the fact
was pumped for a bushel of coal burned. For a that the piston does work W4I on the gas, is
century the quality of these engines improved, shown by the removal of the cross-hatched area.
and the obvious question was whether there One Carnot cycle is now completed.
was any limit to this improvement. Carnot's Definition and Characteristics A Carnot cycle
work addressed that question. He set out to is any reversible cyclic thermodynamic opera-
devise a heat engine against which all imagin- tion composed of four processes which are al-
able heat engines could be compared. ternately isothermal and adiabatic. (The work-
Description The engine Carnot devised is of- ing substance need not be an ideal gas, but
ten represented, as shown in Fig. 1, by a device traditionally this is used in discussions.) Since
which uses an ideal gas as its working substance. no natural process is strictly reversible, the
A quantity of gas is confined in a cylinder with Carnot cycle is an idealization.
a wall so well insulated that no heat can flow Although heat energy has entered and left it
through it. The cylinder's heat-conducting base and work has been done on it, the gas under-
rests on the first reservoir, whose constant tem- goes no detectable physical changes for having
perature is Th, and the gas assumes this tem- passed through the Carnot cycle. Heat energy
perature. A weighted insulating piston holds the Qh was removed from the hotter reservoir, and
pressure of the gas at PI at which pressure its a smaller amount Qe flowed into the cooler
volume is VI. The gas is then said to be in reservoir so that the heat en~rgy budget of the
thermodynamic state I characterized by PI, VI, gas increased by Q = Qh - Qe. The net work
and Th. Little by little the weight on the piston done by the gas on the piston is W = W12 +
165 CARNOT CYCLES AND CARNOT ENGINES

p.

I
1
~ 14
I I
I~W~ 3'Tc
VI V. V...
FIG. 1. The steps of a Carnot cycle and the corresponding pressure-volume diagrams. The piston location at
the start of each process is shown in solid lines; at the end, in dashed lines. The pressure on the gas can be
estimated by the area of the weight shown on top of the piston. The dots indicate that the molecules of the ideal
gas are close together when the volume is small and are farther apart when the volume is large.

W23 + W34 + W41 , and the mechanical energy only on the temperatures of the two reservoirs.
budget of the gas decreased by W which is rep- When these temperatures are measured on the
resented on the P-V diagram by the area en- absolute scale in Kelvin, then 77 = I - (TcITh).
closed by the phase lines. The first law of (2) No heat engine operating in cycles be-
thermodynamics requires that W - Q = 0 so no tween two reservoirs can have a greater ef-
energy residue is left in the gas. If the cycle is ficiency than a Carnot engine operating between
traversed as described, heat energy Qh is re- those reservoirs.
moved from the higher-temperature reservoir. The Carnot engine is thus a standard against
Part of this remains in the form of heat energy which other heat engines can be compared.
Qc as it flows into the cooler reservoir, and part Historically it was the source of a number of
of it is converted to mechanical energy as the ideas that are now basic to the study of thermo-
work W done on the piston . A Carnot cycle dynamics.
operated in this direction is called a Carnot Although the Carnot cycle is the most effi-
engine. If the direction of operation were reo cient heat engine, it does not produce any
versed, the cycle would be called a Carnot reo power P. The requirement of reversibility means
frigerator. In such a device mechanical energy, that a very long time t is required to complete
the work W done by the piston, is converted even one cycle. Thus the power (defined as P =
into heat energy which combines with the heat Wit) is zero. To speed up the cycle a tempera-
energy Qc which flows from the cooler reservoir ture difference between the working substance
into the gas. All this heat energy Qh flows out and reservoirs can be introduced. Under this
of the gas into the hotter reservoir. condition the power efficiency 77' is found 1 to
The efficiency 77 of any engine is the fraction be 77' = I - (TciTh )1/2. The efficiency of real
of the heat energy input Qh which is converted heat engines is described rather well by this
into mechanical energy W; that is, 77 = WIQh. equation.
The following properties of Carnot engines are History of the Carnot Cycle In 1824 Sadi
derived in many textbooks of thermodynamics. Carnot 1a (1796-1832) analyzed a heat engine
(I) The efficiency of a Carnot cycle depends assuming that heat can perform mechanical
CARNOT CYCLES AND CAR NOT ENGINES 166

work in falling from a higher temperature to a lao Carnot, S., "Reflexions sur la Puissance Motrice
lower just as water can do work falling from a du Feu," Paris, Bacheller, 1824. Reprinted (to-
higher level to a lower, and assuming that no gether with Refs. 2and 4 below) by Dover Pub-
heat would be lost just as no water was lost. lications, New York.
(His work preceded by more than 20 years the 2. Clapeyron, E., reprinted in Ann. Physik [2] 59,
theory of Joule and Helmholtz on the mechan- 446-451,566-586 (1843). See Ref. 1a.
ical equivalence of heat.) In his study he pro- 3. Thomson, W., "Mathematical and Physical Papers,"
posed an ideal heat engine that operated in a Vol. I, pp. 100-106, Cambridge Univ. Press, 1882.
continuous cycle and was reversible. He then 4. Clausius, R., Ann. Physik [2] 79,368-397,500-
showed that it is impossible in a cyclic opera- 524 (1850). See Ref. lao
tion to obtain work from a single constant- 5. Thomson, W., Ann. Physik [2] 79, 174-316
temperature heat source and that no more work (1850).
can be obtained from any process than is re- 6. Clausius, R.,Ann. Physik [2] 93,481-506 (1854);
quired to reverse it. 125,390 (1865).
His ideas escaped notice until 1834 when 7. Boltzmann, L., "Lectures on Gas Theory" (trans-
Clapeyron 2 recognized their merit, suggested lated by S. G. Brush) Berkeley, Univ. California
some of the details of the device described Press,1964.
above, and plotted its behavior on a P-V dia- 8. Britwistle, G., ''The Principles of Thermody-
gram. Again the ideas were neglected until Wil- namics," Cambridge Univ. Press, 1925. Preston,
liam Thomson (later Lord Kelvin) learned of T., ''The Theory of Heat," Third edition, London,
Carnot's work through Clapeyron's memoir. In The Macmillan Co., 1919.
1848, Thomson described 3 how a Carnot engine 9. Shortley, G. H., and Williams, D. E., "Principles
could be used to define a temperature scale that of College Physics," Englewood Cliffs, N.J.,
was absolute in the sense that it did not Prentice-Hall, 1959.
depend on what thermometric substance was Zemansky, M. W., "Heat and Thermodynamics,"
used. It was based on a series of Carnot 4th Ed., New York, McGraw-Hill Book Co., 1957.
engines, each of which did the same amount 10. Sandfort, J. F., "Heat Engines," Garden City,
of work. This was the first important idea N.Y., Doubleday & Co., 1962.
drawn from a study of the Carnot cycle. 11. Wilson, S. S., Sci. Am. (August 1981, 134-145.
In 1850 Clausius4 , who learned of Carnot's
ideas through Thomson and Clapeyron, showed Cross-references: HEAT, THERMODYNAMICS.
how Carnot's assumption (no loss of heat) could
be reconciled with the newer views of Joule CAVITATION
and Helmholtz (which now form the basis of
tne tirst law of thermodynamics). It was only Cavitation is defined as the formation of one or
required that the engine exhaust less heat en- more pockets of gas (or cavities) in a liquid. *
ergy, by the amount of the work done, than it Here the word formation can refer, in a general
accepted. Thomson independently reached the sense, both to the expansion of a newly created
same conclusionS in 1851. cavity or of a preexisting one to a size where
In 1854 Clausius 6 in his study of the Carnot macroscopic effects can be observed. The cav-
cycle identified the physical property he later ity's gas content refers to the liquid's vapor,
named "entropy." This was the second im- some other gas, or combinations thereof. Some-
portant idea drawn from study of the Carnot times these cavities are referred to as bubbles or
cycle. In 1877 Boltzmann 7 took the principle voids, depending on the relative amount and
of Clausius that real processes evolve naturally type of gas.
toward states of higher entropy (which is the Cavitation usually occurs in response to a re-
second law of thermodynamics) as a basic point duction of the pressure sufficiently below the
in his theory and thus had no need to consider vapor pressure or the gas saturation pressure of
the Carnot cycle from which that principle was the liquid or to the elevation of the tempera-
derived. It is now common practice to discuss ture above the boiling point, although chemical,
thermodynamics axiomatically rather than his- electrical, and radiation-induced phenomena
torically so that the Carnot cycle no longer can be important.
plays the important role it once did. The most Although liquids, by definition, cannot sus-
complete discussions are found in the older tain shearing forces without flow, they can sus-
volumes,8 but some recent books do describe tain uniform tension. The tensile strength of
the cycle in detai1. 9 An historical account lO a liquid refers to the limiting tension, or nega-
with an elementary presentation of the theory tive pressure, that a liquid can sustain before
and a review l l of Carnot's work with excerpts cavitation occurs. Ultimate cavitation thresh-
from his original paper have been published. olds can be reached only in the absence of weak
spots or cavitation nuclei (usually gas or vapor
ROBERT A. LUFBURROW pockets) which exist on solid surfaces of con-
References tainer walls and suspended impurities, or which
1. Curzon, F. L., and Ahlborn, B., Amer. Journ. *The first two paragraphs are from Ref. 3 and are re-
Physics, 43,22-24 (Jan. 1975). printed with the permission of Academic Press.
167 CENTRIFUGE

are generated by the interaction of ionizing phenomena can be attributed to the concentra-
radiation with the tensed liquid (see also BUBBLE tion of energy which occurs when bubbles grow
CHAMBERS). to at least twice their original size in one or no
Because of the prevalence of these weak spots more than a few acoustic cycles. For this to
(typical tap water has in excess of 10,000 par- occur, peak acoustic pressures must be at least
ticles per cubic centimeter), the ultimate tensile an atmosphere greater in magnitude than the
strength of a liquid, theorized to range from hydrostatic pressure, and the acoustic frequency
150 to 1500 atmospheres for most materials must be sufficiently low (as must be the liquid's
that are liquid at room temperature, has rarely viscosity) so as to allow for adequate bubble
been measured. Commonly measured tensile growth during the tension part of the cycle.
strengths measured by static or dynamic means There is also a relatively more stable and
are typically two to three orders of magnitude periodic type of acoustic cavitation that occurs
lower (approaching the vapor pressure of the when a bubble filled with relatively inert gas
liquid). Static tensile stresses can occur in the (often air, in practical circumstances) oscillates
laboratory, in a liquid filled capillary that is with an amplitude that is small compared to the
spun at sufficient speed, and also in nature, it bubble size. It may slowly grow by a process
is presumed, in the xylem paths of tall trees, as called rectified gas diffusion when the acoustic
water transpiring from the leaves is drawn up pressure is only a fraction of an atmosphere. If
from the tree's roots. the bubble reaches a resonant size (for example
Dynamic tensile stresses are produced hydro- 6500 Hz for a I-millimeter-diameter air bubble
dynamically and acoustically. Hydrodynamic in water at an ambient pressure of one atmo-
cavitation results from the local reduction of sphere), it will go through much larger oscilla-
pressure produced by the motion of a body in tions, producing acoustic emission in the liquid
a liquid (Bernoulli effect), as would be found not only at the driving frequency but also at its
in the vicinity of a rotating propeller of a ship harmonics (2f, 3f, ... ), subharmonics ([/2,[/3,
or near the surface of a fast submarine travel- ... ), and ultraharmonics (2f/3, 3f/4, ... ). The
ing at an insufficient depth for the hydrostatic bubble may then become unstable and transient,
pressure to suppress the flow-induced tension. eventually breaking up into smaller bubbles.
Acoustically generated cavitation occurs when The flow-induced effects induced in the liquid
a sufficiently energetic acoustic wave produces by this cavitational activity are used in a variety
periods of high negative pressure exceeding the of practical applications, from removing em-
ambient hydrostatic pressure, as can happen bedded dirt (as with ultrasonic cleaners) to im-
near the surface of a high-power acoustic trans- proving the efficiency of chemical reactions.
mitter used for long range communication in
the ocean (see SONAR). ROBERT E. APFEL
In both hydrodynamic and acoustic cavita-
tion, the size of cavitation nuclei and the de- References
gree they are wetted by the liquid will dictate
the cavitation threshold (which is usually very Knapp, R. T., Daily, J. W., and Hammitt, F. G., "Cavi-
low in the hydrodynamic case); the number of tation," New York, McGraw-Hill Book Co., 1970.
cavitation nuclei will dictate the amount of Neppiras, E. A., "Acoustic Cavitation," Physics Re-
cavitational activity. And in both hydrody- ports 61,160-251 (1980).
namic and acoustic cavitation, the duration of Apfel, R. E., "Acoustic Cavitation," in Edmonds, P.,
the period of tension will determine how big Ed., "Ultrasonics," Vol. 19, pp. 356-413, New
the bubbles grow before the bubble will col- York, Academic Press, 1982.
lapse under the positive ambient hydrostatic
pressure. Inside a collapsing bubble the pres- Cross-references: ACOUSTICS, ULTRASONICS, LI-
sure and temperature will rise, with the severity QUID STATE.
of the collapse increasing with maximum bub-
ble size and with the scarcity of trapped per-
manent gas, which tends to cushion the collapse. CENTRIFUGE
Collapsing bubbles can generate in the surround-
ing liquid shock waves and hydrodynamic jet- The centrifuge is a device consisting of a rotat-
ting. Such effects in the vicinity of a solid sur- ing container in which substances of different
face can lead to severe erosion, as is commonly size, shape, or density are separated by the
found on screw propellers of ships. centrifugal field developed along the radial
In addition to these effects, a number of in- direction. In most cases the centrifuge is used
teresting phenomena can occur in acoustic with fluids for producing sedimentation of
cavitation. Temperatures inside a collapsing macromolecules of molecular weight M> 10 4
bubble in fairly outgassed liquid can reach grams/mole or daltons. 1 ,2 It has been used ex-
thousands of degrees Celsius and pressures can tensively also in the separation of gaseous com-
reach thousands of atmospheres. Light can be ponents and for determining the strength of
produced (called sonoluminescence) and sono- materials as well as a number of other applica-
chemical effects may occur due to free radical tions. 3,4 Centrifugal fields as large as 10 9 times
formation inside the bubble. These transient gravity have been achieved; however, for most
CENTRIFUGE 168

purposes the fields are of the order of 10 4 -lOs


times gravity. The effective centrifugal force F H
v = - - = (RT
-----
1 )1/2
on a particle of mass m and density p in a fluid of or T N 311'TlaT
density p', is expressed by F = m(p - p')w 2r/p,
where w is the angular velocity and r is the where N is the Avogadro number. As Vor becomes
radial distance of the particle from the axis of larger than v, particles will no longer settle out;
rotation. For a spherical particle, F is opposed i.e., net transport becomes zero at a sufficiently
by the frictional force of the fluid on the par- small centrifugal field so that an equilibrium
ticle according to Stokes' law, 611'Tlav, where Tl distribution of the particles along the column
is the coefficient of viscosity, a the radius and v of solution in the radial direction will be reached.
the speed of the particle (provided that the When this equilibrium state is achieved between
Reynolds number, 2 avp /Tl, is less than unity the tendency to sediment and to back diffuse,
and wall effects are neglected). Thus, the aver- the concentrations at any two levels in the col-
age rate of sedimentation v of spherical particles umn can be related to M by
is described by C2'Y2 M(ap/aC),.,.w 2(rl- r12)
In - - = (3)
111'a 3 (p - p') w 2r = 611'Tlav = fv (1) C1'Y1 2RT
where f is the frictional coefficient. Nonspheri- where the numerical subscripts refer to two arbi-
cal particles can be handled by the introduction trary positions in the radial direction and 'Y is
of appropriate shape factors. Since the measured the activity coefficient. In dilute solutions, as is
quantity in sedimentation is v = dr/dt and the usually the case in analytical work, 'Y ~ 1 and
quantities w, r, p, p', and Tl are also measurable, the practical equation for determining the
a and hence m can be determined. When m is weight-average molecular weight Mw in sector-
expressed in terms of the molecular weight M, shaped cells (to avoid wall effects) is given by
the buoyancy of the particles as a function of
concentration C (mass/volume) is given by 2RT C2 - C 1
(ap/aC),.,., where p is now the density of the so- Mw ~ (ap/aC),.,.w2(r22 - r12) Co (4)
lution; this derivative is virtually constant for
dilute macromolecules in a given solvent of where Co is the initial concentration of the so-
much smaller molecular species. (In the general lution before sedimentation is initiated. The
case for multicomponent solvents, such as when concentrations at radial distances rl, r2 are
salts are added to reduce the effect of charge on normally determined (by various optical means)
the particles, the chemical potential J1 of the at the meniscus and at the bottom of the solu-
solvent mixture is held constant rather than its tion column, respectively. Since charge effects
composition as C changes. S) In dilute solutions may be large with sedimenting macromolecules,
the molar frictional coefficient is RTjD where suppression of this effect is usually carried out
D is the diffusion coefficient, which is mea- by raising the ionic strength (via a neutral salt)
sured independently. Thus, the equation for if the system cannot be studied at the isoelec-
calculating M from the rate of sedimentation, in tric point. In Eqs. (2)-(4) accurate densities as
the absence of substantial charge effects, is a function of C must be available from which
to calculate (ap/aC),.,.;6 the chemical potential
M= RT(dr/dt) (2) J1 of the solvent is maintained constant for this
D(ap/aC),.,.w 2r· purpose by first dialyzing to equilibrium the
sedimentable species against the chosen solvent
The rate of sedimentation in a unit field via a semipermeable membrane. s
(dr/dt)/w 2r is called the sedimentation coef- Preparative Centrifuges For the purification
ficient s, which has the dimension of time. The and concentration of substances in solution or
coefficient s is often expressed in Svedbergs suspension in a fluid, the centrifugal field is
(10- 13 sec), a unit named in honor of T. Sved- made high enough so that appreciable separa-
berg, the pioneer in analytical centrifugation. tion of each material in a mixture is realized.
The value of s or the Svedberg is very impor- Often one substance is sedimented to the bot-
tant in characterizing a substance in solution. tom of the centrifuge cell (i.e., toward the
As noted, the net transport of particles through periphery of the rotor), thereby concentrated
a solvent medium in a centrifugal or gravitational as a pellet or simply enriched relative to the
field is opposed by the tendency to restore the slower sedimenting substances which are pres-
previously distributed condition through the ent. Repeated centrifugation of the redissolved
process of diffusion. Thus BROWNIAN MOTION pellet material then further purifies it from the
from thermal agitation prevents sediment able slower moving impurities. Conversely, the pel-
particles from settling out if v, although finite, leted material may be the contaminant which is
is sufficiently small. The average displacement freed from the medium containing the desired
X of a particle in time T owing to Brownian mo- substance. Centrifuges for these purposes are
tion is X2 = 2DT, and for a sphere of radius a called preparative centrifuges; the variety, con-
the average velocity Vor over a period T is ex- venience, and sophistication of preparative in-
pressed by struments for a wide range of purposes in both
169 CENTRIFUGE

industry and research is great indeed. Equa-


tions (1) and (2) are used for estimating the re-
quired rotor speed, size and style of rotor, and
the time needed for achieving the desired de-
gree of differential sedimentation of mixtures. Analytical Centrifuges When a centrifuge is
These equations hold strictly only when the designed for analytical work it is called an ultra-
sedimentation is radial and so that no turbu- centrifuge. 1 The ultracentrifuge is employed in
lence or remixing occurs. Moreover, the tem- one of two general methods. The first method
perature must be uniform throughout the makes use of Eqs. (1) and (2), whereas the sec-
sedimentable column so that convection does ond is based on Eqs. (3) and (4); both methods
not occur. The force generating thermal convec- are applied extensively. In the first method,
tion is roughly proportional to the density called velocity sedimentation, comparatively
gradient along the column times the centrifugal large centrifugal fields (e.g., 2 X 10 5 times grav-
field. Since the latter is often large, the temper- ity) are generated in order to produce an easily
ature gradient must be small. In high-speed cen- measured rate of sedimentation v of a molecu-
trifugation the rotor is usually spinning in a lar species. The value of v is usually determined
good vacuum «10- 5 torr) in a temperature- by optical means and s is computed. The latter
controlled instrument, so that heating and is normalized to the rate (per unit field) under
thermal gradients are avoided. In general, bio- standard conditions, such as at 20 0 in a medium
macromolecules, including enzymes, are not in- corrected to the density and viscosity of water
activated by the increase in pressure along the (S20,w). Because s depends on the concentra-
radial path in the fluid at high rotor speeds. Or- tion of the sedimenting molecules its value is
dinarily, these pressures amount at most to a further reduced to that at vanishing concentra-
few hundred atmospheres, which are insuffi- tion (S020,w) by extrapolating s at finite con-
cien t to irreversibly alter the structure of these centrations to that at C = O. In this way differ-
macromolecules. ent macromolecules can be compared on the
A major development in preparative centri- basis of their rates of sedimentation. The slopes
fugation is toward the separation of substances of the regression lines, s versus C, reflect on the
along a density gradient. When a column of shape and other properties of the sedimenting
fluid is prepared such that the density increases molecules and in their interactions with other
smoothly along the sedimenting path and then chemical species present. 2 The high centrifugal
a thin layer of solutes or suspended particles to field quickly produces a small density gradient
be separated is applied to the top of the col- along the radial path which stabilizes the liquid
umn, the solutes tend to segregate into distinct column. As a result, highly precise control of
bands during the centrifugation. For the prepa- the temperature and speed of the rotor is not
ration of the density gradient, nearly inert, mandatory, although it is desirable. The time
denser materials, such as sucrose, cesium chlo- requirements of velocity sedimentation is com-
ride, or osmotically inert silica sols are often paratively small (1-2 hours). Also, if the solu-
used and methods exist for constructing these tion contains more than one sedimenting species,
gradients in the fluid column. This technique the characteristic value of s for each can be
has become a great boon in several kinds of measured by observing certain precautions. 2
research. In some methods the mixture of test Thus, the method is useful for comparing the
materials moves down the column until each effects of treatments on the product output
material has reached a level where it becomes and for following the degree of purification
isodense with the layer of fluid of a particular during isolation procedures. Historically, veloc-
density (isopycnic centrifugation). In this way ity sedimentation serves as a criterion for the
each test material exhibiting a different density degree of homogeneity of a purified macro-
in solution concentrates as a thin band at dif- molecular species. Furthermore, the self asso-
ferent levels along the column because the ciation and interaction of species in a mixture
density gradient, made up of much smaller often can be determined. These and other
molecules or stable sols, is approximately in- aspects yielding important information have
variant relative to the transport of the macro- been studied both theoretically and experi-
molecules or larger entities in the centrifugal mentally on a broad front. 2 ,9,IO The velocity
field. The isolation attempts with live cells, method, however, requires knowledge of D or
viruses, nucleic acids and proteins have met another measurement reflecting f, which are
with remarkable success by the use of the determined less accurately than is s. Hence, the
isopycnic method. 7 For example, the clear value of M is more uncertain than by the sec-
separation of complementary strands of nucleic ond method (below).
acid and the parent double-stranded molecule The second method, known as equilibrium
have been achieved owing to the fact that each sedimentation, is on more firm theoretical
complementary strand, while of the same size, grounds, being based upon equilibrium thermo-
has a slightly different density in solution or dynamics. Also, the value of M, not being de-
partial volume, (3 V/3m)p.8 This quantity is pendent upon a separately determined function
contained in the buoyancy term of Eqs. (2), of f, is more reliable. The equilibrium method
(3), and (4), wherein permits an analysis of the various factors con-
CENTRIFUGE 170

tributing to non-ideal behavior of solutions a = Xl (0)/ Xl (a)


from which much valuable information can be
gained about the total system. Further, the o X2(0) x2(a)
equilibrium method has been employed success-
fully in resolving equilibrium constants for
= exp [(M2 - Mdw 2r2/2RTJ (7)
self-associating and other interacting macro- where Xl and X2 are the concentrations of spe-
molecules in mixed systems. 9,IO,1l In general, cies 1 and 2, respectively. The details of the
high purity of the sedimenting species is re- theory of isotope separation by the gas centri-
quired. Also, this method usually entails rela- fuge have been considered by numerous au-
tively small rotor speeds in order that the test thors,16-19 and the separative performance de-
molecules do not sediment to the bottom but pends upon the internal circulation of the ;ro-
rather distribute at equilibrium along the col- cess gas which has been analyzed in detai1. 2
umn of solution as a function of the radial dis- The operation of a gas centrifuge is illustrated
tance. Thus, much more time is needed to in Fig. 1. The rotor is suspended at the bottom
achieve the relevant data than by the velocity by a low friction bearing and at the top by a
method. However, various procedures have frictionless magnetic bearing. In the case of the
been devised to shorten the time requirement. separation of uranium isotopes, uranium hexa-
Molecules as small as M - 10 3 daltons as well fluoride (UF 6) is introduced into the spinning
as particles of M> 10 7 daltons have been stu- rotor from the stationary central post and re-
died successfully, the latter using fields of only moved from stationary pipes called scoops lo-
a few gravities by use of a magnetically sus- cated at either end of the rotor. In practice gas
pended rotor in order to reduce rotor preces- centrifuges are spun in a vacuum at very high
sion effects to a safe level. l2 For larger species, peripheral speeds, 400 mls or greater, so that
the gravitational field of the earth can be used. 13 the process gas is compressed into a thin strati-
Gas Centrifuge The gas centrifuge has been fied layer adjacent to the cylindrical wall of
used for removing fine particles suspended in the rotor. If we consider the process gas to be a
gases, for the separation of gaseous mixtures, binary mixture of the two isotopic species
and for the separation of isotopes, with the last U23s F 6 and U238F 6, then the heavier molecules
of these being one of the most important large containing U238 will tend to be concentrated
scale applications. The first suggestion of using near the cylinder wall and the lighter molecules
a centrifugal field to separate isotopes was in containing U235 will tend to be concentrated
1919,14 and the first successful demonstration near the axis.
was made in 1934 by J. W. Beams at the Uni- In addition to removing mass, the stationary
versity of Virginia with the isotopes of chlo- scoop at the bottom of the rotor induces a
rine. IS In recent times, large scale gas centrifuge countercurrent flow by removing angular mo-
plants have been built for separating the isotopes mentum and pumping the gas radially inward,
of uranium to produce uranium enriched in the forcing it to travel up near the axis and down
fissionable isotope U23S for fuel in light water along the cylinder wall. In order to prevent an
power reactors. These plants employ tens or opposing circulation, the scoop at the top is
hundreds of thousands of centrifuges depending shielded by a baffle which rotates with the
on the size and peripheral speed of the rotor. rotor and has holes which allow the gas to enter
When a centrifugal field is applied to an ideal the scoop chamber and be removed through the
gas, it sets up a pressure gradient scoop. A similar countercurrent flow can be in-
duced by heating the bottom of the rotor and
dp Mp cooling the top of the rotor, which is analogous
-=--w 2 r (5) to the high altitude winds driven by thermal
dr RT
gradients in the earth's stratified atmosphere.
This countercurrent flow produces a net trans-
where p is the pressure, M the molecular weight port of heavy isotopes to the bottom of the
of the gas, R the gas constant and T the abso- rotor and a net transport of light isotopes to
lute temperature. For the case of an isothermal the top of the rotor, which establishes a con-
centrifuge, Eq. (5) is readily integrated to yield centration gradient in the axial direction.
Therefore, the gas removed by the top scoop is
p(r) =p(O) exp (Mw 2r 2/2RT) (6) enriched in U235 (product) and the gas re-
moved by the bottom scoop is depleted in
which gives the pressure p(r) at any radial posi- U235 (tails). Cascades are formed by centri-
tion r in terms of the pressure at the axis p(O). fuges connected in series to obtain the desired
For a mixture of two ideal gases of molecular enrichment and in parallel to obtain the desired
weights MI and M 2 , each gas would have a pres- throughput.
sure governed by Eq. (6) and the ratio of the The maximum theoretical separative capacity
two equations gives the radial separation under of a gas centrifuge has been shown to be 16 ,18
equilibrium conditions (Le., no gas circulation).
An equilibrium separation factor between the oU(max) = - -
rrZpD (AMw a2 2)2 (8)
two gases is therefore given by 2 2RT
171 CENTRIFUGE

TOP SCOOP
,
---:Jtr1J.t;::::",",<:"--l!- MA GHETIC BEAR IHG
DAMPING ASSEMBLY

ROTATING SAfFLE

MOLECULAR PUMP
CASING

BOTTOM SCOOP

NEEDLE & BOTTOM


DAMPING ASSEMBLY

FIG. 1. Gas centrifuge.

where Z is the length of the rotor, /::;.M is the 2. Schachman, H. K., "Ultracentrifugation in Bio-
difference in molecular weights, and 0 U is the chemistry," Academic Press, New York, 1959.
separative capacity in moles per unit time. 3. Beams, J. W.,Physics Today, 12,20 (1959).
Therefore, long, high speed rotors are desirable 4. Lavanchy, A. C., Keith, F. W., Jr., "Encyclopedia
features of the gas centrifuge process. The peri- of Chemical Technology" (R. E. Kirk and D. F.
pheral speed is limited by the strength of the Othmer, Eds.), 3rd. Ed., Vol. 5, p. 194, Wiley
rotor material. As in the case of a rod that is (1979).
rotated, the rotor will have certain natural fre- 5. Casassa, E. F., and Eisenberg, H., J. Phys. Chern.,
quencies which will be determined by the ma- 65, 427 (1961); Advan. Protein Chern., 19, 287
terials of construction, the rotor length to (1964).
diameter ratio, and the damping of the suspen- 6. Kupke, D. W., in "Physical Principles and Tech-
sion systems. Centrifuges which operate at ro- niques in Protein Chemistry," Part C (S. J. Leach,
tational frequencies which are below the lowest Ed.), pp. 1-75, Academic Press, New York, 1973.
natural flexural frequency of the rotor are 7. Meselson, M., Stahl, F. W., and Vinograd, J.,Proc.
called sub critical centrifuges, and those which Nat. Acad. Sci., U. S., 43, 581 (1957); Wolff, D.
operate at rotational frequencies above the A., Methods in Cell Biology, 10, 85 (1975).
first flexural natural frequency are called super- 8. Meselson, M., and Stahl, F. W., Proc. Nat. Acad.
critical centrifuges. Sci., U. S., 44, 671 (1958).
9. Williams, J. W., Van Holde, K. E., Baldwin, R. L.,
DONALD W. KUPKE and Fujita, H., Chern. Revs., 58 (1958).
RALPH A. LOWRY 10. Fujita, H., "Mathematical Theory of Sedimenta-
HOUSTON G. WOOD, III tion Analysis," Academic Press, New York, 1962.
11. Williams, J. W., "Ultracentrifugation of Macro-
References molecules," Academic Press, New York, 1972.
12. Weber, F. N., Jr., Elton, R. M., Kim, H. G., Rose,
1. Svedberg, T., and Pederson, K. 0., "The Ultra- R. D., Steere, R. L., and Kupke, D. W., Science,
centrifuge," Clarendon Press, Oxford, 1940. 140,1090 (1963).
CENTRIFUGE 172

13. Weber, F. N., Jr., Kupke, D. W., and Beams, J. W., A

l
Science, 139,837 (1963).
14. Lindemann, F. A., and Aston, F. W., Phil. Mag., e'I
,,
\
37,523 (1919). \
\
15. Beams, J. W., Rev. Mod. Phys., 10,245 (1938).
16. Cohen, K., "Theory of Isotope Separation," \
\
McGraw-Hill, New York, 1951. el \
17. Soubbaramayer, "Topics in Applied Physics: Ura- \
\
nium Enrichment" (S. Villani, Ed.), Vol. 35, p. \ P.

/x
183, Springer-Verlag, New York, 1979.
18. Hoglund, R. 1., Shacter, J., and Von Halle, E.,
"Encyclopedia of Chemical Technology" (R. E.
Kirk and D. F. Othmer, Eds.), 3rd ed., Vol. 7,
p. 639, John Wiley & Sons, New York, 1979.
19. Benedict, M., Pigford, L. H., and Levi, H. W., "Nu-
clear Chemical Engineering," 2nd Ed., McGraw- ,- /
Hill, New York, 1981. , ,-
20. Wood, H. G., and Morton, J. B., J. Fluid Mech., e'2
101, 1 (1980).
Cross-references: BROWNIAN MOTION, ISOTOPES,
MOLECULAR WEIGHT, ROTATION-CURVILIN- (a)
EAR MOTION.

Bz
CERENKOV RADIATION
This is a feeble radiation in the visible spectrum,
which occurs when a fast charged particle tra-
verses a dielectric medium at a velocity exceed-
ing the velocity of light in the medium. It is
thus a shock-wave phenomenon, the optical
analog of the "supersonic bang." The radiation
arises from the local and transient polarization
of the medium close to the track of the particle.
Consider, Fig. lea), an arbitrary element S of
the medium to one side of the track AB of a fast (b)
electron, the track defining the z-axis. At a
particular instant of time, when the electron is
at say el , the local polarization vector PI will be A
directed along S el', to a point el' slightly be-
hind el, owing to the retarded fields. As the
particle goes by, the vector P2 will turn over and,
when the electron reaches e 2, will be directed
to a point e2'. The variation of P with time, may
be resolved into radial and axial components
Pp and Pz , as shown in Fig. l(b). Owing to
cylindrical symmetry, this polarization, viewed
at a point distant from the particle, appears as (c)
an elementary dipole lying along the axis z,
Fig. l(c). As the particle plunges through the
medium, radiation arises from the coherent
growth and decay of this sequence of elementary B LIGHT RAY
dipoles. Two essential features of the radiation FIG. 1
become at once apparent. First, since it is only
the Pz component which is important, the field
variation, Fig. l(b), is that of a double a-func- tant to realize that the radiation arises from the
tion. Thus, from Fourier analysis, if the circular medium itself, not directly from the particle.
frequency is w, we will expect a spectrum of the Since the medium is stationary, the intensity
form w . dw, i.e., radiation which is bluer than and angular distributions do not contain the
that from an equi-energy spectrum. Secondly, relativistic factor (mc 2 IE); in this respect, it is
since the radiating element is an axial dipole, essentially different from Bremsstrahlung or
the angular distribution, for this element alone, synchrotron radiation.
e,
will be of the form sin 2 Fig. l(c).It is imp or- The description above applies only to one
173 CERENKOV RADIATION

element along the track. The most characteristic obtain the Cerenkov relation:
feature of Cerenkov radiation, its coherence, is
at once apparent when we now consider an ex- cos () = (l1{3n) (1)
tended region of track. In Fig. 2(a) it is easily
seen that there is only one angle () at which it is From Eq. (1) it is at once evident that there is a
possible to obtain a coherent wave front. If the threshold velocity given by (3 = (lIn), below
velocity of the particle is v (={3e, where e is the which no radiation takes place. At ultrarela-
velocity oflightin vacuo), and n is the refractive tivistic velocities, as (3 ~ I, the Cerenkovangle ()
index of the medium, the particle travels a tends to a maximum value ()(max) = cos- l
distance AB in a time D.t, given by AB = {3e • D.t; (lIn). The polarization vectors E and H of the
in the same time the radiation, emitted at A, radiation which, owing to symmetry, takes place
travels a distance AC = (eln)D.t, from which we over the surface of a cone, are shown in Fig.
2(b).
The radiation yield, from the theory of Frank
A and Tamm, is

-dW = e-
2

dz e2 /3n>1
l [ I] 1- R2n2
tJ
W' dw, ergs/cm path

(2a)
PARTICLE or

[1. _1.] .
TRACK WAVE
DIRECTION
dN = 21T(e 2 ) • sin 2 ()
PI \ dz he A2 Al
\
\
\
\
photons/cm path (2b)
,,
\
\
P2 \
, between wavelength limits Al and A2 (in cm).
\ The spectral distribution is (dW Idw) 0:. w or
\
,, C (dWldA) 0:. A-3, expressed as energy per unit
, \
circular frequency or per unit wavelength, re-
spectively. The radiation has, therefore, a con-
(0) ~~o~"'\ tinuous spectrum toward the blue and ultra-
violet. There is no radiation in the x-ray region,
~~~ for which n < 1. For example, in the case of a
B
fast electron in water, n = 1.33, we find from
Eq. (2b), that when {3 ~ 1 and ()(max) = 41°,
the yield (dNldz) is ~ 200 photons/cm, between
Al and A2 of 3500 and 5500A, respectively,
A Before elaborating on the many applications
of the effect, we will first trace a few of the
theoretical developments following the original
classical theory of Frank and Tamm,2 which ap-
plied to isotropic transparent media, and which
neglected quantum effects. A quantum treat-
ment of the effect, which shortly followed,3 re-
vealed results which deviated slightly from
those obtained by the classical treatment. These
deviations in the threshold condition and in the
angular and intensity distributions of the emitted
light were, however, extremely small, as one
might have expected, since Izw «E, where E
is the kinetic energy of the particle; e.g., fzw
would be -3 eV for blue light, while E would
be typically ~MeV.
There followed, in the forties and fifties, a
spate of theoretical papers, mostly in Russian
journals, extending the general theory to many
specialized cases. These are too numerous to
discuss in detail, but some of them will be men-
tioned in passing. If the medium is anisotropic
(e.g., a crystal), or optically active, the thresh-
FIG. 2 old condition, angular distribution, intensity
CERENKOV RADIA nON 174

distribution, and especially the polarization At this stage let us discuss the situation in
characteristics of the radiation, depend in a which the medium is a gas rather than a liquid
complex way on the relationship of the direc- or solid. Take air for instance. In this case, we
tion of the path of the particle relative to the have n = l.000293, Omax = l.3°, Emin(elec-
orientation of the crystal axes (anisotropic trons) = 21 MeV, and dN/dl = 0.3 photons cm- 1
media), or the axis of gyration (optically active path.
media). These energy yields, both in gases and even in
Much work was also devoted to cases in which solids and liquids, are extremely small, relative
the fast-moving single-point charge is replaced to the ionization energy losses, which amount
by one which also carries a magnetic moment, to -2 MeV gm- 1 cm- 2 for relativistic particles.
and to cases in which the moving particle is However, while the energy yields in Cerenkov
either an electric or magnetic dipole; continu- radiation are small compared with the ioniza-
ing further, work was also carried out on the tion losses, the emitted photons, having ener-
radiation caused by the motion of multipoles gies -2-3 eV, are so numerous, and light de-
and oscillators through the medium. In the tectors such as photomultipliers so efficient,
latter case, below the normal Cerenkov thresh- that it is possible to utilize the effect in many
old, a straightforward Doppler effect is ob- branches of high-energy nuclear and cosmic-ray
served, while above the threshold, a "complex" physics.
Doppler effect occurs, but not truly related to Let us now turn to the experimental and ap-
Cerenkov radiation. plication aspects of the phenomenon of Ceren-
Another case of some interest, one which kov radiation. While the early experiments car-
occurs mainly in the UV part of the spectrum, ried out by Cerenkov himself were based on
is the region of anomalous dispersion, where relatively strong radioactive sources and used
one meets a "complex" index of refraction; in the photographic technique, the first detection
this case the Cerenkov spectrum deviates from of single charged particles in a liquid is attributed
the smooth A-3 law, above, and is broken up to the writer,S as is the first detection of the
into bands which occur in those regions where Cerenkov light from single cosmic-ray particles
the real part of the dielectric constant € > ~-2. in air,9 in conjunction with a colleague.
Most of these theoretical problems mentioned While most of the theoretical work on the
so far, apply to transparent media and to the topic has been carried out in the USSR, it
optical region of the spectrum. • would appear that the West has contributed by
It was early realized, however, that Cerenkov far the larger fraction of the work on the ex-
radiation could also be expected to occur in perimental side.
ferrites (both isotropic and anisotropic), hav- The phenomenon has found considerable ap-
ing a permeability Jl, under the same condi- plication in the fields of high-energy and nuclear
tions of threshold, i.e., n = ~-1, where n in this physics, and in cosmic-ray research. The unique
case is (€Jl)1/2. This can occur in the microwave directional and threshold properties of the radia-
and radio-frequency regions of the spectrum. tion may be used in a number of different ways.
In these cases, referring back to Fig. 1, it is only For example, by velocity selection, it is possi-
necessary that the distance between a point S ble to distinguish between particles of different
and the track of the particle shall be <A, so mass having the same energy, and it is also pos-
that, in the microwave region of the spectrum, sible to measure particle velocities directly, by
the particle need not pass through the "body" measuring O. Other examples may be cited: The
of the medium, but can travel down a tunnel, e 2 dependence, Eq. (2) above, has been used to
eliminating the otherwise accompanying ioniza- determine the charge spectrum of the primary
tion losses. Some enhancement of the radiation, cosmic rays, and transparent lead-loaded glasses
poor in this case, because of the A-3 relation, have been developed as total absorption spec-
can however be achieved, if it is possible to trometers for high-energy -y-rays.
"bunch" the electrons. Loaded waveguides are Reference 6 contains a review of some of the
also potential media for Cerenkov radiation. early types of Cerenkov detector, while Refs.
We mention briefly three other associated ef- 10 and 11 cover somewhat later developments.
fects, that radiation, in limited bands, can occur More recently, developments have taken place
in a plasma, that there are various diffraction in very sophisticated detectors, especially ring-
and scattering effects,4 and that the phenom- imaging systems based on gaseous media which
enon of Cerenkov radiation has a certain rela- have proved to be especially valuable in high-
tionship with that of transition radiation. 5 en~rgy particle physics. 12,13
Much of what we have discussed above is of Cerenkov radiation in the atmosphere pro-
mostly academic interest as far as applications duced by cosmic-ray showers from ultra-high-
are concerned, though these topics lay the energy primary particles 14 has become a most
foundations for the many applications. valuable tool in high-energy -y-ray astronomy.
While a summary of the results of these This in fact provides the only tool, and it is
theoretical studies will be found in Ref. 6, the ground-based, for searching for point sources of
most comprehensive review of these aspects of -y-rays in the energy region 10 1 1-10 13 eV.1 S ,16
the phenomenon will be found in the work of A further application of Cerenkov radiation is
Bolotovskii. 7 the search for neutrinos of cosmic origin in sea-
175 CHEMICAL KINETICS

water, in a deep ocean trench off the island of 16. Turver, K. E., and Weekes, T. C., Phil. Trans. Roy.
Hawaii, a project known as DUMAND (Deep Soc. London A301, 615 (1981).
Underwater Muon and Neutrino Detector),17 17. Roberts, A., Ed., "Proc. 1978 DUMAND Summer
the volume of water involved in this detector Workshop," DUMAND Scripps. Inst. Oceanogr.,
being -1 km 3. A somewhat similar type of ex- 1978.
periment is likewise under investiagation in at- 18. News Section, "Search and Discovery," PhysiCS
tempt to observe the decay of the proton. 1S Today 33, 17 (1980).
Three further aspects of the phenomenon will 19. Anderson, W., and Belcher, E. H., Brit. J. Appl.
also be made in passing: (1) the application of Phys. 5(2), 53 (1954).
the effect to produce extremely weak standard 20. Marshall, 1., Astrophys. J. 124,469 (1956).
light sources for testing astronomical equip- 21. Veksler, V. I., CERN Symposium, Geneva, 1, 80
ment;19 (2) the interpretation of a particular (1956).
type of radio e!llission from sunspots;2° and 22. Fazio, G. G., Jelley, J. V., and Charm an, W. N.
(3) the inverse Cerenkov effect,21 proposed as Nature 228, 260 (1970).
a way of accelerating charged particles. 23. Moss, S. H., and Smith, K. C., Int. J. Radiat. Bioi.
By contrast, there have been two applications 38(3),323 (1980).
in biological areas. One of these is that flashes
of light observed by astronauts involved in the Cross-references: COHERENCE, DIELECTRIC THE-
NASA Apollo program during periods of trans- ORY, DIPOLE MOMENTS, FOURIER ANALYSIS.
lunar flight have been attributed, partially or
possibly totally, to Cerenkov radiation pro-
duced by primary cosmic rays of relatively high
nuclear charge traversing the retinas of the eyes CHEMICAL KINETICS
of the astronauts on these missions. 22 Besides offering useful rate equations to de-
And finally we mention that there is evi- scribe the speeds of chemical reactions, chemical
dence 23 that Cerenkov radiation may be a kinetics attempts to describe exactly how each
contributing or even dominant effect in the re- reaction occurs. It does so in terms of one or
pair mechanism by photoreactivation of the more elementary steps, which are reactions
DNA in the bacterium Esherichia coli subse- having no observable intermediate chemical
quent to damage by ionizing radiation. species. The ultimate goal is a theory interre-
lating energy, structure, and time for these single
J. V. JELLEY chemical events. Many of the ideas, developed
since 1850 when the first quantitative rate
study was made, have been extended by analogy
References to explain electron-hole processes in semicon-
1. Cerenkov, P. A., C. R. A cad. Sci (USSR) 2,451 ductors, various solid-state processes, and ther-
(1934). monuclear reactions.
2. Frank, I. M., and Tamm, I., C. R. A cad. Sci. Reaction rates depend on the nature of the
(USSR) 14,109 (1937). reactants, temperature, pressure, kind and in-
3. Ginsburg, V. 1., Zh. Fiz. SSSR 2, 441 (1940). tensity of radiation, nature of catalyst or sol-
4. Dedrick, K. G.,Phys. Rev. 87, 891 (1952). vent, and many other factors. The extent of a
5. Frank, I. M., and Ginsburg, V., Zh. Fiz. SSSR, reaction can be followed by withdrawal of
9(5),353 (1945). samples for early chemical analysis. It is more
6. Jelley, J. V., "Cerenkov Radiation and its Appli- common, however, to analyze the main reaction
cations," Pergamon Press, London, 1958. mixture continuously and nondestructively by
7. Bolotvskii, B. M., Usp. Fiz. Nauk 75(2), 295 spectroscopic means or by observing physical
(1961). properties like density, electrical conductivity,
8. Jelley, J. V., Proc. Phys. Soc. A64, 82 (1951). optical activity, dielectric constant, and so on.
9. Barclay, F. R., and Jelley, J. V., Nuovo Cimento, The rate v of a reaction aA + bB ~ eE + iF is
Series 10, 2(1), 27 (1955). best related to the rate of change of the con-
10. Hutchinson, G. W., Prog. Nucl. Phys. 8, 195 centrations [A) and [B) of reactants by re-
(1960). writing the equation as 0 = eE + iF - aA - bB.
11. Fabjan, C. W., and Fischer, H. G., Rep. Prog. Phys. If the change is an elementary step, its rate is
43(8),1003 (1980).
12. Gilmore, R. S., Malos, J., Bardsley, D. J., Lovett, 1 d[A) 1 d[B) 1 d[E)
v=- - - - = - - - - = + - - - =
F. A., Melat, J. P., Tapper, R. J., Giddings, D. I., a dt b dt e dt
Lintern, 1., Morris, J. A. G., Sharp, P. H., and
Wroath, P. D., Nuc. Instrum. and Methods, 157,
507 (1978). + ld[F) =k[A)a[B]b
13. Ekelot, T., Sequinot, J., TocqueviUe, J., and f dt
Ypsilantis, T. Physica Scripta 23, 718 (1981). Here k is a rate constant independent of concen-
14. Jelley, J. V., Phil. Trans. Roy. Soc. London. A301, tration. If the step is not elementary, its rate is
611 (1981). often presumed to be expressible as propor-
15. Porter, N. A., and Weekes, T. C., Smithsonian tional to certain empirically observed powers
Astrophys. Obs. Spec. Rep. No. 381 (1978). m, n, ... of the concentrations of species pre-
CHEMICAL KINETICS 176

sent in the reaction vessel. The values of m, lizing collision (0 = A + M - M - A*, with k 2 )
n, ... need not be integers and cannot be pre- or internal change (0 = At - A *, with kd that
dicted from the balanced chemical equation if leads to the activated state At. The reaction is
the step is not elementary. The over-all order of called unimolecular because the activated com-
a reaction is the sum of these exponents, and plex At contains only one reactant molecule.
the order with respect to a particular species is The rate of decomposition of A at any instant
its own exponent. The order of a reaction can is v = k2' [M) [A) - k2 [M) [A *), and the rate of
be determined in several ways. In the method of decomposition of A* isv*=k l [A*)- k 2 '[M)
initial rates, concentrations of all but one reac- [A) + k 2 [M) [A*). Since the v's are time deriv-
tant are held constant, if possible at great values, atives, these are simultaneous differential equa-
and v is observed at the start of reaction for tions.
several values of [A). The order with respect to It is generally impossible to solve the simul-
A is the slope of a graph of log v vs. log [A). taneous differential equations that describe a
The first major reaction rate theory was mechanism. The least restrictive and most useful
founded' on the kinetic theory and classical simplification is generally the steady-state ap-
mechanics. It still is very useful when reactants proximation, wherein the concentration of a
approach each other in an attractive potential species oflow concentration is assumed to reach
field (e.g., ions), when the distribution of ener- an effectively constant value after a certain
gies is nonequilibrium (e.g., electrical discharge reasonable time (induction period) has passed.
in gases), or when the molecules involved are If [A *) reaches a steady-state concentration,
tremendous in size. In general, however, the col- v* = 0 and the v* differential equation becomes
lision theory suffers from its inability to satis- an algebraic one for [A *) . Moreover, the rate of
factorily predict effective cross sections (mo- decomposition of A then is v = k 1 [A *) =
lecular sizes for reaction) or the effect of iso- klk2'[M)[A](kl + k 2 [M)fl. This rate equa-
topic substitution. tion becomes second order if k2 [M) ~ k l ; this
Most modern theories suppose the existence occurs at low pressure or when A * changes
of an undetectable transition state of high rapidly into At. If A is a simple molecule of
energy and fleeting existence. The configuration few atoms, the activation energy easily becomes
of this activated complex lies, as it were, atop effective in one bond to cause decay. On the
the mountain pass of lowest height between other hand, the order becomes first if k2 [M)~
energy-valleys of reactants and products. It is k 1 ; this corresponds to high pressure or an A
mechanically stable to all vibrations except the with many degrees of freedom to accommodate
one that describes the progress of reaction over the activation energy.
the saddle-point at the pass. This one motion is A generally more restrictive way to simplify
assigned a very low (sometimes imaginary) fre- the mathematics of a sequence of reactions is to
quency, but otherwise the activated complex is assume that one step is so much slower than
just another molecule. It is supposed to be in the others that it alone limits the rate. All steps
dynamic equilibrium with reactants and its free besides the rate-limiting one are assumed to be
energy can be calculated from its partition func- at equilibrium in this approximation. If, in uni-
tion by the usual methods of STATISTICAL molecular decomposition, the rate-limiting step
MECHANICS. The rate constant k n then takes is A* ""* At, then the rate is k 1 [A *). The reac-
the form (kT/h) exp(-AA/kT) where k is the tion is then first order in A because v = 0 for the
gas constant per molecule, T is the absolute equilibrium A + M = M + A *. If, however, the
temperature, h is Planck's constant, and AA is rate-limiting step is A + M ""* A * + M, then the
the increase in free energy on going from reac- rate is k 2'[M) [A).
tants to activated state. This resembles the well- Bimolecular reactions, wherein the activated
known relation kn = s exp(-E/kT) discovered state consists of two reactant species, are very
empirically in 1889 by Arrhenius. In it, sand E common. The rates of the fastest of these are
are approximately independent of T, and E is limited by the rates of diffusion of reactants
called the activation energy. and have rate constants of the order of 10 10
A recent theory of promise treats reactants as liter mole-I sec-I in aqueous solution. Typical
a wave packet that gradually spreads in time. examples are the aqueous neutralizations NH/
The rate of reaction is taken to be the probabil- + H 2 0 ""* NH3 + H30+ and H30+ + F- ""* HF +
ity that the packet will be found in a configur- H2 O. Typical bimolecular gaseous reactions are
ation that is indentified with products. the linear chain reactions X + H2~HX + Hand
Thermal decomposition of an initially pure H + X2 ""*HX + X where X is H, D, Cl, Br, or I.
gas is seemingly a simple change, yet its order A nice way to initiate these reactions of H2
often changes gradually from first to second as and X 2 is by a photon: X2 + photon ~ X + X.
the pressure falls. Moreover, there is always the Thermal dissociation of X2 is also sufficient to
question why like molecules do not all decay at start reaction.
once. The answers lie in understanding the Carbon compounds undergo many reactions,
mechanism, which is presently taken to be col- but most of them can be classified into a few
lision with any other molecule M (0 = A* + types. Nucleophilic substitution, wherein a basic
M - M - A with rate constant k2') to yield an reactant replaces another initially on C by a net
energized molecule A * that may suffer a stabi- reaction X + RY ~ XR + Y, may be first order
177 CHEMICAL KINETICS

in RY alone or in both X and RY. If first order Much industrial research is devoted to finding
in just RY, the mechanism is labeled SN I and catalysts which speed (or slow) specific desired
the rate-limiting step is conceived as production (or undesired) reactions. Academic research on
of the active carbonium ion R + by the process rates tends, however, to search for mechanisms
RY -+ R+ + Y-. If second order (SN2), the rate- and to propose models useful for prediction.
limiting step is considered to be production of Very fast reactions are of particular interest and
the bimolecular activated complex X ... R ... they offer several kinds of challenge to experi-
Y. Elimination reactions typically yield a double menters. If a decrease in temperature does not
bond with loss of part of the organic reactant affect the mechanism, a very fast reaction can
RCH 2CH 2Y. If first order in organic reactant perhaps be slowed to fit the response time of
(type E 1 ), the rate-limiting step is said to be the analytical devices in use. Typically for anal-
production of the carbonium ion RCH 2 CH 2+, ysis in real time one uses visible and ultraviolet
which then swiftly eliminates H+ to become spectrometers or a mass spectrometer. If a
RCH == CH 2 . If first order in both base and reaction is fast, the reactants must be mixed in
organic reactant (type E2)' the rate-limiting a time very short relative to the time of the
step is thought of as production of a bimolecular reaction; for times less than a millisecond, reac-
complex which eliminates H+ and Y - almost tants are generally mixed before reaction, which
simultaneously. A fifth class of organic reaction is later initiated by a pulse of pressure or radia-
(SNi) describes how an electrophilic reagent tion (e.g., flash photolysis) or introduction of a
(e.g., NOt in mixed HN0 3 and H 2S0 4 or Br+ catalyst. Or, instead of working in real time, a
in Br2 with FeBr3) may attack an aromatic ring reaction may be maintained in a steady state
like that in benzene to form a positively charged (e.g., a flame) with concentrations noted as a
intermediate that soon loses H+ to a base in the function of position, with of course some direct
solution. link between position and time. For reactions
A catalyst is a species that changes the rate of that occur in a time of the order of 10- 3 sec,
a reaction and yet is regenerated by that reaction typical experimental methods are: nuclear mag-
so that it seems to be unchanged in the net netic resonance, sudden changes in pressure or
reaction. Catalysts do not affect the equilibrium temperature, flash photolysis, shock waves, and
state but they do lower the activation energy various methods of controlling flow. For reac-
and sometimes may provide a needed steric tions done in 10- 6 sec, one has: electric field
arrangement. The most general mechanism of displacement, ultrasonic methods, electron
catalysis is A + C I -+ D + C 2 followed by B + paramagnetic resonance, dielectric relaxation,
C 2-+ E + C I to give the net change A + B -+ and pulsed radiolysis. A few special techniques
D + E. Many so-called catalysts of industry need (e.g., fluorescence quenching) are useful at
regeneration (C 2-+C I) by a reaction other than 10- 9 sec. In liquid solution, the rate of a fast
that catalyzed. For example, silica-alumina reaction may be independent of the nature of
cracking catalysts used in making gasoline must the reactants and become, instead, a function
be cyclically burned free of carbon deposited of liquid viscosity, with a temperature coeffi-
during cracking. A catalyzed reaction is almost cient appropriate to viscosity's change with
always first order in catalyst concentration (or temperature. Rate of reaction is thus controlled
surface area) and usually has an order that is by diffusion; for this, the activation energy of
less than the true order by unity. the chemical change seldom exceeds about 20
Enzymes are biological catalysts. Many act by kJ per mole.
the well-known Michaelis-Menten mechanism Very fast gaseous reactions and their mecha-
E + S ~ C -+ P + E, where enzyme E attacks nisms are often studied by crossing molecular
substrate S with a rate kA[E] [S] to form a beams in a vacuum. A heated reservoir with
complex C that may yield products P at a rate small hole allows molecules to effuse (ca. 10- 7
kp[C] or may revert to S with rate kR[C]. The torr). A beam is defined by slits or orifices, and
rate of disappearance of S is Us == k A [E] [S] - a selection of velocities is then made by ro-
k R [C] and the rate of appearance of C or dis- tating sectors in the beam, which at the start
contains the usual Boltzmann distribution of
appearance of E is UE ==kA[E][S]- kR[C]- velocities. For easier detection, a much more
kp[C]. The total concentration of enzyme in intense beam of high-energy molecules can be
the system is [E] 0 == [C] + [E]. In the steady made by fluid flow through a supersonic noz-
state, uE == 0 == kA([E] 0 - [C] )[S] - (kR + kp) zle, but extremely rapid pumping is needed to
[C] so that maintain the necessary vacuum. The simplest
collision in crossed molecular beams is "elas-
_ k [C] _ kpkA[S] [E] 0
Us - p - tic," with momentum and translation energy
(kR +kp)+kA[S] conserved by whole molecules. Inelastic col-
lision modifies their internal energies. Reactive
The rate of disappearance of S is always first scattering of beam by beam involves a transfer
order in total enzyme but may change from first of atoms during an encounter. To link the re-
to zero order in S as [S] increases. The maxi- sults of such scattering to a bulk reaction rate,
mum rate at which E can act occurs when [S] is one must average properly over angles, ener-
great and Us == kp[E]o. gies, and so on. While typical reactive collisons
CHEMICAL KINETICS 178

in gases occur at distances of the order of (or of the field and that remains meaningful over
less than) 100 A, certain other reactions involv- an extended period of time. In particular, the
ing a preliminary transfer of an electron may distinction between chemical physics and phys-
occur at distances of the order of 200 A. These ical chemistry is rather arbitrary, depending
last are rather poetically called "harpooning." more on the background and training of the
individual conducting the work than on the
WILLIAM F. SHEEHAN nature of the work itself. Many universities
have a curriculum option that requires formal
References training in both chemistry and physics depart-
ments. The graduates of such programs tend to
Most physical chemistry textbooks contain introduc- think of themselves as chemical physicists;
tions to chemical kinetics. Some of their many others who have a primarily chemistry back-
authors are: R. A. Alberty and F. Daniels, G. M. ground, may identify themselves as physical
Barrow, G. W. Castellan, S. Glasstone, 1. N. Levine, chemists even though their research is very
E. A. Moelwyn-Hughes, W. J. Moore, R. M. Rosen- similar.
berg, and W. F. Sheehan. Chemical physics did not emerge as an iden-
Ausloos, P. (Ed.), "Kinetics of Ion-Molecule Reac- tifiable research field until well into the twen-
tions," New York, Plenum Press, 1979. tieth century. The atomistic approach to chem-
Avery, H. E., "Basic Reaction Kinetics and Mecha- istry was introduced by Dalton and others at
nisms," London, The Macmillan Press, 1974. the start of the nineteenth century, and impor-
Benson, S. W., "The Foundations of Chemical Kinet- tant contributions were made by physicists such
ics," New York, McGraw-Hill Book Co., 1960. as Avogadro. However, the lack of a detailed
Bradley, J. N., "Fast Reactions," Oxford, Clarendon model for the atom and molecule made it im-
Press, 1975. possible to relate the microscopic structure of
Erdey-Gruz, T., "Kinetics of Electrode Processes," matter to macroscopic behavior. Thus, physics
New York, Wiley-Interscience, 1972. and chemistry pursued their own evolutionary
Glasstone, S., Laidler, K. J., and Eyring, H., "The development through most of the nineteenth
Theory of Rate Processes," New York, McGraw-Hill century, with rather limited interaction.
Book Co., 1941. These paths began to intersect at the turn of
Hinshelwood, C. N., "The Kinetics of Chemical the century: J.Willard Gibbs, a professor of
Change," London, Oxford University Press, 1940. mathematical physics, developed a powerful
Mulcahy, M. F. R., "Gas Kinetics," New York, John tool for chemists in the form of statistical me-
Wiley & Sons, 1973. chanics; Peter Debye created detailed physical
Roberts, D. V., "Enzyme Kinetics," London, Cam- models that explained the properties of elec-
bridge Univ. Press, 1977. trolyte solutions in terms of forces between
Slater, N. B., "Theory of Unimolecular Reactions," electrically charged particles; William and
Ithaca, NY, Cornell Univ. Press, 1959. Lawrence Bragg developed x-ray techniques for
Wilkinson, F., "Chemical Kinetics and Reaction Mecha- determining the structure of crystals, which
nism," New York, Van Nostrand Reinhold Co., opened the chemistry of solids to detailed in-
1980. vestigation for the first time; Niels Bohr recog-
There are several timely review articles in various vol- nized that the theory which explained atomic
umes of "Advances in Chemical Physics," 1. Prigo- spectra should also permit a physical expla~a­
gine, Ed., London and New York, John Wiley & tion of the periodic table of the elements. Wlth
Sons, 1958. the advent of quantum mechanics in the 1925-
Values of rate constants for specific reactions are 30 period, physics and chemistry were drawn so
listed in National Bureau of Standards Circular 510, closely together that there was no longer a de-
its supplements, and in NBS Monograph 34. For finable boundary. Quantum mechanics pro-
critically evaluated rate constants and mechanisms, vided for the first time a quantitative theory
see S. W. Benson and H. E. O'Neal, "Kinetic Data on capable of explaining the forces between atoms
Gas Phase Unimolecular Reactions," NSRDS-NBS that lead to the formation of molecules. During
21 (Feb. 1970), Washington, D.C., U.S. Government a remarkably prolific period the foundations
Printing Office. were laid for a detailed understanding of mo-
lecular structure, the kinetics of chemical reac-
Cross-references: CHEMISTRY, PHYSICAL CHEM· tions, the relation of thermodynamic properties
ISTRY, STATISTICAL MECHANICS. to structure, the interaction of light with mat-
ter, and many other subjects of long-standing
CHEMICAL PHYSICS importance in chemistry.
Another important factor was the develop-
Chemical Physics is a term used to describe a ment by physicists of instruments that allowed
rather broad set of research activities that fall chemical phenomena to be probed in new ways.
on the borderline between the traditional dis- Infrared and mass spectrometers were used suc-
ciplines of chemistry and physics. As with all cessfully by chemists in the 1930s, and the
interdisciplinary endeavors, it is difficult to postwar period saw the introduction of more
arrive at a precise definition of chemical physics and more sophisticated instrumentation-micro-
that is generally acceptable to the practioners wave, nuclear magnetic resonance, electron spin
179 CHEMISTRY

resonance, and Mossbauer spectrometers, to Division of Chemical Physics of the American


name only a few. More recently, the laser has Physical Society. This division has a member-
opened many new avenues for chemical re- ship of about 2,800, which gives a rough idea
search. In a pattern which has been repeated of the number of scientists in the United States
many times, a new instrumental technique is who regard themselves as chemical physicists.
developed by physicists, applied first to prob- The areas of special interest were ranked as
lems of current interest in physics (usually in- follows in the survey:
volving atoms or simple molecules), and then
adapted by chemists to study larger molecules Spectroscopy and light scattering
and more complex chemical phenomena. Molecular interactions and reactions
The birth of chemical physics as an identi- Photochemistry
fiable research field can perhaps be dated from Surfaces
the establishment of The Journal of Chemical Statistical mechanics and thermodynamics
Physics in 1933. The driving force for this Scattering processes
event was the rapid increase in research papers, Polymers
resulting from the new theoretical and experi- Here the importance of surface science is more
mental tools described above, which could not evident.
be accommodated by the traditional journals In recent years the overlap between chemical
of either physics or chemistry. More than any physics and solid state (condensed matter)
other single measure, the contents of this jour- physics has become stronger. A number of ideas
nal have defined the scope of chemical physics. derived from the chemist's traditional approach
More recent journals and publication series to bonding have been introduced into solid
using the term chemical physics in the title state physics. Likewise, the distinction between
have tended to adopt the same scope and em- atomic physics, which in the past focused on
phasis as The Journal of Chemical Physics. interactions between electrons, photons, and
It is therefore instructive to analyze the con- atoms, and the part of chemical physics that
tents of The Journal of Chemical Physics. The deals with molecular interactions and reactions
articles appearing in the journal are currently has become blurred. There is also a trend toward
sorted into five categories, which are listed be- a closer relation between chemical physics and
low along with the approximate distribution of biophysics.
papers in early 1982: In summary, chemical physics is a research
Spectroscopy and light scattering 35% area of somewhat diffuse and fluid boundaries
Molecular interactions and reactions, which is characterized by use of the theoretical
and experimental tools from physics to further
scattering, photochemistry 22% the understanding of phenomena of interest to
Statistical mechanics and chemistry. It bridges the gap between two his-
thermodynamics 20% torically different approaches to natural science,
Quantum chemistry, theoretical whose distinction is gradually disappearing.
electronic and molecular structure 18%
Polymers, surfaces, and general chemical DAVID R. LIDE, JR.
physics 5%
Titles of papers on spectroscopy show emphasis References
on topics such as laser-induced fluorescence, The Journal of Chemical Physics (American Institute
double resonance, and other techniques that of Physics, New York).
permit measurement of the fine details of Chemical Physics (North Holland Publishing Company,
molecular structure. Studies of energy distri- Amsterdam).
bution in molecules, ionization processes, and Chemical Physics Letters (North Holland Publishing
reaction kinetics as determined by molecular Company, Amsterdam).
beam experiments are prevalent in the second Advances in Chemical Physics (John Wiley & Sons,
category. The application of statistical me- New York).
chanics to the understanding of phase transi-
tions and critical phenomena is now an active
topic. Papers in quantum chemistry range from Cross-references: CHEMISTRY ; MATHEMATICAL
diatomic molecules to complex aromatics but PHYSICS; PHYSICAL CHEMISTRY; PHYSICS;
stress the calculation of the full range of molec- THEORETICAL PHYSICS.
ular properties, not only energy levels and sta-
bility. The relatively small number of papers
classified as surface physics is somewhat mis- CHEMISTRY
leading, in view of the growing interest of
chemical physicists in this area of research. Chemistry is the branch of natural science that
The explanation probably lies in the existence is concerned with the nature, composition, and
of a number of competing journals. structure of matter. The chemist attempts to
Another measure of the topics currently em- provide a consistent model of the macroscopic
phasized in chemical physics research is pro- behavior of materials in molecular terms. Be-
vided by a 1981 survey of members of the cause of the focus on the behavior of matter,
CHEMISTRY 180

chemistry is in a central position in the array of Boyle and Antoine Lavoisier, two of the earliest
natural sciences and has strong interdisciplinary proponents of exact quantitative experimenta-
ties to both physics and biology. In fact the tion, exerted profound influences on late eigh-
core of conceptual models of chemistry is fa- teenth and early nineteenth century chemistry.
miliar to the physicist and includes primarily Their techniques were mainly physical measure-
quantum mechanics and thermodynamics. ments.
The drive to understand matter at the molec- Around the middle of the nineteenth century,
ular level has lead the chemist to follow the age Friedrich Kekule and Archibald Couper inde-
old goal of transformation of matter both at pendently proposed a system for writing graphic
the molecular and, more recently, the atomic formulas for chemical compounds. Their con-
level. In this field man is able to synthesize new cepts were based apparently on the notion that
combinations of elements, substances and ma- physical forces held together the atoms in com-
terials. Actually, chemists have created in the pounds. About the same time, general accep-
laboratory several chemical elements (43, 61, tance was finally accorded Amedeo Avogadro's
87, and 93 to 105 inclusive) that apparently are hypothesis (proposed about 50 years earlier)
not normally present in or on our planet. Even which stated that equal volumes of gases at the
a cursory examination of the concept of iso- same temperature and pressure contain equal
merism (the existence of two or more different numbers of molecules. The importance of the
compounds that have identical compositions) establishment of a direct link between observ-
yields the conclusion that literally trillions and able and molecular scale behavior contained in
trillions of different compounds of carbon Avogadro's hypothesis cannot be overestimated.
could be synthesized. In fact, there are possible In addition to this interest in conceptual mod-
at least 62 X 10 12 different compounds (called els, considerable activity revolved about phe-
isomers) all having an identical composition nomenological models. These studies culminated
indicated by the formula C4o Hs2 . This specific in the development of the periodic table of the
combination of atoms is just one of millions of elements by Mendeleev. In this scheme regulari-
other possible combinations, and some of these ties in the chemical and physical behavior are
would have thousands of isomers. The current organized and provided a powerful predictive
chemical literature contains documented evi- tool for the discovery of new elements, com-
dence for either the existence, or the synthesis pounds, and physical properties.
by man, of well over a million different com- During the period 1860-1920, the first great
pounds of carbon. And carbon is only one of period of synthesis of new compounds occurred.
the 105 known chemical elements. Both organic and to a lesser degree inorganic
As with most areas of science, the explosive chemistry flourished, and hundreds of com-
growth of chemical knowledge in the twentieth pounds were prepared, correctly analyzed and
century, particularly since the Second World identified, and logically classified as to struc-
War, has lead to the development of several sub- ture and reactivity. And yet, throughout this
disciplines of chemistry. The four main areas period, chemists had no knowledge whatso-
which are generally recognized are analytical, ever about the structure and composition of
inorganic, organic, and physical chemistry. In- atoms. Also they had a rather shallow concep-
terfacial areas bridging the natural sciences tion of chemical bonding and molecular geom-
range from medicinal and biochemistry to geo- etry. The growth of organic chemistry during
chemistry, astrochemistry, theoretical chemis- the nineteenth century is an epic example of
try, and chemical physics. the productivity of sound inductive and deduc-
A unique and important factor associated tive reasoning.
with chemistry is the existence of a strong in- As was the case with physics, the quantum
dustrial connection. The chemical industry is revolution in the early decades of the twentieth
one of the major components in the economy century had a profound effect on the direction
of any developed nation. The continually in- and development of modern chemistry. Al-
creasing need for commodities such as struc- though the influence of quantum theory is
tural materials, fabrics, fertilizers, pesticides, pervasive throughout chemistry there are three
and pharmaceuticals will enhance the chemical particularity significant cases which bear men-
industry. The consumption of sulfuric acid, for tioning. The organization of elements in the
instance, has for several decades been consid- periodic table is directly correlated to elec-
ered a significant indication of general industrial tronic configurations hence establishing elec-
activity. This strong academic/industrial link is tronic structure as the basis of chemical be-
shown in the development of various subdisci- havior. Due to the pioneering efforts of Lewis,
plines such as polymer and industrial chemistry. Pauling, and others, quantum theory provides a
Historical Background The history of al- basis for both a qualitative and quantitative
chemy is a fascinating record of man's earliest understanding of the chemical bond. The ex-
investigations of matter. However, the mixture plosive growth of various forms of atomic and
of rational and mystic approaches hampered molecular spectroscopy which accompanied the
significant progress. Consequently, it was not development of quantum theory has added a
until the rationalist eighteenth century that the significant battery of methodologies which are
serious origins of chemistry developed. Robert available for modern chemical research. These
181 CHEMISTRY

techniques have proven invaluable both in the sit ion of matter resides in the branch of chem-
measurement of amounts of materials present istry called analytical chemistry. Any means of
in a given sample and in the elucidation of determining molecular structure is often called
molecular structure. Types of spectroscopic an analytical technique. Until relatively re-
measurements include: atomic absorption/emis- cently most analyses were performed by using
sion, ultraviolet, photoelectron, fluorescence, specific chemical reactions and techniques in
infrared, microwave, nuclear magnetic reso- liquid solutions. Recent advances in spectros-
nance, electron spin resonance, nuclear quad- copy and other physical techniques have yielded
rupole resonance, and Mossbauer. The last men- a variety of instruments that greatly facilitate
tioned technique is an interesting example of chemical analyses. The ability of the analytical
the rapidity of information transfer between chemist to measure trace quantities of ma-
physics and chemistry in that within a few terials has played a central role in the develop-
years after Mossbauer's discovery of the effect, ment of awareness of the hazards of environ-
it had become a significant tool for investiga- ment pollution.
tion of the geometric and electronic structure Biochemistry Investigations of the chemical
of iron and tin compounds. Other, nonspectro- phenomena in, and the constituent compounds
scopic, physical methods are also of importance. of, living organisms are performed mainly by
In particular, one can cite mass spectrometry, biochemists. Because every chemical reaction in
which is of value in the determination of molec- any living organism involves compounds of car-
ular structure and the study of isotope masses bon, biochemistry is in part the application of
and diffraction methods (x-ray and electron) organic chemistry to investigations of vital sys-
for the determination of geometrical structure. tems. However, physical and analytical and
During and following World War II there was more recently inorganic chemistry are essential
considerable investigation of the transuranic to biochemistry.
elements and of technetium, francium, and Inorganic Chemistry This branch of chemis-
promethium in relation to their production try is mainly that of all forms of noncarbon
and use. All these radioactive elements were compounds. Although its potential scope is
produced by extraordinary nuclear reactions, huge, it has attracted, until recently, much less
whereas all ordinary chemical reactions involve attention than have organic and physical chem-
only the electrons outside the nucleus. Ordi- istry. The availability of sophisticated physical
nary chemical reactions are, in a sense, extra- methods have made detailed knowledge of inor-
nuclear. ganic materials possible. The very breath of this
The utilization of chromatographic techniques subdiscipline has lead to the existence of sev-
such as column, paper, thin-layer, vapor phase eral interdisciplinary activities. The most im-
(gas), and high performance liquid chromatog- portant of these are organometallic chemistry,
raphy has allowed chemists to effect the sepa- in which one studies carbon derivatives of the
ration of the components of complex mixtures. elements, and bioinorganic chemistry, which
The use of ion exchange in the resolution of focuses on the role of inorganic species, espe-
ionic mixtures and in the removal of ions from cially metal ions, in biological systems.
solutions has been extended during the past Organic Chemistry This division of chemis-
twenty years. try is essentially that of the compounds of car-
The applications of the molecular orbital and bon. Originally organic chemistry was confined
ligand-field concepts have been highly success- to materials in or from living organisms, prob-
ful in the description of the bonding in and ably because nearly every compound either iso-
geometry of complex molecules. These concepts lated from or produced by a living organism is
have firm physical bases. a compound of carbon. Although there are
Chemists, like most scientists, take a special more compounds of hydrogen than of any other
delight in doing experiments that are consid- element, the compounds of carbon are next in
ered to be theoretically impossible; conse- line. The property of catenation (ability of
quently the discovery in 1962 by Bartlett of a identical atoms to bond together) is exhibited
stable compound of xenon, Xe+(PtF 6)- , was an most extensively by carbon. The hundreds of
epic event. Since the discovery of the noble different carbon-atom skeletons of the thou-
gases during the period 1894-1900, most chem- sands of known organic compounds attest this
ists had assumed or believed that these elements fact. All foods, nearly every fabric, every ordi-
were chemically inert. Therefore, Bartlett's dis- nary commercial fuel, and almost all pharma-
covery prompted much activity to produce ceuticals are organic in the sense that they con-
other noble gas compounds. tain compounds of carbon. The great tradition
The importance of chemistry in modern biol- of syntheses of new compounds has developed
ogy, and especially in medicine, has been reem- to an art form in modern organic chemistry.
phasized by knowledge of the role of the nucleic One now can prepare structures ranging from
acids, such as DNA and RNA, in the genetic highly symmetrical geometric forms to huge,
scheme. The increased use of chemotherapy in complex naturally occurring molecules such as
medicine is common knowledge. vitamin B-12.
Analytical Chemistry The qualitative and Physical ChemIStry The quantitative measure-
quantiative determination of elemental compo- ments of the properties and behavior of the
CHEMISTRY 182

elements and their compounds are the major CIRCUITRY


concern of the physical chemist. Nearly every
technique and concept has been adopted from Basic Concepts An electric circuit may be de-
physics. The development of new chemical fined as a path or group of interconnected paths
concepts follows logical consideration of quan- capable of carrying an electric current or an
titative data. The major branches of physical arrangement of one or more closed paths to
c~em~stry are spectroscopy, nuclear chemistry,
accommodate an electron flow. Electrons bear
kmetIcs, thermodynamics, quantum and statis- electric charges, and the unit of electric charge
tical mechanics, and solution and surface chem- is the coulomb, defined as the quantity of
istry. Physical organic and inorganic chemistry charge possessed by 6.24 X 10 18 electrons, and
have gained prominence during the past 25 the electrical through variable is the current.
years. The availability of modern high speed The unit of current is the ampere, defined as
computers has allowed for the application of the flow of one coulomb (6.24 X 10 18 elec-
the methods of quantum chemistry to mole- trons) per second (1 ampere = 1 coulomb/sec).
cules of increasing size and complexity. Calcu- The passage of a current through a circuit ele-
lations are usually divided into ab initio and ment yields a change of potential designated as
s~miempirical methods, the latter having a
the voltage drop or voltage (in volts) and con-
hIgher degree of presupposition and/or param- vention dictates that in passive elements such as
eters derived from experimental data. resistance R, inductance L, capacitance C, short
There are certain discernable trends in the circuits and open circuits, the current flow
current direction of chemical research which through the element is the direction of voltage
are worth noting. The first of these is the in- drop or voltage across the element. In source
creasing tendency of interdisciplinary activi- elements such as ideal voltage sources and
ties between subdisciplines. The most promi- ideal current sources, the current flow is in the
nent . of t~ese include bioinorganic, bioorganic, direction of the voltage rise across the source.
physIcal morganic, physical organic and or- The use of the words through with regard to
ganometallic chemistry. On a larger s~ale, joint current and across with regard to voltage may
programs involving chemistry, physics and be observed.
engineering may be found in the rapidly in- Three types of equations are used in the anal-
creasing number of material sciences programs. ysis and synthesis of electrical circuits. These
One should also note the pervasive utilization are the continuity, compatability and elemen-
of analytical instrumentation in virtually all tal equations. A continuity equation is a rela-
areas ranging from synthetic organic to experi- tionship among through variables, and in the
mental physical chemistry. In similar fashion, electrical case this is the Kirchhoff Current
methods traditionally associated with physical Law: The algebraic sum of the currents enter-
chemistry, such as X-ray diffraction and molec- ing a point (node) in an electrical circuit (net-
ular orbital calculations, are now extensively work) must be zero (L i = 0). A compatability
used by inorganic and organic chemists. There equation is a relationship among across variables
has also been a strong interest in chemistry re- and in the electrical case, this is the Kirchhoff
lated to industrial processes giving rise to con- Voltage Law: The algebraic sum of the voltage
tinued growth not only of polymer chemistry drops around a closed path (loop) in an elec-
but also the study of catalysis and surface
chemistry.
tri~al circuit (network) must be zero (L
LlV = 0).
It IS to be noted that in applying the Kirchhoff
CHRISTOPHER W. ALLEN Laws, currents flowing away from a point are
treated as a negative current flowing toward the
point and negative voltage drops within a path
References are treated as voltage rises .
A readable introduction to chemistry may be found in: . Ele~ental Equations The third type of equa-
Dickerson, R. E., Gray, H. B., and Haight, G. P., Jr. tIon IS the elemental equation which is a rela-
"Chemical Principles", 3rd Ed., Benjamin/Cummings: tionship between the through and across vari-
Menlo Park, CA, 1979. ables for a particular element. For the cases of
Brief surveys of current activity in various subdisci- R, Land C, the elemental equations are
plines of chemistry may be found in Chemical and
Engineering News: 58(33), 30 (1980) (analytical); v=Ri; i = vIR =Gv (1)
59(9), 28 (1981) (biochemistry); 59(46),42 (1981)
(inorganic); 58(11), 34 (1980) (organic); 58(22), 20
(1980) (physical); 58(50), 30 (1980) (polymer). which is known as Ohm's Law and where it is
to be noted that G = l/R,

J
Cross-references: BOND, CHEMICAL; ELEMENTS,
CHEMICAL; ISOTOPES; MOLECULAR WEIGHT' di
MOLECULES; PHYSICAL CHEMISTRY; PHYSICS; v=L - '
dt'
.
1= LI i dt (2)
SPECTROSCOPY.
183 CIRCUITRY

and ZL(S)=Ls; YL(S) = l/Ls (9)

i =C :~; v= ~ Jv dt. (3)


Zc(s) = l/Cs; Y c(s) = Cs.
Then if a direct current (d-c) voltage v = v(s = 0)
(10)

Equations (1), (2), and (3) demand that the is applied ZR(O) = R, ZL(O) = 0, Zc(O) = 00
elements be linear, that is, they possess an out- and YR(O) = l/R =G, YL(O) =00 and Yc(O) =O.
put (either i or v) that is in direct proportion If an alternating current (a-c) voltage is ap-
to the input (either i, v, di/dt or dv/dt) which plied, Euler's equations can be used to show that
shows that R, L, and C may be viewed as simple s = jw (j = yCi) and ZR(jW) =R, ZL(jw) =
proportionality constants. Equations (1), (2), jwL =jXL,ZC(jW) =-j(l/wC) =-jXc, YRCiw) =
and (3) may also be derived experimentally or l/R = G, YLCiw) = -j(1lwL) and Yc(jw) =
by an interpretation of electromagnetic field jWC. In the foregoing XL and Xc are respec-
phenomena. tively called the inductive reactance and the
For the case of the short circuit capacitive reactance.
The Kirchhoff Laws can be employed to yield
i = i; LlV = 0 (4) the familiar voltage and current divider relation-
ships. For n impedances in series with an applied
which states that a short circuit passes all of the voltage Vin(S), the voltage drop across the ith
current with no voltage drop and for the open impedance is given by the voltage divider rela-
circuit tionship
i = 0; LlV = v (5)
Ll Vj(s) =
which states that an open circuit possesses the
impressed voltage but passes no current.
Circuit Connections Circuit elements may be
connected in series in which they are connected k=I,2,3, ... ,i, ... ,n. (11)
in such a manner as to provide a single path for
the flow of current or in parallel in which they In similar fashion, for n admittances in parallel
are connected so as to provide a division of cur- (recall that Y(s) = l/Z(s» with an applied cur-
rent among the elements. It is apparent that in rent Jin(S), the current through the ith admit-
series circuits, the Kirchhoff Voltage Law must tance is given by the current divider relationship
prevail and in parallel circuits, the Kirchhoff
Current Law must apply to the element connec-
tion points. Equivalent single values of R, L,
and C may be determined (R eq , Leq and Ceq)
where for the series circuit
m n k = 1, 2, 3, ... , i, ... , n. (12)
Req = L: R i ; Leq = L: L j ; Ceq = p 1
i=1 j=1
L:-
C
k=l k
Circuit Analysis Electrical circuit or network
analysis involves the determination of unknown
currents or voltage drops when the circuit is
(6) specified and subjected to a forcing function.
and for the parallel circuit For a particular quantity of interest such as the
current through a series connection of circuit
elements when subjected to a voltage forcing
1 function, the response is composed of a tran-
Leq = -n--l-; sient and a steady state response. For example,
L:-
j=1 Lj
consider the connection of a resistor, capacitor,
inductor and a switch all in series with a volt-
age source Vin(t). Kirchhoff's Voltage Law can
(7) be employed to yield the integro-differential

f
It is useful to define a quantity called the im- equation
pedance to exponentials or merely the imped-
ance as the ratio of an exponential voltage, v = d· 1
L d; + Ri + C i dt = Vin(t) (13)
Vest to an exponential current i = Jest. With the
admittance to exponentials taken as the recip-
rocal of impedance, Eqs. (l), (2), and (3) can where it is presumed that the switch closes at
be used to show that, for R, L, and C, t = O. The total response is given by
(8)
CIRCUITRY 184

where ie(t) is the complementary function (also Any of the oscillating forms of (16) describe
termed the natural or transient response) and the underdamped case.
ip(t) is the particular integral (known also as The particular integral of Eq. (13) may be
the forced or steady state response). found by standard methods such as undeter-
The complementary function derives from mined coefficients or variation of parameters.
the assumption of an exponential solution, The complete general response is then the sum
i(t) = lest and its substitution into the homo- of ic(t) and ip(t) and the arbitrary constants in
geneous integro-differential equation yields ie(t) are evaluated after this sum is taken from

L
die
at 1
+ Ric + C
f ie dt =0
two different initial conditions to form the par-
ticular response.
Other Methods Other methods for analysis of
the complete response are in frequent use. One
which leads to the factored and algebraically of these is the method of Fourier analysis, which
adjusted auxiliary equation is useful when the circuit is subjected to a peri-
odic input. The input function is represented as
R 1 a Fourier series. Because the circuit is linear,
S2 + -s+- the principle of superposition (the total re-
L L LC lest = O. sponse to the total stimulus is equal to the sum
s of individual responses due to individual stimuli
acting alone) permits the response to be evalu-
Here it is observed that because I = 0 or est = 0 ated from a term-by-term solution summation
will yield trivial solutions from the individual terms of the input Fourier
series.
S2 + R s + _1_ = 0 Fourier analysis may be extended beyond
L LC .
Fourier series to first the Fourier integral, then
With a = RI2L and Wn 2 = 1 jLC, this equation to the Fourier transform, and finally to the
becomes the simple quadratic Laplace transform. This permits the entire
analysis to be conducted algebraically in the
S2 + 2as + Wn 2 = 0 complex frequency domain instead of on a
differential equation basis in the time domain
with two solutions for the exponent s in the via the Laplace transform of f(t):
assumed ie(t) = lest,
SI, S2 = -a ± ";a 2 - Wn 2 •
The form of the response is dictated by the
f[f(t)] =F(s) = 1
o
00
f(t)e-stdt (17)

discriminant a2 - Wn 2 = Wd 2 • If a> Wn, then


Wd is positive and SI and S2 are both negative, where s = a + jw, a complex frequency.
real, and distinct. The complementary solution Finally, by making use of the idealization of
will be a pair of exponentials, each with an arbi- the unit impulse defined as a pulse of unbounded
trary constant of integration magnitude acting over an infinitesinual time
period so that its behavior is not defined by its
ic(t) = II e(-CHWd)t + 12 e(-Ol-Wd)t (14) magnitude but by its behavior under integration
which is the overdamped case.
If a =Wn, then SI and S2 are negative, real (18)
and equal. In this case SI = S2 =S =-a and the
solution will be given by
one can employ the Faltung or convolution inte-
ic(t) = II e-Ol t + 12 te- Olt. (15)
gral (or simply the convolution of two functions)
This is the critically damped case.
If a < Wn, then a 2 - Wn 2 is negative and SI =
-a + jWd and S2 - a - jWd, a pair of complex con- (19)
jugate roots with negative real parts. A damped
oscillation occurs
where fl (t) is the actual input function, r2 (t) is
ic (t) = e -at (II cos Wdt + 12 sin Wdt) (16a) the circuit response to the unit impulse func-
tion, r(t) is the actual circuit response, and Tis
which may be written in terms of a sinusoidal a dummy variable of integration. This proce-
function and a phase angle dure is known as Borel's Theorem.
(l6b) Steady State a-c and doc Analysis The con-
cepts of circuit impedance and admittance per-
or mit a direct analysis of the steady state condi-
tion. The simplest possible case is that of the
ladder network in which every element current
185 COHERENCE

or voltage can be determined from any or all of semiconductors, monolithic circuits, capacitors,
the following : the two Kirchhoff Laws, voltage and inductors can also be added consistent with
or current dividers, and systematic combina- the device function.
tion of impedances (admittances) in series or Thick-film hybrid circuits consist of conduc-
parallel. tor, dielectric', and resistor patterns which are
When the network possesses loops in the graph screen printed on ceramic substrates. Miniature
theoretical sense, the method of loop (mesh) components such as monolithic integrated cir-
analysis to yield loop currents or node analysis cuits, semiconductor devices, capacitors, induc-
to yield node-to-datum voltages can be used. tors, and transformers are then added after the
Both of these methods involve systematic appli- basic circuitry patterns are formed.
cation of one of the Kirchhoff Laws to each Monolithic bipolar circuits consist of bipolar
loop or node (loops and nodes are evident from junction and monopolar junction field-effect
the topology of the circuit)-the voltage law for transistors, Schottky diodes, junction diodes,
loop analysis and the current law for node resistors and capacitors formed by oxidation,
analysis. The inevitable result is a set of n simul- diffusion, ion-implantation, chemical vapor de-
taneous, linear (and linearly independent) alge- position and photoengraving on and perhaps
braic loop or node equations which are easily within a single crystal semiconductor substrate.
solved by several methods. Interconnection is accomplished through the
For certain networks, the superposition theo- use of thin film circuitry.
rem (discussed above) or the network theorems Monolithic metal-oxide-semiconductors (MOS)
of Thivenin (also Helmholtz) and Norton may consist of MOS field-effect transistors, capaci-
be employed to advantage. The Thevenin and tors, conductors and resistors which are formed
Norton Theorems permit calculation of the by oxidation, diffusion, photoengraving, chem-
performance of a portion of the circuit (possi- ical vapor deposition, thin-film deposition, and
bly a single circuit element) from the terminal ion-implantation on and within a single-crystal
properties of the rest of the network. semiconductor or substrate.
Nonlinear Circuits and Circuits with Time-
Varying Elements Analysis of circuits with non- ALLAN D. KRAUS
linear and time-varying elements poses a problem
that is substantially more difficult than the Cross-references: ALTERNATING CURRENTS; CA-
analysis of their linear counterparts. For non- PACITANCE; CONDUCTIVITY, ELECTRICAL;
linear elements such as the pn junction diode or ELECTRICITY; FEEDBACK; INDUCTANCE; LA-
tunnel diode (resistors), the varactor diode (ca- SER; MASER; SEMICONDUCTORS; SEMICONDUC-
pacitor), and the common inductor driven to TOR DEVICES.
flux saturation by large currents, analysis can
be conducted using perturbation techniques or
by successive small signal approximations.
Examples of time-varying elements are the po- COHERENCE
tentiometer or rheostat (resistor) and the mov- Basic Definitions The term coherence as it is
ing plate capacitor used in communications sys- used in electromagnetic radiation studies is best
tems for tuning. Analysis of circuits with such explained by a discussion of Young's interfer-
elements requires a valid mathematical model ence experiment. Referring to Fig. I, we con-
of the element variation. For example, for a sider a self-luminous radiating source S (like a
resistor which is time varying, Eq. (13) becomes

J
mercury arc lamp) placed a distance 11 from an
opaque screen A. In the screen A two pinhole
d' + R(t)i + C
I openings PI and P2 are made a distance d apart.
L d; i dt = Vin(t) (20)
The radiation passing through the pinholes im-
pinges upon and is recorded upon a photo-
and the degree of analysis difficulty will depend graphic plate B a distance 12 from screen A.
on the form of R(t). If 11 is a very large distance from A (say
Microelectronics and Microelectronic Circuitry 11 ~ bd/"A., "A. being the average radiation wave-
Microelectronics, as the prefix micro implies, length) and the spectral width of the radiation
pertains to the area of technology that is asso- is made very narrow by filtering, then the
ciated with the fabrication of and operation of fringes observed on B in the neighborhood of 0
circuits, devices and systems (and subsystems) will be very sharp, and we say that the radiation
utilizing extremely small components. Four fields impinging upon PI and P 2 are very coher-
technologies are in widespread use: ent. This is in accord with our usual notions
Thin-film hybrid circuits consist of conduc- about the radiation from a point source. If, on
tor, dielectric, and resistor materials which are the other hand, we use the same source and II is
deposited on glass, ceramic, or crystalline sub- taken to be small (say of the order of b) and
strates by evaporation, sputtering or electro- the dimension b is large compared to d then
plating. The form of the deposition is obtained fringes will not be observed on B, and we say
by patterning through photoengraving, or by that the radiation fields impinging upon PI and
printing through a mask. In this technology, P2 are incoherent. Again this is to be expected
COHERENCE 186

Source
characteristic
S/ dim,"';.' b

~---------JI --------~----------~--------~
A B
FIG.!. Young's interference experiment.

since in this case we observe the radiation just diation at PI and the radiation at P2 . To give a
as it emerges from the lamp. As the distance more precise definition of coherence, Wolf 9
11 is increased from the order of b, faint fringes considered the correlation function rep 1 , P2 , T),
will begin to appear upon B. As 11 increases, the commonly termed the mutual coherence func-
fringes will become progressively stronger until tion, defined as
they become very pronounced when 11 ~ bdlX.
The intermediate states, when the fringes are rep 1 , P2 , T) = (V(P 1 , t + T) V*(P 2 ,t) (2)
present but not necessarily very strong, are where V(P 1 , t) is the radiation field at PI and
termed states of partial coherence. This experi- time t + T (in a scalar approximation), V* is the
ment may be performed using a variety of radiation field at P2 at time t and the brackets
sources. The experiment, as we have empha- () denote a time average. For convenience Wolf
sized, measures the coherence of the radiation used a complex notation (V* is the complex
when it reaches PI and P2 ; it does not measure conjugate of V), but this need not concern us
the coherence of the source. here.
A quantitative measure of the fringe strength It can be shown that the magnitude of the
called the visibility 0, was given by Michelson. 7 normalized form of rep 1 , P 2 , T), r(P 1 , P2 , T),
It is defined as where
0= [max - [min (1)
[max + [min
where [max is the maximum intensity recorded
in the vicinity of 0 and [min is the minimum
intensity. 0 varies between 0 and I and, is equal to the visibility D when the radiation
roughly, we term radiation fields at two points is quasi-monochromatic (narrow spectral width)
coherent when 0 = I (strong fringes) and inco- and all path lengths in the problem are small
herent (no fringes) when 0 = O. compared to clD.v (where c is the velocity of
In the above introduction, d was held fixed light and D.v is a characteristic spectral spread).
but it is most important to realize that the In this case it is appropriate to fix T at some
fringe visibility at 0 is a function of the spacing value To usually taken to be zero.
d and that D may be close to I for one spacing The definition given by Eq. (2) was intended
of d and close to zero for another spacing of d. to consider polychromatic fields in addition to
For example, if we fix 11, we will generally de- the quasi-monochromatic fields so often studied
crease the visibility of fringes at 0 by making d using the visibility O. Beran and Parrent 1 have
larger (more precisely it will decrease and then shown that the full function r(P 1 , P2 , T) is, in
increase in a succession of oscillations with the principle, measurable by an extension of the
successive peaks being reduced in magnitude.) techniques used in a Young's interference ex-
The visibility may thus be viewed as a measure periment. Modern discussions of coherence now
of the correlation (coherence) between the ra- center on the calculation and measurement of
187 COHERENCE

the mutual coherence function f(P I , P2 , r). etry, the study of laser radiation and the study
For calculation it is convenient to note that of the radiation from turbulent gases. The mea-
Wolf 9 has shown that f(P 1 , P2 , r) satisfies the surement of the contracted fourth-order
pair of wave equations moment
I 3 2 f(PI' P2 , r) Rik(P 1 , P2 , 0) = (IEi(P1, t)1 2 IEk(P 2 , t)12) (8)
\;h 2f(P 1 ,P2,r)= '2
c
ar2 (i= 1,2) (4)
(called intensity interferometry) has received
The vector properties of the electromagnetic considerable attention since it entailed consid-
field may easily be introduced into the mutual eration of the quantum aspects of the electro-
coherence function. In general one must con- magnetic field. This moment may be thought of
sider the tensor as the correlation of instantaneous intensities
h(PI, t) == IEi(P I , t)12
&ij(P 1 , P2 , r) = (Ei(P I , t + r)Ej*(P 2 , t» (5)
at two points and was originally studied as an
where Ek(Ps , t) is the kth component of the alternate method to the measurement of
electric field at the space point Ps and time t. In f(P I , P2, 0) for determining the angular diameter
general, this function has proved most useful of visible and radio stars.4, 5 In measuring
in the study of polarization effects for fields in f(P 1, P2, 0), we may use the Young's interfer-
which the radiation has a principal direction of ence experiment described above; the quantum
propagation. The form of &ij(P 1 , P2 , r) has, nature of the field rarely explicitly enters since
however, been derived for the radiation in a averaging times are usually long enough to per-
black body cavity. mit a classical analysis. To measure Rik, how-
The above considerations have been for radia- ever, we need to correlate the two signals
tion fields that are stationary in time. That is, Ii(P 1 , t) and h(P 2 , t) which are recorded using
the statistical parameters, as opposed to the de- photomultipliers or coincidence counters. Since
tailed structure of the radiation field, are inde- the relationship between the impinging electric
pendent of the absolute scale of time. For fields field and the ejected photoelectron is a statisti-
in which this is not true we must introduce the cal one, classical considerations did not suffice
concept of an ensemble average (similar to that for a deep understanding of the problem or for
used in statistical mechanics). The mutual co- consideration of the very important signal-to-
herence function &ij(P 1 , P2 , r) must be re- noise problems resulting from inadequate aver-
placed by the function &ijE(P I , tl ; P2 , t2) de- aging times.
fined as A quantum field formalism for coherence
theory has been studied by a number of authors.
References 6 and 14 summarize and reference
most of the basic work. Optical correlation phe-
where the overbar denotes an average over an nomena have been studied extensively and par-
ensemble of systems. ticular consideration has been given to the fun-
Higher-order Coherence Functions and Quan- damental differences between laser light and
tum Aspects As the concept of coherence grew thermal light. Laser models have been devel-
and the statistical formulation of the theory oped using coherence concepts. In addition it is
,~ame more into the fore it was realized that a possible to study the basic interaction between
two-point moment like &ij(P I , P2 , r) or light and matter from a coherence point of view.
&ijE(P I , t l ; P2 , t 2 ) was inadequate to com- For example, the Kramers-Heisenberg disper-
pletely describe the radiation field. Two fields sion formula may be generalized to include the
could have the same mutual coherence function effects of partially coherent incident electro-
and yet differ in the statistical content of the magnetic fields, and spontaneous radiation may
field. To completely describe the field, it was be studied when the initial atomic states are
necessary to consider higher-order moments correlated.
like Lijk/(P 1 , t l ;P2 , t2;P 3 ,t3;P4 ,t4)defined Applications (Measurement of the Angular
as Diameter of Stars) Coherence theory is of
great use in the treatment of imaging and map-
Lijk/(P 1 , tl; P2 , t 2 ; P 3 , t3; P4 , t4) = ping problems. It is especially convenient for
treating problems involving the effects of the
turbulent atmosphere on resolution and acoustic
scattering in the ocean. 2, 10, 11 Radiometry has
and, in fact, for a complete description it was also been studied using coherence theory.12,13
necessary to consider the probability density To present the reader with a definite example
function P[Ei(P1 , t I) 1 defined roughly as the of the use of the coherence theory formalism,
probability of the occurrence of a particular we will conclude this brief discussion with an
realization of the field. outline of the measurement of the angular di-
There are a number of problems that require ameter of visible stars.
consideration of higher-order moments. These Let us suppose that the source in Fig. 1 is a
moments are necessary in intensity interferom- visible star so that II is many light-years. There
COHERENCE 188

is no telescope big enough to resolve any star, 10. Ishimaru, A., "Wave Propagation and Scattering
and to make direct measurements of the angular in Random Media," Vols. I and II, New York,
diameter of a star, Michelson introduced the Academic Press, 1978.
use of interference experiments to essentially 11. Flatle, S. (Ed.), "Sound Transmission Through a
give one a bigger effective aperture. Fluctuating Ocean," Cambridge, U.K., Cambridge
For purposes of visible star measurement we Univ. Press, 1979.
can replace the star of diameter D by a circular 12. Marathay, A. S., Optica Acta 23, 785 (1976).
disk of diameter D lying in a plane parallel to 13. Walther, A.,J. Opt. Soc. Am. 58, 1256 (1968).
A. The radiation leaving the star is assumed to 14. Arecchi, F. T., BonifaciO, R., and Scully, M. O.
be incoherent so that for all points on the disk (Eds.), "Coherence in Spectroscopy and Modern
we may take r(P I , P2 , 0) =co(P I - P2 ). Using Physics," New York, Plenum Press, 1978.
Eq. (4) this allows us to solve for r(P I , P2 , 0) 15. Fran90n, M., "Optical Image Formation and Pro-
on the earth (screen A) if we filter the starlight cessing," New York, Academic Press, 1979.
to insure the validity of the quasi-monochro- 16. Marathay, A. S., "Elements of Optical Coherence
matic approximation. We find Theory," New York, John Wiley & Sons, 1982.

Cross-references: INTERFERENCE AND INTER-


JI kdDl2] FEROMETR Y, LASER, MASER.
r(PI, P2 , 0) = const. __ II
.la.-'::"!-~ (10)
kdDI2
II COLLISIONS OF PARTICLES
where k is the average wave number of the light Introduction Scattering experiments provide
and J I is a first-order Bessel function. the principal technique by which physicists at-
If we define the angular diameter of the star tempt to understand the structure and interac-
as 8 = DIll, we see that J I equals zero when tions of matter on a microscopic scale. Scatter-
8 = 1.22'Kld, where 'K = 2rrlk. When J I equals ing theory provides the basis of analyzing and
zero, there are no fringes on the screen B. interpreting scattering experiments. Other ap-
Hence to find the angular diameter of the star, plications are the scattering of acoustic waves
we need only increase d from zero, when there by sonar systems and electromagnetic waves
will be high contrast fringes, to the separation d by radar systems. Detailed descriptions of scat-
when there are no fringes. Putting this latter tering theory are available in several refer-
ences. I ,2,3,4
value into our expression for 8 gives us the
angular diameter of the star. The effects of A description of the development of scatter-
turbulence may be taken into account in this ing theory may be divided into several topics.
problem if Eq. (4) is generalized to include a The oldest and simplest branch of scattering
variable index of refraction. theory is that of potential scattering, or scatter-
Basic References For a fundamental treat- ing of two particles which interact through a
ment of coherence theory we refer the reader local potential. I Potential scattering was studied
to the following basic texts: Born and Wolf,2 extensively in the first two decades following
the development of quantum mechanics in the
O'Neill,S Beran and Parrent, I Klauder and Su- analysis of elastic scattering of particles by
darsham,6 Fran-;:on I5 , Marathay.16 atoms and of nucleon-nucleon scattering. The
MARK J. BERAN latter topic, in particular, led to the introduc-
tion of an elaborate theory of scattering by
noncentral interactions. 5 The development of
References nuclear physics, with the observation of reso-
1. Beran, M., and Parrent, G., Jr., "Theory of Partial nance reactions, indicated the need for more
Coherence," Englewood Cliffs, N.J., Prentice-Hall, general descriptions of scattering. The result-
1964. ing theory of resonance reactions 6,7 has leaned
2. Beran, M., Radio Science, 10, 15, (1975). only rather lightly on the details of the Schod-
3. Born, M., and Wolf, E., "Principles of Optics," inger equation. Quantum field theory was de-
London, Pergamon Press, 1970. velofed to describe electromagnetic phenom-
4. Hanbury Brown, R., and Twiss, R., Proc. Royal ena. Of major importance in the development
Soc. London Ser. A, 242, 300 (1957). of scattering theory was the introduction of re-
5. Hanbury Brown, R., and Twiss, R., Proc. Royal normalization techniques into FIELD THEORy.9
Soc. London Ser. A, 243, 291 (1957). It might be claimed that modern scattering
6. Klauder, J., and Sudarshan, E., "Fundamentals of theory began with the integral equation formu-
Quantum Optics," New York, W. J. Benjamin, lation of Lippmann and Schwinger lO and the in-
1968. troduction of S-matrix theory by Heisenberg"
7. Michelson, A., Phil. Mag. (5),30,1 (1890). and others. 12 This work has stimulated much of
8. O'Neill, E., "Introduction to Statistical Optics," the development of theoretical physics in the
Reading, Mass., Addison-Wesley, 1963. last two decades. Of particular significance are
9. Wolf, E., Proc. Royal Soc. London Ser. A, 230, the clarification of the study of rearrangement
246 (1955) . collisions and the development of the so-called
189 COUISIONS OF PARTICLES

dispersion techniques and current S-matrix to describe oa


may ordinarily be reduced. The
formalism. most commonly encountered of these sym-
The Scattering Cross Section The properties metries are: (1) energy and momentum conser-
of scattering interactions are usually expressed vation; (2) rotational invariance; (3) the Lo-
most conveniently in terms of the scattering rentz invariance of the scattering cross section
cross section. To define this term, we consider Oa.
the following scattering experiment: A beam of Energy and momentum conservation imply
particles (called beam particles) is directed on a that oa
contains the Lorentz invariant factor
scatterer consisting of target particles. As a re-
sult of collisions between beam and target par- o(P/ - P)o(E' - E),
ticles, there are particles which emerge from
the reactions (called reaction products) and where P and E are the respective initial total
these are detected in particle detectors. To de- momentum and energy and the primed vari-
scribe this quantitatively, we suppose that the ables refer to the final state. The quantities
scatterer contains Nt target particles and that oNs and Nt in Eq. (1) represent integer num-
this is uniformly illuminated by a flux F B (ex- bers of particles and are thus Lorentz invari-
pressed as the number of beam particles per ants. The beam flux FB is also invariant under
unit area per unit time arriving at the target) of Lorentz transformations along the beam axis
beam particles. We suppose also that the scat- parallel to Pl' For such transformations is oa
terer is sufficiently small that the beam is negli- therefore invariant. It is, in fact, customary to
gibly attenuated in passing through it. Then, if define oa
to be invariant under an arbitrary
there are oNs scattering interactions per unit Lorentz transformation. This will be the case
time which lead to detected particles, we define if FB in (1) is replaced by FI, the value of FB
the scattering cross section oa as in the laboratory frame of reference (for which
P2 = 0).
The particle detectors for which Eq. (I) is
(1) defined will ordinarily record events for a re-
stricted domain of the final spins and mo-
menta s/ .•. kp . From this and its Lorentz in-
In the limit that the detectors subtend very
small solid angles, as seen from the target, we variance we infer that oa
is of the form
define the differential scattering cross section
da. When a single detector subtending a solid
angle on is used to define oa,we may define
the cross section per unit solid angle as

da= l'1m
-
oa (2)
dn 50->0 on Here ek, is the energy of a particle with mo-
mentum kl and integration is over the domain
The total scattering cross section a is obtained of detection. Since d 3 k l /el etc. are Lorentz
by summing oaover all scattering events: invariant, we see that I must be a Lorentz in-
variant function of the momenta and spin
a = Loa (3) variables.
We have noted that in the laboratory frame
(Eq. (3) does not exist for scattering by a cou- of reference the target particles are initially at
lomb force). rest, so P2 = O. In the center-of-mass, or bary-
The scattering cross section may be expressed centric, frame of reference the total momentum
in terms of the square of the magnitude of a is zero, so P2 = -Pl'
scattering amplitude (or S-matrix, or r = matrix Potential Scattering We briefly illustrate the
element) and is completely described as a func- discussion of the preceding section with the
tion of the momenta and internal states of the example of nonrelativistic scattering by a local
particles in the initial and final states. Thus, for central potential V(r). The SCHRODINGER EQUA-
the two particles prior to collision 13 we may TION for scattering in the barycentric coordi-
take the momenta PI and P2 and the internal nate system iS 14
state quantum numbers Sl and S2 as variables.
(For example, 81 and S2 may describe spin orien-
tation, isotopic spin, etc. For colliding molecules
[Vr 2 + K2 - v(r») 1/I,/(r) =0 (5)
these variables will describe vibrational, rota-
tional and electronic states). We may suppose Here -ilK is the momentum of particle "I" in
there to be J1 particles in the final state follow- the barycentric system and v(r) =(2Mr /.fP) V(r),
ing the collision and specify this state by the with Mr the reduced mass of the two particles.
momenta and internal variables kl ... kp , Sl/ . . . In the limit of large separation r between the
sp/. The scattering cross section may be ex- particles the wave function 1/1,,+ has the asymp-
pressed in terms of these variables. Because of totic form [our notation is such that we r~pre-
symmetries, the number of variables required sent a unit vector in the direction of K by K)
"
COLLISIONS OF PARTICLES 190

For an unpolarized initial state, corresponding


(6) to a uniform mixture of the (2S 1 + 1)(2S 2 + 1)
spin states, the cross section for scattering par-
A A ticle "1" into the direction r with any spin ori-
Here t(K' r) is the scattering amplitude for entation is
scattering particle "1" from the direction ~
into the direction l'. The corresponding cross do
section per unit solid angle is dn = (2S 1 + 1) (2S 2 + 1)
(7) X L L
VI ',V1' Vl,"2
I(VI',v 2'ltlvl,v 2)12 (14)

The wave function t/l1(+ may be expanded into where the sums extend over all spin orientations.
partial waves as follows: Following scattering by noncentral forces, the
particles will in general have preferred spin
orientations, or be polarized. When, for exam-
ple, particle "1" has spin one-half with a spin
operator aI, we define its polarization vector
P(VI, V2) by the equation
Here Pr is the Legendre polynomial of order 1,
Or is the scattering phase shift [see Eq. (10) be- I
l?w], and wI(I<; r) satisfies the differential equa-I P(V I ,V2)= { ,,~ ,[(v t ,v{lr lv l,V2)]*
hon v. ,v. tV,

[ !f.-
dr2
+ K2 - l(l + 1) - VCr)] W = 0
r2 I
(9)
X (VI" lall VI')(VI', V2' It IVI, V2) }

This is to be integrated subject to the condition


that WI is regular at r = 0, For large r, WI has the X{ f:, I(VI', v2'lflvl, V2)1 2 }-1
asymptotic form Vl,V2

(15)
wI(K;r) ~ ~ sin (Kr - ~ + 01) (10)
For an un polarized initial state, the polarization
is
It is Eq. (10) which permits the determination
of the phase shift 0I. The quantity
(16)
S/(K) = exp [2iol(K)] (II)
is an eigenvalue of Heisenberg's S-matrix. ll The study of polarization following scattering
For scattering by noncentral forces, the po- has provided an important tool for analyzing
tential VCr, S b S2) is a function of r (and some- nuclear and elementary particle reaction.1 5 ,16
times the orbital angular momentum operator) In particular, the role of noncentral interac-
and the spin operators SI and S2 of the two tions in nucleon-nucleon scattering has been
colliding particles (if either has no spin, we con- studied in great detail. 1 7
sider its spin operator to be zero). Spin eigen- Formal Scattering Theory To describe a gen-
functions U(VI, V2) may be introduced as de- eral scattering reaction Lippmann and Sch-
pending on the orientations VI and V2). Then winger lO ,I8 introduced a scattering matrix j'ba
the wave function t/lK,VI,V z+ is to be labeled with to describe scattering from an initial state Xa to
the initial spin orientations VI and V2' The as- a final state Xb. 18
ymptotic form corresponding to Eq. (6) is
(17)
t/lK,VI.'V'+ ~ (2rr)-3/2 [eiK.rU(V I , V2) where Ka+ is the steady-state wave function for
the event and V is the scattering interaction.
+e lKr
- "~ (VI,V2 t'>AI VI,V2)U(VI,I
" 1 t(",r) Since momentum is conserved for an isolated
r Vl',V2' scattering, we may write

Here (VI', v2'1 t(i( , l')1 VI V2) is the scattering am-


plitude for scattering to a final spin orientation where Pa and Pb are the total momenta of the
VI, V2'. The cross section per unit solid angle is particles in the initial and final states, respec-
in this case tively, and Tba is defined only for states band
a corresponding to Pb = Pa.
The scattering cross section oa
[Eq. (1)] is
expressed in terms of Tba as
191 COLLISIONS OF PARTICLES

nuclear and atomic physics. 21 The eikonal ap-


proximation has been used in the description of
rearrangement collisions of slow ions, atoms,
and molecules. 22
(19) The careful description of three-body scatter-
ing given by Fadeev 23 has led to active study
Here Vrel is the relative velocity of beam and of this and related phenomena. 24
target particles, Eb and Ea are the respective Variational principles have also been devel-
total energies of the particles in states b and a, oped for application to several body collisions. 2S
and the sum on b extends over those states Field Theory Quantum field theory was orig-
which lead to the reaction products striking the inally developed to describe electromagnetic
detectors and thus to register an event. We em- phenomena. It was applied in a promising con-
phasize that Eq. (19) is Lorentz invariant and text during the 1930s to (3-decay and to the
conforms to the general structure of Eq. (4). meson theory of nuclear forces. The great opti-
The Heisenberg S-matrix 11 is given by the mism following the development of renormali-
expression zation theory9 faded for want of adequate
mathematical techniques for handling strong in-
teractions. Semiphenomenological calculations
where Dba is a Dirac D-function. The S-matrix of Chew and others 26 gave useful insight into
is unitary, so the strong interaction phenomena. Heisenberg
suggested in 1946" that a proper quantum the-
L Scb t Sba = Oca (21 ) ory of scattering would deal with only observ-
able quantities such as the S-matrix and should
b
not require off-the-energy-shell matrix elements
On substituting Eq. (20) into this, we obtain of such quantities as j" [Eq. (18) J. Considerable
the equivalent expression of unitarity impetus for this point of view has been given by
the development of dispersion theory, follow-
i[1ca-1ca t ) = 2rr L 1cb t O(Eb- E a)1ba ing early suggestions of Wigner and others.2 7 The
b first attempt at a systematic formulation of a
dispersion relation within the context of quan-
(22) tum field theory was made by Gell-Mann, Gold-
berger, and Thirring. 28 Further development
which is defined only for states c and a on the followed applications of formal scattering the-
same energy shell (corresponding to Ec =Ea). ory to quantum field theory.29 The develop-
The fundamental problem of scattering theory ment of the Mandelstam representation 3o pro-
is to determine the j" -matrix on the energy shell vided an important step toward obtaining a
(or, equivalently, the S-matrix). The first step "dynamical principle." A further important
in doing this is to make use of general symmetry step was the proposal by Chew and Frautschi 31
principles (such as Lorentz invariance) to limit and Blankenbecler and Goldberger,32 who sug-
the functional forms allowed. Following this a gested that the only sinularities of the S-matrix
dynamical principle is needed. Such dynamical are those required by the unitarity condition
principles (reviewed in Chapters 5 and 10 of [Eq. (22») and that families of particles should
Ref. 2) have been proposed in a great variety be associated with Regge Trajectories. 33
of forms including integral equations, varia- Considerable impetus was given to the appli-
tional principles and conditions of functional cation of quantum field theories by the devel-
analyticity. opment of gauge theories of weak interactions,
Scattering from Composite Systems Scatter- following a model of Yang and Mills. 34,3 S Quan-
ing from systems composed of two or more tum chromo dynamics, an SU(3) gauge theory,
particles is generally very complex. This is in has provided insight into the "strong" particle
part due to the occurrence of sequential inter- interaction scattering phenomena. 36
actions, in part due to the dynamics of the
scattering system, and in part due to the possi- KENNETH M. WATSON
bility of rearrangement phenomena.
Description of sequential interactions can be References
given in terms of the multiple scattering and op-
tical model equations. 19 1. The early development of scattering theory is well
Rearrangement collisions (i.e., collisions in described in the classic work of Mott, N. F., and
which bound particles rearrange themselves) Massey, H. S. W., "The Theory of Atomic Colli-
have been studied extensively following the de- sions," Oxford, Clarendon Press, 1933.
velopment of formal scattering theory. Much of 2. Goldberger, M. 1., and Watson, K. M., "Collision
the early work 2o was stimulated by the observ- Theory," Huntington, N.Y., Krieger, 1964.
ance of apparent paradoxes. Later work has 3. Newton, R. G., "Scattering Theory of Waves and
tended to be directed toward specific applica- Particles," New York, McGraw-Hill, 1966.
tions. The formulation of Fesbach, for exam- 4. Joachain, C. J., "Quantum Collision Theory,"
ple, has led to a variety of applications in Amsterdam, North-Holland, 1975.
COLLISIONS OF PARTICLES 192

5. Rarita, W., and Schwinger, J., Phys. Rev. 59,436 24. Weinberg, S., Phys. Rev. 133, B232 (1964); New-
(1941); Christian, R. S., and Hart, E. W., Phys. ton, R. G., Nuovo Cimento, 24, 400 (1963); Wat-
Rev. 77,441 (1950); Christian, R. S., and Noyes, son, K. M., and Nuttall, J., "Topics in Several
H. P.,Phys. Rev. 79,85 (1950). Particle Dynamics," San Francisco, Holden-Day,
6. Breit, G., and Wigner, E. P., Phys. Rev. 49, 519, 1967.
642 (1936). 25. Schwartz, C., Phys. Rev. 124, 1468 (1961);
7. Wigner, E. P.,Phys. Rev. 70, 15,606 (1946); Wig- O'Malley, T. F., Sprach, L., and Rosenberg, L.,
ner, E. P., and Eisenbud, L., Phys. Rev. 72, 29 J. Math. Phys. 2, 491 (1961), and earlier refer-
(1947); Sachs, R. G., "Nuclear Physics," Reading, ences; Sugar, R., and Blankenbecler, R., Phys.
Mass., Addison-Wesley Publishing Co., 1953. Rev. 136, B472 (1964).
8. See, for example, Mandl, F., "Introduction to 26. This subject is reviewed by Wick, G. C., Rev. Mod.
Quantum Field Theory," New York, Interscience Phys. 27,339 (1955).
Publishers, 1959. The older work is admirably de- 27. The history of this subject is reviewed in Ch. 10
scribed in G. Wentzel, "Quantum Theory of of Ref. 2.
Fields," New York, IntersciencePublishers, 1949. 28. Gell-Mann, M., Goldberger, M. L., and Thirring,
9. Feynman, R. P., Phys. Rev. 76, 749 (1949); Dy- W., Phys. Rev., 95,1612 (1954).
son, F. J., Phys. Rev. 75,486 (1949); Tomonaga, 29. Lehmann, H., Symanzik, K., and Zimmerman,
S., Progr. Theoret. Phys. (Kyoto) 1, 27 (1946); W., Nuovo Cimento, 1, 205 (1955) .
Schwinger, J.,Phys. Rev., 74,1439 (1948). 30. Mandelstam, S., Phys. Rev., 112, 1344 (1955);
10. Lippmann, B., and Schwinger, J., Phys. Rev., 79, 115,1741,1759 (1959).
469 (1950). 31. Chew, G. F., and Frautschi, S. C., Phys. Rev.
11. Heisenberg, W., Z. Naturforsch. 1,608 (1946). Letters 8, 41 (1962).
12. Wheeler, J. A., Phys. Rev., 51, 1107 (1937); 32. Blankenbecler, R., and Goldberger, M. L., Phys.
Moller, C., Kgl. Danske Videnskab. Selskab, Mat. Rev. 126,766 (1962).
Fys. Medd. 23, 1 (1948). 33. Reviews of the S-matrix theory of scattering can
13. The case that more than two particles collide is be found in Chew, G. F., "S-Matrix Theory of
important for the discussion of chemical reactions Strong Interactions," New York, W. A. Benjamin,
in gases and liquids. This is discussed, for exam- 1961. Omnes, R., and Froissart, M., "Mandelstam
ple, in Ch. 5 and Appendix B of Ref. 2. Theory and Regge Poles," New York, W. A. Ben-
14. The notation used here follows that of Ref. 2. jamin, 1963.
15. Wolfenstein, L., and Ashkin, J., Phys. Rev. 85, 34. Weinberg, S., Rev. Mod. Phys. 46, 255 (1974);
947 (1952); Simon, A., and Welton, T.,Phys. Rev. ibid. 52, 515 (1980).
93, 1435 (1954); Wolfenstein, L.,Am. Rev. Nucl. 35. Taylor, J. C., "Gauge Theories of Weak Interac-
Sci. 6,43 (1956). tions," Cambridge, U.K., Cambridge Univ. Press,
16. A comprehensive account of the theory is given in 1976.
Ch. 7 of Ref. 2. 36. Marciano, W., and Pagels, H., Phys. Rep. 36C
17. Moravcsik, M. J., and Noyes, H. P., "Theories of (Nov. 3, 1978).
Nucleon-Nucleon Elastic Scattering," Ann. Rev.
Nucl. Sci. 11,95 (1961). Cross-references: ATOMIC AND MOLECULAR
18. Gell-Mann, M., and Goldberger, M. L.,Phys. Rev. BEAMS, CONSERVATION LAWS AND SYMME-
91,398 (1953). TRY, CROSS SECTIONS AND STOPPING POWER,
19. Watson, K. M., Phys. Rev. 89,575 (1953); 105, ELEMENTARY PARTICLES, FIELD THEORY,
1388 (1957); Francis, N. C., and Watson, K. M., SCHRODINGER EQUATION, GAUGE THEORIES.
Phys. Rev. 92, 291 (1953); see also Chapter 11 of
Ref. 2.
20. Foldy, L., and Tobocman, W., Phys. Rev. 105,
1099 (1957); Epstein, S., Phys. Rev. 106, 598 COLLOIDS, THERMODYNAMICS OF
(1957); Lippmann,B., Phys. Rev. 102,264 (1956);
Brening, W., and Haag, R., Forschr. Physik. 7, 183 All colloids can be physically described by ag-
(1959); Cook, J., J. Math. Phys. 6, 82 (1957). A glomerates of microparacrystals (mpc's). Their
somewhat more flexible interpretation has been size depends on the paracrystalline lattice dis-
made in Chapter 4 of Ref. 2. tortions (see MICROPARACRYSTALS and MI-
21. Feshbach, H., Ann. Phys. (New York) 5, 357 CROPARACRYSTALS, EQUILIBRIUM STATE OF).
(1958); ibid. 19,287 (1962); Hahn, Y., and Spruch, Within each microparacrystal the distortion
L., Phys. Rev. 153, 1159 (1967); Hahn, Y .,Phys. energy U at the boundaries reaches such high
Rev., C1, 12 (1970); Chen, J. C. Y., Phys. Rev. values that the netplanes are destroyed because
156, 159 (1967); Weller, H., and Roberson, N., the angles of the valence bonds are overstrained.
Rev. Mod. Phys. 52,699 (1980). Figure 1 shows schematically an atom and its
22. Chen, J. C. Y., and Watson, K. M.,Phys. Rev. 174, valence bond in unstrained position; hence
152 (1968); ibid. 188, 236 (1969); Hatton, G., r = 0 has a minimum potential flU. flU increases
Chen, J., Ishihara, T., and Watson, K.,Phys. Rev. with the square of the deviation r from the un-
A12, 1281 (1975). distorted position. Averaging over all atoms one
23. Fadeev, L. D., Zh. Eksperim. i Teor. Fiz. 39, 1459 obtains for flU a value proportional to rZ. In
(1960); English trans., Soviet Phys. JETP, 12, the case of microparacrystals this value is pro-
1014 (1961). portional to the number n of netplanes (see
193 COLOR

term proportion to N 4 :

flG = a{ N;a + i N3fl Gu + iN4AOg2).

u a
Here is the mean netplane distance. For g = 0
the well known dependence of flG on N is
t plotted as the dashed line in Fig. 2. When g =
4.2%, the si~ of the critical nuclei increases by
10% and at N = 13 there is a stoppage of growth
(solid line in Fig. 2). In the limiting case, g =
7.5% (dotted line in Fig. 2), the critical nl!,c1ei
have the same sizes as the mean number N of
the equilibrium state. Now one is concerned
with microparacrystals in a melt. The thermo-
dynamics of solid colloids can be understood
as having g-values smaller than those of the
limi ting case. 1

FIG. 1. Atom with one of its valence bonds to a ROLF HOSEMANN


neighboring atom and its potential U which depends
on the value r by straining the valence angle. Reference
1. Hosemann, R., Colloid & Polymer Sci. 260, 864-
870 (1982).

i
MICROPARACRYSTALS, EQUILIBRIUM STATE
N/2 Cross-references: PARACR YST ALS; MICROPARA-
CRYSTALS; MICROPARACRYSTALS, EQUILIB-
OF), hence flU = LAr/ = 2AN2 nfl t 2 = RIUM STATE OF.
i 0
A oN 4 g 2 /4, where fl t 2 =g2J2 is the distance
variance between adjacent netplanes: flU ~
A og2 N. The free energy flG of a cubic micro- COLOR
paracrystal now contains, in addition to the Definitions
well known terms of the surface free energy a 1. Color (n) The noun color is used in dif-
and the volume enthalpy flG u (which are pro- ferent ways by different people to convey some-
portional to N2 and N3, respectively), a third thing about what we see when we look at an
object. It has been used to describe differences
among spectral power distributions of objects,
fiG differences among chemical constitutions of
colorants, and conditions for color measure-
8 150 ment by colorimetry, in addition to the per-
ceived appearances of objects or media. There
6 is, then, no single, universal, definition 9f color.
f m The Commission Internationale de l'Eclairage
(CIE) recommends two definitions: perceived
4 color and psychophysical color. CIE perceived
50 color is "an aspect of visual perception by which
2 N- an observer may distinguish differences between
two fields of view of the same size, shape, and
O~~~~--~~~~--~--~
6 8 10 12 14
o structure, such as may be caused by differences
in the spectral composition of the radiation
g=O% concerned in the observation." CIE psycho-
2 , - g=4.2% 50 physical color is said to be "a characteristic of
.,. g=7.5% - a visible radiation by which an observer may
4
, \ ,, _0_=1'2 distinguish differences between two fields of

5 rfrr)\ .,,
,,
dlt\Gyl -100 view of the same size, shape, and structure,
such as may be caused by differences in the
spectral composition of the radiation concerned
N=3 ' \ -150 in the observation."
B~ ______-L~____~~______________~ 2. Color Perception An attribute of vision
consisting of chromatic and achromatic com-
FIG. 2. Free enthalpy of solid microparacrystals ponents. Chromatic components of color per-
(0 < g < 7.5%), crystals (g = 0%), and liquid micro- ceptions allow organisms to distinguish among
paracrystals (g - 7.5%). For details see text. hues, such as those called red, green, yellow,
COLOR 194

blue, etc., and to distinguish among their satu-


rations or degrees of chromatic content. Achro- o
matic components of color perceptions refer to ~

intensive aspects of the perceptions such as those !;;;


commonly designated by the words white, gray, ..c
black, bright, dim, light, -or dark. '"~ -1
3. Color Stimulus The agent by which an "iii
organism is stimulated to see color; color stimuli
commonly consist of radiant energy called light. .
;;
II:

....o
In general, then, color stimuli consist of visible
radiation entering the eye and producing the
sensation of color, either chromatic or achro-
matic. ."
c
4. Color Measurement The practice or meth- o
odology of attempting to measure the relation- :- 0
ships between color stimulation and color per-
.
II:
'E
ception; one form of psychophysics, the science c
of determining correspondences between stimuli o
0.
0.
and responses. A special case of color measure- o
ment is called colorimetry, which deals mainly
with the specification of stimuli that match in
color appearance.
!
Perception of Color All color perceptions
have at least three attributes: hue, saturation or
400 500 600 700
chromatic content, and brightness. When a color Wavelength (nm)
stimulus is perceived as part of an array of other
stimuli, it may also have other attributes derived FIG. 1. Estimated p, 'Y, (3 cone receptor systems sen-
from the perceived relationships of its basic sitivities (upper dashed curves) and sensitivities of
attributes to those of the other stimuli in the achromatic (w·bk) and chromatic (r-g and y-b) neural
field of view. These are called relative hue, processes (solid curves), both as functions of wave-
relative chromatic content (or saturation), and length.
relative brightness (or lightness). These and
other derivative attributes of color perceptions
are conscious awarenesses that arise in the brain sensitivity in the region of 560 nm. A second
as a result of stimulation of the organism and class of cones (r) is most sensitive to middle
signal processing carried out within the organ- wavelengths of light and has a peak sensitivity
ism's nervous system. around 530 nm. The third class of cones (13) is
The mechanism of human color vision consists most sensitive to light of short wavelengths near
of an optical system for collecting and forming 420 nm. Estimations of the distributions of
images of light, neural networks that detect and spectral sensitivities of p, r, and {3 cones are
respond to light and that transmit encoded sig- illustrated by dashed curves in Fig. I.
nals, and neural cells in the brain that elaborate Discrimination among color stimuli of differ-
and interpret the signals. The visual mechanism ent spectral powers occurs as a result of the dif-
then includes the eye, the neural pathways, and ferent spectral sensitivities of the cones. How-
the brain. ever, regardless of differences in spectral power,
The lens of the eye forms an image of light on when two color stimuli excite the cones equally,
the retina at the rear of the eyeball. The retina the stimuli cannot be distinguished as different;
contains six kinds of cells called neurons. One that is, when the rate of photon absorption is
class of neurons consists of photodetectors, the same for each stimulus, the stimuli appear
which are capable of responding to light or identical. This principle of univariance is the
changes in light. They are called receptors. basis for metameric color matching, where two
There are about 10 8 rod receptors and 8 X 10 6 spectrally different stimuli can be made to match
cone receptors, so named because of their re- in color by properly adjusting their relative
spective shapes. Each receptor contains one of radiances.
four kinds of photolabile pigments, the stereo- The remaining neurons of the retina combine,
chemical forms of which can be altered by ab- elaborate, and encode electrophysiological sig-
sorption of photons (light quanta). The distri- nals set up in response to absorption of photons
bution of rods and cones varies over the retina; by the receptors. Many receptors may be inter-
there are many more cones than rods near the connected, and their signals converge on higher
visual axis. Rods are useful for night vision but order (Le., more proximal) neurons. The axons
do not contribute significantly to color vision of the most proximal neurons of the retina
or acuity. There are three classes of cones, dis- (ganglion cells) form the optic nerve, a sheath
tinguished by their different spectral sensitivities of about 10 6 fibers that transmit signals to the
to light. One class of cones (p) is most sensitive lateral geniculate nucleus of the thalamus (the
to long wavelengths of light, with a maximum next highest level of the visual mechanism). The
195 COLOR

encoded signals that leave the eye through the different matches from those of normal ob-
optic nerve are of two kinds: spectrally oppo- servers. Dichromatic observers require only two
nent and spectrally nonopponent (most are also lights, and monochromatic observers are satis-
spatially opponent). Spectrally opponent cells fied with a single light.
signal to different extents and with different The wavelengths at which the opponent re-
polarities as a function of wavelength of stimu- sponse functions of Fig. I are zero identify the
lation, whereas spectrally nonopponent cells spectral stimuli that elicit unitary hues for nor-
signal to different extents with the same polarity mal observers under the viewing conditions
as a function of wavelength. The spectrally non- represented in the graph. The supra threshold
opponent signals (A) are thought to be the result r-g curve is zero at about 475 nm. That leaves
of additive combinations of signals from P and only the b signal at that wavelength, so it cor-
'Y cone networks. The spectrally opponent sig- responds to the spectral stimulus that elicits a
nals (C 1 and Cz) are thought to consist of dif- blue color response with no trace of any other
ferences among classes of cone signals. The hue (for the viewing condition in question).
neural signals are conventionally represented The y-b curve crosses the zero line at about
mathematically as: 505 nm, leaving only g, so that would be the
wavelength for a unitary green blue. The r-g
A=allP+a12'Y (1) curve again becomes zero at about 580 nm,
C1 =aZlP - azz'Y (2)
so that wavelength corresponds to a unitary
yellow hue. Unitary red hue can be produced
Cz = a31 P + a3Z 'Y - a3313 (3) only by a combination of short- and long-wave-
length stimuli for the viewing conditions illus-
although there are a number of variations on trated. The hues corresponding to all other
these expressions. stimuli contain proportions of two hues. These
A represents a neural signal analogous to other perceptions are therefore known as binary
achromatic perceptions: white to black or bright hues. All hues are either unitary or binary. That
to dim (sometimes symbolized w-bk). C 1 repre- is, they may appear red or green (but not both
sents a neural signal corresponding to redness at the same time), yellow or blue (but not both
(for positive values) or greenness (negative at the same time), yellow-red, yellow-green,
values); it is therefore sometimes symbolized blue-red, or blue-green, and in various degrees
r-g. Cz represents a neural signal corresponding of lightness or darkness. Even the color normally
to yellowness (positive values) or blueness called brown can be described as a dark yellow-
(negative values); it is therefore sometimes red color. Table 1 lists several relationships be-
symbolized y-b. When C 1 = Cz = 0, only A tween attributes of perceived colors and condi-
remains for any suprathreshold stimulus and tions of the A, C 1, and Cz neural signals of
so it is seen as achromatic (white, gray, black, Eqs. (1)-(3).
etc.). The spectral distributions of sensitivities Additional elaboration and analysis of neural
for the A, C 1, and C z neural mechanisms signals takes place in the thalamus and occipital
depend upon those of p, 'Y, and 13 and upon the cortex of the brain, but the general form of
interconnections in signal processing; there are color-coded signals seems to be the same as
also differences between threshold and supra- those leaving the retina. These signals are orga-
threshold processing conditions. Estimated nized into receptive fields representing the com-
spectral sensitivities of the neural functions are bined responses of many receptors and inter-
shown as the solid curves in Fig. 1. The viewing neurons. In this way, stimulation in one area of
or adaptation condition represented by that the retina influences signal responses in adjoin-
figure is a physiological neutral state such as ing and nearby areas. These interactions give
that associated with dark adaptation or, for the rise to contrast and induction effects. When,
dashed extension of the r-g curve, suprathresh- for example, a gray paper is viewed against a
old daylight adaptation. white background, it appears darker than when
Figure I corresponds to observers with normal it is seen on a dark background. The higher level
color vision. Deviations from normal color vision of activity elicited by the white background
are thought to arise when one or more of the inhibits response to the gray paper, making it
cone classes is missing or reduced in effective- appear darker. Chromatic induction or contrast
ness and when neural processing differs from may also occur to alter perceived hues and satu-
that subsumed for people with normal color rations. These kinds of interactions make it
vision. These deviations are of three kinds: difficult or impossible to predict color appear-
anomalous trichromatic vision, dichromatic ances from measurements or specifications of
vision, and monochromatic vision. Their names only single focal stimuli; all stimuli in the field
derive from the minimum number of color of view must be considered and their interac-
stimuli required by an observer to match colors tions must be taken into account.
satisfactorily. A person with normal trichromatic The quantitative nature of these interactions
vision can match the hues of all stimuli with is not well known. Signal processing is both
additive combinations of three appropriately complex and nonlinear in its effect. For ex-
selected lights. Those with anomalous trichro- ample, although the visual mechanism can
matic vision also require three lights but make respond over a range of luminances (spectrally
COLOR 196

TABLE 1. EXAMPLE OF THE WAY IN WHICH


ATTRIBUTES OF COLOR PERCEPTION COULD BE
RELATED TO NEURAL SIGNALS OF THE
VISUAL MECHANISM.

Let:*
A =QuP + Q1Z'Y = w-bk

C1 =QZ1P -Qzz'Y = '-g

Attribute of
Color Perception Symbol Condition

Achromaticness A C1 =C2 =0, A > °


Chromaticness K (C 1 +C2»0
Saturation (relative Cl +Cz
chromaticness) S 0..; ..; 1
C1 +Cz +A
Brightness B C 1 +Cz +A>0
Lightness (relative C1 +Cz +A
brightness) L 0..; ..;1
(C 1 +C2+ A )w
Constant hue Hi CtfCz =constant
Unitary hues Hr C1 > 0, C2 =°
Hg C 1 < 0, C2 =°
Hy C1 =0, Cz >0
Hb C1 =0, C2 <°
·Where + stands for some unspecified form of combination
and w refers to a white reference.

weighted radiances) of about 10 12 , neural signals produced by valence-shell electronic excitations


are greatly compressed and nonlinearly related from higher levels back to the ground state. In
to input powers (a range of about 10 2 in the heated solids, the emission spectra of lights are
receptors and 10 1 in other neurons). This com- characteristic of the temperature of the material.
pression requires that. to be effective, the visual In gases, the emission spectra are characteristic
mechanism must adjust or adapt the sensitivities of the molecular or atomic structure of the
of its component processes to suit the level and material. In either case, the stimulus function
quality of stimulation provided it. The process 'I' (A) represents the radiance of the light source:
of adaptation permits us to see a "white" paper 'I'(A)=Le(A).
as white by moonlight or bright sunlight and by Light can also cause low-energy valence-shell
yellowish incandescent lamp or blue sky illumi- electronic excitations in atoms. Spectral selec-
nation. tivity is produced when m·olecules selectively
Color Stimuli Color stimuli have extents absorb certain frequencies of light because
(spatial and temporal), intensities, and qualities. of these electronic transitions between quantum
Spatial extent (size and location) can be speci- electronic states. Both organic and inorganic
fied by length and its derivative measures or by compounds absorb light frequencies by transi-
angular subtense and location in the visual field. tion of the molecule from the ground state to
Temporal extent can be specified by time and an excited state, but the physical phenomena
its derivative measures. Intensity and quality that cause the transition may differ in two cases.
are specified by spectral power or radiance In either instance, absorption of a particular
concentration. frequency of light by excitation of electrons
Normal color stimuli can be sources of direct whose energy difference corresponds to that
radiation (lights) or of indirect radiation (e.g., frequency removes light of that wavelength
reflecting or transmitting objects). Light can be from the spectrum of irradiation. Selective alter-
197 COLOR

ation of light can also occur by other physical average daylight. Over these ranges, the nominal
phenomena such as polarization and Rayleigh process of color matching is linear (stimulus
scattering and by fluoresence and photophores- radiances can be increased or decreased by any
cence. The light which is not absorbed (scattered positive factor and the match will continue)
or re-emitted) is reflected or transmitted. In and persistent (the match holds over changes in
these cases, the color stimulus function is desig- sensitivity occasioned by variations in chromatic
nated by the product of the irradiance function adaptation). Therefore, any set of color mixture
and the reflectance function ['Ir (")0..) =p (")o..)E(")o..») primaries that is a linear transform of some
or the transmittance function ['Ir(")o..) =7(")0..) • appropriate set will serve as a standard. The CIE
E (")0..»). In the general case, the product of the has selected a set for the CIE 1931 Standard
relative radiance of a medium with the irradiance Colorimetric Observer (for 2°) and a second set
of the light source defines the color stimulus for the CIE 1964 Supplementary Standard
function ['Ir (")0..) =f3e (")o..)E (")0..»). When L e (")0..) is Colorimetric Observer (for I 0°) and has speci-
expressed in W . m- 2 • sr- 1 or E(")o..) in W . m- 2 , fied the amounts required of them for the
then 'Ir(")o..) specifies both intensity and quality standard observer to match the colors of spec-
of the color stimulus. When those values are tral components of an equi-energy spectrum.
expressed relative to the corresponding value These amounts are symbolized as x(")o..), y(")o..),
for some standard of reference (e.g., the Lam- and z(")o..) [or XlO (")o..),YlO (")0..), ZlO(")o..) for the CIE
bert reflector or a specified thickness of air) 1964 observer) . The color mixture coefficients
then 'Ir(")o..) specifies only the quality of the at each wavelength (x~, y~, z~) represent the
spectral stimulus. tristimulus values of the components of the
Color Measurement Color measurements may equi-energy spectrum and are illustrated in Fig.
be classed in three broad categories: color 2. When these color mixture coefficients are
matches, color differences, and color appear- used, the condition of color match for two
ances. In all three, an atempt is made to relate stimuli 'Ir 1 (")0..) and 'Ir 2 (")0..) can be specified as:
'Ir(")o..) or its derivative expression to some invari-
ant criterion of color perception.
The criterion for color matching (upon which
colorimetry is based) is the invariance of per-
ceptual identities. The amounts of three refer- (4)
ence stimuli or primaries that are required to
produce the same color appearance as that of
some sample are specified as a color match by
color mixture. Because of the principle of uni-
variance combined with the fact that there are
but three kinds of cone receptors, a minimum of (5)
three primaries can be used to match the colors
of all stimuli, provided that the primaries are
perceptually independent (i.e., none of them
can be matched in color by any mixture of the
other two). Certain conventions have been (6)
adopted internationally to simplify the practice
of colorimetry and enhance communication.
The spatial extents of stimuli are restricted to
about 2° and 10° diameter in visual subtense
centered on the visual axis. Certain geometric
conditions are also specified for p(")o..)E(")o..) and I. 51---+---+--\-----+----\------+---
7 (")o..)E(")o..). In order to avoid dealing with the
nonlinear complexities of the visual mechanism, If)

only color matches (conditions of equal quan- ~


tum catches by the receptors) are addressed by §!
CIE conventions adopted in 1931. These color- ~ 1.0
imetric conventions lead to specifications of the ~
conditions for color matches in terms of the 115
amounts of standard primaries required by a ~
standard observer; they do not specify color 0.51--!-+---++-I-----,/--.J-+---\---+-------1
perceptions or differences among them. Color
matches for stimuli subtending 1_2° diameter
have been found to hold over a range of lumi-
nances from about 1-2 cd' m- 2 to around
1,000-2,000 cd' m- 2 and over changes in
chromatic adaptation among daylight sources Wavelength, nm
or Planckian radiators whose spectral radiance FIG. 2. CIE 1931 Standard ColorimetericObserver's
functions are not greatly different from that of color mixture functions.
COLOR 198

or similarly for the eIE 1964 observer [using


XlO(X), YlO(X), zlO(X)I. The limits of integra-
tion are usually 380 nm to 780 nm. The inte- o ~ r--....S30
grals of Eqs. (4 )-(6) are called tristimulus values
and symbolized X, Y, and Z. The scale factor k
IsIO
~
~
may have any convenient value but most often
is assigned one of two values: k = 680 lumens'
W-l (which yields tristimulus values that are °
consistent with luminance since y(X) = V(X),
the eIE 1924 luminous efficiency function) y
500
~
or k = 100 [jE(X).Y(X) d~-l (which yields 1\ ~~
°
tristimulus values consistent with luminance
factor).
.4 \
~ 610 610

IL~80nm
490\ 650
If W1 (X)=W2(X) (the spectral distributions
of both stimuli are identical), then Eqs. (4 )-(6)
will be satisfied for all observers under all illu- °.2 8~ V
minants; such matches are sometimes said to
480
V
41;~ k.('V
be isomeric. If Eqs. (4)-(6) are satisfied but
'II 1 (X) i= 'II 2 (X), then the match is a metameric
0
one that may not satisfy observers whose color o .10 0.2 0.4 0.6 0.8
mixture functions differ from those of the eIE x
standard observer, and the match may not hold
FIG. 3. CIE 1931 chromaticity diagram with co-
for the standard observer under a different ordinates y versus x. Locus of chromaticities corre-
illuminant. sponding to spectral color stimuli is shown by the
It is often convenient to plot the relative curve, along which wavelengths are indicated, and the
amounts of tristimulus values. They are called locus of non spectral mixtures of additive combina-
chromaticity coordinates and are derived from tions of spectral stimuli having wavelengths of 380 and
the tristimulus values as follows: 780 nm is shown by the line connecting the ends of
the spectrum locus.
X
x= (7)
X+Y+Z
Y
(8) 0 .9
y= X+Y+Z
Z 540
z= (9)
X+Y+Z

~~0 0
550
0.7
Since x + y + z = 1, only two are needed for a 505 560
complete specification of chromaticity; x and y
0.6
are conventionally used. When the chromaticity 570
500
coordinates of the components of the equi-
energy spectrum are plotted in a diagram of y 0.5 580
versus x, and the chromaticity coordinates for
the extremes of wavelength integration are
\J 590

joined together, the roughly "horseshoe" area


of Fig. 3 results. The diagram is called a chro- \) D 0 00
0
600

maticity diagram, and the area depicted con- 0 0 0


tains chromaticities corresponding to all color 0 700
stimuli. 0 0
When a normal trichromatic observer matches 0
the color of anyone color stimulus many times,
a distribution of chromaticities results represent- 0 .1
ing the variability of determination. Figure 4
illustrates bivariate normal ellipses representing
such variability for 25 different color stimuli; °0L---~~~~~--~
.OL4--~.0~5---.~0~6--~.0~7~~
.08
the ellipses are plotted as approximately ~ the X
size of the threshold of perceptibility. The el-
FIG. 4. Bivariate normal ellipses representing ~ of
lipses differ in size and orientation. If a thresh- the threshold variability of color matching for 25
old color difference is assumed to be percep- color stimuli. [Based on data reported by MacAdam,
tually equal everywhere in color space, then the D. L., "Visual sensitivities to color differences in day-
chromaticity diagram of Fig. 4 must be judged light," J. Opt. Soc. Am., 32, 247-274 (1942).)
199 COLOR

to be nonuniform. A uniform chromaticity dia-


gram would be one in which the distributions
of color matches are represented as circles of con-
stant diameter throughout the graph. Straight
lines in such a diagram would represent geo-
desics, lines depicting the relative sizes of per-
ceived color differences. The (constant) dis-
tances corresponding to threshold differences in
color might be taken to represent the additive
unit of such a color metric.
Many attempts have been made to derive such
a uniform chromaticity diagram or scale (UCS).
None have been completely successful. In 1976,
the CIE provisionally recommended two trans-
formations of CIE 1931 tristimu1us values as
approximations to a uniform color metric: CIE
1976 L*u*v* and CIE 1976 L*a*b* diagrams.
They are related to the tristimu1us values X, Y,
Z as follows:
x _
L*=116(Y/YW )1/3_16 (10)
FIG. 5. Loci of constant Munsell hue (approximately
u* = 13L*(u' - u'w) (11) radial lines) and chroma (approximately concentric
circles) in a elE 1931 chromaticity diagram for
v* = l3L*(v' - v'w) (12) samples having luminance factors of 0.1977 (a Munsell
value of 5).
where
u' = (4X) (X + 15 Y + 3Z)-1
v'=(9Y)(X+ l5Y+3Z)-1 standards, that attempts to array color stimuli
according to some plan. The Optical Society of
and America uniform color order system arrays
color stimuli approximately according to geo-
L* = l16(Y/Yw )1/3 - 16 (13) desics of suprathreshold color differences. The
Swedish natural color system arrays color stim-
a* = 500 [(X/X w )1/3 - (YjY w)1/3] (14) uli according to the magnitudes of their relative
hues, saturations, and lightnesses. The Munsell
b*=200[(Y/Yw )1/3 - (Z/Zw)I/3]. (15) notation arrays samples according to a combi-
nation of their hues, chromas (a form of rela-
The subscript w refers to a "white" achromatic tive chromatic content), and values (a function
reference stimulus. The coordinates u', v' form of lightness), and their differences in color.
a supplementary chromaticity diagram since Figure 5 illustrates the chromaticities of the
they can be defined for a plane of constant Munsell notation array for color stimuli having
luminance; the L*a*b* metric does not have luminance factors of 0.1977. The approximately
planes of constant luminance and, therefore, radial curves represent constant hues, and the
has no associated chromaticity diagram. Color approximately concentric circles correspond to
differences (constrained to small or near-thresh- constant chromas. These relationships permit
old in size) are specified in the two color spaces specification of differences or magnitudes in
of Eqs. (10)-(15) as the Euclidean distances: terms of CIE tristimulus values. However, all
current systems for specifying near-threshold
tlEuv = [(tlL*)2 + (tlU*)2 + (tlV*)2] 1/2 and suprathreshold color differences are inexact
to various degrees and, as with conventions for
(16) specifying color matches, they should be con-
strained to certain conditions to be useful.
and
C. J. BARTLESON
tlEab = [(tlL *)2 + (tla*)2 + (tlb *)2]1/2.
(17) References
CIE has not yet recommended a method for Billmeyer, F. W., Jr., and Saltzman, M., "Principles of
specifying large differences in color or magni- Color Technology," 2nd Ed., New York, Intersci-
tudes of color appearances. In practice, any of a ence, 1981.
number of color order systems are used for such Boynton, R. M., "Human Color Vision," New York,
specification. A color order system is a system- Holt, Rinehart and Winston, 1979. ,
atic structure, usually represented by material CIE [Commission Internationale de l'Eclairage],
COLOR 200

','Colorimetry ," CIE Publication No. 15, Bureau vacancy with a trapped electron for charge
Central de la CIE, Paris, 1971. compensation. Both the "particle-in-the-box"
Grum, F., and Bartleson, C. 1. (Eds.), "Optical Radia- and "hydrogenic" models can be used as crude
tion Measurements, Vol. 2. Color Measurement," approximations to determine the energy levels
New York, Academic Press, 1980. for the F center. The former model predicts
Grum, F., and Bartleson, C. J. (Eds.), "Optical Radia- that the absorption energy of F centers is in-
tion Measurements, Vol. 5. Visual Measurement," versely proportional to the square of the lattice
New York, Academic Press, 1984. constant (potential well size). This approxima-
Hunt, R. W. G., "The Reproduction of Colour," 3rd tion is surprisingly accurate in predicting the
Ed., London, Fountain Press, 1975. absorption energies of F centers in alkali halides
Hurvich, L. M., "Color Vision," Sunderland, Mass., and is helpful for other materials such as MgO,
Sinauer Associates, 1981. MgF 2 , and CaF 2 . However, much more sophis-
Jameson, D., and Hurvich, L. M. (Eds.), "Handbook ticated theoretical treatments are necessary to
of Sensory Physiology, Vol. VII/4. Visual Psycho- understand the width of the absorption bands,
physics," Berlin, Springer-Verlag, 1972. the temperature dependence of these bands
Judd, D. B., and Wyszecki, G., "Color in Business, and the lifetimes of the transitions. Table I
Science and Industry," 3rd Ed., New York, John lists the energy for the maximum of the F-center
Wiley & Sons, 1975. absorption in a number of materials.
Wyszecki, G., and Stiles, W. G., "Color Science," 2nd Although the negative ion vacancy is a "good"
Ed., New York, John Wiley & Sons, 1982. electron trap only about 80% of the electron
charge is centered within the vacancy. This
Cross-references: LENS; LIGHT; OPTICS, GEO· means that the electron spends most of its
METRICAL; OPTICS, PHYSICAL; PHOTOMETRY; time in the potential well interacting with next
PHOTON; REFLECTION; REFRACTION. neighbor positive ions, but for a significant
amount of time the electron interacts with
other neighbors as well. Since the motion of the
surrounding ions continually changes the di-
COLOR CENTERS mensions of the potential well for the electron,
the F band absorption is very broad. Of course,
Color center is a gen'eric term first coined to as the temperature of a sample is lowered and
characterize the entity responsible for the col- the ion motion is decreased the width of the
ored appearance of alkali halide crystals after absorption band is also decreased. At very low
exposure to x-rays. Today this term is broadly temperature (less than - 200°C) ion motion is
used to describe the microscopic defects that almost entirely the result of zero-point vibra-
are responsible for the optical property changes tions and the width of the absorption remains
throughout the ultraviolet, visible, and infra- constant.
red regions of the spectrum that occur in ir-
radiated or chemically treated materials. Al-
though impurities may associate with color TABLE 1. COLOR CENTER
centers to produce optical absorption or ABSORPTION IN SELECTED
emission bands, color centers are generally MATERIALS.
defined as vacancies, interstitials, or clusters
of these types of defects. These defects can be Wavelength of the
produced in transparent insulator-type crys- Maximum of the
tals. Recent studies have emphasized both Absorption Bands (nm)
oxide and fluoride materials, but the great at 300K
wealth of experimental information and tradi-
tion still resides within the alkali halide crystal F F2 (M) Fa (R)
series. For a number of years it was thought that
color center research would only yield proto- LiF 250 447 310
type information for more complex materials 380
with application potential. In the last few years, NaF 340 499 450
however, it has been acknowledged that color KO 560 822 680
center research is important in such diverse 740
applied projects as nuclear waste disposal and KI 660 1010 810
tunable infrared F-center lasers. 905
The F center is the best characterized color RbBr 680 957 805
center. This center is formed when a negative 860
ion is displaced from a lattice site into an inter- MgF2 260 322 300
stitial position in the lattice or to the surface 355
of the crystal. The displacement of this negative 370
ion results in a potential well formed by the RbMgFa 290 387 300
next neighbor positive ions that surround the 340
negative ion site. This potential well traps an KMgF3 270 446 396
electron and forms the F center-a negative ion
201 COLOR CENTERS

When an F center absorbs a photon of the ap- MgO, the F center consists of two electrons
propriate energy the electron is promoted to an trapped at the negative ion vacancy (charge
excited state. For device applications how this compensation is again essentially complete).
electron returns to the ground state is most When charge compensation is not complete,
important. For example, the electron can return e.g., one electron trapped at a divalent anion
to the ground state by interacting with neigh- site, the center is referred to as an F+ center,
boring ions to give up its excess energy as heat indicating a net positive charge at this site.
to the crystal lattice. This is referred to as a F centers may also be trapped by impurities.
nonradiative transition and can be highly When this occurs the notation FA center is
temperature dependent. The electron can also utilized.
pass to the ground state through the emission 2. F-center aggregates. In many instances F
of a photon (luminescence). This emission is centers aggregate to form clusters. The simplest
especially important for phosphor applications such cluster is the F2 or M center, which con-
or for laser action. The emission band is broad sists of two F centers in neighboring positions.
for the same reason that the absorption is broad. This center resembles a hydrogen molecule and
Since in every instance some energy is shared its optical properties have been approximated
with the lattice neighbors while the electron is using such a model. When the two negative ion
in the excited state, there is an energy difference vacancies share only one electron the center is
between the absorption and the emission. This referred to as an F 2+ center. This center can be
is known as the Stokes shift. The emission for approximated as an H2 + molecule. Because of
most alkali halide F centers occurs in the near the stability and optical properties of these
infrared where tunable lasers are sorely needed. centers, a number of infrared laser system using
Such lasers have been constructed using F cen- them have been constructed. When three F
ters and impurity perturbed F-cluster centers. centers occupy next neighbor positions the
Long-term stability at room temperature is complex is given the notation F 3 or R center.
the most serious deficiency of these small As with the F2 centers, because of the planar
solid state lasers. nature of these defects nonequivalent dipoles
In some materials the excited state of the F exist and more than one absorption band is
center is close to the conduction band in energy. associated with these defects.
Thus, at higher temperatures it is possible under 3. Hole traps. In addition to electron centers,
light excitation for the excited state electron certain defects can trap holes and give rise to
to move into the conduction band. This results optical absorption and emission. The two best
in ionization of the F center, and photoconduc- studied such centers are the X2 - or VK center
tivity is observed. The last method of deexcita- and the so-called H center. The X2- center is
tion of the excited electron which will be con- formed in highly ionic crystals such as the
sidered is energy transfer. It is possible when alkali halides and alkaline earth flourides. This
other defects with similar energy levels are defect has not been detected generally in oxide
nearby for energy to be transferred from one materials. The VK center consists of a hole
defect to another. This mechanism has been trapped between two negative halide ions. It
most thoroughly studied for impurities, but may be pictured as a X2 - molecule that occupies
pertains to color centers as well when the con- two normal halide sites in a crystal in which X-
centration of such centers is high. Clearly, the denotes the negative ion halide. The H center
understanding of optical properties of materials consists of a hole that is shared by four nega-
is necessary for the development of practical tive ion halide ions occupying three normal
applications. Because of the ease of growth halide ion lattice sites in a crystal. The H center
and preparation of alkali halide crystals and the forms an interstitial defect. Since the halide is
atomic simplicity of these materials, numerous interstitial, a negative-ion vacancy must exist
experimental and theoretical advances have elsewhere in the crystal. H centers are stable only
occurred through the study of color centers. at low temperature. As the temperature of a
The great number of high technology tech- material is increased, the H centers aggregate
niques which have been developed for detailed to form interstitial clusters or dislocation
investigation of color center phenomena are loops.
now being applied to more complex materials There are a number of ways to produce color
such as quartz. centers. The most generally used methods in-
Although the F center is the best known and volve irradiation with x-rays, electrons, protons,
understood color center, a number of defects or neutrons ; the use of chemical heat treatment;
have been observed and characterized. A partial or the passage of a dc electrical current through
list of these defects is given below. Table I the sample while it is at a high temperature.
lists the absorption energy for these defects in The latter two treatments generate mostly
several types of crystals: F-type centers without introducing intersti-
1. The F center. In a monovalent anion ma- tials. When crystals are irradiated both inter-
terial such as KCI or MgF2 the F center consists stitials and vacancies are formed. In highly
of an electron trapped at a negative ion vacancy ionic materials photochemical production of
(charge compensation is essentially complete). these defects occurs during irradiation. Vacancy-
In the case of a divalent anion material such as interstitial pairs are produced with as little as
COLOR CENTERS 202

20 eV energy expenditure. The production of in "Laser Advances," (B. S. Wherrett, Ed.) p. 39,
electron-hole pairs in the crystal is sufficient to New York, John Wiley & Sons, 1980.
form X2 - centers which then degenerate into
F centers and H centers. In the photochemical Other Articles
process one 1 MeV electron can produce as Kabler, M. N. and Williams, R. T., Phys. Rev. B18,
many as 10,000 F centers. For those materials 1948 (1978).
in which X2 - centers are not easily formed, Catlow, C. R. A., Diller, K. M., and Hobbs, L. W.,
such as oxide crystals, this type of damage Phil. Mag. 42,123 (1980).
mechanism is negligible. Instead the impinging
radiation has the same effect as in metal crys- Cross-references: EXCITONS; LATTICE DEFECTS;
tals. Interstitial-vacancy pairs are produced LUMINESCENCE; PHOTOCONDUCTIVITY; RADI-
when the impinging particle has sufficient ATION, IONIZING, BASIC INTERACTIONS.
energy to displace an ion into an interstitial
position. In MgO the required energy is about
60 eV so that electrons with a minimum energy COMPRESSIBILITY, GAS
of 330 KeV are required to produce ionic
damage. These materials are much more diffi- (a) The compressibility of a gas is defined as
cult to damage by irradiation than are the more the rate of volume decrease with increasing
ionic halides. Typically about one stable F cen- pressure, per unit volume of the gas. The com-
ter is produced for every ten I-MeV electrons. pressibility depends not only on the state of the
The thermal, electrical, mechanical, and gas, but also on the conditions under which the
optical properties of most crystals are changed compression is achieved. Thus, if the tempera-
considerably when color centers are present. ture is kept constant during compression, the
Only a few parts per million of interstitial type compressibility so defined is called the isother-
color centers can change the hardness of a ma- mal compressibility {3T:
terial appreciably. Because of these changes, the
study of color centers in those materials with
potential for practical applications is particularly
I (3V)
{3T =- V 3P T =p' 3P T'
I (3 P) (1)
important. Oxide materials such as MgO, Al2 0 3 ,
and quartz have been studied as have most If the compression is carried out reversibly
flouride systems. New work is progressing on without heat exchange with the surroundings,
wide band gap semiconductors and even more the adiabatic compressibility at constant en-
complicated mixed crystals. tropy, {3s, is obtained:

i(~;)s = ;(:~t·
W. A. SIBLEY (2)
{3s=-
References
The two compressibilities are related by
Books
Schubnan, J. H. and Compton, W. D., "Color Centers {3S/{3T = Cv/Cp. (3)
in solids," London, Pergamon Press, 1962.
Fowler, W. B. (Ed.), "Physics of Color Centers," Here P is the pressure, V the volume, p the
New York, Academic Press, 1968. density, T the temperature, S the entropy, and
Markham, J. 1., "F-Centers in Alkali Halides," Supple- C v, Cp specific heats at constant volume and
ment No.8, Solid State Physics, (F. Seitz and D. pressure, respectively.
Turnbull, Eds.), New York, Academic Press, 1966. The compressibility factor of a gas is the ratio
Crawford, J. H. and Slifkin, L. M. (Eds.), "Point De- PV/RT (cf. GAS LAWS). This name is not well
fects in Solids," Vol. 1, p. 201, New York, Plenum chosen since the value of the compressibility
Press, 1972. factor is no indication of the compressibility.
Henderson, B., and Wertz, J. E., "Defects in the Alka- (b) The experimental behavior of the com-
line Earth Oxides," London, Taylor and Francis pressibility as a function of pressure and tem-
Ltd., 1977. perature is as follows: dilute gases obey the
Stoneham, A. M., "Theory of Defects in Solids," laws of Boyle and Gay-Lussac, PV=R T, to a
Oxford, Clarendon Press, 1975. good approximation. The compressibilities {3T
and {3s of a dilute gas are then given by
Review Articles
Sibley, W. A., and Pooley, D., "Radiation Studies of (3 -!.{3 _(Cv/Cp) (4)
Materials Using Color Centers," in "Treatise on Ma- T - p' S - P .
terials Science and Technology," Vol. 5, (H. Herman,
Ed.), p. 45. New York, Academic Press, 1974. Compressed gases, however, show large devi-
MoUenauer, L. F., "Color Center Lasers," in "Methods ations from the behavior predicted by Eq. (4).
of Experimental Physics," Vol. 15B, (E. L. Tang, Ed.) This is demonstrated in Fig. I, where the iso-
p. 1, New York, Academic Press, 1979. thermal compressibility of argon, divided by the
Litfin, G. and Welling, H., "Color Center Lasers," corresponding value for a perfect gas at the
203 COMPRESSIBILITY, GAS

100 force of impact increase as well, giving rise to


an extra increase of pressure. Therefore ~S <
~T' The actual magnitude of the temperature
rise depends on the internal state of the mole-
cules; the more internal degrees of freedom
available, the more energy can be taken up
inside the molecule and the smaller the temper-
ature rise on adiabatic compression. If molecules
have many internal degrees of freedom, the
difference between ~T and f3s, just like the dif-
ference between Cp and C v , becomes small.
Molecular theory also shows that if the mole-
cules have negligible volume and do not interact
with each other, the gas follows the perfect gas
laws of Boyle and Gay-Lussac so that Eq. (4)
holds for the compressibilities f3T and f3s. How-
ever, in real gases the molecular volume is not
negligible. Consequently, in states of high com-
pression little free space is left to the molecules
and thus the real gas and the liquid have low
compressibilities as compared to the perfect
gas (Fig. I). On the other hand, the mutual
attraction molecules experience as they ap-
proach each other makes a real gas easier to
compress than a perfect gas. This explains the
initial rise of f3T/f3 p erf. at temperatures not too
Ol+-------------~-------------+----
o far above critical.
(d) Experimental values for the compressi-
DENSITY bility could be obtained in principle by measur-
ing the pressure increase on a small volume
FIG. 1. The ratio (3/(3perf. of the isothermal com- decrement. In practice a measurable pressure
pressibility of argon to that of a perfect gas at the increase is obtained only in regions of low com-
same density, as a function of the density, at 0.91, pressibility, i.e., in the dense gas. Usually, com-
1.02, and 1.08 times the critical temperature. The
critical density is indicated by Pc. pressibilities are determined in indirect ways.
Thus, the isothermal compressibility, being pro-
portional to (3 V/3P)T, can be deduced from
same density, Eq. (4), is pictured as a func- experimental PVT data if these data are suf-
tion of density for various temperatures. At ficiently accurate and densely spaced. For ob-
all temperatures the compressibility at high taining the adiabatic compressibility from PVT
densities falls to a small fraction of the value data, some additional information is needed,
for a perfect gas. As the critical temperature is for instance SPECIFIC HEAT data in the per-
approached from above, a large maximum occurs fect gas limit. For reviews of experimental
in ~T at densities near critical. The isothermal methods for determining PVT relations and
compressibility diverges strongly at the critical deriving thermodynamic properties see Refs.
point (just like Cp, see Ref. ld) and is infinite la, b.
everywhere in the two-phase region, where the The adiabatic compressibility is readily ob-
pressure does not rise on isothermal compres- tained from speed-of-sound data through the
relation
sion. The adiabatic compressibility, however,
diverges only weakly (like Cv, see Ref. ld) (5)
when the critical point is approached, and it is
finite in the two-phase region.
(c) Simple notions taken from molecular Eq. (5) is valid only when compressions and
theory can be used to explain the general expansions of the sound wave are truly revers-
features of the compressibility in its tempera- ible and adiabatic, i.e., if the frequency is low
ture and density dependence. The pressure of and the amplitude small. Experimental tech-
the gas is caused by the impact of the molecules niques for determining the speed of sound are
on the wall. If the volume is decreased at con- discussed in Ref. 1c.
stant temperature, the average molecular speed The isothermal compressibility, through the
and force of impact remain constant, but the fluctuation theorem of statistical mechanics, is
number of collisions per unit area increases and reJated to the density fluctuations in the sys-
thus the pressure rises. If the gas is compressed tem. 2 ,3 These density fluctuations are respon-
adiabatically, the heat of compression cannot sible for the scattering of light. Thus, the iso-
flow off, thus the average molecular speed and thermal compressibility, in principle, could be
COMPRESSIBILITY, GAS 204

directly obtained from the intensity of scat- Many semiempirical equations of state with
tered light. 2,3 In gases, sufficient intensity is varying degrees of theoretical foundation have
obtained only in regions of large density fluctu- been developed and can be used with Eq. (I)
ations, i.e., near the critical point. Experimental to calculate the compressibility. Van der Waals'
methods and results are discussed in Refs. equation, a two-parameter equation which gives
Id,3. a qualitative picture of the PVT relation of a
(e) Theoretical predictions for the isothermal gas and of the gas-liquid transition, is an ex-
compressibility are obtained from evaluation ample. For surveys of useful semiempirical
of the statistical-mechanical partition function equations see Refs. I b, 2 and 4.
or the radial distribution function. In either
case, a model for the intermolecular potential J. M. H. LEVELT SENGERS
is required. An approximate calculation of the
partition function valid at low densities leads to References
the virial expansion;2 this expansion expresses
PV/RT in a power series in density, the coef- 1. Vodar, B., and Le Neindre, B. (Eds.), "Experi-
ficients being related to the interactions of two, mental Thermodynamics" (I.U.P.A.C.): Volume
three, etc. particles; the compressibility then II: "Experimental Thermodynamics of Non-
follows straightforwardly. Reacting Fluids," New York, Plenum Press; Lon-
The virial expansion is not useful for dense don, Butterworths, 1973/74. The following
gases, because convergence is slow and higher articles are particularly pertinent.
virials are hard to calculate. For dense systems, a. Trappeniers, N. J., and Wassenaar, T., "PVT
methods have been developed for evaluation of relationships in gases at high pressure and moder-
the radial distribution function g(r), which is ate or low temperatures."
the ratio of the average density of molecules at b. McCarty, R. D., "Determination of thermo-
a distance r from any given molecule to the dynamic properties from the experimental
average density in the gas. The compressibility PVT relationship."
is related to g(r) by the fluctuation theorem c. Van Dael, W., "Measurement of the velocity of
sound and its relation to the other thermo-
dynamic properties."
kT~T=I/p+J[g(r)-I] 41Tr 2 dr (6) d. Levelt Sengers, J. M. H., "Thermodynamical
properties near the critical state."
2. Hirschfelder, J. 0., Curtiss, C. G., and Bird, R. B.,
Integral equations for g(r), relating it to the "Molecular Theory of Gases and Liquids," New
molecular potential, its derivatives, and higher- York, John Wiley & Sons, 1964.
order distribution functions, have been de- 3. McIntyre, D. M., and Sengers, J. V., "Studies of
veloped. They can be solved after approxima- fluids by light scattering," in "Physics of Simple
tions about the form of these higher-order Liquids," (H. N. V. Temperley, J. S. Rowlinson,
distribution functions are made. These solutions and G. S. Rushbrooke, eds.), Ch. 11, p. 449,
are hard to obtain for any but spherical inter- Amsterdam, North Holland Publishing Co., 1968.
actions, and invariably suffer from internal 4. Boublik, T., Nezbeda, I., and ffiavaty, "Statistical
inconsistency. For systems of hard spheres, Thermodynamics of Simple Liquids and Their
good solutions are available that have been Mixtures," Amsterdam, Elsevier Pub!. Co., 1980.
verified by comparison with results from com-
puter simulation. Cross-references: DENSITY AND SPECIFIC GRAV-
The most promising route toward predicting ITY, GAS LAWS, KINETIC THEORY.
the equation of state of dense fluids of non-
spherical molecules has been the approximate
evaluation 'of the partition function by the so-
called perturbation method. Here it is assumed
that the partition function is known for a refer- COMPTON EFFECT
ence system, which may be a system of hard Introduction The Compton effect refers to the
spheres. The departure of the real interaction collision of a photon and a free electron in
from that in the reference system is then treated which the electron recoils and a photon of
as a perturbation in a high-temperature expan- longer wavelength is emitted as indicated in
sion of the partition function. Considerable Fig. I. It is one of the most important processes
progress has been made in recent years with the by which x-rays and ')'-rays interact with matter
prediction of properties of systems of molecules and is also one which is accurately calculable
with nonspherical hard cores, "soft" repulsions, theoretically.
dipolar and quadrupolar interactions, etc. A discussion of the effect is found in most
Validation of the results of the perturbation textbooks on atomic physics. Particularly com-
method is increasingly performed by compari- plete presentations have been made by Evans. 1 ,2
son with the results of simulation of molecular History Barkla and others ( 1908) made many
systems on high-speed computers. Ref. 4 observations on the scattering of x-rays by dif-
gives a complete survey of and references to the ferent materials. The diffuse scattering was in-
more successful integral equation and perturba- terpreted qualitatively by J. J. Thomson in
tion methods. terms of the interaction of electromagnetic
205 COMPTON EFFECT

waves with electrons which he had shown to be


a constituent of all atoms. As more experiments
were carried out with light elements, it was
established by J. A. Gray (1920) that the dif-
fusely scattered x-rays were less penetrating.
This implied that the scattered radiation had a
longer wavelength than the incident radiation.
This could not be reconciled with Thomson's
theory which represented x-rays as continuous
electromagnetic waves with wavelengths un-
changed by scattering. THIN
The effect which now bears his name was SCATTERER
established quantitatively by Arthur Holly
Compton (1923) when he published careful ~.

spectroscopic measurements of x-rays scattered c


at various angles by light elements. He found
that x-rays scattered at larger angles had system-
atically larger wavelengths. In searching for an FIG. 1. Diagram showing the initial and fmal ener-
explanation of the data, he discovered that the gies and momenta for Compton scattering.
observations were accounted for by considering
the scattering as a collision between a single
photon and a single electron in which energy
and momentum are conserved. where e is the velocity of light, h is Planck's
The important place which the effect occupies constant, and the angles are those indicated in
in the development of physics lies in his inter- Fig. 1. The relativistic relation between the
pretation of the effect in terms of the newly kinetic energy T of the recoiling electron and
emerging quantum theory. The essential duality its momentum p is
of waves and particles was demonstrated in an
especially clear way, since the collision con- pc = .jT(T+ 2me 2 ) (4)
served energy and momentum while both the
incident and scattered x-rays revealed wave-like where m is the mass of the electron. These
properties by their scattering from a crystal. In equations can be combined to obtain relations
recognition for this contribution, Compton was which are useful in the interpretation of data.
awarded the Nobel Prize in 1927. The Compton shift is
A complete theory for the effect was worked
out in 1928 by Klein and Nishina using Dirac's , e e h
relativistic theory of the electron. The calcula- X - X = ~ - -; = me (1 - cos 8) (5)
tion was one of the brilliant successes of the
Dirac theory. It represents quantitatively, within This relation was first found experimentally by
the experimental uncertainties, all phenomena Compton who noted that the shift in wave-
associated with the scattering of photons by length (X' - X) depended on the angle, but not
electrons for energies up to several billion elec- on the wavelength, of the incident photon. The
tron volts. Because of the confidence with quantity (h/mc), which is the shift at 90°, is
which photon interaction with electrons can be called the Compton wavelength of the electron
interpreted, the Compton effect has been im- and is one of the useful constants (2.4262 X
portant in the analysis of the energy and the 10- 10 cm).
polarization of gamma rays from many sources.
Kinematics The relations between the ener-
gies and directions of the incident and scattered hv' = (6)
me 2
photons and the recoil electron are determined 1 - cos 8 +--
by the conservation of energy and of the com- hv
ponents of momentum parallel and at right
angles to the incident beam. In the usual case, In this form, the energy of the scattered photon
where the electron is initially at rest and the is seen to vary from that of the incident photon
energy and momentum of the incident photon at 0° to less than (me 2 /2) at 180°. At high en-
are hv and (hv/e), the equations are: ergies the angle 8 for which hv' is (hv/2) is ap-
proximately 2(me 2 /hv) radians.
hv = hv' + T (I)
The kinetic energy of the recoiling electron is

T = hv(l - cos 8)
hv hv' (7)
- = - cos 8 + P cos I/> (2) me 2
e e (I - cos 8) + hv
O=-Sln
hv' . 8 -PSln'l'
. '" (3) The relation between the scattering angles of
e the electron and photon is
COMPTON EFFECT 206

ALUMINUM

0·01
FIG. 2. Differential cross section for photons scat- 6
tered at angles, e, for a number of incident energies. ,,

,,
0 .00 II.-.J........L...J......L.......u.......J....-'--.L........LL......L..-,--.l....--'-------">.-,---/
0 .0 1 2 • 6 0 .1 2 4 6 I Z 4106 2 4 6 100

cot I/> = (I + l!.!:.) (I -.smcose e)


mc 2
(8)
PHOTON ENERGY MeV

FIG. 3. The attenuation coefficients, Ct, for the ab-


sorption of photons in aluminum as a function of
Graphs of these kinematic relations and of the energy. The broken lines represent the separate con-
scattering cross section are given by Evans(2) tributions of the photoelectric effect, the Compton
and by Nelms(3). effect, and pair production to the absorption.
Scattering of Unpolarized Radiation The dif-
ferential cross section for the scattering of un-
e
polarized radiation at an angle is given by the where T varies from 0 to Tmax = 2(hv)2/(2hv +
Klein and Nishina equation. mc 2 ). A number of these distributions are
shown in Fig. 4.
Scattering of Plane Polarized Radiation The
dar 2 (VI)2 (v
dr1. = ~ v ;; + vv' - sin2 e) (9) differential cross section for the scattering of
plane polarized radiation by unoriented elec-
trons was also derived by Klein and Nishina. It
where ro is the electron radius = e2 /mc 2 = represents the probability that a photon, pass-
2.8177 X 10- 13 cm, and v'is obtained from ing through a target containing one electron per
Eq. (6). The cross section is shown as a function square centimeter, will be scattered at an angle
e
of for several energies in Fig. 2. The classical e into a solid angle dr1. in a plane making an
Thomson cross section r02 (1 + cos 2e)/2 can be angle 71 with respect to the plane containing the
seen to hold for low energies where v' "'" v. electric vector of the incident wave.
The total cross section obtained by integrat-
ing this cross section over angle is important in
the attenuation of well-defined beams in pass- da = r02
ing through a material. The relative importance dr1. 2 v
(~)2 (l: + ~ _
v' v
2 sin2 e cos2 71) (10)
of Compton scattering as compared to the pho-
toelectric effect and pair production is illus- 1.0
trated for aluminum in Fig. 3 where the attenu- z I NC IOEN T PHOTON ENERGY. h. MeV
ation coefficient a is shown as a function of
/
0

energy. The fraction of the photons surviving '"...0-v 0.8 0 .5


without an interaction upon passing through ..J
1.0
x g/cm 2 of aluminum is e _Ctx. The Compton ef-
W
, 0.6
;;;
,..,
fect is the major one between O.S and 2 MeV. ~ ci
Extensive tables and graphs for other elements E u 0.4
are available. 2,4
I
t--
':'
In detectors whose response is proportional S!
to the energy deposited by the recoil electrons,
.
~ 0 .2 3.5
the distribution of electron energies associated • ~

with a photon of known energy is of interest: a '",.;


The distribution is given by the relation a 0 .5 1.5 2.5 .3 3~

RECOIL ELECTRON ENERG Y T INMeV

da _ rrr02mc 2
dT - (hV)2
{2 + (~)2
hv - T
FIG.4. The energy distribution of the Compton
recoil electrons for several values of the incident
photon energy hI!. Based on figure in "Compton
[ (mC~)2 + hv - T _ 2mc 2(hv - T)]} Effect" by R. D. Evans in "Handbuch der Physik,"
Vol. XXXIV, pp. 234-298, 1958, J. Fluge, Ed., by
(hv)2 hv hvT permission of Springer-Verlag, publishers. 2
207 COMPTON EFFECT

+ 1
The cross section has its maximum value for
T/ = 90°, indicating that the photon and elec-
tron tend to be scattered at right angles to the +06
electric vector of the incident radiation.
This dependence is the basis of several instru-
ments for determining the polarization of pho- ~ +02
tons. For example, it was used by Wu and Shak- .....
nov s to establish the crossed polarization of the 6"
""0

two photons emitted upon the annihilation of a


positron electron pair; by Metzger and Deutsch6 ...
<I
a:
to measure the polarization of nuclear gamma
rays; and by Motz? to study the polarization of -0.6
bremsstrahlung.
Scattering of Circularly Polarized Radiation
- I
The scattering of circularly polarized photons o' 30' 60' 90' 120' ISO' 180'
by electrons with spins aligned in the direction SCATTERING ANGLE. 8
of the incident photon is represented by
FIG. 5. The ratio of the partial cross section depen-
do
dn
=r
0
2 (~)2
v
[(~v' + ~v _ sin2 e) dent on the spin orientation of the electrons to the
average cross section as represented by the second and
first terms of Eq. (10). Based on figure in "Compton

±(~ - ~) cos e] Effect" by R. D. Evans in Handbuch der Physik, Vol.


(11 ) XXXIV, pp. 234-298, 1958, J. Fluge, Ed., by per-
mission of Springer-Verlag, publishers. 2
The first term is the usual Klein-Nishina for-
mula for unpoJarized radiation. The + sign for
the additio.1al term applies to right circularly per cent, this is often sufficient to get accurate
polarized photons. The ratio of the second term and reliable measurements of circular
to the first is a measure of the sensitivity of the polarization.
scattering as a detector of circularly polarized Cross sections for some practical arrange-
radiation and is shown in Fig. 5. ments and discussions of earlier work are pre-
In practice, the only source of polarized elec- sented by Tolhoek. 8 Applications to the deter-
trons has been magnetized iron where 2 of the mination of the helicities of photons, electrons,
26 electron spins can be reversed upon changing and neutrinos in confirming the two-component
its magnetization. Although the change in the theory of the neutrino are reviewed in consider-
absorption or scattering is usually only a few able detail by L. Grodzins. 9

1.5
"0-' ""'" IIIj
~
1.0
,I!Illil
C;
I
!I !
'l:l
......
b
'l:l
05

I
I

o 200 400 600 BOO 1000


INCIDENT GAMMA RAY L AB ENERGY (MeV)

FIG. 6. The differential cross section for the scattering of high-energy


photons by protons at 90° in the center of mass system. Based on
figure in Steining, Loh and Deutsch, Phys. Rev. Letters, 10, 536
(1963).1 0
COMPTON EFFECT 208

Scattering from Moving Electrons Compton 3. Nelms, A. T., "Graphs of the Compton Energy-
backscattering of low-energy photons by high- Angle Relationship and the Klein-Nishina Formula
energy electrons can produce photons with from 10 keV to 500 MeV," Natl. Bur. Std. Cire.,
energies comparable with those of the electrons. 542 (1953).
Milburn 10 and Arutyunian et a1. 11 pointed out 4. White, G. R., "X-Ray Attenuation Coefficients
that backward scattering of an intense polarized form 10 keV to 100 MeV," Nat!. Bur. Std. Rept.,
laser beam by a beam of high energy electrons 1003 (1952).
could produce a useful beam of monoenergetic 5. Wu, C. S., and Shaknov, I., "Angular Correlation
photons of intermediate energy. The maximum of Scattered Annihilation Radiation," Phys. Rev.,
energy of the backscattered photons is given by 77,136 (1950).
6. Metzger, F., and Deutsch, M., Phys. Rev., 78, 551
, 4 (E/mc 2 )2 (1950).
hI! = hI! (12) 7. Motz, J. W., "Bremsstrahlung Polarization Mea-
1 + 4hI!E/(mc 2 )2
surements for 1 MeV Electrons," Phys. Rev., 104,
where E is the total energy of the electron. For 557 (1956).
photons from a ruby laser (A = 6943 A, hI! = 8. Tolhoek, H. A., "Electron Polarization Theory
1.79 eV) the photons backscattered from 6-GeV and Experiment," Rev. Mod. Phys., 28, 277
electrons will have energies of 848 MeV. Photons (1956).
from a ruby laser interacting with the electron 9. Grodzins, L., "Measurement of Helicity," in
beam at the Stanford Linear Accelerator Center Frisch, O. R., Ed., "Progress in Nuclear Physics,"
produced monoenergetic polarized photons of New York. Pergamon Press, 1959.
1.44, 2.8, and 4.7 GeV for the study of photon 10. Milburn, R. H.,Phys. Rev. Letters 10,75 (1963).
interactions in a hydrogen bubble chamber.12 11. Arutyunian, F. R., and Tumanian, V. A., Phys.
Higher-intensity facilities for producing inter- Letters 4,176 (1963).
mediate-energy photons for photonuclear re- 12. Ballam, J., et aI., Phys. Rev. Letters 23, 498
search has been proposed by Italian and U.S. (1969).
groups.14 These facilities would use photons 13. Federici, L., et aI., Lettere 01 Nuovo Cimento, 27,
from high-intensity lasers interacting with 1.5 339 (1980).
GeV and 2.5 GeV electron beams in storage 14. Sandorfi, A. M!) et aI., IEEE NS 30, 3083 (1983).
rings. 15. Steining, R. F., Loh, E., and Deutsch, M., "The
Proton and Deuteron Compton Effect Par- Elastic Scattering of Gamma Rays by Protons,"
ticle-like scattering of high-energy photons by Phys. Rev. Letters, 10,536 (1963).
protons and deuterons has been observed and 16. Jones, R. S., Gerber, H. J., Hanson, A. 0., and
has been referred to as the proton and deuteron Wattenberg, A., "Deuteron Compton Effect,"
Compton effect. The kinematic equations are Phys. Rev., 128, 1357 (1962).
identical to those for electrons except that the 17. Buschhorn, G., Criegee, L., Franke, G., Heide, P.,
mass is that of the proton or deuteron. Kotthaus, R., Poelz, G., Timm, U., Vogel, G.,
Although the cross sections are smaller than Wegener, K., Werner, H., and Zimmerman, W.,
that for electrons, by the square of the ratio of Proton Compton Scattering between 2.2 and 7
the masses, the scattering is easily distinguished GeV," Phys. Lett., 378, 207 (1971).
by the characteristically higher energy of the 18. Boyarski, A. M., Coward, D. H., Ecklund, S.,
radiation at large angles. At energies above the Richter, B., Sherden, D., Siemann, R., and Sin-
pion threshold, the cross section is dominated clair, C., "Forward Compton Scattering from
by pion nucleon resonances. The experimental Hydrogen and Deuterium at 8 and 16 GeV," Phys.
cross sections for the scattering by protons, as Rev. Lett., 26,1600 (1971).
presented by Steining, Loh, and Deutsch,IS are
shown in Fig. 6. Some experimental results and Cross-references: COLLISIONS OF PARTICLES,
calculations on the coherent scattering from CONSERVATION LAWS AND SYMMETRY, ELE-
deuterium are described by Jones, Gerber, Han- MENTARY PARTICLES, QUANTUM THEORY,
son, and Wattenberg. 16 X-RAYS.
Measurements at photon energies up to 7 and
16 Ge V and comparisons with theoretical pre-
dictions of the vector dominance model have
been reported by groups from Hamburg l ? and COMPUTERS
Stanford. ls
Introduction A computer may be defined as a
A. O. HANSON device capable of solving problems by accepting
data (input), performing prescribed operations
on the data (processing), and providing the re-
References
sults of these operations (output). The basic
1. Evans, R. D., "The Atomic Nucleus," Chapter distinction between electronic calculators,
23, New York, McGraw-Hill Book Co., 1955. which also fit this definition, and computers is
2. Evans, R. D., "Compton Effect," In Flugge, S., that the latter provide speed in performing com-
Ed., "The Encyclopedia of Physics," Vol. 34, pp. plex operations, virtually infinite program and
234-298, Berlin, Springer-Verlag, 1958. data storage, and the ability to interact with the
209 COMPUTERS

environment (including other computers) on a multaneously (in parallel) while the solution is
real time basis. displayed in graphical form in continuous
Computers may be hydraulic, mechanical, fashion. Since operations are performed in
electromechanical, or electronic devices. They parallel, increased problem complexity demands
are broadly classified as being digital, analog, or more computer components, not more time.
hybrid, i.e., analog and digital linked together. Solution time is therefore a function of the time
The first two types differ fundamentally in the characteristics of the problem, not the machine.
manner in which data are stored and operated The speed of the analog computer, therefore, is
upon. Analog computers operate on continuous orders of magnitUde faster than the digital
variables, in the form of voltage or current (the computer in solving complex problems that in-
electrical analog of a number or physical quan- volve calculus. This speed is paid for by a pre-
tity), that represent continuous data. Digital cision of results that is mders of magnitude less
computers operate on discrete numerical data than for a digital computer. Hybrid computers
represented by a series of binary digits. The attempt to combine the best features of both
data and instructions are stored internally and types i.e., the speed of the analog with the pre-
are indistinguishable in memory. Digital com- cision of a digital. This is brought about through
puters perform calculations by adding binary a digital "front-end" which is used to set up
numbers according to instructions derived from and check out the analog computer and its pro-
coding an arithmetical expression or series of gram. The actual computation time is equiva-
expressions (an algorithm) that represent the lent to that of a pure analog machine, but the
problem to be solved. Even though the problem total time (including set-up time) is significantly
may include differential or trigonometric ex- reduced. Indeed, problems may be solved that
pressions, each is reduced to simple addition or would be cost prohibitive on a digital computer
subtraction (addition of the complement) since and, for all practical purposes, impossible to set
addition is the only direct operation a digital up on an analog computer.
computer can perform. Instructions, which are As described below, in physics all three
written sequentially, are executed sequentially types of computers are utilized. Digital com-
(one at a time). Calculations usually culminate in puter applications range from theoretical cal-
a numerical display or graphical representation culations to pure data collection and experi-
of the results. Because of the sequential nature ment control. Applications of analog computers
of the digital process, the amount of time spent involve primarily solutions of differential equa-
in solution is proportional to the problem com- tions which serve as models of physical systems.
plexity. Analog computers, on the other hand, Hybrid systems come into play for extremely
are composed of elements which perform sum- complex simulation. Examples include determi-
mation, integration, multiplication and differ- nation of tactical envelopes for missiles, optimi-
entiation directly. Rather than a serially coded zation of control settings in a nuclear power
algorithm, the instructions are in the form of plant, and development of chemical kinetic
basic modifications of the analog computer's models. Whatever the branch of physics, one
circuitry by a user-wired patch board panel. can be sure that computers today play some
There is no internal data or program storage as significant role.
such. Instructions are _execu ted effectively si- Computers of the Past The abacus has been

1
A

A 0-----1-1

CIA + BI
A
INPUTS Bo----I

Co-------~

INTEG RATER

OUTPUT Yo-------------;

Y = f[CIAA+ BI] dt
BALANCE POTENTIOMETER

FIG. 1. Typical analog computer configuration to solve the equation for y. Requires: 1 adder, 1 inverter,
2 multipliers, 1 integrator.
COMPUTERS 210

called the earliest computing machine. For Herman Hollerith, who had worked for the
more than 3000 years after the recording of its census office and was a mechanical engineer,
invention, this was the sole device available to solved the problem by devising a punched card,
aid in arithmetic calculations. Finally, in the which could contain all pertinent data, and a
year 1642, a Frenchman, Blaise Pascal, in- series of machines for punching, counting and
vented the first mechanical adding machine. His sorting. Using this equipment, census figures
stylus operated "arithmetic machine" had the were published in less than three years with
ability to handle carryovers from one column unheard-of accuracy. Hollerith revised his
to the next. Nearly 200 more years elapsed be- punched cards in 1894. They contained 80
fore C. X. Thomas, also a Frenchman, working columns, each with holes for 0 through 9; this
on a concept proposed by the German, Gott- is exactly the format in predominant use today.
fried Wilhelm Leibniz, built an "arithmometer" Hollerith's efforts eventually led to the forma-
in 1820 which, besides addition and subtraction, tion of the International Business Machines Cor-
could perform multiplication and division using poration, the world's largest manufacturer of
the concept of repeated addition and subtrac- computers. Tremendous quantities of work
tion. have been accomplished and are still possible
While this was going on, an independent de- using punch card techniques. However, this
velopment in the weaving industry saw French- methodology is practical only for sorting,
men Jacques de Vaucanson, in 1741, and counting and selecting in a limited number of
Joseph Marie Jacquard, in 1804, use holes ways.
punched, first in metal drums, later in punched Electronic and electrical techniques were re-
cards, as the control or programmer for textile quired to make practical, accurate computa-
looms. On a more esoteric plane, Englishman tions of the vast quantities of data to solve
Charles Babbage, in 1823, began construction problems and prove theories which today serve
of his "difference engine." This device, based as the cornerstone of modern science. These
on the fact that an equation of degree n will machines are based upon the foundations laid
have a constant nth difference, was used to by George Boole, who pioneered in the field of
make calculations for trigonometric and log- symbolic logic, and Allan Turing, who hypothe-
arithmic tables. Babbage succeeded in con- sized a universal computer. George Boole's
structing a machine to solve a second degree algebra provides a mechanism for representing
equation. But his "analytical engine," which logic in mathematical symbols and rules for
had a memory, control, an arithmetic unit, and calculating the truth or falsity of statements.
an input/output section could not be built at Digital computers carry out these operations an
that time with sufficient precision to produce infinite number of times. Allan Turing's paper,
reliable results. Earliest analog computers of the "Computable Numbers," in 1937, described a
type designed by Vannevar Bush and others in hypothetical Turing machine that can solve any
the 1930's actually followed these same me- type of mathematical problem which can be re-
chanical principles until electronics began to duced to coding in a given set of commands
take over in the late forties. This activity was within the memory capacity of the machine.
culminated perhaps by the Maddida, which ap- First application of these principles resulted
peared in 1951. Analog computers since that in the Bell Telephone Laboratories relay com-
time have differed mainly in the application of puter in 1939. Five years later, in 1944, the
advanced modular electronics and in linkages second significant relay computer, which was
with digital computers. Developments of cal- also the first general purpose digital machine,
culators after Pascal and Thomas are not reli- was constructed at Harvard by Professor
ably documented. Two American mechanical Howard Aiken, with funds provided partially
adders, which spawned present day companies, by IBM. Known as the Mark I, this machine was
are worthy of mention, however. These include really a huge electro-mechanical calculator. The
the Felt "macaroni box" made in 1885 by a Electronic Numerical Integrator and Calculator,
firm which later became the Victor Comptom- "ENIAC," the first electronic computer, was
eter Corporation, and the "listing accountant" developed in 1946 by J. P. Eckert and J. W.
made by a forerunner of the Burroughs Cor- Mauchiy of the Moore School of Engineering in
poration. Philadelphia. Funds were provided by the U. S.
The real push behind the development of Army with the promise that the machine would
modern digital computers came about because be suitable for calculating ballistic tables. This
of U. S. Government requirements, first to machine contained 18,000 vacuum tubes which
count people and tabulate corresponding data; replaced the former mechanical relays as switch-
later, during World War II, for developing artil- ing elements. As might be expected, it was huge,
lery trajectory tables and performing calcula- weighing over 30 tons, and terribly unreliable.
tions for the Manhattan project. In preparation Nevertheless, it existed until 1955 after over
for the 1890 census, it became clear that, using 80,000 hours of operation had been logged.
available counting and sorting techniques, it Development proceeded thereafter at a much
would be nearly time to perform the 1900 more rapid rate. IBM continued work, produc-
census before 1890 figures could be determined. ing the Selective Sequence Electronic Calculator
211 COMPUTERS

(SSEC) in 1947. The Moore School developed a


second machine known as the Electronic Dis- MEMORY
crete Variable Computer (or EDVAC) which UNIT
became operational in 1952. EDV AC is ac-
cepted as the first stored program computer.
Unlike earlier machines which were programmed
at least partially by setting switches or using
patch boards, this machine and all others which
were to follow, stored instructions and data in
identical fashion. EDV AC used acoustical delay CONTROL
lines, which were simply columns of mercury UNIT
through which data passed at the speed of
sound, as the main memory. This type of stor-
age has given way to magnetic core memory
and semiconductor memory in computers of
today.
In 1946, John Von Neumann, a mathemati- ARITHMETIC
cian at the Institute of Advanced Study (lAS), AND
Princeton, New Jersey, presented a paper en- LOGIC
UNIT
titled "Preliminary Discussion of the Logical
Design of an Electronic Computing Instru- FIG. 2. Basic configuration of a digital computer.
ment." This paper, which was prepared jointly Problems are of the form
under a contract with the U. S. Army, suggested Input A = 3.0, B = 4.0
the principles under which all digital computers
Output C = SQRT (A * A + B * B)
which followed would be built. This included
internal program storage, relocatable instruc-
tions, memory addressing, conditional transfer,
parallel arithmetic, internal number base con- processes and make basic decisions. These small
version, synchronous internal timing, simulta- machines are referred to as minicomputers.
neous computing while doing input/output, and Large word computers serve as theoretical num-
magnetic tape for external storage. An lAS com- ber crunchers, often with smaller machines pre-
puter, employing most of these concepts, actu- processing input data or controlling output de-
ally went into operation in the early fifties. All vices. Computer word size ranges from eight
machines which followed used virtually the same bits, or a single word accuracy of one part in
principles. UNIVAC I (Universal Automatic 256, to sixty bits for an accuracy of one part in
Computer) was the first commercially available 260 or a maximum positive decimal number of
digital computer (circa 1950). This machine was about 1 152 921 504 000 000 000.
a direct descendent of ENIAC and EDV AC, In setting (programming or coding) instruc-
having been built by Remington Rand follow- tion sequences, the user arranges for the control
ing acquisition of the Echert-Mauchly Computer unit to determine, through an instruction de-
Corporation. Eventually, 48 UNIVAC I's were coder, whether the instruction itself refers to
built making Remington Rand the number one some memory location where data resides or
computer manufacturer until International Busi- calls for some basic arithmetic or logical opera-
ness Machines (IBM) began in earnest in 1954 tion to be performed. These operations are per-
with the introduction of its 700 line. This formed between data registers. Registers always
machine and its successor, the 650, made IBM provided include: an accumulator, a program
the number one computer manufacturer in the counter, a memory address indicator, and an
world, a position which it holds today. overflow indicator. Operations themselves may
Digital Computer Hardware As shown in modify the stored instructions by performing
Figs. 2 and 3, general-purpose digital computers calculations on them, branch to a new set of
are comprised of four basic components: an instructions, and/or cause the machine to inter-
arithmetic or computing unit, a high speed in- act with the outside world. Response from this
ternal storage unit, input-output devices, and a interaction may cause the machine to "decide"
control unit. Data and instructions are indis- what operations to perform next depending
tinguishable within the computer; both are rep- upon the options built into the program. The
resented by binary patterns of semiconductor computer can make no decision of its own ac-
or magnetic core states. Each binary unit is cord; it is simply directed to "conclude" which
called a "bit". A fixed number of bits is re- one of a given set of alternatives best fits a
ferred to as a computer "word". A word is, given situation. This being the case, it is impor-
generally, the smallest directly addressable tant to remember that computers don't solve
whole unit of memory. The number of bits in problems, people that program them do. The
the word determines the accuracy of the basic advantage possessed by the computer is speed
machine and its cost. Small word machines are of operation. Even the slowest machines can
employed in real world applications to control perform an addition of two numbers in less
COMPUTERS 212

7 TRACK, 37.5 INCHES CENTRAL PROCESSOR


READ·RATE:
PER SECOND 64 k MEMORY, 600 nSEC CYCLE TIME
1200 CARDS PER MINUTE
7.5 kHz AT 200 B.P.1. FLOATING POINT PROCESSOR
20.8 kHz AT 556 B.P.1. 01 RECT MEMORY ACCESS
30.0 kHz AT 800 B.P.1. REAL TIME CLOCK HARDWARE
ARITHMETIC ELEMENT

BUFFERED
1/0
DATA
CONTROLLERS PUNCH·RATE :
CHANNELS
250 CARDS PER MINUTE
INPUT SIGNAL CONDITIONER
INTERFACE SYNC. UNIT

READER:
9 TRACK, 37.5 INCHES PER SECOND READS 400 C.P.S.
30 k B·BIT C.P.S. AT 800 B.P.1. 5,7, OR 8 LEVEL TAPE
1200 LINES PER MINUTE PUNCH:
136 COLUMNS PER LINE PUNCHES 150 C.P.S.

300 STEPS PE R SECON 0


0.01 INCH STEP WIDTH
11 INCH PLOT WIDTH

I/O TERMINALS
SEND·RECEIVE RATE:
300 WORDS PER MINUTE

CAPACITY: 3 MILLION 16·BIT WORDS


DATA FLOW: 1.25 mHz BIT RATE

FIG. 3. Typical 3rd generation, medium size computer.

than 3 microseconds. Using special circuitry, multiprogramming. This assumes, of course,


multiplication of any two numbers takes less that no single program requires total system
than ten microseconds. These arithmetic func- resources for more than a small time period.
tions, plus logical operations like AND, OR, Hardware controllers have been optimized to
XOR, repeated thousands of times in one sec- take advantage of this usual case and permit
ond enable the machine to use iterative tech- virtually simultaneous user access to the central
niques to do all operations necessary for cal- processor.
culating mathematical tables, as well as to keep Digital Computer Software Computer soft-
up with the real world in monitoring and con- ware refers to all programs which direct ma-
trol applications where data at rates of up to chine activities. This software can be generally
100,000 characters per second are encountered. divided into three classes: an operating or ex-
Figure 3 shows the configuration of a typical ecutive system; language assemblers, translators
third generation, medium size machine (first and compilers; user applications programs,
introduced after 1968, and employing medium mathematical program libraries, and data analy-
scale integrated circuitry with a multiprogram- sis packages. The first class generally goes under
ming operating system). Its purpose is to show acronyms similar to Disk Operating System
the diversity of peripheral gear which is tied to (DOS), Mass Storage Operating System (MSOS),
a computer and the speed at which this equip- Real Time Executive (RTE), UNIX, CPIM, or
ment operates. The numbers shown are repre- other such names. It is the function of this
sentative and by no means show the highest master software to schedule users and users'
capacities available. This configuration will sup- programs, allocate processor and peripheral
port a dedicated operation (one user at a time) resources, perform internal "housekeeping,"
or several users simultaneously in a multipro- and handle emergencies. It is the operating sys-
gramming or time shared mode. These latter tem that permits time sharing and multipro-
terms refer to the ability of the system to run gramming to be carried out. All but the most
more than one user program simultaneously. basic systems offer this software. The second
The difference between time sharing and multi- software class operates within or is subject to
programming, as the terms are usually used, is the OS. It includes such things as FORTRAN
that several users are connected simultaneously (Formula Translator) compilers, BASIC (Begin-
to the system in time sharing, while more than ners All-Purpose Symbolic Instruction Code)
one program is run at the same time under interpreters, a system editor. and an assembler.
213 COMPUTERS

These and other conversion programs reside point. On small dedicated computer systems a
within the machine and are used to change single application package may be the only soft-
English language statements or data to machine ware loaded other than input/output drivers
bit patterns which are decoded internally. Stan- which communicate with peripheral devices.
dards have been devised for FORTRAN, BASIC, This is true of microprocessors, experiment
ALGOL, COBOL, PL/l and other languages so controllers, front-end machines and, in general,
that a program written on one manufacturer's all computers that have less than 8000 words of
machine will (almost) run on another. Assembly central memory . Software clearing houses that
languages, however, are machine independent supply computer programs worldwide for ex-
and in no way compatible. The final class of clusive manufacturers, as well as for general use,
software includes those programs designed and have gone into business to meet the demand for
written to accomplish a specific task. For ex- application programs.
ample : find the roots of a polynomial expression Scientific Applications The evolution of the
of a certain type; or given a set of data describ- digital computer and modern scientific know-
ing the elongation versus temperature and load ledge are closely linked. We have seen that de-
of an alloy specimen, as well as known constants, velopment of the ENIAC was the direct result
calculate stress and strain values at the break of the need to perform millions of calculations

PERIPHERALS

• TELETYPEWRITER • CARD READER


CENTRAL PROCESSOR • PAPER TAPE I/O • MAGNETIC DRUM
• MAGNETIC TAPE • MAGNETIC DISK
• LINE PRINTER

DISPLAY

RAW DATA

• 10 TO 14 BITS HIGH LEVEL OR


• SINGLE-ENOED
• INTEGRATING LOW LEVEL
• DIFFERENTIAL
• WIDE-BAND • SAMPLE & HOLD
• HIGH CMR • SOLID STATE

CRITICAL
INTERRUPT INTERFACE INPUT SIGNAL
CONDITIONER 14---- DISCRETE
& FAN-IN CONTROLLER
EVENTS

DIGITAL INPUT CONTROLLER INPUT SIGNAL 1+--- DISCRETE


& MULTIPLEXER CONDITIONER DATA

EVENTS
COUNTERS & INPUT SIGNAL OR PULSE
INTERFACE CONTROL CONDITIONER
TRAINS

CONTROL &
PARALLEL WORD PARALLEL WORD RELAY
DISPLAY
OUTPUT CONTROLLER OR POWER AMP OUTPUTS
(DISCRETES)

CONTROL &
PARALLEL WORD
RELAY OUTPUTS t-----~ DISPLAY
(DISCRETES)

LOGIC LEVEL
CONTROL

CONTROL

FIG. 4. Typical data acquisition and analysis configuration.


COMPUTERS 214

on the Manhattan Project that would have trken homes. Thousands of people are purchasing
years during a time when years were not wail- personal computers each week, not only low-
able. A second example is the fact that no at- end microcomputers, but also high-end systems
tempt was made to find a rigorous solution of such as the IBM PC. They 1!.ave proven to be
the "3-Body Problem" of celestial mechanics incredibly popular and several hundred thousand
until the advent of the modem digital com- of them were sold in only the past three years.
puter. Today these servants of scientists and Further, for every person who buys a personal
engineers are found in their smaller versions in computer, two or three more are considering a
more physics laboratories than not. Theoretical purchase. This has been attributed to both the
data or that produced by passive data acquisi- videogame explosion and the public's recogni-
tion devices is then transmitted to larger sys- tion of the power and capability of the micro-
tems for further processing. Where theoretical computer. These computers are reasonably
studies are involved, large systems perform inexpensive, and designed to allow the average
computations that would simply not be practi- person to learn about the computer and use it
cally possible otherwise. Examples of typical to solve everyday problems.
applications or user software now in use to solve The personal computer is a genuine computer
particular scientific problems include programs which has most of the features of the big main-
to: calculate Cartesian coordinates for all atoms frame computers. As computers became smaller,
in a molecule, compute numerical eigenvalues they also became faster, and today processing
and matrix elements for the quantum mechan- time is measured in microseconds and nano-
ical radial equation, and determine the coulomb seconds (billionths of a second). For example,
lattice energy of an ionic crystal. There is also the IBM PC is four times faster than the IBM
a complete system of programs for quantitative 360. It can be equipped with enough capacity
theoretical chemistry. In the area of direct data to handle the accounting and inventory control
acquisition and instrument control, small com- tasks of most small businesses. It can also per-
puter systems of the configuration shown in form computations for engineers and scientists,
Fig. 4 are most often employed. These systems and it can be used to keep track of home fi-
are built as part of electron spin and nuclear nances, budget management, investment analy-
magnetic resonance spectrometers, all types of sis, and many, many other applications ranging
optical spectrophotometers, real time Fourier from educational to recreational to business
analyzers, and x-ray spectrometers, to name only functions.
a few. They are used to control and acquire Figure 5 shows the basic configuration of a
data in applications ranging from remote seis- personal microcomputer. Like other computers
mic stations to laboratory spark chamber moni- it consists of the CPU and the system board
toring. In physics education they find uses that connects the CPU to other devices for
ranging from teaching basic principles to per- input and output (I/O) and storage. There are
forming semiautomatic pattern recognition for three basic types of memories: ROM (Read-
bubble chamber film data analysis. Only Memory) which is not user-modifiable and
Personal Computers The latest entry into which contains programs you would never want
the computer industry is the personal computer to change; RAM (Random Access Memory), in
(PC). In the past eight years, the personal com- which data can be either written or read and
puter has moved from the province of hobbyists which can be thought of as the microprocessor's
to take a place in business offices and private work space; the mass storage device, which al-

CPU Address bus (16)


microprocessor
and support

! ! ! !
circuitry
Control bus
(10:12)
To expansion
port (if
RAM Serial Parallel
ROM available)
interface interface

Databus (8) ~ t t t
I/O bus I/O bus

CRT, Disk drive,


keyboard, printer,
printer, speech·
modem, etc. synthesizer, etc.

FIG.5. Block diagram of a basic personal microcomputer system.


215 COMPUTERS

lows the storing of great quantities of informa- will allow "intelligence" to be built into all
tion and programs on magnetic disk or tape computer peripherals and make possible special
outside the computer itself. Parallel and serial purpose modules to be employed for handling a
interfaces connect the microcomputer to other single arithmetic or control function. This will
devices such as printers, keyboards, etc. Early permit many more operations to be performed
computers had only serial ports that received in parallel, thus decreasing time to solution. The
and returned data serially, or, one bit at a time. low cost of these modules will cause them to
Processing time in computers was vastly im- begin to appear almost everywhere. Few scien-
proved by the development of parallel ports tific instruments will be built where they are
which handle and move large amounts of data not employed as control elements or to provide
simultaneously. The process utilizes electrical at least intermediate results directly. Software
conductors (printed circuit boards) that will development will respond to make use of this
carry multiple electrical impulses around the distributed internal processing and permit better
innards of the computer. These multiple impulse machine and manpower utilization. More em-
carriers are known as busses, and several used phasis will be placed on high level languages
together are called bus systems. similar to PLjI and on the development of ap-
Although the industry is yet dominated by plications packages which will handle a given
8-bit microprocessors, all new systems are 16- problem from start to finish. Less distinction
bit. Within a few years they will have replaced will be made between systems and language
the 8-bit processors as the brains of medium- routines. A single tool will be available to ac-
priced personal computers. The power of the complish a given objective. Problems like code
16-bit microprocessor-with its ability to ad- conversions between units will be handled by
dress over 1 million characters of computer hardware. Modular programming techniques
memory-is needed to run software that is more and top-down software management will harness
flexible and easier to use than the software software development making distributed intel-
commonly used today. Microprocessor chips ligence and application of automatic data pro-
(such as Intel Corporation's 8088 and Motorola's cessing a reality in all fields.
MC68000) are already turning up in the more
advanced personal computers and manufacturers DENNIS E. WISNOSKY
of the Corvus Concept and the IBM Personal
Computer, among others, are putting that power References
to good use. Further, during 1984 the first
32-bit biped PCs will be introduced. This will Weik, Martin H., "Standard Dictionary of Computers
put on a desk top computer power of limitless and Information Processing," New York, Handen
dimensions to most people. Book Co., 1969.
Future Developments As we have shown, Davis, Gordon B., "An Introduction to Electronic
the computer industry only recently celebrated Computers," New York, McGraw-Hili Book Co.,
its silver anniversary, yet there is almost no area 1965.
of our life which it has not affected. In terms Bernstein, Jeremy, "The Analytical Engine: Comput-
of dollars, the hardware, software and man- ers, Past, Present, and Future," New York, Random
power investment is well over 100 billion. De- House Inc., rev. ed. 1981.
spite these impressive figures, which point to Boo1e, George, "An Investigation of the Laws of
success, many serious shortcomings in computer Thought," New York, Dover Publications, 1954.
technology and its applications are evident. Quantum Chemistry Program Exchange, Indiana Uni-
Most experts agree that hardware advances at versity, Bloomington, Indiana 47401.
the component level will slow down. Peripheral Korn, Granizo Art!}ur, "Minicomputers for Engineers
devices such as line printers, card readers, etc., and Scientists," New York, McGraw-HilI Book Co.,
appear to be approaching their capacity limits. 1973.
These devices must give way to new technologies Perone, Sam P. and Jones, David 0., "Digital Com-
or be reduced in cost to the extent that they puters in Scientific Instrumentation," New York,
can proliferate infinitely. It seems more reason- McGraw-Hili Book Co., 1973.
ale that they will give way to more direct user "Digital Products and Applications," Digital Equip-
involvement through input/output terminals ment Corporation, Maynard, Massachusetts, 1971.
and direct data transfer. The direction of central "CDC System 17 Computer Systems Applications
and mass memory supports this contention since Guide," Control Data Corporation, Minneapolis,
costs, speed, and density continue to improve Minnesota, 1973.
drastically. This trend will continue for some Petrocelli, O. R., Ed., "The Best Computer Papers of
time with mechanical devices giving way to new 1971," Princeton, Auerbach Publishers Inc., 1972.
approaches employing chemical and basic mo- Frederick, Franz, "Guide to Microcomputers," Wash-
lecular phenomena. Real future advances will ington, D.C., Association for Educational Communi-
occur as a result of changes in arithmetic and cations and Technology, 1980.
control elements and in the software systems Goldstein, Larry and Goldstein, Martin "IBM Personal
which drive computers. Micro-processors made Computer: An Introduction to Programming and
possible by large scale integrated circuitry (LSI) Applications," Robert J. Brady Co., Prentice-Hall
COMPUTERS 216

Publishinb and Communications Company, Bowie, the rate of homogeneous nucleation of droplets
Maryland, 1982. from supersaturated vapor.
Dertouzos, Michael L. and Moses, Joel, "The Computer The remarkable agreement 3,S of this macro-
Age: a Twenty-year View," MIT, 1979. scopic theory with observations of the critical
"All About Personal Computers," Debran, N.J., Data- supersaturations for appreciable nucleation rate
pro Research, 1982. of various liquids in cloud chambers stood for
Mark, Frank, "Discovering Computers," London: many years as the basis of our knowledge of
Stonehenge, 1981. nucleation. For example, referring to the data
of C.T.R. Wilson 6 and of C. F. Powell? who re-
Cross-references: ELECTRONICS; MEASURE- ported a "fog limit"* for homogeneous nuclea-
MENTS, PRINCIPLES OF; CYBERNETICS; SEMI- tion of water droplets at a supersaturation ratio
CONDUCTOR DEVICES. of about 5.0 at 275 K,t agreement with the
macroscopic theory is excellent. 8 However in
recent years it has been pointed out 4,S,8,9,lO
that the external partition functions for free
CONDENSATION translation and rotation had been neglected in
The condensation of a vapor to form a liquid, the macroscopic theory. When these contribu-
amorphous phase, or crystal generally occurs tions are included, the theory predicts a critical *
by the mechanism of nucleation and subse- supersaturation ratio of about 3.0 for water
quent growth. Nucleation is a thermally acti- vapor at 275 K, in poor quantitative agreement
vated process which leads to a stable fragment with Powell's observations. This situation stim-
of the condensed phase. In the absence of ulated a great deal of experimental and theo-
surfaces of certain condensed phases, reactive retical work in the field. On the experimental
foreign molecules, or other potent catalysts to side, two new techniques were developed to
the nucleation process, it is usually the slower measure critical supersaturation ratios for homo-
step and occurs at an appreciable rate only geneous nucleation: (I) the diffusion cloud
under conditions considerably removed from chamber ll ,l2,l3 and (2) the supersonic nozzle
equilibrium. For the usual case where nuclea- method. l4 ,ls Work is still in progress, but it
tion catalysts are present, the process is charac- appears that virtually all substances with the
terized as heterogeneous nucleation, but if possible exception of argon l6 and a~monia,l?
there are no such catalysts at all, it is called follow the original macroscopic theory which
homogeneous nucleation. ignores the contributions from free translation
In principle, statistical thermodynamics would and rotation. O~ the theoretical side, much ef-
appear to offer the most attractive approach fort has been dIrected toward ascertaining the
to nucleation rate theory. For example, Band l magnitude of another correction to the macro-
and Hill2 have given formal treatments for the scopic theory, which tends to counterbalance
equilibrium concentration of clusters of mole- the effects of free translation and rotation.
cules in a vapor. However, the internal partition This is termed the replacement partition func-
functions have thus far eluded quantitative tion, and it describes the free energy due to the
evaluation. Further, in the case of metallic six internal degrees of freedom the isolated
systems, there is at present little knowledge of cluster does not have because it is not a part of
the electronic energy states in clusters con- the bulk phase. Efforts to calculate this quan-
taining only a few atoms. tity by classical phase integral methods l8 ,l9,2o
Accordingly, even to the present day, most have not been entirely successful because of the
treatments follow that of Volmer 3 and co- great difficulty in defining an embedded cluster
workers, who evaluated the free energy of for- in a li~uid. In fact, some treatments yield
mation of clusters by ascribing macroscopic large l8 ,l replacement ~artition functions while
thermodynamic properties to them. Thus the others give small ones. 0 However, the calcula-
free energy of a droplet is described as the sum tion is somewhat easier for crystals, and the
of a surface term (area times surface tension) result 20 indicates that the replacement parti-
and a volume term (volume times the negative tion function is not large and hence does not
bulk free energy change). An attractive feature appreciably offset the contributions from free
of this approach is that it permits ready visuali- translation and rotation in the case of homo-
zation of the origin of the free energy barrier geneous nucleation of crystallites from the
to nucleation in terms of the maximum in the vapor. Unfortunately, there are very few ex-
above sum as a function of size. However, a perimental data for this case, but the theo-
very unattractive aspect is that the calculated retical result has been used in astrophysics
size of the critical nucleus, i.e., the cluster size calculations. 21 In summary, one might con-
at the top of the free energy barrier, is only
about 100 molecules, which leads one to doubt *Denotin~ production of many droplets, i.e., of the
the applicability of macroscopic concepts in order of 10 cm-3.
the present examples. Nevertheless, following tMeaning the ratio of actual to equilibrium partial
standard methods,4 this spherical-drop model pressure of vapor.
leads directly to a rather simple expression for *Critical for a given, usually high, nucleation rate.
217 CONDENSA nON

clude at the present time that the good agree- ment with observed supersaturations for
ment between the original macroscopic theory appreciable nucleation rate is not good for either
and experiments on liquids is fortuitous and the old or the new theory. Nevertheless it has
due to (i) a large replacement partition func- been established that in most cases nucleation
tion or (ii) the circumstance that the macro- occurs by the processes of adsorption, surface
scopic surface tension overestimates the droplet diffusion, and statistical fluctuation to form the
entropy and underestimates the potential crystalline nuclei. Also, particularly for metals
energy.22 ,23 on ionic and semiconductor substrates, many
One might think that the above issues could nucleation rates have been quantitatively mea-
be settled by computer calculations using sured by means of both kinematic and in situ
fairly realistic potential functions to calculate transmission electron microscopy. Indeed,
the cluster free energies, and much work has within the past 15 years there has been a re-
been done in this area. 22 ,24 In one of the more markable increase in the degree of sophistica-
elaborate efforts,25 Monte Carlo methods were tion of such exrceriments. Ultra high vacua (of
used to calculate the surface tension and chemi-
cal potential of bulk Lennard-Jones argon,
°
the order of 10- Torr) leading to much greater
cleanliness are now common and the character-
and these results were applied to estimate the ization of both substrate and deposited crystal-
free energy of isolated L-J argon droplets. The lites has been greatly enhanced by the applica-
resulting free energies of formation were then tion of LEED, Auger spectroscopy, mass
compared with the actual Monte Carlo free spectrometry, transmission electron diffraction
energies of isolated liquid L-J argon clusters. It and a host of other new techniques. However,
was found that the contributions from free interpretation of the rate data is often obscured
translation and rotation were required to de- by ignorance of the actual (often defect) sites
scribe the Monte Carlo cluster data. This means for nucleation on the substrate surface. Inter-
that, unlike the case of physical experiments on estingly enough, it has been possible to demon-
real liquids, the original macroscopic theory strate and measure the mobility of small (10-
must be modified by the contributions from free 20 atom) clusters,32 e.g., of solid gold on the
translation and rotation in order to describe (100) surface of alkali halide crystals. In
L-J argon. However, a major shortcoming of connection with the above experiments, se-
this and other computer calculations is that lected-zone dark-field electron microscopy
pairwise potentials were used. This is a serious (SZDF) has been developed 33 to determine
difficulty in view of the importance of three- the crystal structure, orientation, and degree
body potentials in describing surface properties. of perfection (absence of twinning) of each of
On the other hand, molecular dynamics com- the myriad of isolated microcrystallites on a
puter simulations on highly supersaturated substrate surface. SZDF employs annular ob-
L-J vapor have been useful in predicting spi- jective-lens apertures of different geometries
nodal decomposition as another condensation to select only a well-defined number of Debye-
mechanism, which is an alternative to nuclea- Scherrer diffraction rings. Thus it is possible
tion and growth of droplets. Spinodal decom- to map, within one image, all specimen areas of
position has long been recognized 26 as a mech- random azimuthal orientation that diffract
anism of phase separation in binary solid into the selected range of Bragg angles.
systems in which there is no symmetry change Historically, field emission microscopy has
and for which the free energy-composition curve yielded interesting information on nucleation
is continuous. At high supersaturations the ini- of metals in deposition from the va~or onto
tial phase is unstable for all infinitesimal clean tungsten field emitter tips.34, 5 Many
density variations having a wavelength greater metals, e.g., copper and gold, form critical
than some "critical" wavelength. The molecular adsorbed coverages of several monatomic layers
dynamics work on L-J vapor 27 exhibited the before nucleation of three-dimensional crystals
characteristic interconnected structure of vapor occurs. This is the Stranski-Krastonow mecha-
and liquid, and the process was consistent with nism 36 for deposition of crystals from the vapor
the recognized laws for spinodal decomposition. and may be expected for situations in which
There are at present few, if any, physical experi- the binding energy of the adatoms to the sub-
ments to corroborate this prediction. strate is high. In some cases the initial mono-
A considerable amount of work has been layers are thought to be pseudomorphic'" with
done on the heterogeneous nucleation of metal the substrate. 37 Another important mechanism
crystals from thermal vapor beams onto sub- for cases of high binding energy is the mono-
strates. 5,28,29 Many theoretical approaches fol- layer-by-monolayer overgrowth (MO) or Frank-
low a macroscopic treatment similar to that van der Merwe mode,38 which also may occur
outlined above. In view of the high supersatura- with pseudomorphism in the initial monolayers.
tion ratios, typically 106 -1 0 33 , and resulting In some cases the monolayers are initiated by
small critical nucleus sizes (I to 5 atoms) nucleation of 2-dimensional discs. 39 Misfit dis-
involved in this case, an "atomistic" theory30,31
has been introduced to replace the older macro- "'Of exactly the same crystal structure and lattice
scopic theory. However, quantitative agree- parameter.
CONDENSA nON 218

locations to accommodate the strain are thought currence of this type of growth from the
to form at the substrate-deposit interface as the vapor at high supersaturation.
deposit thickens. Ultimately, the deposit be- The fact that real crystals do indeed grow
comes noncoherent with the substrate and at much lower supersaturation ratios, of the
attains its natural crystal structure. The misfit order 1.01, continued to present a theoretical
dislocations even facilitate phase transforma- problem for many years. Then in 1949 Burton,
tion in cases where the bulk deposit and sub- Cabrera, and Frank 43 showed that certain
strate are of different crystal structure. Ex- emergent dislocations of the screw orienta-
amples of the MO mechanism are SnSe on tion * must provide a source of monomolecular
(001) SnTe and Pt on (001) Au. 39 Recent steps for growth at low supersaturations. Fur-
studies 39 have provided much quantitative ther, they demonstrated that the resultant
information, such as Burgers vector, spacing growth form on the close-packed surface, the
and mechanism of origin, on misfit dislocations. growth spiral, cannot exterminate itself as do
Also, they have revealed ordered structures other types of steps or ledges. In the usual
of monolayer or sub-monolayer thickness, case, crystal growth by this mechanism is
e.g., Pb on the (Ill) Ag surface. This may have thought to be controlled by surface diffusion
a relevance to the preparation of epitaxial thin of the admolecules. Experimental verification
films. of these predictions is now voluminous. 44
Growth is the process by which the stable Inasmuch as crystal growth from the vapor
nuclei continue to grow and thereby consume is linked to the motion of monatomic steps on
the supersaturated vapor. In general, several the surface, it becomes of great interest to
mechanisms are involved in the growth process, describe the step dynamics arising from surface
and some of these are thermally activated. diffusion gradients. Most of the studies thus
However, the free energies of activation are far have been concerned with the closely re-
usually low, and hence most growth processes lated reverse process of crystal evaporation,
proceed at an appreciable rate even under con- and the theory of step dynamics has been ex-
ditions close to equilibrium where the gross tended to cover stationary-state evaporationS
evaporation flux almost equals the gross conden- with the steps emanating from crystal edges or
sation flux. The first step in growth from the from dislocations and the bunching of evapora-
vapor is thought to be adsorption of the imping- tion steps under transient conditions. 4S ,46 Re-
ent molecule. The bulk of both experimental cently the evaporatior, rate from (IOO) KCl
and theoretical work indicates that the im- single crystal surfaces WiS measured as a function
pingent atoms or molecules are in most cases of undersaturation unjer stationary-state con-
thermally accommodated and adsorbed at ditions,47 and remarkable agreement with
the surface before being either re-evaporated theory was found. In general the evaporation
or integrated into the liquid or crystalline coefficientt will lie between ! and 1 and the
structure. catalytic effect of dislocations in providing
In the case of liquids, it is thought that the additional monatomic steps disappears as the
molecular mobility is sufficiently high that the undersaturation ratio rises above 0.9. Similarly,
adsorbed molecules are taken almost immedi- in recent electron microscope work on vacuum
ately into the liquid structure. However, the evaporated (100) NaCI surfaces,48 the transient
situation is quite different for crystals, whose fluctuations in monomolecular step spacing
surfaces are still most conveniently visualized caused by a sudden change in evaporation
in terms of the or~nal "atomic building block" temperature were quantitatively described by
model of Kossel and Stranski.41 Thus, in the theory.
the case of certain surfaces of high index, the Most of the theoretical studies of growth in
kinks in the steps of the atomically rough deposition on substrates have not been in terms
surface provide ready sinks for adsorbed mole- of the fundamental microscopic mechanisms,
cules. In fact, experiment shows that such namely, the screw dislocation spiral mechanism
planes grow so rapidly that they quickly and the monatomic disc nucleation mechanism.
eliminate themselves from the crystal growth Perhaps this is because of the experimental
form, leaving the smoother surfaces of low difficulty in observing growth rates of the small
index. These closely packed planes contain no crystallites on substrates. Rather, the main
steps and kinks to serve as sinks for the ad- thrust of the theoretical work on growth thus
molecules diffusing on the surface. Accord- far has had as its goal prediction of the crystal-
ingly, for a perfect crystal, it is thought that lite number density and size distribution. This
growth can proceed only by nucleation of is of course a complicated problem affected by
new monomolecular layers, whose edges nucleation, depletion of the adatom concen-
provide the sinks, and their lateral propaga- tration about growing clusters, surface dif-
tion. A typical supersaturation ratio for ap-
preciable growth by this mechanism is of *Or dislocations with a component of the Burgers
the order of I.S for molecular substances, and vector perpendicular to the surface.
there is a large amount of theoretical and tRatio of actual to eqUilibrium gross evaporation
experimental evidence42 for the general oc- flux.
219 CONDENSA TlON

fusion fields, competition between crystal- 15. Wegener, P. P., and Wu, B. J. C., p. 325 in "Nucle-
lites for the available ada toms and the effect ation Phenomena," Vol. 7, Adv. Colloid Inter-
of capillarityt in retarding crystallite growth. face Sci. (A. C. Zettlemoyer, Ed.), New York,
Most of the studies have confined themselves Elsevier, (1977).
to the case of incomplete coverage of the 16. Hoare, M. R., Pal. P., and Wegener, P. P., J. Col-
substrate. There are many treatments of these loid Interface Sci. 75,126 (1980).
phenomena, but one useful approach is nu- 17. Dawson, P. B., Willson, E. J., Hill, P. G., and
merical integration of the divergence of clus- Russell, K. C.,1. Chem. Phys. 51,5389 (1969).
ter current in size space with consideration 18. Reiss, H., p. 1 in Ref. 15.
of capillarity in the boundary conditions. 49 19. Kikuchi, R., p. 67 in Ref. 15.
The results give a reasonably good description 20. Nishioka, K., and Pound, G. M., p. 205 in Ref.
of the observed time evolution of cluster size 15.
distribution. :I: 21. Czyzak, S. J., Hirth, J. P., and Tabak, R. G.,
In the interests of brevity, the complex and "The Formation and Properties of Grains in the
interesting effects relating to diffusion in the Interstellar Medium," Vistas in Astronomy 1982
vapor,s adsorption of impurities,S chemical (in press).
reaction,42 and dissipation of the heat of con- 22. Abraham, F. F., "Homogeneous Nucleation
densations have been omitted from the above Theory," Supplement 1 to Advances in Theoreti-
discussion. The subject of crystal growth cal Chemistry, New York, Academic Press, 1974.
morphologies is, of course, huge and beyond 23. Binder, K., and Kalos, M. H., J. Statistical Physics
the scope of the present article. S ,42 22,363 (1980).
24. Hoare, M. R., Advances in Chem. Phys. 40 (1979).
G. M. POUND 25. Miyazaki, J., Pounc!, G. M., Abraham, F. F.,
and Barker, J. A., I. Chem. Phys. 67,3851 (1977).
References 26. Cahn, J. W., J. Chem. Phys. 42,93 (1965).
27. Mruzik, M. R., Abraham, F. F., and Pound, G. M.,
1. Band, W., "Quantum Statistics," New York, Van J. Chem. Phys. 69,3462 (1978).
Nostrand Reinhold, 1955. 28. Voorhoeve, R. J. H., "Molecular Beam Deposi-
2. Hill, T. L., "Statistical Mechanics," New York, tion of Solids on Surfaces: Ultra Thin Films,"
McGraw-Hill Book Co., 1956. Treatise on Solid State Chemistry, (N. B. Hannay,
3. Volmer, M., "Kinetik der Phasenbildung," Dresden Ed.) New York, Plenum, 1976.
and Leipzig, Steinkopff, 1939. 29. Matthews, J. W. (Ed.), "Epitaxial Growth,"
4. Frenkel, J., "Kinetic Theory of Liquids," London, Parts A & B, New York, Academic Press, 1975.
Oxford Univ. Press, 1946. 30. Walton, D.,J. Chem. Phys. 37,1282 (1962).
5. Hirth, J. P., and Pound, G. M., "Condensation and 31. Rhodin, T. N., p. 31 in "Proceedings of a Con-
Evaporation, Nucleation and Growth Kinetics," ference on Single Crystal Films at Bluebell, Penna."
Oxford, Pergamon Press, 1963. (M. H. Francombe and H. Sato, Eds.), Oxford,
6. Wilson, C. T. R., Phil. Trans. Roy. Soc. London Pergamon, 1964.
192,403; 193,289 (1899). 32. Metois, J. J., Zanghi, J. C., Erre, R., and Kern,
7. Powell, C. F., Proc. Roy. Soc. London 199,553 R., Thin Solid Films 22, 331 (1974).
(1928). 33. Poppa, H., p. 215 in Ref. 29, Part A.
8. Feder, J., Russell, K. C., Lothe, J., & Pound, 34. Jones, J. P., Proc. Roy. Soc. (London) 284A,
G. M.,Adv. Phys. 15(57),111 (1966). 469 (1965).
9. Dunning, W. J., "Nucleation," (A. C. Zettlemoyer, 35. Gretz, R. D., and Pound, G. M., Applied Phys.
Ed.) p. 1, New York, Marcel Dekker, 1969. Letters 11,67 (1967).
10. Lothe, J., and Pound, G. M., "Nucleation," 36. Stranski, I. N., and Krastonow, L., Akad. Wiss.
(A. C. Zettlemoyer, Ed.) p. 109, New York, Deut. Math. Nat. KI. 146,797 (1938).
Marcel Dekker, 1969. 37. Bauer, E., and Poppa, H., Thin Solid Films 12,
11. Franck, J. P. and Hertz, H. G., Z. Physik 143,559 167 (1972).
(1956). 38. Frank, F. C., and van der Merwe, J. H., Proc.
12. Katz, J. L.,J. Chem. Phys. 52,4733 (1970). Roy. Soc. AlOO, 125 (1949).
13. Katz, J. L., Mirabel, P., Scoppa, C. J., and Virkler, 39. Honjo, G., and Yagi, K., "Studies of Epitaxial
T. L., J. Chem. Phys. 65,382 (1976). Growth of Thin Films by In Situ Electron Micros-
14. Wegener, P. P., and Parlange, Jean-Yves, Natur- copy," p. 197 in "Current Topics in Materials
wissenschaften 57,525 (1970). Science, Vol. 6," (E. Kaldis, Ed.), Amsterdam,
North Holland, 1980.
40. Kossel, W., Nachr. Akad. Wiss. Goettingen Math.
tSurface tension. Phys. KI. 1,135 (1927).
:l:Recently (unpublished work) continuous and uni- 41. Stranski, I. N., Z. Phys. Chem. 136,259 (1928).
form thick fIlms of amorphous metals and alloys have 42. Strickland-Constable, R. F., "Kinetics and Mech-
been prepared by deposition from vapor beams onto anism of Crystallization," London, Academic
very cold substrates. Here the mechanism is one of Press, 1968.
random impingement and adherence of the immobile 43. Burton, W. K., Cabrera, N., and Frank, F. C.,
atoms to the surface. Phil. Trans. Roy. Soc. London A243, 299 (1950).
CONDENSATION 220

44. Dekeyser, W., and Amelinckx, S., "Les Disloca- Hence,IIA =(lIp) (VIQ) or V =I(pQIA), so that
tions et la Croissance des Cristaux," Paris, Masson, V = IR. The ratio pQIA is called R, the electrical
1956. resistance of the conductor. It is a property of
45. Mullins, W. W., and Hirth, J. P., J. Phys. Chern. a particular sample because it involves the di-
Solids 24, 1391 (1963). mensions A and Q, whereas p and a are in-
46. Surek, T., Pound, G. M., and Hirth, J. P., Surface trinsic properties of the constituent material.
Sci. 41,77 (1974). With V in volts and I in amperes, R is measured
47. Nordine, P. C., and Gilles, P. W., J. Chern. Phys. in ohms. The electrical resistivity p =RA IQ is
74,5242 (1981). then given in ohm-meters if A is in square
48. Bethge, H., Hoeche, H., Katzer, D., Keller, W. K., meters and Q in meters, while a is given in
Bennema, P., and van der Hoek, B., J. Crystal (ohm-meters)-l or mhos per meter.
Growth 48,9 (1980). The major factors determining the magni-
49. Robertson, D., and Pound, G. M., J. Crystal tude of the electrical conductivity for a material
Growth 19,269 (1973). are the conduction electron, or ion, density, and
the nature of the interatomic forces (which
Cross-references: CRYSTAL STRUCTURE ANAL- decide the mobility of the charge carriers). The
YSIS, CRYSTALLIZATION, CRYSTALLOGRAPHY, actual current flow also depends on the size of
FIELD EMISSION, STATES OF MATTER, VAPOR the electric field. Part of the energy carried by
PRESSURE AND EVAPORATION. the current is inevitably consumed as Joule
heat, but there are many commercial devices in
which such heat or light conversion is put to
good use (electric fires, toasters, cookers, light
CONDUCTIVITY, ELECTRICAL filaments, fuses, etc.). The rate per unit volume
The electrical conductivity of a substance is an at which energy is converted is 12 R or VI, in
intrinsic property denoting the ability with units of watts or joules per second. Where it is
which electric charge can flow through the sub- required to add resistance to an electrical circuit,
stance. The meaning of a definite conductivity resistors made of carbon, graphite, or metallic
is most commonly associated with solids, al- alloys are often selected. When wires of low re-
though electrical conduction also occurs in sistance are needed, copper is the most com-
liquids, electrolytes, and ionized gases. Elec- mon material (p '" 1.7 X 10-8 Qm) although at
trons are the usual charge carriers in solids, but ordinary temperatures silver is the best conduc-
ionic conduction can be important for some tor (p '" 1.6 X 10-8 Qm).
materials, such as the alkali halides and com- In alternating current circuits, the conduc-
pounds of the KAg4 Is class, while proton con- tivity or resistivity depends on the frequency
duction has been demonstrated for ice. of the applied electric field. Deviations from
A suitable definition for the electrical con- the dc value are not appreciable at low frequen-
ductivity of an isotropic material is provided by cies but may become significant for microwave
Ohm's law. This is the statement that the or higher frequencies.
direct-current density J within a conductor is Gases can conduct electricity if they are
proportional to the dc electric field E. At a ionized. Practical applications include discharge
given temperature and pressure, the constant of tubes, electronic vacuum tubes, and the arc
proportionality is the electrical conductivity a, discharge. Natural ionization of the atmosphere
thus: results from cosmic rays or radioactive sources
in the ground. The conductivity of the atmo-
J = aE sphere is quite low at ground level but it in-
creases rapidly with altitude up to 50 km be-
If the material is anisotropic, the magnitude of cause of the greater cosmic radiation and the
J depends not only on the magnitude of E but lower density of scattering centers.
on its direction as well. J and E are then non- Many liquids or solutions known as electro-
parallel for some orientations of the material, lytes (besides a few solids) can be decomposed
and a is a tensor of the second rank. Since the by an electric current into charged particles
ability of a material to conduct electricity is in- called anions and cations. Such processes in-
fluenced by the mechanisms resisting the flow volve a transfer of matter through the conduc-
of charge, it is also helpful to work in terms of tor. The conductance of a solution is defined as
the specific electrical resistivity p, which is the the current flowing per unit charge applied to
reciprocal of a. the electrodes immersed in the solution and
If a voltage V between the ends of a conduc- per unit concentration of electrolyte between
tor of length Q and uniform cross-sectional area the electrodes. It is dependent on the number
A maintains a current I through the conductor, and mobilities of the ions in the solution. In
Ohm's law may be set in a practical form by geophysics there is an important field of ac-
combining the relations: tivity involving electrical conductivity mea-
I surements of the surface layers of the crusts of
V 1 the earth and the moon. But in laboratory
J = aE, E=- J=- a=-.
Q' A' p physics a principal area of activity lies in
221 CONDUCTIVITY, ELECTRICAL

solid-state physics, and the remainder of this gained in this way is just as steadily returned to
article will be devoted to such work. the lattice as Joule heat via collision processes
Solids may be classified in various ways with it. The current density J is then -nev for
(according to their binding, ductility, crystal- an electron density of n per unit volume and
line or amorphous nature, etc.) but a particu- drift velocity v. The mean-free path between
larly convenient one considers their conduction collisions commonly exceeds 100 interatomic
properties. Two distinctive qualities are of spacings at room temperature, or even 10 7
interest: (i) the magnitude of a or P at a suit- spacings in pure metals at low temperatures.
able comparison temperature (say, room tem- An assumption that the collisions are elastic
perature), and (in the temperature variation of permits the use of a relaxation time r, at least
a or p. Three broad classes of solids may thus under the conditions pertaining at high tem-
be characterized: peratures for thermal scattering or at low tem-
(1) Metals having high conductivities (when peratures for impurity scattering. r is a quantity
pure) with specific resistivities at room tem- inversely related to the probability per unit
perature lying in the range 1.6 X 10-8 !"lm (for time of an electron undergoing a collision. A
silver) to 140 X 10-8 !"lm (for manganese and basic equation of the form P = m/(ne 2 r), in
plutonium). Less pure metals and alloys may which m is closely related to the electron mass,
have resistivities up to 1000 times bigger than can be derived without undue difficulty, so that
these, while very pure metals at liquid helium the major barrier to calculating the magnitude
temperatures may have resistivities 105 times of P or a of a metal revolves about understand-
smaller. In general, the resistivities of metals in- ing and evaluating r and its temperature de-
crease with temperature. Also the effect of add- pendence.
ing small amounts of impurities is that of adding A conceptual hurdle here is how can an elec-
a temperature-independent contribution to the tron proceed more than a few atomic spacings
resistivity (Matthiessen's rule). If PT is the without being scattered by the massive, closely
thermal resistivity and Po is the impurity con- spaced lattice ions? The reason is that the
tribution, the total resistivity p = PT + Po. lattice is the source of a periodic electrostatic
(2) Semiconductors which have much lower field and that the electron waves are modulated
conductivities than do metals, with resistivities by a function having the same period. It can
in the range 10- 5 to 105 !"lm. In contrast to then be shown that such waves are propagated
metals, their resistivities decrease with rising with no loss of energy if the lattice is perfectly
temperature and very rapidly with the addition periodic. In practice, the lattice potential is
of impurities. never perfect, for it is disturbed by both
(3) Insulators whose electrical conductivities thermal vibrations and impurity atoms or phys-
are lower still, with resistivities ranging from ical defects. The thermal vibrations are quan-
106 to 10 16 !"lm. The feeble conductivity is tized with discrete energy values called phonons.
little affected by impurity additions, but it im- For temperatures exceeding the Debye tem-
proves rapidly as the temperature is raised. perature, the phonon density and hence the
The basic differences between these classes electron-phonon scattering and the resistivity
can be understood in terms of atomic and increase almost in proportion to T. At very low
quantum-mechanical principles which explain temperatures, PT ex T5. This is directly related
the varying degree of availability of free elec- to the T3 variation in the Debye phonon spec-
trons or mobile ions for conduction purposes. trum which gives a T3 specific heat at low tem-
The main features are outlined below. An addi- peratures. Obtaining realistic estimates of the
tional distinct class, that of superconductivity, resistivity magnitudes for metals other than the
which is a spectacular quantum situation in alkali metals remains a matter of considerable
which the low-temperature state of a number complexity. A useful semiempirical equation is
of metals and a few semiconductors is one of the Griineisen-Bloch relation because 'it facili-
zero resistance, is not dealt with here (see tates the analysis and discussion of experimental
SUPERCONDUCTIVITY). data. It represents the variation of the thermal
Metals are characterized by their high density resistivity of a wide selection of metals rather
of free conduction electrons which transport well. In some multivalent metals, but more
negative charge (-e) through the interstices of importantly in semiconductors, electrons be-
the crystal lattice composed of positive ions. have in a way which can be described by the
The origin of the conduction electrons is some displacement of positive charge carriers called
or all of the valence electrons from the pre- holes. Experimental results are sometimes dis-
viously neutral atoms. In the absence of an cussed as if the current arises from the flow of
electric field the conduction electrons have electrons and holes.
high-speed random motions (~106 ms- I ), and Magnetic metals have additional resistive ef-
there is no directed charge flow in any particular fects due to scattering from localized-spin
direction. But when a field is applied, they ac- assemblies. An important field is dynamic co-
quire a steady net drift, of much lower speed operative phenomena using the divergence of
than their kinetic speeds, in exactly the opposite dp/dT at magnetic critical points as a tool.
direction to the field. Part of the kinetic energy Another active field is the Mott transition
CONDUCTIVITY, ELECTRICAL 222

whereby certain materials can be switched from Cross-references: ELECTRICITY, FERMI SURF ACE,
a metallic to an insulating condition using HALL EFFECT AND RELATED PHENOMENA,
small changes of pressure, temperature, or PHOTOCONDUCTIVITY, SEMICONDUCTORS,
electron-to-atom ratio. Resistivity is also used SOLID STATE PHYSICS, SUPERCONDUCTIVITY.
to study atomic order-disorder and crystal
phase transitions, and in the study of defect
production and migration. Its temperature de-
pendence is used for thermometry and its
strain dependence for strain gauges. Its magnetic-
field dependence is the basis of the major field
of magnetoresistivity. When used together with CONSERVATION LAWS AND SYMMETRY
the related Hall effect, valuable information is
provided on the effective sign, number, and Among the most basic of the laws of nature are
mobility of the charge carriers in semiconduc- the conservation laws. A conservation law is a
tors, while in metal single crystals certain de- statement saying that in a given physical system
tails of the FERMI SURFACE can be deduced. under specified conditions, there is a certain
Insulators and pure semiconductors have no measurable quantity that never changes regard-
free electrons available at 0 K for conduction. less of the actions which go on within the sys-
Diamond, silicon, and germanium are typical ex- tem. One of the tasks of physics is to determine
amples. All their four valence electrons are which properties of a given system are actually
fully occupied in forming chemical bonds in conserved during the course of specific types of
the solid. Raising the temperature energizes, interactions.
and frees for conduction duties, a small frac- In classical (pre-quantum and pre-relativity)
tion of these electrons. The empty energy physics the following conservation laws were
states left behind (holes) also aid in conduc- known:
tion. The fraction of carriers is -1 in 10 9 for (1) Conservation of Mass. In a closed system
Si and Ge at room temperature; the number the total mass is constant.
varies approximately as T3/2 exp(-tJ.E/2kT) (2) Conservation of Energy. In a closed sys-
where M is the energy to excite a bound elec- tem the total amount of energy is constant. (In
tron and k is the Boltzmann constant. M is relativistic physics these two laws are identical
-5.2, 1.2, and 0.75 eV for diamond, silicon, due to the equivalence of mass and energy.)
and germanium respectively. The chief differ- (3) Conservation of Momentum. The total
ence between the behavior of diamond (a momentum of a system is constant if there is no
typical insulator) and pure Si and Ge (typical outside force acting on the system. (The mo-
semiconductors) is the greater ease with which mentum of an object is defined as the mass
temperature can induce conduction in the multiplied by the velocity; the total momentum
latter. The conduction of semiconductors, but of a system is the vector sum of all the individ-
not of insulators, is readily improved by adding ual momenta of the parts.) This means that the
certain impurities. This has the effect of intro- internal forces within the system have no
ducing electrons (n-type) or of producing holes effect on the total momentum.
(p-type). Such materials are termed extrinsic, (4) Conservation of Angular Momentum. The
or impurity, semiconductors. Their great prac- total angular momentum of a system is con-
tical application is in transistors and diodes. stant if there is no torque acting on the system
(See SEMICONDUCTORS and SOLID-STATE PHYS- from without. (The angular momentum of an
ICS.) Conduction in many semiconductors and object relative to a point 0 is its momentum
insulators is also increased by the photoelec- multiplied by the perpendicular distance be-
tric action of incident light radiation (see tween its line of travel and the point 0.)
PHOTOCONDUCTIVITY). Historically these laws arose out of a philo-
sophical belief that the universe was created
G.T.MEADEN with a definite amount of motion which re-
mained unchanged following the original cre-
ation. As a result of the attempt to clarify what
kind of "motion" was conserved, the concepts
References of momentum and kinetic energy were devel-
Blatt, F., "Physics of Electronic Conduction in Solids," oped. As early as the seventeenth century
New York, McGraw-Hill, 1968. Huygens recognized that both momentum and
Kittel, C., "Introduction to Solid State Physics," kinetic energy were conserved in the collisions
Fifth Edition, New York, Wiley, 1976. of elastic balls. During the nineteenth century
Meaden, G. T., "Electrical Resistance of Metals," New the existence of various "forms" of energy was
York, Plenum, 1965; London, Iliffe, 1966. recognized, and the more general law of con-
Meaden, G. T., "Conduction Electron Scattering and servation of energy arose out of measurements
the Resistance of the Magnetic Elements," Can· involving reactions in which energy was trans-
temporary Physics, 12,313-337 (1971). formed from one form to another (e.g., me-
Ziman, J. M., "Electrons and Phonons," Oxford, chanical, thermal, electrical). The measure-
Clarendon Press, 1960. ments showed that within certain limits of
223 CONSERVATION LAWS AND SYMMETRY

accuracy, the total amount of energy in a terms of a potential energy function (more pre-
closed system was unchanged by any of the re- cisely a mathematical function called the Hamil-
actions tested. tonian of the system). If the potential energy
From the modern point of view, it is not of the system is known for any position of
necessary to make measurements involving large- these objects in space, then we can predict the
scale systems, for the macroscopic behavior of future motion of the objects in the system.
matter results from the interactions between A detailed solution of the equations of motion
relatively few types of elementary particles. will describe the position and velocity of each
Therefore it is sufficient to investigate the con- particle in the system at any time during their
servation laws as they apply to the basic inter- interaction. However, certain general predic-
actions between fundamental particles. tions can be made without going through the
At present only four fundamental types of complete solution of the problem, if there
interactions have been recognized: the gravita- exist certain symmetries of space and time. The
tional, the weak nuclear, the electromagnetic, following examples illustrate the various geo-
and the strong nuclear force. Each of these metrical or space-time symmetries encountered
interactions individually obeys the classical in classical physics.
conservation laws. As a result those laws must (1) If the potential energy function does not
be obeyed in any kind of action involving inter- depend explicitly on one of the space coordi-
actions between particles. This rule, of course, nates, then the component of momentum as-
applies to every activity in the universe. sociated with that coordinate never changes-it
For example, when we compress a spring, the is a constant of the motion, and thus obeys a
potential energy of the spring is increased. The conservation law. Particular situations most fre-
modern picture visualizes the energy as stored in quently encountered are as follows :
the electric fields between the atoms of the (a) An object moves in a three-dimensional
spring as they are pushed closer together. Thus, space where its potential energy is a constant.
the spring's potential energy is ultimately of an That is, the expression describing the potential
electrical nature. does not explicitly contain the coordinates
The development of a conservation law is x, y, or z, so it does not make any difference
seen to depend on a combination of theoretical where the origin of the coordinate system is
concept and experimental measurement: The located. This means that the description of the
scientist forms in his mind an abstract concept system is invariant with respect to a translation
of a physical quantity such as energy which can of the origin of the coordinate system in any
be measured by a given set of operations. direction. As a result of this symmetry the
Measurements then show that (within limitation momentum of the object in all three dimensions
of error) this quantity is conserved under a is constant. In technical terms, conservation of
given set of conditions. Modern measurements linear momentum is associated with translational
have been able to verify the conservation laws symmetry (or homogeneity) of space.
to very high degrees of accuracy. Conservation (b) An object moves in a world which is
of ene~y has been verified to within I part out flat, so that the force of gravity is in the verti-
of 10 1 , using the Mossbauer effect. (See Ref- cal (z) direction. The potential energy depends
erence 1.) on the height of the object above the ground,
An important function of the conservation but does not depend on its location in the
laws is that they allow us to make many pre- horizontal plane. That is, the description of the
dictions about the behavior of a system without system is invariant with respect to a translation
going into the mechanical details of what hap- of the coordinate system in the x-y plane. Since
pens during the course of a reaction. They give there is symmetry in the x-y plane, the object's
us a direct connection between the state of the momentum is conserved as far as motion in
system before the reaction and its state after that plane is concerned, but is not conserved in
the reaction. In particular we can say that any the z direction.
action which violates one of the conservation (c) Two spherical bodies interact in such a
laws must be forbidden. For example, many way that the potential energy depends only on
problems involving rotational or orbital motion the distance between the two bodies. This inter-
are solved very simply by noting that the action has spherical symmetry, and the system
motion must be such that the angular momen- is invariant with respect to a rotation of the co-
tum of the system remains constant. No further ordinate system about any axis; i.e., it is iso-
information concerning the forces or accelera- tropic. In spherical coordinates there are two
tions are required . angle variables, so there are two components of
With the development of the Hamiltonian angular momentum to be conserved. As a re-
method of solving physical problems, and par- sult the two bodies orbit around their common
ticularly with the growth of importance of center of mass in such a way that the magni-
quantum mechanics, it has become clear that tude of the total angular momentum is con-
the conservation laws are closely connected stant, while the plane of the orbit in space never
with the concept· of symmetry in nature. This changes. In other words, conservation of angular
is based upon the fact that the interaction be- momentum is due to the isotropy of space.
tween two or more objects can be described in (2) If the interaction between two objects
CONSERVATION LAWS AND SYMMETRY 224

does not depend explicitly on the time coordi- to do with the constancy of certain essential
nate, then the actions which take place do not numbers during the course of particle reactions.
depend on when we start measuring time. That These "number laws" are as follows:
is, the properties of the system are invariant (l) Conservation of Electric Charge. If P is
with respect to a translation of the origin along the number of positive charges in a system, and
the time axis. As a result of this symmetry it is N is the number of negative charges, then
found that the total energy of the system is Q = P - N is the net number of charges. The
conserved. In other words, conservation of charge number Q is unchanged by any reaction.
energy is associated with a symmetry in the For example, the creation of a positive charge
time dimension. must always be accompanied by the formation
Use of a four-dimensional coordinate system of an equal negative charge (e.g., an electron-
in accordance with Einstein's principle of rela- positron pair is created by a high-energy
tivity allows us to combine both space and photon). Conservation of electric charge is as-
time symmetries into a single space-time sym- sociated with a symmetry property of Max-
metry. With this scheme the three dimensions well's equations known as gauge invariance,
of space and the one dimension of time make which states that the absolute value of the
up a single four-dimensional space. Analogously, electric potential (as opposed to the relative
energy is regarded as the fourth component of a value) plays no part in physical processes. In
four-dimensional vector whose first three com- quantum field theory conservation of electric
ponents are the three components of momen- charge is connected with the fact that the
tum. The symmetries associated with translation properties of a system of particles do not de-
and rotations in this space-time continuum are pend on the phase of the wave function describ-
called Poincare symmetries. ing the system .
With the rise in importance of elementary . (2) Baryon Conservation. Baryons are a class
particle physics, a new type of symmetry has of elementary particles including the proton,
proven very valuable. These are "internal sym- the neutron, and several heavier particles such
metries" -symmetries involving the internal as the lambda, the sigma (plus, minus, and
properties of particles. The general philosophy neutral), and the omega (minus). Baryons are
underlying the study of elementary particle particles that interact with the strong nuclear
interactions is that anything can happen as long force. Each baryon is given a baryon number I,
as it is not expressly forbidden by a law of each corresponding antibaryon is given a baryon
nature. Among elementary particles there are a number -I, while the light particles (photons,
vast number of conceivable reactions that electrons, neutrinos, muons, and mesons) are
might take place. However, most of these re- given baryon number O. The total baryon
actions are forbidden by "selection rules," number in a given reaction is found by alge-
which are essentially conservation laws. For ex- braically adding up the baryon numbers of the
ample, the total electric charge during any re- particles entering into the reaction. During any
action cannot change. This rule immediately reaction among particles the baryon number
forbids such reactions as the conversion of a cannot change. This rule ensures that a proton
neutron into a proton plus a neutrino. An elec- cannot change into an electron, even though a
tron must also be created to balance the neutron can change into a proton. Similarly, to
charge. create an antiproton in a reaction, one must
The study of symmetries and conservation simultaneously create a proton or other baryon.
laws is especially important in elementary par- Baryon conservation ensures the stability of the
ticle physics because the exact nature of the proton against decaying into a particle of
strong and weak nuclear interactions is not smaller mass. Both conservation of charge and
known, so one cannot make detailed predic- baryon conservation are absolute selection rules.
tions concerning the results of reactions involv- (3) Lepton Conservation. Leptons are a class
ing these forces . However, a knowledge of sym- of light particles that include electrons, neutri-
metry principles gives one a great amount of nos, and muons, as well as their antiparticles:
general information concerning these reactions, the positrons, antineutrinos, and antimuon.
so one can estimate the probability of each re- Each lepton is assigned a lepton number + I ,
action taking place. while each antilepton has a lepton number -I.
While a law such as conservation of energy is All other particles have lepton number zero. In
true for all interactions, a number of the in- any reaction the algebraic sum of lepton num-
ternal symmetries lead to conservation laws that bers is conserved . This rule determines the
do not apply to all of the four fundamental course of beta decay, muon decay, and other
interactions. Such symmetries are therefore reactions governed by the weak interaction.
called "approximate" or "broken" symmetries. (4) /sospin Conservation. Since the strong
The implication of the term is that undistorted nuclear force acting between two neutrons is
nature would be completely symmetrical, but found to be the same as the force acting be-
that the presence of certain forces leads to an tween two protons, as well as between a neutron
asymmetry, or breaking of the symmetry. and a proton, it is found useful to consider the
One important class of conservation laws has neutron and proton as two states of the same
225 CONSERVATION LAWS AND SYMMETRY

particle (the nUcleon). These two states are con- such reactions can provide no clue that will dis-
sidered to differ only by the different positions tinguish between the directions right and left.
of a vector property called the isospin (or iso- Prior to 1956 it was believed that all reactions
topic spin). This concept arises by analogy from in nature obeyed the law of conservation of
the fact that two electrons in an atom can exist parity, so that there was no fundamental dis-
in a state of spin "up" and spin "down." These tinction between left and right in nature. How-
two states are indistinguishable in the absence ever, in a famous paper by C. N. Yang and T. D.
of an external magnetic field because of sym- Lee it was pointed out that in reactions involv-
metry of space with respect to rotations around ing the weak interaction, parity was not con-
an arbitrary axis. Similarly, a proton is a served, and that experiments could be devised
nucleon with isospin "up" and the neutron is a that would absolutely distinguish between right
nucleon with isospin "down." In particle phys- and left. This was the first example of a situa-
ics, whenever a system can exist in a discrete tion where a spatial symmetry was found to be
state, characterized by a definite quantum broken by one of the fundamental interactions.
number, there exists a property (in this case The principle of charge conjugation sym-
isospin) that is conserved. In the absence of metry states that if each particle in a given sys-
electromagnetic interactions there is no differ- tem is replaced by its corresponding anti-
ence between the two isospin states because of particle, then nobody will be able to tell the
symmetry with respect to rotation in "isospin difference. For example, if in a hydrogen atom
space." Electromagnetic interactions make a dif- the proton is replaced by an antiproton and the
ference because of the charge on the proton. electron is replaced by a positron, then this
Isospin conservation implies equality of p-p, antimatter atom will behave exactly like an or-
n-n, and n-p forces, except for the effect of the dinary atom, if observed by people also made
electr011lagnetic force. Thus isospin conserva- of antimatter. In an antimatter universe the
tion is only an approximate symmetry. laws of nature could not be distinguished from
(5) Strangeness Conservation. Strangeness isa the laws of an ordinary matter universe.
property of elementary particles found useful However, it turns out that there are certain
to classify hyperons (particles more massive types of reactions where this rule does not
than nucleons) into families. Each particle is hold, and these are just the types of reactions
assigned a strangeness quantum number S which where conservation of parity breaks down. For
is related to the electric charge Q, the isospin example, consider a piece of radioactive material
number T, and the baryon number B by the emitting electrons by beta decay. The radio-
formula Q = T + (S + B)/2. (T = ! for a proton active nuclei are lined up in a magnetic field
and - ! for a neutron; other particles may have which is produced by electrons traveling clock-
T = 0 or 1, depending on the type.) Strangeness wise in a coil of wire, as seen by an observer
is conserved in reactions involving the strong looking down on the coil. Because of the asym-
interaction. The selection rules resulting from metry of the radioactive nuclei, most of the
strangeness conservation are very important in emitted electrons travel in the downward direc-
explaining why some reactions take place much tion. If the same experiment were done with
more slowly than others. similar nuclei composed of. antiparticles and the
A very important set of conservation laws is magnetic field were produced by positron cur-
related to symmetries involving parity (P), rent rather than an electron current, then the
charge conjugation (C), and time reversal (T). emitted positrons would be found to travel in
Parity is a property that is important in the the upward, rather than in the downward, direc-
quantum-mechanical description of a particle or tion. Interchanging each particle with its anti-
system of particles. It relates to the symmetry particle has produced a change in the experi-
of the wave function that represents the system. ment.
If the wave function is unchanged when the co- However, the symmetry of the situation can
ordinates (x, y, z) are replaced by (-x, -y, -z) be restored if we interchange the words "right"
then the system has a parity of + 1. If the wave I and "left" in the description of the experiment
function has its sign changed from positive to at the same time that we exchange each parti-
negative (or vice versa) when the coordinates cle with its antiparticle. In the above experi-
are reversed, then the system is said to have a ment, this is equivalent to replacing the word
parity of -1. During a reaction in which parity "clockwise" with "counterclockwise." When
is conserved, the total parity number does not this is done, the positrons are emitted in the
change. downward direction, just as the electrons in the
Changing the coordinates (x, y, z) into original experiment. The laws of nature are thus
(-x, -y, -z) converts a right-handed coordinate found to be invariant to the simultaneous ap-
system into a left-handed coordinate system. In plication of charge conjugation and mirror in-
terms of symmetry, the meaning of conservation version.
of parity is that in any situation where parity is Time reversal invariance describes the fact
conserved, the description of the reaction will that in reactions between elementary particles,
not be changed if the word "left" is changed to it does not make any difference if the direction
the word "right" and vice versa. This means that of the time coordinate is reversed. Since all re-
CONSERVATION LAWS AND SYMMETRY 226

actions are invariant to simultaneous application universe are the same and that they obey the
of mirror inversion, charge conjugation, and same physical laws in the farthest galaxy as
time reversal, the combination of all three is they do on earth. One of the goals of science
called CPT symmetry and is considered to be a then, is to determine the relationships that exist
very fundamental symmetry of nature. among these entities so that the set of indepen-
A new type of space-time symmetry has been dent fundamental quantities may be reduced to
proposed to explain the results of certain high- a minimum. Thus, 19th Century chemistry led
energy scattering experiments. Called "scale the way to the understanding of the vast com-
symmetry," it pertains to the rescaling or "dila- plexity of matter in terms of less than a hundred
tion" of the space-time coordinates of a system "irreducible" chemical elements. In the first
without changing the physics of the system. half of the 20th Century physics expanded the
(See Reference 5.) Other symmetries, such as realm of chemistry by the discovery of isotopes
chirality, are of a highly abstract nature, but and increased the number of atomic elements
aid the theorist in his effort to bring order into to perhaps a thousand, but at the same time
the vast array of possible elementary particle reduced the number of fundamental entities to
reactions. three-the proton, the neutron and the electron-
It is a temptation to say that "nature likes out of which the thousand different isotopes
symmetries" in order to prove the theoretical and billions of different molecules are built.
necessity of a conservation law. However it Physics is now undergoing a similar simulta-
must be realized that only human beings can neous expansion and reduction. The electron has
like anything. The search for symmetries in been joined by the muon, tauon, and neutrinos
nature leads to experiments that test the (and their anti-particles) and the proton and
theory. While a symmetry idea may suggest a neutron are now associated with pions, kaons,
conservation law, the conservation law must be and an array of "strange" and "charmed"
tested by experiment to see if nature really be- mesons and baryons. On the other hand there
haves that way in a given situation. are emerging theories of a more elementary par-
ticle-the quark- out of which the observed
MILTON A. ROTHMAN particles are "constructed," and the "grand
unification," which will describe gravitation,
References electromagnetism, and the strong and weak
nuclear forces as manifestations of a single
1. Rothman, M. A., "Discovering the Natural Laws : entity.
The Experimental Basis of Physics," New York, It is this universality that makes it meaningful
Doubleday & Co., Inc., 1972. to attempt to measure the properties of these
2. Rothman, M. A., "The Laws of Physics," New fundamental entities with all possible precision.
York, Basic Books, Inc., 1963 . Although the numerical values of such quantities
3. Swartz, C. E., "The Fundamental Particles," as the mass and electric charge of a proton, or
Reading, Mass., Addison-Wesley Publishing Co ., the velocity of propagation of electromagnetic
1965. waves are expected to be constant in time and
4. Sakurai, J. J., "Invariance Principles and Ele- space, our knowledge of those values is vari-
mentary Particles," Princeton, N.J., Princeton Uni- able and in general changes with each new
versity Press, 1964. measurement. There is no valid evidence how-
5. Jackiw, R., "Introducing Scale Symmetry," Phys· ever, at the present time, that these quantities
ics Today (January, 1972). are not indeed constant. It may be that the
values of the fundamental constants are chang-
Cross-references: ANTIPARTICLES; ELECTRO- ing over time scales comparable to the age of
WEAK THEORY; ELEMENTARY PARTICLES; the universe and that this so-called "age" is not
GAUGE THEORIES; GRAND UNIFICATION the indication of a time of beginning but is
THEORY; IMPULSE AND MOMENTUM ; IRRE- instead the time constant describing the rate of
VERSIBILITY; PARITY; POTENTIAL; QUANTUM change of an ever-existing, unending universe.
CHROMODYNAMICS; QUARKS; ROTATION- Should such a model indeed prove correct , it
CURVILINEAR MOTION; WEAK INTERACTIONS; would not i!J. principle change the concept of
WORK, POWER AND ENERGY. fundamental, invariable constants of nature.
Although it would fundamentally alter our
theoretical understanding, and change the
CONSTANTS,FUNDAMENTAL* models one uses for the description of the
universe, these changes would still be described
Perhaps the basic concept of modern physical in terms of fixed parameters, which are then
theory is that there are fundamental entities the fundamental constants.
underlying the structure of the universe, that Fundamental Units The existence of fixed,
the atoms and molecules which make up the constant values associated with atomic and
molecular physics has important implications
*Preparation of this article supported in part by for metrology and the establishment of units
National Bureau of Standards (US), Grant NB81- and standards. By the end of the nineteenth
NADA2087. century it had already been suggested that basic
227 CONSTANTS, FUNDAMENTAL

physical processes could be used to establish atomic definition of the second was interna-
standards for units of measurement. Since tionally adopted in 1967: "the second is the
atomic processes are uniquely reproducible and duration of 9192631770 periods of the radia-
universally available they can be better standards tion corresponding to the transition between
than arbitrary artifacts such as a metre bar or the two hyperfine levels of the ground state of
a prototype kilogram. Artifacts can change or the cesium-133 atom" (F = 4, mF = 0 ++ F = 3,
be damaged. Does the build-up of a surface film mF = 0).
or the absorption of atmospheric gases on the In the decade between 1972 and 1982 inter-
prototype kilogram change the size of the comparisons of the wavelengths of many ab-
international unit of mass? How can we know? sorption-stabilized gas lasers demonstrated their
On the other hand the mass of a proton, or of reproducibility at the level of parts in 10 10 with
a carbon atom in its ground electronic state, is stability of parts in lO ll . During the same period
invariant, the same everywhere and at all times. it became clear that the 86Kr transition defining
Similarly, one may consider the Bohr radius or the metre could not provide an unambiguous
the Compton wavelength of an electron as a standard of wavelength, because of its inherent
fundamental atomic unit of length. Because of spectral profile and lack of spectral purity,
the limitations of measurement, the use of such beyond the order of several parts in 10 9 • At the
microscopically small standards of length same time , nonlinear optical techniques were
would sacrifice accuracy in the comparison of being developed that provided efficient micro-
the standard with macroscopic lengths. The wave harmonic generation. These, coupled with
optical interferometer, however, allows one to frequency synthesis developments and the in-
compare lengths in terms of the wavelength of a herent monochromaticity and stability of ab-
monochromatic light source. Therefore in 1960 sorption-stabilized lasers, made possible the
the platinum-iridium prototype metre bar in direct comparison of the frequency of optical
use since 1889 was replaced by the definition transitions (A ~ 10- 4 cm) with that of the
of the metre as the length equal to 1650763.73 cesium clock (A = 3.26 cm). When both the
wavelengths in vacuum of the electromagnetic frequency and wavelength of a wave are known
radiation arising from the optical transition its velocity is simply c = At. In this way the ve-
2PlO-5d s in the isotope of krypton of mass 86. locity of light was found to be c = 299792458 ±
The unit of time (the second) is ordinarily 1.2 m/s. The precision of this result is limited
defined as 1/86400 of a day. As the available not by the measurements themselves but by the
precision of measurement and the demands of inherent uncertainties associated with the kryp-
science and technology increased, this defini- ton lamp. Such a situation implies that the
tion proved inadequate, since the period be- wavelength standard should no longer be kryp-
tween two successive noons changes throughout ton but should instead be a much more repro-
the year due to the nonuniform speed of the ducible and monochromatic laser. If, however,
earth in its orbit around the sun. The specifica- one can transfer frequency (or wavelength)
tion was made more precise, and the "day" was measurements from the optical region to the
defined for time-keeping purposes to be the microwave region and hence know the wave-
mean length of the day averaged over a full length of the radiation used to define the dura-
year. However, this definition also proved in- tion of the second, or the frequency of the
adequate because the rotation of the earth on radiation used to define the length of the metre,
its axis is not constant: as a result of tidal forces is it necessary to have two separate standards?
between the earth and the moon and of physical Should one not use a single atomic transition
changes in the distribution of mass over the to define both the unit of length and the unit
earth's surface during the year, the rate of rota- of time? In essence, why not define the metre
tion is decreasing by 0.03% per million years in terms of the second and an accepted value
and varying seasonally by as much as a part in for the velocity of light? This is exactly what
10 7 • In 1960 the international General Con- has now been done and in October 1983 the
ference on Weights and Measures therefore metre was redefined by adopting c = 299792458
adopted a definition of the second as "the frac- m/s as an exact quantity.
tion 1/31556925 .9747 of the tropical year for The unit of temperature is related in a less
1900 January 0 at 12 noon ephemeris time." direct manner to atomic processes, but since
This definition directly demonstrates one of the the liquid or gaseous state of matter is a result
problems associated with nonatomic standards. of intermolecular forces, the realization of the
The year (the time between successive vernal international temperature scale is in principle
equinoxes) is not a constant so that the mean also atomic, defined by fundamental constants,
motion of the earth in its orbit must be speci- and reproducible without reference to a specific
fied at a specific instant . With the development arbitrary artifact, by chosing the triple point of
of atomic clocks and the ability to make accu- water to have an assigned temperature of 273.16
rate and precise measurements of frequency it degrees Kelvin, exactly.
was clear that such clocks could provide con- Only the unit of mass is defined on the basis
venient and reproducible standards for the mea- of a man-made object; the kilogram is nothing
surement of time. Following the recommenda- more than the mass of the prototype platinum-
tion of the General Conference in 1964 an iridium kilogram kept in the laboratory of the
CONSTANTS, FUNDAMENTAL 228

Bureau International des Poids et Mesures in Planck radiation law (describing the distribution
Paris. of electromagnetic energy and how it changes
These units and standards, which define the with temperature) is he. In fact, the thermo-
international metric system of units ("Systeme dynamics of black-body radiation requires,
International" or SI), are not without signifi- solely on the basis of dimensional analysis, that
cance to those English-speaking nations who the constant he must exist independently of
have not yet adopted the metric system. There is quantum theory. From these three constants
no independent English system; the foot is in- one may construct a quantity with the dimen-
ternationally defined as 0.3048 m and the pound sions of a mass, m * = (fze/G )1/2 = 21.8 J,lg, a
is exactly 0.45359237 kg (see Table 1). quantity with the dimensions of length 1 * =
Universal Constants In view of the attempts (ItG/e 3 )1/2 = 1.62 X 10- 35 m, and a quantity
to replace arbitrary or man-made standards of with the dimensions of time, t* = 1*/e = (fiG/
measurement with standards based on atomic e5 )1/2 = 5.38 X 10- 44 s. The physical signifi-
phenomena it is appropriate to ask whether cance of these quantities is not clear. The length
there is a truly universal set of fundamental 1* and the time t* may be related to an ulti-
units of length, mass and time. The so-called mate quantization of space and time and may
Planck units are based on a recognition that the represent inherent limits of measurement, so
existence of the universe itself provides a basis that it would be impossible to distinguish two
for constructing such quantities. We may postu- points which are closer together in space or
late that the universe contains matter and time as separate points. The same kind of inter-
energy. The Newtonian constant of gravitation, pretation cannot be given to m *, an object of
G, is the fundamental quantity defining the inter- that mass is within the range of size visible to
action between matter in the large-without the eye, certainly easily seen in a microscope of
bringing into question its composition or inter- even low ma~nification, and would contain the
nal structure. Similarly, Planck's constant (di- order of 10 7 atoms! It remains for future
vided by 21T), -Ii, and the speed of electromag- theorists to provide a meaning to these funda-
netic radiation, e, are the fundamental quantities mental quantities.
associated with light and electromagnetic en- Experimental Determinations The only test
ergy. A more fundamental basis can indeed be one has of validity of physical theory is the
found. In the special theory of relativity the agreement between the results of measurements
constant e appears as a factor which unifies the and the predictions of the theory. Discrepancies
description of space and time. Its numerical between theory and experiment may be due to
value depends on the units in terms of which experimental errors or to misinterpretation of
one measures length and time. It has the dimen- the experimental results, or they can demon-
sions of a velocity (length/time) but its identi- strate the need for revision or rejection of the
fication with the velocity of electromagnetic theory and its replacement by a new theory.
radiation is a consequence of the theory and The difficulties inherent in measuring physical
the fact that the photon has zero rest-mass. In constants to an accuracy of a few parts per mil-
the general theor; of relativity the fundamental lion are great. Direct measurements of the mass
constant is G/e , determining the extent to of the electron or of the electronic charge are
which the energy density produces a curvature not as accurate as measurements which deter-
of space. The fundamental constant of the mine instead various combinations of these
quantities. Whereas R. A. Millikan was able to
measure the elementary charge on an electron
TABLE l. METRIC EQUIVALENTS. in 1912 to one part in a few thousand (and this
experiment can hardly be improved upon today),
1 inch 0.0254 m our current knowledge of the electronic charge,
2.54 cm with an accuracy of approximately I part in
1 foot 0.3048 m 500,000, comes from combining the ratio of
1 mile 1609.344 m the Faraday constant to the Avogadro constant
1 pound (avdp) 0.45359237 kg with measurements of the Sommerfeld fine
453.59237 g structure constant, the gyromagnetic ratio of
1 ounce (avdp) 28.349523125 g the proton, the Josephson effect, the realization
1 pound (troy) 373.2417216 g of the SI ohm, the magnetic moment of the
1 ounce (troy) 31.1034768 g proton relative to the Bohr magneton, and a
1 gallon (231 in 3 ) 3.785411784 L* half-dozen or so other measurements which
1 quart 0.946352946 L affect the final result to a greater or lesser de-
1 ounce (fl) 29.5735295625 cm 3 gree. In fact, the present knowledge of the
numerical values of all of the so-called funda-
mental constants of physics come from such
. NOTE These numbers are exact by definition.
For most calculations they may be rounded off
indirect measurements. Of these physical con-
considerably. stants, only the universal gravitational constant
The legal definition of length is in terms of G is measured independently of the others. In
the yard. 1 yard =0.9144 m. the first place, no theoretical relation is known
*L == litre = 1 dm 3 = 1000 cm 3 • which relates G to the other physical constants,
229 CONSTANTS, FUNDAMENTAL

and in the second place, the accuracy with constants in the sense that these estimates are
which G is known is at least an order of mag- the most accurate. For these reasons, least
nitude poorer than the accuracy of the atomic squares adjustment has been used for all of the
constants. significant determinations of the values of the
With the growth in our knowledge of natural fundamental physical constants over the past
laws and of the technical means of making pre- 35 years.
cise physical and chemical measurements, an Numerical Values The numerical values given
increasing number of relationships have been in Tables 2 and 3 are based on the most accu-
discovered between the fundamental constants rate and consistent measurements available as
of physics and chemistry. The situation regard- of June 1982. Significant measurements are the
ing our knowledge of these constants is a spider- frequency-voltage relation in a superconducting
web of interconnected data which can be vi- Josephson junction which determines the ratio
sualized as a bridge truss made up of elastic 2e/h; the Faraday constant, NAe; the ratio of
members, or, more simply, an interconnected the magnetic moment of the proton to the mag-
network of springs, in which the length of each netic moment of the electron J1.p/J1.e, or to the
member represents the experimentally measured magnetic moment of the muon, J1.p/J1.jl; the
relationship between constants, and the stiffness proton magnetic moment measured in units of
is a measure of the accuracy of this measure- the Bohr magneton J1.p/J1.B, or in units of the
ment. The problem is to determine the positions nuclear magneton J1.p/J1.N = (mp/me)J1.p/J1.B; the
of the nodes of this network. One recognizes quantized Hall effect in semiconductors which
that the alteration of anyone member will yields e 2 /h; and the Avogadro constant NA.
produce an effect which will be transmitted The spectroscopy of the energy levels of one-
throughout the entire structure. This will be electron atoms (positronium, muonium, hydro-
true whether we change the length of a member, gen, and deuterium) and of two-electron helium
or its stiffness, or remove it entirely. provide the basis for determining the Ryberg
In order to determine the values of the physi- constant and the Sommerfeld fine structure
cal constants from such an overdetermined set constant. An accurate determination of the fine
of data, it has become common to use the structure constant may also be obtained from
method of least squares. This can be considered the observed anomalous magnetic amount of
as equivalent to the problem of minimizing the the electron and (with less precision) from the
stored potential energy in our multidimensional anomalous magnetic moment of the muon. On
network. The fundamental requirement, how- the other hand, the agreement of the observed
ever, is basically one of establishing a method magnetic moment of the muon with that pre-
of analysis which is consistent and independent dicted by the theory of Quantum Electrody-
of the choice of variables used to describe the namics (QED) and the value of Ct: derived from
situation. The method of least squares not only the electron magnetic moment provides perhaps
does this but also provides a procedure which the most accurate verification of QED, and may
yields "best" estimates for the values of the be considered as providing a probe of the elec-

TABLE 2. DEFINED VALUES AND EQUIVALENTS.

Meter (m) the length of path traveled by light


in vacuum during a time of
1/299792458 of a second.
Kilogram (kg) mass of the international kilogram
Second (s) 9192631770 cycles of the radiation
of the hyperfine transition F = 4,
mF = 0 to F = 3, mF = 0 of the
ground state of the atom 133 Cs.
Degree Kelvin In the thermodynamic scale,
(K) 273.16 K = triple point of water
WC) = T(K) - 273.15
(freezing point of water,
0.0000 ± 0.0002°C)
Unified atomic +, the mass of an atom of the 12C
mass unit (u) nuclide
Standard 9.80665 m/sec 2
acceleration of 980.665 cm/sec 2
gravity (gn)
Normal atmosphere 101325 N/m 2
(atm) 1013250 dyne/cm 2
Thermochemical 4.184 J
calorie (calth) 4.184 X 10 7 erg
CONSTANTS, FUNDAMENTAL 230

TABLE 3. GENERAL PHYSICAL CONSTANTS a

Constant Symbol Value

Speed of light c 299792458 m . s-l


Gravitational constant G 6.673 X 10-11 N . m 2 . kg-2
Elementary charge e 1.60218 X 10-19 C
Avogadro constant NA 6.0221 X 10 23 mol- 1
Mass unit u 1.66054 X 10-27 kg
Electron mass me 9.1094 X 10-31 kg
5.4858 X 10- 4 u
Proton mass mp 1.67263 X 10-27 kg
1.00727647 u
Neutron mass mn 1.67493 X 10-27 kg
1.0086649 u
Faraday constant F 96485 C . mor 1
Planck constant h 6.6261 X 10-34 J . s
11 = h/2rr 1.05458 X 10-34 J . s
Fine structure constant C\( 0.00729735
C\( -1 137.0360
Josephson Frequency 2e/h 483594 GHz/V BI
483597 GHz/V
Magnetic flux quantum <1>0 2.06785 X 10-15 T . m 2
Rydberg constant Roc 10973731.5 m- 1
Bohr radius ao 0.529177 X 10-10 m
Compton wavelength he = h/mec 2.42631 X 10-12 m
Electron radius re 2.81794 X 10-15 m
Thomson cross section 8rrre 2 /3 6.65246 X 10-29 m 2
, 26751.5 X 104 s-l r1
Gyromagnetic ratio of protons in H2O 'Yp
'Yp' /2rr 42.5763 MHz' T- 1
Bohr magneton J.l.B 9.27406 X 10-24 J . T-1
Nuclear magneton J.l.N 5.05081 X 10-27 J . r 1
Proton magnetic moment in H2 O J.l.p'/J.l.B 1.5209931 X 10-3
J.l.p'/J.l.N 2.792776
Free proton magnetic moment J.l.p/J.l.B 1.5210322 X 10-3
J.l.p/J.l.N 2.792848
J.l.p 1.410615 X 10-27 J . T-1
First radiation constant 8rrhc 4.9925 X 10-24 J . m
2rrhc 2 3.7418 X 10-16 J . m 2 8- 1
Second radiation constant hc/k 0.014388 m . K
NAhc 0.1196266 J. m 1 . mol- 1
Gas constant R 8.3145 J . r l • mol-1
Boltzmann constant k 1.38066 J . K- 1

aBased on an analysis of the data available June 1982. The numerical values are expected
to be accurate to within one or two units in the last digit given. All quantities are in SI
(Systeme International) units:
C = coulomb N = newton
G =gauss T = tesla
Hz = hertz (cycles per second) u =atomic mass unit
J = joule W =watt
K = kelvin (degrees) Wb =weber
231 CORIOLIS EFFECT

tromagnetic interaction to distances as small as constant speed in a straight line in a rotating co-
10-20 m. ordinate system is equal in magnitude to the ap-
parent acceleration just described. An expres-
E. RICHARD COHEN sion for it may be derived simply and quite
rigorously, but not generally, for the case of a
References body moving with constant radial speed in a
rotating system. It consists of two distinct com-
1. Cohen, E. R., Crowe, K. M., and DuMond, J. W. M.
ponents both easily eva~uated for the case men-
"The Fundamental Constants of Physics," New tioned. One component arises from the change
York, Interscience Publishers, 1957.
2. Rossini, Frederick D. "Fundamental Measures and
in direction of the radial velocity of the body,
Constants for Science and Technology," Cleveland, the other from its change in tangential velocity
Ohio, CRC Press, 1974. due to changing distance from the center of
rotation (see ROTATION-CIRCULAR MOTION).
3. Taylor, B. N., Parker, W. H., and Langenberg, D. N.
"The Fundamental Constants and Quantum Elec- Suppose the rotating system has a constant
angular velocity w, and the body moves radially
trodynamics," New York, Academic Press, 1969. with constant speed u. It is initially at distance
rl from the center of rotation with velocity VI .
Cross-references: ELEMENTARY PARTICLES; FRE- After a small time interval At, it is at distance
QUENCY STANDARDS; GRAVITATION; MEA- r2 with velocity V2 (see Fig. 1). The two veloc-
SUREMENTS, PRINCIPLES OF; PHOTOMETRY; ities VI and V2 are related since the final veloc-
PHOTON; QUANTUM THEORY; QUARKS; RELA- ity V2 is the vector sum of the initial velocity
TIVITY; SPECTROSCOPY; SYMBOLS, UNITS AND VI and the change in velocity Av. Refer to Fig.
NOMENCLATURE IN PHYSICS; TIME. 2 for a vector diagram of the preceeding state-
ment (see STATICS). If we consider instanta-
neous values, At approaches zero, the angle
wAt approaches zero, and the chord Au ap-
proaches its arc in length. We may then use the
CORIO LIS EFFECT well known angle-arc relationship (arc length) =
(radius) (angle in radians), and write
A marksman fires his rifle due north. In the
absence of wind, he might well expect it to Au = uw At, or Au/At = wu
travel in a straight line and land due north of
him. But will it? The physicist would, in gen- Now Au/At is the acceleration component (al)
eral, answer no on the basis that the earth is resulting from the change in direction of v.
rotating and is not, therefore, an inertial frame The tangential velocity of a body equals the
of reference (see ROTATION -CIRCULAR MO- product of its angular velocity and its radius of
TION). G. G. Coriolis first analyzed this effect rotation. Equating changes in these quantities
in 1844, and he is acknowledged in its name. yields
For large artillery projectiles this effect may be
significant, but for hand carried weapons it is Aut = w(r2 - rd = wAr
usually negligible. For instance, a typical .22-
caliber rifle bullet might be horizontally de- Or, dividing by At
flected 0.2 meter in traveling one kilometer.
That a projectile will normally follow a path Aut/At = wAr/At
which is curved in the horizontal leads con-
versely to the idea that to follow a straight path
over the rotating earth requires the application
of a sidewise force. Even though this appears to
violate Newton's first law (see DYNAMICS), it is
perfectly true in a non-inertial system-hence,
the reason that this force is sometimes called
"fictitious. "
A body which is moving with constant speed
in a straight line in an inertial system is not ac-
celerating and is subject to no net force. An
observer in a rotating coordinate system will,
however, observe the same object to follow a
curved path. The observer may treat this appar-
ent deflection from a straight line as an acceler-
ation which is always perpendicular to the path
of the object. It is called the Coriolis accelera-
tion. The apparent force applied to the body to w
cause the deflection from a straight path into a
curve is called the Coriolis force.
The true acceleration of a body moving with FIG. 1
CORIOLIS EFFECT 232

tude and v is the true radial speed with respect


to the earth's axis.
V/Vh = sin </>
v = Vh sin </>

and the Coriolis acceleration becomes


ae = 2WVh sin </>

and the Coriolis force is


Fe = 2mwvh sin </>

The Coriolis effect applies to any object mov-


ing on the surface of the earth, and a more gen-
eral treatment will show it to be completely
AV independent of the direction of motion. The
quantity 2w sin </> is commonly known as the
FIG. 2 Coriolis parameter. Since the earth rotates 21T
radians in 24 hours, or at a rate of 7.27 X 10- 5
Now flvtJflt is the acceleration component (a2) radians/sec, the parameter is quite small, and
equals exactly 10-4 sec-I at a~o~t 43!O lati-
arising from changing tangential velocity, and tude. This small value for the Conolis parameter
flr/flt is the radial speed v. Hence a2 also equals
means that in everyday life its effects are small
WV.
and go largely unnoticed. For instance, the
Since both al and a2 lie in the same direction, Coriolis force on an automobile driving at turn-
being perpendicular to the radial velocity and pike speeds might typically be five newtons.
to the right in the figure, their magnitud.es may The acceleration can cause considerable deflec-
be added together with the sum equallllg the tion of long range artillery, however, and ap-
magnitude of the Coriolis acceleration ae propriate corrections must be made. .
al + a2 = ae = 2wv The Coriolis effect plays a large role III the
great mass movements of the oceans and at-
Using Newton's second law F = rna, the Coriolis mosphere. In the northern hemisphere, the ap-
force is parent deflection is always to the right of the
direction of motion. For example, air drawn
Fe = 2mwv toward a center of low pressure is deflected to
the right and eventually flows around the low
The earth is not, of course, a rotating plane. pressure area in a counterclockwise motio~. This
The Corio lis acceleration reaches a maximum at motion characterizes frontal storms tYPical of
the poles and vanishes at the equator. If Vh de- temperate climates. If the pressure force and
notes horizontal velocity on the earth's surface, Coriolis force are equal, the resulting wind ve-
the resulting Coriolis acceleration may be eval- locity is said to be geostrophic (Le., "turned by
uated by reference to Fig. 3 where </> is the lati- the earth"). Above one kilometer, the winds are
closely geostrophic.
Though it has been only a little over a cen-
N. POLE tury since the first analysis of this effect, the
Coriolis force has through the centuries influ-
enced man's environment through its control of
the motions of winds and waters, and hence the
distribution of the sun's heat over the earth.
Ancient man, not aware of the motion of his
planet, was nevertheless profoundly influenced
by it.
JULIAN M. PIKE

~------------~EQUATOR
References
Byers, Horace Robert, "General Meteorology," fourth
edition, New York, McGraw-Hill Book Co., 1974.
Coriolis, G. G., "Traite de la Mecanique de Corps
Solides," Paris, 1844.
Fowles, Grant R., "Analytical Mechanics," third edi-
tion, New York, Holt, Rinehart and Winston, 1977.
FIG. 3 Stephenson, Reginald J., "Mechanics and Properties
233 COSMIC RAYS

of Matter," third edition, New York, John Wiley studied extensively, using detectors carried to
& Sons, Inc., 1969. great heights by means of balloons, rockets and
spacecraft. The charge spectrum of "normal"
Cross-references: DYNAMICS; MECHANICS; ROTA- (solar system) material, accounted for by nu-
TION-CURVILINEAR MOTION. cleosynthesis in the big bang followed by pro-
cessing in stars, is characterized by strong
contrasts. The odd-Z elements are much less
COSMIC RAYS abundant than those with even Z, the light
elements Li, Be, and B are practically absent,
Cosmic rays are high energy subatomic particles and so forth. The cosmic-ray charge spectrum
which populate the interstellar space within shows much less contrast. The differences can
galaxies, and to some extent between galaxies. largely be accounted for by spallation of primary
The primary cosmic rays are atomic nuclei and heavy nuclei in collisions with interstellar hy-
electrons which have been accelerated by the drogen. The path length required to explain the
action of electric and magnetic fields in a hier- anomalous composition of the nuclei arriving
archy of astrophysical processes. Only a few of at the earth (the so-called "escape length") is
these processes have been clearly identified. about 7 g/cm 2 below 1 GeV per nucleon, de-
The primary particles produce secondary cosmic creasing to less than I g/cm 2 above a few hun-
rays in collisions with other forms of matter. dred GeV per nucleon. The abundances of the
The kinetic energy of these particles is typi- primary elements (those which are actually
cally 1-10 GeV; they have far more energy accelerated) show a suggestive correlation with
than the alpha, beta, and gamma rays emitted the first ionization potential of the atoms.
by radioactive atoms. Collisions between cosmic Studies carried out by measuring artificial etch-
rays and target nuclei release enough energy to pits in meteoritic crystals have shown that the
create mesons, both directly and via decay. cosmic-ray charge spectrum extends beyond Fe
Most of the cosmic rays which reach the earth's at least to Z about 80, the vicinity of Pt and Pb
surface are in fact muons from the decay of in the periodic table. These "ultra-heavy" nuclei
charged pions. The pions are produced high in are especially interesting probes since their in-
the atmosphere by collisions between primary teraction mean free paths are much shorter
cosmic rays and the nuclei of oxygen and nitro- than the escape length. The conditions they
gen atoms. require in order to be synthesized are also more
Investigations of the unexpected behavior of restrictive than for lighter nuclei. Large counter
these muons, and of other cosmic-ray phenom- systems and large sheets of etchable plastic
ena, stimulated the development of large ma- are now being used to study the ultra-heavy
chines for producing "artificial cosmic rays," component.
and evolved directly into the discipline now Measurements have also been made of isotopic
called elementary particle physics. Many of the abundances in cosmic rays. In case of primary
experimental devices that are used in this new constituents it is expected that the results will
field were originally developed to study cosmic provide especially clear evidence regarding the
rays. They include Geiger-Muller and propor- varieties of nuclear processing that the cosmic-
tional counters, and devices for delineating ray material has been through prior to its ac-
tracks, descendents of the Wilson cloud chamber. celeration. One of the first findings is that the
Positrons, muons, charged pions, kaons, the abundance ratio 22Nej2oNe (extrapolated to
first hyperons-all were discovered using cosmic the source) is significantly greater than for solar
rays. system material. However in other cases (26Mg/
Because the cosmic-ray energy spectrum ex- 24 Mg and 30Si/ 28 Si) the abundances are not
tends, roughly as an inverse power law, to ener- dissimilar.
gies much greater than accelerating machines The stable isotopes of secondary constituents
can duplicate, a good deal of activity still centers provide information on the escape length and
on the use of cosmic rays for studying elemen- its energy dependence. The unstable isotopes
tary particle physics. With the advent of proton- provide a clock for measuring the average time
antiproton colliders achieving TeV center-of- spent in traveling from the sources to the solar
mass energies, this kind of work will be restricted system. By combining the lifetime with results
henceforward to cosmic-ray energies greater on the path length one can obtain the density
than 1015 eV. The intensity of such cosmic rays of the scattering material, averaged over those
is so small that it will be difficult indeed to do parts of the galaxy through which the cosmic
worthwhile experiments. At the present time rays have propagated. Results of this kind have
much cosmic ray work is devoted to a second come mainly from the isotope lOBe (half-life
application, obtaining information about the l.6 X 106 years), and they are limited thus far
astrophysical setting in which cosmic rays are to energies less than I GeV per nucleon. They
produced and propagate. indicate that the lifetime is at least 10 7 years.
Cosmic rays provide the only directly acces- This value, which is supported by data on the
sible sample of matter from outside the solar electron and positron energy spectra, described
system. Hence the chemical composition (nu- below, is unexpectedly large. It implies a mean
clear charge spectrum) of this material has been density of scattering material equal to 0.3 g/cm 3 ,
COSMIC RAYS 234

substantially less than the accepted value for Work on cosmic 'Y-rays is further advanced.
interstellar matter in the galactic disk. These One of the original objectives was to determine
results indicate that cosmic rays spend much of the spatial distribution of cosmic-ray flux in
their lifetime in regions of lower than average this galaxy, using the 'Y-rays from decay of 7To
density, perhaps outside the disk. mesons produced by collisions of nuclei with
Attempts to detect nuclei of antimatter in interstellar gas and dust. Progress has been
cosmic rays have shown that for charge num- made, but this is proving to be more difficult
bers Z ~ 3 their abundance is less than 10- 4 . than expected because of competition from
Antiprotons have been found in two different 'Y-rays produced by cosmic-ray electrons. Two
energy bands. They appear to be secondaries mechanisms are involved, bremsstrahlung and
produced in the sources or in passage through collisions with low energy photons (microwave
the interstellar medium. If the anomalously through infrared to visible). It is found that the
high intensity in the lower of these bands is disk-shaped region in which electrons are pres-
confirmed it may indicate that cosmic-ray ent is considerably thicker than the disk con-
protons have a different life history than the taining most of the gas and dust of the galaxy.
heavier nuclei. With their limited angular resolution, the instru-
About 1% of the cosmic rays that strike the ments used to date respond mainly to the elec-
earth are electrons. High energy electrons have tron-initiated 'Y-rays. However these 'Y-rays are
difficulty in propagating very far in the galaxy themselves a source of important astrophysical
because they lose energy in producing synchro- information.
tron radiation under the influence of the mag- The most abundant cosmic rays, having ener-
netic fields that are present. On the other hand, gies on the order of I-lOGe V, are strongly
the fact that they do produce this radiation has affected by magnetic fields within the solar
made it possible to show that cosmic-ray elec- system. The first proof that the bulk of cosmic
trons (and presumably nuclei) are present in rays are charged particles rather than 'Y-rays
large numbers in supernova remnants such as was provided by the latitude effect, a reduction
the Crab Nebula, and in curious structures (jets in cosmic-ray intensity at low latitudes resulting
and radio lobes) associated with unusually active from the shielding effect of the earth's dipole
external galaxies. Observationally, synchrotron field. Experiments detecting an east-west asym-
radiation is distinguished by its non thermal metry in the intensity of obliquely incident
spectrum and high degree of polarization. From cosmic rays proved, on the basis of geomagnetic
the intensity of this radiation one can deduce theory, that primary cosmic rays have predomi-
the amount of power that must be supplied to nantly a positive charge.
electrons in these sources. It proves to be a sig- The weaker but more extensive fields in inter-
nificant fraction of the visible power, indicating planetary space show their influence in more
that the mechanism by which cosmic rays are subtle ways, through time variations correlated
accelerated is remarkably efficient. with solar activity. The effects range from an
If the electrons were a secondary component, II-year periodicity in the local cosmic-ray
produced by nuclear collisions, half of them intensity to sudden intensity modulations as-
would be positively charged. Measurements sociated with individual solar flares. The equip-
show that positrons make up only one-tenth ment used to study these variations ranges, in
of the total, so most of the electrons must be turn, from networks of earth-based monitors to
directly accelerated. Since the positrons are satellites and interplanetary probes. Both the
secondary and are subject to the life-shortening short term and long term variations are under-
synchrotron process, measurements of their stood in terms of an interaction between cosmic
energy spectrum afford an alternative approach rays and the "solar wind." The solar wind, an
to determining the cosmic-ray lifetime. extension of the sun's corona, consists of out-
Cosmic 'Y-rays and neutrinos with energies less ward-streaming plasma carrying with it a mag-
than 10 or 20 MeV are a mixed bag. Some of netic field that is partly. ~egular, resulting in
these 'Y-rays come from nuclear reactions in convection, and partly turbulent, resulting in
stellar atmospheres and some from e ± annihila- diffusion. At the boundary where the solar
tion, others are thermal radiation from accre- wind meets the interstellar medium some par-
tion on neutron stars and black holes; some of ticle acceleration seems to take place, making
these neutrinos come from nuclear reactions that region an important plasma physics labo-
inside stars. But 'Y-rays and neutrinos with higher ratory .. Particle acceleration also occurs from
energies are true cosmic rays, albeit secondary. time to time near the sun's surface, in flares.
High energy neutrinos produced locally, by Many cosmic-ray investigations are aimed pri-
cosmic rays striking the earth, have been 0 b- marily at deriving information about the sun
served in deep mines. Initial steps have been and the interplanetary medium.
taken toward construction of DUMAND, a It is important for use of the 14C radiocar-
deep underwater muon and neutrino detector. bon dating technique in archaeology to know
Having a target mass of order 10 9 tons (target whether the intensity of cosmic rays reaching
volume -1 km 3 ), it will be capable of detecting the earth has varied appreciably over the past
high energy neutrinos from distant events such l0 4 years. This unstable isotope, generated in
as supernova explosions. the earth's atmosphere mainly through the reac-
235 COSMIC RAYS

tion 14N(n, p)14C, is one of several produced repeats for several generations. At high enough
by cosmic-ray bombardment. By comparing 14C energies many of the charged pions collide be-
dates with those determined by counting tree fore decaying, because of relativistic time
rings it has been found that the apparent pro- dilation, further augmenting the hadronic
duction rate of 14C has varied by some 20% cascade. The composite cascade, consisting
over the past 3000 years. However, carbon par- of hadrons, muons, and electrons (plus r-rays)
ticipates in complex exchanges between the is called an air shower.
atmosphere, biosphere, and hydrosphere. It is The cosmic radiation observed at ground level
not clear what relative importance should be consists largely of muons from very small air
given, therefore, to variations in the earth's showers whose electron and hadron compo-
magnetic field, in solar activity, or in climate, nents have been absorbed in the upper atmo-
when interpreting these data. Extensive mea- sphere. Above 10 11 eV primary energy the
surements of cosmogenic radionuclides in electromagnetic cascades can be det~cted at
meteorites indicate that the intensity of cosmic ground level by means of atmospheric Cerenkov
radiation in interplanetary space has been light collected with large mirrors. This light is
constant within experimental error (about a very faint, so the observations must be made
factor of 2) for the past 100 million years. on clear, moonless nights. The same technique
Below I GeV per nucleon the energy spectra applies, of course, to showers produced by
of the various cosmic-ray constituents are in- r-rays. Cosmic r-rays from several pulsars have
creasingly affected by the solar wind. Below been detected in this way.
100 MeV per nucleon, corrections for the When the primary energy reaches about 10 14
residual solar modulation, even for quiet-sun eV the electrons reaching mountain altitude are
data, are so great that the shapes of the spectra numerous enough to be detected at distances of
in interstellar space are uncertain. For at least order 100 m from the shower core, or axis. This
3 decades above 1 GeV per nucleon the energy marks the beginning of a new experimental
spectra have approximately inverse power-law regime in which large counter assemblies are
form. The exponent has about the same value, used to make detailed measurements, preferably
2.6, for all primary nuclei. The spectra of ele- on several components, of individual showers.
ments that are wholly or predominantly sec- The equipment used includes ionization ca10-
ondary, such as Li, Be, B, are somewhat steeper. rimeters for studying the high-energy hadrons,
These differences are successfully explained by counters deep underground for detecting "hard"
models that properly take into account frag- (high energy) muons, counters with smaller
mentation during propagation from the source amounts of shielding to measure "soft" (-1
or sources to the solar system. By inverting the GeV) muons, unshielded counters for register-
calculation one obtains "source abundances," ing the total number of particles at ground
for comparison with solar system material, for level (most of which are electrons), and detec-
example. It is found that for self-consistency in tors of atmospheric Cerenkov light accumulated
these results the path length should decrease over the growth and decay of the shower above
with increasing energy, as remarked above. ground level. The simpler and less costly of
Above 103 GeV per nucleon the evidence these detectors are deployed in large numbers,
from "direct" experiments, those in which the forming arrays with collecting areas ranging
primary nuclei are identified by measuring their from a small fraction of a square kilometer to
charge, is still scanty. Such experiments must several tens of square kilometers. Using elec-
be carried out above the atmosphere using bal- tronic coincidence circuitry, the array counters
loons or spacecraft, but the particle energies select the events to be measured. Commonly
must be measured calorimetrically, using thick they are given the added task of determining
absorbers. The two requirements combine to the trajectory of each selected shower: the core
limit the product of area, time and solid angle, location from the intensity pattern, the direc-
and hence the total count for a given intensity. tion from the timing pattern. Aside from the
Ground based experiments can have much larger trajectory requirements, which determine the
sensitive areas and can use the atmosphere as geometrical factor of the experiment, the selec-
an absorber, but they must rely on indirect tion depends primarily on the shower energy
evidence for identifying the primary particles. being greater than some threshold. The actual
The ground-based experiments are of two energies of the individual selected showers are
types, those which detect unaccompanied high calculated after the fact, in most cases using
energy muons, and those which detect air data from additional detectors over and above
showers. The muons arise from the decay of those which produced the trigger.
charged pions (and kaons) produced when the Large air shower arrays, operating continu-
primary cosmic rays collide with air nuclei, high ously for many years, have measured the cosmic-
in the atmosphere. In the same collisions, neu- ,ray energy, spectrum over a range extending
tral pions are produced which give rise, by their from about 10 17 eV to more than 10 20 eV per
decay, to photons, which in turn produce elec- particle, while smaller arrays, operating at high
tromagnetic cascades via pair production and mountain altitudes, have covered the range
bremsstrahlung. The colliding nuclei retain half from 10 14 to 10 17 eV. This spectrum can be
of their energy, on the average, so this process extended to lower energies by adding together
COSMIC RAYS 236

the low energy spectra described above for all composition of these nuclei (for equal energy)
values of Z. The result exhibits several features. indicates that from 10 12 eV to the knee of the
The (integral) slope, which at 10 12 eV has energy spectrum the average mass increases. It
become 1.7, a trifle greater than at lower then appears to decrease markedly over the
energies, first decreases to -1.5 and then, above next 1.5 to 2 decades. After that it seems to
the so-called "knee" at a few times 10 15 eV, remain constant. At the knee, where the aver-
increases rather abruptly to -2.0. After this age mass seems to be greatest, the fraction of
rapid fall has continued for 3 decades the spec- Fe nuclei may be 50-100%; at the highest ener-
trum again flattens, having an "ankle" at -10 19 gies the fraction of protons is at least 50%,
eV, above which the slope is only -1.6. probably greater.
Air shower experiments also explore the di- Seventy years have passed since the balloon
rectional characteristics of high energy cosmic flights by Victor Hess which established the
rays. At low energies, disregarding effects of existence of cosmic rays. Except for small
the geomagnetic field and solar wind, cosmic contributions from solar activity, there is still
rays are remarkably isotropic in direction. This no clear evidence of where or how they are
is understood as resulting from repeated mag- produced. It is becoming more and more diffi-
netic scatterings as the cosmic rays diffuse cult to account for the non solar cosmic rays
through the interstellar medium. Up to energies reaching the earth by means of a single source
corresponding to the knee, the magnitude and or class of sources. There is much to be said in
direction of the small observed anisotropy favor of an association between supernovae and
«0.1 %) is explained by the motion of the solar the bulk of galactic cosmic rays, those with
system relative to its surroundings (the Comp- energies not much higher than 10 15 eV, but
ton-Getting effect) and a tendency of cosmic even in that domain alternative models have
rays to stream outward from the denser parts of some support. The possibility of accounting for
the galaxy in the direction of the local inter- the knee in terms of a "pulsar bump" made up
stellar magnetic field. At energies corresponding of particles enriched in heavy nuclei cannot
to the knee the anisotropy begins to increase. yet be ruled out. It is very probable but not
The direction of the maximum intensity begins absolutely certain that the highest energy par-
to change, becoming perpendicular to the galac- ticles are extragalactic. Does the ankle at 10 19
tic disk at about 10 17 eV. At that point the eV mark a transition from predominantly galac-
anisotropy is several percent. As the energy tic to extragalactic? Alternatively, if the knee
increases further, so does the anisotropy, but marks the onset of enhanced leakage from the
no clear pattern emerges until the highest ener- galaxy, one might expect the extragalactic com-
gies are reached. Above 5 X 10 19 eV, where the ponent to become dominant within a couple of
Larmor radius of a proton in a regular magnetic decades, by a few times 10 17 eV, but there the
field of reasonable strength (3 microgauss) is spectrum seems to be smooth.
greater than the size of the galaxy (20 kilo- In the highest energy region, at least, the study
parsec), the arrival directions are predominantly of cosmic rays remains a frontier, both in phys-
perpendicular to the plane of the galactic disk. ics and in astrophysics.
It is generally believed that these cosmic rays
must be extragalactic. But they cannot come JOHN LINSLEY
from the extreme depths of space. Because of
the Doppler effect, photons of the 2.7 0 micro- References
wave background radiation will appear to a Progress in Cosmic Ray Physics, 1-3, and subsequent
10 20 eV proton to be 100 MeV 'Y-rays capable volumes entitled Progress in Elementary Particle and
of producing photopions. The resulting energy Cosmic Ray Physics, Amsterdam, North Holland
loss sets a limit of about 30 megaparsec on the Publishing Co.
range of such protons. Photodisintegration im- Ginzberg, V. L., and Syrovatskii, S. I., "The Origin of
poses a similar limit on the range of heavier Cosmic Rays," New York, Pergamon Press, 1964.
nuclei. Hayakawa, S., "Cosmic Ray Physics," New York,
The identity of the particles that produce Wiley-Interscience, 1969.
large air showers must be deduced from second- Greisen, K., "The Physics of Cosmic X-ray, ,),-ray, and
ary features of the showers, such as the "elonga- Particle Sources," New York, Gordon and Breach,
tion" (atmospheric depth of maximum develop- 1971.
ment) or the ratio of muons to electrons. But Hillas, A. M., "Cosmic Rays," New York, Pergamon
these secondary features also depend on the Press, 1972.
character of elementary particle collisions at Origin of Cosmic Rays, Dordrecht, The Netherlands,
energies where direct evidence is unobtainable. Reidel, 1981.
The extreme proposals can be ruled out by Conference Papers, 17th International Cosmic Ray
rather general arguments: practically none of Conference, 1-14, Dordrecht, The Netherlands,
the large air showers are initiated by relativistic Reidel, 1982.
dust grains, and few if any by neutrinos or
'Y-rays. Just as at low energies, most of the very Cross-references: ASTROMETRY; ASTROPHYSICS;
high energy cosmic rays are atomic nuclei. DOPPLER EFFECT; ELEMENTS, CHEMICAL; REL-
The preponderance of indirect evidence on the ATIVITY; SOLAR PHYSICS; SPACE PHYSICS.
237 COSMOLOGY

COSMOLOGY arguing that the so-called spiral nebulae belong


to the Milky Way. When it was realized that the
The Size and Shape of the Universe Although spiral nebulae are actually extragalactic "island
philosophical speculations about the origin, universes" containing literally billions of stars,
structure, and evolution of the Universe have the known size of our Universe had to be in-
continued since the time of the ancient Greeks, creased by factors of hundreds and then thou-
the observational basis for scientific cosmology sands, and even tens of thousands.
has only recently been obtained. For nearly
two centuries, the universe was, for example, Although the once-fashionable island universe
thought to be limited to a finite, static system theory of spiral nebulae went into decline after
of stars which make up our Milky Way Galaxy. the spectroscopists demonstrated the gaseous
Observations supporting a finite stellar Universe nature of certain diffuse nebulae, the extraga-
with an observable edge were presented by Wil- lactic interpretation of the spirals revived its
liam Herschel in the late eighteenth century and popularity after Vesto M. Slip her serendipitously
reaffirmed by Jacobus Kapteyn in the early 20th discovered their large radial velocities. In the
Century . By assuming that all stars shine with early twentieth century there was a widespread
roughly the same intrinsic brightness, they could belief that the spiral nebulae are primitive solar
use counts of the number of faint stars in dif- systems in the making, and Percival Lowell asked
ferent directions to determine the structure of Slip her to use the instruments at the Lowell
the stellar system. In this way both Herschel Observatory to investigate their expected rota-
and Kapteyn concluded that the stars are con- tions. As a result, in 1914 Slipher "accidentally"
centrated in a flattened disk with the Sun at the discovered their astonishingly high velocities
center and with the greatest extent in the plane which sometimes exceeded 1,000 kilometers
of the Milky Way. Herschel was unable to pro- per second. Because it was hard to believe that
vide a distance scale for his picture, however objects with such enormous speeds could long
and it remained for Kapteyn to specify that th~ remain a part of the Milky Way system, these
edges of the stellar Universe are located at about observations proved to be a major impetus for
1,500 pc in the direction perpendicular to the reviving the island universe theory. In other
galactic plane and at about 8 times that distance words, the velocities of the spiral nebulae ex-
in t~e direction of the plane. (One parsec, or ceeded the escape velocity of the Milky Way,
pc, IS equal to ten thousand billion kilometers and they were therefore probably extragalactic
or 10 13 km.) , objects which lie outside the gravitational con-
One of the first objections to the Sun-centered trol of our Galaxy.
Universe was provided by Harlow Shapley in This was nevertheless, a controversial conclu-
the early 1920s. His observations indicated that sion, for it seemed to contradict Adrian van
the globular clusters outline an immense galactic Maanen's measurements of the proper motions
system that is nearly a factor of ten larger than of spiral nebulae. If these objects lie well outside
the visible stellar system. According to this our Galaxy, then their angular motions corre-
scenario, the Kapteyn Universe is simply a local spond to absurdly high linear speeds approach-
star cloud located in the outer regions of the ing the velocity of light. Van Maanen's obser-
grander Milky Way structure. Although Shapley vations were simply incorrect, however and
did not realize it at the time, interstellar dust it was eventually shown that the spiral nebulae
absorbs the light of distant stars, and for this lie at the enormous distances suggested by their
reason the star counts vastly underestimated the high radial velocities. The great debate over the
scale of the stellar system. This was not all for extragalactic nature of spiral nebulae was in
it was soon realized that the entire syste~ of fact, finally settled in 1925 when Edwin Hubble
stars is rotating like a gigantic pinwheel about a announced the discovery of Cepheid variable
remote, massive center. For instance, the in- stars in the Andromeda nebula (M 31) and the
vestigations of Bertil Lindblad and Jan Oort, in great spiral in Triangulum (M 33). The observed
1925 and 1927 respectively, demonstrated that period and apparent brightness of these stars
the Sun is located about 10,000 pc from the could be combined with the period-luminosity
galactic center, and that the Sun revolves around relation to infer a distance of about 480,000 pc
this center at a very rapid speed of about 250 for these two nebulae. This meant that they are
kilometers per second. In addition the enor- considerably more distant than the most remote
mously massive center, which contains a mass parts of our own Milky Way Galaxy. In fact, we
equivalent to about 100 billion stars, gravita- now know that all of the spiral nebulae are
tionally controls the motion of the Milky Way, remote galaxies, or island universes of stars
producing differential rotation in which the which are scattered throughout extragalacti~
more remote stars move at slower speeds. In space at distances as large as 10 billion pc.
any event, Harlow Shapley's arguments for a Hubble next embarked on a survey of the realm
dramatically larger size for the Milky Way were of the nebulae. Just as the counts of stars as a
correct. The known Universe had to be centered function of brightness and direction in the sky
far from the Sun, and its boundary had to be had been used to show that the Milky Way is a
increased by a factor of ten. As it turned out finite disklike system, nebula counts could be
however, Shapley was decidedly incorrect in used to explore how the nebulae, or galaxies,
COSMOLOGY 238

are distributed in space. As Hubble probed that Vr is much smaller than c. In any event,
deeper into space to fainter and fainter magni- seven years after Slipher's pioneering work,
tudes, the number of galaxies increased propor- C. Wirtz looked for correlations between these
tionally just as would be expected for a uni- velocities and other observable properties of the
form distribution of galaxies in ordinary flat galaxies. Wirtz found that, when suitable aver-
Euclidean space. Expressed algebraically, the· ages of the available data were taken, "an ap-
number N(m) of nebulae per unit solid angle proximate linear dependence of velocity and
brighter than visual magnitude m was: apparent magnitude is visible .... The depen-
dence of the magnitudes indicates that the spiral
N(m) = 1.43 X 10- 5 X 100.6m
nebulae nearest to us have a lower outward
galaxies per steradian. (1 ) velocity than the distant ones." Thus, both
(Astronomers use a logarithmic scale of bright- Slip her and Wirtz were remarkably close to
ness, or magnitudes, in which larger apparent "discovering" the expanding Universe; but it
magnitudes refer to apparently fainter objects.) remained for Edwin Hubble to provide the data
Although some galaxies are aggregated into which conclusively indicated that the most dis-
groups or clusters, Hubble found that they are tant galaxies are rushing away from us the
uniformly distributed when viewed on very fastest.
large scales. That is, the counts of faint galaxies It actually was not until 1929 that Hubble
in different directions and to different depths provided better quantitative data for the dis-
in the same direction indicated that there is a tances of the spiral nebulae and mentioned
homogeneous and isotropic distribution of somewhat incidentally the linear correlation of
galaxies. This meant that we are not in a priv- distance D and redshift. Because he was pri-
ileged position in the Universe, and that the marily interested in determining the solar motion
Universe has no preferred shape or center. The with respect to the distant nebulae, he did not
distribution of galaxies would appear the same, specifically state the now-famous Hubble law
apart from small scale fluctuations, when studied cz=Vr=HoXD. (4)
from any galaxy. In addition, there was no sign
of the number of galaxies thinning out as Hubble (The parameter Ho is now called Hubble's con-
looked further into the depths of space. There stant.) Hubble's first formulation of this red-
simply was no indication of an edge of the shift-distance relation referred only to rela-
Universe. In fact, observations with the most tively nearby galaxies (velocities less than 1,000
powerful modern telescopes have reaffirmed kilometers per second and distances less than
Hubble's basic conclusions about the large scale 2 megaparsecs or 2 million parsecs), but he ini-
structure of the Universe. It has a nearly ho- tiated a program to explore the relation to the
mogeneous distribution of mass with no visible largest distances possible. By the early 1930s
edge and no preferred center. Milton Humason had extended the spectro-
The Redshift-Distance Relation As early as graphic observations to the limits of the 1DO-inch
1914 Vesto M. Slip her had published the radial telescope at Mount Wilson, and together Huma-
velocities for fifteen spiral nebulae or galaxies, son and Hubble found the redshift-distance
noting that most of them are moving away relation to hold for velocities as large as 20,000
from the Earth at velocities which exceed those kilometers per second and for distances as large
of any other known astronomical object. Ac- as 32 megaparsecs. This was done by reformu-
cording to Slip her, this indicated "a general lating Hubble's law in the form:
fleeing from us or the Milky Way." To be pre-
cise, Slipher actually measured the difference log (cz) = 0.2m + B, (5)
between the wavelengths Ao of the spectral
lines of the galaxies and the wavelengths AL of where m is the apparent magnitude of the object
the same lines observed in terrestrial labora- and the constant B depends on Hubble's con-
tories. This difference specifies the redshift z stant H o and the intrinsic brightness of the
through the relation galaxy. Because of the impossibility of measur-
ing the distances to the weaker nebulae, Hubble
and Humason assumed that all nebulae have the
(2) same intrinsic brightness and inferred a velocity-
AL distance relation from the log red shift-apparent
The most straightforward interpretation of the magnitude relation given in Eq. (5). They were
redshift is in terms of the Doppler effect due to also able to derive the value of Hubble's con-
the recession of the galaxies. According to this stant H o = 558 kilometers per second per mega-
interpretation, the galaxies are moving away parsec. This work was then amplified and ex-
from the Earth with radial velocities Vr along tended by Humason, Nicholas Mayall, and Allan
the line of sight given by Sandage. In 1956 they published the log red-
shift-apparent magnitude relation for 474 extra- .
Vr =c X z for Vr « c (3) galactic nebulae, using data obtained during a
25-year period at the Mount Wilson, Palomar,
where c = 2.99792456 X 105 kilometers per and Lick Observatories. Within the accuracies
second is the velocity of light, and it is assumed of the data, the relation is linear and the slope
239 COSMOLOGY

has a value of 0.2, which is the expected value and quasars can be used to probe the remotest
for a homogeneous, isotropic expanding Uni- regions of the Universe. In fact, quasars with
verse. Moreover, by this time Walter Baade had redshifts as large as 3.5 have been discovered,
shown that the period-luminosity relation for corresponding to a distance of about 10 billion
Cepheids in spiral arms is quite different from pc. These quasars are so far away that they
that for Cepheids in galactic nuclei or in globular emitted the radiation we now detect when the
clusters. When this difference was taken into Universe was less than 20% of its present age.
account, the extragalactic distance scale was Moreover, their radial velocities are so large that
revised, and Hubble's constant was revised astronomers must use the relativis:::.c expression
downward to about 180 kilometers per second
per megaparsec. (During the past 15 years Allan (z + 1)2 - 1]
Sandage and his colleagues at the Mount Wilson V =c X [ (6)
and Palomar Observatories have published at r (z + 1)2 + 1
least five successively smaller values for Hubble's
constant, the most recent value being about 50 to infer a radial velocity Vr from the measured
kilometers per second per megaparsec.) red shift z, As illustrated in Fig, 1, the quasars
The log redshift-apparent magnitude relation and radio galaxies nevertheless exhibit a log
was extended much deeper into space and fur- red shift-apparent magnitude relation which is
ther into the past when radio galaxies and quasi- consistent, within the errors, with that expected
stellar objects, or quasars, were discovered. Al- for a homogeneous, isotropic expanding Uni-
though normal galaxies like the spiral nebulae verse. Unhappily, the great dispersion in the
emit most of their radiation at optical wave- observed data makes it impossible to determine
lengths in the visible part of the spectrum, the the details of the expansion. The data suggest
radio galaxies and quasars emit tremendous instead that the distant quasi-stellar objects
amounts of energy at radio wavelengths. In are about a factor of 10 brighter than the nearer
fact, these objects were first discovered by radio galaxies, which are in turn about a factor
radio astronomers who measured their angular of 10 brighter than the very nearby normal
sizes and determined their positions in the sky galaxies. Because the light with which we view
using interferometric techniques. These posi- distant objects left them some time ago, this
tions were then used to locate optical counter- change in brightness suggests that younger extra-
parts whose redshifts could be measured. Thus, galactic objects are brighter than older ones.
for example, the first known radio galaxy, The Expanding Universe: Origin and Evolution
Cygnus A, was identified with an extragalactic The most obvious and simplest interpretation
object in 1954. Its redshift was 0.057, at the of the observed galaxy redshifts is that they are
time one of the largest known. Nearly a decade due to the Doppler effect of receding galaxies,
later the first two quasars, 3C 273 and 3C 48, and that the Universe is therefore expanding.
were identified with blue objects with starlike Nevertheless, there was initially quite a different
images and redshifts of 0.16 and 0.37, respec- explanation, for the Universe was still believed
tively. In retrospect, the designation quasi- to be limited to the Milky Way system of stars.
stellar object appears to this writer a misnomer, Around 1917 theoreticians were trying to find a
however, for most quasars are comparable to solution to Einstein's gravitational theory which
galaxies in overall extent, and their starlike would keep this finite, static system from col-
images are actually due to their enormous lapsing under its own weight. Willem de Sitter
distances. showed that there is one solution in which the
These were mind boggling discoveries, for the red shifts of galaxies increase with the square of
quasars had to be receding from our Galaxy at their distance, but in this case the finite Universe
speeds which are a substantial fraction of the remains static and the galaxies do not move
velocity of light, and both radio galaxies and apart. That is, the redshift- distance relation
the quasars had to be intrinsically much brighter becomes a mathematical curiosity of the Gen-
than the nearby spiral galaxies. Although Nobel eral Theory of Relativity, and has nothing to do
prize winner Martin Ryle once reasoned that with an expanding Universe. Curiously enough,
the discrete radio sources are stars located within Hubble at first attributed his linear redshift-
the Milky Way, and Geoffrey Burbidge once distance relation to this so-called de Sitter ef-
argued that quasars do not participate in the fect. Most astronomers, including Hubble,
expansion of the Universe, there is now a gen- eventually accepted the expanding Universe,
eral consensus that both of these superluminous whose observed properties can be explained by
objects are the active nuclei of young galaxies simple Newtonian gravitation, but additional
whose remote distances are specified by Hubble's astronomical evidence was at first required to
law. Even their unusually bright radiation seems "prove" that the Universe is expanding.
to be explained by a central energy source in- Indirect evidence for an expanding Universe
volving a massive black hole. At any rate, the is provided by the fact that the sky is dark at
extragalactic radio objects have approximately night. If a static Universe is composed of a
the same intensity over a wide range of wave- uniform and infinite distribution of stars, the
lengths, in contrast to the spectrum of the night sky should be about as bright as the Sun.
normal galaxies. As a result, the radio galaxies This so-called Olbers' paradox arises from the
COSMOLOGY 240

:1.
6.5 H~ .

6.0 - 1

~
"
5 .5 1. w
7.
-.
~

5 .0 ~
01
~.5 .1 2. .3

.
N
U
~.0
....
"-
I
0,
0
1.
..
-'
-'
iii
3 5 .01 .2 'f!
8-' '"a: .1 ....
3 0 ><
.e5 u
«
ID
2.5 .0121 - . 02
-
2.0 • OUASI-STELLAR SOL~CES
.e1 ~
.eec -'
1.5 .
~ RADIO GALAXIE S
NORMAL GALAXIES .e001 .0e'2
.001
1.0 .0007
6. 28.
APPARENT MAGNITUDE

FIG. 1. The author's composite Hubble diagram, in which are plotted 663 normal galaxies,
230 radio galaxies, and 265 quasi-stellar objects. The vertical axis at right gives the look-back
time in billions of years, calculated on the assumption of a Hubble constant of 50 kilometers
per second per magaparsec. The solid lines denote the theoretical log red shift-apparent magni-
tude relations for a homogeneous isotropic Universe with different values of the deceleration
parameter q o. Because of the wide range in the intrinsic properties of these extragalactic objects,
the observations provide no defmitive statement about the value of qo and the fate of the
expanding Universe.

fact that every line of sight in a static infinite For example, in a Universe which expands for-
Universe will eventually intersect the surface of ever in flat Euclidean space, the big bang oc-
a star, and the sky should therefore be as bright curred at a time
as a typical star's surface. This paradox is re-
solved in an expanding Universe in which the 2
to = - - = 13 billion years ago, (7)
redshift moves the starlight of distant galaxies 3Ho
out of the visible optical wavelengths and into
the larger invisible wavelengths. In fact, in order where Hubble's constant Ho = 50 kilometers
to avoid a theoretically bright night sky, astron- per second per megaparsec. The astrophysically
omers have been compelled to conclude that determined ages of the oldest stars agree, within
the radiation from energetic young galaxies has a factor of two, with this estimated time since
to be greatly redshifted, and that an expanding the big bang. This agreement therefore provides
Universe is therefore required. additional evidence for an expanding Universe
If we accept the notion of an expanding Uni- which originated in a big bang.
verse, the redshift-distance rel{ltion provides a During the early stages of the expanding Uni-
method for studying the past history of the verse, thermal radiation provided the dominant
Universe, while also providing a time scale for form of mass-energy, and the expansion was
creation in an evolving Universe. For instance, therefore gravitationally controlled by the radi-
the light we detect from radio galaxies and ation. As shown by George Gamow in the 1940s,
quasars was emitted during past epochs, or the temperature T(t) at time t since the big
higher redshifts, when the Universe was com- bang is, under these conditions, given by
paratively young. The fact that these objects
are more abundant at higher redshifts provides C2 ] 1/4 10 10 0
T(t) = [ ~ - K. (8)
support for the idea that we live in an evolving 321rGat 2 t1/2
Universe that changes with time. In fact, a
backwards extrapolation of the presently ob- Here, c is the velocity of light, G is the New-
served expansion indicates that there must have tonian constant of gravitation, a is the radiation
been a time in the past when all of the matter constant, and in the numerical approximation
in the Universe was highly compressed and the time t is in seconds. As Gamow realized, the
extremely hot. This was the time of the big temperatures during the first few minutes are
bang which gave rise to the expanding Universe. hot enough for protons and neutrons to com-
241 COSMOLOGY

bine to form helium nuclei. The amount of to be extremely isotropic. In fact, the most
helium synthesized is about 27% by mass, and precise measurement of the large-scale isotropy
this abundance agrees with that observed in of the Universe now comes from the microwave
various spots in our Galaxy, in nearby galaxies background, which is isotropic to better than
and in quasars. Moreover, the observed abun- 0.1 % on angular scales greater than one minute
dance of helium in our Galaxy far exceeds the of arc. (An anisotropy of a few millidegrees
amount which would be synthesized in all of has been attributed to either the motion of our
the stars in our Galaxy over its entire lifetime. Galaxy and its neighbors relative to the back-
On the other hand, the observed abundance of ground radiation at a velocity of 500 kilometers
the heavier elements in our Galaxy shows ap- per second, or to a Universe which is anisotropic
preciable variation, and the much lower abun- at the 0.1 % level.) At any rate, the high energy
dances can be synthesized within stars. In other density, thermal spectrum, and isotropy of the
words, the observed abundance of helium can- 3°K microwave background cannot be explained
not be produced within the stars, and even if it by the radiation of conventional astronomical
was there would be an excessive abundance of objects, and it must be attributed to the relic
the heavier elements. We can only conclude radiation of a denser, hotter epoch. This dis-
that the observed helium originated in the early covery therefore provides unambiguous evidence
stages of the expanding Universe before galaxies for the singular origin of the expanding Uni-
and stars were formed. As a matter of fact, verse, while also providing an important con-
detailed calculations indicate that the amount of straint on the radiation temperature throughout
helium produced depends upon the "frozen-in" the evolution of the Universe.
proton-neutron ratio, and that this ratio is very The radiation temperature Tr (t), the radia-
sensitive to the rate of expansion. If this rate tion energy density pr(t), and the matter den-
differed by even 50% from the accepted value, sity Pm (t) at any time t can, for example, be
the helium abundance would disagree with that inferred using the boundary conditions
observed.
Direct astronomical evidence for the expand- Tr (t 0)= 3 degrees Kelvin
ing Universe was provided in 1965 when Arno
Penzias and Robert Wilson accidentally detected Pr (to) = 4 X 10-34 grams per cubic centimeter
the 3°K microwave background radiation that Pm (to) = I X 10- 31 grams per cubic centimeter
proved to be the relic radiation of the big
bang. When testing the microwave (or short (9)
wavelength radio wave) receiving system in-
tended to be used for measurements of the where the time to = 13 billion years and the
high-latitude continuum radiation of our Gal- mass density is for galaxies. For a Universe of
axy, Penzias and Wilson found a few degrees radius R (t) at time t we have
of unexpected noise temperature. Because the Pm (t) ex [R (t)]-3
excess noise showed no sidereal, solar, or direc-
tional variation, they first supposed that is Pr(t) ex [R (t)]-4 (10)
arose in the receiving antenna itself, but they
concluded that there remains an external noise Tr(t) ex [R (t)]-1
contribution with an antenna temperature of
3.5 ± l.OoK at the wavelength of 7.3 centi- so that
meters. In a companion letter in the same issue
of the Astrophysical Journal, Robert Dicke and .:Pm
. . :;.::..(t)Tr(t)
.:. . .:--'-. . .:. . :. . = constant for all times. (11)
his colleagues explained the excess 3°K radia- Pr(t)
tion as the residual temperature of the primeval
explosion that initiated the expansion of the We are presently in a matter dominated era in
Universe. This concept had nevertheless already which Pm exceeds Pro At the decoupling time
been developed by George Gamow, Ralph these two forms of energy density were equal,
Alpher, and Robert Herman in the late 1940s, and before that time radiation dominated
when they showed that a present temperature matter.
of about 5°K results when the 10 billion degree Fate of the Universe What is the fate of the
primeval fireball has been cooled by expansion Universe? Will it continue to expand forever,
in the enlarging cavity of the world. In any or will it stop its expansion in the future and
event, if the cosmic noise discovered by Penzias begin to contract? The answer to this profound
and Wilson is the relic fireball radiation, it question is found by considering the condition
should have a thermal spectrum. Because of the for which the outward motion of a remote
interactions of matter and radiation, the prime- galaxy is just balanced by the inward gravita-
val radiation will very rapidly reach thermody- tional pull of all of the other galaxies in the
namic equilibrium, and the resulting thermal Universe. This critical condition occurs when
spectrum will be preserved for all time as the the velocity of the remote galaxy is just equal
Universe expands and cools. The expected ther- to the escape velocity of the rest of the Uni-
mal spectrum was promptly confirmed, and the verse. When Hubble's Law is used to specify the
microwave background was additionally shown galaxy's velocity and the galaxy's kinetic energy
COSMOLOGY 242

is placed equal to the gravitational potential Hubble, E., "The Realm of the Nebulae," New Haven,
energy resulting from the gravitational tug of Yale Univ. Press, 1936; New York, Dover Pub.,
all the other galaxies, we obtain a critical mass 1958.
density: Lang, K. R., "Astrophysical Formulae," New York,
3H02 Springer-Verlag, 1980.
Pc = 81TG = 4.7 X 10- 30 Lang, K. R., and Gingerich, 0., "A Source Book in
Astronomy and Astrophysics, 1900-1975," Cam-
grams per cu bic centimeter, (12) bridge, MA, Harvard Univ. Press, 1979.
Sandage, A. et al. (Eds.), "Galaxies and the Universe,"
where G is the Newtonian gravitational constant
Chicago, Univ. of Chicago Press, 1975.
and the numerical value corresponds to Ho = 50 Sciama, D. W., "Modern Cosmology," Cambridge,
kilometers per second per megaparsec. U.K., Cambridge Univ. Press, 1971.
If the present mass density Pm (to) of the Silk, 1., "The Big Bang: The Creation and Evolution of
Universe exceeds Pc then the present expansion the Universe," San Francisco, W. H. Freeman, 1980.
will eventually reverse, the galaxies and stars Weinberg, S., "Gravitation and Cosmology: Principles
being destroyed and finally engulfed in a hot, and Applications of the General Theory of Relativ-
dense fireball analogous to the one which initi- ity," New York, Wiley, 1972.
ated the present expansion. In other words the Weinberg, S., "The First Three Minutes: A Modern
Universe will eventually stop its expansion, View of the Origin of the Universe," New York,
contract, and undergo another big bang. Alter- Basic Books, 1977.
natively, if the present mass density is less than
or equal to the critical value, the Universe will
continue to expand forever. The galaxies will Cross-references: ASTROMETRY; ASTROPHYSICS;
disperse and fade into a black void. Will the DOPPLER EFFECT; ELEMENTS, CHEMICAL; REL-
expanding Universe end with a whimper or a ATIVITY; SOLAR PHYSICS; SPACE PHYSICS.
bang? Edwin Hubble answered this question in
1926 when he essentially showed that the mass
density of galaxies is less than the critical den-
sity, and that the Universe will therefore con- CRITICAL MASS*
tinue to expand forever. The mass of fissionable material required to
Of course, we now have greater confidence in produce a self-sustaining sequence of fission
the conclusion that the expansion will continue reactions in a system (a reactor, for example) is
indefinitely. Astronomers have reaffirmed the the critical mass for that system.
fact that the mean density of matter in the lumi- The chain of reactions will be self-sustaining
nous visible parts of galaxies falls short of the if, on the average, the neutrons released in each
critical value by a factor of about 30. Dark or fission event initiate one new fission event. The
invisible matter has been inferred from dynami- system is said to be critical when that condition
cal considerations as well as from radio and exists.
X-ray measurements of galactic halos and the Neutrons released from fissioning nuclei may
intergalactic medium in clusters of galaxies. escape from the system; they may be captured
Nevertheless, when these contributions have in non-fissioning reactions, or they may prod uce
been added, the mass density of the Universe is new fissions. The critical mass depends on the
at least 10 times too small to halt its expansion. relative probabilities of these processes and on
This has been confirmed by satellite measure- the average number of neutrons released per
ments of the abundance of interstellar deute- fission. Evaluation of these probabilities is the
rium, which was presumably created in the concern of criticality calculations which are
big-bang explosion. These measurements indi- important in the design of neutron chain
cate that the present mass density of the Uni- reactors.
verse in both visible and invisible forms is The escape probability becomes larger for
Pm (to) ::s; 5 X 10- 31 grams per cubic centi- smaller systems, inasmuch as the ratio of surface
meter. Theoreticians will continue to create to volume increases as a system is made smaller.
esoteric theories and observational astronomers Thus, there is a critical size below which the
will continue to collect uncertain data; but the chain reaction in a given system cannot be made
available evidence indicates that the conclusions self-sustaining. The concept of critical size is
will not change. The evolving Universe will often discussed along with critical mass.
continue to expand forever, and the observable Neutrons colliding with non-fissionable nuclei
properties of this expansion will probably con- in the system may be absorbed and thus lost to
tinue to be explainable in terms of Newton's the chain reaction. In fact, not every neutron
theory of gravity and flat Euclidean space. absorption by a fissionable nucleus results in a
fission. Non-fission absorption must be taken
KENNETH R. LANG into account in calculating the critical mass. For
example, a system containing pure 235 U can be
References made to have a low critical mass. If the same
Harrison, E. R., "Cosmology: The Science of the
Universe," Cambridge, U.K., Cambridge Univ. *Research sponsored by the USAEC under con-
Press, 1981. tract with Union Carbide Corporation.
243 CRITICAL PHENOMENA

configuration were loaded with a sufficient CRITICAL PHENOMENA


~~lntity of natural uranium (0.0057 per cent
U, 0.72 per cent 235 U and 99 27 per cent Introduction The field of phase transitions
238 U) to contain the sa~e total' amount of and critical phenomena has seen extraordinary
235U, it would not be critical because at certain
developments within the past few years 1 and
energies the 238 U readily absorbs neutrons with- in this brief account we shall attempt t~ pro-
out fissioning. vide an overview of some of these, concentrat-
The probability that a neutron striking a ing our discussion on the concepts of scaling
fissionable nucleus will cause it to fission de- and universality.
pends on the fission cross section (see FISSION) We shall organize the introduction to critical
which in turn depends on the energy of the phenomena about three simple questions: (1)
neutron, increasing as the neutron energy gets "What happens?", (2) "Why study?", and (3)
lower. Thus the addition of a moderator that is "What do we actually do?"
a material which takes up energy f;om th~ "What Happens?" What happens near the
neutrons without absorbing them will lower critical point is easily explained by means of
the critical mass of a system. Wate; and carbon the example of a simple magnet. The most strik-
are good moderators. ing macroscopic property of a magnet is that it
The critical mass also depends on the average is in fact "magnetized" -it can, for example
number of neutrons released per fission. This pick up thumb tacks! Suppose we measure th~
number changes slightly with neutron energy. number of thumbtacks or magnetization as we
For 23 5 U it is about 2.45 for thermal neutrons heat the magnet at a uniform rate. As we heat,
and about 2.65 for I MeV neutrons. The num- the tacks fall off one by one, but the rate of
bers are slightly higher for 23 3 U. falloff suddenly diverges to infinity at a certain
A complete criticality calculation must take temperature (see Fig. I) and there are no
into account the fission cross section as a func- thumbtacks left for temperatures higher than
tion of neutron energy and the average neutron this temperature. We call this temperature the
yield perfission as a function of neutron energy. critical temperature, Te , and we call phenomena
Also to be considered are the geometrical dis- associated with the critical point critical
tributions in the system of the fissionable nuclei phenomena.
the absorbing nuclei, and the moderator, and Examples of other critical phenomena are the
how the neutrons scatter from them. Further- singularities in two important "response
more, the configuration of reflecting material functions ":
outside of the fuel volume has a marked in- (I) the constant-field specific heat (see Fig.
fluence on the critical mass. A complete cal- 2(a))
c~la~ion .would construct the spatial and energy
dlstnbutlOns of neutrons in the system through CH == response in heat content to a
the use of a mathematical procedure that models change in temperature
the history of neutrons from their release to
their capture or escape. Neutron diffusion
theory or transport theory is usually used for
such calculations. Actual calculations use ideal-
== T(::)H' (1)

izations and produce approximate results. Often where here S = S(T, H) denotes the entropy of
criticality experiments are required to verify the system, and
results. (2) the isothermal susceptibility (see Fig.
Final~y, a~ example of critical masses, a sphere 2(b))
32 cm In dIameter containing 235 U dissolved in
water has a critical mass of about 2.1 kg. The
same sphere with 233 U has a critical mass of
about 1.1 kg. I The Oak Ridge National Lab-
oratory graphite reactor was loaded with 31
tons of natural uranium. This contains about
203 kg of 235 U.
A thorough discussion of neutron chain reac-
tors is given in reference 2.
CHARLES D. GOODMAN
References
1. Callihan, A. D., Morfitt, 1. W., and Thomas, 1. T.,
Proc. Intern. Conf. Peaceful Uses At. Energy,
Geneva, 5, 145 (1956).
2. Weinberg, A. M., and Wigner, E. P., "The Physical
Theory of Neutron Chain Reactors," Chicago,
University of Chicago Press, 1958.
Cross-references: CROSS SECTIONS AND STOPPING
POWER; FISSION; NUCLEAR REACTIONS; NU- FIG. 1. .Behavior (schematic) of some common phys-
CLEAR REACTORS. ical quantities near the critical point.
CRITICAL PHENOMENA 244

XT == response of magnetization to a
change in magnetic field

(2)

where M = M(T, H) is the magnetization and H


is the magnetic field.
Thermodynamic functions are macroscopic in
nature . What is happening at a microscopic level
to account for the anomalies in the thermody-
namic functions? Simply said, the motion of (0 ) Experiment
increasing numbers of particles is becoming cor-
related; e.g., the correlation function,

- - -
C2 (T, H, r) == (SoSr) - (so) (Sr), (3)
which describes the degree of correlation among
the constituent magnetic moments Sr of our
magnet, is becoming long-range. Here the angu-
lar brackets denote thermal averages.
In fact, the "moments" of the correlation
function,

iJa(T,H) == filY C2(T,H,,!)d!,, (4) (b) mft


FIG. 3. Behavior (schematic) of common physical
are found to diverge for all positive a and even quantities near the critical point, (a), compared with
for a limited range of negative a (roughly a > the predictions of the mean field theory, (b). Here
- 2)! In particular, the zero-field isothermal su~ K == lIt .
ceptibility is directly proportional to the zeroth
moment
so the divergence of the susceptibility can be
seen to be directly related to the increase in
range of C2 (T, H, r).
A particularly useful measure of the range of
S(T,H=O) the correlation function is the correlation
length, HT, H), defined by the equation
~2(T, H) = iJ2(T, H)/iJo(T, H) (6)

This quantity can be measured experimentally,


e.g., by light scattering, and it is found to di-
verge near the critical point (see Fig. I).
"Why Study?" There are, of course, many
T answers to the question "why study critical
phenomena," but certainly the simplest answer
(0 ) is that there exist rather striking discrepancies
between the predictions of closed-form theories
and our findings in the real world. For example,
in Fig. 3 we compare, in a qualitative fashion,
the experimentally observed temperature de-
pendences of the functions discussed above
with the predictions of the mean field theory
(mft) of cooperative phenomena. The mean
field theory corresponds to a model in which
H each magnetic moment interacts with all other
( b) magnetic moments in the entire system with an
equal exchange interaction. Such a model is
FIG. 2. Definitions of two response functions, the unlikely to be a realistic model for the task of
constant-field specific heat CH(T, H =0), and the describing the effects noted above, and indeed
isothermal susceptibility evaluated in zero field XT(T, we know that interaction energies in a system
H =0). Both functions are singular at the critical fall off with distance between the magnetic
point of a simple magnet. moments. Thus one reason for studying critical
245 CRITICAL PHENOMENA

phenomena is that it provides a testing ground


for theories that concern the microscopic inter- o
actions in matter.
A second reason for the burgeoning interest
in critical phenomena is that a wide variety of
seemingly disparate physical systems are found
to behave quite similarly near their "critical
points." For example, phenomena analogous to
those shown in Fig. I are found in a liquid-gas
system (see LIQUID STATE), in a ferroelectric,
in a superfluid, in a superconductor, and in a -4 -3 -2
binary alloy or mixture. Accordingly, workers
from a variety of disciplines have been drawn IT - Tc I
log ---"--
together by the problem of critical phenomena, Tc
adding to the interdisciplinary flavor of the FIG. 4. Sketch of definition of the critical-point
field. exponent 13 [M-ITI13j to illustrate the general defini-
"What Do We Actually Do?" The answer to tion (8).
this question may be given on many different
levels, but basically it is that we simply want to
understand on a microscopic level what forces
exist between the constituent particles in mat- and T == T - Tc ~ o. Accordingly, we define the
ter, and how they "contrive" to produce the three exponents 0:, (3, and 'Y in Table 1 to de-
anomalies discussed above and sketched in Fig. scribe, respectively, the behavior of the specific
heat, the spontaneous magnetization, and the
I. As a first step in this direction, it is useful to
provide a quantitative measure of the isothermal susceptibility along this path.
phenomena. The second column gives a typical range of
It is found that in almost all systems the lim-
experimental values for the exponents, while
iting behavior of the functions of interest is the third column gives the values predicted by
well described by a simple power law-so that ifthe mean field theory. The reader would do
one plots one's data on log-log paper, the datawell to verify from inspection of Fig. 3 that the
fall on a straight line sufficiently near the criti-
mean field exponents are correctly listed. That
cal point (see Fig. 4). It is important to empha-
the agreement is far from perfect is consistent
size that the power law behavior sets in for with the discrepancies between Figs. 3(a) and
3(b) (experiment and mft).
some systems only quite near the critical point,
and when we write Theorists have endeavored to calculate the
exponents for as many models as possible. For
[(x) '" xl) , (7) example, the fourth column shows the expo-
nents calculated for the two-dimensional Ising
meaning "[(x) varies as x to the e power for x model, for which many of the zero-field prop-
near zero," we mean, formally, that erties are known exactly. We see that these ex-
ponents agree no better with experiment than
e
== lim log [(x) . (8) do the mean field exponents-in fact, they err
x-+o log x in the opposite direction, with the experimental
numbers lying in between the mft and two-
There is in general a different exponent for dimensional Ising predictions.
each function and each path of approach to the We shall subsequently argue that the reason
critical point. The thermodynamic functions that the two-dimensional Ising model disagrees
considered above all concern approaches to the with experiments on three-dimensional systems
critical point (T = T c , H = 0) in which H = 0 is that the system dimensionality plays a crucial

TABLE 1. DEFINITIONS AND TYPICAL VALUES OF SELECTED


CRITICAL-POINT EXPONENTS FOR A SIMPLE MAGNET.

Mean Field
Exponent Definition Experiment Theory (mft) d = 2 Ising d = 3 Ising

a,',a CH - (-r)-O:' -0.1 to 0.2 0 0 ::::: 1/8


- T-O:
13 M - (-T)13 0.2 to 0.4 1/2 1/8 ::::: 5/16
r', r xr - (-T)-r' 1.1 to 1.5 1 7/4 ::::: 5/4
- T-r
0:' + 213 + r' :::::2 2 2 :::::2
CRITICAL PHENOMENA 246

role in determining critical-point exponents. In- tion f(x, y, z, ... ) is a GHF if we can find func-
deed, the fifth column, showing the results of tions gx(A), gy(A), ... ,U(A) such that for all
numerical approximation procedures for the positive A,
three-dimensional Ising model, is seen to agree
rather better with the experimental results. f(gx(A)X,gy(A)y,.··J = gt(A)f(x,y, ... ], (10)
Scaling Hypothesis One could continue dis-
cussing exponents for other systems, for other where the functions gj (A) are arbitrary except
functions, or for other paths of approach to the that 3they possess inverses. It is elementary to
critical point. Until a few years ago this discus- show that (10) is equivalent to the statement
sion would serve to summarize most of the re- that there exist numbers ax, ay, ... , at such
search activity in critical phenomena. However, that
we are sooner or later going to seek a deeper f(AaXX,AaYy,· .. )=Aatf(x,y, ... ). (11)
understanding of the exponents, and this under-
standing is just beginning to arise. It is coming The scaling hypothesis is just that-a hypothe-
not from the exact solutions of model systems- sis. It involves GHFs, and because of the prop-
for which the complexity of the derivation all erties of GHFs, it can be made about a variety
but totally obscures any physical insights con- of functions. One can make a scaling hypothe-
cerning the magnitude of the exponent ob- sis about three different classes of functions:
tained-but rather from an altogether different thermodynamic functions (TF), static correla-
approach. tion functions (SCF), and dynamic correlation
This approach began historically with the in- functions (DCF).
troduction of rigorous relations among the criti- TF hypothesis: Close to the critical point
cal-point exponents-these relations took the T = H = 0, the singular part of the Gibbs poten-
form of inequalities, and generally involved tial per spin G(r, H) is "asymptotically" a
three exponents. The inequality involving a/,~, GHF.
and -y/ is simply SCF hypothesis: Close to the critical point
and for large I r I , the static correlation function
a/ + 2~ + -y/ ~ 2, (9) C2(r, H, r) is aGHF.
DCF Hypothesis: Close to the critical point
and is generally called the Rushbrooke inequal- and for large I r I and t, the dynamic correlation
ity.2 That model systems appeared to satisfy function C (T ,fl, r, t) is a GHF.
Eq. (9) (and most of the other "rigorous" in- 2
equalities) as equalities (Table I), and that most One can show that these statements are not
experimental systems were not inconsistent entirely independent of one another and that in
with the possibility that (9) is an equality, did fact:
not go unnoticed. However, all attempts to rig- DCF hypothesis => SCF hypothesis
orously prove (9) as an equality have been sin-
gularly unsuccessful. => TF hypothesis.
It is in the finest tradition of theoretical phys-
ics that when one cannot solve the original The TF hypothesis predicts that Eq. (9) holds
problem, one seeks to replace it with a simpler as an equality, and thus is consistent with the
problem that one can solve. In this instance, the numbers shown in Table I. However, it also
"breakthrough" occurred in 1965-two years makes other predictions, not anticipated before.
after Rushbrooke proposed the Rushbrooke in- For example, it predicts that all data taken near
equality-bl many investigators working inde- the critical point can be made to "collapse"
pendently. Their work generally goes by the onto a single curve providing the data are
name of the "homogeneity" or "scaling" hy- plotted in the correct units. The single curve is
pothesis, and is perhaps most easily formulated called a "scaling function," and examples of
in terms of a class of functions called general- scaling functions are shown in Fig. 5 and de-
ized homogeneous functions (GHFs). A func- fined in Table 2. The scaling function has just

TABLE 2. SUMMARY OF SCALING FUNCTIONS


PREDICTED BY THE SCALING HYPOTHESIS FOR
THE M-H-T EQUATION OF STATE. HERE Ii IS
DEFINED BY H - Mli. SKETCHES OF THE
THREE FUNCTIONS ARE SHOWN IN FIG. 5

Function Dependent Variable Independent Variable

Hr ==H/ I rl(3li Mr==M/lrl(3


HM==H/IMIIi rM == r/ 1M 11/(3
MH==M/IHII/ii rH==r/IHI 1/(31i
247 CRITICAL PHENOMENA

ning a n dimensional space, i.e.,

§ == (S 1, S2, ... , sn), (12a)


with

Thus the Hamiltonian under consideration iss


n
JC=- I: I: JI.-'!' (a) SI. a SI.,a .(13a)
1:. ,1.,' 01=1

Note that when n = I, this Hamiltonian re-


duces to the Ising model, since the spins be-
come simply one-dimensional "sticks" capable
of assuming the two discrete orientations +I
(up) and -1 (down).
For n = 2, the Hamiltonian (13a) describes a
.... system of two-dimensional vectors, and is gen-
" .... , , erally called the XY of "plane rotator" model.
, It has also been called the Yaks-Larkin model,
because Yaks and Larkin have considered it a
lattice model for the superfluid transition in a
Bose fluid.
For n = 3, (13a) describes the general aniso-
tropic Heisenberg model, which has proved par-
ticularly useful in describing a variety of mag-
netic materials near their critical points.
For ii -+ "", (13a) reduces to the spherical
model."
A study that has recently been done concerns
which parameters in the general model Hamil-
tonian (13a) are important for determining the
values of critical-point exponents, and which
FIG. 5. Sketches of the three possible "scaling are "irrelevant." To this end, one considers
functions" for interpretation of MHT equation of (1) the lattice dimensionality, d;
state data near the critical point. A dashed line (2) the spin dimensionality, n,
indicates that data are not taken for this "branch" of (3) the "lattice anistropy" or "nonuniform-
the scaling function due to MH symmetry. ity of the interaction" -i.e., the de~endence of
Jr-r,a upon the direction of r - r .7 There are
many important materials for which the inter-
recently been calculated directly for the Heisen- action strength in one crystal direction is differ-
berg model, and agreement with experimental ent than in another direction, so this is not "of
data on magnetic systems is striking.4 merely academic interest."
Universality Hypothesis The universality hy- (4) the "spin space anistropy," or depen-
pothesis has arisen from attempts to answer the dence of J,-r,a upon Q. Probably no material is
question "On what features of a model do criti- perfectly isotropic and there are certainly some
cal properties (exponents, scaling functions, materials for which the anistropy is believed to
etc.) depend?" Universality states that the only playa very important role. 8
properties are the dimensionality d and the (5) the range of interaction, or dependence
"symmetry" n, where n actually denotes that of Jr_"a upon r - r'. Again, there are probably
dimensionality of the "order parameter" or spin no materials for which the commonly made as-
space. sumption that only nearest neighbors of one
To make this concept more precise, let us in- another interact, and that all other pairs of
troduce a general Hamiltonian which encom- spins are completely coupled, is valid. 9
passes, as special cases, a very large proportion (6) the spin quantum number, S . 10 The Ham-
of the models that exhibit critical phenomena. iltonian (13a) was for classical spins, and cor-
This Hamiltonian utilizes the concept of classi- responds to the S = 1/2 Ising model and the
cal spin vectors, situated on sites! and!' in a S -+ "" limits of the other systems. Materials in
d-dimensional lattice, and interacting with en- nature have a wide range of spin quantum num-
ergy parameters J,-r,a that depend upon the ber S, and accordingly we must consider what
vector (r - r'). The fmportant thing is that the happens to these models for general quantum
spin vectors are taken to be unit vectors span- number S.
CRITICAL PHENOMENA 248

The conclusion of this work is that the only 13. Herzfeld, I., and Stanley, H. E., J. Mol. Bioi. 82,
features of Eq. (13a) that are important are lat- 231 (1974).
tice dimensionality d and the symmetry of the 14. Stanley, H. E., in "Proceedings of the Enrico
ground state (which can be measured by an Fermi Summer School on Phase Transitions,"
"effective dimensionality" of the spin). This K. A. Muller, Ed., London and New York, Aca-
conclusion is sometimes called "the universality demic Press, 1973.
hypothesis," and it has the implication-if be- 15. Stanley, H. E., Ed. "Cooperative Phenomena near
lieved-that one need only consider two param- Phase Transitions: A Bibliography with Selected
eters, d and n. 11 ,12 Hence (l3a) may be re- Readings," Cambridge, Mass., MIT Press, 1973.
placed bys 16. Stanley, H. E., Ed., "Biomedical Physics and Bio-
materials Science," Cambridge, Mass., MIT Press,
n 1972.
JCu(d, D) == -J L L S.lOt S.!+6 Ot (l3b)
.!,S Ot=I Cross-references: ANTIFERROMAGNETISM, COM-
PRESSIBILITY, FERRIMAGNETISM, FERROELEC-
where here {j denotes a vector between nearest- TRICITY, GAS LAWS, GASES: THERMODYNAMIC
neighbor pairs of lattice sites. PROPERTIES, HEAT, HEAT CAPACITY, LIGHT
Conclusion and Outlook In this all-too-brief SCATTERING, LIQUID STATE, MANY BODY
introduction to the subject of critical phenom- PROBLEM, MATHEMATICAL BIOPHYSICS, POLY-
ena we have perforce concentrated our dis- MER PHYSICS, SUPERCONDUCTIVITY, SUPER-
cussion on a single type of system, namely a FLUIDITY.
simple magnet. One principal attraction, as
noted above, is that a great variety of physical
systems appear to behave quite similarly near CROSS SECTION AND STOPPING POWER
their critical points. Indeed, the role of cooper-
ative phenomena (and the proposed role of Cross section, a, is a conceptual quantity widely
phase transitions) in certain biological systems used in physics, particularly in nuclear physics,
has long been a subject of fascination for many, to represent the probability of collision between,
and it may be that the concepts that are even- particles. For example, if a beam of neutrons
tually uncovered in our attempts to under- @ is incident from the left on a nucleus ®, a
stand physical systems near their critical points certain fraction of the neutrons will be removed
will prove useful in elucidating biological from the beam by interaction with A.
behavior. 13-16
H. EUGENE STANLEY -{§)~-----

References
~------
1. Stanley, H. E., "Introduction to Phase Transitions
----@-----
and Critical Phenomena," London and New York, ~------------
Oxford University Press, 1971. Second Edition
1983. By definition, a is the fraction of neutrons,
2. Rushbrooke, G. S., J. Chern. Phys., 39, 842 contained in I cm 2 of beam, that interact with
(1963). A.
3. See, e.g., the recent GHF approach of A. Hankey For a thin layer of material of thickness dx
and H. E. Stanley,Phys.Rev.,86,3515 (1972); a containing N nuclei/cm 3 the number of nu-
comprehensive list of references to all earlier work clei/cm 2 is N dx. If the flux of incident neu-
is found therein. trons is ¢J/cm2 , the fractional decrease in
4. Milosevic, S., and Stanley, H. E., Phys. Rev., 85 traversing the thin layer will be:
2526 (1972); 86, 986 (1972); 86, 1002 (1972).
5. Stanley, H. E., Phys. Rev. Letters, 20, 589 (1968). - (d¢J/¢J) =Na dx (I)
6. Stanley, H. E., Phys. Rev., 176,718 (1968).
7. Paul, G., and Stanley, H. E., Phys. Rev., 85, or for a finite thickness x and incident flux ¢Jo:
2578 (1972).
8. lasnow, D., and Wortis, M., Phys. Rev., 176,739 ¢J = ¢Joe-Nax (2)
(1968). For a given type of nucleus there are, in general,
9. Paul, G., and Stanley, H. E., Phys. Rev., 85, 3715 a number of possible interactions, a, b, c, ....
(1972). The total cross section at is the sum of the cross
10. Lee, M. H., and Stanley, H. E., Phys. Rev., B4, sections for the individual interactions:
1613 (1971).
11. Kadanoff, L. P., in "Proceedings of the Varenna at = Ga + Gb + Gc ••• etc. (3)
Summer School on Critical Phenomena," M. S.
Green, Ed. London and New York, Academic A convenient unit for nuclear cross sections is
Press, 1972. the barn, an area of 1O-24 cm 2, which is ap-
12. Griffiths, R. B., Phys. Rev. Letters, 24, 1479 proximately equal to the cross-sectional area
(1970). of medium weight nuclei.
249 CRYOGENICS

The stopping power, Sp, of a material for an refrigerants. Liquid oxygen at 90 K is used
incident particle is the quantity - dT/dx, i.e., extensively in the steel industry and in provid-
the energy loss per unit length of path, generally • ing hospitals with oxygen gas for breathing. In
expressed in ergs per centimeter. This type of addition, liquid oxygen and liquid hydrogen are
attenuation is used primarily in considerations used as rocket fuels in the space program. Liquid
of the passage of heavy charged particles, nitrogen, which is relatively inert, quite inex-
such as protons, deuterons and alpha particles, pensive (~$O.l5/liquid liter), and has a large
through matter. For example, for a particle of latent heat at 77 K of 1.6 X lOs joules/liter, is
any spin having a rest mass M(~mo, the rest used as a powerful refrigerant in a variety of
mass of an electron), charge ze, and velocity applications including the frozen foods industry,
V(={k), the energy loss (as excitation and ioni- the biological storage of tissue and live cells,
zation) per element of path - dT/dx to a homo- the refrigeration of sensitive detectors of elec-
geneous medium containing N atoms/cm 3 , each tromagnetic radiation, and in the production
of atomic number Z, is given by: and storage of colder liquid cryogens such as
hydrogen at 20.3 K and helium at 4.2 K. Per-
haps the largest use for liquid nitrogen, however,
dT 41Te 4 Z2 [ 2m o V2 ] is in the storage and transfer of nitrogen to
- - = - - N Z In - - -In (1- (32) - {32
dx moV2 I provide clean and relatively inert atmospheres
in the chemical and semiconductor industries.
(4) Liquid 4He at 4.2 K is at present used almost
exclusively in laboratory applications, but may
where I is the mean atomic excitation potential soon find its way into hospitals and even into
calculable from the Thomas-Fermi electron dis- the business world thanks to new technologies
tribution function to be I = kZ ~ 11.5 Z electron being developed based on superconductivity in
volts. metals. Superconductivity is a phenomena in
The relative stopping power S is the inverse certain metals where below a transition tem-
ratio of the length of a material to the length of perature Tc the electrical resistance of the
a standard substance having equivalent stopping metal vanishes completely, and electrical cur-
power (usually referred to aluminum as So = I) rents can flow with no dissipation of energy.
at 15°C and 76 cm pressure: For most elemental superconductors, Tc lies in
the range of I K to about 20 K. If a magnetic
(dTldx) I NIBI plBIAo
S= =--= (5) solenoid wound of superconducting wire is
(dT/dx)o NoBo PoBoA I energized and its leads are then joined together,
the current source can be removed and the mag-
where P is the density, A the atomic weight, netic field will remain as the current continues
and B = Z In 2mo J12 II from Eq. (4). to flow through the solenoid. Magnetic fields as
high as 120,000 Gauss can be produced with
CLARK GOODMAN such solenoids, which decay no more than one
part in 108 per day. Less stable superconduct-
Cross-references: COLLISIONS OF PARTICLES, ing magnets have produced fields as high as
NUCLEAR REACTIONS, NUCLEAR REACTORS, 165,000 Oe. A conventional room temperature
RADIOACTIVITY. magnet able to produce such a high field would
dissipate many millions of watts of electrical
energy in maintaining the field, and this energy
as heat would then have to be removed or the
CRYOGENICS magnet would melt. Applications for supercon-
ducting magnets include uses in nuclear magnetic
Cryogenics is the science of very low tempera- resonance (NMR) studies and in some large
ture refrigeration and the study of phenomena electric motors. NMR spectrometers capable of
which occur at these temperatures. In a general observing resonances in a whole human body are
sense, cryogenics can be divided into two parts: being built utilizing superconducting magnets
the methods used to obtain and maintain very which can produce three-dimensional images
low temperatures, and the special techniques of the subject. Such devices may soon replace
associated with measuring physical properties, x-ray machines as diagnostic medical tools.
particularly temperature, in the low tempera- In addition to the absence of electrical resis-
ture environment. Cryogenic temperatures are tivity, superconductors possess a variety of
usually defined as those below about 90 K, (90 other exotic properties. For example, the cur-
Kelvin) the liquification point of oxygen, al- rent flowing in a closed superconducting loop is
though in the laboratory temperatures as low quantized in units of hl2e, where h is Planck's
as 0.005 K (5 mK) are becoming commonplace. constant and e is the absolute value of the charge
Applications Industrial cryogenic applica- on an electron. Superconducting quantum in-
tions include the shipment and storage of liqui- terference devices (SQUIDs) are able to use this
fied gases such as nitrogen, oxygen, and natural fact and the tunneling properties of electrons in
gas fuel, and the use of these fluids through superconducting weak links to measure minute
evaporation, particularly liquid nitrogen, as magnetic fields. For example, the fields pro-
CRYOGENICS 250

duced by electrical activity in the human brain, ing from room temperature such refrigerators
at the level of 10- 12 Gauss are being studied must operate in series with other refrigerators
with SQUID devices in a number of low tem-· using working fluids with higher temperature
perature laboratories. Other applications of critical points, the hottest being above room
superconducting quantum tunneling have re- temperature.
sulted in computer logic elements smaller and By far the most widely used refrigeration
faster than conventional semiconductor devices. process involves the expansion of a gas under
Such elements may ultimately be utilized to elevated pressure through a porous plug, the
produce super powerful computers operating Joule-Thompson process. In such an expansion
at 4.2 K. an ideal noninteracting gas will neither warm
Cryogenics serves three major purposes in the nor cool; however, with real gases the expan-
physics laboratory. First, many physical proper- sion alters the average potential energy between
ties are temperature dependent, and can be the gas particles, and as a consequence the
more easily understood if studied as functions temperature of the gas is changed. If the kinetic
of temperature rather than at a single tempera- energy of the gas particles is sufficiently low
ture. Second, thermal energies often mask im- (at low temperatures) so that the atoms or
portant features which, when discovered and molecules feel mainly the attractive part of
understood, allow a far deeper understanding of the interatomic (intermolecular) forces during
physical behavior to be obtained. One example, collisions, the expansion will result in cooling;
of such a feature was the discovery by de Haas however, if the temperature is sufficiently high
and Van Alphen in 1930 of oscillations in the so that the gas particles feel mainly the repulsive
magnetic susceptibility of bismuth at low tem- short range interactions, the expansion will
peratures in a high magnetic field. This observa- result in warming. In the limit of hard sphere
tion later provided the key to understanding gas particles with no attractive interactions and
complicated electronic properties of many met- an infinite repulsive interaction, the volume
als. Thirdly, phenomena such as superconduc- excluded by the particles themselves causes the
tivity can only exist at low temperatures. Super- pressure to drop further during the expansion
fluidity in liquid 4He below 2.17 K and 3He than it would for an ideal gas. As a result of this
below .0027 K are other examples of ordered excess drop in pressure, more mechanical energy
states which can only exist at very low tempera- is expended in forcing the gas though the plug
tures. In these states the viscosity of the fluids than can be recovered from the expanding gas
vanish and they are able to flow without dissi- on the low pressure side. This difference in
pating energy: a true case of the frictionless energy results in heating of the gas. Not surpris-
pulley. In the case of liquid 3He the fluid be- ingly, the maximum temperature at which cool-
comes anisotropic, exhibiting liquid-crystal-like ing will occur, the inversion temperature, is
textures and many unique magnetic properties. related to the boiling temperatures of the liqui-
In all superfluids heat can propagate as a wave, fied gases, since both temperatures depend on
as opposed to the usual diffusive heat flow, and the interatomic forces. Helium (4He), for in-
in the case of one of the superfluid phases of stance, boils at atmospheric pressure at 4.2 K
3He, this heat wave carries magnetization with and has an inversion temperature of 51 K. Hy-
it. drogen boils at 20.4 K and has an inversion
In other low temperature studies, physicists temperature of 205 K. Nitrogen and oxygen
have been able to form crystalline lattices of boil at 77.3 K and 90.2 K respectively, and have
free electrons on the surface of superfluid 4He inversion temperatures of 621 K and 893 K.
and measure their mechanical properties. In Since helium and hydrogen have inversion tem-
additional low temperature studies of two- peratures below room temperature, they cannot
dimensional electron gases, such as those found be used for this type of cooling process directly,
in MOS-FET devices, a quantized Hall effect is but must be precooled by a prior refrigeration
observed when the samples are placed in a very process to well below room temperature. Joule-
intense magnetic field. This unusual quantum Thompson expansion is the process most gen-
effect is providing a new and better standard erally used in the liquification of oxygen and
for the unit of electrical resistance. nitrogen, however, and it is the final refrigera-
Refrigeration Most refrigeration processes tion process used in the liquification of helium.
used to obtain very low temperatures involve If rather than allowing a gas to expand through
the change of state of some fluid refrigerant. a porous plug as in the Joule-Thompson process,
The most simple refrigerators involve the liqui- one allows the gas to expand against a piston or
fication of a gas under elevated pressure with other moving wall (such as the vane of a gas
the latent heat of condensation being removed turbine), any gas will cool at any starting tem-
to the outside world, followed by evaporation perature. This occurs because on the average
at a much lower pressure and temperature. Such when the gas particles recoil off the moving
processes will operate only between their criti- surface in an expanding volume they have lower
cal temperatures and their triple point tempera- kinetic energies than they had prior to their
tures. Oxygen, for example, has a critical point collisions with the moving wall. From a macro-
at 50 atmospheres pressure and 154 K. Its triple scopic viewpoint the gas is doing work against
point is at 54.4 K. To provide continuous cool- the moving piston, Po V. Expansion engines
251 CRYOGENICS

using either moving pistons or turbines are The 3He which is diluted into the 4He-rich
almost always employed in the liquification of fluid is withdrawn from it by connecting the
helium, although a three-stage Joule-Thompson region where the mixing occurs, the mixing
process to liquify helium using nitrogen, hydro- chamber, to a still by a long capillary tube. The
gen and helium is possible. The disadvantage of still temperature is regulated at about 0.8 K,
expansion engines is that they involve closely where liquid 3He has a much higher vapor pres-
fitting moving parts which require very high gas sure than liquid 4He. Thus a pump which re-
purity. Ice and frozen air are constant hazards moves the vapor above the liquid in the still
in such machines. distills the 3He out of the 4He-rich phase. The
Liquid 3He and 4He hold a special place in 3He removed from the still is pressurized to
the field of cryogenics, largely because they are about 0.1 atmosphere and is reliquified in a
the only stable materials available to man which pumped 4He bath. It is then returned to the
remain in a fluid state as their temperature is mixing chamber through a series of heat ex-
lowered arbitrarily dose to absolute zero. (Both changers which transfer heat from the warm
can be s?lidified near absolute zero, however, returning liquid to the cold dilute 3He rising
by applymg pressures on the order of 30 atmo- up to the still.
spheres.) This fact not only makes 3He and There is one additional refrigeration process
4He unique materials insofar as low tempera- involving 3He called Pomeranchuk or compres-
ture physics is concerned, it also allows them sional cooling. It relies on the unusual property
to be useful refrigerants to exceedingly low that the liquid phase of He 3 below 0.3 K is
temperatures. more highly ordered than the solid phase. This
Liquid 3He and 4He have boiling temperatures is due to the random orientations of the 3He
at atmospheric pressure of 3.2 K and 4.2 K, nuclear spins in the solid, which give it a sub-
respectively. By pumping away the vapor above stantial disorder even at low temperatures. If
the liquid phase, however, one can lower the one solidifies liquid 3He below 0.3 K (about
boiling points to about 0.25 K for 3He and 34 atmospheres at 0.02 K), as solid 3He forms
0.7 K for 4He. Below these temperatures the it absorbs heat from the surrounding liquid
vapor pressures above the liquids, which are and the system cools. A carefully designed
decreasing exponentially with decreasing tem- Pomeranchuk cell precooled with a dilution
perature, are insufficient to allow significant refrigerator to about 25 mK can reach about
amounts of liquid to be evaporated, and the 0.8 mK for short periods of time and can
cooling capacities of 3He and 4He pumped maintain 1.5 mK for several hours. Lower
cryostats drop rapidly to zero. temperatures are not feasible because at 1.0 mK
In the mid-1960s a novel refrigeration device the nuclear spins in the solid order antiferro-
called the 3He-4He dilution refrigerator was magnetically, and the difference in the degree
developed which overcame the problems of the of order between the liquid and solid drops
vanishingly small vapor pressures of 3He and rapidly to zero. Although not an important
4He at low temperatures. This device virtually cooling process for external samples, this self-
revolutionized low temperature physics during cooling of 3He has been crucial in experiments
the 1970s, and one can now purchase a dilution designed to study the unusual superfluid phases
refrigerator capable of continuous refrigeration of liquid 3He below 2.7 mK and the nature of
down to 0.004 K (4 mK) for about $100,000. nuclear magnetic ordering in solid 3He below
The key to the operation of the dilution re- 1.0 mK.
frigerator is the finite solubility of 3He in super- In many ways the most powerful and by far
fluid 4He. As 3He is added to liquid 4He at low the most common technique now used to obtain
temperatures (below about 0.2 K), the 3He temperatures below those available with dilution
mixes with the 4He to a maximum concentra- refrigerators is that of adiabatic demagnetiza-
tion of about 6%. In the mixing or dilution tion. In this technique a collection of spins,
process the distance between the 3He atoms either electronic or nuclear, is partially ordered
increases and in the process the 3He absorbs by the application of a large magnetic field. The
heat. The superfluid 4He has very low specific magnetic moments associated with the spins
heat and thermal conductivity at these temper- tend to align along the direction of the magnetic
atures, and in fact, the 3He atoms do not scatter field. The system is precooled, usually with a
off the 4He atoms in the superfluid. At these dilution refrigerator, until some significant frac-
temperatures superfluid 4He is often referred tion of the spins are aligned (the net polariza-
to as a dense vacuum. In a crude sense, one can tion varies as 1IT for small polarizations). Then
therefore consider the mixing of 3He in super- the magnetic refrigerant is thermally isolated
fluid 4He to be a form of evaporation. However, from the precooling stage, and the magnetic
the density of 3He in the 4He is so high and the field is lowered. As the field decreases, thermal
temperature is so low that this simple model excitations flip the aligned spins against the
breaks down. In fact, the heat absorbed by a magnetic field and in the process thermal energy
quantity of 3He mixing with superfluid 4He is is absorbed. If the heat capacity of the magnetic
proportional to T2. A moderately powerful coolant totally dominates the heat capacity of
dilution refrigerator has a cooling capacity at the sample being cooled, the_ ratio of the tem-
10 mK of only a few micro-watts. perature to the sum of the external and effec-
CRYOGENICS 252

tive internal magnetic fields stays constant. For anything whose resistance is temperature de-
nuclear coolants such as copper, the internal pendent in a reproducible way can serve as a
fields, which arise largely from the magnetic secondary resistance thermometer.
moments associated with the spins, are only a Below about 20 mK a variety of Curie law
few Gauss. If the external field is lowered from thermometers are generally used. These rely
80,000 Gauss to 80 Gauss, the temperature of on the temperature dependence of the polari-
the copper nuclear spin system will drop by a zation of either electronic or nuclear magnetic
factor of 1000. If the starting temperature is moments in a magnetic field. Detection of the
10 mK, the final nuclear temperature will be polarization can be either through a mutual
10 ilK. Electronic moments and hyperfine inductance measurement, a static magnetization
enhanced nuclear moments may be polarized measurement, nuclear magnetic resonance, or a
more easily at higher temperatures due to their measurement of the anisotropy of gamma radi-
larger magnetic dipole moments, but due to ation emitted from polarized radioactive nuclei.
higher internal fields, they do not get as cold. Nuclear magnetic resonance of platinum wire or
The lowest temperature one can reach with powder is the thermometer of choice below
even a very dilute paramagnetic salt such as about I mK, and the lowest temperatures mea-
cerium magnesium nitrate is about 2 mK. sured this way are somewhat below SO ilK.
Although one can achieve nuclear spin tem- Certain other thermometry schemes are also
peratures well below I ilK in copper nuclear occasionally used at ultralow temperatures,
demagnetization cryostats, heat leaks have not such as those based on the temperature depen-
allowed external samples to be cooled to this dent dielectric constant of certain materials (a
very low temperature. Thermal conductivities scheme virtually immune to even very large
vanish as the temperature approaches absolute magnetic fields), the thermal noise power (John-
zero, and even heat leaks as small as 10-9 watts son noise) of a suitable resistor, and the 3He
become important in determining sample tem- melting pressure. In the range from about 100
peratures. External metallic samples have not mK to just above I mK a simple 3He melting
been cooled much below SO ilK with copper pressure thermometer can be produced with a
demagnetization refrigerators, and 3He, which resolution of somewhat better than .0001 mK
is harder to cool, has not been below about or 100 nano Kelvin.
120 ilK. The major difficulty associated with ther-
Thermometry There are a wide variety of mometry schemes below 0.3 K is the absence
thermometry schemes for measuring cryogenic of suitable calibration or fixed points whose
temperatures. Most are calibrated against certain temperatures have been established by thermo-
easily reproducible fixed points whose absolute dynamic means. A series of superconducting
temperatures are established by some thermo- transition temperatures has been established by
dynamic means such as the ideal gas law ther- the U.S. National Bureau of Standards recently,
mometer. Down to 90 K the accepted inter- but the lowest is at about IS mK. Ultimately
national thermometry scale is based on the elec- the super fluid 3He transition temperatures at
trical resistivity of platinum wire, which is nearly various sample pressures should provide a highly
proportional to temperature in this range due reproducible temperature scale from 1.0 mK to
to electron-phonon interactions. The vapor 2.7 mK, and almost all provisional temperature
pressures of certain gases in equilibrium with scales in this range are currently being referenced
their liquid phases provide highly reproducible to superfluid 3He transition temperatures.
thermometry scales over limited temperature The engineering aspect of cryogenics is itself
intervals. In particular, the vapor pressures of a very challenging area. The properties of ma-
4He from 5 K to I K and 3He from about 3 K terials near absolute zero are generally quite
to 0.5 K are internationally accepted tempera- different from those exhibited at room tem-
ture scales. Thermocouple devices, particularly perature. In particular, thermal contraction of
gold doped with iron vs chromel, can provide materials being cooled to 4.2 K can amount to
useful thermometry down to I K and below, changes in length as large as I % or as small as
although their use .in the laboratory has waned 0.1 %. Materials which fit together well at room
in recent years. temperature may bind or leak badly at low
By far the most convenient and prevalent temperatures. Some become too brittle to be of
secondary thermometry schemes are those based any use, while the mechanical properties of
on the temperature dependent resistance of others improve dramatically. The problems of
some stable device. Pure metal wires are com- providing adequate thermal contact between
monly employed down to liquid nitrogen the different parts of a cryostat and adequate
temperatures. Commercially available carbon heat exchange between warm and cold cryogenic
resistors are the cheapest and most prevalent fluids are enormous. Often the low temperature
thermometers, being useful from close to room physicist or cryogenic engineer must exhibit
temperature down to temperatures as low as considerable resourcefulness in utilizing the
10 mK. These thermometers do not tend to be unusual properties of materials at low tempera-
very reproducible, however, and for high preci- tures to overcome technical problems. Consid-
sion work doped germanium resistors are gen- erable progress is being made, however, and the
erally used down to about SO mK. Virtually facility with which one can now produce and
253 CRYSTAL STRUCTURE ANALYSIS

maintain very low temperature environments is reciprocal lattice), and if the x-rays are mono-
vastly expanding the applications of the cryo- chromatic, each spectrum (or "reflection") ap-
genic science. pears as a spot-image of the crystal on the
photographic film, or a sharp peak on the dif-
D.D.OSHEROFF fractometer trace. Within the limitations of
Friedel's Law (IF(hkl)i2 = IF(hkl)12, so that a
References center of symmetry is always present in the dif-
fraction effects when the scattering is purely
Wilks, 1., "The Properties of Liquid and Solid Helium," elastic), such observations show the crystal sys-
London, Oxford Univ. Press, 1967. tem, point symmetry, and dimensions of the
White, G. K., "Experimental Techniques in Low Tem- unit cell of the crystal. Further information
perature Physics," second edition, London, Oxford about glide and screw symmetries may lead to a
Univ. Press, 1968. unique assignment to one of the 230 space
Lounasmaa, O. V., "Experimental Principles and Meth-
ods Below 1 K," New York, Academic Press, 1974. groupS (such as the common P2 1 /c, Pcab or
Bennemann, K. H., and Ketterson, J. B., "The Physics P2 1 21 21 ), but may leave a 2- to 4-fold ambi-
of Liquid and Solid Helium Part II," New York, guity. The space group assignment and unit cell
John Wiley & Sons, 1978. dimensions place severe, often crucial restric-
tions on the atomic arrangement in the crystal,
and are a necessary prerequisite for any crystal
Cross-references: CONSER VATION LAWS AND analysis.
SYMMETRY, DE HAAS-VAN ALPHEN EFFECT, The intensities of the spectra, or reflections,
ENTROPY, HEAT TRANSFER, LIQUEFACTION OF form the basis of the determination of the
GASES, HEAT, SUPERCONDUCTIVITY, SUPER- atomic arrangement. The determination (using
FLUIDITY. x-rays) is based on the development of the elec-
tron density distribution in the unit cell by
means of the relationship:
CRYST AL STRUCTURE ANALYSIS
p(x,y, z) =
Diffraction of radiation by the ordered, peri-
odic arrangement of atoms in a crystal yields
mainly information about: (1) the nature of the
1- L L L F(hkl) e-21ri (hx+ky+lz) (1)
radiation (spectrum); (2) the geometry of the V h k 1
crystal (repeat unit and symmetry); and (3) the
distribution of the scattering material in the re- in which V is the volume of the unit cell. The
peat unit of the crystal. In the last category lies structure amplitudes F(hkl) are obtained exper-
the powerful technique of crystal structure imentally from
analysis as revealed primarily by x-ray diffrac- K2
tion, and also by neutron and electron diffrac- IF(hkl)1 2 = -I(hkl) (2)
tion. With x-rays the electron density distribu- ALp
tion is determined, and from this rather precise
information about atomic locations (±O.OOS A), where A is an absorption factor, L is a geomet-
as well as thermal motions, polarization effects, ric correction (Lorentz) factor and p is the
and disorder phenomena, is obtained. Neutrons polarization factor; I(hkl) is the total intensity
yield information about positions and thermal in the reflection peak (integrated intensity).
motions of atomic nuclei, and also about dis- The scale factor K is not usually determined,
tributions of magnetic and spin vectors in the and so the observed data set consists of the rela-
crystal. tive structure amplitudes I F(obs) I = IFlfK,
Experimental Aspects When atomic proper- each indexed and identified by the Miller in-
ties are of primary importance, the radiation dices h, k, and I.
used is monochromatic, usually generated as Though the number of terms in (1) is theoret-
characteristic x-radiation from a specified target ically infinite, the actual number is always lim-
material in a Coolidge tube (typically CuKQ<, ited by the wavelength and the angular range
A = 1.5418 A, or MoKo:, A = 0.7101 A). The available. With x-rays (but not with neutrons)
diffraction effects are observed by various pho- the average I FI values decrease rapidly with
tographic methods, or by means of a gonio- Bragg angle, so that often no intensities can be
meter or diffractometer fitted with a pulse measured at higher angles. Within these limita-
counter. tions, an effort is generally made to collect all
The geometry of a crystal can be determined of the independent relative amplitudes that can
by routine methods, in which the diffraction be produced by a crystal. Such a data set may
effects observed are interpreted in terms of the contain about 1000 (for simple inorganic struc-
reciprocal lattice (see CRYSTALLOGRAPHY). tures) or as many as 30 000 (for proteins)
Various single-crystal film cameras are arranged observations.
to show separately the x-ray spectra produced The Crystal Structure Problem A computa-
by as many gratings in the crystal as possible tion of the electron density distribution in the
(each spectrum corresponds to a node in the unit cell by Eq. (I) will clearlv provide all the
CRYSTAL STRUCTURE ANALYSIS 254

crystal structure information that can be ex- LIiF(obs)l- IF(calc) II


tracted from the measured data. Unfortunately,
there is one link missing between this desired R = (6)
result and the experimental observations, LIF(obs)1
namely the relative phases of the terms in Eq. known as the "R factor," "reliability index,"
(1). Th~se phase angles cannot be determined etc. For analytical purposes a so-called
experimentally and must be derived indirectly. "weighted R factor" is also used:
This situation constitutes the notorious "phase
problem" of crystal structure analysis, a major Lw(IF(obs)l- IF(calc) 1)2
stumbling block in the procedure for the first
50 years following the discovery by Laue of Rw or R2 = - - - - - - - - - - - (7)
x-ray diffraction in 1912, and one which LW
IF(obs)12
strongly affected the character of crystal struc-
ture research during all that time. In the 1960s
where ..;wis the weight of the observation
IF(obs)1. In structure analysis where the ob-
powerful theoretical arguments showed how servations often outnumber the parameters by
the phase information was contained in and more than 10 to 1, the two R indices follow
could be extracted from the intensity distribu- each other closely.
tions within the data set itself. Thus, the chasm If R for the initial model is < 0.50, the model
of the phase problem was finally bridged, so may be at least partly correct, and is then tested
that there is no longer any limit on the com- by forming an electron density synthesis from
plexity of structures that can be determined, Eq. (1) or (5), using only F(obs) terms whose
and crystal structure analysis has become a phases are strongly indicated by Eq. (3). This
standard and exceedingly powerful tool in the map will show the starting partial model plus
service of chemistry and physics. other features in the structure not included in
Trial and Error Methods Formerly, an em- that model (additional atoms, shifts in assumed
pirical trial structure model served as a starting atom positions, etc.), from which an improved
point of an iterative procedure that was used model can be derived. This model is then used
to try to converge on a true set of phases for to calculate new values of F(ca1c), leading fa-
equation (1). From such a model, theoretical vorably to substantially reduced R, and thus
amplitudes and phases can be directly calcu- initiating an iterative process of structure
lated by: refinement.
F(hkl) = L tnlne21ri(hxn+kYn+1Zn) (3) Vector Maps Considerable information about
the structure can be gained from another type
n
of Fourier synthesis that does not depend on
in which In is the form (scattering) factor of knowledge of phase angles, the so-called "Pat-
the atom n (a tabulated function of the Bragg terson function":
angle), tn is a thermal parameter of the form
exp [- (21l'un sin (})2/A2) (Un is the root-mean- P(X,y, z) =
square amplitude of vibration of atom n), and
Xn, Yn, and Zn are the coordinates of the atom -LLL
1
1F(hkl) 12 cos21T(hx+ky+lz) (8)
n. F is thus a complex quantity V h k I

F(hkl) =A (hkl) + iB(hkl) (4) It was shown by A. L. Patterson in 1935 that


this function contains maxima at the points in
where A = F(hkl) cos Q and B = F(hkl) sin Q. In crystal space which represent interatomic vec-
general, the phase angle Q may have any value, tors in the crystal structure standing at the ori-
but if the origin of coordinates is a center of gin, and that the height of these maxima is pro-
symmetry, F must be an even function and portional to the product of the electron densi-
therefore B = O. In that case the phase problem ties of the two atoms forming these vectors.
reduces to a choice of sign for each term in the Thus, if there are n atoms in the unit cell, there
equation will be n(n - 1) peaks in the Patterson map, the
remaining n peaks being superposed at the ori-
p(X,y,Z)=
gin representing self-vectors for each atom. The
Patterson map is a convolution of the electron
!.. L L L ± IF(hkl)1 cos 21T(hx + ky +lz) (5) density over every point in space, or essentially
every atom in the structure, and m~ny highly
V h k I
ingenious techniques have been deVIsed to ef-
This condition enormously simplifies the phase fect a deconvolution. Nevertheless, when n be-
problem. comes very large, interpretation becomes im-
The quality of a given model is judged by possible. The Patterson map and its various
comparison of IF(obs)1 with IF(calc)1 obtained convolutions are called generally "vector maps."
from equation (3). The traditional figure of Obviously, the task of interpretation of a vec-
merit commonly used in crystal structure analy- tor map is greatly simplified if there is one or a
sis is a relative first moment coefficient very few relatively heavy atoms in the structure
255 CRYSTALLIZATION

(with large f in Eq. (3», because these will be Data compilations and treatments of tech-
identified easily in the mapping, and will tend niques of crystal structure analysis are given in
to dominate the distribution of phases in Eq. a list of selected references below.
(1). This is the basis of the so-called "heavy-
atom method." Where it is applicable, it can HOWARD T. EVANS, JR.
avoid much of the labor involved with more di-
rect methods, and it is still of primary impor-
tance in protein structure analysis. References
Statistical Methods of Phase Determination 1. Bacon, G. E., "Neutron Diffraction," 3rd ed.,
D. Sayre in 1952 first noted that when large F Oxford, Clarendon Press, 1975.
terms are concerned, there is a strong tendency 2. Buerger, M. J., "Vector Space," New York, John
for Wiley & Sons, Inc., 1959.
S[F(h j ±h 2 ,k j ±k 2 ,ll ±l2») ~ 3. Buerger, M. J., "Crystal-Structure Analysis," New
York, John Wiley & Sons, Inc., 1960.
S[F(hjkjld)·S[F(h2k2l2») (9) 4. Donnay, J. D. H., and Ondik, H. M., eds., "Crystal
where S signifies +1 or - 1 according to the cen- Data," 3rd ed., 4 vols., Nat. Bur. of Stand. and
trosymmetric phase of F. This relationship, Jt. Comm. on Power Diffr. Stand., 1973-78 (com-
which depends on the required positivity of the pilation of unit cell data, references to structure
function (1), is the root of the most powerful determinations).
phase-determining procedures in modern use, 5. Ewald, P. P., and Hermann, C., eds., "Struktur-
notably the "symbolic addition procedure" bericht," vol. 1-7, Leipzig, Akademische Verlags-
evolved by Karle and Karle (see ref. 6). These gesellschaft M.B.H., 1931-43 (detailed compila-
techniques, consisting of the analysis of vector tion of crystal structure determina tions,
combinations of the strongest F terms (usually 1913-1939).
normalized to emphasize the trigonometric part 6. Karle, J., and Karle, I. L., Acta Cryst., 21, 849
of equation (3», have been extended to non- (1966) (symbolic addition procedure).
centrosymmetric cases with considerable gen- 7. Lipson, H., and Cochran, W., "The Determination
eral success, so that the phase problem may of Crystal Structures," 3rd ed., Ithaca, N.Y., Cor-
now confidently be said to be for the most part nell Univ. Press, 1966.
overcome. 8. Stout, G. H., and Jensen, L. H., "X-ray Structure
Determination," London, The Macmillan Co.,
Refinement of the Crystal Structure Once a
1968.
satisfactory model structure has been evolved, 9. Warren, B. E., "X-Ray Diffraction," Reading,
defined by a given set of parameters (which Mass., Addison-Wesley Publ. Co., 1969.
may amount to hundreds or even thousands), 10. Wilson, A. J. C., and Pearson, W. B., eds. "Struc-
the refinement of these parameters is carried ture Reports," vol. 8-47, Utrecht, N.V.A. Oos-
out by differential methods. The least squares thock's Vitgenvers Mij. 1956-1980 (continuation
analysis of the F(obs) data set (in which a func- of "Strukturbericht" (ref. 5), covering literature
tion based on Eq. (7) is minimized) is the most 1940-1976).
easily adapted to computers and is most widely 11. Woolfson, M. M. "Direct Methods in Crystallog-
used. Methods based on minimization of !::.p raphy," Oxford, Clarendon Press, 1961.
(using "difference maps" calculated by Eq. (1) 12. Wyckoff, R. W. G., "Crystal Structures," 2nd ed.,
with !::.F as coefficients instead of F) are also vol. 1-6, New York, John Wiley & Sons, Inc.
commonly used. Convergence when R ~ 0.15 is 1963-1969. (compilation of results of crystal
considered satisfactory (with standard error of structure determinations).
bond lengths ~ 0.03 A), but as the quality of
data measurement and techniques of refinement
have improved, R often nowadays reaches 0.05 Cross-references: CRYSTALLIZATION, CRYSTAL-
or lower (standard error of bond lengths LOGRAPHY, DIFFRACTION BY MATTER AND
< 0.005 A). DIFFRACTION GRATINGS, LATTICE DEFECTS,
NEUTRON DIFFRACTION, PARACRYSTALS,
Crystal Structure Data The most important
information yielded by crystal structure analy- X-RAY DIFFRACTION, X-RAYS.
sis, namely, the configuration of atoms in mole-
cules and the solid state, and the distances be-
tween the atoms, forms a vast body of critical CRYSTALLIZA TION
data on the properties of substances which is
expanding at a rapid rate. Compilations of these The forms of natural crystals have been studied
and of unit cell geometry data are published by mineralogists for many years and have been
with a lag usually of 10 years or more, so that classified by symmetry, interfacial angles, per-
reference to the journal literature is usually nec- fection of shape, and more detailed criteria.
essary to find reliable and recent data concern- These forms are often related to the molecular
ing a particular substance. Reference to original structure and growth of the crystals. The equi-
papers is also very desirable to answer the im- librium shape of a crystal is that for which the
portant question of the reliability and distribu- surface energy is a minimum. Since atomic
tion of errors in a given structure determination. planes of densest packing usually have the low-
CRYSTALLIZA TION 256

est surface energy, these planes predominate in of steps must exist. For this source F. C. Frank
the surface facets of equilibrium crystals, postulated a screw dislocation in the crystal
resulting in a correspondence between the that emerges at the crystal surface. This emer-
atomic structure and shape of the crystal. gent dislocation provides a step pinned at one
However, natural and even synthetic crystals end, so that as it propagates it winds up into
rarely have the equilibrium shape, because spiral and is always available for incorporation
for crystals larger than about 10 J.l. in dimension, of molecules. Many spirals have been observed
the differences in surface energy between faces on crystal surfaces; one is shown in Fig. 1. The
are too small to transport enough material surface nucleation and screw dislocation mech-
over the distances required. Therefore the anisms for crystal growth were definitely con-
morphology of crystals is usually determined firmed by the elegent experiments of G. W.
by the rate of crystal growth, rather than by Sears on the growth of perfect metallic fila-
the equilibrium shape. ments (whiskers), metallic platelets, and para-
A crystal is bounded by those faces whose toluidine crystals. 10
rate of growth is slowest, since fast-growing Impurity molecules can modify the mor-
faces grow out of existence. Close-packed phology and growth rate of crystals by their ef-
planes frequently grow most slowly, so even fects on the relative surface energies and
when kinetic factors control the crystal shape growth rates of different crystal faces. These
there is usually a relation between the faces of molecules can poison growth on certain planes
a crystal and its molecular structure. by adsorption at kinks in steps on these planes,
lInusual crystalline morphologies result from slowing the growth of these steps. Impurities
particular conditions. Dendritic (treelike) shapes can also change the rate of adsorption and sur-
result when crystals grow with high driving face diffusion of incorporating molecules.
force, or with rapid transfer of heat. The rate of pillbox nucleation can be increased
Since crystallization proceeds by propaga- by the lowering of surface tension by impurity
tion of the nucleus into the parent phase, the adsorption. Therefore impurities can produce
surface separating these phases is the site of different crystal habits and either faster or
incorporation of molecules into the crystal. slower growth rates.
Thus the structure of this interface between Crystal growth from dilute solution is similar
phases is critical in determining the mechanism to growth from the vapor, but impurity ef-
of growth. A crystal-vapor interface becomes fects are more marked and morphologies more
molecularly "rough" above a critical tempera- varied. The growth rate is often controlled by
ture, so that molecules can be incorporated the rate of diffusion of solute up to the crystal
anywhere on it. For low-index faces this criti- surface rather than interfacial processes, es-
cal temperature is close to the melting tempera- pecially for crystals larger than a few microns
ture, and at lower temperatures the surface is in size. Convection in the liquid solution can
molecularly smooth at equilibrium. If there is change crystal morphologies and growth rates
a monomolecular step on the surface, incorpora- and make compositional and size control more
tion of molecules into the crystal will occur difficult. 11
preferentially at the step. Such a step con- The rate of crystallization from a pure liquid,
tains kinks or jogs, which are the final sites often called solidification or freezing, is usually
for incorporation. Thus the progress of a controlled by the rate at which the heat of
molecule from the vapor into the crystal is: fusion is removed from the interface. Only for
(I) transport through the vapor to the crystal slowly growing crystals in viscous melts do
surface, (2) adsorption onto the crystal sur- interface processes control the rate of crystal-
face, (3) movement on the surface to a step lization. Nevertheless the structure of the
(surface diffusion), (4) adsorption onto a step, liquid-solid interface can influence crystal
(5) transport along the step to a kink, and morphologies. Many solid-liquid interfaces are
(6) incorporation at the kink. Steps (2), (4), "rough," but others appear to be similar to
and (6) can involve reorientation and desolva- vapor-crystal and solution-crystal interfaces, as
tion of the molecules. The rates at which evidenced by growth facets in such materials as
crystals grow can be controlled by anyone or silicon, germanium, P2 Os, and many organics.
several of these steps. In growth from the Growth in concentrated solutions (e.g., metallic
vapor it is usually step 3 involving surface dif- alloys) is influenced by convective instabilities
fusion that controls the rate of growth of the in the melt as well as instabilities of the inter-
crystal. face (dendritic growth).
If the crystal surface is molecularly perfect, Crystal growth and liquid-crystal interfaces
a pillbox of material must be nucleated on it are being simulated in computers.12
to create a step, which then grows to another To make crystals for laboratory and industrial
perfect surface. Under these circumstances, use a great variety of techniques have been used.
continued surface nucleation is required, and Growth by either condensation or chemical
growth occurs only below a certain under- reaction from the vapor phase can give crystals
cooling. However, experimentally, crystals with high purity and special structures and
often grow at much smaller undercoolings than form. For large-scale industrial use, this method
this calculated one, so that a continuous source is too costly, although it is valuable for certain
257 CRYSTALLIZA nON

FIG. I. Growth spiral on a paraffin crystal, observed by C. M. Heck.


From Doremus, Roberts and Turnbull "Growth and Perfection of
Crystals," by permission of John Wiley & Sons, Inc., New York.4

special applications. Luminescent crystals of through a temperature gradient in a furnace,


zinc and cadmium sulfides are grown from so that crystallization starts at one point in
vapor for industrial use. Metallic crystals with the melt and propagates through it relatively
few impurities and defects, and in the special slowly. In the Vernueil method, powder is
forms of thin films or whiskers, can readily added to the molten surface of a crystal so that
be grown from vapor. Other crystals made in a crucible of other material is not needed. This
this way are silicon, germanium, iodine, sele- method is used for materials with high melting
nium, phosphorus, and a variety of organic temperatures, such as alumina, spinels, rutile,
crystals. The study of the growth of ice crystals mullite, ferrites, and yttrium-iron garnet. A
from water vapor has special importance in new method of growing sapphire crystals
meteorology.13 from the melt, called edge-defined film-fed
The most common method of growing me- growth, has been announced by Tyco Labora-
tallic and semiconducting crystals is by solidi- tories. The crystal is pulled rapidly from a die,
fication of their melts. Special techniques to which molten alumina is transported as a
have been developed to grow single crystals film. Solid crystals are often purified by zone
of these materials and many others. In the melting, in which a molten zone is moved
Czochralski method, a seed crystal is touched through the crystal. Segregation of impurities
to the melt, and the crystal is "pulled" from it into the melt purifies the crystal.
by slowly withdrawing the seed. In the Bridg- Precipitation from liquid solution is a com-
man technique, the melt is slowly moved mon method of growing crystals. Ionic salts
CRYSTALLIZATION 258

are grown from aqueous solutions both in- State Physics," Vol. 25, New York, Academic
dustrially and in the laboratory. Sugar is Press, 1970, p. 152.
crystallized from water solution. Other organic
crystals, including polymers, are grown from a Research Papers
variety of solvents. Quartz crystals are grown 10. Sears, G. W., J. Chern. Phys. 24, 868 (1956)
from aqueous solution at elevated temperatures and other references by the same author.
and pressures (hydrothermal growth). Various 11. Pimputkar, S. M., and Ostrach, S., J. Cryst.
crystals have been grown from more exotic Growth 55, 614 (1981). Convective effects in
solvents, for example: garnets, titanates, and crystals grown from the melt.
ferrites from molten salts (fluxes); tin, iron, 12. Gilmer, G. H., "Computer Models of Crystal
and phosphorus from mercury; and diamond Growth," Science 208.355 (1980).
from a molten metal under pressure. 13. Kuroda, T., and La em ann, R., "Growth Kinetics
Crystallization from the solid phase is also of Ice from the Vapor Phase and its Growth
possible. Growth of grain size in a single-phase Forms," J. Cryst. Growth 56, 189 (1982).
solid, called recrystallization, is often used to
improve the properties of poly crystalline ma- Cross-references: CONDENSATION, CRYSTAL
terials, particularly metals. Crystalline com- STRUCTURE ANALYSIS, CRYSTALLOGRAPHY,
pounds can be made from high-melting-point LATTICE DEFECTS, PARACRYSTALS, VAPOR
materials by pressing together mixtures of their PRESSURE AND EVAPORATION.
powders and diffusing them together at high
temperature (sintering). Crystals can be grown
from a solid solution. This type of precipita-
tion is frequently used to improve the properties CRYSTALLOGRAPHY
of metals, for example, to harden them.
Crystallography is the science of the geometric
R. H. DOREMUS properties of matter in the ordered solid state.
When atoms or molecules condense into a solid
References phase from a liquid or gaseous phase, the lowest
energy state is achieved if they become ar-
1. Holden, A., and Singer, P., "Crystals and Crystal ranged in as regular a way as possible, usually
Growing," Garden City, New York, Doubleday by forming a small basic unit of structure which
and Co., 1960. A simple, nonmathematical dis- is repeated indefinitely in three dimensions
cussion of crystal growth and structure. throughout the solid to form a crystal. The geo-
2. Strickland-Constable, R. F., "Kinetics and Mech- metric properties of this unit and its manner of
anism of Crystallization," London and New York, regular repetition are highly characteristic of
Academic Press, 1968. Emphasis on nucleation, the substance in question, and constitute an
and crystallization from the vapor. exceedingly useful subject of study in connec-
3. Laudise, R. A., "The Growth of Single Crystals," tion with any field of science involving the solid
Englewood Cliffs, N.J., Prentice-Hall, 1970. state. Occasionally, no extended, regular repeti-
A guide for persons wanting to grow crystals, tion of structure is present in the solid phase,
with accent on practical methods together with but this glassy state has many properties of a
some background material. liquid and lies outside the realm of crystallog-
raphy. In other cases, extended order may oc-
Papers from Symposia cur in one direction only (as in fibres) or in two
4. Doremus, R. H., Roberts, B. W., and Turnbull, directions (as in some clays), but by far the
D. (Eds.), "Growth and Perfection of Crystals," most common condition of the solid state is
New York, John Wiley & Sons, 1958. full three-dimensional order, and it is with this
5. Frank, F. C., Mullin, J. B., and Peiser, H. S. type of order that crystallography is primarily
(Eds.), "Crystal Growth in 1968," J. Crystal concerned.
Growth , 3 and 4 (1968). The familiar outward manifestation of the
6. Laudise, R. A., Mullin, J. B., and Mutaftschiev, three-dimensional order of the atomic structure
B., "Crystal Growth 1971," Amsterdam, North- of the solid is the polyhedral shape commonly
Holland, 1972. exhibited by crystals. These remarkable shapes
were admired for centuries (see, for example,
Special Techniques Albrecht Durer's engraving "Melancholia,"
7. Bockris, J. 0., and Razumney, G. A., "Funda- 1514), but the underlying principle governing
mental Aspects of Electrocrystallization," New them was first discovered by Steno in 1669 .
York, Plenum Press, 1967. This principle is expressed as the Law of the
8. Henisch, H. K., "Crystal Growth in Gels," Uni- Constancy of Interfacial Angles, according to
versity Park, PA, The Pennsylvania State Univ. which the dihedral angles between the faces of
Press, 1970. all crystals of a given substance remain un-
changed regardless of how the relative sizes and
Review Article shapes of the faces may vary. Rene Just Hauy
9. Parker, R. 1., "Crystal Growth Mechanisms: in the late eighteenth century was the first to
Energetics, Kinetics, and Transport," in "Solid present a systematic account of the character-
259 CRYSTALLOGRAPHY

ization of substances by the measurement of me try operations must also apply to the entire
interfacial angles, that is, crystallography, and crystal. Thus, the morphology of the crystal
thus establish it as a science. Haiiy was a min- and all its other properties must obey this sym-
eralogist, and through his influence crystallog- metry. The detection and definition of the sym-
raphy was subsequently developed and applied metry also serve to characterize the substance
by workers mainly in mineralogy, and to a small and are equally as important in crystallography
extent in chemistry. The link between external as the measurement of the lattice parameters.
form and internal structure was dramatically The way in which symmetry operations can
completed by M. von Laue's discovery of x-ray interact consistently with each other is strictly
diffraction in 1912, and from then on crystal- limited by the geometry of coincidence and can
lography was rapidly developed and advanced be rigorously analyzed by the mathematical
in physics laboratories, and later more and more methods of group theory, both as to what sym-
in chemistry. metry operations are possible and how they
A crystal mayor may not exhibit external may be combined. The problem is usually ap-
faces, but if it does, these may be studied in proached by constraining all symmetry opera-
terms of their distribution and development, tions to pass through a single point in space, but
which constitutes the morphology of the crys- special restrictions are introduced by the re-
tal, by special techniques of crystallo me try , quirement that this point must be consistent
usually making use of an instrument that re- with any point in the crystal lattice, that is, the
flects beams of light from the crystal faces into symmetry groups must be consistent with the
a telescope, the two-circle goniometer. If the translational operations of the lattice. An im-
crystal has no faces, its internal geometric prop- portant symmetry operation is the axis of rota-
erties may be studied by its interaction with tion by which any motif is reproduced by a
radiation, by the methods of optical crystallog- rotation around an axis of 360/n degrees, where
raphy if refraction of infrared, visible or ultra- n is the order of the axis; n successive opera-
violet light is involved, or by x-ray crystallog- tions then superimpose the object on itself. In
raphy if diffraction of x-rays (also neutrons or crystals, because of the requirements of the lat-
electrons) is studied. tice, n can only have the values 1, 2, 3,4 and 6.
The crystal can be defined completely (except A 5-fold axis is not possible, for example, for
for chance irregularities and defects) in terms the same reason that it is not possible to fit reg-
of the arrangement of the atoms within a finite ular pentagons into a regular two-dimensional
unit of volume called the unit cell (whose size is pattern which will fill all space. Further, there
usually of the order of loA on an edge) and the are only 11 ways to combine the 5 axes to-
way this unit is repeated in three dimensions to gether at a point; these are called the 11 axial
fill up the volume of the crystal. The shape and point groups. These form a convenient basis for
dimensions of the unit cell provide parameters classifying all crystals into 6 crystal systems,
characteristic of the substance and constitute which, while not strictly rational in their defini-
the first primary geometrical property of crys- tion, provide a fundamental link between the
tals. The most general unit cell (triclinic case) is symmetry of the crystal and its dimensional
a parallelopiped which can be defined by six properties. Reference axes are generally chosen
constants, three edge lengths (a, b and c) and parallel to lattice translation directions, of
three interedge angles (Ct, ~ and r). This unit cell course, but further, they are customarily taken
is repeated by translation, a shift along each of parallel to prominent symmetry axes. Four of
the cell edges by an integral number of edge the axial point groups, for example, have a
lengths. If the unit cell and its contents are rep- single 3-fold axis or 6-fold axis, with or without
resented by a point in space, the crystal consists a number of 2-fold axes at right angles to them.
of a regular array of such points called a lattice, In all these, the reference c axis is customarily
in which each point is related to every other by set parallel to the unique 3- or 6-fold axis, and
an integral number of vectorial translations cor- the other two axes ai, and a2, which are equiv-
responding to the unit cell edges. The lattice alent by symmetry, are taken normai to the c
should be distinguished from the crystal struc- axis along lattice directions 1200 apart, coinci-
ture, which refers to the arrangement of atoms dent with the 2-fold axes if present. These
within the unit cell, although the term "lattice" groups are all included in the hexagonal system.
is sometimes loosely used in reference to the The axial symmetry operations are operations
structure. of the first kind, that is, they reproduce left-
The atoms within the unit cell may be related hand motifs as left-hand motifs. Other sym-
to each other by a number of geometric oper- metry elements of the second kind, that is,
ations called symmetry, and this phenomenon which reproduce left-hand motifs as right-hand
constitutes the second chief geometric property motifs, are the center of symmetry and the mir-
of crystals. The unit cell must embrace all of ror plane of symmetry. When these operations
the different types of atoms related by sym- are added to the axial groups, 32 point groups
metry that are not related by simple translation. are produced.
On the other hand, the symmetry operations Referring to the lattice, the introduction of
which apply to one unit cell are also operated symmetry gives rise to a number of lattice
on by the lattice translations, so that these sym- groups in which various rational relationships
CRYSTALLOGRAPHY 260

TABLE 1. THE SIX CRYSTAL SYSTEMS

Number of Symmetry Groups


Independent Axial Axial Point Bravais Space
System Lattice Parameters Relationships Point Groups Groups Lattices Groups

Triclinic a, b, c, ex, (3, 'Y none 1 2 1 2


Monoclinic a, b, c, (3 ex = 'Y = 90° 1 3 2 13
Orthorhombic a, b, c ex = (3 = 'Y = 90° 1 3 4 59
Tetragonal a,c b=a
ex = (3 = 'Y = 90° 2 7 2 68
Hexagonal a,c b=a
ex = (3 = 90° 4 12 2 52
'Y = 120°
Cubic a b=c=a
(isometric) ex= (3 = 'Y =90° 2 5 3 36

exist between the lattice parameters. One im- where h, k, and I may now be continuously vari-
portant result of this interaction is the appear- able (nonintegral). The potential f(x, y, z) at
ance of centered lattices, in which the lattice any point x, y, z in direct space (e.g., electron
unit cell chosen according to the rules used to density) is thus a synthesis of all the reciprocal
set up the 6 crystal systems contains additional potential g(h, k, l) (e.g., x-ray scattering) in re-
lattice points on body or face diagonals. There ciprocal space (and vice versa). In a crystal, the
are 14 such "Bravais lattices." The symmetry reciprocal potential is sampled at the reciprocal
groups so far mentioned can in favorable cir- lattice points (vanishing for nonintegral h, k, l)
cumstances all be detected from the external and the above integral becomes a triple summa-
morphology of the crystal. tion. The concept is used also in the interpreta-
When the combinations of lattice translations tion of electron momentum space in a crystal,
and symmetry operations, that is, the symmetry in which the Brillouin zones are defined by the
properties of the crystal structure, are analyzed, reciprocal lattice (see FERMI SURFACE).
new symmetry operations are evolved (screw
axes and glide planes) and each of the point HOWARD T. EVANS, JR.
groups contains many such combinations, add-
ing up to a total of 230 space groups. These are
detected by diffraction methods. Table 1 sum- References
marizes the relations between the crystal sys-
tems and the symmetry groups. deJong, W. F., "General Crystallography," San Fran-
A fundamental concept of great importance cisco, W. H. Freeman and Co., 1959.
to all aspects of crystallography is that of the Phillips, F. c., "An introduction to Crystallography,"
reciprocal lattice. If the unit vectors of the di- New York, Longmans, Green and Co., 1946.
rect lattice are a, b, and c, then a reciprocallat- Terpstra, P., and Codd, L. W., "Crystallometry," New
tice exists whose unit vectors are a* = b X c, York, Academic Press, 1961.
b* = c X a, c* = a X b, with lengths given by Buerger, M. J., "Elementary Crystallography," New
a* = (ltv) bc sin 0:, b* = (lIV) ca sin {3, c* = York, John Wiley & Sons, Inc., 1956.
(l IV) ab sin A; the reciprocal unit cell volume Hilton, H., "Mathematical Crystallography and the
V* = IIV. Because the Bragg diffraction angle Theory of Groups of Movements," Oxford, Claren-
28 is a function of the reciprocal of spacings in don Press, 1903; New York, Dover Publications, Inc.,
the direct lattice (2 sin 8 = AId), the various or- 1963.
ders of diffraction are associated with nodes in
the reciprocal lattice. These nodes are desig- Cross-references: CRYSTALLIZATION, D)FFRAC-
nated by their integral coordinates in the lattice TION BY MATTER AND DIFFRACTION GRAT-
h, k, and I, known as Miller indices. Each node INGS, CRYSTAL STRUCTURE ANALYSIS, LAT-
in the reciprocal lattice corresponds to a set of TICE DEFECTS, X-RAY DIFFRACTION.
net planes in the direct lattice, and vice versa;
the duality is complete. Crystal faces, which are
parallel to the direct lattice net planes are there- CURRENT ALGEBRA
fore designated by Miller indices. This duality
of direct and reciprocal space is exactly parallel Current algebra is a study of hadronic matter,
to that represented by the Fourier transform: (i.e., strongly interacting particles like protons,
neutrons, and 7T mesons), through their electro-
magnetic and weak interaction properties. This
f(x,y, z) =ffJg(h, k, l)e- 2rri (hx+ky+lz) dxdydz study leads to a degree of unification of the
three different forms of elementary particle
261 CURRENT ALGEBRA

interactions: strong, weak, and electromagnetic. and


The electromagnetic four current density
J e.n1.f.l(X) == (p(x), l(X», with land p respec- J± = cos 0 (F l ± i Fz ) + sin 0 (F4 ± i Fs)
tively the densities of electric current and -cosO (F1S ±i F2 S)-sinO (F4 S ± iFsS). (5)
charge, is long familiar from Maxwell's equa-
tions. The conservation of this current, of Here 0 is a parameter called the Cabibbo angle,
course, leads to charge conservation in nuclear and experimentally 0 ~ 15°.
reactions. Mathematically this is expressed by Gell-Mann suggested that the current algebra
be used in the same way as the familiar quan-
saying Q = !d 3 XP(X) is a time-independent tum condition, i.e., Iq, p] = i1!, to derive sum
rules as in atomic physics. His proposal re-
constant. In 1958 it was established that the mained dormant until use was made of another
weak decays of hadrons are also described by concept called the partial conservation of axial
currents. This led to the highly successful (V -A) current hypothesis (PCAC). This postulate con-
theory of weak interactions, proposed by Mar- nects the pseudoscalar pion field to the diver-
shak and Sudarshan and independently by gence of the axial current. This hypothesis was
Feynman and Gell-Mann. (Here V is a vector used to obtain a value for the weak decay of
current and A is an axial vector current.) Soon the 1T meson from the known strong interaction
after the successful proposal of SU(3) sym- pion-nucleon coupling constant (Goldberger-
metry for classification of hadrons (suggested Treiman relation) and is known to be accurate
by Gell-Mann and Ne'eman in 1961) Gell-Mann to about 10 percent.
laid down the foundations of current algebra. Among the successful applications of current
In attempting to interpret in precise terms the algebra are prediction of weak decay rates of
notion of symmetry violated by the strong in- hyperons [e.g., L- -* n + e- + iI]. Such decays
teractions, Gell-Mann suggested that SU(3) sym- involve vector and axial parts; the vector parts
metry operators be identified with charges as- are predicted to be just Clebsch-Gordon coeffi-
sociated with the weak and electromagnetic cients of the group SU(3), while the axial parts
currents. In particular he emphasized the equal are also predicted in terms of two unknowns
time commutation rules of these charges, and (f and d type couplings). As for the meson de-
suggested that they may remain unchanged cay rates, the K -+ 1T e v decay is predicted ac-
even in the presence of SU(3) breaking inter- curately, although in meson systems symmetry-
actions, showing how this leads to a precise no- breaking corrections have to be made in general
tion of universality of weak and electromag- before predictions can be compared.
netic interactions. Application of current algebra leads to sum
In the SU(3) scheme, one can define eight rules between different observables, as was the
vector currents, with the charges corresponding case in atomic physics. To illustrate the use of
to these operators being the generators of the commutators in atomic physics, we give a deri-
group. These charges obey the equal time com- vation of the Thomas-Reiche-Kuhn sum rule.
mutator algebra. One starts with the fundamental quantum-
mechanical assumption
i,j,k=I···g (I)
I€·
-> ->
qi, €. Pi]
-> ->
= zlluij
. " (6)
where fijk are antisymmetric structure con-
stants of the group SU(3). One may also define where i and j label different electrons and € is
an octet of axial currents, and the charges asso- an arbitrary unit vector. Assuming that forces
ciated with these currents, Fi s , obey the algebra are velocity-independent, we may write

i,j,k=I···g . (2) Pi = im [H, qil n (7)


Using Eqs. (l) and (2) we may take the expec-
Gell-Mann further postulated that axial gener- tation value between the ground state 10) of an
ators among themselves obey a similar algebra, atom with Z electrons.
i, j, k =I ... 8. (3) L(OI [t'ilI, IH,€.qjllI0)=~1i2 (8)
ij
Such an algebra leads to an abstract group
SU(3) X SU(3). Although such a group is in Now inserting a complete set of intermediate
fact not a symmetry group of particle. states, states one gets
nevertheless the generators at equal time are as-
sumed to obey the commutator algebra. Thus Z Z1i2
the axial charges are not constants of motion,
nor are all the vector charges constants of mo-
L 1<01 L €. qi In >12(En - Eo) = 2m (9)
n i=l
tion. The charges that playa role in electromag-
netic and weak interactions are respectively The usefulness of the sum rule arises from the
fact that the left-hand side of the equation in-
(4) volves matrix elements of dipole operators that
CURRENT ALGEBRA 262

are observable in the transition spectrum of an In recent years tremendous progress has been
atom. Very similar methods can be used with made in understanding the strong interactions.
current algebra to obtain sum rules. Quarks have emerged as the basic constituents
A sum rule obtained by Adler and Weisberger of hadrons, and their interactions are described
for the axial vector coupling in (3-decay is by a gauge theory called quantum chromody-
among the triumphs of the axial-charge algebra. namics. In such a theory current algebra is
Use was made of the commutation relation exact. The great utility of current algebra arises
from the smalleness of quark masses, especially
[FIS + iF2s, Fis - iF2S] = 2 F3 (10) the "up" and "down" quarks. The notion of
PCAC can then be made very precise. Thus all
to derive the basic hypotheses of current algebra, which
were arrived at mostly through intuition, are
GA] 2 Frr
1 = [G +-
JOO dv~ [Orr-p (v)
now on very firm ground. The problem of how
2 quarks form hadrons has not yet been solved
V 1T Il V in quantitative detail. Current algebra continues
to provide a strong tool for the study of ha-
- Orr+p (V)] (11) dronic properties.
N. G. DESHPANDE
Here GA is the axial vector coupling, Gv is the
vector coupling, F rr is the pion decay constant,
v is the laboratory pion energy, and a is the References
pion-nucleon total cross~ection. GA calculated 1. Adler, S. L., and Dashen, R. F., "Current Algebras
from this relation is in good agreement with and Applications to Particle Physics," New York,
experiment. Other results include: Adler's neu- Benjamin, 1968.
trino sum rule, which relates neutrino to anti- 2. Bernstein, J., "Elementary Particles and Their Cur-
neutrino scattering on protons at high energy; rents," San Francisco, W. H. Freeman, 1968.
Cabibbo-Radicati sum rule, which relates anom- 3. Marciano, W., and Pagels, H., "Quantum Chromo-
alous magnetic moments of nucleons to cross- dynamics," Physics Reports 36C(3), 137-276
sections for absorption of isovector photons on (1978).
nucleons; and photoproduction sum rules. These
sum rules, wherever tested, are in good agree- Cross-references: CONSERVATION LAWS AND
ment with experiment. SYMMETRY, ELECTROMAGNETIC THEORY,
Another class of applications leads to deriva- ELECTROWEAK THEORY, ELEMENTARY PAR-
tion of low-energy theorems. These theorems TICLES, GAUGE THEORIES, GRAND UNIFIED
especially relate to processes where pions are THEORY, PARITY, QUANTUM CHROMODYNAM-
emitted. It was shown that the Adler-Weisberger ICS, QUANTUM THEORY, QUARKS, STRONG
sum rule could be obtained as a low-energy INTERACTIONS, WEAK INTERACTIONS.
theorem relating (GAIG v ) to s-wave pion-nu-
cleon scattering lengths. Other theorems deal
with leptonic decays of K-mesons. A theorem
due to Callan and Treiman for the decay CYBERNETICS
K ~ 1T e v states that the two form factors that
enter this decay are related to the K and 1T de- Cybernetics was defined by Norbert Wiener in
cay constants. 1948 1 as "control and communication in the
animal and the machine." Communication
f+(mK 2 )+f- (mK2)=FKIFrr (12) clearly implies the communication of informa-
tion; and purposeful control requires informa-
Other theorems due to Weinberg relate form tion about the current position and about the
factors in the K ~ 1T 1T e v decay and obtain end being sought. Thus cybernetics has a strong
1T 1T ~ 1T 1T scattering lengths. Low-energy the- "information" connotation and may be equated
orems that combine PCAC and current algebra to "information handling." Note that it is in-
are not tested as well by experiment and some formation handling, rather than communica-
corrections may have to be made. tion, for two reasons: first, the information
Other applications of the current algebra lead may be used to modify the system (rather than
to a value of the electromagnetic mass differ- merely passing through it), and second, it may
ence of 1T+ - 1To mesons. Current algebra has be necessary to marshal information from
been further extended by assuming that the various sources and process it (e.g., by a com-
currents are proportional to vector fields, like puter) before using it to effect control to a
p, w, K* mesons. This hypothesis, called the desired end. Wiener's definition can then be
current-field identity, has led to predictions for rewritten as "the handling of information in
a large number of strong interaction decay rates animals and machines in order to achieve de-
of mesons. Such applications, however, involve sired effects." This includes the idea of "goal-
a large number of free parameters. Neverthe- seeking" systems which adapt themselves, to
less this enables calculations to be made where a greater or less extent, to the requirement of
no method existed before. moving from the current situation to the de-
263 CYBERNETICS

sired goal. The least extent of this is found in signal-to-noise ratio and the theorem that any
the constant-parameter closed-loop system, the waveform can be represented by a number of
design of which may nonetheless involve con- discrete signals equal to two per cycle of the
siderable sophistication in order to achieve maximum Fourier frequency (the sampling
optimum performance (see FEEDBACK, SERVO- theorem*).
MECHANISM). A further component in the idea Spectrum of a Stochastic Process The fre-
of cybernetics is that the system in question quency spectrum corresponding to a specified
should be stochastic, i.e., that it should show time function can be found by Fourier trans-
random fluctuations, usually about a defined form, but this is not applicable to a stochastic
average trend. Wiener was concerned with this time function of which the amplitude at any
aspect and contributed to the theories both of given time can only be stated as a probability.
the frequency spectra of stochastic time series In that case one proceeds via the autocorrela-
and of the filtering of stochastic signals in order tion function RXX(tI, t2) which is defined 4
to get the best estimate of the underlying as the expectation (in the statistical sense)
trend. The three elements in cybernetics, infor- that the variable x has the values XI, X2 at
mation, spectra of stochastic processes, and times tl, t2' If the stochastic function is sta-
filtering of stochastic signals, can be summarized tionary one can replace tl and t2 by the dif-
as follows. ference r = t2 - tl and average over all times:

IT
Information Control systems respond to sig-
nals which carry information. For example,
when a call is made for an elevator car in a Rxx(r) = lim - I f(t) f(t - r) dt.
modern multiple installation, the computer- T-+OO 2T -T
like controller takes account of all calls which
have not yet been answered and the relative Having determined the autocorrelation function
positions and directions of travel of all cars for all values of r one obtains the power (squared
before deciding which car shall answer each amplitude) spectrum of frequencies via the
call. This is an example of information handling Wiener-Khintchine transform,

f
rather than communication. Another example
is that in the CAD/CAM (Computer-Aided
Design and Computer-Aided Manufacture) W(f) = 00 Rxx(r) cos 2rrfr dr
organization of an engineering plant, informa- -00

tion fed to a computer during design of com-


ponents may be used to control the machine Wiener Filters The classic idea of filtering
tools which make the components. An objective is to separate frequency components, but
definition of information, independent of the Wiener posed a more fundamental question:
observer and of the medium in which the in- Given the past history of a stochastic time
formation exists, was provided by Shannon in series, how can one make a least-squared-error
his mathematical theory of communication. 2 estimate of its value at any time? An exact
(Information is intimately related to communi- answer requires the solution of an integral
cation, first because it is what is communicated equation, which is not always feasible. More-
and second because it can be observed only over, if "any time" includes the future, this
when it is communicated to the observer.) implies prediction of the future, which is not
In human terms the information possessed consistent with a physically realizable filter.
by an individual may be thought of as a One method of avoiding prediction is to intro-
weighted sum of probabilities. (many of which duce delay, leading to the infinite-lag filter.
may be practical certainties). This accords By shifting to the frequency domain it is pos-
with Shannon's measure of information in sible to design a filter of characteristic
physical communication systems, although this
measure was derived from entirely different and W(f)
specific considerations. Shannon's measure of YCf) = W(f) + N(f) e- j (j 21rf
the information conveyed by each unit signal
is numerically equal to the negative of the where W(f) is the power spectrum of the signal
entropy H, where (the underlying trend) and N(f) that of the noise
(the superimposed fluctuation). The exponen-
n tial term represents a delay which tends to
H =- L Pi log Pi· infinity as the filter is elaborated to make Y(f)
i= 1 exact, and this form of filter is physically realiz-
able. In order to obtain a physically realizable
Here, Pi is the probability of the ith state of the filter without delay (the zero-lag filter) one
receiver after receiving the signal, e.g., the print- must return to the time domain and factor the
ing of the ith character out of a set of n in a
teletype system. This assumes that there is a
finite number n of possible states. See Shannon's *Shannon assumed a low-pass frequency character-
paper for a derivation of the number of possible istic, but bandpass characteristics have subsequently
states of a communication channel of given been treated. 3
CYBERNETICS 264

filter function into two components, having under the heading of economic cybennetics
zero value for t < 0 and t > 0, respectively. while others fall within biocybernetics. 15 The
The former corresponds to a physically realiz- whole field of cybernetics is reviewed from time
able filter since it has no output before its to time in a "progress" volume. 16
input. This factor can be translated into the
frequency domain to combine with a function D. A. BELL
of the signal and noise spectra to specify the
optimum zero-lag filter. This is inevitably less References
effective than the infinite-lag filter, but delay
cannot be tolerated in closed-loop systems.* f 1. Wiener, N., "Cybernetics," New York, Wiley,
Adaptive and optimizing control can be ap- 1948.
plied to chemical process plant to maintain 2. Shannon, C. E., "A Mathematical Theory of Com-
specified conditions of temperature, pressure, munication," Bell Syst. Tech. J. 27,379-423 and
flow etc., in spite of changes in external circum- 623-656 (1948).
stances which may include changes in the char- 3. Linden, D. A., "A Discussion of Sampling Theo-
acteristics of input materials. One can go fur- rems," Proc.I.R.E., 47, 1219-1266 (1959).
ther and, for example, make a fractional 4. Korn, G. A., and Korn, T. M., "Mathematical
distillation plant optimize the output of any de- Handbook for Scientists and Engineers," New
sired fraction. York, McGraw Hill, 1961.
The first generation of industrial robots were 5. Anon., "Robots Feel their Way into Tight Situ a-
not adaptive but merely followed a fixed cycle tions," New Scientist 87, 5.91 (21st August 1980).
of operations, which was programmed by 6. Marsh, P., "Robots See the Light," New Scientist
"leading" the robot through the cycle which it 86,238-240 (12th June 1980).
could then repeat exactly as often as required. 7. Belenkov, V. D., Gusov, S. V., Zotov, Yu. K.,
But present development is concerned with Rushanskiy, V. I., Timofeyev, A. V., Frolov,
achieving adaptive behavior by adding sensors V. M., and Yakubovich, V. A., "Adaptive System
to the robot, e.g., tactile sensors 5 or solid-state for Control of Autonomous Mobile Robot,"
television cameras to provide vision. 6 For Eng. Cyb. (U.S.A.) pp. 37-45, translation of
example, an arc welder may be made to follow Tekh. Kibern. (U.S.S.R.) no. 6, pp. 52-63 (Nov.-
the joint to be welded, in spite of variations Dec. 1978).
in positions of component parts; or a mobile 8. Walter, W. G., "An Electromechanical 'Animal,'''
robot can avoid obstacles. 7 Discovery 11,90 (March 1950).
The application of cybernetics to robots is 9. Various authors, "Cybernetics and General
very close to artificial intelligence, i.e., making Systems-A Unitary Science?" Kybernetes 8,
machines behave as though they were intelli- 7-15; 17-23; 25-32; 33-37; 39-43; and 45-49
gent living creatures. This was first demon- (1979).
strated in the form of "mechanical animals,,,g 10. Landa, L. N., "Cybernetic Methods in Education,"
but now includes such things as chess-playing Educ. Technology 17, 7-13 (1977).
computers. 11. Berlin, V. N., and Weiss, R. G., "The Role of
The recent tendency is to apply the term Evaluation Systems in the Government Policy
"cybernetic" to more abstract systems which and Programme Change," Proc. Int. Conf. on
exhibit feedback plus adaptation and often Cybernetics & Society, Washington D.C. 19-21
some degree of stochastic behavior. There is Sept. 1977, New York, IEEE, 1977.
then a relationship between "cybernetics" and 12. Sukhanov, O. A. and Kristov, Kh. K. "Synthesis
"general systems theory" which is discussed of Stable Cybernetic Models for Investigating
in a group of papers.9 One application of this the Dynamics of Electrical Systems," Power
type is to education,10 where there may be Engineering (U.S.A.) IS, 20-25 (1977), transla-
various forms of feedback from taught to tion of Izv. Akad. Nauk SSSR Energy and Transp.
teachers, and a random element is introduced (USSR) 15,22-27 (1977).
by the varied abilities and motivation of stu- 13. Stepanenko, V. E., "Group Identification of Sub-
dents. Other applications have been to govern- stances based on Cybernetic Models," J. Anal.
ment policy, 1 power systems,12 chemical Chem. 35,404 (1980).
analysis,13 and the development and manu- 14. Garte, D. "Cybernetic Model for the Process of
facture of silicon chips.14 The point in the last Product Development of Solid-state Circuits
is that there are two coupled loops, for process considering the Semiconductor Process Develop-
and for product development, and only the use ment," Nachrichten Tech. Elektron. 30,312-317
of an adaptive controller overall can ensure (1980).
stability. Some of these topics are often grouped 15. Nalecz, M., "Some Problems in Modern Biocy-
bernetics and Biomedical Engineering," Proc.
*Those who want to explore the mathematics in 2nd. Int. Symposium on the Theory and Practice
detail are referred to Wiener's book, "The Extrapola- of Robots and Manipulators, Warsaw 14-17
tion, Interpolation and Smoothing of Stationary Sept. 1976, Amsterdam, Elsevier, 1977.
Time Series," New York, Wiley, 1949. It is advisable 16. "Progress in Cybernetics and Systems Research,"
to start with a paper by Levinson which is reprinted edited by R. Trappl and others. Vols. 1-5, pub-
as Appendix C in the book. lished by Halsted Press, New York; vol. 6 and
265 CYCLOTRON

subsequent by Hemisphere Publications, Washing-


ton, D.C.; vol. 9,1981.
Introductory and General Books.
Rothman, Milton A. "The Cybernetic Revolution:
Thought and Control in Man and Machine," New
York, Franklin Watts, 1972.
George, F. H., "Foundations of Cybernetics," London,
Gordon & Breach, 1977.
Pask, Gordon, "Cybernetics of Human Learning and
Performance" New York, Crane-Russak, 1975.
Trappl, Robert (Ed.), "Cybernetics: A Source Book,"
Washington, D.C., Hemisphere Publications, 1982.

Cross-references: BIONICS, FEEDBACK, COM-


PUTERS.

FIG. 1

CYCLOTRON * been inspired by Rutherford's success in disin-


tegrating atoms with alpha particles from nat-
The cyclotron is an accelerator of ions widely ural sources to seek a means of producing a
used to study the nucleus, to produce radio- controlled beam of high energy particles. The
active substances, and to stud~ the interactions practicability of the idea was demonstrated,
of ionizing radiation with livmg systems and and most of the essential features of the Cyclo-
with inert matter. It is equally important as the tron were developed by Lawrence, M. Stanley
first of a class-Magnetic Resonance Accelera- Livingston, N. E. Edlefsen, and others during
tors-which includes the various kinds of syn- the next few years.
chrotron (see SYNCHROTRON) as well as syn- Figure 1 is a schematic diagram showing the
chrocyclotrons and sector focused cyclotrons. principle components of a cyclotron. ~he dees
The essential feature of this type of accelerator are two hollow semicircular electrodes m a vac-
is that acceleration of charged particles to high uum tank located between the poles of an elec-
~nergies is achieved by. a successiv~ applicati?n
tromagnet which provides an approximately
of small accelerations m synchromzatlon wIth uniform magnetic field over the entire region.
the rotational period of the particles in a mag- The dees are part of an electrical resonant cir-
netic field. The condition for synchronization is cuit which may be excited by an oscillator
simple and can be derived as follows: A charged whose frequency is adjusted to the rotational
particle moving p~rpe!ldicul:rrly to t.he line.s of frequency given by Eq. (1). Ions are produced
force in a magnetic fIeld will descnbe a cucle
which is defined by the equilibrium between by an electric discharge in a source located at
the Lorentz force FQ = eBv and the centrifugal the center. They are drawn from the source and
accelerated into a dee while it is negative, they
force Fc = mv 2lr. Equating these, one may follow a semicircular path in the (electrostatic)
solve for the rotational frequency of the par- field free interior of the dee and again arrive at
ticles which is set equal to the frequency of the gap between the dees where, by that time,
the accelerating field. This is the Cyclotron the voltages are reversed in sign. and t~e:y are
Resonance Condition: accelerated again. The ions descnbe semlCucles
eB of increasing radius as their velocity and e~ergy
fa =fo 2rrm
(1) increase as a result of repeated acceleratlOns.
When they reach the maximum radius of t~e
where dee, they enter a channel between a septum m
one of the dees and the deflector. The deflector
fa = frequency of accelerating field is charged negatively and draws the particles
fo = rotational frequency out where they may strike a target in the target
e = charge of ion chamber or they may travel some distance as a
m = mass of ion beam outside the cyclotron before they are
B = magnetic field strength. used.
The kinetic energy of the accelerated particles
The important fact is that the rotational fre- is given by:
quency is independent of the energy of the par-
ticle and depends only on quantities which. a!e 1 B2R 2 e 2
(approximately) constant. In 1929 the pOSSibil- T=--- (2)
2 m
ity of using this relationship as the basIs for an
accelerator occurred to Ernest O. Lawrence, where T is the kinetic energy and R is the ra-
who like many other physicists at the time had dius of ion path at point of extraction. For pro-
CYCLOTRON 266

tons, Eq. (1) reduces to I = 1.52B MHz and no further acceleration. The energy limit of the
Eq. (2) to T = 0.484 B2R2 MeV with B in kilo- conventional cyclotron imposed by this phase
gauss and R in meters. The usual values of Bare error can be shown to be proportional to the
from IS to 22 kilogauss. square root of the accelerating potential and to
In addition to the resonance condition, a suc- be about 30 MeV for protons with a dee-to-
cessful cyclotron requires that the orbits be dee potential of 200 kV. It has not been pos-
stable, i.e., they must remain in the median sible to reach the theoretical maximum energy
plane and at the appropriate radius. The first is in practice, and for reasons made clear in
achieved by introducing in the magnetic field a the next sections, the incentive to do so has
small negative gradient with respect to radius. disappeared. The maximum energy which
The field lines are then bowed as shown in Fig. has been attained with protons is 22 MeV and
2, and the Lorentz force on a particle off the that required about 500 kV on the dees. Cur-
median plane has a vertical focusing compo- rents in cyclotrons are usually of the order of
nent. Radial stability results from the fact that 100 p.A but up to 1 rnA has been attained. The
the orbit of the particle is an equilibrium orbit most commonly used ions are protons, deu-
with the inward Lorentz force predominating at terons, and alpha particles, although heavier
radii larger than the equilibrium orbit and the ions such as carbon, nitrogen and oxygen ion-
centrifugal force predominating at smaller radii. ized to +3 or +4 have also been accelerated.
Ions which are displaced either vertically or A possibility of achieving higher energies with
radially then execute oscillations about the cyclotrons was opened up in 1945 when V.
equilibrium orbit. If the magnetic field is de- Veksler and E. M. McMillan independently
scribed by the index, pointed out the phase stable characteristic of
the cyclotron resonance condition [Eq. (1)1
r aB which may be explained if the equation is re-
n =-- - (3)
B ar written in terms of particle energy:
Be e 2
where r is the radius, then it can be shown that 10 = 2rr(Eo + T) (5)
for 0 < n < I stable oscillations occur with fre-
quencies: where Eo is the rest energy of the particle.
Consider a particle rotating in a cyclotron at
fz = fo Vn the resonant frequency and crossing the acceler-
Ir =/0 ~ (4) ating gap at a phase such that it gains no energy
and that a later arrival causes it to lose energy.
where Iz is the frequency of vertical oscillations If this particle is perturbed by an excess of en-
and Ir is the frequency of radial oscillations. ergy, 10 decreases and the particle loses energy.
The negative gradient in the magnetic field If the particle is perturbed in phase so that it
results, however, in the situation where the ro- arrives at the accelerating gap too early, it gains
tational frequency, Eq. (1), is not exactly the energy, 10 decreases, and the phase slips back.
same at all radii. In addition, it must be noted Perturbations in energy or phase thus result in
that the mass in Eq. (1) is the relativistic mass, oscillations about the equilibrium phase. Under
m = mo + Tle 2 and increases with energy. The these conditions, if the accelerating frequency
result of these two discrepancies is that the ro- of a cyclotron is slowly decreased, the ions will
tational frequency of the ion decreases as it is execute stable oscillations about that phase
accelerated and there is an accumulated phase which will give sufficient energy gain so that
lag between the ion and the accelerating field the radius and energy are matched as the orbits
which when it approaches rr radians, results in expand. This is the Principle 01 Phase Stability
as applied to the Synchrocyclotron. It com-
pletely removes the energy limitation of the
cyclotron previously discussed. This principle
FOCUS ING LORENTZ was immediately exploited and synchrocyclo-
COMPONENT FORCE trons (also sometimes called Frequency Modu-
lated or FM Cyclotrons, and in the U.S.S.R.,
Phasotrons) have been built which give protons
I f I \ , up to 1000 Me V. The only limit is the economic
I
I
I I
I I I
I I I
I
\
\
I
\
\
one due to the large size of the magnet.
\ I I I II I I I I The important structural difference between
~ \ \ \ 1\ I J a synchrocyclotron and a conventional cyclo-
7}: /
f I

~~V////ZJ1
~\ \ I I I tron is in the provision for a variable frequency.
This is accomplished by placing a variable ca-
pacitor in the resonant dee circuit. Rotary blade
MAGNETIC capacitors have been in common use for this
FORCE purpose, but in more recent designs, vibrating
LINES
blade capacitors have been preferred. The re-
FIG. 2 quired frequency swings are about two to one,
267 CYCLOTRON

and the usual modulation frequencies are about of the conventional cyclotron and, at the same
60 to 100 Hz. The ions are accelerated in pulses time, are capable of high average currents be-
as the accelerating frequency sweeps through cause they operate at a constant frequency. Pro-
its modulation cycle in contrast to the contin- vision of auxiliary magnet coils on the pole tips,
uous acceleration in a conventional cyclotron. to trim the field shape over a range of values of
The result is that average currents in synchro- average field, and adjustable frequency oscilla-
cyclotrons are about 1 per cent of cyclotron tors, to provide for different ions and a varia-
currents, thus removal of the energy limit has tion in maximum energy, have made the mod-
been accomplished at the, expense of a current ern cyclotron of this type very flexible.
limitation. In a further application of the Thomas prin-
Another method of circumventing the energy ciple, the magnet assembly is made up of indi-
limit of the cyclotron was proposed by L. H. vidual sectors with field-free spaces between
Thomas in 1938, seven years before the principle them. Fixed-frequency accelerating cavities are
of phase stability was enunciated. In ,the located between the sectors. This design varia-
Thomas proposal, the average magnetic field in- tion of the sector-focused cyclotron makes it
creases with radius so that the resonance con- economically possible to go to higher energies,
dition may be exactly matched by a constant but at a sacrifice of some of the features per-
accelerating frequency as the ion gains energy. mitting easy variation of energy and ion species.
The axial focusing force is supplied by an Cyclotrons may also be used two or even three
azimuthal variation of the magnetic field which in tandem. The extracted beam from a cyclo-
may be obtained by using sectored magnet tron designed for optimum acceleration of cer-
poles, Fig. 3a. The ion orbits are then no longer tain ions in a low energy range is used to pro-
circular, and the radial component of velocity vide an intense beam for injection into a larger
interacting with the azimuthal component of cyclotron designed for a higher energy. Super-
magnetic field produces an axial focusing force. conducting magnets have been used in some of
This is the "edge focusing" which occurs when the most recent cyclotron designs to produce
an ion crosses a fringe field obliquely, and it magnetic fields more than twice that of con-
has long been used in mass spectrometers and ventional magnets, resulting in more compact
other devices. installations and saving in magnet power.
This idea was well in advance of the theory The flexibility of cyclotron design, resulting
and practice of the cyclotron art at the time from the application of the Thomas principle
and was not immediately exploited. Develop- and concurrent technological developments, has
ment beginning in 1949 and extending to re- rendered the classical cyclotron and synchro-
cent years has resulted in a whole subclass of cyclotron obsolescent. A compilation of cyclo-
cyclotrons characterized by a fixed rotational trons made in 1978 in connection with the
frequency and focusing forces derived from Eighth International Conference on Cyclotrons
spatial variation in the magnetic field. For ex- and their Applications (see last reference) lists
ample, if the sectors are spiral shaped as in Fig. a total of 108 cyclotrons world-wide, in opera-
3 b, additional focusing forces of alternating gra- tion or under construction, of which 14 are
dient type are developed. These cyclotrons are synchrocyclotrons and only 4 classical. The re-
variously called Sector Focused, Isochronous, mainder are variations of the sector-focused
and AVF (azimuthally varying field) cyclotrons. design . Two of these use superconducting
They have energies well beyond the energy limit magnets.

HIGH FIELD

(a) (b)
LOW FIELD
FIG. 3
CYCLOTRON 268

TABLE I

SECTOR FOCUSED SECTOR FOCUSED


CYCLOTRON SYNCHROCYCLOTRON CYCLOTRON CYCLOTRON
60-inch Cyclotron, 184-inch Cyclotron 88-inch Cyclotron, (Two in tandem)
University of Lawrence Berkeley Lawrence Berkeley SIN, Swiss Institute for
Washington, Seattle Laboratory, Berkeley Laboratory, Berkeley Nuclear Research

First 1951 1946 rebuilt 1957 1962 1974


Operation

Magnet 1.52 m diameter, 4.8 m diameter, 2.24 m diameter, 9.2 m diameter,


19 kG max. field, 23.4 kG max. field, 3 spiral sectors 8 separated sectors,
197,000 kg 3,900,000 kg 20 kG max. field, 21 kG max. field,
272,000 kg 2,000,000 kg

RF Fixed frequency, Variable frequency, Fixed frequency, Fixed frequency,


11.6 MHz, 18-36 MHz for protons, adjustable 5.5-16.5 50 MHz,
two dees, Modulation frequency MHz for various 4 cavities, 600 kV/cavity
250 kV 64 Hz, particles,
single dee, single dee,
11 kV 75 kV

Beam Protons 11 MeV, Protons 740 MeV, Protons 60 MeV, Protons 588 MeV,
deuterons 21 MeV, deuterons 460 MeV, alpha particles 140 110 IJ.A design current.
alpha particles 42 alpha particles 910 MeV,
MeV, MeV, 12C4+ 193 MeV
150 /LA max. 1 /LA max. 16 0 6+ 315 MeV
current current. 3 rnA max. current.

Source of data, "Eighth International Conference on Cyclotrons and Their Applications," IEEE Transactions
NS-26, Number 2, 1979.

Table I gives a comparison of the salient Kolomensky, A. A., and Lebedev, A. N., "Theory of
design features and performance of typical Cyclic Accelerators," New York, John Wiley & Sons,
examples of a classical cyclotron, a synchro- 1966.
cyclotron, a sector focused cyclotron designed Burgerjon, J. J., and Strathdee, A. (Eds.), "Cyclo-
for high intensity and variable ion and energy, trons-1972" (Proceedings of the Sixth International
and a sector focused cyclotron designed for Cyclotron Conference), New York, American Insti-
high intensity and energy. The latter, giving tute of Physics, 1972.
copious secondary beams of pi mesons, are Joho, W. (Ed.), "Seventh International Conference on
sometimes called meson factories. Cyclotrons and their Applications," Basel and Stutt-
The early and continuing impetus for cyclo- gart, Birkhauser Verlag, 1975.
tron development has come from its use in re- Hicks, J. W., (Ed.), "Eighth International Conference
search in nuclear physics and chemistry and on Cyclotrons and their Applications," IEEE Trans-
particle physics. But applications in medicine actions NS-26, Number 2, 1979.
where cyclotrons have provided radioisotopes
for diagnosis and neutron and charged particle Cross-references: ACCELERATORS, LINEAR; AC-
beams for therapy have been not far behind. CELERATOR, PARTICLE; BETATRON; SYNCHRO-
Some· of the other important applications have TRON; ACCELERATOR, VAN DE GRAAFF.
been trace elements analysis by activation of a
sample and characterization of the resultant
activity; solid state studies by creation of ir-
radiation damage and by implantation of im-
purities; and production of many different CYCLOTRON RESONANCE
radioisotopes for use as tracers in many fields (DIAMAGNETIC RESONANCE)t
of science and technology.
The term cyclotron resonance is used to desig-
EDWARD J. LOFGREN
nate the resonant couplirlg of electromagnetic
References power into a system of charged particles under-
going periodic orbital motion in a uniform static
Livingston, M. Stanley, and Blewett, John P., "Particle magnetic field. The frequency of the electric
Accelerators," New York, McGraw-Hill Book Co.,
1962. tSupport of U.S. Air Force is acknowledged.
269 CYCLOTRON RESONANCE (DIAMAGNETIC RESONANCE)

field at resonance is simply related to the orbital


frequency of the electron in the magnetic field. (2)
The effect has been observed and studied ex-
tensively in gases and in solids. In this case, the constant energy surfaces in
One important application of the cyclotron wave vector space are spheres, and the cyclotron
resonance principle is made in the acceleration mass of Eq. (I) is just the effective mass, m *.
of charged particles, as in a cyclotron. In a uni- For energy extrema located at general points in
form magnetic field, H, a charged particle of wave vector space, Eq. (2) becomes
mass, me, undergoes orbital motion with an
angular velocity 1i
E(k)=-
2(kx2 y 2 kz2)
- +k- +- (3)
eH 2 mx my mz
We =--, (1)
me C in which the extremal point is taken as the
origin, and mx, my, and mz are three com-
in which e is the charge and C the velocity of ponents of an effective mass tensor. This gener-
light. Energy from the electromagnetic fields, alization is also necessary in describing the
i.e., from the alternating electric and magnetic energy bands for crystals with symmetry lower
fields, is transferred into kinetic energy of the than cubic.
particle, and the radius of the particle orbit is An expression for the cyclotron effective mass
increased with no change in angular velocity. which is valid for an electron orbiting on a
Particle acceleration takes place in vacuo in constant energy surface of energy E for an
order to prevent energy transfer to the gas by arbitrary E(k) is
means of collisions.
In solids, cyclotron resonance has been suc-
cessfully applied to studies of electronic energy
band structure. The perfectly periodic array of
H 2(:;tH
m e (E,k )=2rrfz (4)

atoms in an ideal solid scatters electrons co- in which kH is the wave vector component
herently. An electron experiencing such coher- parallel to the magnetic field, A is the area of
ent scattering can be described by the same the electron orbit in wave vector space, and
equations of motion as the free electron, except
(aA/aE)kH is the derivative of this area with
that the free electron mass is replaced by an
effective mass, m *. Incoherent scattering from respect to energy evaluated at constant kH. For
crystalline imperfections causes electronic col- spherical or ellipsoidal constant energy surfaces,
lisions which limit the number of completed the cyclotron mass is iridependent of both
electron orbits, thus giving rise to a frequency energy and kH' and for these two simple cases,
bandwidth for the cyclotron resonance absorp- me is given, respectively, by me = m *, and
tion. The observation of cyclotron resonance

(~J = m:~z + m~~z + m:~y


requires that the charged particle execute about
one complete cyclotron orbit without collisions, 2 (5)
or WeT;::: I, in which the collision time, T, is the
mean time between incoherent scatterings. A in which a, ~, 'Yare the direction cosines of the
long collision time is achieved by using samples magnetic field with respect to the axes of the
of the highest possible purity and lattice per- ellipsoidal constant energy surface.
fection and by cooling to very low temperature For solids which have relatively low carrier
(usually liquid He temperature, 4 K) to elimi- density (e.g., insulators, semiconductors, and
nate the thermal motion of the atoms. The con- semimetals) the electronic states which are im-
dition for cyclotron resonance can also be portant in the transport properties are located
satisfied by increasing We through the use of near energy band extrema. For nondegenerate
high magnetic fields, e.g., I DO-kilogauss static extremal points in wave vector space, E(k) can
fields are currently available which for free be expanded in a Taylor's expansion. The lead-
electrons results in We ~ 1.5 X 10 12 rad/sec or ing term of such an expansion would be given
an electromagnetic wave length of about I mm. by Eq. (3). For degenerate points (positions
Electrons moving in the periodic lattice of a where two or more levels have the same energy),
solid occupy energy levels which are specified a simple generalization of a .Taylor's expansion
by the wave vector quantum number, k, or by must be used. In solids with relatively high
the crystal momentum, 11k Since the number of carrier density (e.g., metals), the transport prop-
electrons is very large, the wave vectors assume erties are determined by electronic states which
an almost continuous range of values. A knowl- are far from the energy extrema and the E(k)
edge of the functional form of the dependence relation is not adequately described by a Taylor's
of the energy on wave vector is necessary for a expansion.
complete description of the behavior of elec- Cyclotron resonance experiments have been
trons in solids. The simplest form of the relation particularly successful in the quantitative deter-
between energy and wave vector valid for energy mination of the band parameters of the semi-
bands in cubic crystals is conductors silicon and germanium. The success-
CYCLOTRON RESONANCE (DIAMAGNETIC RESONANCE) 270

ful application of this technique in these semi- energy given by Eq. (4), which is, in general,
conductors is attributed to the high quality of dependent on the wave vector component
the available material, and to the complete parallel to the magnetic field. The interpretation
classification of the possible forms of the of these experiments is not simple but when
theoretical band structure model. Since in these coupled with experiments which measure the
materials the intrinsic carrier concentration is shape of the Fermi surface, such as DE HAAS-
extremely small at low temperatures, electrons VAN ALPHEN EXPERIMENTS, a fairly complete
are optically excited out of filled valence levels determination of the electronic band structure
in the crystal in order to produce sufficient is possible. These techniques have been success-
carriers to obtain a measurable signal. Reso- fully applied in the study of copper.
nances are observed both for the excited elec- Cyclotron resonance in ionic crystals allows
trons and for the holes left behind in the empty the measurement of polaron effects. The POL-
levels in the valence band. ARON denotes the charge carrier together with
In metals, the high carrier density requires its local lattice distortion. Cyclotron resonance
modification of the conventional cyclotron observed in AgBr has been interpreted as a
resonance experiment. Two important conse- polaron orbiting in the applied magnetic field .
quences of this high carrier density are the non-
uniform penetration of the electromagnetic G. DRESSELHAUS
field in the skin depth and the inapplicability of
the simple effective mass theory to describe the
electronic states. To overcome the problem of References
the small electromagnetic penetration depth, Kittel, C., "Introduction to Solid State Physics," sec.
the geometrical arrangement suggested by Azbel ed. p. 371, New York, John Wiley & Sons, Inc.,
and Kaner is used. The static magnetic field is 1956.
applied in the plane of a flat sample, so that the Lax, B., and Mavroides, J. G., "Solid State Physics,"
electrons near the surface can be accelerated by Vol. XI, p. 261, New York, Academic Press Inc.,
the electromagnetic fields, and the orbits de- 1960.
scribed by a cyclotron radius which is large
compared with the skin depth. In this way,
whenever the applied frequency is a multiple of
the cyclotron frequency, a resonant condition Cross-references: CYCLOTRON, DE HAAS-VAN AL-
is satisfied. This type of cyclotron resonance PHEN EFFECT, DIAMAGNETISM, FERMI SUR-
experiment yields an effective mass at the Fermi FACE, RESONANCE.
D
DE HAAS-VAN ALPHEN EFFECT* cylinders grow in size, and at a critical field
given by
The de Haas-van Alphen effect is the periodic
oscillation of the magnetization with inverse IIH = 211(n + r)elhcAo (2)
magnetic field. These oscillations were dis-
covered in 1930 in Bi at very low temperature the nth cylinder becomes too large to be con-
by W. J. de Haas and P. M. van Alphen. For tained in the surface. Here Ao is the maximum
some time the effect was thought to be unique cross-sectional area of the sphere. This gives
to Bi but it was observed in zinc in 1947 and rise to oscillations in the free energy F periodic
since then in a very large number of metals in IIH. This same result occurs for more irregu-
and intermetallic compounds. Related magneto- lar shaped Fermi surfaces, with the oscillations
oscillatory behavior in the electrical resistance corresponding to extremal cross sections.
has even been observed recently in organic The effect is usually observable only below
metals. 4 K because the Fermi surface is blurred out
Peierls first explained the effect in 1933 as over an energy range ~kT at finite temperatures
a consequence of the quantitization of the mo- T. If we define an effective carrier mass m'" by
tion of the conduction electrons in a magnetic
field. In zero field, electrons have a nearly con- m'" =-fl 2/211(dAldE)kz (3)
tinuous range of energies up to a maximum then the oscillatory term in the free energy F is
called the Fermi energy EF. In momentum or given l by
wave number space, all states are occupied up
to the Fermi surface. The Fermi surface is thus Fo: TH312 exp [-211 2 k(T+X)m"'cleHfz]
the boundary between filled and unfilled states
in momentum space. . cos [cfzA 0 leH) + (1114) - 0] (4)
When a magnetic field Hz is applied, only
discrete energy levels En are available for a given for the case when the argument of the exponen-
kz, and the separation between these levels tial is much greater than unity. Here k is the
increases with field. If the energy surfaces are Boltzmann constant, 0 is a phase factor, and X
ellipsoidal so that E is proportional to k 2 , is a term that takes into account the impurity
En is exactly soluble from the SchrOdinger collision broadening of the energy levels. By
equation. Typically the energy surfaces are varying the temperature as the field is held
much more complex but an approximate ap- constant m'" can be determined, and then by
proach due to Onsager can be used. By applica- varying field at constant temperature the scat-
tion of the Bohr-Sommerfield quantization con- tering factor X can be measured.
dition to orbits normal to the applied field Historically the de Haas-van Alphen effect
direction, Onsager showed that the area A en- was measured using torsion balance techniques,
closed in an orbit satisfies usually at slowly varying fields considerably
below 40 kG or using pulsed field techniques
A = 211(n + r) eHtfic (1) to a maximum of ~200 kG. In the former
method the torque exerted on a sample as the
where n is an integer, r is a constant near! (ex- field is changed is measured with the advantages
actly ! for quadratic energy surfaces), 11 is of high sensitivity and accuracy. The disad-
Planck's constant divided by 211, and c is the ve- vantages are insensitivity to spherical segments
locity of light. For a free-electron metal (E 0: of the Fermi surface and practical limitation to
k 2 ) at T = 0, the electrons are contained in a moderate fields, which make large cross sec-
sphere in k-space of maximum energy EF, while tions difficult or impossible to observe. The
in a field Hz they are found in a series of pulsed field technique employs 100-200 kG
cylinders with axes parallel to Hz as shown in fields with rise times of the order of 10 msec .
Fig. 1. As the field magnitude increases the These high fields permit observation of very
high de Haas-van Alphen frequencies but ac-
"'This work performed at Sandia National Labora- curacy is limited by the necessity of making
tories supported by the U.S. Department of Energy measurements of the susceptibility and field
under Contract Number DE-AC04-76DPO0789. magnitude in a few msec.

271
DE HAAS-VAN ALPHEN EFFECT 272

I
II

[100]
FIG. 2. Fermi surface of thorium as calculated by
Gupta and Loucks.

i I
In the past decade, the bulk of the detailed
de Hass-van Alphen measurements have em-
ployed what is known as the field modulation
FIG. 1. Quantitized orbits for a free-electron metal technique. 2 Here a small balanced pickup coil
in a magnetic field Hz. The dashed line is the boundary detects the oscillations in the magnetization of
of the Fermi surface. the sample as the applied field is slowly varied

I
6 I I
2. 08 X 10 7 G (0 ) y
f- [ OO l ] [OlO J
I II
Itil!'I!I
f-- I-

IA I
I
Iti I
..\ I v\ !" .......\ I
lit'''''' .,...-..,.I' ~
..\'.i' I ~,u._\
,I • •

I~

FIG. 3. De Haas-van Alphen oscillations obtained by rotation of the magnetic field about a single crystal
of o:-U at -100 kG, 9 kbar and 1 K.
273 DENSITY AND SPECIFIC GRAVITY

while being modulated sinusoidally. The tech- erty of substances is called density. Thus, den-
nique lends itself to use with superconducting sity is a measure of the amount of matter that
coils, which can now be obtained in the 100-150 occupies a given amount of space. Precisely
kG range. The flexibility to vary the modula- defined, density is the mass of a substance per
tion amplitude and direction when coupled unit volume. In the international system of units
with detection of various harmonics of the (Systeme International) the density of a sub-
modulation allows spectrometer action in sort- stance is given in kilograms per cubic metre
ing out complicated de Haas-van Alphen spectra. (kg/m 3 ). In the centimetre/gram/second (cgs)
Fourier analysis of the spectra is also routinely system the density of water is I g per cubic
employed. These techniques have resulted in an centimetre. It is possible to remove the system
enormous amount of very accurate, and in of units from the numerical value of density
many cases complete, experimental mappings by defining a property called specific gravity.
of the Fermi surfaces of a great many metals. The specific gravity is the ratio of the density
With high-quality, long-mean-free-path crystals, of the given substance to the density of a stan-
the limitation on the accuracy of the cross- dard substance. It has been customary to define
sectional area data is reduced to the limitations the specific gravity of a substance as the ratio
in counting the oscillations and determining of the density of the substance to the density
the absolute value of the magnetic field (which of water. As a result, since the density of water
can be monitored by in situ nuclear magnetic in the cgs system is I g/cm 3 , the specific gravity
resonance). of an object has the same numerical value as its
The calculated Fermi surface of thorium is density in cgs units.
shown in Fig. 2 and all of the extremal orbits Density is the ratio of two independent ex-
shown on each of the three sheets have been trinsic properties of a given amount of a sub-
observed. The detailed angular dependence of stance, its mass divided by its volume. In com-
the sheets can be determined by rotation of mon use age the concept of density is frequently
the magnetic field (at constant magnitude) confused with the weight of an object. In addi-
with respect to the crystallographic axes as tion it is not uncommon to find weight and
shown in Fig. 3 for a-U. These data were volume confused in common discussions of
taken at I K at 100 kG at a pressure of 9 kbar objects. The common confusion among weight,
(~9000 atmospheres) in order to avoid low- volume, and density is displayed in the child's
temperature phase changes below 43 K. This is trick question, "Which is heavier-a ton of bricks
an indication of the versatility of present-day or a ton of feathers?" When trying to convey
experimental de Haas-van Alphen techni<iues. scientific information about the density of an
De Haas-van Alphen data have been important object to the public, care must be taken to
in the development of band calculations of elec- explain this intrinsic property of matter. It may
tronic structure of metals. Because of its great be best to contrast density to the daily experi-
sensitivity and the fact that it is nearly a dc ences of heft or heaviness.
experiment, the field modulation technique Density is an intrinsic property of an object,
also lends itself to measurements of the pres- i.e., it will not vary from one part of a homoge-
sure and strain dependence of the Fermi sur- neous object to another, but it does depend
face. This type of experiment provides critical upon the volume of the object . Hence, density
and direct tests for the theoretical model is a function of those variables that can change
descriptions which can be calculated easily at the volume of an object. In general, the volume
various interatomic spacings. of an object is a function of both temperature
and pressure. This functional dependence of
J. E. SCHIRBER volume on temperature and pressure is most
striking for gaseous materials (see GAS LAWS)
References but temperature is also an important considera-
1. Lifshitz, I. M., and Kosevich, A. M., J. Exper. tion for all three states of matter (see EXPAN-
Theor. Phys. 29,730 (1955). SION, THERMAL).
2. Stark, R. W., and Windmiller, L. R., Cryogenics 8, The concept of density as mass per unit
272 (1968). volume seems to imply a continuous medium.
3. Boyle, D. J., and Gold, A. V., Phys. Rev. Lett. 22, According to the modern atomic theory of
461 (1969). matter (see ATOMIC PHYSICS) all matter is
4. Schirber, J. E., and Arko, A. J., Phys. Rev. 821, made up of discrete entities called atoms.
2175 (1980). Hence, the density of a substance is a micro-
scopic measure of how tightly the atoms of the
Cross-references: FERMI SURFACE, MAGNETISM. substance are packed together. If a solid sphere
model is adopted for an atom, then the density
of individual atoms of various elements can be
DENSITY AND SPECIFIC GRAVITY calculated and compared to the densities of
normal materials (see the table at the end of
Definitions The amount of a substance that this article). Similarly according to the nuclear
occupies a given amount of space is an intrinsic model of atoms (see ATOMIC PHYSICS), an
property of the substance. This intrinsic prop- atom consists of a massive nucleus situated in
DENSITY AND SPECIFIC GRAVITY 274

the middle of an electronic cloud. Once again, Liquids A standard object and the hydro-
if a solid spherical model is adopted for a static method can be used to find the density
nucleus, then one can take typical nuclear prop- of a liquid. This process for determining the
erties and calculate nuclear densities to com- density of a liquid calls for measuring the
pare with the densities of other kinds of matter weight of the object in air Wa and in water,
(see the table at the end of this article). Ww , and in the liquid, WI, then the density of
Solids Since the compressibility of solids is the liquid is given by
very small, their density is essentially indepen-
dent of pressure and only shows a small tem- PI = (Wa - WI)Pw/(W a - Ww ) (kg/m 3 ).
perature dependence. The direct measurement
method of density involves measurement of Accurate density measurements must mini-
both mass and volume under the same condi- mize temperature effects and are usually made
tions. For example the mass of 1 cubic meter of at specified temperatures (e.g., 15°C).
aluminum is 2700 kg. The hydrostatic method A mechanical system for the rapid measure-
of measuring densities is based on the applica- ment of the density of liquids with extremely
tion of Archimedes' principle, which states that high accuracy has been developed. 2 This system
the buoyant force on an object immersed in a measures the period of oscillation of aU-tube
fluid is proportional to the mass of the volume containing the liquid of unknown density. The
of the displaced fluid. The hydrostatic method period of oscillation is directly related to the
for measuring the density of a solid weighs the density of the liquid in the U-tube.
solid object in air and in water and uses the loss The operation of this system is based on the
of weight in water as a way to determine the physics of a damped harmonic oscillator. The
density of the object. If the solid has a weight instrument provides an external electromagnetic
in air Wa and a weight in water Ww then the force to balance the damping forces on the
density of the solid Ps, neglecting the buoyancy oscillating U-tube. While this force balance is
of air, is given by maintained the resonant oscillations of the sys-
tem are established. The period of the resonance
Ps = WaPw/(W a - Ww ) (kg/m 3 ) oscillations is then measured and related to the
density of the liquid as follows:
where Pw is the density of water. The total mass of the vibrating objects is
The determination of the density of a solid given by
by the hydrostatic method can be corrected for
the buoyant force of the air. Then the density M=Mt+Vp
of the solid is given by
where Mt is the mass of the empty U-tube, V is
Ps = Wa (Pw - Pa)/(Wa - Ww ) + Pa the volume of the tube, and P is the density of
the liquid in the tube.
where Pa is the density of air. The period at resonance is given by
Sources of error in the hydrostatic method
are largely due to surface tension effects and T = 2rr(M/k)1/2 = 2rr(Mt + Vp)/k)1/2
trapped air on the surface of the solid. As a rule
of thumb a minimum volume of 5 cm 3 is re- where k is the restoring force constant of the
quired for an accuracy of one part in 10 4 in oscillator.
the corrected density. 1 The density of the liquid can be computed
For solids less dense than water, a sinker from the period of oscillation and the values of
(denser than water) is used. The sinker is the instrument constants. In practice the instru-
weighed in air and in water as described above ment COllMants are determined by measuring
and then weighed with the unknown. The den- the periods of oscillations of fluids of known
sity of the unknown sample Pu is then found density such as dry air and pure water.3
in terms of: The commercially available instrument is
designed to measure total time for a fixed num-
Wua = weight of unknown in air; ber of oscillations in time ranges from 0.7 to
480 seconds. * During the time of measurement
Wsa = weight of sinker in air the temperature of the sample must be con-
trolled to within ± 10- 2 °c for absolute errors
Wusw = weight of unknown and sinker in the density of the fluid less than 10-3 kg/m 3 .
in water; In practice this temperature control is difficult
and leads to the use of small sample volumes
Wsw = weight of sinker in water i.e., seven milliliter volumes are the most com~
Pu = WuaPw/(W ua + Wsa - Wusw ) mono The instrument specifications call for a
maximum precision of 1.5 X 10-3 kg/m 3 for a
- (Wsa - Wsw)) measuring range of 0-3 X 10 3 kg/m 3 when
density differences are measured. In comparing
Pu = WuaPw/((Wua - (W usw - Wsw))
. (kg/m 3 ) *The commercial instrument is manufactured by the
A. Paar Company of Graz, Austria and is marketed by
where Pw is the density of water. the Mettler Instrument Corp. in the U.S.A.
275 DIAMAGNETISM

the measurements using this mechanical oscilla- follows :


tion instrument to other sources of density
measurement of standard sodium chloride solu- p=MjV=MPo(l +0.00366(T- To»/PV
tions, the precision of this instrument was
found to be better than 5 parts in a million for where M is the mass of the gas, V is the volume
all cases. of gas at a pressure P and temperature T(K),
Gases The densities of a gas are strongly de- and Po and To are the standard values of pres-
pendent on both temperature and pressure. It is sure and temperature. The experimental method
the custom to give density values at the standard used is the direct measurement of a known
temperature and pressure (STP) values of oDe volume V of a gas at ambient temperature and
and I atmosphere, respectively. Measurements pressure. A standard flask of air is evacuated
under other conditions may be reduced to STP then filled with the gas to ambient pressure'
and its mass is measured. '
values by using the appropriate equation of Applications Since density is an intrinsic
state. In many cases, the ideal gas law can be pr<?perty of a substance and is, in general, a
used to give the STP density p from measure- umque value for each substance, density can
ments at any pressure and temperature as be used as a way to distinguish one su bstance
from another. In systems where the force on
o.bjects .is proportional to mass, as in a gravita-
t1on~l.fleld (Se~ gravity) differences among the
TABLE OF TYPICAL DENSITIES densIties of flUIds can be used to separate fluids
from one another.
Density Typical Values of Densities The materials we
Substance (kg/m 3 ) commonly encounter have densities that vary
only over a few orders of magnitude from air
Smooth density of galactic v:ith a density of 1.3 kg/m 3 to gold with a den-
material throughout the SIty of 19 X 103. However, as shown in the
universe 2 X 10-28 Table of Typical Densities, in the whole uni-
Mean density of interstellar gas 3 X 10-21 verse the densities of matter vary over 45 orders
Mean density of the moon 3.3 X 10 3 of magnitude.
Mean planet densities
ROBERT G. FULLER
Saturn 0.70 X 10 3 RICHARD M. FULLER
Jupiter 1.33 X 10 3
Mars 3.93 X 10 3
Venus 5.24 X 10 3 References
Earth 5.515 X 10 3 1. Marton, L., "Classical Methods," Volume 1 of
Typical densities of earth materials "Methods of Experimental Physics" (I. Estermann,
Ed.), Academic Press, New York, 1959.
Air 1.293 2. Stabinger, H., Kratky, 0., and Leopold, H., Mon-
Teak wood 0.58 X 10 3 tash. Chem 98,436 (1967).
Alcohol 0.80 X 10 3 3. Elder, J. P., Volume 61 of "Methods of Enzymol-
Ice (O°C) 0.917 X 10 3 ogy" (C. H. W. Hirs and S. N. Timasheff, Eds.),
Water (O°C) 0.999841 X 103 Academic Press,.New York, 1979.
Water (4°C) 0.999973 X 103 4. Fuller, H. Q., Fuller, R. M., and Fuller, R. G.
Water (20°C) 0.998203 X 10 3 "Physics Including Human Applications," Harpe;
Aluminum 2.70X 10 3 and Row, New York, 1978. See pp. 182 and
Copper 8.96 X 10 3 690-691.
Silver 10.5 X 103 5. Anderson, H. L. (Ed.), "Physics Vade Mecum"
Mercury 13.6 X 10 3 American Institute of Physics, New York, 1981. '
Gold 19.3 X 103 6. "Handbook of Chemistry and Physics," 64th Edi-
Platinum 21.5 X 10 3
tion, The Chemical Rubber Company, Boca Raton,
Mean densities of atoms FL,1982.
(using solid sphere model)
Cross-references: ATOMIC PHYISCS; COMPRESS-
Hydrogen 4.4 X 10 1
IBILITY, GAS; EXPANSION, THERMAL; GAS
Oxygen 2.0 X 10 3
1.7 X lOs LAWS; MASS AND INERTIA.
Uranium
Densities of stars
The sun 1.41 X 10 3
White dwarf 1 X 10 9 DIAMAGNETISM
Neutron star 2 X 10 17
Magnetic sl!sceptibility is defined as Xm =M/H,
Density of nuclear matter where M IS the magnetic moment per gram
(using solid sphere model) (gram susceptibility) or per mole (mole suscep-
tibility) that is induced by an external magnetic
DIAMAGNETISM 276

field strength H. If M > 0, the susceptibility is els within an energy band of conducting states,
paramagnetic; if M < 0, the susceptibility is and at large H these "bunches" can be resolved.
diamagnetic. Whereas most magnetic phenom- Ther are known as Landau levels because Lan-
ena, including PARAMAGNETISM, are manifesta- dau first presented the quantum mechanical
tions of ELECTRON SPIN, diamagnetism reflects theory of conduction-electron diamagnetism,
electron angular momentum. which for single parabolic energy bands gives
If an external field strength H is applied to a
conductor so as to change the number of lines 2
of flux that thread through it, there is induced Xm cond = - "3 IlB2N(EF),
in the conductor an electric current whose asso-
ciated magnetic field opposes the change (Fara- where IlB is the Bohr magneton and N(EF) is
day's Law of Induction and Lenz's Law). In the density of energy levels at the Fermi energy
most conductors the current I that is thus in- EF. Since N(EF) oscillates with H as successive
duced is rapidly dissipated as heat through the Landau levels pass through EF, Xm cond shows
12 R loss, where R is the electrical resistance. oscillations in large H (DE HAAS-VAN ALPHEN
These currents are known as eddy currents, and EFFECT). Transitions between Landau levels,
they are of great practical interest in ac applica- which are split by an energy hwp = eHlm*c,
tions. However, such transients do not influence may be induced by an electromagnetic field of
the dc measurement of Xm. There are three angular frequency wp. This gives rise to reso-
other classes of electron-momentum change in- nance power absorption as w passes through
duced by H that are not dissipated: electron wp (cyclotron or diamagnetic resonance). These
currents in SUPERCONDUCTORS, where the re- two effects are used to map out the contours in
sistance is R = 0; currents of atomic dimension momentum space of the Fermi energies in
induced in atoms or molecules or the atomic metals.
"core" electrons of solids; and microscopic con-
duction-electron helical currents having quan- JOHN B. GOODENOUGH
tized helical radii.
In a superconductor, switching on of an H Reference
induces eddy currents that permanently shield
the inside of the conductor from penetration 1. Landau, L. D., Z. Physik, 64,629 (1930).
by the magnetic-field lines. Therefore the super-
conductor is an ideal diamagnet, except for a Cross-references: DE HAAS-VAN ALPHEN EFFECT;
small skin depth at the surface. If a supercon- MAGNETISM; SUPERCONDUCTIVITY.
ductor is cooled through the normal-conducting
~superconducting transition temperature in the
presence of H and after the eddy currents in-
duced in the normal-conducting state have been DIELECTRIC THEORY
dissipated, the field lines are rapidly expelled A dielectric is a material having electrical con-
from the superconductor (Meissner effect). This ductivity low in comparison to that of a metal.
proves that the ideally diamagnetic state is It is characterized by its dielectric constant and
thermodynamically stable. dielectric loss, both of which are functions of
An external field H superposes on the motion frequency and temperature. The dielectric con-
of atomic or molecular electrons (or the atomic stant is the ratio of the strength of an electric
core electrons in solids) a common circular mo- field in a vacuum to that in the dielectric for
tion about H of angular frequency WL = the same distribution of charge. It may also be
eHl2mc, where elm is the electronic charge-to- defined and measured as the ratio of the capaci-
mass ratio (Larmor's theorem). This atomic tance C of an electrical condenser filled with
current produces an atomic moment that is the dielectric to the capacitance Co of the evac-
proportional to the square of the distance of a uated condenser:
classical electron from the nucleus, rj ~ lAo
Therefore the diamagnetic contribution from f = CICo
electrons localized about an atomic nucleus is
The increase in the capacitance of the con-
Xm core = - (Ne 2 /6mc 2 ) L rj2, denser is due to the polarization of the dielec-
tric material by the applied electric field. Since
j
the dielectric constant is not a constant, it is
where N is the number of atoms per gram (or frequently called the "dielectric permittivity."
mole). The relative permittivity or dielectric constant
In addition to macroscopic eddy currents, is the ratio fIfo, where fo is the permittivity or
conduction electrons tend to move in micro- dielectric constant of free space. In the mks sys-
scopic helical paths in the presence of an H . The tem of units, the dielectric constant of free
contribution to Xm from this helical motion is a space is 8.854 X 10- 12 farad/m, while in the esu
purely quantum mechanical effect. The radii of system the relative and the absolute dielectric
the H-induced helical paths are quantized, constants are the same. The relative dielectric
which leads to a "bunching" of the energy lev- constant, which is dimensionless, is the one
277 DIELECTRIC THEORY

commonly used. When variation of the dielec- A much better representation of the dielectric
tric constant with frequency may occur, the behavior of polar liquids is given by the Onsager
symbol is commonly primed. When a condenser equation
is charged with an alternating current, loss may
occur because of dissipation of part of the en- Edc - 1 _ €o<> - 1
ergy as heat. In vector notation, the angle 0 be- Edc +2 €oo +2
tween the vector for the amplitude of the charg-
ing current and that for the amplitude of the = + 2)
3Edc(Eo<> 4rrN 1Jl 2 (4)
total current is the loss angle, and the loss tan- (2Edc + Eoo)(edc + 2) 9kT
gent, or dissipation factor, is
Kirkwood developed an equation differing
Loss current €" from that of Onsager (Eq. 4) in that Jl2 is mul-
tan 0 = - - - - - - tiplied by a correlation parameter g in an at-
Charging current €'
tempt to account for the hindrance of the rota-
where e" is the loss factor, or dielectric loss, of tional orientation of dipolar molecules by their
the dielectric in the condenser and e' is the neighbors. The departure of the value of g from
measured dielectric constant of the material. unity is a measure of the hindrance to molecular
At low frequencies of the alternating field, rotation by short-range intermolecular forces.
the dielectric loss is normally zero and e' is in- Positive deviations of g from unity occur when
distinguishable from the dielectric constant €dc short-range hindering torques favor parallel
measured with a static field. Debye has shown orientation of neighboring dipolar molecules,
that while negative deviations occur when these
edc - 1 = 4rrNJ (ao+ Jl2 ) (1)
torques favor antiparallel orientation. Empiri-
cally determined values of g are not far from
€dc + 2 3 3kT unity for normal or unassociated liquids, but
may depart considerably from unity for abnor-
where N J is the number of molecules or ions mal or associated liquids. When g = 1, the Kirk-
per cubic centimeter; ao is the molecular or wood and the Onsager equations are identical.
ionic polarizability, i.e., the dipole moment in- Both equations contain the approximation in-
duced per molecule or ion by unit electric field
volved in treating the dipolar molecules as
(1 esu = 300 volts/cm); Jl is the permanent di- spheres. 10hari has been successful in treating
pole moment possessed by the molecule; k is the water in certain clathrate hydrates, in which
the molecular gas constant, 1.38 X 10- 16 , and he finds the dipole moment of H2 0 increased
T is the absolute temperature. An electric di- by nearly 30% over its vapor phase value.
pole is a pair of electric charges, equal in size, An equation derived by Debye for the change
opposite in sign, and very close together. The
dipole moment is the product of one of the two of dielectric constant or permittivity ~E pro-
charges by the distance between them. duced by application of an electric field of
In Eq. (1) Jl2/3kT is the average component intensity E to a liquid or gas
in the direction of the field of the permanent -4rrN 1 Jl 4 E 2
dipole moment of the molecule. In order that ~E=--..!.:...-- (5)
this average contribution should exist, the mole- 45 (kT)3
cules must be able to rotate into equilibrium
with the field. When the frequency of the alter- showed that any departure from linear depen-
nating electric field used in the measurement is dence of polarization upon the intensity of the
so high that dipolar molecules cannot respond applied field was too small to be evident at the
to it, the second term on the right of the above field intensities « I 00 volts cm- 1 ) normally
equation decreases to zero and we have what used in measurement of e, but at higher applied
may be termed the optical dielectric constant fields in the kilowatt region and, especially with
€o<>' defined by the expression molecules of large dipole moments, departure
from linearity was sufficient to give significant
Eo<> - 1 _ 4rrN 1 a values of ~E. The important work of A. H.
(2) Piekara and his coworkers in this area has been
€o<> + 2 - 3 0
reviewed and discussed by Davies (Hill, Vaughan,
Eo<>differs from n 2 , the square of the optical re- Price, and Davies, 1969) and more recently by
fractive index for visible light, only by the small Davies and by Piekara himself (Neel, L., Ed.).
amount due to mfrared absorption and to the The Onsager treatment has also been introduced
small dependence of n on frequency, as given to change the factor (e + 2)4/81 in Eq. (5).
by dispersion formulas. It is usually not a bad Several simple liquids, such as ethyl ether, chlo-
approximation to use eo<> =n 2 . The general Max- roform, and chlorobenzene, the molecules of
well relation €' = n 2 holds when e' and n are which have moderate-sized dipole moments,
measured at the same frequency. The Debye were found to show the predicted decreases in
equation may be written in the form dielectric constant when a strong electric field
was applied. However, in the case of the more
edc - 1 _ eo<> - I = 4rrNJ Jl2 (3) polar nitrobenzene and 1,2- and 1,3-nitrotolu-
Edc + 2 eo<> + 2 9kT ene, positive values of ~e were observed, chang-
DIELECTRIC THEORY 278

ing to negative on dilution with a nonpolar it follows that


solvent, which presumably reduced or destroyed
pairwise orientation of the dipolar molecules. Em" = (Edc - E",,)/2 (10)
These and more or less similar changes in dielec-
tric constant or permittivity produced by appli- and that the corresponding values of wand
cation of a high electric field are called nonlinear E' are
dielectric or saturation effects, although the (11)
degree of saturation attainable before dielectric Wm = l/T
breakdown occurs is normally small. An excep- and
tion cited by Davies is a dioxan solution of
polY('Y-benzyl-L-glutamate), which has an enor- (12)
mous molecular dipole moment of about 2000
D. At a field intensity of about 40 kV cm- 1 , the The symmetrical loss-frequency curve predicted
dielectric constant of the solution is reduced by by this simple theory is commonly observed for
an amount LlE = -0.10 to equal that of the pure simple substances, but its maximum is usually
solvent, which means that virtually complete lower and broader because of the existence of
dielectric saturation has occurred. Liszi, Mes- more than one relaxation time. Various func-
zaros, and Ruff have developed an equation for tions have been proposed to represent the dis-
the field dependence of the dielectric constant tribution of relaxation times. A convenient
or permittivity of liquids taking into account representation of dielectric behavior is obtained,
a field dependence of the Kirkwood correlation according to the method of Cole and Cole, by
parameter g. This equation predicts both nor- writing the complex dielectric constant as
mal and anomalous dielectric saturation and the
structure making and breaking effects of the
field. E* = E + - - - -Edc
' = --- -t""
'-"-- (13)
1 + (jWTo)l-a
Anomalous dielectric dispersion occurs when
00

the frequency of the field is so high that the where To is the most probable relaxation time
molecules do not have time to attain equilib- and 0: is an empirical constant with a value be-
rium with it. One may then use a complex di- tween 0 and 1, usually less than 0.2. When the
electric constant values of E" are plotted as ordinates against
those of E' as abscissas, a semicircular arc is
E* = E' - iE" (6) obtained intersecting the abscissa axis at E' =
Eoo and E' = Edc. The center of the circle of
where j =..;=t. Debye's theory of dielectric be- which this arc is a part lies below the abscissa
havior gives axis, and the diameter of the circle drawn
through the center from the intersection at Eoo
* _ + Edc - E""
(7) makes an angle o:n/2 with the abscissa axis.
E - E"" 1 + jWT When 0: is zero, the diameter lies in the abs::issa
axis, there is but one relaxation time, and the
where W is the angular frequency (2n times the behavior of the material conforms to the simple
number of cycles per second) and T is the di- Debye theory. When, as may arise from intra-
electric relaxation time. Dielectric relaxation is molecular rotation, a substance has more than
the decay with time of the polarization when one relaxation mechanism, or, when the mate-
the applied field is removed. The relaxation rial is a mixture, the observed loss-frequency
time is the time in which the polarization is re- curve is the resultant of two or more different
duced to l/e times its value at the instant the curves and, therefore, departs from the simple
field is removed, e being the natural logarithmic Debye or Cole-Cole curve.
base. The behavior of a good many materials which
Combination of the two equations for the have been found, mostly at low temperatures,
complex dielectric constant and separation of to depart from the symmetrical arc given by
real and imaginary parts gives Eq. (12) has been well represented by the em-
pirical equation of Cole and Davidson, in which
E' = E"" Edc - E""
+ --==-----".:.... (8) the exponent 1 - 0: in Eq. (I 3) is replaced by /3.
1 + W 2 T2 The corresponding curve is a skewed arc and
the empirical parameter /3, which has values be-
E
"
= (Edc1 +- WE""2 T2)WT (9) tween 0 and 1, measures the degree of skewness.
Glarum has accounted for skewed-arc behavior
in terms of the diffusion of lattice defects.
These equations require that the dielectric con- Anderson and Ullman have treated the reorien-
stant decrease from the static to the optical di- tation probability of a molecule as a function of
electric constant with increasing frequency, the free volume, which fluctuates as the result
while the dielectric loss changes from zero to of random thermal motion. If the rate of free-
a maximum value E'~ and back to zero. These volume fluctuation is slow compared with that
changes are the phenomenon of anomalous di- of molecular reorientation, the Cole-Cole plot is
electric dispersion. From the above equations, symmetrical, but flatter, the greater the depen-
279 DIFFERENTIAL EQUATIONS IN PHYSICS

dence of the relaxation rate on the free volume. e"(observed) = edc" + e"(Maxwell-Wagner)
When the rate of change of free volume is much
greater than that of reorientation, all of the + e"(Debye) (17)
molecules have the same environment and a
single relaxation time should be observed. For CHARLES P. SMYTH
an intermediate situation a skewed-arc plot is to
be expected. These and other approaches to the References
problems of dielectric behavior have been de-
scribed by Hill, Vaughan , Price, and Davies. Nee Bottcher, C. J. F., "Theory of Electric Polarization,"
and Zwanzig have formulated a theory of di- Second edition, revised by O. C. Van Belle, P.
electric relaxation involving dielectric friction Bordewijk and A. Rip, Vol. I, New York, Elsevier,
on the rotating dipole, which leads to a fre- 1973.
quency-dependent relaxation time. Frequency- Debye, P., "Polar Molecules," reprinted by Dover, New
dependent dynamic viscosity has been used by York, 1945.
10hari and Smyth to explain an apparent wide Frohlich, H., "Theory of Dielectrics," Second edition,
distribution of the relaxation times of super- London, Oxford University Press, 1958.
cooled solutions of rigid polar molecules as well Hill, N. E., Vaughan, W. E., Price, A. H., and Davies,
as the seemingly low dielectric relaxation times M., "Dielectric Properties and Molecular Behavior,"
found for polar molecules at high frequencies New York, Van Nostrand Reinhold Co., 1969.
or high viscosities. The complexity of molecular Johari, G. P. and Smyth, C. P.,l. Am. Chem. Soc. 91,
behavior in condensed matter is such that no 5168 (1969);1. Chem. Phys. 56,4411 (1972).
molecular theory of dielectrics is completely Johari, G. P., 1. Chem. Phys. 74, 1326 (1981).
satisfactory at the present time. Kirkwood, J. G., 1. Chem. Phys. 7,911 (1939).
If the dielectric material is not a perfect di- Liszi, J., Meszaros, 1., and Ruff, I., 1. Chem. Phys.
electric and has a specific dc conductance k' 74,6896 (1981).
(ohms-t' cm- 1 ), there is an additional dielectric Nee, T. W. and Zwanzig, R.,l. Chem. Phys. 52,6353
loss (1970).
Neel, 1. (Ed.), "Nonlinear Behavior of Molecules,
Atoms and Ions in Electric, Magnetic or Electromag-
" 3.6 X 10 12 rrk' netic Fields" (Proc. Soc. Chim. Phys.), Amsterdam,
edc = (14)
w Elsevier Scientific Publishing Company, 1979, pp.
301-353.
The effective specific conductance is given by Onsager, 1.,1. Am. Chem. Soc. 58, 1486 (1936).
Scaife, B. K. P. (Compiler), "Complex Permittivity,"
London, The English Universities Press Ltd., 1971.
Smyth, C. P. , "Dielectric Behavior and Structure,"
New York, McGraw-Hill Book Co .• 1955;Ann. Rev.
Phys. Chem. 17,433-456 (1966),
It is evident from this equation that k' increases Smyth, C. P., "Molecular Interactions" (Ratajczak, H.
with w, approaching a limiting value, koo, the and Orville-Thomas, W. J., Eds.), Vol. 2, Chap. 7,
infinite-frequency conductivity, which is attained John Wiley and Sons, Ltd., London, 1981.
when 1 can be neglected in comparison with
w 2 7 2 , so that Cross-references: CAPACITANCE, DIPOLE MO-
MENTS, POLAR MOLECULES, REFRACTION ,
RELAXATION.
(16)

DIFFERENTIAL EQUATIONS IN PHYSICS


In a heterogeneous material, interfacial polar-
ization may arise from the accumulation of Because of their high frequency of occurrence
charge at the interfaces between phases. This and importance in the physical sciences it is fit-
occurs only when two phases differ considerably ting to introduce some of the more common
from each other in dielectric constant and con- differential equations that arise and to use these
ductivity. It is usually observed only at very low as a basis for discussion and development.
frequencies, but, if one phase has a much higher (i) When a given mass of a radioactive sub-
conductivity than the other, the effect may in- stance disintegrates it is well known that if at
crease the measured dielectric constant and loss any time t the mass remaining is m, then the
at frequencies as high as those of the radio rate of decay of mass, -dm/dt, is proportional
region. This so-called Maxwell-Wagner effect de- to the amount remaining. This implies that
pends on the form and distribution of the phases
as well as upon their real dielectric constants _ dm = km (k = positive constant). (1)
and conductances. Each type of form and dis- dt
tribution requires special treatment. For a com-
mercial rubber, for example, the observed loss (ij) Suppose that an alternating voltage E cos wt
may be is applied to an electrical circuit consisting of a
DIFFERENTIAL EQUATIONS IN PHYSICS 280

resistance R, inductance L, and capacitance C or L(D)y = f(x)


in series connection, t specifying time. If i de-
notes the current flowing in the circuit at time t (5)
and q the charge on the capacitance plate into
which it flows, then iF dq/dt and so the poten- where L(D) =- anDn + an_IDn-1 + ... + aiD +
tial differences across the three components are ao and Dr =- d r /dx r . The coefficients ao ,a I , ..• ,
respectively Ri, Ldi/dt, q/C. Equating their sum an may be either constants or functions of x
to the applied voltage gives only. Note that y and all its derivatives in (5)
occur only to the power unity and that there are
dq d 2q q no products of these quantities. Such an equa-
R dt + L dt 2 +(7 =E cos wt. (2) tion is said to be linear. Suppose that (5) is
satisfied by the particular value y = Y(x) so that
(iii) When heat is conducted along a metal L(D) Y(x) = f(x). Then Y(x) is called a particu-
bar the temperature u(x, t) at time t at a dis- lar integral of (5). Further suppose that the
equation L(D)y = 0 has n linearly independent
tance x from one end is known to satisfy
solutions YI(X),Y2(X),'" ,Yn(X). Then it is
ou = k 02U
easily seen that
(3)
ot ox 2 ' L(D)(A IYI + A 2Y2 + ... + AnYn) = 0,
in the case when there are no radiation losses. where AI, ... ,An are arbitrary constants, and
For homogeneous material and constant cross so Y = A lY 1 + ...... AnYn satisfies the homoge-
section, k is a positive constant. neous equation L(D)y = O. This solution of the
(iv) If Y (x, t) denotes the lateral displacement homogeneous equation is called the comple-
at time t of a point distant x from one of the mentary junction of the nonhomogeneous equa-
fixed ends of a string set in vibration, then it tion (5). The number of constants in it is equal
can be shown that to the order of the differential equation. Since
o2y o2y L(D){Y(x)+A1YI +A 2 Y2 + ... +AnYn}
at2 = c 2 ox 2 ' (4)
= L(D)Y + 0 = f(x),
where c is a constant for a uniform string of Eq. (5) is satisfied by
constant cross section.
The equations (I) through (4) are examples of Y = Y(x) + A IYl (x) + ... + AnYn(x). (6)
differential equations. In each case it is seen
that a relation exists between a quantity or func- This form can be shown to be the most general
tion whose value is sought, i.e., the dependent form of solution to (5) and it is called the com-
variable, one or more independen t variables, plete primitiVe. Thus the task of finding the
and the derivatives of the dependent variables complete primitive to (5) consists of finding the
with respect to the independent ones. Equations complementary function, obtained by taking
(I), (2) are examples of ordinary differential f(x) = 0, and adding to this a particular integral.
equations and they involve only total derivatives For the special case when the coefficients
as there is but one independent variable. Equa- ao, al,' .. ,an in (5) are all constants-and this
tions (3), (4), however, involve partial deriva- assumption will be made from now on-the gen-
tives and are called partial differential equations. eral solution to the equation L(D)y = 0 is ob-
Partial differential equations involve two or tained by making the trial substitution Y = emx •
more independent variables. In both cases the where m is an undetermined constant. We find
order of the differential equation is that of the
highest-order derivative it contains. (I) is of anmn+···+alm+ao=O. (7)
first order, but (2), (3), (4) are all of second
order. Let m =ml,m2,'" ,mn be the n roots of the
auxiliary equation (7). Then L(D)y = 0 has the
In many physical problems which are formu- n solutions em,x, em2x, ... ,emnx and it can be
lated as differential equations auxiliary condi- shown that its most general solution is
tions are imposed upon the dependent variable
and possibly also on certain of its derivatives. Y =A1em,x +A 2 em2x + ... +Anemnx , (8)
These conditions compounded with the partial
differential equation constitute a boundary value where AI, ... ,An are constants and the m's
problem. If, as often happens, such conditions are all distinct. As an example the differential
are prescribed at t = 0, where t is an indepen- equation (I) is a first-order homogeneous equa-
dent variable specifying time, the compound tion, and by putting m =eAt we find A =- k.
problem is called an initial value problem. Thus the general solution is m = A e -kt, involv-
(1) Ordinary Differential Equations We now ing but one arbitrary constant. If it is given that
consider the ordinary differential equation of m = mo when t = 0, we find A = mo and so the
order n solution of this initial value problem is
dny dn-Iy dy m = moe- kt .
an -+an-I - - + ... +al -+a y =f(x) By way of further example, consider the
dxn dxn-I dx 0 second-order homogeneous equation
281 DIFFERENTIAL EQUATIONS IN PHYSICS

d2 y dy and so integrating these,


- + (a + b) - +ab y =0
dx 2 dx
(a, b unequal constants). A(t) = - _1_ sin 2 wt
2W2 '
Putting y = emx gives the auxiliary equation
m 2 + (a + b)m +ab = 0 having roots m = -a, -b.
Hence the general solution is
B(t) = 2~ (t + 2~ sin 2wt).
y = Ae-ax + Be- bx (a =1= b). Thus on substituting these into (10) and sim-
plifying we obtain the particular integral y =
This form is suggested immediately if the differ- (t/2w) sin wt. Hence the complete primitive is
ential equation is written in the operator form
. t.
(D + a)(D + b)y = 0 (a =1= b), Y = A cos wt + B sm wt + 2w sm wt
since y = Ae-ax and y = Be- bx satisfy (D + a)y =
o and (D + b)y = 0 respectively. In the case (A, B constants).
when a = b it can easily be verified that xe-ax As a second method we introduce the Laplace
is a second' solution to (D + a)2 y = 0 and so its transform. This device is eminently suited to
general solution is solving initial value problems and it is conse-
y = Ae-ax + Bxe-ax • quently favored by electrical and control engi-
neers. The Laplace trangorm of f(t) is denoted
For the general solution of (5), it follows that
the main task is the finding of a particular inte-
.c
either by (f(t)} or by f(s) and it is defined to
be

1
gral. Various techniques are available. Here we
describe two methods using suitable examples. 00

The first is called the method of variation of .c{f(t)} = f(s) = e-st f(t) dt.
parameters . Suppose it is required to solve for o
y (t) the differential equation
By elementary integration one can compile the
d2y following useful table of Laplace transforms for
- + w 2y = cos wt. (9)
dt 2 different forms of f(t):
This is the resonance equation and we observe
that when R =0 and w 2 = IjLe, Eq. (2) is of f(t) cosat sin at
this form. The complementary function, found
from the auxiliary equation m 2 + w 2 = 0 is san!
clearly y = ae iwt + be-iwt . This may be more [(s)
s2+a2 s2+a2 sn+l
simply expressed in the form 1. = A cos wt + s s-a
B sin wt using the relations e ±Iwt = cos wt ±
i sin wt the constants A, B being given by A = Here a is constant and n = I, 2, 3, .... Also it
a + b, B= i(a - b). To obtain a particular inte- is easy to show by integration by parts that
gral of (9) take
yet) = A (t) cos wt + B(t) sin wt; (10)
£ { --;tt
df(t)}
=s
_
f(s) - f(O),
then
d 2 f(t)} _ •
y(t) = -wA(t) sin wt + wB(t) cos wt, (11) .c { ~ = S2 f(s) - s f(O) - f(O).
provided A and B are so chosen that
Let us use this method to solve again equation
0= A(t) cos wt + B(t) sin wt. (12) (9) subject to the initial conditions yeO) = 0,
Differentiating y in (11),
y((l) = o.Lety(s) = £ {y(t)}. '!.hen, using the ini-
tial conditions, £{dy jdt)] = s yes), .cfd 2 y/dt 2 } =
ji (t) = -w 2 A(t) cos wt - w 2 B(t) sin wt, (13) s2Y(S). Also the table shows that .t{cos wt} =
S/(S2 + w 2 ). Hence on taking the Laplace trans-
provided A and B are so chosen that form of both sides of (9),

cos wt = -wA(t) sin wt + whet) cos wt. (14)


(s2 + w 2 )y(s) =S/(S2 +W2)
and so
Solving equations (12), (14) for A and h we
find
_l sin wt cos wt, l
J
A(t) = h(t) = cos 2 wt Now
w w
1 s _ e-st cos wt dt,
= - (1 + cos 2wt)
00
S2 + w 2 - 0
2w
DIFFERENTIAL EQUATIONS IN PHYSICS 282

and so differentiating both sides of this partially


with respect to w,
Since, for all t, au/ax = °when x = °and when
J
x =Q,
Fm exp(-m 2 kt)=0,
_ 2ws _ 00 ~ -Sf
m exp (-m 2 kt)(-E sin mQ + F cos mQ) = 0.
(S2+W2)2- aw(e coswt)dt,
o
°
= 1
o
00
(-t e-Sf sin wt) dt.
The first of these equations implies that F =
and the second that mQ = mr, where n is an inte-
ger. Thus a solution satisfying the end condi-
tions is
Dividing through by -2w, we have u(x, t) = E exp (-n 2 rr 2 kt/Q2) cos (nrrx/Q),

(S2 /w2 )2 = i 00 e-Sf (2~ sin wt) dt.


where n is an integer. A more general solution
may be obtained by superposition (as (3) is
linear) in the form
Thus
L
00

u(x, t) = En exp (-n 2 rr 2 kt/Q2) cos (nrrx/Q)


y(s) = .c {2~ sin wt } , n=O

Using the initial condition u(x, 0) = uox, the


and so last form leads to the half-range Fourier series
representation
y(t) = 2~ sin wt.

(2) Partial Differential Equations The sim-


plest methods of solution are based on the tech-
L
00

n=o
En cos (nrrx/Q) = UoX for °< x < Q.
nique of separation of variables and on the use
of an in tegral transform such as the Laplace Determining the coefficients by the usual
transform. These are illustrated by means of method,
examples.
Suppose it is desired to solve the one-dimen-
sional heat conduction equation (3) holding Eo =
1
Q III UoX dx = "2I uoQ,
along a uniform rod of length Q, being given o

° °
that when t = 0, u = UoX and, for all t ~ 0,
au/ax = at x = and x = Q (i.e., both ends are andforn=I,2,3,"',
thermally lagged). Making the trial solution u =
X(x) T(t), we find
X"(x)/X(x) = T'(t)/kT(t).
En = % i Q uox cos (n~x) dx

As the left side of this equation is a function of 2u oQ


= - n 2 rr2 [I - (-l)n].
x only and the right one of t only, each is con-
stant. The physical nature of the problem im-
plies that as t increases, u decreases for any Thus the solution to the boundary value prob-
particular value of x. Hence T decreases as t in- lem is
creases and so T' < 0. Thus we take the con-
stant to be negative, say - m 2, and obtain the
ordinary differential equations u(x, t) ="2I uoQ
{
XII(X) + m 2 X(x) = 0,
T'(t) + m 2 kT(t) = 0.
_ 4u o Q '£ exp {-(2n - 1)2 k7r 2 t/Q2}
rr2 n=l (2n - 1)2

These have the general solutions (2n - l)rrx


. cos Q
X(X) = A cos mx + B sin mx,
{ where t ~ 0 and 0 ~ x ~ Q. It can be shown that
T(t) = C exp (-m 2 kt).
the solution is unique.
Thus, writing E = A C, F = BC, a solution for To illustrate the Laplace transform technique,
u(x,t)is consider the boundary value problem of solving
for f(x, y) the partial differential equation
u(x, t) = XT
a f =f(x,y)
axay
2
= exp (-m 2 kt)(E cos mx + F sin mx).
283 DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS

in the region x ;;;. 0, y ;;;.


boundary conditions
° and subject to the 5. Forsyth, A. R., "Differential Equations," London,
Macmillan, 1921.
6. Ince, E. L., "Ordinary Differential Equations,"
[(x, 0) = 0, New York, Dover, 1956.
{ 7. Piaggio, H. T. H., "Differential Equations," London,
[(0, y) = a (= const.).
Bell, 1942.
The first boundary condition suggests taking the 8. Sneddon, I. N., "Elements of Partial Differential
Laplace transform with respect to y. To this Equations," New York, McGraw-Hill, 1957.
end write

J
Cross-references: CALCULUS IN PHYSICS, FOURIER
ANALYSIS, LAPLACE TRANSFORM, MATHE-
l(x,s) = 00 e-sy [(x,y)dy. MATICAL PHYSICS, MATHEMATICAL PRINCI·
o PLES OF QUANTUM MECHANICS.
Then
£ {a[lay } =s lex, s) - [(x, 0) = s J(x, s),
DIFFRACTION BY MATTER AND
and DIFFRACTION GRATINGS
£{ a 2
axay
[} =~£{a[}=sdl(x,s).
ax ay dx
According to the principle of Huygens (1629-
1695) each point in the space which is touched
by a wave gives rise to a spherical secondary
Thus the partial differential equation transfor wave, which again produces tertiary waves, and
into so on. Every wave interferes with the next one
and quite generally gives rise to diffraction phe-
---ax-
dJ(x, s) 1_
= -; [(x, s)
nomena (see OPTICS, PHYSICAL). Such phe-
nomena in the case of visible light first were ob-
served by F. M. Grimaldi (1618-1663) and
and this ordinary differential equation has the mathematically explained by J. Fresnel (1788-
general solution 1827). G. R. Kirchhoff (1824-1887) gave the
first exact mathematical solution of the scalar
lex,s) =J(o,s) exp (xis). wave differential equation in terms of a bound-
ary integral. If both the primary and the dif-
Since [(0, y) =a,J(O, s) =a/s and so fracted rays are parallel (e.g., small source, large

-
[ex,s) = -a
,v
exp (x)
-
s
distances between diffracting sample and source
and detector), we have the experimental condi-
tions of Fraunhofer (1787-1826). With two
lenses LI and L 2 , a collimator pinhole P in the
Ix x2 x3 ] focal plane of LI , and a photographic plate F in
= a [ -; + S2 + 2! s3 + 3! s4 +. .. . the focal plane of L 2 , this condition is fulfilled
even for short distances (Fig. 1). Monochro-
From the table, £ {yn} =n !/sn+1 for positive j matic light is produced by a Hg lamp with a
tegral n, and so Schott filter S and an aqueous solution of
CUS04' If 80 and 8 are unit vectors in the direc-
x2y2 X3y3 ] tion of the primary and the diffracted beam,
[(x,y) =a [ I +xy + (2!)2 + (3!)2 + .... and A is the wavelength of the source, then the
diffracted intensity f is proportional to
In terms of the modified Bessel function this feb) ~ [e 2 [e 2 IR 12 ; R(b) =F(p) (1)
may also be written as[(x,y) =afo(2.J;Y).
where
FRANK CHORL TON 8 - So
b=-- (2)
A
References
1. Carslaw, H. S., and Jaeger, J. C., "Operational P L, L2 F
Methods in Applied Mathematics," London, Oxford

/~*~
University Press, 1947.
2. Choriton, F., "Boundary Value Problems in Physics
and Engineering," New York, Van Nostrand Rein-
hold,1969.
3. Courant, R., and Hilbert, D., "Methods of Applied
Mathematics," New York, Interscience, 1962. COS04 -F- -F-
4. Churchill, R. V., "Fourier Series and Boundary S ' 2
Value Problems," New York, McGraw-Hill, 1941. FIG. 1. Equipment for Fraunhofer diffraction.
DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS 284

and Fraunhofer used assemblies of N parallel ori-

F= f e- 21Ti (bx) dux (3)


ented metal wires with an intermediate distance
d and the distance a (from center to center), in
the direction S 1 • Hence

d d
is the symbol of the Fourier transform, x is a
vector in physical space, p2 (x) is the transpar- p(xd = {
1 for na - -
2
<
= Xl <
= na +-2 (5)
ency of the object M at the end point of the o for all other Xl
vector x, which lies in the plane of the object,
dux is a surface element of the object, to 2 and and
te 2 are explained in Table I and b is given by
Eq. (2). Figure 2(a) shows an object p(x) in the (6)
form of a parallelogram with the edge vectors
Ll and L2 , and Fig. 2(b) give its Fraunhofer
pattern:
IRI2 = 181 2 ;

_ /\ sin 1T(bL l ) . sin 1T(bL2 )


8(b)-I L l L21 1T(bLd 1T(bL2 ) (4) (7)

TABLE 1. FACTORS te 2 AND fe 2 FOR DIFFERENT RADIATIONS

te 2 fe 2

1 + cos 2 2e
Visible light I/A2 2e scattering angle
2

2 X-rays (~t
moc 2
1 + cos 2 2e
2
e electric charge of an electron
mo rest mass of an electron
moe2~ c velocity of light
3 Electrons 1/sin4 e h Planck's constant
2h2
4 Neutrons Cross section Polarization factor A (de Broglie) wavelength

(a) (b)

FIG. 2. (a) Parallelogram as diaphragm. (b) Fraunhofer pattern . Secondary maxima of the shape factor S2
(Eq. (4».
285 DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS

f is the Fourier transform of a single slit, S that Fig. 3(a) shows a model of a two-dimensional
of the shape of the whole lattice (length Na). ideal periodic "point lattice," and Fig. 3(b) rep-
Z is the "reciprocal" lattice point function (lat- resents its Fourier transform.
tice factor) of the centers of the wire, since
PCb I - 0) is a normalized point function at
b l = O. The symbol of convolution_is defined p(x) =L P(x - Xr); Xr = PI al + P2a2 (10)
for both functions G I (b) and G 2 (b) in Fourier r
space, and gl (x) and g2 (x) in physical space by

g;Cx)g;(x) = f gl (y)g2(X - y)duy (9)


where al and a2 are the vectors of a lattice cell
of the model, and A I and A2 are those of the
In the one-dimensional case of Eq. (8) c is to be "reciprocal" lattice cell (in b-space)
replaced by the scalar quantity CI and dUe by
dc I . Two-dimensional gratings are of high inter- A - a2 1\ a3 . A _ a3 1\ al
est, since their Fraunhofer pattern gives all the I - al (a2 1\ a3)' 2 - a2(a3 1\ al) (I2)
information we need to understand the more
complicated structural theories of three-dimen-
sional matter. This will be shown below. a3 is an arbitrary vector orthogonal to al and
The technique of preparing such models is a2, and PI, P2 , h I and h2 are integers. Since the
lattice of Fig. 3(a) is bounded if one multiplies
quite easy: in the examples of Figs. 4,5,6,7,8, p(x) by the shape function sex) of the lattice
10 and 12, the objects were painted with india (which is I inside the lattice, and zero out of it)
ink on paper 15 X 15 inches and then photo-
graphed on fine-grained films 0.5 X 0.5 inch the p-summation can be extended to infinity.
From the convolution theorem one obtains
with steep gradation characteristics (for instance generally
Peruline film FlO). The black ink points now
become transparent on a black background. In
the models of Figs. 3, 9 and 11, steel balls of 2
to 3 mm diameter were placed into the focal
plane of a lens system and photographed with a Hence the Fourier transform of the bounded
linear reduction 1 : 15 on the same film material lattice p(x) . sex) is given again by the convolu-
mentioned above. tion product of Eq. (6), whereS(b) is the Fourier
By defocusing the system, one obtains dia- transform of sex) and Z(b) is the Fourier trans-
phragms, where the single balls do not touch form of Eq. (11). [2 = 1 applies only for "point-
each other (Figs. 3 and 9, but not Fig. II). This like" atoms, otherwise f is the Fourier trans-
is quite advantageous, since the width of the form of the shape of each "point."
atom-factor [2 [see Eq. (6)] is larger and more M. v. Laue in 1911 prepared an article "Wellen-
reflections are visible. optik" for the "Encyclopedie der Math. Wissen-

(a) (b)
FIG. 3. (a) Steel balls in a crystalline lattice. (b) Bragg reflections with shape factor S2.
DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS 286

schaften" and found that Eq. (11) can be easily


developed to the Fourier transform of three-
dimensional point lattice, where a3 is a third
lattice vector, non-coplanar to a1 and a2' Then
bh in Eq. (II) must be replaced by
bh =h1A1 +h 2 A 2 +h3A3;

A - a1 A. a2 ()
3 - a3(a1 A. a2) 14

Together with W. Friedrich and P. Knipping


using x-rays of a wavelength of the same order
of magnitude as that of the atomic distances, (a)
M. v. Laue (1912) found three-dimensional dif-
fraction effects in single crystals. feb) then is
the atom form amplitude
feb ) =F(Po) (15)
where Po (x) is the electron density distribution
of one atom, whose center lies at x = o.
C. J. Davisson and L. H. Germer (1927) ob-
served the same diffraction phenomena using
electron beams. p(x) must then be understood
as density distribution of both the electrons
(negative) and protons (positive). If single dif-
fraction processes occur, Eq. (6) remains un-
changed.
D. P. Mitchell, P. N. Powers, H. v. Halban and
P. Preiswerk (1936) found the same diffraction
phenomena using thermal neutrons. In this case
p(x) is the density distribution of the nuclei,
each one weighted by the mean of the square
root of its respective cross section. The propor-
tional factors fe 2 , f6 2 of Eq. (1) for the differ-
ent radiations are given in Table 1. (b)
The vector b defined by the integral of Eq. (3)
expands the three-dimensional Fotlrier space FIG. 4. (a) Linear thermal oscillations without cor-
and is connected with the unit vectors s, So of relations. (b) Debye factor and thermodiffuse back-
the diffracted and primary beam by Eq. (2). ground.
This is the construction of Ewald (1914). For a
fixed A and So all values R(b) are in reflection
positions, whiCh lie on a sphere with radius I/A center of an atom being at the distance x from
and the center at b o = so/A. its ideal position and if the atoms oscillate inde-
P. Debye (1915) found that the molecules in pendently from each other, then R 2 is given by
the gaseous state give rise to diffraction patterns
depending on the structure of the single mole-
cules, without any intermolecular interferences. R2(b)=N[2(1-D2)+;f 2D2 Z1S1 2 ;
In 1927, F. Zernike and I. A. Prins discussed
quantitatively diffraction phenomena of liquids D(b) = F(H) (16)
and laid down the fundamentals of the structure
analysis of amorphous matter. Moreover, P. (Ur = volume of a lattice cell). In Fig. 4(b) can
Debye (1913) and I. Waller (1927) found that be clearly recognized the first term of Eq. (16)
real crystals never have a periodic lattice similar as a diffuse background. It has a structure in
to that of Fig. 3(a), since the atoms show ther- the horizontal direction. The "Bragg reflections"
mal oscillations around their ideal positions. All are weakened by the "Debye-Waller factor" D2,
these different phenomena can be studied quite the more, the stronger is the diffuse background
easily with the help of two-dimensional statisti- (1 - D2). In nature there exist correlations be-
cal models and their Fraunhofer patterns, since tween the different oscillations and "elastic
Eq. (I) holds for all diffraction phenomena. In waves" with an "acoustical" and "optical" fre-
Fig. 4(a) we have the "frozen" structure of a quency creep through the lattice.
point lattice with thermal oscillations. They are Fig. 4(a) gives a single undamped longitudinal
quite anisotropic and occur only in the horizon- and sinusoidal wave with a horizontal wave vec-
tal direction. If H(x) is the frequency of the tor bt and a wavelength At and amplitude at
287 DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS

I I In Fig. 6(a) we have again a horizontal wave


At = Tb;I = 8a; !at 1= 4" a (17) with the same wave vector [Eq. (17)], which
now is transverse. From Laue's theory it fol-
According to the theories of M. v. Laue (1927) lows, that the intensity of a Laue spot of the
and Laval (1941) in Fig. 5 (b) at the reciprocal mth order is proportional to the square of the
lattice points Bessel function
b = bh + mbt (18)
"Extra Laue spots" ("Laval spots") occur. Ac- where m is the order of the Bessel function and
cording to conventional theories, m = I is called at is the amplitude vector of the elastic wave.
one-phonon scattering, m = 2 is two-phonon Hence in Fig. 6(b), strong Laval spots occur
scattering and so on; and it is said that the fre- only in the vertical bh direction; in Fig. 5(b), in
quency Vo of the incident radiation is changed the horizontal bh direction. Since I 1(12 ) has its
into Vo +mVt ("inelastic scattering," vt-frequency maximum at bat = 0.25 (0.5), in Figs. 5(b) and
of the phonons). 6(b), the Laval spot m = I at the reflection hi,

:: :.. .. .. ...: .. : ..
::
... : ..
:
:
:: :
..
: : : .. : :: :
.. : :
: : : :: ::
::
::: :
: :: : : :
.. : : :
.. ..... :: .'
: .. : :: : : : :::
..
::: : : ... : : .. ::
.. ..
:: : :
.. : :: ..
: : '. .. , .. :: : : : : ::: :
:: :.. . .. :
:: : :: : : ..
..
:
.. : :: : : ::
: : ..
::
.. .. : : : .. ::
.. .. ::;::
:: : :: ::::
: :: :: : :: : .. : .. : : : :::: : :::: : : :: : :~ : ::::
.. : .... : :: : .. .....
ii!
: : :: :
: : .. : :: : : ...: : '" : :: :
..
: :: : : : ::
: ::::
.. : ::
: : ..
:: :
.. : : : : : : ::
: :: : : : : :: : :
: : : ::
:: : : : ::: .. : ... .. :: : : .. : : .. :: ..
: .. : : : : : ::: :
::: :: : :: :: : .. :
.. :: : : : :: : : .. ..
: :: :: ~:: ~
: :: .. :: : :..... : : : : : : : :: : : ..
.. : :
:: :
-- : : : .. : :: :
:: " ..:..: : : : : : :: :
..
:: : .. : :: .. ....
: : ...
... .. .. ..... : ....
.' ..
.. .. ..
::
: : .. : : ..
:
: ::
...
.. .
." : :
: .. :
:: : .. .. .. .. :

(a) (a)

(b) (b)
FIG. 5. (a) A single longitudinal wave. (b) Extra FIG. 6. (a) A single transversal wave. (b) Extra Laue
Laue spots. spots.
DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS 288

h2 = 1.0 (0.1) is much stronger and in the 2.0 diffuse scattering. According to Debye's theory
(0.2) reflection, much weaker, than the Laval of heat capacity, every wave has a different am-
spot m = 2. plitude at, following a Boltzmann statistic. Then
In nature, one never finds such sharp Laval in certain regions of a single crystal, the atoms
spots, but a completely smooth "thermodiffuse" oscillate statistically with different amplitudes
scattering, and it is said that a "white" spectrum than in others. As a result, the lattice cells ex-
of undamped elastic waves exists. hibit different sizes and para crystalline distor-
In Fig. 7(a) strongly damped transverse waves tions occur.
according to Eq. (17) consisting only of two Close to the melting point, the amplitudes at
maxima and minima are introduced. Now in of the elastic waves become so large that the
Fig. 7(b) quite diffuse spots appear. Hence, in electron clouds of the atoms suffer large defor-
nature, such damped waves could also occur, mations, which damp these waves more and
and if they had a spectral distribution, they more.
could give rise to the same observable thermo- Another physical reason for paracrystalline
distortions below the melting point exists if the
"motives" in the single lattice cells have differ-
:
.. ...
. ...,
.. ..
ent shapes. Such phenomena have been observed
in the "macrolattices" of natural and synthetic
. ... high polymers .
.. :
.. :: : :: ; :
Figure 8(a) shows a para crystalline model,
where both coordination statistics Hk are hori-
:; :: zontalline functions. Figure 8(b) represents its
: : ::
: Fraunhofer pattern and Fig. 8(c) the x-ray small-
: :: :: angle pattern of the {3-keratin of the quill of a
:: :: :::: sea gull (Bear and Rugo (19S1). The cell edge
:
: :: : : a2 in the vertical direction parallel to the fiber
: axis has a constant length of 18SA but statisti-
: cally changes its direction (with respect to the
macroscopic fiber axis) within ±SO, while the
orthogonal edge length al has an average value
:: : of 34A and changes its length statistically within
: ±2.sA. As a result of the van der Waals forces,
which allow a variation of the length al orthog-
:: :: : onal to the fiber axis, the homopolar forces
along a2 have freedom to change their direction
.... of the molecular chain, within one paracrystal.
.. ....
.... . .: This paracrystal itself, therefore, shows a flexi-
ble character in atomic dimensions.
(a) Equation (6) holds again, if one replaces the
crystalline lattice factor Z of Eqs. (7) and (11)
by the paracrystalline lattice factor

Z(b) = -
I
Ur k= 1
n Re 1+
3 Fk(b)
1 _ F (b)
k
(19)

Fk(b) = F(Hk) (20)

Hk is a so-called coordination statistic. Hk(X) is


the "a priori" probability of finding a cell edge
vector ak = x in a certain paracrystalline lattice
cell.
For the same reason, in mixed crystals con-
sisting of atoms of different sizes, paracrystal-
line distortions can also occur.
Figure 3(a) showed a model of steel balls of
the same size, building up a crystalline lattice.
Figure 3(b) represents its Fourier transform. In
Fig. 9(a) steel balls of different sizes built up a
paracrystalline lattice, whose Fraunhofer pat-
(b)
tern [Fig. 9(b)] exhibits characteristic features
of a paracrystalline lattice [cf. Fig. 8(b)].
FIG. 7. (a) Twelve damped transversal waves. (b) In Fig. 10(a) another model of a mixed crys-
Diffuse extra Laue spots. tal is given without paracrystalline lattice distor-
289 DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS

..........
.' .'
. .. .. . . .
: : ,' ,' : " .'

:' :' ,::':':' ,: :' ,: .:


~~ :..::.::..~. ~. :., =='.:'..:
............
.. , ..... .. . .
.... . . ...
........
.. . . ......
...... .. .....
..... ....
.. .. .. ..
. .. .
·· .........
. ......
..........
.....
.
... ... ..... ..
.....
...........
..... ...... .....
.
.. ......
.. ... .....
'
.
..
"
.... ................
, .. . ....
· ..........
. ... ...
.......
........ ..
·· .......
.......
... .. ... .... ··· ................
.. ....
... .... ......
........ .. .. ...
.. ··.. .....
....-......
,

.....
........... ·.::.................
... .... .....
::::::,::::::::::
................. ..... .
...
.. ...
."" . .........
........
..........
.. .. .·. ... .............. .....
.................
...................
(a)

,
'\.

(b)

FIG. 8. (a) Monoparacrystal with linear horizontal


coordination statistics. (b) Its Fraunhofer pattern. The
reflections (0, h 2 ) are crystalline, all others are more or
(c) less diffuse. (c) Small-angle x-ray pattern from f3-
feather·keratin (Bear·Rugo, 1951).

tions. Now the two kinds of atoms are not dis- to draw statistical sequences. Unfortunately,
tributed totally randomly to the lattice points. after he painted a thick atom he tended to
Hence the Fraunhofer pattern of Fig. lO(b) choose a thin one, etc. Unconsciously, he intro-
shows a diffuse background, which is not given duced "cooperative forces."
by Similar diffuse walls are observed in ferroelec-
tric NaN0 2 above the Curie point: In this case,
(21) rows of about six NaN0 2 molecules have paral-
lel oriented polar axes, and have built up micro-
and which presents "diffuse walls" in the verti- ferroelectric domains in a statistically paraelec-
cal direction at hi = ±! of a width 8b 1 - AA 1 . tric matrix (M. Canut and R. Hosemann, 1964).
This means that in the horizontal direction, In Fig. 11 (a), similar to Fig. 9(a), steel balls of
rows of about 8 atoms show a kind of "super- different sizes which now built up a structure of
structure" (Nahordnung, "cooperative order"). single small paracrystallites were used. The
This "superstructure" has a psychological back- Fraunhofer pattern, Fig. ll(b), shows the typi-
ground: The technician, painting the models cal features of a liquid or melt or "amorphous"
row by row in the horizontal direction was solid. Hosemann and Lemm (1964) have proved,
anxious to choose thick and thin atoms quite that in molten gold and lead such para crystals
arbitrarily. However, he did not use a Monte can be found with average diameters of 12A to
Carlo Method, but tried as arbitrarily as possible 40A.
DIFFRACTION BY MATTER AND DIFFRACTION GRATI,NGS 290

....................................... ...•
•••••••••• "I •••••••• , . . . . . . . . . . . . . . . . . . . . . . . . . . . .

e+ ••••• • • • . -"

... ............ ............ ".•.. ".. .............. .


...
I ••••••• " ••• ". _ •• " ••••••• " ••• " ••••••••• "

-.", ...
" " ' '."

....... ........... ........................... .......... .


••• II . . . . . . . . . . . . . . . . . . . . . . "." ••••••••••• ' . ' • • • • • • • • .•. •

•••• I ................... , •••••••••••• "." ••• 'I I · ' . . . . . . .

...... ...•...•.... ..... ........... ........... ... .........


• ' . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . '.1

................................................. "
'. .... .
e . ...... .

.. ' ... ..............................................


... ............. ..... . .... ...... .. .•. ..................
'

'

. . .. .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . e,.· .•
(a) ................................................... .. . ,.
e· e . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

......................................................
II' .'

.... .............................
.........• .....................
,.,.,., : ..................... .
....... ..... ..... .•. ... ... .
.. ..•..•.•.. ......... .................. ....... .
.. ...•... .........
................. ...............
........ ..... ...
.... ......... ........ ..... .
...... . ...... ... ...... .
.............. .•... .. ..................................
...... ...........•. .............. ..... .............. ..... . .
"

· ..................................................... .
...•.....................•......... ....•.......... •...
.. . .. . .. .. -......................................... .
..... ... ........ ......•..... .......... ..•. ........... .
........................................... .....
............ ................ . ..... ...... . . ... .
..........
..................................................
·

..... .............. .•... .................. .•.. ...•.. .•..


·
,

· ............... ............................. .
· ....................................................... .
~

· ............................... e·.· ............... .


..............................................
..... ........................ .......... ......... ..... .
.
· .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . "!' ••••••• • . . . . . . . . . . . .

.. ..•.. .•. ... .. .•. ......• .•..... ..... ....... ..


,

......
'.....................................................
(a)

(b)

FIG. 9. (a) Mixed single paracrystal (steel balls).


(b) Intensity function (Eq. (6) and (19)).

After having completed the step from crystals


to amorphous matter, Fig. 12(a) gives an ex-
ample of a special gas and Fig. 12(b) at larger
b-values, the gas-interferences. Lord Rayleigh
(1842-1919) proved that here phase relations
between the single scattering centers are de-
stroyed in the average as a consequence of their
irregular positions. Since in Fig. 12 only 5200
centers were used, "ghosts" remain in the
Fraunhofer pattern. Besides this fluctuation the
intensity at large b-values is given by the shape
of the single points. Moreover in Fig. 12(a)
every 40 points cluster together. The clusters
are arranged in a crystalline lattice. Hence, at
small angles in Fig. 12(b), "Bragg-reflections"
whose intensity is proportional to the squared
transform Eq. (15) do occur. Po is now a forty-
point function different for each cluster, and in
Eq. (6), f must be replaced by the average 1. (b)
The diffuse background of Fig. 12(b), which
now is the statistical fluctuation of the density FIG. 10. (a) Mixed single crystal with "cooperative
distribution Po is given again by Eq. (21). If we forces." (b) Background with diffuse walls (Nahord-
replace the word "cluster" by zircon atom and nung) .
291 DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS

"point" by electron, Fig. 12(a) gives an instan-


taneous picture of the electron configuration in
a zirconium crystal. The Bragg reflections give
in this case information about Schrodinger's
wave functions for electrons
Po(x) = 1/11/1*
and the fluctuation term of Eq. (21) gives some
information about the structure of a single elec-
tron. In reality, Compton processes disturb the

(a)

(a)

(b)
(b)
FIG. 11. (a) Polymicroparacrystalline assembly of
steel balls. (b) Intensity function of "amorphous" FIG. 12. (a) Zirconium crystal with discrete elec-
matter. trons. (b) Rayleigh scattering of the electrons.
DIFFRACTION BY MATTER AND DIFFRACTION GRATINGS 292

diffraction. There is some hope for further de- p(\l.,t)


tailed studies. C (1; t)

ROLF HOSEMANN

Cross-references: ABERRATION, DIFFRACfION THE-


ORY OF; CRYSTALLOGRAPHY; ELECTRON DIF-
FRACTION; NEUTRON DIFFRACTION; OPTICS,
PHYSICAL; PARACRYSTAL POLYMER PHYSICS;
SCHRODINGER EQUATION ; X-RAY DIFFRACTION.

DIFFUSION IN LIQUIDS
Diffusion, in a macroscopic sense, is a universal FIG. 1. Concentration or probability profIles for
process that leads to the eliminating of con- one-dimensional diffusion for times fl and f2.
centration gradients in gases, solids, or liquids.
At the molecular level, it arises because atoms
or molecules undergo small, essentially random Both equations have the form of the well-
displacive movements as a result of their known Gaussian error curve, which gives the
thermal energy. It is a property having great distribution of random errors to be expected in
fundamental importance, because all useful a large set of measurements. Figure I illustrates
theoretical efforts to understand the dynamical these equations, showing the probability of
behavior of liquids at the molecular level must finding a single particle at a distance from its
lead to agreement with accurate macroscopic origin (or the concentration distribution of a
measurements. It is one of four transport finite quantity of a substance) after times, tl
properties of matter: viscosity, diffusion, ther- and t 2 •
mal conductivity, and (in electrically conduct- Equation (2) is a particular solution of a pair
ing media) electrical conductivity. Each of of more general differential equations known
these transport properties measures the flux as Fick's laws. Consider a plane of unit cross-
of some quantity in a gradient: viscosity, sectional area in a system, across which a con-
the flow of momentum in a velocity gradient; centration gradient, ac/ax, exists. There will
diffusion, the flow of mass in a concentration be a net flux of matter, Jx , through the plane
gradient; thermal conductivity, the flow of heat from the region of higher to lower concentra-
in a temperature gradient; electrical conduc- tion given by Fick's first law:
tivity, the flow of electric charge in a gradient
of electrical potential. Since their formal math-
ematical structures are the same, it is not sur- J = -D ac (3)
prising that there is often a quantitative correla-
x ax
tion among them. In principle, it is possible to determine the co-
The diffusive motions of individual molecules efficient of diffusion on the basis of Eq. (3) by
may be likened to a kind of aimless three- measuring the net quantity of matter that
dimensional random walk. If we limit our crosses through unit area of the plane in unit
attention to the displacement of a single par- time. This proves to be difficult in liquids or
ticle in a given direction from its position at solids, however, and an additional complication
some arbitrary zero time, the probability that is that the concentration gradient aC/ax is not
it will be found at a distance, ±x, from its origin constant but decreases with time.
after a time, t, is given by: For this reason, it is more feasible to measure
the accumulation of matter in a small volume
I element after a measured period of time. We
P(x, t) = 2(rrDt)I/2 exp (-x2/4Dt) (1) may consider such a volume element to be
bounded by parallel planes of unit area, sepa-
In this equation, the parameter D, called the rated from one another by a distance dx, as
diffusion coefficient, is a measure of the average shown in Fig. 2. The rate of change of concen-
rate with which the displacement of the particle tration within the volume element is expressed
occurs. by Fick's second law:
If, instead of attempting to follow the ran-
dom motion of an individual particle, we intro-
duce a large number of particles Co at a point
_~ [Jx - (Jx + ax
dx
aJ . dX)] =dC
dt
=Da C
ax
2
2
within the system, their concentration C(x, t)
will vary with time and distance in a given (4)
direction according to:
This is the fundamental equation, upon which
c all experimental studies of diffusion depend.
C(x, t) = 2(rrD~)1/2 exp (-x 2 /4Dt) (2)
One of the most widely used of the absolute
293 DIFFUSION IN LIQUIDS

current carried by the appropriate ions as they


diffuse through the depleted zone to the

J.
....:...--~---
,
I electrode. The Ilkovic equation gives the rela-
tion between the diffusion current and param-
eters of the system when the electrode is a flow-
I
I ing liquid metal, such as mercury:

-- --
.1----
it=4C31T )1!2 nFDI/2 C(:1T:)2/3 tl/6

ttl'-
(6)
FIG. 2. Diffusive flux across parallel planes.
where n is the number of equivalents involved
in the electrode reaction,
F is the Faraday,
methods for measuring diffusion coefficients of C is the concentration of reducible or
liquids is the "open capillary" technique, de- oxidizable ions in the bulk solution,
vised by Anderson and Saddington. 1 A capil- m is the mass flow rate of metal (mer-
lary tube, usually less than I mm in diameter cury),
and 2-3 cm in length, is filled with an iso- p is the density of metal, and
topically labeled substance and immersed in a D is the ionic diffusion coefficient.
thermostatted bath of the unlabeled liquid. Optical methods 4 are applicable to solutions
Interdiffusion of the labeled and unlabeled that are transparent, and have been used ex-
molecules occurs across the open end of the tensively to measure the diffusion coefficients
capillary, and at the end of the experiment, of polymers, proteins, and other biologically
the average concentration of labeled substance important compounds. They depend upon the
remaining in the capillary is determined by variation of the index of refraction of a solu-
suitable radiochemical or mass spectrometric tion with the concentration of the dissolved
techniques. The relation between the diffusion substance. Schlieren images of a slit or inter-
coefficient and the initial and final concentra- ferometric fringes of a collimated beam of light
tion of labeled substance in the capillary is that is deviated by refraction as it passes
given by a series solution of Fick's second law: through a concentration gradient of the solution
permit measurement of the time-dependent
displacement of the solute.
Eo
C 8 00 I
~ =(1T)2 (2n + 1)2
A completely different method for the mea-
surement of diffusion coefficients in the liquid
state is based upon nuclear magnetic resonance
exp (-(2n + I)21T2Dt/4L2) (5) (NMR). An assembly of atomic nuclei which
have been excited to some non equilibrium spin
In this equation, for which Land t are, respec- distribution will return to thermal equilibrium
tively, the capillary length and time, the series by a mechanism that involves the coupling of
converges rapidly and as a rule may be termi- the nuclear spins with their local molecular
nated after the first or second term with environment. The rate of the return to equi-
negligible error. librium is characterized by a relaxation time
Other methods have been developed for the (T1 , the spin-lattice relaxation time), and is
measurement of diffusion coefficients, usually governed by the variations in the local magnetic
limited to the liquids or solutions of a particular field at nuclei which are induced by the transla-
kind. One of the most accurate, limited to elec- tional motions of neighboring molecules. In
trolytic solutions,2 is based upon the measure- favorable circumstances the spin-lattice relaxa-
ment of the change in the electrical conductance tion time may be obtained from the width of
as a function of time in a direction transverse the NMR resonance line. Bloembergen, Purcell,
to a concentration gradient in the solution. and Pound 2 developed the basic theory which
Such a method leads to the mean diffusion relates the spin-lattice relaxation time to the
coefficient of ion pairs of a I: 1 salt, rather diffusion coefficient of molecules, and made
than the individual ionic diffusion coefficients the first measurements on water and a series of
if the concentration gradient is initially es- hydrocarbons to test its validity. A more direct
tablished between the electrolyte and pure determination of the self-diffusion coefficient
water. of liquids is provided by NMR spin-echo ex-
The diffusion coefficients of electroreducible periments. These techniques employ rf pulses
or electrooxidizable ions in polar solvents mar and either steady or pulsed magnetic field
be determined by the method of polarography gradients to determine the rate of decay of the
or related techniques. Here the principle is induced nuclear spin polarization. Spin echo
the establishment of an ion-depleted zone of measurements of the self-diffusion coefficients
the solution next to the cathode or anode of a of liquid lithium and liquid sodium, as well as
polarographic cell, and the measurement of the of H2 0 and D 2 0, are in excellent agreement
DIFFUSION IN LIQUIDS 294

with the results of isotope labeled diffusion relaxation by spin-rotation interactions. The
experiments. 5 relative magnitudes of TI and T2 distinguish
A great deal of information 6 about the me- between small and large step angular motions,
chanisms of molecular motion is contained in and TJ depends upon the moments of inertia of
the NMR signal and its time evolution. The the molecule.
processes of nuclear spin relaxation involve A comRletely different approach to the study
coupling to magnetic fields and electric field of the dynamics of atom motion in liquids is
gradients (when I ~ 1) both internal and ex- based upon slow neutron scattering. The angu-
ternal to a molecule, and are modulated by lar and energy dependence of the cross section
their thermally driven fluctuations. Various for inelastic scattering of neutrons by the nuclei
theoretical models? have been devised to infer of atoms in a liquid metal, for example, may
the details of translational and rotational mo- be used to deduce the Van Hove space-time
tions of molecules from several characteristic correlation function. s This function contains
correlation times that are observable by spin information about the diffusive motions of
echo NMR methods. (A correlation time is the nuclei, and experiments on liquid metals indi-
average time required for a molecule to lose cate that a transition from the gas-like free
the memory of its orientation in a prepared motion of particles to solid-like diffusive dis-
spin state.) It is often possible, for example, placements occurs on a time scale of 10- 13
to deduce the rates with which molecules of sec.
low symmetry rotate about their axes, to ascer- Studies of the temperature dependence of
tain whether the angular motion is through a the diffusion coefficient of liquids generally
small or large angle, and to characterize such lead to an empirical relationship of the form:
motions by rotational diffusion coefficients or
equivalently, correlation times. D =Do exp (-(Q/RT) (0)
Efforts to relate observable transport proper-
ties of a system of motions at the molecular where Do and Q are experimental parameters
level are based upon the so-called fluctuation that are essentially temperature-independent
dissipation theorem, which links the linear and characterize the diffusion process in the
transport coefficients to time correlation func- system at hand. Because of the exponential
tions of locally fluctuating dynamical variables form of Eq. (0), the mechanism of diffusion in
(e.g., force, position, velocity, momentum). liquids is often assumed to be an activated
For translational self-diffusion, the self-diffusion process, by analogy with other kinetic pro-
coefficient is givenS by the velocity autocor- cesses to which the absolute reaction rate
relation function: theory has been applied with much success.

i
According to this theory, the rate-limiting step
of a kinetic process is determined by the fre-
D= t oo
dt ( v(O) . v(t) (7) quency with which atoms or molecules acquire
sufficient energy through thermal fluctuations
to surmount an energy barrier identified by the
where (v(O) . v(t) is the average of the particle parameter Q. It has not yet proved possible to
velocity over a short time !1t, which is never- make accurate a priori calculations of Do and
theless long compared with the period between Q on the basis of the activated state theory,
"collisions." Similar time correlation functions however, and the exponential form of Eq. (0)
are defined for other transport properties, is no proof that the mechanism of atom trans-
such as viscosity and electrical conductivity, port in liquids is a thermally activated process
and for molecular rotation. The correlation in any simple sense.
times for molecular rotation are: In recent years high speed computers have
made possible the simulation of self-diffusion
experiments by molecular dynamics calcula-
tions. In such experiments the classical equa-
tions of the motion are solved by difference
methods for a set of particles of given mass
(8) and assumed pair interaction potential at cer-
tain temperatures and particle densities. The
and time intervals are on the order of 10- 14 seconds,

1
and D is calculated from the Einstein equation
00 (1(0) . J(t) with the mean square displacements (R2):
TJ = dt -'--'--'--":"":"':'" (9)
o <J(J+ 1) (R2 )au = 6Dt. (1)
where, in Eq. (8), I = I for dielectric relaxation Calculations have been made for (noninteract-
and I = 2 for nuclear spin relaxation by intra- ing) hard spheres, liquid argon,9 liquid sodium
molecular dipolar or quadrupolar interactions, metal,lo and molten sodium chloride,l1 and the
and in Eq. (9), J(t) is the rotational angular agreement is generally very good. They show
momentum operator that governs nuclear clearly that diffusion occurs by small steps of
295 DIFFUSION IN SOLIDS

less than molecular diameter. This rules out the DIFFUSION IN SOLIDS
once favored "hole theory" of transport and
corroborates other evidence that diffusion is a The term "diffusion" refers to the random
highly cooperative process, involving the motion, generally activated by local fluctuations
p~rticipation of many particles. This, together
of thermal energy, of particles through a medi-
wIth other evidence I2 based upon the small um . The particles with which we shall be con-
pressure-dependence of self-diffusion in liquid cerned are atoms and molecules' the medium
mercury, effectively disproves all models that can be various types of solids. Of special interest
appeal to the activated state theory of transport will be crystalline solids- these include all
processes for the elementary diffusive step as metals, most ionic substances, and many cova-
a single particle jump over an intermolecular lent ones-in which the atoms occupy periodic
distance. The concept of an activated state also and well-defined sites.
loses its meaning when the number of particles The migrating particles may themselves be
needed to define it becomes large and ill-de- uniformly distributed constituents of the host
fined. The characterization of the exponential solid; this is called self-diffusion. When the dif-
temperature-dependence of self-diffusion by an fusing system contains chemical inhomo-
"activation energy" should therefore be re- ~eneities or when a foreign substance diffuses
garded as an experimental parameter, and not m from the surface, we speak of chemical or of
as a barrier that a single molecule must sur- impurity diffusion . Diffusion processes are tech-
mount. The same must be said of other pseudo- nologically important in the oxidation and
thermodynamic quantities (enthalpy, entropy, tarnishing of metals, where one reactant must
and free energy) that have been adduced to the migr~te through t.he layer of reaction product,
transport properties of liquids. and m the annealing of deformed or radiation-
damaged materials. Self-diffusion is an essential
NORMAN H. NACHTRIEB step in the photographic process in silver halides
and impurity diffusion is widely used in th~
fabrication of semiconductor devices such as
References transistors. Many metals such as steel and
1. Anderson, J. S., and Saddington, K. J. , J. Chern.
du~alumin, are hardened by solid-state precipi-
Soc. 381 (1949) .
tatI0!1 and reaction, in which diffusion plays a
dommant role. It is also significant in the pow-
2. Harned and Owen "The Physical Chemistry of
Electrolytic Solutions," 2nd Ed., p. 595, Rein- der metallurgy technique of fabrication of parts
from high-melting metals.
hold, New York, 1950.
3. Kolthoff, 1. M., and Lingane, J. J. , "Polarogra- If, in the medium, there are variations in the
conce.ntratio~ of the migrating atoms-perhaps
phy," Vol. I, p. 40, Interscience Publishers, New chemIcally dIfferent atoms or in the case of
York, 1952. self-diffusion, radioactive trace~ isotopes-then
4. Geddes, A. 1., and Pontius, R. B., in "Physical there oc~urs a net. drift of the diffusing species
Methods of Organic Chemistry," Vol. I, Part II, from regIOns of hIgh concentration to those of
pp. 895-1005, Interscience Publishers, New York, lower concentration. This flow takes place even
1960. though each individual atom may migrate com-
5. Murday, J. S., and Cotts, R. M., J. Chern. Phys. pletely at random . It is a statistical result of the
53,4274 (1970) . fact that if there are more atoms per unit
6. Jonas, J., Science, 216, 1179-1184 (1982). volume of, say, A to the left of a given plane
7. Boden, N., in "Nuclear Magnetic Resonance," than to the right, then even with random non-
Vol. 1, p. 121, The Chemical Society, Burlington directed motion, more A atoms will cro~s the
House, London, 1972). plane from the left than from the right. We can
8. Zwanzig, R., Ann. Rev. Phys. Chern. 16, 67-102 define the flux of A as the net excess of A
(1965) . atoms crossing a plane of unit area in unit
9. Rahman, A ., Phys. Rev. 136, A405 (1964). time. Experimentally, this flux is found to
10. Takeuchi, S., Tanaka, M., Fukui, Y. , Watabe, M., depend on the chemical natures of the medium
and Hasegawa, M., in "The Properties of Liquid and the diffusing species. If the medium is
Metals," Proc. 2nd. Int. Conf. Tokyo, Japan isotropic or is a crystal of cubic symmetry, the
(1972), p. 143, Taylor & Francis Ltd. , London, flux is along the direction of the concentration
1973 . gradient. Moreover, if the system is not too
11. Woodcock, 1. V., in "Advances in Molten Salt therI?odynamically nonideal, the flux is pro-
Chemistry," (Braunstein, J. , Mamantov, G., and portional to the concentration gradient; the
Smith , G. P., Eds.), Vol. 3, p. 40, Plenum Press,
constant of proportionality is called the diffu-
New York, 1975.
sion coefficient, D. Thus, we write the flux
12. Nachtrieb, N. H., and Petit, J., J. Chern. Phys.
J =- D dc/dx, where the negative sign indicates
24,746 (1956).
that the net flow is toward the region of lower
concentration. This statement is known as
Cross-references: BROWNIAN MOTION, DIFFUSION Fick's law. If lengths are measured in centi-
IN SOLIDS, KINETIC THEORY, LIQUID STATE, meters and time in seconds, D is in units of
MAGNETIC RESONANCE, RELAXATION. square centimeters per second. It follows from
DIFFUSION IN SOLIDS 296

Fick's law that at any given point in the medi- excessive activation energy that would be re-
um, the concentration of the diffusing entity A quired to push aside the mutual neighbors of the
will change with time at a rate governed by the pair. A dramatic demonstration that diffusion
variation of the flux with distance lac/at must proceed by a mechanism which allows
= a/ax(Dac/ax); for the particularly simple independent motion of individual atoms is the
case where D is independent of distance, as in Kirkendall effect. Two mutually soluble speci-
self-diffusion, then ac/at =Da 2 c/ax 2 ]. Also, mens of differing composition, say A and B, are
from the theory of random flights, it can be welded together with inert markers imbedded
shown that the root-mean-square displacement at the interface. Subsequent interdiffusion of
of atoms resulting from diffusion for a time t A and B results in a drift of the markers relative
increases as the sguare root of the product to the ends of the specimen, indicating that
Dt; Rrms = (6Dt) 112. more atoms have left one side of the couple
It is observed that for any given system, D in- than have entered it from the other. Clearly a
creases rapidly with increasing temperature, pair exchange mechanism cannot be operative
almost invariably following the Arrhenius rela- here.
tion D = Do exp( - H/R D. Here, Do and Hare Extensive evidence is now available that in
positive constants for a given system and R is most cases of self-diffusion or of substitutionally
the universal gas constant. The parameter H is dissolved impurities, migration proceeds as a
called the activation energy and generally in- result of the presence and mobility of vacant
creases as the melting point of the host crystal lattice sites. These vacancies exist in the crystal
increases. Typically, in crystals which melt at in thermodynamic equilibrium, at concentra-
400 to sooce, H is about 20 000 to 2S 000 tions which increase with temperature as exp
cal/mole, or 1 eV/atom, for self-diffusion or (- Hf/RT), where Hr is the energy required to
diffusion of substitutionally dissolved impurities. form a vacancy (about I eV in the noble
In crystals which melt near 1000cC, such as the metals). At temperatures near the melting
noble metals, the activation energy is approxi- point, the fraction of sites vacant is typically
mately 2 eV/atom. The value of the parameter 0.01 to 0.1 per cent. Vacancies move by the
Do is usually in the range 0.01 to 100 cm 2 /sec. jumping of adjacent atoms, at a rate which
It is interesting that for a large number of varies as exp (- Hm/RD. Hm is the activation
metals and simple ionic crystals, the diffusion energy for the migration process. The average
coefficients for self-diffusion and for most im- jump frequency of an atom must then be the
purities lie near to 10-8 cm 2 /sec at tempera- product of the jump frequency of a vacancy
tures approaching the melting point. Thus, after and the fraction of atomic sites that are vacant.
diffusing for one day at such a temperature, the Comparing the temperature dependence of the
value of Rrms is about I mm, rather a large dis- diffusion coefficient with that of these two
tance when compared to the spacing between factors, it follows that H must equal the sum of
atomF in a crystal. Hm and Hf. Quantitative experimental verifi-
Because of the three-dimensional regularity cation of this equality in a number of substances
of atomic positions in a crystalline solid, the has firmly established the vacancy mechanism
unit step in diffusion must be the jump of an for diffusion in such crystals.
atom from one site to a neighboring, crystal- Other mechanisms, however, are also known
lographically equivalent site. Large-scale diffu- to operate. For example, linear or planar
sion is the result of random superposition of defects within the crystal can provide paths for
many such jumps, all of the same length A but easier and more rapid diffusion. Thus, along
distributed among the various jump directions boundaries between crystal grains, on the ex-
allowed by the crystal. It is readily shown that ternal surfaces, and along the linear defects
the relation between the macroscopic diffusion known as crystal dislocations, the atomic regu-
coefficient D and the microscopic atomic jump larity is interrupted and binding energies are
frequency r is D = 1/6A 2 r. This equation may correspondingly decreased. As a result, the
be compared with that given above for Rrms by activation energy for diffusion is locally smaller
noting that for random jumps Rrms = A(rt) 1/2. and the diffusion coefficients are therefore
Now A will depend on the details of the larger; one speaks of a "short-circuiting" effect.
mechanism of diffusion but it must be of the Along dislocations and internal boundaries, the
order of the interatomic spacing, about 3 diffusion activation energy is typically only
X 10-8 cm. Then the typical high-temperature about half that for diffusion within the bulk
diffusion coefficient of 10-8 cm 2 /sec requires crystal, and short-circuiting contributions be-
each atom to make 10 7 to 10 8 jumps each come especially important at temperatures be-
second. low about half of the melting point. Diffusion
In most crystals the atoms are rather densely along external surfaces appears to depend
packed; thus the means whereby such a high strongly on the nature of the ambient atmo-
frequency of jumps can be accomplished is not sphere, and the details of the process are not
obvious. Conceptually, the simplest possibility well understood at present.
is the simultaneous exchange of sites between Another mechanism which is found in some
two atoms, but this is ruled out because of the systems involves the diffusing atom (or ion)
297 DIFFUSION IN SOLIDS

migrating from one interstitial position (i.e., Apart from the technological interest in these
squeezed in between proper atom sites) to fast ion conductors, there is also a great sci-
another. This process is not surprising in those entific challenge in understanding the micro-
cases in which a very small impurity ion is scopic mechanisms and interactions involved.
already dissolved interstitially, such as for In such substances, one sublattice retains the
carbon in iron; its occurrence is perhaps unex- normal crystalline order, but the other one in-
pected, however, in cases where the migrating variably displays an extraordinarily high density
ion is present primarily substitutionally (i.e., in of point defects-often in the range of several
normal atomic sites of the crystal). Thus, in percent-which gives rise to the high mobility
the silver halides, self-diffusion of silver and of the corresponding ionic component. For ex-
the diffusion of several cationic solutes proceed ample, in a-AgI and RbAg4Is, the anion sub-
primarily via the interstices of the lattice; lattice is relatively perfect, while the silver
perhaps the "softness" of the silver halides ion distribution is highly disordered. In many
encourages such a mechanism. Also, in the lead and alkaline earth halides, on the other
loosely packed crystals of germanium and hand, it is the anion sublattice which becomes
silicon, some solutes (such as copper or gold) disordered. In the several beta-alumina-type
diffuse interstitially, although they are dis- materials, blocks of perfectly ordered ions are
solved mainly substitutionally. And more re- separated by parallel planes containing a dis-
cently, it has come to be appreciated that in ordered distribution of mobile, monovalent
many polyvalent metals such as lead and tin, cations.
the noble metals and their divalent neighbors One of the interesting questions posed by
diffuse by some sort of interstitial process. In such materials is the origin, extent, configura-
all such cases the diffusion coefficients at high tion, and temperature-dependence of the dis-
temperatures are much larger, by several factors order in just one of the sublattices. Another
of ten, than would be expected for ordinary problem involves the very low activation
substitutional diffusion by means of vacancies. energies of migration of the mobile ions and
Within the last decade, numerical procedures defects. Still a third point of interest hinges
for calculating the energies of formation and on the nature of cooperative interactions be-
migration of point defects have been developed tween the mobile ions. This is especially fasci-
to such a point that the interpretation of experi- nating, since fast ion conductors are known in
mental data can often proceed with considerable which the mobile ions can move either in all
confidence, especially in the case of diffusion three dimensions (as in a-AgI) , in only two di-
and ionic conductivity in such relatively simple mensions (as in the conducting planes of the
materials as halides and oxides. Thus, experi- beta-aluminas), or along only one-dimensional
mental deviations from the simple Arrhenius channels (as in the mineral hollandite). It seems
relation can be analyzed in terms of the de- likely that these substances may well yield as
pendence on temperature (through the thermal rich a reward in scientific understanding as they
expansion of the lattice) of the parameters Do promise to do in technical applications.
and H, and in terms of the operation of more LAWRENCE SLlFKIN
than one mechanism of atom migration. In
other cases, in which no unambiguous ion trans-
port mechanism could otherwise be identified,
the results of these calculations now serve as
an invaluable guide. The most successful of References
these schemes has been the HADES code de- The fIrst reference describes experimental techniques
veloped at AERE Harwell, in which point de- for determining diffusion coeffJcients; those following
fect parameters are calculated from inter- are recent brief reviews in order of increasing sophisti-
atomic potentials which, in turn, are obtained cation or date of publication. References to more
by fitting the known dielectric and lattice me- detailed discussions are given in these.
chanical properties of the substance. Examples
of the use and versatility of this procedure Tomizuka, C. T., in Lark-Horovitz, K., and Johnson,
may be found in the volume of Journal de V., Eds., "Methods of Experimental Physics," Vol.
Physique cited in the references. 6A, p. 364, New York, Academic Press, 1959.
Another recent development has been the Girifalco, L. A., "Atomic Migration in Crystals," New
rapid growth of interest in those ionic ma- York, Blaisdell, 1964.
terials in which one component displays an un- Shewmon, P. G., "Diffusion in Solids," New York,
usually high mobility , giving rise to diffusivi- McGraw-Hill Book Co., 1963.
ties and ionic conductivities in the crystalline Lazarus, D., "Diffusion in Metals," in Seitz, F., and
solid that are comparable to those typically Turnbull, D., Eds., Solid StatePhys., 10, 71 (1960).
found only in liquids. These substances are Peterson, N. L., "Diffusion in Metals," in Seitz, F.,
referred to as fast ionic conductors, "super- and Turnbull, D., Eds., Solid State Phys., 22, 409
ionic" conductors, or solid electrolytes-this (1968).
last term reflecting their potential application Neumann, G., and Neumann, G. M., "Surface Self-
in electrolytic cells. Diffusion of Metals," in Wohlbier, F. H., Ed., Diffu-
DIFFUSION IN SOLIDS 298

sion Monograph Series, No.1; Solothurn (Switzer- Consider Bernoulli's equation for steady, non-
land), The Diffusion Information Center, 1972. viscous, incompressible flow in a gravity field.
Various chapters in: Crawford, I. H., Jr., and Slifkin, It may be written as
L., Eds. "Point Defects in Solids," New York,
Plenum Press, 1972 and 1973. pip + v2 /2 + gz = const.
Adda, Y., and Philibert, J., "La Diffusion dans les
Solides," Paris, Presses Universitaires de France, where p is the pressure, p the mass density, v
1966. the fluid velocity, g the acceleration of gravity,
Nowick, A. and Burton, I., Eds., "Diffusion in Solids: and z the vertical distance measured from some
Recent Developments," New York, Academic Press, arbitrary level. When mass M, length L, and
1975 . time T are selected as primary quantities, the
Examples of the calculation of atomic jump parameters dimensions of a force can be derived from New-
may be found in "Lattice Defects in Ionic Crystals," ton's second law as that of mass times acceler-
J. Physique, 41 Colloq. No.6 (1980). ation: IF] = MLT-2. Therefore the dimensions
For recent experimental and theoretical research on of pressure, being force per unit area, are Ip] =
fast ion conductors, see Solid State Ionics 5 (1981), ML -I T-2. Further
and other issues of this journal. Ip] =ML-3, Iv] =LT-I, Ig] =LT-2, Iz] =L.
Murch, G. E., "Atomic Diffusion Theory in Highly
Defective Solids," Aedermannsdorf (Switzerland), Ail terms in Bernoulli's equation have the di-
Trans Tech, 1980. mensions L2 T-2, which are those of the square
of a velocity, but also of energy per unit mass.
Cross-references: CRYSTALLIZATION, CRYSTAL· Dimensional homogeneity can be used as a
LOGRAPHY, DIFFUSION IN LIQUIDS, LATTICE check against errors.
DEFECTS, SOLID·STATE PHYSICS, SOLID-STATE Dimensional Analysis Mathematical relations,
THEORY. with which one attempts to describe nature's
order, cannot depend on arbitrary units of
DIMENSIONS measurement (meter, second, etc.). These rela-
tions must be expressible in dimensionless form
When describing natural phenomena, certain by means of dimensionless parameters. Dimen-
physical attributes or characteristics, such as sional analysis, making it possible to find such
mass or length, are distinguished. To these are parameters and to say something about the re-
assigned numerical magnitudes by prescribed lation between them, is useful particularly in
measuring procedures, in which selected scales dealing with complex problems not amenable
are employed. The length of a Foucault pendu- to direct analysis. It provides the rationale for
lum might be measured with a meter stick and all model experiments.
found to equal 64 units of measurement, namely To perform dimensional analysis, one must
64 meters. The length is called a physical have an insight into the quantities which are
quantity. relevant to the problem in question. Typically,
Customarily, certain quantities, such as length one lists these relevant quantities and selects
and time, are chosen to be primary or funda- the primary quantities in which their dimen-
mental. Other quantities, like velocity, can then sions are to be expressed.
be expressed in terms of these. They are termed For example, assume that a homogeneous
secondary or derived quantities. When the meter body of arbitrary shape and having a character-
and the second are taken as units of length and istic dimension Q, having been kept at uniform
time, the magnitude of a velocity can be ex- temperature, is suddenly subjected, over a por-
pressed in meters per second. The exponent of tion of its surface, to a temperature which is
the power of any primary quantity is called the greater by the amount 60. The remainder of the
dimension of the secondary quantity in that surface is to be insulated. What can be said
primary quantity. The dimensions of a secondary about the temperature rise 6 at any point, des-
quantity are then written in terms of the primary ignated by some radius vector r, after a time t
quantities from which they are constituted and has elapsed?
the powers to which they are raised. Thus the Heat conduction being a geometric phenome-
dimensions of a velocity v, here written as Iv 1, non, the heat-absorbing capacity per unit volume
with length L and time T as primary dimen- is judged to be relevant. With the specific heat c
sions, are L T-I . of the material being defined as the heat energy
Most physical statements, such as those relat- which raises the temperature of a unit mass by
ing to the conservation of energy or mass, or to one degree, and designating the mass density by
equilibrium of forces, can be considered as ac- p, the heat capacity per unit volume is cpo An-
counting statements, certifying that there is to other relevant quantity must be the thermal
be neither a gain nor a loss of a certain quantity. conductivity k of the material, defined as the
The terms in the equations, in which such state- heat transmitted per unit time per unit cross
ments are expressed, then must all refer to the section per unit temperature gradient.
same quantity: The equations must be dimen- For this problem heat energy IE], length
sionally homogeneous. IL l, time I Tl, and temperature [6] are con-
299 DIMENSIONS

TABLE 1.

Exponents of Dimensions
Quantities E L T ()

Q
r
()o 1
() 1
cp -3 -1
k -1 -1 -1
1

venient primary quantities. The quantities general significance. The Mach number is the
judged to be significant, with their exponents ratio of local fluid velocity to local velocity of
in terms of these primary dimensions are listed sound. The Reynolds number is vQp/p., where v
in Table 1. The number of dimensionless pa- is a characteristic velocity, Q a characteristic di-
rameters to be obtained is determined by Buck- mension, p the fluid density, and p. the viscosity
ingham's PI theorem. Formal ways to determine of the fluid.
them are given in references I through 5. In this Dimensions and Physical Constants Physical
case, inspection shows that there are three di- constants appear when relations are established
mensionless parameters, which can be written as: among quantities which are already dimen-
sionally connected. For example, Newton's sec-
1Tl =8/80, 1T2 =r/Q, 1T3 =kt/CpQ2 ond law when written as F = mdv/dt fixes the
dimensions of force as [F] =MLT-2. But New-
The result can be stated as: 8/80 = ftr/Q, kt/ ton's law of gravitation also relates a force (that
cpQ2). The temperature rise 8 at any point and of gravitation) with masses and their distance.
at any time is proportional to 80. The time t With ML T dimensions taken to be independent,
needed to attain a temperature rise which is a the gravitational law is written F = Gm 1 m2 /r2
specified fraction of 80 is proportional to the where G is a dimensional gravitational constant.
square of the characteristic dimension Q (the It is possible to write the gravitational law F =
size of the body). The material is fully described m 1 m2 /r2, but then M, L, T can no longer be in-
by the quantity k/cp, known as the thermal dependent. For example, the mass dimensions
diffusivity. could be expressed in terms of Land T: [M] =
Particularly in fluid mechanics and heat trans- L3 T-2. Since physical constants are dimen-
fer, some dimensionless parameters have gained sional, they can be expected to be constant

TABLE 2. SOME QUANTITIES AND THEIR TYPICAL DIMENSIONS

Primary Quantities
Physical Quantity MLT MLT() MLTQ

Velocity LT-l
Accelera tion LT-2
Force MLT-2
Energy, Torque ML2T-2
Momentum MLT-l
Density ML-3
Viscosity ML-lT-l

Heat, Energy ML2T-2


Specific Heat L2T-2()-l
Conductivity MLT-3()-l
Entropy ML2T-2()-l

Electric Current T-lQ


Electric Potential ML2T-2Q-l
Impedance ML2T-lQ-2
Capacitance M-lL-2T2Q2
DIMENSIONS 300

only when the operational methods with which IV of the periodic table (see PERIODIC LAW AND
the fundamental units are defined are properly PERIODIC TABLE). Both Ge and Si are tetrava-
correlated. They then take on the character of lent elements, i.e., they have 4 valence electrons.
conversion factors. The velocity of light, having Elements in their pure state are said to be
the dimensions L T-l, can be universally invari- intrinsic.
ant only when the operational definitions of L Each element under column III of the periodic
and T make it so, as can be done by making use table has 3 valence electrons and is referred to as
of radiation phenomena. trivalent. Examples of trivalent elements include
In thermal problems E, L, T, and 0 can be indium (In) and gallium (Ga). Elements in
selected as primary quantities. But, as [E] = column V of the table have 5 valence electrons
ML2T-2, obviously M, L, T, 0 can also be and are called pentavalent. Arsenic (As) and
chosen where convenient. When physical con- antimony (Sb) are examples of pentavalent ele-
stants are relevant to a particular problem with ments.
the dimensions selected, they must be listed The process of introducing one of the ele-
among the relevant quantities. When the gas ments from column III or V into intrinsic Ge or
law p = pRO applies to the problem, the gas Si is called doping. The doped material becomes
constant R must thus be listed. Alternatively, R impure or extrinsic. If a trivalent impurity is
can be taken to be dimensionless, making [0] = introduced in Ge or Si (trivalent elements have
L 2 T-2. Then 0 would no longer appear as a one less valence electron than Ge or Si) holes
primary quantity. are created and the material is said to be p-type.
In electricity, a variety of primary quantities Introduction of a pentavalent impurity (penta-
can be used, based on the manner in which the valent elements have one more valence electron
electrostatic and electromagnetic laws are stated. than Ge or Si) creates free electrons and the
Often M, L, T, and Q (electric charge) are material is n-type.
employed. Because of thermal effects, free electrons and
holes are always being produced in Ge and Si
REINOUT P. KROON
(intrinsic generation of electron-hole pairs).
Consequently, there will be some electrons in
References the p-type material and some holes in the n-
type material. These carriers are referred to as
1. Buckingham, E., "Dimensional Analysis," Philo- minority carriers. Electrons in n-type material
sophicalMagazine, 48,141 (1924). and holes in p-type material are termed majority
2. Bridgman, P. W., "Dimensional Analysis," Cam- carriers. The most widely used diode is the p-n
bridge, Mass., Harvard University Press, 1931. junction diode. Imagine a single crystal of Ge
3. Huntley, H. E., "Dimensional Analysis," London, (or Si) doped so half the material is p-type and
MacDonald and Co., Ltd., 1958. the other half, n-type. The internal boundary
4. Sedov, L. 1., "Similarity and Dimensional Methods between the two extrinsic regions is a p-n junc-
in Mechanics," New York, Academic Press, 1959. tion, and the resulting device is a junction diode
5. Kline, S. J., "Similitude and Approximation The- (Fig. I). The electrical symbol for the junction
ory," New York, McGraw-Hill, 1965. diode is illustrated in Fig. 2.
6. Kroon, R. P., "Dimensions," I. Franklin [nSf., 292
(July 1971).

Cross-references: CONSTANTS, FUNDAMENTAL;


SYMBOLS, UNITS, AND NOMENCLATURE IN
PHYSICS. p-n
'junction

DIODE (SEMICONDUCTOR)
There exists a class of two-terminal devices
which have the property of permitting current FIG. l.
to flow with practically no resistance in one
direction and offer nearly infinite resistance to
current flowing in the opposite direction. These (p- side)
devices are called diodes.' The applications of
diodes to electronic circuits are numerous. To
mention a few, they include rectification of
alternating current to a unidirectional current,
detection of radio waves, and gating circuits ( n-side)
used in digital computers.
The basic materials utilized for making semi-
conductor diodes are germanium (Ge) and sili-
con (Si). These elements are included in column FIG. 2.
301 DIODE (SEMICONDUCTOR)

What are the characteristics of the p-n junc- I


tion? To answer this question, three possible
conditions are considered. Referring to Fig. 3,
these are: Unbfased Forward biased
(1) Unbiased: p- and n-sides are connected
by a wire. \
(2) Reverse biased: the p-side is connected -E '-Is
o E
to the negative terminal of battery E, and the
n-side connected to the positive terminal.
(3) Forward biased: the p-side goes to the
positive terminal of E, and the n-side to the Reverse biased -1
negative terminal.
Simple energy diagrams for the three condi- FIG. 4.
tions are shown in Fig. 3 for the electron. Simi-
lar diagrams can be generated for holes. When
the diode is unbiased, no net flow of electrons At room temperature (300 K) and E> 0.1 volt:
takes place across the junction. Assuming that
some electrons on the n-side have sufficient en- 1== I s e 39E
ergy to overcome the potential hill, electrons on
the p-side (minority carriers) "slide down" the When E is more negative than 0.1 volt:
hill making the net current flow zero. For the 1==- Is
reverse biased case, the potential hill is raised
and only the few minority carriers from the p- An example of a simple rectifier employing a
side "slide down." This results in a minute re- p-n junction diode is given in Fig. 5. During the
verse saturation current. When the diode is for- positive half-cycle (0° to 180°) of the ac sinu-
ward biased, the potential hill is lowered. This soidal waveform Us, the diode is forward-biased
enables electrons to climb over the hill and cur- and conducts. The voltage UL across load resis-
rent flow occurs. The same considerations apply tor RL is therefore nearly identical to that of Us
to holes. In fact, the total diode current is equal for the positive half-cycle. For the negative half-
to the sum of the electrons and holes flowing cycle (180° to 360°) the diode is reverse-biased
across the junction. and does not conduct. No current flows inRL,
The characteristic curve of a semiconductor and UL = 0 during the negative half-cycle. Be-
diode is shown in Fig. 4. An equation for this cause the diode conducts for only one-half
curve, called the rectifier equation, is expressed cycle, the circuit of Fig. 5 is called a half-wave
as: rectifier. .
1= Is(e-11600E/T - 1) The waveform of UL is only unidirectional.
To obtain steady dc, like that from a battery, a
where filter is required. An example of an elementary
I =diode current, amperes filter is a large-valued capacitor placed across the
Is = reverse saturated current (which is tem- load resistor.
perature dependent), amperes The circuit of Fig. 5 can also be used as a de-
E = diodebiasing voltage (+ E for forward tector of amplitude-modulated (AM) radio
bias;-E for reverse bias), volts waves. Figure 6(a) illustrates the components of
T = absolute temperature (OC + 273°), de- an AM wave. If this is applied to the input of
grees Kelvin. Fig. 5, the wave is rectified and the output ap-

~ I

Unbiased
~
y;~
.E
Reverse biased
~ IE
Forward biased
I
I
I ~ (:-) Minority !
electron I
I
I r (~) ,lnOtri y
M't I

Mojor~ity:
electron
-' e ec ron

e........ :
ee~ \Ener I
gy
I I
I I

FIG. 3.
DIODE (SEMICONDUCTOR) 302

Vs Input

o~ ..

FIG . 5.

n-~nr
(b) (c)

Carrier
frequency
(a)
FIG. 6.

pears as shown in Fig.6(b). Placing a small- Hunter, L. P., "Handbook of Semiconductor Elec-
valued capacitor across RL filters out the carrier tronics," 3rd Ed., New York, McGraw-Hill Book
frequency and the desired modulating signal is Company, 1970.
obtained (Fig. 6(c). Kaufman, M., and Seidman, A. H., "Handbook for
Besides the p-n junction diode, there are a Electronics Engineering Technicians," New York,
number of other diode types which find use in McGraw-Hill Book Company, 1976.
specialized applications. These include the fol- Millman, J., "Microelectronics," New York, McGraw-
lowing diodes: Hill Book Company, 1979.
Gunn : used for the generation of microwave Seidman, A. H., and Waintraub, J. L., "Electronics:
power. Devices, Discrete and Integrated Circuits," Colum-
Hot carrier (Schottky barrier): used for fast bus, Ohio, Charles E. Merrill Publishing Company,
switching of waveforms, such as found in 1977 .
computers. Streetman, B. G., "Solid State Electronic Devices,"
IMP ATT: used for the generation of microwave 2nd Ed., Englewood Cliffs, N.J., Prentice-Hall, Inc.,
power. 1980.
Injection: used for the generation of laser fre-
quencies. Cross-references: ENERGY LEVELS, POTENTIAL,
Light-emitting (LED) : used for alpha-numeric SEMICONDUCTOR, SEMICONDUCTOR DEVICES,
displays. SOLID·STATE PHYSICS, SOLID-STATE THEORY,
Varactor: a reverse-biased junction diode that TRANSISTOR.
behaves like a variable capacitor, as a function
of the applied voltage across the device.
Zener (avalanche): a reverse-biased junction
diode that exhibits a dc voltage which is nearly DIPOLE MOMENTS (ELECTRICAL AND
independent of a specified range of current MAGNETIC)
flowing in the device. The Zener diode finds
wide use as a voltage reference in regulated Uncharged molecules can be classified as non-
power supplies. polar or polar dependently on whether, in the
absence of an electric field , the centers of
ARTHUR H. SEIDMAN gravity of their constituent positive and nega-
tive charges are coincident or not. A body con-
References taining two opposite charges, ±Q, separated by
a distance d, is characterized by an electric
Ghaznavi, C., and Seidman, A. H., "Electronic Circuit dipole moment, Qd = p.; P. is a vector quantity,
Analysis," New York, The Macmillan Company, expressed conveniently in debye (D) units:
1972. 1 debye = 10- 18 esu = 3.33 X 10-30 [coul mJ.
303 DIPOLE MOMENTS (ELECTRICAL AND MAGNETIC)

In the presence of an applied field a nor- ble also to solutes in nonpolar solvents, and by
mally nonpolar molecule becomes dipolar by using alligation formulas, values of TP for a
induction, i.e., by deformation of its electronic dissolved species can be obtained at infinite
and atomic arrangements: m = (ae + aa)E, dilution; such estimates are usually close to,
where the coefficients of proportionality ae but not identical with, the true TP's directly
and aa are the electronic and atomic polariza- observed on the vaporized solutes. Over the
bilities, respectively. In the general case of an past thirty years, much effort has been devoted
anisotropic ally polarizable molecule, ae and aa to theoretical or empirical treatments of "sol-
are tensors, the components of which may be vent effects." Since distortion polarizations are
evaluated from observations of electric bire- almost invariant with temperature, the tempera-
fringence (Kerr effect), the depolarization of ture dependence of TP follows as (TPh =
(Rayleigh) scattered light, refractive index dis- A + BIT; the constants A and B, when fitted
persion, etc. An estimate of the mean of the to experimental data by least squares, give
three principal polarizabilities is given by A = EP + AP and B = 41TNIl 2 /9k, whence 11 =
3R/41TN, where R is a molecular refraction by 0.012812Bo.s esu; results for about 350 gases
the Lorenz-Lorentz formula; when R is extrap- are listed by Marryott and Buckley.
olated to infinite wavelength the mean polar- Practical details concerned with the measure-
izability obtained refers to the electronic defor- ment of dielectric constants, and other proper-
mations alone. Polarizabilities are expressed in ties, necessary for the deduction of Il's of
volume units (cubic centimeters) (N = the solutes or vapors, are described fully in the
Avogadro number). books (cited below) by Le Fevre, Smith, and
A field E exercises a torque on an electric di- Smyth, wherein also references are made to
pole 11, tending to align it in the field direction other, but less simple, techniques by which di-
in opposition to the randomness caused by pole moments can be determined (e.g., Stark
thermal agitation. In a large assembly of mol- splitting in microwave spectra of gases at low
ecules, therefore, a statistical and temperature- pressures, the dielectric losses or power factors
dependent equilibrium is achieved which corre- of dilute solutions, molecular beam studies,
sponds to a slight excess of molecules having etc.); the first two of these are useful since
their permanent dipoles oriented antiparallel they can detect very small moments which the
to the field so that the average moment m of ordinary dielectric constant methods cannot
one molecule is apparently proportional to the reveal accurately; the third technique-involving
field intensity, i.e., an orientation polarizability the deviation undergone by a thin ribbon of
ao is exhibited. gaseous molecules in passing through an in-
The electric dipole moment per unit volume tense nonhomogeneous electric field-is applica-
of a dielectric material is the polarization vector ble to substances, such as metal salts, which
P, understandable in magnitude as the charge through insolubility or low volatility would be
density bound at the electrodes by a polarized otherwise unexaminable.
dielectric. Based on the arguments of Mossotti
(1850) and Clausius (1879), the polarization By the end of 1961, some 7000 dipole mo-
per mole is related to the dielectric constant e ment values for more than 6000 substances had
by M(e - l)/d(e + 2) = 41TNa/3, where Mid is been recorded (see McClellan's Tables); they
the molecular volume, N is the Avogadro num- fall mostly in the range 0 to 5 debyes.
ber, and a is the over-all polarizability. Debye Chemical interest is largely due to the rela-
(1912) showed ao to be 1l 2 /3kT (k = Boltz- tionships between polarity and molecular struc-
mann's constant, T = absolute temperature) so ture. Monatomic molecules, diatomic molecules
that a = ae + aa + ao, and the total polarization of the type AA, and centrosymmetric poly-
per mole TP is the sum of the electronic, atomic molecules, are nonpolar; a linear tri-
atomic, and orientation polarizations: TP = atomic molecule ABA is nonpolar, but if bent
EP + AP + oP. A p'ossible fourth polarization or constructed as AAB it is polar; pyramidal
mechanism, the blocking or trapping of migrat- tetratomic molecules AB3 are polar, etc. A
ing charge carriers in a dielectric, although more quantitative approach supposes that char-
ignored in the classical molecular theory, may acteristic polarities are associated with covalent
also contribute to the apparent € of solid or chemical bonds, e.g., that two bonds, having
macromolecule-containing systems. "bond moments" III and 112, mutua1l1inclined
The commonest method for the determina- at (Jo, produce a resultant of (111 2 + 112 + 2111112
tion of dipole moments involves the dispersion cos (J)O.5. On this basis, bond moments de-
of e: TP is measured at radio and optical wave- duced from the resultant moments of molecules
lengths (the second of these is a molecular with known structures, often permit the dis-
refraction since the square of the index of covery or testing of stereo specifications of fur-
refraction of a nonabsorbing, nonmagnetic ma- ther molecules. However, caution is necessary
terial equals the dielectric constant at the same since bond moments are not independent of
frequency), then approximately R = EP + AP,. bond environments, but may be modified by
and 112 = 9kT(TP - R)/41TN. Although strictly induced moments-determined by the fields of
valid only for gaseous dielectrics the Mossotti- neighboring polar bonds or centers and the
Clausius-De bye equations have proved applica- (anisotropic) polarizabilities of the bonds under
DIPOLE MOMENTS (ELECTRICAL AND MAGNETIC) 304

consideration-or by other internal electronic quantity; X/N by experiment is an average of


effects (c.f. resonance, mesomerism, hybridiza- three principal magnetic susceptibilities directed
tion, etc.). Completely successful calculations along three mutually perpendicular principal
of dipole moments from a priori theory have axes of magnetic susceptibility; these can be in-
yet to be made. vestigated through torsional movements of crys-
Some concepts developed for electrostatic tals in magnetic fields (Krishnan's method) or
fields have magnetic counterparts; thus in place from magnetic birefringence measurements
of polarization P there is magnetization I, the (Cotton-Mouton effects) in conjunction with
magnetic dipole moment per unit volume data for Xmean secured with a Gouy balance.
caused in a material by an externally applied A permanent magnetic dipole will experience
field H; internally the magnetic flux density a torque in a magnetic field. Langevin (1905)
(the magnetic induction) is B = H + 41TI; the showed that the mean moment mof a gaseous
ratio I/H is the volume susceptibility K ordi- molecule in the field direction (provided that H
narily measured. Individual magnetic monopoles is not too large) is m= (mp2/3knH, where mp
are not known to exist in nature, but move- is the actual moment of each molecule; there-
ments and spins of electrons in atoms and mole- fore the molar paramagnetic susceptibility Xp is
cules-if viewed classically as direct currents Nm/H = NQp = N mp2/3kT. In practice Xp is ex-
flowing in closed circuits-can create fields tracted from the observed X by treating this
identical with those expected from magnetic as the algebraic sum of a negative diamagnetic
dipoles having moments dimensionally equiva- susceptibility (estimated from Pascal's con-
lent to products of pole strengths and distances. stants) and a positive paramagnetic susceptibil-
The elementary magnetic moment is the "Bohr ity; thus mp follows as (3kTXp/N)°·s [erg
magneton," 9.273 X 10-21 [erg gauss-I], gauss-I] or as 2.84(TXp)0.s [Bohr magnetons].
assumed to be the magnetic moment of an Molar diamagnetic susceptibilities are indepen-
electron "spinning" on its own axis. Atoms dent of temperature, while molar paramagnetic
may possess orbital moments (due to mechani- susceptibilities in general vary as liT or l/(T-
cal angular movements of electrons) and spin Tc). The small paramagnetisms of alkali metals,
moments (one for each electron). Magnetic mo- Cu, Ag, etc., or of certain salts (e.g., KMn04
ments can be induced or permanent. A unit or K2Cr207), attributable respectively to un-
volume containing v particles each of magne- compensated spins of conduction electrons, or
tizability Qm, subjected to a field H, displays to uncompensated paramagnetisms of complex
a magnetization I = VQmH = vm, where mis the ions, are temperature invariant.
average magnetic dipole moment per particle; Normally any atom or molecule with unpaired
thus K = VQm, and the molar susceptibility electrons shows paramagnetism and possesses a
X = K V = NQm (where V is the molar volume magnetic moment. Magnetic properties can
and N the Avogadro number); Qm can be split therefore provide important information on
into Qi + Qp, to correspond with the contribu- valency states in free radicals, molecules con-
tions to I made by the induced and permanent taining first period elements with unpaired p
moments respectively. electrons, transition elements having unpaired
An electron in an orbit of radius r represents d electrons, lanthanides with unpaired 4f and
a current loop; application of a magnetic field H actinides with unpaired Sf electrons. Theoreti-
perpendicularly to the loop plane will induce a cal expressions exist to calculate paramagnetic
voltage tending to create a field opposing that moments in terms of atomic structures and
applied; the effect will be manifest as an ap- spin and orbital angular momenta of unpaired
parent induced moment antiparallel to H electrons. Simple examples are the ions of
and-by classical calculations-of the value transition metals where, if n is the number of
-e 2r2H/6mc 2 (here e is the electronic charge, m unpaired electrons, mp is approximately pre-
is the electronic mass, c is the velocity of light); dicted as [n(n + 2)] 0.5 B9hr magnetons (for a
hence Qj = - e 2r2 /6mc 2, and for a monatomic full discussion of such relations see Nyholm's
substance with spherical atoms the molar dia- review). Determination of mp thus gives n,
magnetic susceptibility X = -(Ne 2/6mc 2)Lnn2, which is often of value in deciding the three-
where rj2 is the mean value of r2 for the ith dimensional arrangements and bond types in-
electron and the sum is taken over n electrons. volved in molecules, especially those built
around a central metal atom.
The X's observed for the inert gases, the C For most substances X is independent of field
atoms in diamond, the Cl atoms in C1 2, etc. strength, but a few paramagnetic compounds
have agreed with reasonable magnitudes of Lr2. can, below the characteristic temperature Tc
Pascal (1910) showed diamagnetic susceptibility (see above), show "ferromagnetism" due to
to be an "additive-constitutive" property, so spontaneous parallel alignments of spins of
that the X's of polyatomic molecules can be atomic magnets. Materials which are ferromag-
approximately predicted by summing "atom" netic at ordinary temperatures (e.g., soft iron)
and "bond" susceptibilities in numbers and have nonlinear magnetization-field characteris-
kinds appropriate to the molecular structure tics, develop large magnetizations in weak fields,
under consideration. The diamagnetic suscepti- rapidly approach saturation conditions, exhibit
bility of an individual molecule is a tensor hysteresis, etc.; whole domains about 0.01 mm
305 DOPPLER EFFECT

in diameter and magnetically saturated are or observer on one of the newly developed
thought to be undergoing orientation during railroad trains.
such processes. Ferromagnetism-and related In his original article on the special theory of
phenomena such as "antiferromagnetism" and relativity (see RELATIVITY), Einstein! devel-
"ferrimagnetism" -have at present few applica- oped the expression for the Doppler shift of
tions in chemistry; in electronics (e.g., ferrites light waves which was dependent upon the
in antennas and in magnetic tape), they are velocity of the source relative to the observer.
frequently important. From the photon hypothesis for light,Schrod-
inger 2 ,3 obtained the same results. Thus, the
R. J. W. LE FEVRE Doppler effect provides one of the illustrations
of the equivalence of the wave and particle
References descriptions of light.
Debye, P., "Polar Molecules," New York, The Chem- Classroom demonstrations of the Doppler
ical Catalog Co., Inc., 1929. effect for water waves are made in shallow glass-
Hippel, A. R. von, "Dielectrics and Waves," New York, bottom ripple tanks. Instead of giving the vi-
J. Wiley & Sons, Inc., 1954. brating source a constant velocity, one lets
Le Fevre, R. 1. W., "Dipole Moments," Third edition, the sheet of water as medium flow continuously
London, Methuen and Co., Ltd., 1953. by the source.
Le Fevre, R. J. W., "Molecular Refractivity and The circles of Fig. I are snapshots of the
Polarizability" in "Advances in Physical Organic crests of a water wave or compressions in a
Chemistry," London and New York, Academic sound wave observed when the source is moving
Press, 1965. at constant velocity v to the right relative to
Le Fevre, R. J. W., "Polarization and Polarizability in the medium. Points I and 2 are positions of the
Chemistry," Rev. Pure and Applied Chern. (Aust.), source one and two periods after passing O. The
20,67 (1970). largest circular crest originated at 0, the next
Le Fevre, C. G., and Le Fevre, R. J. W., in "Physical at I and the smallest at 2. A crest is about to
Methods of Chemistry," Vol. I, Weissberger, A. and leave point 3 at the time the snapshot is taken.
Rossiter, B., Eds., New York, Wiley-Interscience, If the position P of the observer is a large dis-
1972 (part IIIc, pp. 399-452). tance from the source compared to the distance
McClellan, A. 1., "Tables of Experimental Dipole the source moves in one period, then with good
Moments," San Francisco and London, Freeman approximation we may assume that two succes-
and Co., 1963. sive crests are moving in the same direction as
Maryott, A. A., and Buckley, F., "Table of Dielectric they pass P. If v is the velocity of the source in
Constants and Electric Dipole Moments of Sub- the direction OA, the source moves a distance
stances in the Gaseous State," Natl. Bur. Std. Cire., vT in one period T. In one period, the source
537 (1953). e,
comes closer to P by the amount vT cos where
Nyholm, R. S., Quart. Rev. London, 7, 377 (1953). e is the angie between the direction of the
Selwood, P. W., "Magnetochemistry," New York, velocity of the source and the line from the
Interscience Publishing, 1956.
Smith, J. W., "Electric Dipole Moments," London,
Butterworth's Scientific Publications, 1955.
Smyth, C. P., "Dielectric Behavior and Structure,"
New York, Toronto, London, McGraw-Hill Book
Co., Inc., 1955.
Van Vleck, J. H., "The Theory of Electric and Mag-
netic Susceptibilities," Oxford, Clarendon Press,
1932.

Cross-references: BOND, CHEMICAL; DIELECTRIC


THEORY; FERROMAGNETISM; MAGNETISM. -r-------r------~~~~~~---+---+-A

DOPPLER EFFECT
The wave effect by which astronomers measure
the radial velocities of galaxies, and policemen
determine the speeds of approaching automo-
biles, was in spite of its simplicity not discovered
until the nineteenth century. In 1842, Christian
Doppler predicted that the frequencies of FIG. 1. The source is moving relative to the medium
received waves were dependent on the motion along the line BA. The circles represent crest of the
of the source or observer relative to the prop- wave at an instant. (Andrews, C. 1., "Optics of the
agating medium. His predictions were promptly Electro-Magnetic Spectrum", Englewood Cliffs, N.J.,
checked for sound waves by placing the source Prentice-Hall, Inc., 1960).
DOPPLER EFFECT 306

source to the observer at P. Now Ao is the wave- For an observer at rest Vo = ciAo. Substituting
length and Vo the frequency when the source is for Ao, we obtain
at rest; A is the observed wavelength and v the v v
observed frequency when the source is in -= 1 + - cos () (2)
motion. Because of the motion of the source, Vo c
the wavelength received at P is reduced by when the observer is in motion relative to the
vT cos (). medium.
A = Ao-vT cos () By a postulate of relativity, the velocity of
light is the same relative to all observers. The
theory of relativity yields the frequency
If c is the velocity of the wave, T = Ao/c and
A= Ao[1-(vlc) cos() I, but A = clv and Ao = clvo. v
Therefore, 1 + - cos () 0
v c
v
Vo v
1- - cos ()
(1)
Vo
g 1- -
c2
(3)

c
in which v cos ()o is the component of the
velocity of the source toward the observer. The
when the source is in motion relative to the angle ()o is measured in the source system. If ()
medium. is the angle measured in the observer's system,
In Fig. 2 the source is at rest, but the observer then
at P has a velocity v with respect to the medium. v
--cos ()
The velocity of the wave relative to the observer c
is equal to the vector sum of the velocity of the cos ()o = - - - - (4)
wave relative to the medium and the velocity of v
the medium relative to the observer. In Fig. 2, - cos () -1
c
c is the velocity of the wave and v the velocity
of the observer relative to the medium. Let Ao F~gure 3 is a graphical plot of vivo against vic
be the wavelength, Vo the frequency of the for the radial motion in the three cases we have
source, and v the frequency received by the treated. (1) The linear relation is that for the
moving observer. The radial velocity of the observer in motion relative to the medium that
wave relative to the observer is c + v cos () so propagates sound or other mechanical waves.
that
3.-----,.---~~

VAo=c+vcos()

2~----~--~-~

OL------L----~
-I o
vic
FIG. 3. Graphical plots of the ratio of the observed
frequency to the frequency at the source against the
FIG. 2. The source is at rest at point 0 and the ratio of radial velocity to the velocity of the wave for
observer at point P is moving with velocity v relative three cases: (1) sound waves from a moving source,
to the medium. (Andrews, C. L., "Optics of the Elec- (2) sound waves to a moving receiver, (3) electro-
tro-Magnetic Spectrum," Englewood Cliffs, N.J., Pren- magnetic waves. The circle represents the red shift of
tice-Hall, Inc., 1960). light received from the most distant galaxies observed.
307 DYNAMICS

(2) The other solid curve is for the source of bra ted in miles per hour. The precision of such
sound in motion. (3) The broken curve repre- a speed detector depends upon the frequency
sents the Doppler effect for electromagnetic of the source being so stable that it varies less
waves such as x-rays, light, and radio waves. than the Doppler frequency shift during the
By comparing several spectral lines of elements time that the wave travels from the source to
observed in a star with a laboratory spectrum of the mirror and back. The same phenomena of
the same elements, astronomers use the Doppler beats between the direct wave from the source
effect to measure the radial components of and the wave reflected from a moving mirror
velocity of astronomical bodies toward or away may be observed with light. If one of the mirrors
from the earth. Spectra of the edges of the sun's of Michelson's interferometer is moved at con-
disk are measured to determine the velocities stant speed, the frequency with which dark
toward and away from the earth. The radial bands pass the cross hair is the difference in
velocities of the principal stars of our galaxy frequency of the two waves.
have been recorded. The spectral lines of some The numerator of Eq. (3) contains a term for
of the stars are doublets which periodically come the radial component of velocity. However, the
together and separate again indicating that the second-order term in the denominator is inde-
light comes from two stars revolving about a pendent of direction. Thus, as vic approaches
common center of gravity (see ASTROMETRY). unity, one may expect to detect a tangential
In the expanding universe, the radial velocities Doppler effect. Ives and Stilwe1l 6 have measured
of other galaxies away from our galaxy are pro- the predicted value of the Doppler shift in
portional to their distances from the observer. frequency due to a stream of radiating molecules
Thus, the Doppler red shift provides a means of for which vic was 10- 2 • This experiment was a
determining the dimensions of the observed direct proof of time dilatation (see RELATIVITY)
universe. In 1964, some of the most intense for the transverse case. In order to separate the
radio sources (see RADIO ASTRONOMY) were tangential from the radial effect, Ives and Stil-
located with high precision by observing these well produced a sharply collimated beam of
sources when the moon passed in front of them. molecules. In order that e be precisely 90°,
With this knowledge of position, the same they set a mirror accurately normal to the line
sources were located with a light telescope. 4 of observation and altered the line of observa-
The measured red shift was surprisingly high. tion until nearly the same wavelengths were
The sources were not stars as previously thought given by direct and reflected light.
but the most distant galaxies known. In a At relativistic speeds the transverse Doppler
period of eighteen years the Doppler red shifts effect will be an important consideration in the
were measured for more than a thousand quasi- final determination of the nature and positions
stellar objects. Quasar OQI72 in the Catalogue of the quasars.7
of Quasi-Stellar Objects S had the largest red
shift. If this shift were due solely to the Doppler C. L. ANDREWS
effect, the astronomers had to conclude that
this quasar was moving away from us with 91 % References
of the speed of light. The Doppler frequency
and velocity of this source are indicated by a 1. Einstein, A.,Ann. Physik, 17,891 (1905).
circle on the broken curve of Fig. 3. 2. Schrodinger, E., Physik. Z., 23, 301 (1922).
If a microwave beam is reflected from a 3. Michels, W. C., Am. 1. Phys., 15,449 (1947).
moving microwave mirror, such as a person, 4. Burbage, G., and Burbage, M., "Quasi-Stellar Ob-
an automobile or a man-made satellite, the jects," San Francisco, W. H. Freeman Co., 1967.
image of the primary source may be considered 5. Hewitt, A., and Burbidge, G., "Catalogue of Quasi-
as another source moving with twice the veloc- Stellar Objects," Astrophysical J. Supplement
ity of the mirror. Since the speed is small com- 43, 57, 14-Al (1980); Shipman, H. 1., "Black
pared with the speed of light, the squared terms Holes, Quasars and the Universe," Boston, Hough-
of Eq. (3) may be neglected. Thus ton Mifflin, 1980.
6. rves, H. E., and Stilwell, A. R., J. Opt. Soc. Am.
v = vo (1 + ~v cose) 31,369 (1941).
7. Gordon, K. L,Am. J. Phys. 48,514 (1980).

Direct frequency measurements cannot be made Cross-references: ASTROMETRY, RADAR, RADIO


to enough significant figures to distinguish v ASTRONOMY, RELATIVITY, WAVE MOTION.
from Vo. However, if the two frequencies are
combined they give beats or the difference
frequency
DYNAMICS
2v
LlV = Vo - cos
c
e Introduction Dynamics, a branch of mechan-
ics, is often defined in two basic parts. First,
Since the beat frequency is porportional to the kinematics is the study of the motion of bodies
radial velocity, a frequency meter may be cali- with no consideration of what has caused the
DYNAMICS 308

motion, and second, kinetics is the study which considered in this article. (See FLUID DYNAM-
relates the action of forces on the bodies to the ICS.)
resulting motion. The kinematic analysis of the motion of a
This order of presentation is often utilized in particle may be approached through the estab-
textbooks and other treatises on the subject. lishment of a position VECTOR r, directed from
With various interpretations of the word bodies, the origin of a specified fixed coordinate system
the subject has also been divided into the areas to the point representing the position of the par-
of particle dynamics, rigid body dynamics, and ticle, to give
fluid dynamics. As needs have developed, vari-
ous specializations have been created and new r=xi+yj+zk (I)
theories have been formulated. These special where i, j, and k are unit vectors along the x, y,
areas include mechanical vibrations, flight dy- and z axes, respectively. Differentiating Eq. (I)
namics, space dynamics, gas dynamics, mag- with respect to time yields a velocity equation
netohydrodynamics, dynamic systems,1 and of the form
relativistic dynamics, to mention a few.
The present state of knowledge recognizes all dr dx dy dz
motion as relative, since no fixed reference is v= -= - i + - j + - k (2)
known for finding the absolute motion of any dt dt dt dt
body. The earth, which is frequently used as a
frame of reference is rotating about its own axis where the instantaneous velocity, v, of the par-
and is also revolving about the sun. The solar ticle at any position on its path is the instanta-
system, which consists of the sun and its planets, neous time rate of change of displacement. A
is a minute part of the Milky Way galaxy that is second differentiation with respect to time
known to be revolving in space 2 . Beyond this, yields the acceleration of the particle in the
there is limited knowledge of the nature of mo- form
tion that exists.
In the field of astronomy, measurements are dv d2 r d2 x d2y d2 z
evaluated in a coordinate system that is located a=-=-=-i+-j+-k (3)
dt dt 2 dt 2 dt 2 dt 2
relative to the fixed stars. These stars are located
at such a vast distance from the earth that they
appear as points of light that are almost motion- where the instantaneous acceleration, a, of the
less in space. In this frame of reference, the mo- particle at any position on its path is the in-
tions of celestial bodies are described with ex- stantaneous time rate of change of velocity.
tremely great precision, and the motions of If x, y, and z are scalar functions of time, then
bodies within the solar system can be predicted Eqs. 0), (2), and (3) may be used to trace the
accurately over periods of hundreds of years. path of the particle and determine the velocity
Some applied areas of dynamics, exemplified and acceleration at any instant. These equations
by the space exploration program, also require may be easily adapted to the cases of rectilinear
the degree of extreme accuracy that is possible translation and curvilinear translation of a par-
with a celestial frame of reference. In many ticle in plane motion.
other areas, this extreme accuracy is not essen- The kinematic analysis of a rigid body moving
tial and measurements based upon this frame of in a plane often involves the trace of two points,
reference would be tedious and impractical. In which may be called A and B, that are located
such cases, motion may often be adequately on the body. These points move with the body
described in a coordinate system located relative and remain a fixed distance apart. Two coordi-
to the earth.3 nate systems may be used to define the position
Kinematics A particle is a body having di- of the body in the plane. An X- Y coordinate
mensions that are small, relative to other dimen- system, with origin 0, is a fixed reference, and
sions of the system, so that its motion may be an x-y coordinate system, with origin 0 located
considered equivalent to the motion of a point at A, is attached to the body so that it moves
at its mass center with rotational effects ne- and rotates with the body. In the fixed refer-
glected. Thus, particles may be either small or
large. In the solar system, it would be possible from °
ence system, a position vector R is directed
to point A on the body and a second
position vector p is directed from to point B. °
to assume the earth to be a particle but, in a
terrestrial system, this assumption could be In the moving coordinate system, a vector r is
totally unjustifiable. directed from A to B. An equation relating the
A rigid body is a group of particles having un- position of the two points may be written as
varying external and internal configuration. The p= R +r (4)
size of the rigid body would be appreciable in
comparison to the other dimensions of the Using the I, J, K unit vectors for the X- Y-Z co-
system, so that the rotational effect would have ordinate system and the i, j, k unit vectors for
to be considered. the x-y-z coordinate system, Eq. (4) may be re-
A fluid body is a group of particles with written as
varying external and or internal configuration.
The analysis of this type of system will not be p = XI + YJ + xi + yj (5)
309 DYNAMICS

Differentiating Eq. (5) with respect to time body 2, and VB3/2 is the velocity of point B on
yields the velocity of B in the form body 3 relative to body 2. The term 2W2 X
vB3/2 is often referred to as the CORIOLIS com-
dp dX dY ponent of acceleration.
vB=-=-I+-J Equations 7 and 8 may be adapted to the case
dt dt dt
of rotation of a rigid body about a fixed axis at
dx di dy dj point A by considering point A to be fixed.
+-i+x - + - j +y - (6) Thus, vA and aA are both zero and
dt dt dt dt
vB = vB/A (12)
Noting that dx/dt = dy/dt = 0, Eq. (6) simplifies
to aB = aB/A (13)
(7) For a body rotating about a fixed axis, analy-
sis of the rotational motion yields
where vB and vA are the velocities of points B
and A, respectively, and vB/ A is the velocity of dO
point B relative to point A. -=w (14)
A second differentiation of Eq. (5) with re- dt
spect to time yields the acceleration of B in the dw
form -=a (15)
(8) dt

where aB and aA are the accelerations of points where 0 is the angular displacement in radians,
B and A, respectively, and aB/A is the accelera- w is the angular velocity in radians per second,
tion of point B relative to point A. and a is the angular acceleration in radians per
A second important case of rigid body motion second per second. It should be noted that time
exists when point B is not attached to the same may be expressed in other units.
body as point A but is moving along a con- Kinetics Newton's Laws of Motion The laws
strained path on this body. For the analysis of of Newton are based upon the motion of a parti-
this motion, it is convenient to designate the cle relative to a fixed frame of reference in
fixed reference as body I, the body to which which the particle can be made completely free
point A is attached as body 2, and the body to of all outside influences. Under such a condi-
which point B is attached as body 3. The same tion, the particle at rest will remain at rest and a
general arrangements of coordinate systems are particle in motion will continue to move at a
used but in this case, the fixed X- Y coordinate constant velocity. This ideal frame of reference
system may be considered as attached to body I is often referred to as a Newtonian or as an iner-
while the moving x-y coordinate system is at- tial frame of reference. 4
tached to body 2. In the general case, vector r Since it is not possible to actually establish
within the x-y coordinate system is varying in the Newtonian frame of reference, Newton's
both magnitude and direction. Differentiating laws of motion are used in a celestial or a ter-
the position expression restrial frame of reference. The gyroscopic in-
struments and the stabilized platforms represent
p= R +r (9) attempts to achieve a fixed or stabilized frame
of reference for aircraft or space vehicles.
once with respect to time and simplifying yields In modern terminology, Newton's laws of
motion for a particle may be interpreted as
vB3 = vA2 + vB2/A2 + vB3/2 (10) ( I) A particle tends to remain at rest or con-
tinues to move at a constant velocity if there is
where vB3 is the velocity of point B on body 3, no unbalanced force acting upon it.
vA2 is the velocity of point A on body 2, (2) An unbalanced force acting on a particle
vB2/A2 is the velocity of point B on body 2 rel- will produce a time rate of change of momen-
ative to point A on body 2, and vB3/2 is the tum, d(mv)/dt, which, at any instant, will be
velocity of point B on body 3 relative to body 2. proportional to the force and will be in the
A second differentiation with respect to time same direction as the force.
gives (3) The forces that exist between two con-
tacting particles are equal in magnitude, are op-
aB3 = aA2 + aB2/A2 + aB3/2 + 2W2 X vB3/2 posite in direction, and are collinear.s
(11) The concept of the first law is the fundamen-
tal principle used for the analysis of forces act-
where aB3 is the acceleration of point B on ing on stationary particles and also for the anal-
body 3, aA2 is the acceleration of point A on ysis of forces acting on particles moving with a
body 2, aB2/A2 is the acceleration of point B constant velocity . The concept has been ex-
on body 2 relative to point A on body 2, panded to include rigid bodies and fluid bodies.
aB3/2 is the acceleration of point B on body 3 The second law is the foundation of the anal-
relative to body 2, W2 is the angular velocity of ysis of forces acting on particles moving with
DYNAMICS 310

accelerations and, again it has been extended to celeration of the center of mass of the body;
include rigid bodies which involve rotary mo- and a is the angular acceleration of the body.
tion and to include fluid bodies. Again, with proper choice of units, k 1 and k2
The· third law is fundamental to the force can be made equal to unity.
analysis of interconnecting systems of particles The concepts and equations presented herein
under both static and dynamic conditions. It can be applied for the solution of a wide variety
has also been extended to include simple contact of problems which involve systems of particles,
between any pair of bodies and, with some systems of bodies, or combinations thereof. The
modification, to include any type of interaction usefulness of this approach may be further
between bodies. broadened by the introduction of the closely
Force and Acceleration In general, Newton's related concepts of energy and impulse-
second law is stated as momentum.
It should be noted that, in Newtonian me-
LF = k d(mv) (16)
chanics, the fundamental property of the par-
dt ticle is an unvarying mass and time is absolute.
It should also be noted that when the speed of
where L F is the net unbalanced force on the the particle approaches the speed of light, this
particle; k is a constant of proportionality, con- theory becomes inaccurate in compariosn to a
sistent with the units used, that is determined theory based upon a more exact mathematical
experimentally; m is the mass of the particle; model attained through the application of the
and v is the instantaneous velocity of the par- principles of RELATIVITy.6
ticle. GLENN L. DOWNEY
For a particle that is not shedding or accu-
mulating mass, Newton's second law reduces to
References
LF =kma (17) 1. Cannon, Robert H., Jr., "Dynamics of Physical
where L F, k, and m are as previously defined
Systems," New York, McGraw-Hill, 1967.
2. Robertson, H. P., "The Universe," Sci. Am., 195
and a is the instantaneous acceleration of the (3),73-81 (September 1956).
particle. By the proper choice of units in Eq. 3. Kane, Thomas R., "Dynamics," New York, Holt,
(16) and (17), the constants of proportionality Rinehart and Winston, Inc., 1968.
can be made equal to unity. 4. Goodman, L. E., and Warner, W. H., "Dynamics,"
For a rigid body in plane motion, both trans- Belmont, California, Wadsworth, 1964.
lational and rotational acceleration must be con- 5. Smith, G. M., and Downey, G. L., "Advanced En-
sidered. By extending the concept of Newton's gineering Dynamics," 2nd ed., Scranton, Pa., Inter-
second law, it may be stated that national Textbook, 1968.
6. Synge, J. L., and Griffith, B. A., "Principles of
LF=klma (18) Mechanics," 3rd ed., New York, McGraw-Hill,
1959.
LT=k 2Ja (19) 7. Goldenstein, Herbert, "Oassical Mechanics," 2nd
where L F and LT are the unbalanced force
Ed., Reading, MA, Addison-Wesley, 1980.
8. Shigley, Joseph E., and Uicker, John J., "Theory
and unbalanced torque, respectively, acting on of Machines and Mechanisms," New York, McGraw-
the body; k 1 and k2 are constants of propor- Hill, 1980.
tionally, consistent with the units used, that are
determined experimentally; m is the mass of the
body; J is the mass moment of inertia of the Cross-references: ASTRONAUTICS, CORIOLIS EF-
body about a centroidal axis that is perpendicu- FECT, FLUID DYNAMICS, IMPULSE AND MO-
lar to the plane of the motion; a is the linear ac- MENTUM, MECHANICS, STATISTICS.
E
ELASTICITY* 4
where WM = ('V X s)M is an infinitesimal
Elasticity is the part of mechanics dealing with vector. This means that the neighborhood of
deformations that vanish entirely once the forces point M undergoes an infinitesimal rigid rota-
that have caused them are removed. Most solid tion, without any change in shape or size. Con-
bodies behave elastically for sufficiently small sequently, the deformation is represented by
deformations, and we will be concerned here the symmetric part S, which is called the
with the infinitesimal theory of elasticity. Also strain dyadic.
we will consider only isotropic bodies that is In a Cartesian orthonormal basis, in which
bodies whose elastic properties are the' same i~ 3 3

all directions. we have r =L Xiai, we write s =L Si3i, and


The fundamental quantities in elasticity are i=1 i=1
second-order tensors, or dyadics: the deforma- obtain
tion is represented by the strain dyadic and the
internal forces are represented by the stress
S= L 3i3j S ij
i, j= I
dyadic . The physical constitution of the de-
J..2 [~ ~ Si]. The diagonal
formable body determines the relation between
the strain dyadic and the stress dyadic which where Sij = Sj +
relatio~ is, in the infinitesimal theory, a~sumed aXi aXj
to be lInear and homogeneous. While for aniso- components SII, S22, and S33 are the coeffi-
tropic bodies this relation may involve as much cients of linear extension in the directions 81
as 21 independent constants, in the case of iso- a2, and 83, respectively, while the non diagonai
tropic bodies, the number of elastic constants components SI2 = S21, SI3 = S31, and S23 =
is reduced to two. S32 are called shear strains. For instance, 2S12
Let s(r) be the displacement vector due to is the change in the angle of the dihedron
the deformation, of a particle that b~fore the formed by the planes that before the deforma-
tion were respectively normal to the directions
def?r.mation was ~ituated at point P having r as
po.sl!lOn vec~or with respect to some arbitrary 31 and 32. The shear strains are not essential for
the complete representation of a deformation
ongm. A neighboring point Q, whose position
si~ce they can be made to vanish by expressing
vector was r + dr before the deformation will S III the basis of its principal axes.
suffer a displacement s(r + dr) which will differ .If an infinitesimal element of the body occu-
from s(r) by the quantity
pies the volume d V before the deformation and
ds = dr • 'Vs the volume d V' after, the relative increase of
The hypothesis of small deformations means volume, or volumetric dilatation, is given by
that ds, the change in the displacement vector dV' - dV
~hen we go from P to the neighboring point Q, = SII + S22 + S33 = I S I = 'V •s
IS very small compared to dr, the position vector dV
of Q relative to P. Consequently, the scalar
components of the dyadic 'Vs are all very small Th~ forces applied to a finite deformable body
compared to unity. The geometrical meaning are either body. forces acting on every volume
of the dyadic 'Vs is obtained by separating it element d V and represented by the notation
4
into its symmetric part S = ('Vs + s'V) and its
dVF = dVpK, where F is the force per unit
volume, !' is the force per, unit mass, and p is
antisymmetric part R = -4
1 X ('V X s), where 1 the density, or surface forces acting on every
is the unity dyadic. The antisymmetric part is element dS of the bounding surface and repre-
interpreted as follows: if at some point M the sented by dST, where T is the surface stress or
symmetric part vanishes, then we have for the surface force per unit area. The effect of these
neighborhood of M the relation applied forces is transmitted throughout the
ds = dr' RM = WM X dr body, so that through any surface element in-
side the body, there is a force exerted by the
*See MECHANICAL PROPERTIES OF SOLIDS for matter on one side of the element upon the
a less mathematical introduction to elasticity. matter on the other side. Such forces are called

311
ELASTICITY 312

internal stresses and are defined as follows: let distributed on the end sections; then we have
dS be a surface element completely inside the
body, and let us choose arbitrarily the positive Young's modulus =
sense of the normal n to this surface element; Normal traction force/unit cross sectional area
this defines for dS a positive side, the one con-
tainmg n, and a negative side. Then Tn, the Relative longitudinal extension
stress vector on the positive side of dS is de-
fined as a vector such that dST n is the surface and
force on the positive side of dS-i.e., the re- Relative lateral contraction
sultant of all the forces exerted through dS by Poisson's ratio = - - - - - - - - - - - -
the matter on the positive side of dS upon the Relative longitudinal extension
matter on the negative side. In general there is a
normal component Tn - nn, which is a pressure We can also write
or a traction depending upon whether the sign
of Tn - n is negative or positive, and a tangent T = 2f.l,8 + XIS 11
component n X Tn X n called the shear stress. where J1 = E /2(1 + v) and X = vE /(1 + v)(I - 2v)
The value of stress vector Tn depends upon are Lame's constants. J1 is the rigidity modulus,
the orientation of the normal n, so that we can the only constant necessary when the volumetric
characterize the state of stress at a point by de- dilatation vanishes everywhere.
fining the stress dyadic T through the relation Substituting the preceding relation into the
Tn = n - T equilibrium equations, we transform them into

The mechanical equilibrium conditions applied 2J1V - S + XV 1S 1+ F = 0 inside the body


to an arbitrary volume V, bounded by the closed and
surface S, and completely inside the deformable
body give 2J1n - S + Xn 1S 1= T on the bounding surface.

Iv dVF+ Is dSn-T=O
These vector relations are not sufficient for the
complete determination of the symmetric dyadic
S. To insure that a solution of the above equa-
tions corresponds to a possible displacement
and vector s, we must be able to integrate the relation

Iv dVr X F + 1 dSr X (n - T) =0
S = ~ (Vs + sV)
i.e., from a given expression for S, obtain the
value of s. From the vanishing of the curl of a
By the use of the divergence theorem, the first gradient, it is easily seen that this integrability
condition gives the equation condition, also called the compatibility equa-
tion, is
V-T+F=O
vxSxV=O
at any point inside the body, and the second
condition implies that T is a symmetric dyadic. By elimination of the vector products, we ob-
On the external surface of the body, we have tain the equivalent form
usually to fulfill the boundary condition
VV - S + V - SV - VV 1S 1- V - VS = 0
n- T= T
Using the stress-strain relation and the equi-
where T is the applied external force per unit librium conditions, we obtain the Beltrami-
area. Other boundary conditions can also be Michell form of the compatibility equation:
met, such that the value of the displacement
be prescribed. I v
V- VT + - VV 1T 1= - - V - F1
For infinitesimal deformations, we assume l+v I-v
that the relation between strain and stress is
expressed by Hooke's law: the deformation is - (VF + FV)
proportional to the applied force. For isotropic Finally, by expressing the strain dyadic in
bodies, this linear relation is terms of the displacement vector, we obtain
Navier's form of the equilibrium equations:
1
S=-[(I+v)T-vITI1] J1V - Vs + (X + J1) VV - s + F = 0 inside the body
E
where E is Young's modllius and v is Poisson's and
ratio. These two elastic constants can be de- 'An V - s + 2J1n - Vs + J1n X (V X s) = T
fined by considering the stretching of a cylin-
drical bar by normal traction forces uniformly on the bounding surface.
313 ELECTRICAL POWER GENERATION

Dealing here directly with the displacement ELECTRICAL POWER GENERATION


vector, there is no need of considering the com-
patibility equation. About 98% of the electric power produced in
The propagation equation for elastic distur- this country is by three-phase generators. It is
bances is obtained by adding the inertia force transmitted and distributed this way. Advantages
to the body force. We get then of three-phase generators lie in economy of
apparatus, lower transmission losses inherent
a2 starting torque for polyphase motors' and con-
stant running torque for balanced l~ading. A
J.I.'V' 'Vs + (X + J.I.) 'V 'V • s + pK = P - s
at 2 generator is built with axial slots for armature
coils in a stationary hollow cylindrical iron core
inside the body. called the stator. The windings are placed in the
slots so that when carrying current they produce
The stress-strain relation and the boundary con- a chosen even number of alternate magnetic
ditions are not affected, but we generally have poles. The coils over each magnetic pole are
to take into account initial conditions. grouped in three equal bands to give a three-
The energy density u, or energy per unit phase balanced system of terminal voltages.
volume, is given by An inner rotor has coils which carry direct
current to give the same number of alternate
l I a s as magnetic poles as on the stator. Rotor current
u = -8: T + - p - • -
2 2 at at strength is controlled by a rheostat or voltage
from a dc generator. Voltages are produced in
where the first term is potential, or strain the stator windings by flux cutting as the rotor
energy, and the second term is kinetic energy. magnetic flux sweeps by them, currents flow
The energy flux density vector when the generator terminals are connected to
a three-phase load impedance. The three-phase
as stator line voltages are equal in magnitude and
s = -_. T 120 electrical degrees apart in time sequence.
at So also are the line currents for a balanced
is a vector such that dSn . S gives the quantity three-phase load. Generator voltages are of the
of energy that flows per unit time through the order of 12,000 to 30,000 volts for large
surface element dS in the positive direction of machines.
n, the normal to dS. At any point the energy Generator frequency is the product of the
continuity equation pairs of magnetic poles and the speed in revolu-
tions per second. At 60 Hz (cycles per sec), a
t:-V0-pole generator runs at 3600 rpm and a
au as SIx-pole generator at 1200 rpm. The maximum
-+'V'S-p-'K=O
at at speed of 3600 rpm has been increasingly adopted
even for very large machines because high speed
expresses the conservation of mechanical energy. ~eans decr~ased size and weight for a given
GERARD NADEAU
kIlowatt ratmg and better steam-turbine perfor-
mance. Waterwheels and water turbines show
best characteristics at much lower speeds-
References roughly a range of 100 to 600 rpm. A frequency
Godfrey, D. E. R., "Theoretical Elasticity and Plas- of.?O Hz prevails i~ this country for public
ticity," London, Thames and Hudson Co., 1959. utilIty power generatlOn. Because of weight and
Green, A. E., and Zerna, W., "Theoretical Elasticity," space limitation, 400 Hz is found in the aircraft
New York, Oxford University Press, 1954. industry. Europe is basically on 50 Hz.
Jaunzemis, W., "Continuum Mechanics," New York . In the large central station steam power plants,
Macmillan, 1967. ' smgle generators (units) reach, or go somewhat
Lai, W. M., Rubin, D., and Krempl, E., "Introduction beyond, 1,250,000 kW in rating. Some units
to Continuum Mechanics," New York, Pergamon have ratings up to 1.5 million kilowatts.
Direct-current generators are built with their
Press, 1978. dc magnetic poles in the stator. Armature con-
Nadeau, G., "Introduction to Elasticity," New York, ductors in the rotor have ac voltages induced in
Holt, Rinehart and Winston, Inc., 1964. them as t~ey are rotated; the same principle of
Pearson, C. E., "Theoretical Elasticity," Cambridge, flux cuttmg holds as before. An automatic
Mass., Harvard University Press, 1959. mechanical switching device, called a com-
Sokolnikoff, I. S., "Mathematical Theory of Elas- mutator, is placed on the shaft. It carries fixed
ticity," New York, McGraw-Hill Book Co., Inc.
1956. ' brushes, and with its many insulated copper
bars connected to the armature coils, it inverts
every other alternation of the voltage at the
Cross-references: MECHANICAL PROPERTIES OF two armature terminals. It is the commutator
SOLIDS, POLYMER PHYSICS, VECTOR PHYSICS, that requires the rotor to be the armature so
VISCOELASTICITY. that coils and their switching arrangement
ELECTRICAL POWER GENERATION 314

always move exactly together. Direct-current When the exhaust gas from the MHD genera-
generators are generally limited to several thou- tor is used to heat steam for a conventional
sand kilowatts and their application lies mainly generator, a larger portion of the thermal spec-
in industrial plants. trum is utilized and the system efficiency may
The majority of central station generating be raised from the present 40% to possibly 50
plants in the USA are steam turbine generators. or 55%. Heat for the system may come from
Coal, oil, or gas is used to heat water to around the use of fossil fuel, nuclear reactors, or as
1000°F at pressures of 2000-3500 psi. This expected in the future, fusion reactors.
superheated steam is successively expanded Geothermal production of electric power uses
through high pressure, medium pressure, and natural steam obtained from the earth through
low pressure turbines, all mounted on a com- steam wells and piped to turbines. Italy pro-
mon shaft with the generator. Steam is ex- duces about one-third of a million kW in this
tracted at perhaps five to seven points through manner and New Zealand has slightly less in-
the turbines. This extracted steam is used to stalled capacity of this kind. The only US
reheat the water from the condenser in the installation is about 175,000 kW on the West
boiler feed water heaters. The effect is to "square- Coast. Geothermal power is limited. Tempera-
off" the Rankine cycle and bring the system tures and pressures are low, but there is a lower
efficiency closer to Carnot efficiency. Actual capital investment and absence of fuel cost.
generating plant efficiencies approach 40%. A serious problem lies in elimination of con-
Nuclear power plants utilize the same Rankine tamination in the steam. These systems remain
steam cycle as do fossil fuel plants, but, the in prototype stages.
heat is provided by nuclear fission. Tempera- Probable Future Trends Considerable work
tures and pressures are not as high as in fossil is being done on boiler materials to increase the
fuel plants, and therefore operating efficiencies operating temperatures and pressures of steam
are not quite as high. Nuclear power generation plants. Ceramics look promising. Nuclear fusion
has suffered some growing pains that have research will provide the answers in clean nu-
served to increase the wariness of the populace. clear power. Fluidized bed combustion tech-
Such wariness has slowed the growth of nuclear niques will reduce fossil fuel emissions.
power.
Other methods of generating electricity in- FREDERICK C. BROCKHURST
clude gasoline- and diesel-driven generator sets,
gas turbine-driven generator sets, fuel cells, References
solar cells, wind driven generators, magneto-
hydrodynamic (MHD), generators, and geo- US Department of Energy Reports, "Electric Power
thermal generators. The engine- and gas-driven Supply and Demand for the Contiguous United
generators are normally used as backup sytems States."
or peaking systems for high demand periods. Mablekos, Van E., "Electric Machine Theory for
They are also used for prime power in remote, Power Engineers," New York, Harper & Row Pub-
isolated installations. Fuel cells and solar cells lishers, 1980.
are being used in very small power demand ap-
plications such as microwave relay sites and Cross-references: ELECTRICITY; ENERGY STOR-
spacecraft. Both have yet to demonstrate the AGE, ELECTROCHEMICAL; ENERGY STORAGE,
efficiencies necessary for economic feasibility. THERMAL-MECHANICAL; HALL EFFECT AND
Wind driven generators up to a few hundred RELATED PHENOMENA; MAGNETO-FLUID-
kw are being developed to relieve the burden MECHANICS; NUCLEAR REACTORS; PHOTO-
on fossil fuels. Relability and economic feasi- ELECTRICITY; THERMOELECTRICITY; WORK,
bility have not yet been demonstrated. MHD POWER AND ENERGY.
generators received much attention during the
1970s, but many technological challenges have
not yet been met. MHD units have had .fair
success in Britain when combined with con- ELECTRIC PROPULSION
ventional steam turbine-generator plants to
raise the overall plant efficiency. Electric propulsion is a form of rocket propul-
The limited efficiency of steam turbines im- sion in which electric power, generated on board
posed by the thermodynamic properties of the propelled vehicle, is used to eject propellant
steam has stimulated the development of meth- rearward at high velocity to produce thrust.
ods to convert heat directly into electricity. Electric propulsion systems can be considered
The MHD generator is one in which a thermally to be made up of two major components: (1)
ionized gas is forced at high temperature, pres- the electric power generation system, which
sure, and velocity through a duct situated in a converts power from a basic power source (such
transverse magnetic field. An induced voltage as a nuclear reactor or the sun) into electric
appears in the third mutually perpendicular power, and (2) the thruster, which uses this
direction (the Hall effect), and this voltage may electric power to produce thrust by ejecting the
be tapped by electrodes within the duct (see propellant.
MAGNETOFLUID-MECHANICS). The primary potential advantage of electric
315 ELECTRIC PROPULSION

rockets over chemical rockets (or hypothetical where mp is the mass flow rate of propellant
solid-core nuclear rockets) is that much higher and go is the acceleration of gravity at the
propellant ejection velocities can be attained. earth's surface (9.8 m/sec 2 ) which relates mass
Higher ejection velocities, in accordance with to weight. The relation of thrust, specific im-
Newton's law, produce higher thrust per unit pulse, and propellant ejection velocity is
mass of propellant, so that the total mass of
propellant needed for space missions can be newtons (3)
greatly reduced. The mass of required electric
power generation equipment is appreciable, where Vj is the mean propellant ejection velocity
however, so that some of the saving in propel- (more commonly called effective jet velocity).
lant mass is offset by the mass of the power The first and last terms in Eq. (3) express New-
generation system. The net mass saving possible ton's law that force is equal to the time rate of
using electric propulsion therefore depends change of momentum. The last two terms show
strongly on the performance parameters of the that specific impulse is directly proportional to
system. effective jet velocity.
One of the most important of these perfor- The jet power is the time rate of change of jet
kinetic energy, or
mance parameters is the propulsion-system spe-
cific mass a, which is defined as Pj =t mpUj2 = ! FVj = ! golF newton-m/sec
a= mps ~ (1)
or, in kilowatts,
Pj kW
Pi golF kW (4)
where mps is the total propulsion system mass = 2000
(in kilogram~) and Pj is the jet power produced
(in kilowatts). If this parameter is less than For constant thrust and jet velocity, the total
about 30 kg/kW, electric propulsion systems can propellant mass mp needed for a mission can be
be employed to advantage over nuclear or chem- written [from Eq. (3») as
ical rockets for many unmanned interplanetary
exploration missions. For such missions, typical Ft
required power levels range from about 25 kilo- mp = - kg (5)
gol
watts to several hundred kilowatts, to propel
vehicles having initial mass in earth orbit in the where t is the total propulsion time and Ft is the
range of 1000 to 10 000 kg. total impulse required for the mission. From
If a is less than about 5 kg/kW, mission studies Eq. (1) and (4), the propulsion system mass can
indicate that electric propulsion is superior to be written:
nuclear rockets, with regard to required initial
weight and trip time, for manned expeditions ago IF
to the near planets. l For these missions, power mps = aPj = - - kg (6)
levels of several megawatts would be needed for 2000
vehicle weights (in orbit) of the range of 100 000
to I 000 000 kg. These equations show that, although propel-
In other possible applications, such as provid- lant mass can be reduced indefinitely by increas-
ing small amounts of thrust for attitude control ing the specific impulse [Eq. (5)], the power
or orbit control of satellites, the specific mass is required (and therefore the power-plant mass) is
less important, since the required electric power increased when this is done [Eq. (6)]. It is,
is small compared to that used by the other therefore, desirable to use that value of specific
on-board equipment. impulse for which the sum of the masses of pro-
Most of the mass of an electric propulsion pellant and propulsion system is lowest. This
system resides in the electric power generation optimum specific impulse will yield the least
system; however, the performance of the other total mass for the mission, or the highest pay-
major component, the thruster, is of equal im- load mass for a given total mass. For lunar and
portance in determining the over-all specific interplanetary missions and for specific mass
mass. The most important parameter for the likely to be obtained, calculations show that the
thruster is the efficiency 17 with which the elec- optimum specific impulses range from about
tric power is converted into jet power. If this 1500 to 15000 seconds (corresponding to jet
efficiency is low, the required electric power, velocities of about 15 to 150 km/sec). These
and therefore the power-plant mass, is corre- specific impulses compare with values of about
spondingly high. 450 seconds that are typical for high-energy
Another important parameter for the thruster chemical rockets and about 900 seconds that
(as for all rockets) is the specific impulse I. This may be possible with solid-core nuclear rockets.
parameter is defined as the thrust F produced Another characteristic feature of eleCtric pro-
per unit weight flow of propellant: pulsion systems is the very low thrust generated
in comparison with chemical or nuclear rockets.
F This can be seen from Eq. (6) which can be
1= -.-- sec (2)
mpgo written:
ELECTRIC PROPULSION 316

F 2000 and voltages. Also, to produce useful power


---=-- (7) densities, emitter temperatures must be in the
mpsgo algo 2 range 1500 to 2000 K. Conversion efficiencies
For a specific mass a of 10 kg/kW, and a (heat into electric power) of IS to 30 per cent
specific impulse I of 5000 seconds, Eq. (7) are possible. The remaining 70 to 85 per cent of
yields a thrust-to-weight ratio of about 4 X the thermal power must be radiated into space.
10-4 . This very low value results partly from the The collector electrodes, where this waste heat
higher specific impulse typical of electric rock- appears, must be adequately cooled by a heat-
ets, but mostly from the specific mass, which is transfer fluid that is pumped past the collector
of the order of 1000 or more times higher than to pick up the waste heat and carry it to a radi-
that obtainable with solid-core nuclear rockets ator. In order that the radiator be of adequately
Or chemical rockets. The low thrust-weight ratio low size and weight, it must operate at tempera-
means that electric propulsion systems cannot tures of about 900 K or higher. Analyses for a
be used for launching from planetary surfaces. complete nuclear thermionic system yield spe-
They are best suited for propelling vehicles be- cific masses of the order of 4 to 10 kg/kW, but
tween orbits about the planets or between orbits numerous severe performance, design, and engi-
about the earth and the moon. neering problems remain to be solved before
Because the thrust-weight ratio is so low, elec- such systems can be developed to mission
tric rockets must operate for much longer pe- status 4
riods of time (of the order of 1000 times longer) Still more indirect, in the conversion of en-
than chemical or nuclear rockets to produce the ergy, are the turboelectric systems
same totalimpulse. Typically, for interplanetary
missions to the near and far planets, these re-
quired operating times range from many months
(~~~[ear =: heat ~ mechanical ~ electric)
to several years. The removal of limitations on For these, as well as the thermionic systems, the
jet velocity, therefore, is obtained at the ex- nuclear reactor appears to be a better basic en-
pense of greatly increased propulsion system ergy source than the sun, because it provides a
mass and required operating lifetime. more compact and versatile system, suitable for
Power Generation Systems The need for low operation in shaded regions and at any distance
specific mass dominates the selection of suitable from the sun.
methods for generating electric power for pri- A nuclear turboelectric system for electric
mary propulsion of space vehicles. The require- propulsion (as illustrated in Fig. 1) is basically
ment that power be generated with very little a lightweight adaptation to space conditions of
consumption of mass dictates that either nuclear small ground-based nuclear power stations. s
or solar energy must be used as the basic energy The chief differences result (as for the therm-
source. ionic systems) from the lack of means other
Among the possible methods of converting than radiation to eliminate the waste heat re-
this energy into electric power, the most direct sulting from inescapable conversion inefficien-
are photovoltaic solar cells. Considerable prog- cies. To produce specific mass below 10 kg/kW,
ress has been made in reducing the thickness, the waste-heat radiator must operate at temper-
and hence the weight of photovoltaic solar atures above 900 K, which in turn requires that
cells? eventual achievement of a specific mass the nuclear reactor operate at temperatures in
near 5 kg/kW appears possible. A lightweight excess of 1200 K.
radioisotope cell, in the range of I kg/kW, has The most suitable working fluid, at these tem-
been proposed and analyzed 3 but not yet dem- peratures, is potassium, if a liquid-vapor thermo-
onstrated. This cell is basically a very high- dynamic cycle (Rankine cycle) is used. In a
voltage, low-current device, which converts a single·loop version of this cycle, the liquid
fraction of the kinetic energy of the isotope metal is vaporized in the nuclear reactor; the
decay particles directly into electric power. resulting vapor drives the turbine, which in
Somewhat less direct in energy conversion are turn drives the generator to produce electric
systems that use thermionic cells power. The vapor passes from the turbine
through the radiator, where it is recondensed,
nuclear ~ heat
( solar ~ electriCity) and the liquid is then recirculated through the
~ reactor. A major problem is to develop mate-
rials with adequate corrosion resistance during
A nuclear reactor or solar concentrator is used long periods of high-temperature operation with
to heat a suitable material (such as tungsten) to alkali liquid metals. As illustrated in Fig. I, the
temperatures high enough to produce thermal radiator is the largest and heaviest part of the
emission of electrons. These electrons traverse a system.
gap to a cooled collector electrode, thereby pro- A possible alternative to the turboelectric sys-
ducing electric power at a potential of the order tem is an MHD (magnetohydrodynamic) gener-
of I volt. Many thousands of these thermionic ator, which replaces the turbogenerator with a
cells must be connected in series-parallel com- duct through which a hot, electrically conduct-
binations to achieve the required power levels ing fluid is passed. The duct is embedded in a
317 ELECTRIC PROPULSION

...... Shield
\ _ - Propellant tank

C-49478

FIG. 1. Conceptual design of space vehicle for manned Mars mission. Nuclear turboelectric propulsion system.

strong magnetic field which must be produced tained at the proper voltage difference (usually
by superconducting coils to minimize the power several thousand volts) to produce the desired
consumption. The most suitable fluid seems to ejection velocity (specific impulse). A second
be a noble gas seeded with cesium to make it electron emitter (not shown) is placed adjacent
electrically conducting. 6 When this fluid, hea ted to the ion beam, downstream of the accelerator,
by the nuclear reactor, is forced through the to neutralize both the ion space charge and the
magnetic field, electric currents are induced,. and net current leaving the thruster. Experimental
power can be extracted by electrodes embedded efficiencies in converting electric power into jet
in the duct. Such a system can tolerate higher power range from 60 to 80 per cent at specific
temperatures than turbines, and should be par- impulses in the range 2500 to 9000 seconds.
ticularly attractive for required power levels in Thrusters in sizes up to 150 cm in diameter,
the multi-megawatt range. with jet powers near 180 kW have been success-
Thrusters A large number of methods are fully operated. 7
possible to eject propellant by use of electric Other ion thrusters, using contact ionization
power. These are generally divided into three of cesium atoms on hot tungsten to produce the
categories: (l) electrostatic thrusters, in which ions (rather than electron bombardment), have
atoms (or heavier particles) are electrically achieved somewhat lower performance. In these
charged and then accelerated rearward by means thrusters, cesium vapor is passed through porous
of an electrostatic field; (2) plasma thrusters, in tungsten, which must be heated to about 1500
which the propellant is made into an electrically K to evaporate enough cesium ions from the
conducting gas and accelerated rearward by ap- ionizer surface. The high work function of tung-
plication of electromagnetic forces; (3) electro- sten and the low ionization p(>tential of cesium
thermal thrusters, which use the electric power make these two substances the most promising
to heat the propellant, and then accelerate it for contact ionization thrusters.
rearward by thermal expansion through a noz- Atomic-ion thrusters tend to become less ef-
zle. ficient at low ejection velocities (low specific
Electrostatic thrusters that accelerate atomic impulse), because a certain fixed amount of
ions (ion rockets) have received the most re- energy is needed to ionize the propellant atoms.
search and development attention, as a result of As the ejection velocity decreases, the jet power
early demonstrations of good efficiencies in the approaches the power required for ionization,
range of specific impulses needed for major and the efficiency decreases. A possible way to
space missions. Typical of these ion rockets is increase the efficiency is to increase the mass of
an electron-bombardment thruster (such as that each charged particle so that its kinetic energy,
shown in Fig. 2) which uses mercury vapor as at a given jet velocity, is higher. This approach
propellant. 7 The propellant atoms are ionized leads to use of colloidal particles in place of
by collision with electrons emitted by the cath- atomic ions. Because of the much higher mass
ode and attracted toward the anode. A weak per unit charge, voltages in the hundreds of
axial magnetic field is maintained in the ioniza- kilovolts are needed to prodllce the desired jet
tion chamber to make the electrons spiral velocities.
around on their way to the anode, thereby in- Although the efficiencies attainable with elec-
creasing their path length and their probability trostatic thrusters are high, there remains a limi-
of colliding with propellant atoms. The resulting tation which, although not crucial, is undesir-
positive ions are extracted through a screen grid able, namely, a low thrust (Or power) per unit
by means of an accelerating grid that is main- beam area, due to limitations on ion beam cur-
ELECTRIC PROPULSION 318

,
Magnetic-field coil"",\

CS-25138
C-57341

FIG. 2. Cutaway photograph of electron-bombardment ion thruster. With about I kW of power, this unit pro-
duces a thrust of 0.025 Newtons grams at a specific impulse of 5000 sec.

rent density. These limitations result from two powered hydrogen heater (also called resistojet).
sources: (1) space charge and (2) accelerator The hydrogen heater is limited to specific im-
electrode erosion. The space-charge limited cur- pulses less than about 1000 seconds, because of
rent is determined by the accelerating voltage the limitation on the wall temperature of the
and the distance between accelerator electrode heater. High efficiencies, however, have been
and ion source. The voltage, in turn, is approxi- achieved.1 o The arc jet, which heats the propel-
mately fixed by the desired specific impulse, lant by means of continuous electric discharge
and the accelerator spacing is limited by electri- as the propellant flows by, can achieve some-
cal breakdown and thermal warping. The erosion what higher specific impulses (up to about 2000
limitation appears to be even more restrictive on seconds), but the efficiency is generally less than
thrust per unit area than space charge. As the 50 per cent, due to losses involved in dissocia-
current density is increased, there is greater im- tion and ionization of the propellant atoms, and
pingement of ions on the accelerator electrode. losses to the walls of the arc chamber and noz-
For an accelerator lifetime of the order of 1 zle. Because of the lower specific impulse range,
year, estimates indicate a limit for thrust per electrothermal thrusters are not useful for inter-
unit beam area of about 2 newtons (0.2 kg) per planetary missions, but may be used for more
square meter (about 50 kW 1m2) at a specific limited applications such as satellite orientation
impulse of 5000 seconds. Higher values are al- control and orbit correction.
lowable as specific impulse increases. History and Status The possibility of reduc-
Plasma thrusters, which operate on the prin- ing propellant consumption by ejecting the pro-
ciple of accelerating an electrically conducting pellant electrically at high velocities was recog-
gas (plasma) are not subject to the space charge nized by early space flight and rocket pioneers,
limitation, and require no accelerator grid. Con- such as Goddard ll and Oberth,12 but the prac-
sequently, a higher thrust per unit area with tical feasibility of such propulsion systems was
adequate lifetime may be achievable. A variety not demonstrated . With the advertt of nuclear-
of plasma thruster types have been investigated electric power and large rockets during and after
during the past decade,S but so far the effi- World War II, more interest in electric propul-
ciencies have been much lower than those of ion sion was aroused, and between 1946 and 1956,
thrusters. Sizable effort has been devoted to a number of preliminary analyses of nuclear and
the magnetoplasmadynamic (MPD) arc jet, solar electric systems were published. 13 ,14
which consists of a coaxial discharge between a Comparative studies of the applicability of
central cathode and a surrounding anode in the electric, nuclear, and chemical propulsion to
presence of a magnetic field. 9 future space missions, together with an engi-
Electrothermal thrusters are primarily of two neering study of large electric power systems for
types-the electric-arc jet and the electrically space use were completed in 1957. 15 These and
319 ELECTRIC PROPULSION

similar studies led to the initiation of major re-


search programs in electric propulsion and
power generation at U.S. government and indus-
trial laboratories. Low-thrust trajectory studies
and mission analyses 1, 16,17 have further clarified
the role of electric propulsion in future space
missions.
The use of cesium-tungsten contact ioniza-
tion for electrostatic thrusters was proposed by
Stuhlinger,14 and early experimental work in
the United States, beginning in 1957, was con-
centrated on this approach. 18 - 20 However, the
invention and development of the electron
bombardment ion thruster 21 showed that it
could achieve higher efficiencies with less sensi-
tivity to fabrication techniques and materials.
In 1964, the first successful space flight test of
ion thrusters was accomplished with the launch-
ing of SERT-I (Space Electric Rocket Test 1).22
One cesium-tungsten and one electron bombard-
ment thruster were launched into a 20-minute
ballistic space trajectory. The cesium-tungsten
thruster developed a high-voltage breakdown,
but the electron bombardment thruster operated
successfully. The test demonstrated that ion
beam neutralization in space was no problem,
and that the thrust level was the same as in
ground test vacuum facilities. After further re-
search and development of the electron bom-
bardment thruster, a long-duration satellite or- FIG. 3. Artist's drawing of SERT-II Spacecraft.
bital test (SERT II) was launched in February
1970 (Fig. 3).23 A one-year operating period
was planned. One of the thrusters operated con-
tinuously for five months and the other for
three months before high-voltage short circuits
occurred. Data analysis indicated that ion-beam
erosion of the accelerator grid probably pro-
duced metal chips which shorted across to the
screen grid. A long period of partial shading
of the spacecraft occurred from 1972 through
1978, so that further testing was not possible.
Almost continuous sun exposure again occurred
for several periods from 1979 to May 1981.
Early in 1979, the short circuit of one of the
thrusters was cleared by maneuvering the space-
craft with the attitude control rockets. There-
after, this thruster was operated normally during
sun exposure periods until the mercury propel-
lant was exhausted. Many startup and compo-
nent performance tests were accomplished
during these periods. 24 These remarkably suc-
cessful tests of the long-time survivability of a
solar-electric spacecraft and its thruster system
provided evidence that solar electric propulsion
was feasible for use in planetary missions.
Following the initial tests of SERT II (1971),
further development of thrusters and solar-cell
arrays was carried out by NASA. A Solar Elec-
tric Propulsion System (SEPS) program was
initiated to produce a complete, reliable mis-
sion-ready system. Initial application was to be
a Halley's Comet rendezvous mission in 1985,
but budgetary restraints eliminated this pro-
gram and other missions for SEPS.25 FIG. 4. Artist's representation of SEP (Solar Electric
The performance parameters achieved for a Propulsion System) Spacecraft. 25 KW power level.
ELECTRIC PROPULSION 320

25 killowatt version of SEPS are as follows 11. Lehman, Milton, "This High Man," New York,
(Fig. 4): Farrar, Straus, and Co., 1963.
12. Oberth, H., "Wege zur Raumschiffahrt," Munich
Solar Array Mass: 375 kg and Berlin, Verlag von Oldenbourg, 1929 (re-
Power Processing Subsystem Mass: 375 kg printed by Edwards Bros. Inc., 1945).
Thruster system mass: 312 kg 13. Shepherd, L. R., and Cleaver, A. V., "The Atomic
Thruster system efficiency: 60% Rocket," Pt. I, J. Brit. Interpianet. Soc., 7, 185-
189 (1948); Pt. II, ibid., 7, 234-241 (1948);
Thus the overall specific mass was about 40 Pt. III, ibid., 8,23-37 (1949); Pt. IV, ibid., 8, 59-
kg/kW. Operating the thrusters directly from 70 (1949).
the solar array (to eliminate power processing 14. Stuhlinger, E., "Electrical Propulsion System for
mass) is considered feasible in future systems Space Ships with Nuclear Power Source," J. Astro-
so that specific mass below 30 kg/kW should be nautics, 2 (4), 149-152 (1955); 3 (1), 11-14
readily achievable. (1956); 3 (2),33-36 (1956).
Other research and development programs 15. Moeckel, W. E., Baldwin, L. V., English, R. E.,
with more modest intended applications (atti- Lubarsky, B., and Maslen, S. H., "Satellite and
tude control, station-keeping and orbit control Space Propulsion Systems," NASA TN D-285,
of satellites) have been carried out for many 1960. (Unclassified versions of material presented
years in Europe 26 and Japan,27 as well as the at NACA Flight Propulsion Conference, Novem-
United States. 28 ,29 These programs have studied ber 22, 1957.)
a variety of electric thruster types from pulsed 16. Irving, J. H., and Blum, E. K., "Comparative Per-
plasma ejectors to smaller ion thrusters. In addi- formance of Ballistic and Low-Thrust Vehicles for
tion, research is underway to improve the per- Flight to Mars," Vistas Astron., 2, 191-218
formance of magneto-plasmadynamic (MPD) (1959).
thrusters to make them more competitive with 17. Sauer, C. G., and Melbourne, W. G., "Optimum
ion propulsion. Some research is also continuing Earth-to-Mars Trip Trajectories Using Low-Thrust,
on nuclear-thermionic power generation for Power-Limited Propulsion Systems," Rep. TR 32-
electric propulsion. 29 376, Jet Prop. Lab., C.l.T., 1963.
18. Forrester, A. T., and Spenser, R. C.: "Cesium-Ion
W. E. MOECKEL
Propulsion," Astronautics, 4 (10), 34-35 (October
References 1959).
19. Childs, J. H., "Design of Ion Rockets and Test
1. Moeckel, W. E., "Comparison of Advanced Facilities," Paper 59-103, Inst. Aero. Sci., Inc.,
Propulsion Concepts for Deep Space Explora- 1959.
tion," J. Spacecraft and Rockets, 9 (12), 863-868 20. Brewer, G. R., Etter, 1. R., and Anderson, J. R.,
(December, 1972). "Design and Performance of Small Model Ion
2. Rappaport, P., "Photovoltaic Power," J. Space- Engines," Paper 1125-60, ARS, 1960.
craft and Rockets, 4 (7), 838-841 (July 1967). 21. Kaufman, Harold R., "An Ion Rocket with an
3. Mickelsen, W. R., and Low, C. A., Jr., "Potentials Electric Bombardment Ion Source," NASA TN D-
of Radioisotope Electrostatic Propulsion," Astro- 585, 1961.
nautics Aerospace Eng., 1 (9), 52-57 (October 22. Cybulski, R. J., Shellbauer, D. M., Lovell R. R.,
1963). Domino, E. J., and Kutnik, 1. J., "Results from
4. Becker, R. A., "Thermionic Space Power Systems SERT-I Ion Rocket Flight Test," NASA TN D-
Review," J. Spacecraft and Rockets, 4 (7), 847- 2718, March 1965.
851 (July 1967). 23. Kerslake, W. R., Goldman, R. G., and Neiberding,
5. Zipkin, Morris A., "Alkali-Metal Rankine-Cycle W. C., "SERT-II: Mission, Thruster, and In-Flight
Power Systems for Electric Propulsion," J. Space- Measurements," 1. Spacecraft and Rockets, 8 (3),
craft and Rockets, 4 (7) 852-858 (July 1967). 223-224 (March 1971).
6. Nichols, Lester, D., "Comparison of Brayton and 24. Kerslake, W. R., "SERT II Thrusters-Still Ticking
Rankine Cycle Magnetogasdynamic Space Power After Eleven Years," AIAA paper 81·1539, AIAA/
Generation Systems," NASA TN D-5085, 1969. SAEI ASME 17th Joint Propulsion Conference,
7. Richley, Edward A., and Kerslake, William R., 1981.
"Bombardment Thruster Investigations at the 25. Austin, R. E. and Kesteu, W., "Solar Electric
Lewis Research Center," J. Spacecraft and Rock- Propulsion Systems (SEPS) Program Plans and
ets,6 (3), 289-295 (March 1969). Systems Development," AIAA paper 79-2119,
8. Seikel, George R., "Generation of Thrust-Elec- 14th International Electric Propulsion Conference,
tromagnetic Thrusters," NASA SP-11, 1962, pp. 1979.
171-176. 26. Loeb, H. W., et aI., "European Electric PropUlsion
9. Connolly, D. J., Sovie, R. J., Michels, C. J., and Activities," AIAA Paper 79-2120, 14th Interna-
Burkhart, J. A., "Low Environmental Pressure tional Electric Propulsion Conference, 1979.
MPD Arc Tests," AIAA J., 6 (7), 1271-1276 27. Azuma, H., et al., "Experimental Plan for Electron
(July 1968). Bombardment Ion Thruster on Engineering Test
10. Jack, John R., "NASA Research on Resistance- Satellite III," AIAA paper 81-0662, 15th Interna-
Heated Hydrogen Jets," "Advanced Propulsion tional Electric PropUlsion Conference, 1981.
Concepts," Vol. I, Gordon and Breach Science 28. Vondra, R. T., "U.S. Air Force Programs in Elec-
Pub., Inc., 1963 (pp. 75-89). tric Propulsion," AIAA paper 79-2123, 14th
321 ELECTRICAL DISCHARGES IN GASES

International Electric Propulsion Conference, monatomic gas (He) and a diatomic gas (N 2 ).
1979. Here k = €/€g, where €g is the mean kinetic en-
29. Hudson, W. R., "NASA Electric Propulsion Pro- ergy of a gas molecule at 15°C (0.037 eV). It is
gram," AIAA paper 79-2118, 14th International seen that the mean electron energy greatly ex-
Electric Propulsion Conference, 1979 . ceeds the mean energy of a gas molecule even
when X /Po is small. This is due to the inefficient
Bibliography energy exchange in collisions between electrons
and gas molecules. If the collisions are elastic, it
Angrist, S. W., "Direct Energy Conversion," Boston, is readily shown that f, the mean fractional en-
MA, Allyn and Bacon, Inc., 1965. ergy lost by an electron in a collision, is ~2m/M
Jahn, Robert G., "Physics of Electric Propulsion," where m is electron mass and M is molecular
New York, McGraw-Hill Book Co., 1968. mass. Clearly fel ~ 1. At a given X/Po, € is gen-
Stuhlinger, Ernst, "Ion Propulsion for Space Flight," erally lower in poly atomic than in monatomic
New York, McGraw-Hill Book Co., 1964. gases. Owing to the possibility of inelastic colli-
Journal of Spacecraft and Rockets, Space Power Issue, sions involving vibrational or rotational excita-
Vol. 4, No.7, July, 1967. tion of the molecule, f ~ 2m/M in the former
case. In the latter case, only electronic excita-
Cross-references: ASTRONAUTICS, PHYSICS OF; tion of the atom can occur and this requires
DYNAMICS; FLIGHT PROPULSION FUNDAMEN- much higher energies in general.
TALS; IMPULSE AND MOMENTUM; MAGNETO- In addition to their random motion , the elec-
MECHANICS; PHOTOELECTRICITY ; PLASMA. trons must obviously possess a superimposed
drift motion in the direction of the applied field.
Figure 2 shows the variation of the drift velocity
We with X/Po; normally We is small compared
to the mean random speed of the electrons.
ELECTRICAL DISCHARGES IN GASES Ionization by Electron Collision When the
Motion of Slow Electrons in Gases Suppose energy of an electron exceeds a certain critical
that a swarm of electrons traverses a gas in value €j, ionization can occur at a collision with
which a uniform electric field X exists. In gen- a gas molecule. As X/Po, and hence €, is in-
eral the distribution of energy among the elec- creased, an increasing fraction of electrons in
trons will depend on the distance x which they the swarm will have energies exceeding €j. The
have traveled in the field . However, provided x size of the electron swarm will then increase
is sufficiently large, the energy distribution at- with the distance x traveled in the field direction.
tains a steady value independent of x . In this This growth is most conveniently studied un-
steady state, the average rate of supply of energy der conditions where € is kept constant. This can
to an electron from the field is equal to the be done by releasing electrons from the cathode
average rate of loss of energy in collisions with of a plane-parallel system and varying the elec-
gas molecules. trode gap d and potential difference V in such
Many important quantities in this subject are a way that the electric field X(= V/d) is fixed. It
related to eXA, the energy gained by an electron is then found that the electron current at the
of charge e in traveling the mean distance Abe- anode, i, increases exponentially with d or V.
That is
tween two successive collisions with gas mole-
cules. Since" is inversely proportional to the gas
density , the above quantity can be expressed in i = io exp (T/V) (I)
the form X/Po where Po is the gas pressure re-
duced to some standard temperature . where io is the electron current released from
The mean energy of an electron in the swarm, the cathode and T/ is the electron ionization co-
€ , is a function only of X/Po for a particular gas. efficient: this is defined as the average number
Figure I shows the form of this variation for a of ionizing collisions made by an electron in
moving through a I-volt potential difference.

i'
200

150
T
.."
CII 15

~
E
100
N2 " 10

I
.>t:.

50
~ It)
5
0 I
6 (0
...
2 4 8 Q 0

xPo _ (v em-I torr-I)


5 10 15 20

~ X --(vcm-Itorr-I)
Po
FIG. 1. Mean electron energy as afunction of X/Po FIG. 2. Electron drift velocity as a function of X/Po
for He and N 2 • for He, A and N2 .
ELECTRICAL DISCHARGES IN GASES 322

where 'Y is a generalized secondary ionization


coefficient. This is defined as the probability of
a secondary electron being released from the
cathode per positive ion arriving at the cathode.
Included in 'Yare contributions to the secondary
I
emission arising from radiation quanta and meta-
> stable molecules.
Since 'Y depends largely on the mean energies
of the electrons and ions, it is, like f/, a function
only of X/Po though the function now depends
on the nature of the cathode as well as on the
+-__-+-_ _--+___+-_--1
5X 10- 5 gas.
0.5 5 50 500 5000 Spark Breakdown It is clear from the above
xPo _ (v em-I torr-I). equation that the ionization current tends to
become very large as the potential difference
FIG. 3. Electron ionization coefficient as a function across the gap approaches the value Vs given by:
of X/Po for air, Ne, A, and Ne + 10-2 per cent A.
(3)
f/ is also a function only of X/Po for a given gas
(Fig. 3).
It is important to note that the curve of it This is the condition for spark breakdown and
against X/Po passes through a maximum. The can be best explained in the following manner.
decrease in f/ at low X/Po is due to the increas- Suppose that a primary electron current io is
ing importance of excitation compared with released from the cathode when V = Vs. The
ionization as X/Po decreases; since the excita- electron current reaching the anode is then io
tion energy losses are larger in polyatomic than exp (f/ V s). Hence, the positive ion current reach-
in monatomic gases, as remarked earlier, the de- ing the cathode due to the current io is io [exp
crease in f/ occurs more rapidly in air than in (f/Vs) - 11. This will give rise to a secondary
neon. The decrease in f/ at high X/Po (300 volts electron current of value 'Yio [exp (f/ Vs ) - 11. If
cm -1 torr-I), where excitation losses are com- Vs is given by Eq. (3), then 'Y [exp (f/V s ) - 11 =
paratively unimportant, is due to the fact that 1 and the secondary current is equal to the orig-
an increasingly large fraction of the energy sup- inal primary current io . Hence it is clear that
plied from the field is used in maintaining the the process can continue even if the initiating
kinetic energy of the swarm. current ceases. When V is less than Vs , however,
The curve for the gas mixture Ne + 10-2 per the discharge current i is proportional to io
cent A is of great interest. Since the excitation [Eq. (2)1. Thus i = 0 when io = O. It follows
potential of the most important metastable state that V = Vs marks the transition from a non-self-
of Ne (16.5 volts) exceeds the ionization poten- maintained to a self-maintained discharge. Vs is
tial of A (15.8 volts), the process Ne* + A-+ best defined as the potential difference required
Ne + A+ + e can occur. This reaction has a very to maintain a small discharge current i when the
high probability, of the order unity per collision, primary current io = O. Vs is independent of i
and causes a great increase in f/ at low X/Po provided this is sufficiently small to avoid space
above the value for pure neon, since the effective charge distortion of the field.
excitation energy losses are now considerably Since f/ and 'Yare both functions only of X /P0
reduced. The double maximum in the curve of and X = Vs/d s at breakdown, it follows from
f/ vs X/Po arises from the fact that the direct and Eq. (3) that
indirect ionization processes have their maxi-
mum efficiencies at different X/Po values (~70 (4)
and 2 volts cm- 1 torr-I, respectively). Thus, for a given gas and cathode material, the
Secondary Ionization Processes It is found breakdown potential between large plane-paral-
that Eq. (l) no longer holds at larger values of lel electrodes depends only on the product of
V; i now increases more rapidly leading ulti-
mately to spark breakdown. This is due to the the reduced gas pressure and electrode separa-
occurrence of secondary ionization processes, tion. This result, which is known as Paschen's
in addition to ionization by collision between law, has been confirmed experimentally over a
electrons and gas molecules. In general, the most wide range of Po and d s.
important secondary process is the release of The variation of Vs with Pods for a number of
electrons from the cathode surface. If various gases and cathode materials is shown in Fig. 4.
simplifying assumptions are made, it can be It should be noted that the curves all exhibit a
shown that the ionization current is now given minimum; this corresponds to the maximum in
by: the curve of f/ vs X/Po. It will be seen that the
rise of Vs at high values of Pods is most marked
. io exp (f/ V) in air, less in pure Ne, and less still in the Ne + A
1= 1 - 'Y[exp (f/V) - 11 (2) mixture. This is readily understood by reference
323 ELECTRICAL DISCHARGES IN GASES

10,000 Alr/
5,000
50
2,000 Ne

~_4
1,000 40
V, 500 Ne+S.IO "loA
200 30
100 tF
0.1 10 100 1000 20

~50.8
Pods - - (torr em) 98.4
10
FIG. 4. Breakdown potential as a function of the
product of the reduced gas pressure and electrode sep- 0
aration, Pods for air, Ne, and Ne + 0.0005 per cent A 0 2 3 4 5
with an iron cathode.
100 A V _
V.
to the decrease in 17 at low X/Po (=Vs/Pod s ) in
these gases (Fig. 3). FIG. 5. Variation of formative time lag with over-
Time Lag of Spark Breakdown If a potential voltage for various fixed values of X/Po in H2 with
difference ;;;:. Vs is suddenly applied to a dis- copper electrodes.
charge gap, a finite time elapses before the ini-
tial current io has in~reased to a self-maiIl:tai!led total 'Y. This ranges from 0.75 at X/Po = 50 to
discharge current ~ 10- 7 ampere/ cm 2 . ThiS time 0.50 at X/Po = 300 volts cm- I torr-I.
lag consists of two parts. First of all, there is a Glow Discharge We have seen that any small
statistical lag which arises from the fact that the current i can be maintained even in the absence
primary and secondary ionization processes are of initiatory electrons when the potential differ-
both subject to statistical fluctuations. Thus,. al- ence between the electrodes reaches a value Vs
though V> Vs where 'Y [exp (17 V s) - 11 = I ~m­ given by Eq. (3). V is only independent of i
plies that on the average one electron leavmg when the latter is less than ~l /.lA. At higher
the cathode will give rise to one secondary elec- currents the space charge concentration becomes
tron, this may not happen in any part~cular case. sufficient to cause X and hence 17 to vary across
Clearly the mean statistical lag ts will decrease the gap, and 17 Vs in Eq. (3) must be replaced by
as the initial current is increased and it may be J17 dV. The static V-i characteristic is normally
shown that negative since the field redistribution produced
1 by the space charge effects increases the over-all
ts =PNo (5) ionization efficiency. Once breakdown has taken
place, the current increases to a value deter-
where No is the number of primary electrons mined by the impedance of the voltage supply.
leaving the cathode per second and P is the prob- If the current density is sufficiently small
ability that any particular electron leads to « 0.1 ampere/cm 2 ), the cathode is not heated
breakdown. The latter quantity is zero at the to a high enough temperature for thermionic
sparking threshold Vs but increases rapidly for emission to be a significant factor in the main-
V > Vs. P!:::<. I provided V> 1.25 Vs. tenance of the discharge. This regime is termed
The second component of the total time lag a glow discharge and the field variation across
is the formative lag tF' This can be regarded as the gap in a long cylindrical tube is indicated in
the time that must elapse after the appearance Fig. 6. We can distinguish five main regions here:
(I) The cathode fall, in which the field de-
of a suitable initiatory electron before the vari- creases from a high value at the cathode to
ous ionization processes generate a self-main- approximately zero.
tained current of any given magnitude. This cur- (2) The negative glow, in which ionization
rent can be chosen arbitrarily to specify break- and excitation are due largely to fast electrons
down of the gap and is gen~rally taken to be arriving from the cathode fall. The length of this
-10- 7 ampere/cm 2 . Clearly tF will depend on region is normally controlled by the distance
the relative importance of the various secondary traveled by the electrons before their energy is
mechanisms mentioned earlier; positive ion reduced below the minimum required for exci-
transit times are typically ~ 10-6 second, while tation.
the time lags involved in the contribution of (3) Faraday dark space. In many cases the
radiation ~uanta and metastable mo~ecules to ionization in the negative glow is so intense that
'Y are ~1O- and 10-3 second, respectively. The the electron current here exceeds the total dis-
observed variation of tF with overvoltage <l V charge current. A region is therefore required
(=V- Vs ) for various fixed values of X/Po in H2 where electrons are lost by diffusion and not
is shown in Fig. 5. Comparison with theory en- replenished by ioniZation; usually X ~ 0 here.
ables an estimate to be made of k, the relative (4) The positive column, where X has a small
contribution of photons at the cathode to the constant value such that the corresponding elec-
ELECTRICAL DISCHARGES IN GASES 324

current and pressure. This is termed the normal


cathode fall. The abnormal cathode fall occurs
t when i > inS, where S is the total cathode area.
Vc now increases with current.
X
Arc and High Current Discharges If i is in-
creased sufficiently, a stage will eventually be
reached where the cathode temperature is high
enough for thermionic emission to be important.
d- Vc now decreases with further increase in i
FIG. 6. Variation of axial field with distance from (Fig. 7), and we are in the region of the arc dis-
the cathode for a glow discharge in a long cylindrical ,charge. The transition current clearly depends
tube. on the rate of loss of heat from the cathode and
only has a definite value when the surface is
uni!o~m. In so~e a~c discharges (e.g., Hg), the
tron energy distribution gives an ionization rate emISSion mechaDlSmls probably not thermionic;
which just balances the loss of electrons and ions these are not fully understood however.
by radial diffusion to the walls. The current density is much higher in the arc
(5) The anode fall, where X again increases. than in the glow discharge. The charged particle
Regions (I) and (2) are the most important density is typically in the range 10 14 to 10 18
regions of the discharge; the primary and second- electrons cm- in the core of the arc when the
3

ary ionization by which the discharge is main- pressure is approximately atmospheric. Because
tained take place here. The fall of potential of the very high frequency of collisions between
across region (1), usually termed the cathode the electrons, positive ions, and neutral mole-
fall (Vc) is clearly an important parameter of cules, thermal eqUilibrium is often established
the discharge. for all the groups of particles present in the arc
It should be noted that the section extending positive column. The temperature at the axis of
from the negative glow to the anode has only a the arc is typically in the range 4 000 to 10 000
small field strength- and small resultant space K. At high pressures the temperature diminishes
charge with I nj - ne I ~ ne, where nj and ne are laterally very quickly whereas at low pressures it
the ion and electron concentrations. This region remains constant over a large portion of the
is generally called a plasma. In many cases, the cross section.
electrons here have a random motion which is We have assumed hitherto that the current is
large compared to their drift motion in the always sufficiently low for the magnetic field
field direction. Our earlier discussion on electron produced by the current to play an unimportant
swarms is valid here. On the other hand, the re- role in the discharge mechanism. At high currents
gions which occur near the cathode and the this is no longer true, and the interaction of the
walls have a high field strength and resultant self-magnetic field of the discharge and the cur-
space charge with nj ~ ne. The electrons and ions rent produces forces on the ionized gas compa-
behave here as a beam rather than as a swarm. rable to the other forces acting. The required
Ambipolar Diffusion The radial diffusion of currents increase with the gas pressure p; at nor-
ions and electrons to the wall in the plasma re- mal temperatures, i > 10 3 amperes and p < 1
gion (4), above, does not occur at the same rate t?rr are required. The force due to the magnetic
as when only one type of carrier is present. field tends to constrict the discharge, and a col-
Clearly, the electrons will tend to diffuse to the u!lIn so constricted is said to be pinched. This
walls much more rapidly than the ions leaving an plOch effect offers a possible method of con-
excess of positive charge. A space charge field fining the hot gas to a channel remote from the
is set up which retards electrons and accelerates
positive ions so that their effective diffusion
rates are equalized. This process can be described
in terms of the am bipolar diffusion coefficient
Da which is given approximately by:
200

Da ~Dj [1 + ~:] (6)

where Dj js the normal ion diffusion coefficient


and Te and Ti are the effective electron and ion
temperatures, respectively. o+-----~--~~--~
Cathode Fall When the current is sufficiently 50 100 150
small « I 0 rnA for a cathode of area'" 1 cm 2 )
i- (rna)
the discharge does not occupy the entire cathod~
area. The current density in the covered portion FIG. 7. Variation of cathode fall of potential with
in is approximately constant, and the cathode current for discharge in A at 30 torr pressure with
fall of potential Vc is nearly independent of spherical tungsten electrodes, 1.8 mm diameter.
325 ELECTRICAL MEA.SUREMENTS

walls of the containing vessel. However, a major current may be utilized; or some effect as-
obstacle to the achievement of a steady pinched sociated with quantum dynamics may be
discharge is the inherent instability of such a employed.
channel to lateral perturbations. This causes the The basis of any meaningful measurement of
pinched column to leave the axis of the con- an electrical quantity must ultimately be the
taining tube and take up a helical path in con- national reference standards maintained by the
tact with the walls. It is, however, possible that National Bureau of Standards (NBS), and as-
a suitable arrangement of magnetic fields may signed by means of absolute measurements in
help to stabilize the discharge, leading to the which certain electrical quantities are de-
prospect of continuous operation of a pinched termined in terms of appropriate mechanical
discharge. quantities. The electrical units are related to
the metric system of mechanical units in such
J. D. SWIFT a way that the units of power and energy are
identical in both systems. In 1960 the name
References Systeme International (SI), l now in use
throughout the world , was assigned to the mea-
Craggs, J. D., and Meek, J. M., "Electrical Breakdown surement system based on the meter, kilogram,
of Gases," London, Oxford University Press, 1953. second, ampere.
Loeb, L. B., "Basic Processes of Gaseous Electronics," Because four independent relations tie the
Berkeley, Cal., University of California Press, 1955. six electrical quantities together, only two abso-
Jones, F. Llewellyn, "Ionisation and Breakdown in lute measurements are needed to fix the na-
Gases," London, Methuen & Co., Ltd., 1957. tional reference standards. Historically, these
Acton, J. R., and Swift, J. D., "Cold Cathode Dis- have been an ohm and an ampere deter min a-
charge Tubes," London, Heywood and Co., Ltd.,
tion .2 In the ohm determination, a resistance
1963.
is compared to the reactance at a known fre-
von Engel, A., "Ionized Gases," London, Oxford Uni- quency of an inductor or capacitor whose
versi ty Press, 1965. magnitude is calculated from its measured di-
Somerville, J. M., "The Electric Arc," London, mensions together with the assigned magnetic
Methuen & Co., Ltd., 1959. constant (41T X 10- 7 Him in SI units), or in
Nasser, E., "Fundamentals of Gaseous Ionization and the case of a calculable capacitor, the corre-
Plasma Electronics," New York, John Wiley & Sons, sponding electric constant derived from the
1971. magnetic constant and the speed of light in
Llewellyn-Jones, F., "The Glow Discharge and an In- vacuum. Thus the ohm is assigned in terms of
troduction to Plasma Physics," London, Methuen & the meter and the second. In the ampere de-
Co., Ltd., 1966. termination , a current carried by two coils is
Raether, H. , "Electron Avalanches and Breakdown in evaluated in terms of their measured dimen-
Gases," London, Thornton Butterworth, 1964. sions, the magnetic constant, and the force with
Hoyaux, M., "Arc Physics," Berlin, Springer-Verlag, which they interact . This force is opposed by
1968. gravity acting on a known mass. The ampere is
Brown, S. C., "Introduction to Electrical Discharges in assigned in terms of the meter, kilogram, and
Gases," New York, John Wiley & Sons, 1966. second. The measured current is passed through
Franklin, R. N., "Plasma Phenomena in Gas Dis- a known resistor, and the volt is assigned using
charges," Oarendon, Oxford Univ. Press, 1979. Ohm 's law.
Howatson, A. M., "An Introduction to Gas Dis- The National Reference Standards, in terms
charges," London, Pergamon, 1976. of which the legal electrical units are main-
Meek, J. M., and Craggs, J. D., (Eds.), "Electrical tained, are groups of stable resistors, capacitors,
Breakdown of Gases," New York, John Wiley & and standard cells (Weston saturated cells)
Sons, 1978. whose values have been assigned by absolute
measurements. At NBS the reference standard
Cross-references: ELECTRICITY, IONIZATION, PO- of resistance is a group of I-U resistors, fully
TENTIAL, STATIC ELECTRICITY. annealed and mounted strain-free out of con-
tact with the air, in sealed containers. The
reference standard of capacitance is a group of
ELECTRICAL MEASUREMENTS 10 pF fused-silica-dielectric capacitors whose
values are assigned in terms of the calculable
In an electrical measurement, one is concerned capacitor used in the ohm determination.
with the evaluation of an electrical quantity- The reference standard of voltage is a group of
resistance, capacitance, inductance, charge, cur- standard cells maintained at a constant temper-
rent, voltage-or of a quantity that depends on ature. Their values are periodically reassigned
some combination of them. The measurement by means of a "Josephson experiment" to be
means may be a ratio device , such as a potenti- described below . The individuals within each
ometer or bridge in which similar quantities reference group are intercom pared routinely
are compared ; or it may be an electrome- to detect any drift with respect to the group
chanical system in which a force or torque is mean. Representatives of the group are com-
developed; or the heating effect of an electric pared with the national standards maintained
ELECTRICAL MEASUREMENTS 326

by the national laboratories of other countries tinues to be maintained by a group of l-n re-
to detect any differences that could develop sistors on the basis of their 1948 assignment.
in their "as-maintained" units. These inter- The voltage divider is the basic element of
national comparisons are carried out by the many measurement networks. In general it
Bureau International des Poids et Mesures consists of a group of series-connected resistors
(BIPM) in Sevres, France. Stability of the (or impedors using resistance, inductance,
national reference standards has also been capacitance, or some combination of them). Its
monitored in recent years by experiments operating principle is the following: when the
which relate an electrical unit directly to series circuit is tapped at an intermediate
some atomic constant-a natural invariant. In point but no current is drawn from the tap,
one such experiment, the ampere is related to the ratio of the voltage between the tap point
proton gyromagnetic ratio. 3 A characteristic and a divider terminal to the voltage between
magnetic moment and spin are associated with terminals equals the ratio of tapped resistance
the proton-it behaves both as a magnet and a (or impedance) to the total divider resistance
gyroscope-tending to align itself in a magnetic (or impedance). Modern dc potentiometers are
field and to precess about the field direction dividers that may achieve an accuracy of a few
with an angular velocity proportional to field parts-per-million (ppm) in comparing direct
strength if its alignment is disturbed. In fact, voltages and, with appropriate range-extending
proton precession frequency has become a dividers, can be used to measure voltages to
widely used method of measuring magnetic 1500 V or more. Factors limiting accuracy
field strength. Thus the repeatability of pro- in high-voltage dividers may be the heating
ton precession frequency in the field of a di- effect of power dissipated in the divider and
mensionally stable solenoid, excited by a cur- leakage currents across its insulating structure.
rent established in terms of the legal ohm and Standard cells may be intercompared to a few
volt, is a measure of the stability of the legal parts in I 0 8 , using special potentiometers de-
ampere. In another experiment use is made of signed to minimize parasitic voltages. Direct
the voltage appearing across a Josephson currents can be measured to a few ppm by com-
junction (made of two superconductors sep- paring the voltage drop the current produces
arated by a barrier), irradiated with microwave in a known resistance, with a known reference
power and simultaneously biased with direct voltage.
current. 4 The voltage across the junction in- Two voltage dividers may be connected in
creases with increasing bias current, in discrete parallel to the same source to form a bridge.
quantum jumps related to the frequency of the Equality of divider ratios-indicated by zero
microwave irradiation. This relation is Nhv = potential difference at their tap points-permits
2eVN, where N is step number, h is Planck's accurate comparison of impedances and is
constant, v is the microwave frequency, e is relatively insensitive to minor variations in the
elementary electron charge, and VN is the junc- level of supply voltage. For dc resistance mea-
tion voltage. Thus the stability of the legal volt surement, the Wheatstone network is used. One
is monitored in terms of frequency and the of its two dividers provides a known ratio, and
ratio of Planck's constant to electron charge by the other includes the unknown and a known
comparing the Josephson voltage to that of resistor with which it is compared. In this
the cells in the national reference group. simple form the Wheatstone bridge is used for
The legal volt was reassigned on January I, resistors (usually greater than 1 n) which have
1969, based on recent ampere determinations. only two terminals, Le. whose potential and
This assignment was such that the number current connections coincide. For 4-terminal re-
stating the emf of a standard cell was increased sistors, whose potential and current terminals
by 8.6 JJ.V over its previous (1948-68) assign- are separate, the Kelvin bridge is used, with two
ment. Other national laboratories also made dividers having known identical ratios in addi-
adjustments such that all national reference tion to the divider that incorporates the cur-
standards of emf (including the standard at rent circuits of the unknown and reference re-
BIPM) were in agreement at that time . It was sistor. Using a direct substitution method with
determined through Josephson volt experi- one of these bridges, nominally equal resistors
ments that the emf of the cells in the national can be compared at the ppm level or better, in
reference group was slowly drifting (as much as the range 10-4 -10 4 n.
0.3 JJ.V /year), and it was decided that the emf The more general impedance bridge is a 4-
of these cells would be periodically reassigned arm network, similar to the Wheatstone bridge-
to keep the legal volt at a constant value. s or a more complicated network that can be
Thus, since July 1, 1972 the legal volt has been reduced to an equivalent 4-arm array by ap-
maintained in terms of an assigned frequency / propriate de1ta-wye transformations? -in which,
voltage ratio through the Josephson volt experi- by proper choice of components, inductances
ment. Recent ohm determinations,6 based on may be intercom pared or measured in terms of
a calculable capacitor, indicate that the 1948 capacitance and resistance, or capacitances may
assignment of the national reference standard be intercom pared. The accuracy of such bridges
of resistance (USA) was correct within a is usually limited by the stability of ratios and
microohm, and the value of the legal ohm con- reference components. Additional limits may
327 ELECTRICAL MEASUREMENTS

be imposed by coupling between bridge ele- vices (SQUIDs) are very sensitive magnetic
ments or to nearby objects or ground. Such flux sensors. A SQUID comprising a supercon-
coupling may be an ambient magnetic field in- ductor formed into a loop interrupted by a
ducing a parasitic emf in a bridge element or in Josephson junction can be used as a flux de-
an open loop between elements, or by capac- tector in a current comparator for the measure-
itance or leakage between elements or to a ment of cryogenic resistance ratios. 9 Com-
source or to ground. Such effects can be re- parator sensitivity can amount to 0.5 nA-
duced or eliminated by choice and arrange- turns. Such an arrangement must be magnetically
ment of components and by the use of shields shielded and is inherently limited by the J ohn-
maintained at appropriate potentials to eliminate son noise of the resistors at their operating
the effect on bridge balance of stray capacitance temperature.
and leakage. In any measurement network (such as a
The inductive voltage divider, used extensively bridge or potentiometer) using voltage divider
as a bridge element at power and audio-fre- or current comparator techniques, a sensitive
quencies, is in principle a multi-decade, select- detector must be included to show the ab-
able tap-point auto-transformer whose decades sence of current (or voltage difference) in an
may be wound on separate magnetic cores. appropriate network branch. In a dc network
With proper construction this type of divider the simplest detector is the D' Arsonval gal-
can be operated at ppm or better accuracy. vanometer, consisting of a coil of fine wire sus-
Completely shielded bridges whose ratio arms pended by conducting filaments in a radial
are closely-coupled secondary windings of a magnetic field. A light-beam reflected from a
transformer with a high-permeability core, mirror attached to the moving system indicates
(known as transformer-ratio-arm bridges) are rotation arising from current in the coil. The
capable of high accuracy and are widely used light beam may be focused on a scale for direct
for capacitance measurements. For example, observation, or may be shared by differentially
stable 3-terminal 10-pF capacitors can be rou- connected photocells whose output is supplied
tinely compared to one part in 10 8 in such a to a second galvanometer, producing an ampli-
bridge. The voltage ratio supplied by the fied deflection. If the galvanometer is isolated
coupled secondaries may be 1/1, and is rarely from mechanical disturbances, unbalance detec-
more than 10/ l. The conjugate of the trans- tion at the nanovolt level is possible in circuits
former ratio-arm bridge is based on the current having resistance up to several hundred ohms.
comparator and has much greater ratio flexi- Electronic instruments which have been de-
bility.8 In this arrangement, two currents are veloped for low-level dc signal detection are
compared by equality of their opposed, uni- more convenient, more rugged, and less sus-
formly distributed linkages with a common ceptible to mechanical disturbances than is a
core, following Ampere's circuital law, D'Arsonval galvanometer. However, the mea-

f
suring circuits of electronic detectors must use
considerable filtering, shielding, and guarding
Hdl=LI, to minimize electrical interference and noise.
The galvanometer is an extremely efficient
low-pass filter, and (when operated to make
i.e., the line integral of magnetizing force along optimum use of its design characteristics) is
a closed loop is determined by the sum of the still the most sensitive low-level dc detector.
currents passing through the loop. The balance Electronic detectors generally use either a me-
point, corresponding to zero flux density in chanical or transistor chopper driven by an
the high-permeability core, is sensed by zero oscillator whose frequency is chosen to avoid
induced voltage in a detector winding around the local power frequency and its harmonics.
the core. There is generally an auxiliary winding This modulation converts the dc input signal
into which a current can be injected to compen- to ac which is then amplified, demodulated,
sate the difference in ampere-turns of the ratio and displayed on an indicating instrument,
windings, and so bring the system to balance. fed to a recording device, or subjected to fur-
The whole arrangement must be shielded from ther processing.
external magnetic fields. Current ratios as high Adjustable-frequency amplifier detectors are
as 1000/ I are feasible, with correspondingly used with ac bridges and basically incorporate
high voltage ratios impressed on the elements a low-noise preamplifier followed by a high-
being compared. This range of ratios makes gain amplifier around which is a tunable feed-
possible a relatively simple bridge for the evalu- back loop whose circuit has zero transmission
ation of high-voltage power cables, insulating at the selected frequency, so that the negative
structures, and energy-storage capacitors. DC feedback circuit controls the overall transfer
versions of the current comparator have been function of the detector to eliminate passage of
adapted to use in potentiometers and bridges signals at other than the chosen frequency. The
for the comparison of resistance standards display element which accepts the detector out-
and for the evaluation of current-carrying put may be a cathode-ray oscillograph (CRO)
shunts. or a rectifier-type moving-coil instrument.
Superconducting Quantum Interference De- Frequently a phase-selective element is inter-
ELECTRICAL MEASUREMENTS 328

posed between the amplifier and display ele- coilY In a milliammeter, the coil may directly
ments so that in-phase and quadrature bridge carry the current to be measured; in an am-
adjustments are independently displayed. If a meter the bulk of the current is carried by a
CRO is used, the unbalance bridge signal may shunt; in a voltmeter a series resistance limits
be impressed on the vertical deflection plates the current and the scale is marked in terms of
and a phase-adjustable signal derived from the the voltage drop between the instrument
bridge supply impressed on the horizontal terminals.
plates. The result is an elliptical screen pat- An ac instrument requires an arrangement for
tern (Lissajou figure), the slope of whose major which the direction of torque does not reverse
axis (after appropriate phase adjustment) with the direction of current at the instrument
represents one of the quadrature unbalance terminals. This requirement may be provided
signals, and whose minor axis represents the by four rectifier elements arranged in a square
other. Balance is indicated when the ellipse is with the input across one diagonal and a
collapsed to a straight line and the line is D'Arsonval meter across the other, the recti-
brought to the horizontal. Nanovolt sensitivity fier arrangement being such that the current in
is possible also in some types of ac detectors. the dc meter is the same for either direction
Indicating instruments are used to measure of input current. Response is to average value
current, voltage, phase, and power. With spe- of the rectified current, but because one is
cial circuit arrangements they can be used for usually concerned with effective (rms) value,
other electrical quantities or for nonelectrical the scale is marked in terms of rms value for an
quantities for which transducers can be devised assumed sine-wave input. Thus, a waveform
that will convert the measurand into an elec- error is present for a nonsinusoidal input.
trical signal. Indicating instruments fall into Rectifier instruments of this type are used as
two categories, analog and digital. Analog voltmeters and milliammeters in the power and
instruments generally have a pivoted rotating audiofrequency range.
element driven by the electrical signal, the turn- Thermocouple instruments use the heating
ing motion being opposed by springs so that effect of current in a fine wire or thin-walled
equilibrium position of the system under driv- tube to which a thermocouple is attached .
ing and restoring torques is indicated on a scale (These structures and related devices are re-
by a pointer attached to the moving element, ferred to as thermoelements, and they are also
or by an optical system employing a mirror. employed in thermal voltage and current con-
The accuracy of such a system is limited by verters as transfer elements.) In the analog-
how well the pointer position can be read, type instrument the thermocouple output
generally a percent or so, a tenth percent in goes to a low-range dc millivoltmeter. Since
the best of pointer/scale instruments, and a temperature rise of the heater elements is pro-
hundredth percent for some mirror/light- portional to the square of the current, there is
beam readouts. Some analog instruments also no waveform error in the indication of effec-
have front-end electronics which can improve tive (rms) value. This type of instrument may
sensitivity and input impedance of electro- be used for current measurement from dc to
mechanically-based instruments. Digital instru- rf range (200 MHz or more with some con-
ments convert an electrical signal into a digital structions). As a voltmeter, thermal converter-
output which may be either displayed as a based instruments are usually restricted to
numeric readout or further processed in a com- audio frequencies, but if a multiplier having
puter. The accuracy of such a system is limited low distributed capacitance is used, the range
by its internal reference and by its critical may be extended to a megahertz or more with-
network components such as the signal con- out serious error.
ditioner, sample-and-hold amplifier, and analog- In moving-iron instruments a soft-iron piece
to-digital converter. Additional limitations in forms the moving element. It is immersed in
many cases may be offset, gain, nonlinearity, the field of a coil that carries the current to be
quantization "noise," or other sources of error measured, and its motion is such as to increase
attributable to the analog-to-digital converter. the inductance of the system with increasing
Even though a digital instrument may have a current. Energy storage, and hence torque,
5-digit numeric display, its overall accuracy depend on current squared, and there is no
may be the equivalent of only 3 or 4 digits waveform error for rms indication. By suitably
(0.1 or 0.01%), depending on the critical net- shaping fixed and movable iron pieces, the scale
work components. A calibration service for can be made nearly linear over the upper 80%
precision analog-to-digital and digital-to-analog of its range. In other arrangements, part of the
converters has been available since 1981 at NBS scale can be compressed and a small portion of
based on a reference digital-to-analog converter total range expanded to cover much of the scale.
standard developed for this purpose. 10 This scheme is useful in a voltmeter that moni-
DC analog instruments usually have a perma- tors a nearly constant voltage.
nent-magnet moving-coil system whose opera- Electrodynamic instruments have a moving
tion is identical with that of the D' Arsonval coil immersed in the field of a fixed-coil system.
galvanometer, i.e., a response proportional to The interaction of their fields produces a
the average value of current in the moving torque proportional to the square of the cur-
329 ELECTRICAL MEASUREMENTS

rent for a series (voltmeter) connection, or to version. For ac measurements the DVM is pre-
the product of the currents in a shunt (ammeter) ceded by an ac/dc signal converter. If this con-
connection, so that the scale indication is of verter is a rectifier followed by a filter, to ob-
effective (rms) value as in a moving-iron instru- tain an average value of the rectified signal, the
ment. However, eddy currents can be much display of an equivalent rms value (for an as-
less, and electrodynamic instruments are sumed sine-wave input) will be subject to error
generally useful over an extended low-frequency for other than sinusoidal waveforms. Alterna-
range. Also, since there is no magnetic polariza- tively, thermoelements can be employed to pro-
tion (present on dc in a soft-iron instrument), vide accurate measurement of the true rms
the dc response can be error-free if the coil value of ac voltages, irrespective of wave
arrangement is astatic or is shielded from ambi- shape. Typically, the thermoelement output
ent magnetic fields. Electrodynamic instru- is part of a feedback network which provides
ments are also used as wattmeters to measure a dc output voltage equal to (or proportional
power at frequencies up to 1 KHz. The moving to) the true rms value of the ac input voltage.
coil with a series resistance is connected across The thermo element may be switched alter-
the supply as the voltage circuit of the watt- nately to the ac input and then to the dc out-
meter. The fixed coils are connected to carry put as shown schematically in Fig. 1. With
the load current. At any instant, the driving either input applied to the thermoelement, the
torque is proportional to the instantaneous output emf of the thermocouple is balanced
product of line voltage and load current; the against an opposing emf (produced automat-
moving coil, by reason of its inertia, takes up ically by the feedback loop). Equivalency be-
a position that represents average power over a tween the input rms ac voltage and the dc
cycle. output voltage is thus obtained by virtue of
A digital voltmeter (DVM) compares a di- detecting a null voltage under both input
rect voltage input to an internally generated conditions.
reference voltage. This comparison may be The reference voltage of a DVM is usually
done by successive approximations in an auto- based on the reverse breakdown of a Zener
matic self-balancing potentiometer; the refer- diode, and (depending on selection, aging, and
ence voltage may be used to generate a ramp compensation of temperature coefficient) may
function that opens a counting gate when the achieve 5-place accuracy. In fact, portable di-
ramp equals the input signal and closes the gate rect voltage standards of this type can, with
when the ramp voltage is zero, resulting in a temperature control, achieve short-time ac-
voltage-to-time conversion; a ramp voltage may curacy of a ppm and stability of 2ppm/year or
be generated at a rate proportional to the input better.t 2 Such a voltage standard can replace
until it equals a reference value, returned to a standard cell for many applications, although
zero, and repeated. The number of repetitions an inherently higher noise level is to be
in a fixed time (proportional to the input expected.
voltage) is counted and displayed, the time in- The ac-dc transfer function is an important
terval being chosen so that the number of one, since the basic standards of resistance and
pulses makes the meter direct-reading. In a emf, and the potentiometer techniques for ac-
dual-slope instrument, the input signal may be curately measuring current and voltage, are
made to charge a capacitor at a proportional available only on dc. Electrodynamic instru-
rate, switch to a reference voltage of opposite ments have been used as ac-dc transfer stan-
polarity, and discharge the capacitor at a con- dards in the range below a kilohertz.13 Thermo-
stant rate, the discharge time being used to gate elements with appropriate shunt and series
clock pulses to a counter for voltage-time con- resistors are used over an extended frequency

SCALING
DC
OUTPUT
AMPLIFIER

AC
INPUT

DIGITAL LOGIC
CONTROL CI RCU ITS

FIG. I. Automatic thermoelement-based rms ac voltmeter.


ELECTRICAL MEASUREMENTS 330

range, and are both portable and convenient and rotating coil indicators are still in use (with
to use. accuracies on the order of O.S %), digital phase-
A recent electrical measuring instrument de- angle meters are rapidly displacing them, where
velopment is that of the transient digitizer or resolutions of 0.0 I degree and O.OS degree
waveform recorder. These kinds of instruments accuracy are possible. is As with a DVM how-
have grown out of a need for measurements ever, there are certain limitations with respect
with the speed of a CRO and the accuracy of to input signal levels, zero offset, and fuIl-
a DVM. The waveform recorder typicaIly con- scale (gain) corrections, as weIl as nonlinearities
sists of a wide-band amplifier, high-speed which must be taken into account to achieve
sample-and-hold and analog-to-digital converter the best accuracy possible with digital phase
circuits, and a temporary (or buffer storage) meters . A precision audio-frequency phase-
memory. Similar to most oscilloscopes, the po- angle calibration standard has been developed
sition and duration of the captured segment of at NBS, which uses digital waveform synthesis
the input waveform can be selected. Both digital techniques to produce a reference and variable
and reconstructed analog output interfaces pair of sinusoidal waveforms whose relative
are also incorporated. Some of the best wave- phase displacement can be accurately con-
form recorders claim 20-100 MHz sampling trolled to within O.OOS degree.
rates with 0 .2S-0.1 % resolution 14 Accuracy The induction watthourmeter, for measure-
specification for these instruments is a subject ment of electrical energy, is probably the most
that is receiving considerable attention and one familiar and widely used of all electrical instru-
in which physical as weIl as performance stan- ments. More than 9 X 10 7 are in continuous
dards must be developed. use to meter electrical energy consumed in the
Instrument transformers are used at power USA, representing in excess of 30 billion dol-
frequencies to extend current and voltage mea- lars per year. The essential features of an induc-
surement capability beyond the nominal input tion watthourmeter are indicated schematicaIly
range of indicating instruments. Current trans- in Fig. 2. The voltage winding has many turns
formers rated at 20 kA are used in some instal- of fine wire and carries a current that is almost
lations, as are voltage transformers rated at in quadrature with the line voltage ; special lag
3S0 kV. Low-range current transformers of arrangements ensure that the magnetic field of
special construction are capable of accurate the voltage pole V lags the line voltage by 90
operation throughout the audio-frequency degrees. The current coils (a few turns of coarse
range . Instrument transformers, consisting of a wire) carry the load current, and the field of
primary and secondary winding coupled by a the current poles C - C is in phase with the load
magnetic core, are designed to accurately re- current. The eddy currents produced in the disk
produce the primary current (or voltage) on a by the field from the current poles are in
reduced scale in the secondary circuit. They are quadrature with this field, and hence have a
quite different from power transformers in component in phase with the field from the
details of design and operation, although voltage pole, corresponding to the in-phase
their basic operating principle is the same. (power) component of the load current. The
The usual current transformer, designed for use reaction of this component of current-pole-
with a S-A ammeter and to supply the current induced eddy currents in the disk, with the
circuit of a wattmeter or watthourmeter, is voltage-pole field, produces a driving torque
capable of delivering only the small amount of on the disk. Similarly, eddy currents in the
power required by these instruments. Current disk, induced by the voltage-pole, are in phase
transformers operate under nearly short-circuit with the field associated with the power com-
conditions with their magnetic core at low flux ponent of the load current in the current-
density. The usual voltage transformer, de- poles; this also produces a driving torque
signed for use with a 120-V voltmeter and the proportional to power through the meter. A
voltage circuit of a wattmeter or watthourmeter, braking torque proportional to disk speed
also has a low power rating, and operates under
nearly open-circuit condition with its magnetic
core approaching saturation. Instrument trans-
formers of good design may be expected to
have errors of only a few hundredths percent
when operated within their design limits.
Line i Load
Performance of a current transformer may be
evaluated in terms of a current comparator,
whose magnetic core operates at zero flux
density and is therefore free from the major
sources of error that arise from core excitation
in a transformer.
In power measurements and ac impedance
measurements in general, audio phase angle
is an important electrical quantity. Although
analog phase-angle meters using precision bridges FIG . 2.
331 ELECTRICITY

results from interaction of the brake magnet 10. Souders, T. M., Flach, D. R., and Bell, B. A., "A
field with the eddy currents it induces in the Calibration Service for Analog-to-Digital and
moving disk. As a result, disk speed is propor- Digital-to-Analog Converters," NBS Tech Note
tional to load power, and the number of disk 1145, U.S. Govt. Printing Office, Washington,
revolutions (read from a counter geared to the D.C., 1981.
disk) is a measure of the total energy consumed 11. Harris, F. K., "Electrical Measurements," Wiley,
by the load. New York, 1952.
Electric power and energy can also be mea- 12. Spreadbury, P. J., and Everhart, T. E., "Ultra-
sured by processing and combining current and Stable Portable Voltage Sources," lEE Conference
voltage signals electrically. This can be accom- Publication 174, pp. 117-120, 1979.
plished either by some form of analog multipli- 13. Hermach, F. L., "AC-DC Comparators for Audio-
cation, e.g., using a time-division multiplier,16 frequency Current and Voltage Measurements of
or by sampling the instantaneous currents and high Accuracy," Trans. IEEE IM-25, 489-494
voltages periodically, i.e., digitizing the samples (1976).
so obtained and computing the power from the 14. Lawton, R., "Proceedings of the Waveform
digital values obtained. In general, higher ac- Seminar," NBS Special Publication, U.S. Govt.
curacies can be obtained than with the methods Printing Office, Washington, D.C., 1982.
previously described, but the instrumentation 15. Turgel, R. S., Oldham, N. M., Stenbekken, G. N.,
required is quite complex. A portable system and Kibalo, T. H., "NBS Phase Angle Calibration
for measuring ac voltage, current, power, and Standard," NBS Tech Note 1144, U.S. Govt.
energy,17 based on a thermal converter which Printing Office, Washington, D.C., 1981.
compares a current derived from the quantity 16. Tomoto, M., Sugiyama, T., and Yamaguchi, K.,
being measured with an equivalent direct cur- "An Electronic Multiplier for Accurate Power
rent, has been found to have an accuracy of 20 Measurements," Trans. IEEE IM-17, 245-251
ppm in an international comparison of power (1968). Turgel, R. S., "Digital Wattmeter using a
measurements at unity and 50% power factor. Sampling Method," Trans. IEEE IM-23, 337-341
(1974 ).
FOREST K. HARRIS 17. Schuster, G., "Thermal Instrument for Measure-
BARRY A. BELL ment of Voltage, Current, Power, and Energy at
Power Frequencies," Trans. IEEE IM-29, 153-157
References (1980).
McAuliff, R., Lentner, K. J., Moore, W. J. M., and
1. "The International System of Units (SI)," NBS Schuster, G., "An International Comparison of
Special Publication 330, U.S. Govt. Printing Of- Power Measurements at 120 V, 5 A, 60 Hz,"
fice, Washington, D.C., 1971. Trans. IEEE IM-27, 445-449 (1978).
2. "Precision Measurement and Calibration-Elec-
tricity (Low Frequency)" NBS Special Publica- Cross-references: ALTERNATING CURRENTS; CA-
tion 300 (Vol. 3), U.S. Govt. Printing Office, PACITANCE; ELECTRICITY; INDUCTANCE; MAG-
Washington, D.C. 1968. NETOMETRY; MEASUREMENTS, PRINCIPLES OF;
3. Driscoll, R. L., and Olsen, P. T., "Application of SEMICONDUCTOR DEVICES; SYMBOLS, UNITS,
Nuclear Resonance to the Monitoring of Electrical AND NOMENCLATURE.
Standards" (Precision Measurements and Funda-
mental Constants), NBS Special Publication 343,
117-121, U.S. Govt. Printing Office, Washington, ELECTRICITY
D.C., 1971.
4. Harris, F. K., Fowler, H. A., and Olsen, P. T., Electricity is the material interaction that arises
"Accurate Hamon-pair Potentiometer for Joseph- from electromagnetic forces between electric
son Frequency/Voltage Measurements," Metrolo- charges at rest and in motion. The properties
gia 6, 134-142 (1970). of charge that are responsible for the main
5. Field, B. F. et aI., "Volt Maintenance at NBS via features of electrical phenomena are:
2e/h: A New Definition of the NBS Volt," Met- 1. Superposition. The effects of a group of
trologia 11,155-166 (1973). charges are the sum of the independent effects
6. Cutkosky, R. D., "New Measurement of the Abso- of each one, both in producing fields of force
lute Farad and Ohm," Trans. IEEE IM-23, 305- and in being acted on by them. The principle
309 (1974). of superposition gives significance to the total
7. Hague, B., and Foord, T. R., "AC Bridge Meth- amount of charge q or Q on an object or in a
ods," Pitman, New York, 1971. system, and also to the concepts of volume
8. Kusters, N. L., "Precise Measurement of Cur- charge density p, surface charge density a, and
rent Ratios," Trans. IEEE IM-13, 197-209 linear charge density X. The current density
(1974 ). vector J gives the direction and magnitude of
9. Gallop, J. C., and Petley, B. W., "SQUIDs and the rate per unit area at which charge moves
their Applications-A Review Article," Jour. through a surface perpendicular to its velocity.
Phys.-E (Sci. Insf.) 9,417-429 (1976). The total current through a surface S is then
Dzuiba, R. F., and Sullivan, D. B., "Cryogenic DC
Comparators and their Applications," Trans.
IEEE MAG-11, 716-719 (1975).
I = is J . d S taken over the surface.
ELECTRICITY 332

2. Conservation. Positive and negative charges surface S by


have opposite effects in the superposition sums,
and tend to be present in nearly equal numbers
in most circumstances so that electric neutrality
is widely prevalent in matter. The algebraic sum
<PE = Is E • dS, Is
<PB = B . dS. (3)
of the charges present in any closed system is
strictly constant, a fact which is represented by The variations of E and B in a region may be
the differential equation represented by lines of force which are tangent
at every point to the respective vector at that
(1) point.
6. Potential. When the charges responsible
A current of positive charges in the positive
for E are stationary, and in cases where they
direction is equivalent in the absence of mag- move so that the currents are steady, the elec-
netic fields to a current of negative charges in tric field is conservative. That is, E (x, y, z) is
the opposite direction. derivable from a single-valued scalar potential
3. Quantization. In ordinary matter, not
subject to high energy interactions (300 MeV or V(x,y, z):
more), negative charge is carried exclusively by E(x,y,z)=-VV(x,y,z). (4a)
electrons, and positive charge by protons, in-
cluding those in nuclei. Electrons and protons Equivalently, V can be found from E in a re-
have quantum-mechanical properties: gion by integration along an arbitrary line in
(a) Charge is quantized so that each electron space:
has the same charge - e and each proton has +e,
where e = 1.6092 ... X 10- 19 coulomb. This
value is so small that the quantization of charge
is not noticeable in macroscopic electromagnetic
effects. Nevertheless a quantity of charge q may
V(x,y, z) = -lP Po
E' dl (4b)

be defined in principle as an integer multiple


of ±e. where P is the point (x, y, z) and Po is an arbi-
(b) Wherever situated, electrons and protons trary reference point. V(x, y, z) is the potential
have available distinct quantized values or levels energy per unit charge of a small test charge
of energy, and each may generally be considered placed at P, considering its energy to be zero
to occupy a single level at a time. at Po. A line of electric force can be seen to
(c) The wave properties of electrons mean that proceed always from higher to lower potential
and can never cross or come back to itself. The
on a microscopic scale electrons can not be lo- variation
calized at points of space but are represented by scribed byofconstructing
potential in a region can be de-
a set of equipotential
a quantum-mechanical charge density. These surfaces, which are everywhere perpendicular
properties allow electrons to move freely to the electric lines of force. (See Fig. 1.)
through condensed matter provided they can be 7. Coulomb's Inverse Square Law. When a
excited into appropriate energy levels. charge is at rest or moving at a speed much
4. Mobility. Charge can move through solid lower than the speed of light c, it produces a
matter in electron waves, through liquids in the field E at a distant point P given by Coulomb's
form of ions in solution, and through gases as Law:
both electrons and ions. Without external forc-
ing, charge mobility always tends to neutralize
the containing matter. (5)
5. Force. A small piece of matter or particle
with a charge q moving at a speed v in some
frame of reference will experience a force given where r is the unit vector from q to P and r is
by the Lorentz force law: the distance. The constant €o is found to have
the value €o = 8.845 ... X 10- 12 in SI units.
F = q(E + v X B) (2) The field produced at P by many qk's at dif-
ferent places is given by a vector sum of (5)
where the vectors E (x, y, z) and B(x, y, z) are, over all the q k 'so
respectively, the electric and magnetic field From Coulomb's Law may be deduced the
vectors at the position (x, y, z) of q, and are Gauss Flux Law for the electric flux through an
determined by the magnitudes, positions, and arbitrary closed surface:

f
velocities of all the other charges in the neigh-
borhood. The magnetic force qv X B is respon-
sible for the motor and generator effects (see
ELECTRIC MOTORS, ELECTRIC POWER GEN-
~
E . dS = q;/€o (6)
ERA TION). It is readily seen that E is an ordinary
or polar vector, whereas B is a pseudovector or where the flux is calculated through a closed
axial vector (see VECTORS). For each vector arbitrary surface and the q i are the charges
there is defined the flux through an arbitrary within the surface and no others. The Maxwell
333 ELECTRICITY

I
I

\\
,, \
,
,, \
,, \
,
\, \.
,,
·,, ,
·,
+s +1
i

--- --

FIG. 1. Equipotentials and lines of force for +4q and -q. The light dashed line through
P separates flux that passes from +4q to -q from flux that passes from +4q to infinity.
(From Scott, p. 52; originally from Garrett, Milan Wayne, "Electricity and Magnetism,"
Ypsilanti, MI, University Litho-Printers, 1941, p. 48, Fig. 4, 7.)

differential-equation form of Gauss' Law is where r is the distance from the volume element
dr to the point (x, y, z).
V' E = p/eo. (7) 8. The Poisson and Laplace Equations. From
(4) and (7) we find Poisson's Equation:
A tube of flux in free space, the sides of which
are composed of flux lines, contains the same V2V(x,y,z)=p/eo (9)
flux across any section; the flux passes from
positive charge at one end of the tube to nega- which in charge-free space becomes Laplace's
tive charge at the other. Gauss' Law can be used Equation:
to find the electric field in situations of high
symmetry. The intensity of the field outside an (l0)
infinite uniform plane distribution of charge A unique solution of (9) exists for a set of fixed
density a is E = a/2eo regardless of distance; conductors with each conductor having either a
outside a uniform line of charge density X it is given value of V or a given value of total charge
E = X/2rreor where r is the cylindrical radius Q, so that the fields between the conductors
coordinate; ~nd outside a spherically symmetric and the charge distributions on them can be
distribution of total charge Q it is E = Q /4rreor2 , determined.
with r being the spherical radius coordinate. 9. Capacitance. For a bundle of flux leaving
Just outside a conductor with charges at rest, a portion of the surface of one conductor and
at a point where the surface density is a, E is ending on another, the ratio of the charge at
normal to the surface with intensity E = a/eo. the positive end to the potential difference be-
In the general case in which all charges can tween the two conductors is the capacitance of
be treated as volume distributions and are at that pair of conductor areas. In general, the
rest or slowly moving, the scalar potential can charges on the conductors are linear functions
be found from Coulomb's Law by of the potentials:
I
V(x,y, z) = - -
Jp dr
-- (8) Qj = L
n
Cjk V k (11)
4rreo r k=l
ELECTRICITY 334

where the Cjk are coefficients of capacitance.


When only two conductors are present, and
Ql - Q2 = Q, we call the pair a capacitor and
set
Q=CV (12)

where V is the potential difference between the


B
two, C is the capacitance, and Q is the charge
on the positively charged conductor.
10. Energy in Electrostatic Systems. The
energy in a capacitor is
U= t CV2 = t Q2 IC (13)
FIG. 2. Ampere-Biot·Savart law, Eq. (17).

and for a set of n conductors is


n n
U= t L: L: CjjVjVj
j= 1 j = 1
(14) The quantity eod<I>Eldt is called the displace-
=t L:QjVj
n ment current, where <I>E is the electric flux
from q through 1'. For a finite segment of cur-
j= 1 rent using (17) we find
which can be converted into a volume integral
i B. dl' = 1101 LlQ (20)
over the intervening space
J;, 41T

U= teo JE 2 dr. (15)where LlQ is the difference between the solid


angles subtended by I' at the ends of the cur-
11. The Ampere-Biot-Savart Law. A charge rent segment. (See Fig. 3.) LlQ is 41T for a closed
q moving at a nonrelativistic velocity v creates loop. For a combination of closed current loops
a magnetic field B at a distant point P given by h and moving or variable charge elements qj,
we have Ampere's generalized circuital law :
Iloq VXf
B=--- (16)
41T r2
where r is the unit vector from the charge q to
the point P and r is the distance. The constant where the sum is over all the current loops that
110 is given the value 41T X 10- 7 in SI units. An thread through I' and <I>E is the total electric
element of current of strength I and length dl flux produced by the net charges in the region.
makes a field dB given by replacing qv by Idl, Segments of current that are not closed, as in
so that for a finite length 1 of current we have

1
the case of a capacitor being charged, may be
counted either in the first term as currents or in
B = 110 1 dl X f . (17) the second as changing charges at the ends of
41T I r2 current segments.
The corresponding Maxwell differential equa-
For a closed loop of current, it can be shown
from (17) that
Iloi
B=--VQ=-lloVVm (18)
1T
where Q is the solid angle subtended by the
current loop at the point where B is sought.
The quantity Vm is the 'occasionally used multi-
---
q
dl

ple-valued magnetic scalar potential. Equations

f
(17) and (18) can be used to obtain the magnetic
fields of a number of shapes of wires and coils,
and when applied to microscopic current loops FIG. 3. Calculation of B . dl' around a closed
can account for the phenomena of magnetism.
If B in (16) is integrated around an arbitrary path in the neighborhood of a moving charge q sub-
close loop I' as in Fig. 2, it can be shown that tending a solid angle n'.
335 ELECTRICITY

tion is inductance L is defined by the analogous relation


(22) 8,. =-L dI/dt. (30)
12. Vector potential. From (17) it also fol- It may be shown that the magnetic energy in
lows that B can be found from a vector poten- a collection of circuits is
tial A:
U=tl:Lklk2+tl:MklkIj (31)
k k,j
B = V X A; " h Jdl
A=/J.oL.J- - (23)
41T ,2 which can be written as an integral over the
k
volumes of all the wires

1.
where the sum is over all the closed or open
loops into which the current sources of B can
be analyzed. The flux through an arbitrary loop U= t J . A dr (32)
I' can be rewritten in two ways: wIres

1
cI>B =
s
B . dS =j
ft'
A' dl' (24)
and over all space as

U= _1_ J B2 dr. (33)


From (23) the third Maxwell equation follows: 2J10 space

V'B=O (25) 14. Relative Motion. From (2) it is easy to


see that if the source of B moves at -v and q
remains at rest, the moving source must create

f
which asserts that the flux through any closed an electric field at the position of q:
surface is zero, B . dS = O. The lines of B E'=VXB. (34)
have no beginnings or ends; they generally cir- From (16) it is easy to see that if a collection
culate indefinitely without closing. of charges which creates a field E at a point is
13. Faraday's Law of Induction. For any all set into motion at a common speed v, the
closed loop in space, whether moving or sta- moving source of E creates a B' according to
tionary, Faraday's Law asserts that

i
B' = J1o€OV X E. (35)

8,. = E . dl = -dcI>B/dt = -d[f A . dl]ldt Motion of Charges in Matter


1. Solids. (a) Levels for the electrons in solids
that take part in binding lie in continuous bands
(26) separated by finite gaps. Because electrons are
fermions with spin !, each level can hold at
where € is the electromotive force (emf) around most two electrons; at equilibrium each band is
the loop 1 and <PB is the magnetic flux through filled up from the bottom. The average energy
I. The loop may be stationary or moving. From of a group of electrons added to or removed
(26) it follows that the electric field of magnetic from a solid is called the Fermi level (F .1.) and
origin is not conservative and is given by E = lies essentially at the top of the filled group of
-aAjat, so that the complete expression for levels. A change in the electric potential of a
electric field in terms of potentials is region in the piece of matter by .b. V will change
the F.!. by - e.b. V.
E = -VV - aAjat. (27) (b) A conductor has its F.!. below the top of
The corresponding Maxwell equation is a partly filled band called the conduction band.
When a gradient Vv = - E is established in the
V X E = -aB/at. (28) conductor, the gradient in F.1. causes electrons
to jump freely into slightly higher levels cor-
From (17) and (26) it follows that any circuit responding to motion in the direction of - E,
with a changing current will induce an emf in thus producing a current density
any neighboring circuit, or in itself. In the latter
case, the negative sign in (26) guarantees that (36)
the induced emf will oppose the current changes where ac is the conductivity. Eq. (34) is a form
producing it (Lenz' Law). We have of Ohm's Law. Applied to a wire of length 1
(29) and cross-section S, (36) becomes

for the emf in circuit j produced by a changing


current in circuit k, where Mjk is the mutual
inductance and obeys the rule Mjk = Mkj. Self-
V= 11 11
o
E . dl = I
0
dl
- = IR
as
(37)
ELECTRICITY 336

where R is the resistance and V is the potential (f) An insulator has all the levels of one band
drop along the wire. The power dissipated into filled and a substantial gap below the next,
electron transitions at crystal imperfections is empty band. The presence of a small field E
1 V = 12 R. This power can be proved (Poynting will shift electrons only within molecular dis-
theorem) to enter the segment of wire through tances, creating dipoles in the phenomenon
the sides, not the ends. of dielectric polarization. (See DIELECTRIC
No current in a conductor implies J = aE = 0 THEORY.) A sufficiently large E may transfer
in the interior of the conductor, which becomes some electrons to the empty band and provide
an equipotential volume. By (7), p also becomes them enough energy of motion to cause dielec-
zero in the interior; any net charge on the con- tric breakdown.
ductor must reside on its surface. Application (g) A semiconductor is an insulator with a
of Gauss' law to a surface in the material sur- narrow interband gap, which contains either
rounding an empty cavity in the conductor donor levels from atoms with excess electrons
shows that there can be no charge on the inside not covalently bound, or acceptor levels requir-
surface and no field in the cavity. The potential ing the addition of electrons to satisfy their
in such a cavity must therefore be the same as covalent bonds. In the former case (n-type) the
the potential of the conductor found by use of donated electrons are readily lifted into con-
(8), and will be called the cavity potential. ductor levels, above the gap, and in the latter
(c) A thin wire carrying steady current must (p-type) the residual "holes" become transferred
have J and E within it parallel to it sides. The down to the filled band and function as posi-
surface charges of its distributed capacitance tive carriers. (See SEMICONDUCTORS.)
must move with J.oThey are locally respon- 2. Aqueous liquids are like insulators in hav-
sible for a normal component of E outside the ing no accessible levels corresponding to macro-
wire, and over the whole circuit are the main scopic electron transfer. However, electrons can
sources of the field both inside and outside the transfer between molecules or atoms, and the
conductors. resulting positive and negative ions are free to
(d) The energy difference per unit charge be- move under the influence of an electric field.
tween the F.l. and a point in a field-free cavity The conductivity of a positive or negative ionic
in the conductor is called the work function I/> species is expressed by
of the solid. When two conductors of the same
cavity potential and differing I/>'s are placed in v± =±u±E and J± =neZ±v± (39)
electrical contact, electrons will transfer toward
the lower of the two F.l. 's until the F .l.'s come where v± is the ionic velocity, u± the mobility,
to equality with a difference Ll V = Lll/> between E the local field intensity, Z the valence, n the
the two potentials, which is the contact poten- ionic density, and J ± the partial current density
tial difference. (See Fig. 4.) due to one positive or negative species.
(e) When a temperature gradient exists in a In a liquid with a particular ionic constitution
conductor, the varying extent of thermal elec- there is always a highest occupied electron level
tron jumps will tend to cause a current. Ohm's and a lowest empty one. If a metallic electrode
law becomes is inserted, equilibrium will generally be estab-
lished between one of these levels and the Fermi
J=a(-VV+S*VT) (38) level in the metal; the electron transfer between
phases often results in deposition of neutral
where S*, the entropy transport per unit charge, atoms on the metal or removal from the metal.
is responsible for thermoelectricity. (See The F.l. of the electrode thus takes up a defi-
THERMOELECTRICITY.) nite position in relation to the cavity potential

Un = - eVF

Fig. 4. Energy levels for adjacent metals in contact when equilibrium has been estab-
lished (Fermi levels UF coincident).
337 ELECTROACOUSTICS

of the solution. A cell is an arrangement with ELECTRICAL DISCHARGES IN GASES; ELEC-


two electrodes in which a solution has two TRICAL MEASUREMENTS; ELECTROCHEMISTR Y;
different constitutions at the electrodes that are ELECTROMAGNETIC THEORY; ELECTRON;
capable of reacting by electron transfer but FERROMAGNETISM; HIGH VOLTAGE RESEARCH;
cannot react directly with each other because INDUCED ELECTROMOTIVE FORCE; INDUC-
of physical separation. The two electrode F.l.'s TANCE; MOTORS, ELECTRIC; POTENTIAL; SEMI-
came to different positions in relation to the CONDUCTOR; STATIC ELECTRICITY.
solution cavity potential, the difference being
the electomotive force (emf) 8, of the cell.
When current passes through a cell, the sup-
pressed reaction can proceed via electron trans- ELECTROACOUSTICS
fer through the outer circuit. Owing to internal
resistance r in the cell, the external potential or Introduction Electroacoustics is concerned
F.l. difference becomes V = 8, - Ir. The amount with the transduction of acoustical to electrical
of reaction at either electrode is such that one energy and vice versa. Devices which convert
mole of reactant of valence Z will be deposited acoustical signals into electrical signals are re-
or dissolved when the charge transferred is ferred to as microphones or hydrophones de-
Q =It =N A Ze, an expression of Faraday's laws pending on whether the acoustic medium is air
of electrochemistry. The transition between or water. Devices which convert electrical sig-
two or more types of current carriers in the nals into acoustical waves are referred to as
solution and one type only at the electrode is loudspeakers (or earphones) in air and projectors
accomplished by the occurrence of diffusion in water.
processes. Transduction Mechanisms Piezoelectricity
3. In gases, electrons can move freely between Certain crystals produce charge on their surfaces
molecules once knocked loose by molecular when strained and conversely become strained
collision or by energetic charged particles. Local when placed in an electric field. Important
fields will accelerate electrons and ions in be- piezoelectric crystals include quartz, ADP,
tween collisions. A field strength of about lithium sulfate, rochelle salt, and tourmaline.
3 X 10 6 V jm is enough to initiate a breakdown Lithium sulfate and tourmaline are volume
in air at S.T.P. by allowing electrons to gain expanders, that is, their volume changes when
enough energy between collisions to ionize subjected to an electric field in the proper di-
more molecules. Somewhat lesser fields can rection. Such crystals can detect hydrostatic
render a gas partially conducting. In a highly pressure directly. Crystals which are not volume
ionized condition, the gas becomes a plasma expanders must have one or more surfaces
(PLASMA). shielded from the pressure field in order to
Electrostatic phenomena of charge transfer convert the pressure to a uniaxial strain which
make use of gaseous conduction, of charge can be detected. Tourmaline is relatively insen-
redistribution in conductors under the influ- sitive and used primarily in blast gauges. Quartz
ence of fields, and of forces exerted on the is used principally in high-Q ultrasonic trans-
charges that reside on the surfaces of conduc- ducers.
tors. For the production of fields by charges Certain ceramics such as lead zirconate titanate
moving at relativistic velocities, see PROPAGA- (PZT), barium titanate, and lead metaniobate
TION OF ELECTROMAGNETIC WAVES. become piezoelectric when polarized. They
exhibit relatively high electromechanical cou-
WILLIAM T. SCOTT pling, are capable of producing very large forces,
and are used extensively as sources and receivers
References for underwater sound. PZT and barium titanate
have only a small volume sensitivity hence must
Scott, W. T., "The Physics of Electricity and Mag- have one or more surfaces shielded in order to
netism," Second Edition, New York, John Wiley, detect sound efficiently. Piezoelectric ceramics
1966; Huntington, N.Y., Robert E. Krieger, 1976. have extraordinarily high dielectric coefficients
Reitz, J. R., Milford, F. J., and Christy, R. W., "Foun- and hence high capacitance and are thus capable
dations of Electromagnetic Theory," Third Edition, of driving long cables without preamplifiers.
Reading, Mass., Addison-Wesley, 1979. Recently, it has been discovered that certain
Lorrain, P., and Corson, D., "Electromagnetic Fields polymers, notably polyvinylidene fluoride, are
and Waves," Second Edition, San Francisco, W. H. piezoelectric when stretched. Such piezoelectric
Freeman, 1970. polymers are finding use in directional micro-
Moore, A. D. (Ed.), "Electrostatics and Its Applica- phones and ultrasonic hydrophones.
tions," New York, John Wiley, 1973. Magnetostriction Some ferromagnetic mate-
Portis, A. M., "Electromagnetic Fields: Sources and rials become strained when subjected to a mag-
Media," New York, John Wiley, 1978. netic field. The effect is quadratic in the field
so a bias field or dc current is required for linear
Cross-references: CAPACITANCE; CIRCUITRY; operation. Important magnetostrictive metals
CONDUCTIVITY, ELECTRICAL; DIELECTRIC and alloys include nickel and permendur. At
THEORY; ELECTRIC POWER GENERATION; one time, magnetostrictive transducers were
ELECTROACOUSTICS 338

used extensively in active sonars but have now exposed to an acoustic field as one leg of an
been largely replaced by ceramic transducers. optical interferometer. Path length changes of
Magnetostrictive transducers are rugged and the order of 10- 6 optical wavelengths can be
reliable but inefficient and configurationally detected. The principal advantages of such
awkward. Recently, it has been discovered that sensors are their configurational flexibility, their
certain rare earth iron alloys such as terbium- sensitivity, and their suitability for use with
dysprosium-iron possess extremely large mag- fiber optic cables. Fiber optic sensors which
netostrictions (as much as 100 times that of utilize amplitude modulation of the light (mi-
nickel). They have relatively low eddy current crobend transducers) are also being developed.
losses but require large bias fields, are fragile, Parametric Transducers The nonlinear inter-
and have yet to find significant applications. action of sound waves can be used to produce
Metallic glasses have also recently been con- highly directional sound sources with no side
sidered for magnetostrictive transducers. lobes and small physical aperture. In spite of
Electrodynamics Electrodynamic transducers their inherent inefficiency, substantial source
exploit the forces produced on a current-carry- levels can be achieved and such "parametric
ing conductor in a magnetic field and, con- sonars" have found a number of underwater
versely, the currents produced by a conductor applications. Parametric receivers have also
moving in a magnetic field. Direct-radiation been investigated but practical applications
moving-coil transducers dominate the loud- have yet to be found.
speaker field. Prototypes of high power under- Carbon Microphones Carbon microphones
water projectors have been constructed using utilize a change in electrical resistance with
superconducting magnets. Electrodynamic mi- pressure and are used extensively in telephones.
crophones, particularly the directional ribbon Sensitivty and Source Level A microphone
microphones, are also common. or hydrophone is characterized by its free-field
Electrostatics Electrostatic sources utilize voltage sensitivity M, which is defined as the
the force of attraction between charged capaci- ratio of the output voltage E to the free-field
tor plates. The force is independent of the sign amplitude of an incident plane acoustic wave.
of the voltage so a bias voltage is necessary for That is, for an incident wave which in the ab-
linear operation. The forces are relatively weak sence of the transducer is given by
so a large area is needed to obtain significant
acoustic output. The effect is reciprocal, with P=Pocos(k'R-wt), (1)
the change in the separation of the plates (i.e., M is defined by
the capacitance) produced by an incident
acoustic pressure generating a voltage. The im- M =E/Po . (2)
pedance of a condenser microphone, however,
is high, so a preamplifier located close to the In general M will be a function of frequency
sensor is required. Condenser microphones are and the orientation of the transducer with re-
very flat and extremely sensitive. The change spect to the wave vector k (i.e., the direction of
in capacitance induced by an acoustic field can incidence of the wave). Thus, for a given fre-
also be detected by making the capacitor a part quency, M is proportional to the directivity of
of a bridge circuit, or alternatively part of an the transducer. It is usually desirable for a mi-
oscillator circuit. The acoustic signal will then crophone or hydrophone to have a flat (i.e.,
appear as either an amplitude or frequency frequency independent) free field voltage sensi-
modulation of some ac carrier. The charge stor- tivity over the broadest possible range of fre-
age properties of electrets have been exploited quencies to assure fidelity of the output elec-
to produce electrostatic microphones which do trical signal.
not require a bias voltage. A loudspeaker or projector is characterized in
Magnetism Magnetic transducers utilize the a similar manner by its transmitting current
force of attraction between magnetic poles and, response S which is defined as the ratio of the
reciprocally , the voltages produced when the acoustic source level to the driving current I. In
reluctance of a magnetic circuit is changed. the farfield of a transducer the acoustic pressure
Magnetic speakers are used extensively in tele- is a spherical wave which can be expressed as
phone receivers.
Hydraulics Nonreversible, low frequency, P(R) =Ps (fJ , cf» (RoIR) cos (kR - wt) (3)
high power underwater projectors can be con-
structed utilizing hydraulic forces acting to where () and cf> are elevation and azimuth angles
move large pistons. Electroacoustic transduc- and Ro an arbitrary reference distance (usually
tion is achieved by modulating the hydraulic 1 meter). Ps «(), cf» is defined as the source level.
pressure with a spool valve actuated by an elec- Thus S is given by
trostrictive (PZT) stack. (4)
Fiber Optics An acoustic field acting on an
optical fiber will change the optical path length which is a function of () and cf> and the frequency
by changing the length and index of refraction w. For high fidelity sound reproduction S should
of the fiber. Extremely sensitive hydrophones be as flat as possible over the broadest possible
and microphones can be made by using a fiber bandwidth. For some purposes, however, such
339 ELECTROACOUSTICS

as ultrasonic cleaning or long range underwater Tem =-Tme =BL (8)


acoustic propagation, fidelity is unnecessary
and high-Q resonant transducers are employed where L is the length of the wire in the coil and
to produce high intensity sound over a narrow the electrical impedence Ze is largely inductive.
bandwidth. For ~ piston transducer with a piezoelectric
Reciprocity Most conventional transducers "mofor"
are reversible, that is, they can be used as
either sources or receivers of sound (a carbon T me = Tem = -id 33 /(e T sw) (9)
microphone and a fiber optic hydrophone are where d 33 is the piezoelectric strain coefficient,
examples of transducers which are not revers- s is the compliance, eT is the permittivity at
ible). A transducer is said to be linear if the constant stress, and the electrical impedance Ze
input and output variables are linearly propor- is largely capacitive.
tional (hot-wire microphones and unbiased If a piston transducer is placed in an acoustic
magnetostrictive transducers are examples of field such that the average pressure over the
nonlinear transducers). A transducer is said to surface of the piston is PB then F =PB A, where
be passive if the only source of energy is the A is the area of the piston, and for a receiver
input electrical or acoustical signal (a micro- 1= 0, so
phone with a built in preamplifier and a para-
metric receiver are examples of nonpassive E=(TemA/Zm)PB . (0)
transducers). Most transducers which are linear,
passive and reversible exhibit a remarkable If the transducer is small compared with an
property called reciprocity. For a reciprocal acoustic wave length PB ~ Po (in general PB =
transducer of any kind (moving coil, piezoelec- DPo, where D is the diffraction constant) and
tric, magnetostrictive, electrostatic, magnetic the free field voltage sensitivity is given by
etc.) the ratio of the free field voltage sensitivity
to the transmitting current response is equal to M = TemA/Zm. (11)
the reciprocity factor J, which is independent From Eq. (5) the transmitting current response
of the geometry and construction of the trans-
ducer. That is: is

M(w,e,rp) 41TRo PowTem A


S=':""::"'--'=.!!.!-
---'---'--'--'- = J (w) = - - (12)
(5)
Sew, e, rp) Pow 41TROZm

where Po is the density of the medium and Ro From these simple considerations a number
is the reference distance used in defining the of principles of practical transducer design can
source level. Equation (5) has a number of use- be deduced. The mechanical impedance Zm is
ful consequences: 0) the receiving and trans- in general given by
mitting beam patterns of a reciprocal transducer
are identical, (2) a transducer cannot be simul- Km .
Zm =-.-+zwM+R m (3)
taneously flat as a receiver and transmitter since zw
S has an additional factor of w, and (3) Eq. (5)
provides the basis for the three transducer re- where Km is an effective spring constant, M the
ciprocity calibration technique whereby an mass, and Rm the mechanical resistance. For a
absolute calibration of a hydrophone or micro- piezoelectric transducer (Eq. (9» Tem is in-
phone can be obtained from purely electrical versely proportional to frequency, hence from
measurements. Eqs. (10) and (11) we see that a piezoelectric
Canonical Equations and Electroacoustic transducer will have a flat receiving sensitivity
Coupling Simple acoustic transducers can be below resonance (i.e., where its behavior is
characterized by the following canonical equa- controlled by stiffness). On the other hand a
tions: moving coil microphone must have a resistive
mechanical impedance to have a flat response.
E = ZeI+ Tem V (6) From Eq. (12) we derive the fundamental tenet
of loudspeaker design, that a moving coil loud-
F= TmeI+Zm V (7) speaker will have a flat transmitting current
response above resonance (i.e., where it is mass
where V is the velocity of the radiating or re- controlled). Accordingly, moving coil loud-
ceiving surface, F is the total force acting on speakers are designed to have the lowest pos-
the surface (including acoustic reaction forces), sible resonant frequency (by means of a high
Ze is the blocked (V = 0) electrical impedance, compliance, since the output is inversely pro-
Zm is the open circuit mechanical impedance portional to the mass) and piezoelectric hydro-
and Tem and T me are the electromechanical phones are designed to have the highest possible
coupling coefficients. For reciprocal trans- resonant frequency.
ducers Tem = ± T me. For example, for a moving An interesting and important consequence of
coil transducer where the "motor" is a coil in electromechanical coupling is the effect of the
a radial magnetic field B, motion of the transducer on the electrical im-
ELECTROACOUSTICS 340

pedance. In the absence of external forces (in- tional microphones are useful to reduce un-
cluding radiation reactance) from Eqs. (6) and wanted noise (e.g., to pick up the voice of a
(7) speaker and not the audience), directional hy-
drophones or hydrophone arrays increase
E=(Ze- Te~:me)I. (14)
signal-to-noise and aid in target localization.
One way to achieve directionality is to make
the radiating surface large. A baffled circular
That is, the electrical impedance has a "mo- piston has a directivity given by
tional" component given by TemTme/Zm. The
motional component can be quite significant Do = 2J1 (ka sin 8)/ka sin 8 (19)
near resonance where Zm is small. This effect
is the basis of crystal controlled oscillators. Do equals unity for 8 = 0 and ~ when ka sin 8 =
2.2. For small values of ka, Do is near unity for
Radiation Impedance An oscillating surface all angles.
produces a reaction force FR on its surface Some transducers respond to the gradient of
given by the acoustic pressure rather than pressure, for
(15) example, the ribbon microphone which works
by detecting the motion of a thin conducting
where ZR is the radiation impedance. We can strip orthogonal to a magnetic field. Such trans-
thus rewrite Eq. (7) as ducers have a directivity which is dipole in na-
~ure, Le.,
Fext=TemI+(ZR+Zm)V (16)
DO = cos 8. (20)
where F ext now includes only external forces.
For an acoustically small baffled circular piston Note that since the force in this case is propor-
of radius a, tional not to Po but to kPo , a ribbon micro-
phone (which like a moving coil microphone is
ZR = rra 4 Pow 2/2c - i(8/3)wpoa 3 • (17) electrodynamic) will have flat receiving sensitiv-
ity when its impedance is mass controlled. By
The radiation impedance thus has a masslike combining a dipole receiver with a monopole
reactance with an equivalent "radiation mass" receiver one obtains a unidirectional cardioid
of (8/3 )poa 3 and a small resistive component receiver with
proportional to w 2 responsible for the radiated
power. A transducer will thus have a lower Do =(1 +cos8) (21)
resonant frequency when operated underwater
than when operated in air or vacuum. The total P. H. ROGERS
radiated power of the piston transducer is given
by References

-:-=ReZRIVI2 = (rra 4 pow2j2c)V2. (18) Hunt, F. V., "Electroacoustics," Cambridge, MA, Har-
vard Univ. Press, and New York, John Wiley & Sons,
Most transducers are displacement limited so 1954.
for a direct radiating transducer V in Eq. (18) is Bobber, R. J., "Underwater Electroacoustic Measure-
limited. To obtain the most output power the ments," U.S. Gov. Printing Office, Washington, D.C.,
piston should have the largest possible surface 1969.
area consistent with keeping the transducer Bouyoucos, "Hydroacoustic Transduction," J. Acoust.
omnidirectional (the transducer will become Soc. Am. 57, 1341 (1975).
directional when a ;C A). This is easy to do in air Meeks, S. W., and Timme, R. W., "Rare Earth Iron
but difficult in water, since it is hard to make Magnetostrictive Underwater Sound Transducer,"
pistons which are both lightweight and stiff J. Acoust. Soc. Am. 62, 1158 (1977).
enough to hold their shape in water. Alterna- Bacaro, 1. A., Dardy, H. D., and Carome, E. F., "Fiber
tively, the driver can be placed at the apex of a Optic Hydrophone," J. Acoust. Soc. Am. 62, 1302
horn. For a conical horn the fluid velocity at (1977).
end of the horn (where the radius is ae) will be Moffett, M. B., and Mellon, R. M., "Model for Para-
reduced to V(a/ae) but the radiating piston will metric Acoustic Sources," J. Acoust. Soc. Am. 61,
now have an effective radius of ae so the radi- 325 (1977).
ated power will increase by a factor of (ae/a)2 Ricketts, D., "Electroacoustic Sensitivity of Piezoelec-
For high power operation at a single frequency tric Polymer Cylinders," J. Acoust. Soc. Am. 68,
the driver can be placed at the end of a quarter 1025 (1980).
wave resonator. Sessler, G. M., and West, J. E., "Applications," in
Directivity It is often desirable for trans- "Electrets" (G. M. Sessler, Ed.), New York, Springer-
ducers to be directional. Directional sound Verlag, 1980.
sources are needed in diagnostic and thera- Bobber, R. 1., "New Types of Transducer," in "Under-
peutic medical ultrasonics, for acoustic depth water Acoustics and Signal Processing (1. Bjorno,
sounders and to reduce the power requirements Ed.), Dordrecht, Holland, D. Riedel Publishing
and reverberation in active sonars, ·etc. Direc- Company, 1981.
341 ELECTROCHEMISTRY

Cross-references: ACOUSTICS,
CRYSTALLOGRA- for example, nonaqueous solutions usually have
PHY, DIELECTRIC THEORY, LASER, MAGNE- much lower electrical conductivity than aqueous
TISM, RESONANCE, SOLID-STATE PHYSICS, UL- systems and as a consequence find less use
TRASONICS. in electrochemical applications. Molten salts,
on the other hand, are ionic fluids, freely mix,
and have high electrical conductivity but are
ELECTROCHEMISTRY restricted to electrochemical uses at high
temperatures.
Electrochemistry is that branch of science which A current of electricity in an electronic con-
deals with the interconversion of chemical and ductor is due to a stream of electrons, particles
electrical energies, i.e., with chemical changes of subatomic size, and the current causes no net
produced by electricity as in electrolysis or with transfer of matter. The flow is, therefore, in a
the production of electricity by chemical action direction contrary to what is conventionally
as in electric cells or batteries. The science of known as the "direction of the current." In
electrochemistry began about the turn of the electrolytic conductors, the carriers are charged
eighteenth century. In 1796 Alessandro Volta particles of atomic or molecular size called ions,
observed that an electric current was produced and under a potential gradient, a transfer of
if unlike metals separated by paper or hide matter occurs.
moistened with water or a salt solution were An electrolytic solution contains an equivalent
brought into contact. Volta used the sensation quantity of positively and negatively charged
of pain to detect the electric current. His obser- ions whereby electroneutrality prevails. Under a
vation was similar to that observed ten years potential gradient, the positive and negative ions
earlier by Luigi Galvani who noted that a frog's move in opposite directions with their own
leg could be made to twitch if copper and iron, characteristic velocities and each accordingly
attached respectively to a nerve and a muscle, carries a different fraction of the total current
were brought into contact. through anyone solution. Each fraction is refer-
In his original design Volta stacked couples of red to as the ionic transference number. Further-
unlike metals one upon another in order to in- more, the velocity increases with temperature
crease the intensity of the current. This arrange- causing a corresponding increase in electrolytic
ment became known as the "voltaic pile." He conductivity. This characteristic is opposite to
studied many metallic combinations and was that observed for most electronic conductors
able to arrange the metals in an "electromotive which show less conductivity as their tempera-
series" in which each metal was positive when ture is increased.
connected to the one below it in the series. The concept that charged particles are respon-
Volta's pile was the precursor of modern bat- sible for the transport of electric charges through
teries (see BATTERIES). electrolytic solutions was accepted early in the
In 1800 William Nicholson and Anthony history of electrochemistry. The existence of
Carlisle decomposed water into hydrogen and ions was first postulated by Michael Faraday in
oxygen by an electric current supplied by a 1834; he called negative ions "anions" and posi-
voltaic pile. Whereas Volta had produced elec- tive ones "cations." In 1853, Hittorf showed
tricity from chemical action these experimen- that ions move with different velocities and ex-
ters reversed the process and utilized electricity ist as separate entities and not momentarily as
to produce chemical changes. In 1807 Sir Hum- believed by Faraday. In 1887, Svante Arrhenius
phry Davy discovered two new elements, potas- postulated that solute molecules dissociated
sium and sodium, by the electrolysis of the re- spontaneously into free ions having no influence
spective solid hydroxides, utilizing a voltaic pile on each other. However, it is known that ions
as the source of electric power. These electro- are subject to coulombic forces, and only at in-
lytic processes were the forerunners of the many finite dilution do ions behave ideally, i.e., inde-
industrial electrolytic processes used today to pendently of other ions in the solution. Ioniza-
obtain aluminum, chlorine, hydrogen, or oxy- tion is influenced by the nature of the solvent
gen, for example, or in the electroplating of and solute, the ion size, and solute-solvent inter-
metals such as silver or chromium. action. The dielectric constant and viscosity of
Since in the inter conversion of electrical and the solvent play dominant roles in conductivity.
chemical energies, electrical energy flows to or The higher the dielectric constant, the less are
from the system in which chemical changes take the electrostatic forces between ions and the
place, it is essential that the system be, in large greater is the conductivity. The higher the vis-
part, conducting or consist of electrical conduc- cosity of the solvent, the greater are the fric-
tors. These are of two general types-electronic tional forces between ions and solvent molecules
and electrolytic-though some materials exhibit and the lower is the electrolytic conductivity.
both types of conduction. Metals are the most In 1923 Debye and Ht)ckel presented a theory
common electronic conductors. Typical electro- which took into account the effect of coulombic
lytic conductors are molten salts and aqueous forces between ions. They introduced the con-
solutions of acids, bases, and salts. Owing, in cept of the ion atmosphere, or continuous
general, to the low solubility and ionizatiol1 of charge distribution, which is a continuous func-
solutes in nonaqueous solvents, ethyl alcohol tion of r, the radial distance from a central or
ELECTROCHEMISTRY 342

reference ion rather than a discrete or discon- current and time, i.e., the total charge (cou-
tinuous charge distribution. The ion atmosphere lombs) passed through a solution. These laws
acts electrostatically somewhat like a sphere of are: (1) the amount of chemical change, e.g.,
charge - € at some average distance from a cen- chemical decomposition, dissolution, deposition,
tral or reference ion of charge +€, with the value oxidation, or reduction, produced by an electric
of the average distance approximating that of current is directly proportional to the quantity
the ionic radii of ionic crystals. This. interionic of electricity passed through the solution; (2) the
attraction leads to two effects on the electro- amounts of different substances decomposed,
lytic conductivity. Under a potential gradient, dissolved, deposited, oxidized, or reduced are
an ion moves in a certain direction. However, proportional to their chemical equivalent weights.
the ion cloud, being of opposite sign will tend A chemical equivalent weight of an element or
to move in the opposite direction, and because a radical is given by the atomic or molecular
of its attraction for the central ion, will have a weight of the element or radical divided by its
retarding effect on the ion velocity and thereby valence; the valence used depends on the elec-
lead to a lowering in the electrolytic conductiv- trochemical reaction involved. The electric
ity. On the other hand, the central ion will tend charge on an ion is equal to the electronic
to pull the ion cloud with it to a new location. charge or some integral multiple of it. Accord-
The ion atmosphere will adjust to its new loca- ingly, a univalent negative ion has a charge
tion in time, but not instantaneously, and the equal in magnitude and of the same sign as a
delay results in a dissymmetry in the potential single electron, and its chemical equivalent
field around the ion. This also causes a lowering weight is equal to its atomic weight, if an ele-
in the conductance of the solution. These ef- ment, or to its molecular weight, if a radical. A
fects become more pronounced as the concen- trivalent ion has +3 or - 3 electronic charges,
tration of the solution is increased; for dilute depending on whether it is a positive or a nega-
solutions, below about 0.1 molal, the equivalent tive trivalent ion. For trivalent ions, then, the
conductance decreases with the square root of equivalent weight would be equal to its atomic
the concentration. For more concentrated solu- weight, if an element, or to its molecular weight,
tions, the relation between conductivity and if a radical, divided by three.
concentration is much more complex and de- The quantity of electricity required to pro-
pends more specifically on individual solute duce a gram-equivalent weight of chemical
properties. change is known as the faraday. A faraday cor-
lii:terionic attraction in dilute solutions also responds, then, to an Avogadro number of
leads to an effective ionic concentration or ac- charges. The most accurate determination of the
tivity which is less than the stoichiometric value. faraday has been made by a silver-perchloric
The activity of an ion species is its thermody- acid coulometer in which the amount of silver
namic concentration, i.e., the ion concentration electrolytically dissolved in an aqueous solution
corrected for the deviation from ideal behavior. of perchloric acid is measured. This method
For dilute solutions the activity of ions is less gives 96487 coulombs (or ampere-seconds) per
than one, for concentrated solutions it may be gram-equivalent for the faraday on the unified
greater than one. It is the ionic activity that is e l2 scale of atomic weights.
used in expressing the variation of electrode po- The electrochemical equivalent or, preferably,
tentials, and other electrochemical phenomena, the coulomb equivalent of an element or radical
with composition. is that weight in grams which is equivalent to
When electricity passes through a circuit con- one coulomb of electricity and is given by the
sisting of both types of electrical conductors, a gram-equivalent weight divided by the faraday
chemical reaction always occurs at their inter- (96487 coulombs per gram-equivalent); for ex-
face. These reactions are electrochemical. When ample, the electrochemical equivalent of silver
electrons flow from the electrolytic conductor, is given by 107.870/96487 or 0.00111797 g/cou-
oxidation occurs at the interface while reduc- lomb where 107.870 is the atomic weight of
tion occurs if electrons flow in the opposite di- silver based on the unified e l2 scale. The elec-
rection. These electronic-electrolytic interfaces trochemical equivalents of other elements may
are referred to as electrodes; those at which be calculated in like fashion.
oxidation occurs are known as anodes and those In electrolysis and in any electric cell or bat-
at which reduction occurs, as cathodes. An anode tery, there is an electromotive force (emf) or
is also defined as that electrode by which "con- voltage across the terminals. This emf is ex-
ventional" current enters an electrolytic solu- pressed in the practical unit, the volt, which is
tion, a cathode as that electrode by which "con- equal to the electromagnetic unit in the meter-
ventional" current leaves. Positive ions, for kilogram-second system. In anyone cell, the
example, ions of hydrogen and the metals, are emf is the sum of the potentials of the two elec-
called cations while negative ions, for example, trodes and of any liquid-junction potentials that
acid radicals and ions of nonmetals are called may be present. Neither of the individual elec-
anions. trode potentials can be evaluated without refer-
In 1833, Michael Faraday enunciated two ence to a chosen reference electrode of assigned
laws of electrolysis which give the relation be- value. For this purpose, the hydrogen electrode
tween chemical changes and the product of the has been universally adopted and is arbitrarily
343 ELECTROCHEMISTR Y

assigned a zero potential for all temperatures surface may differ from that in the bulk of the
when the hydrogen ion is at unit activity and solution). Also for high current densities, espe-
the hydrogen gas is at atmospheric pressure. A cially for the generation of gases such as hydro-
hydrogen electrode consists of a stream of hy- gen, oxygen or chlorine, the voltage required
drogen gas bubbling over platinized platinum or exceeds the reversible voltage; the excess voltage
gold foil and immersed in a solution containing is known as overvoltage, or overpotential for a
hydrogen ions; the electrochemical reaction is: single electrode, and arises from energy barriers
1/2H2 (gas) = H+(solution) + €, where € repre- at the electrode. Overpotential, in general, in-
sents the electron. The potential of the hydro- creases logarithmically with an increase in cur-
gen electrode, EH , as a function of hydrogen- rent density.
ion concentration and hydrogen-gas pressure is The electrochemistry of the solid state is
given by similar in many ways to that of aqueous solu-
tions. The basic problems of ionic crystals
EH =E H o - (RT/nF) In (aH+/PH,1/2) can be treated in many cases by analogy to
aqueous solutions. Differences do occur. In
=EHo - (RT/nF) In ( cH+fH+/PH, 112). silver halides, for example, practically only
cations are mobile, while in lead chloride and
where E H o is the standard quantity assigned a bromide only anionic conduction occurs. Mixed
value of zero R is the gas constant, T the abso- conductance, in that both anions and cations
lute temperature, n the number of equivalents, migrate, is exhibited by alkali halides and lead
F the faraday, PH, the pressure of hydrogen, iodide. Other solids at temperatures below
and aH+, cH+ and fH+, respectively, the activity, about 600°C, cuprous oxide for example, show
concentration, and activity coefficient of hy- only electronic migration, while above this
drogen ions. When aH+ and PH,1I2 equal one, temperature they exhibit in addition some
cationic conduction. Ionic conduction in the
EH =EHo. For very dilute solutions below 0.01 solid state is attributed superordinately to lattice
molal fH+ may be taken as unity without ap- defects. Even for pure substances lattice defects
preciable error. which may contribute to the electrical con-
The standard potentials, EO, of other elec- ductivity may be due to interstitial cations,
trodes are obtained by direct or indirect com- cation vacancies, interstitial anions, anion
parison with the hydrogen electrode. Values vacancies, excess electrons, or electron holes.
thus obtained at 25°C for some typical elements Conduction occurs, for instance, when a cation
are listed in Table 1. in one interstitial site jumps to an adjacent
The reducing power of the elements decreases interstitial site. Lattice defects in pure sub-
on going down the column. These values are for stances may be artificially produced by trace
the ions at unit activity, and reversible or ther- doping with an unlike substance of proper
modynamic values as a function of metal or characteristics. When, for example, silicon, a
radical concentration are given by equations semiconductor with four valence electrons, is
similar to the one above. For the general reac- doped with a five-valence-electron element,
tion: M = Mn+ + ne, the potential is given by such as antimony, arsenic, or phosphorus,
EM =EMo - (RT/nF) In aMn+. loosely bound electrons are introduced into the
In electrolysis, at very low current densities, lattice of silicon which are free to migrate lead-
the potentials of the electrodes approximate in ing to electrical conduction. Such a semicon-
magnitude their reversible values and deviate ductor is known as an n-type. A p-type
somewhat from these values because of an IR semiconductor is produced if silicon is doped
drop in the solution and possible concentration with a three-valence-electron element, such as
polarization (the concentration at the electrode boron, indium, or gallium, whereby a hole in

TABLE 1. SOME STANDARD ELECTRODE POTENTIALS AT 25°C

Potential Potential
Electrode (V) Electrode (V)

Li = Li+ +E -3.045 Cu = Cu++ + 2E +0.337


Ca = Ca++ + 2E -2.87 Cu = Cu+ + IE +0.521
Na = Na+ +f -2.714 2I- = I2 + 2E +0.536
Mg = Mg++ +2 e -2.37 2Hg = Hg 2++ + 2e +0.789
AI = AI+++ + 3e -1.66 Ag= Ag+ +e +0.799
Mn = Mn++ + 2e -1.18 Pd = Pd++ + 2e +0.987
Zn = Zn++ + 2e -0.763 Pt = Pt++ + 2E +1.20
Fe = Fe++ + 2e -0.440 2CI- = el 2 + 2e +1.36
Ni = Ni++ + 2e -0.250 Au = Au+ +E +1.68
H2 = 2H+ + 2e 0.000 2F- = F2 + 2E +2.87
ELECTROCHEMISTRY 344

the silicon lattice is produced which can move Cross-references: BATTERIES; CDNDUCTIVITY, ELEC-
about and impart conductivity. TRICAL; DIELECTRIC THEORY; ELECTRICAL
The passage of electricity through gases is DISCHARGES IN GASES; IONIZATION; MOLECU-
sometimes included under electrochemistry. LAR WEIGHT; POTENTIAL; VISCOSITY.
However, in electrical discharges in gases, the
principles are entirely different from what they
are in the electrolysis of electrolytic solutions. ELECTROLUMINESCENCE
Whereas in the latter, ionic dissociation occurs
spontaneously as a result of forces between sol- Electroluminescence is a process which generates
vent and solute and without the application of light in crystals by conversion of energy supplied
an external field, for gases relatively high volt- by electric contacts, in the absence of incan-
ages must be applied to accelerate the electrons descence, cathodo- or photoluminescence.
from the electrode to a velocity at which they It occurs in several forms. The first observa-
can ionize the gas molecules they strike. In this tion of the presently most important form,
case, the resulting chemical reaction taking place "radiative recombination in p-n junctions," was
between ions, free radicals, and molecules oc- made in 1907 by Round, then more thoroughly
curs in the gas phase and not at the electrodes by Lossev from 1923 on, when point electrodes
as in the electrolysis of solutions. Studies of the were placed on certain silicon carbide crystals
electrical conduction of gases, accordingly, are and current passed through them. Explanation
generally considered under the physics of gases. and improvement of this effect became possible
In addition to the above topics, it is frequently only after the development of modern solid-
customary to include under electrochemistry: state science since 1947.
(1) processes for which the net reaction is If minority carriers are injected into a semi-
physical transfer, e.g., concentration cells; (2) conductor, i.e., electrons are injected into a p-
electrokinetic phenomena, e.g., electrophoresis, type material, or "holes" into n-type material,
electroosmosis, streaming potential; (3) proper- they recombine with the majority carriers
ties of electrolytic solutions if determined by either directly via the bandgap, or through ex~
electrochemical or other means, e.g., activity citon states, or via impurity levels within the
coefficients and hydrogen-ion concentration; bandgap, thereby emitting the recombination
(4) processes in which electrical energy is first energy as photons. Part of the recombinations
converted to heat which in turn causes a chemi- occur nonradiatively, producing only heat.
cal reaction to occur that would not do so spon- Exploitation of the effect was strongly de-
taneously at ordinary temperature. The first pendent on progress in compound semicon-
three are frequently considered a portion of ductor crystal preparation and solid-state elec-
physical chemistry, and the last one is a part of tronics, since crystal perfection (absence of
electrothermics or electrometallurgy. defects) is of prime importance. At present,
Electrochemistry finds wide application. In single-crystalline dome-shaped p-n diodes made
addition to industrial electrolytic processes, of gallium arsenide, GaAs, a "III-V compound"
electroplating, and the manufacture and use of (from groups III and V of the periodic system),
batteries already mentioned, the principles of yield the highest efficiencies (40 percent of the
electrochemistry are used in chemical analysis, electrical power input converted into optical
e.g., polarography, and electrometric or con- power output) in the near-infrared, and diodes
ductometric titrations; in chemical synthesis, made of gallium phosphide, GaP, doped with
e.g., dyestuffs, fertilizers, plastics, insecticides; oxygen and zinc, yield red and infrared light,
in biology and medicine, e.g., electrophoretic efficiency peaked at 10 percent. GaP diodes
separation of proteins, membrane potentials; in doped with nitrogen emit green light, with
metallurgy, e.g., corrosion prevention, electrore- .5 percent efficiency. Very important are alloys
fining; and in electricity, e.g., electrolytic recti- suchasInxGal-XP, AlxGal-xAs, and especially
fiers, electrolytic capacitors, Josephson junc- GaAsXPI-X (red-emitting), where the color of
tions. luminescence can be changed by changing the
WALTER J. HAMER composition. For high efficiency, it is important
that the material have a "direct" bandgap, allow-
References
Dubpernell, George, and Westbrook, J. H. (Eds.),
"Selected Topics in the History of Electrochem-
istry," Proceedings Volume 78-6, Princeton, N.J.,
The Electrochemical Society, 1978.
Semiconductor
Bard, A. J., (Ed.), "Encyclopedia of Electrochemistry
of the Elements," Vol. 14, New York, Marcel
Dekker Inc., 1980. Junction
Bard, A. J., and Faulker, 1. R., "Electrochemical
Methods: Fundamentals and Applications," New
York, John Wiley & Sons, 1980.
Rand, D. A. J., Power, G. P., and Ritchie, I. M. (Eds.),
"Progress in Electrochemistry," New York, Elsevier Contacts
Scientific Publishing Co., 1981. FIG. 1. Light-emitting diode.
345 ELECTROLUMINESCENCE

ing electron-hole recombination without pho-


non participation. Wavelengths of light emitted
by III-V crystals range from 6300 A to 30 f.L.
An important phenomenon, "injection laser
action," was discovered in GaAs diodes in 1962.
The crystal faces at the ends of a: p-n junction
are made optically parallel so as to form a
Fabry-Perot optical cavity. Beyond a certain in-
jection current density, (the "threshold cur- FIG. 3. AC-driven thin-film electroluminescent de-
rent") the individual recombination processes vice. (1) Glass substrate; (2) transp . condo In203-Sn
no longer occur randomly and independently of front electrode; (3) insuL fJ.!m of high permittivity;
each other, but in phase, so that a near-parallel (4) ZnS : Mn EL film, ca. 500 nm thick; (5) black back
beam of coherent light ("'9000 A) of enormous electrode.
intensity (l0 7 W/cm 2 in pulsed operation) is
emitted. The efficiency has been improved by
using graded bandgap AlxGal-X As heterojunc-
tions and special doping profiles so that the inhomogeneously, in the form of two sets of
lasing region near the p-n junction acts as a comet-like striations which light up alternat-
"light pipe," preventing light straying out side- ingly, each set once per cycle. These comets
ways. AlAs has the same lattice constant as coincide with long, thin conducting copper
GaAs, yet higher bandgap. Therefore, these sulfide precipitates which form along crystal
junctions are free of strain-generated imper- imperfections. The applied field relaxes in these
fections. The current threshold for lasing is now needles and concentrates at the tips, so that
reduced to 200 A/cm 2 at room temperature, electrons and holes are alternatingly field-
allowing continuous operation at low total emitted into the surrounding insulating lumines-
power, to prevent heating. cent ZnS. The holes are trapped there until
These coherent or incoherent electrolumines- they recombine with the more mobile electrons,
cent p-n diodes are small point sources used emitting the typical luminescent spectra. The
for pilot lights, alpha-numeric readouts, opto·· local brightness can be as high as lOS foot-
electronic processing, ranging systems, direct- lamberts.
sight communication, and as IR-lamps for night Instead of these powder particles embedded
vision devices. in an insulating resin one also uses contiguous,
Another kind of electroluminescence, dis- vacuum-evaporated or sputtered or epitaxially
covered by Destriau in 1936, uses inexpensive grown ZnS: Mn films sandwiched between
powders consisting of small particles of essen- similarly prepared inorganic insulator films
tially copper-doped zinc sulfide, ZnS, a II-VI (Fig. 3). With 150 volt , 5000 cps applied, yel-
compound, embedded in an insulating resin and low light emission (brightness up to 3000 fL)
formed into a large flat plate capacitor with one takes place twice per cycle in this ZnS: Mn
plate transparent (e.g., Sn02-coated glass). If film , due to multiple avalanche breakdown in
an ac voltage is applied, light is emitted (blue, close-spaced channels. The current is capac-
green, red, depending on the exact material itively limited by the insulating films. The
composition) twice per cycle, with brightnesses manganese centers are excited by impact with
up to thousands of foot-lamberts. Brightness field-accelerated avalanche electrons. The op-
increases linearly with drive frequency, and ex- tical contrast of these transparent film struc-
ponentially with voltage, until saturation occurs. tures is increased by using black back electrodes
The efficiency is about 1 percent but it de- which absorb the ambient light so that the
creases with increasing brightness. addressed segments stand out bright against
Microscopic examination of the interior of an the black background even in sunlight. The
individual particle reveals that the light is emitted operational longevity of these film structures is

G al ss Coated Phosphor Aluminum


with SnOZ Mixed with Baek E el et rode
Resin
FIG. 2. Destriau-type EL paneL
ELECTROLUMINESCENCE 346

very good, probably owing to the absence of


copper doping.
Low voltage dc-driven-electroluminescence of
polycrystalline films of ZnS on glass, doped
with copper and manganese, has also been
achieved. The mechanism involves high-field-
aided hole injection. The operation with DC
simplifies the addressing of multi-element dis-
play panels. Longevity is still not satisfactory. FIG. 5. DC-driven ftlm EL cell. (1) Glass substrate;
However, operation with low-voltage DC in- (2) transp. condo front electrode; (3) n-type conduct-
stead of high-voltage AC would be very desir- ing II-VI material ftlm; (4) black resistive layer, high
able to simplify the addressing of multi-element work function; (5) metallic back electrode.
displays.
Simpler to produce in large areas are DC-EL
layers consisting of ZnS: Mn particles coated
p-type cermet layer which provides multiple
with conducting copper sulfide. They are held
together by a minimum of organic binder so hole-injecting contacts, current limitation, and
that there are many random interparticle con- black background. Such EL area sources are of
tacts. During a "forming" process, the copper great interest for use in luminous dials, instru-
ment faces, alphanumeric computer readouts,
sulfide diffuses away from the ZnS particle
and ultimately in flat image display panels
surfaces that get the warmest, creating a high
voltage drop there, with the concomitant (Fig. 5).
appearance of electroluminescence owing to A. G. FISCHER
avalanche injection at conducting spikes. The
copper-sulfide-coated particles before and be-
hind the light-emitting ones act as current- References
limiting series resistors to prevent local break- Fischer, A. G., "Electroluminescence in II-VI Com-
down. This is a requirement for all DC-EL pounds," in "Luminescence of Inorganic Solids,"
devices, just as the capacitive current limitation P. Goldberg, Ed., New York, Academic Press, 1966
is a requirement for all AC-EL devices. By using (pp. 541-602).
rare earth doping instead of manganese, and Dean, P. J., "Junction Electroluminescence," in "Ap-
CaS host material instead of ZnS, blue, green plied Solid State Science," R. Wolfe and C. J. Kriess-
and red emission has been achieved also (Fig. man, Eds., New York, Academic Press, 1969 (pp.
4). 1-151.
The latest advance is a combination of the Bergh, A. A., and Dean, P. J., Proc. IEEE, 60, 156-
most desirable features such as low DC voltage 223 (1972) (LEOs).
operation, and high contrast in bright ambient
due to black back electrodes. This requires Cross-references: LIGHT:
LUMINESCENCE; OP-
film structures, not powder layers. It consists TICS, GEOMETRICAL; OPTICS, PHYSICAL; VI-
of an n-type conducting, luminescent film of SION AND THE EYE; COLOR.
II-VI material on a transparent conducting
front glass substrate, backed by a resistive

ELECTROMAGNETIC THEORY
The task of electromagnetic theory is to account
for the effects of electrical charges in various
states of motion. Although historically electro-
magnetic theory was developed from Coulomb's
celebrated law, it is at present more ,economic
to develop it differently,11 ,12 The macroscopic
effects are described with remarkable accuracy
by the following set of equations (rationalized
mks system of units)

F = qE +qv X B (1)
op
FIG. 4. DC-driven powder EL layer. (1) Glass sub- V·J+-=O (2)
strate; (2) condo transp. front electrode; (3) sparsely at
applied organic binder; (4) densely packed small
ZnS: Mn,Cu powder particles, Cuz -x S coated; (5) alu- VXH=-+J
aD (3)
minum or silver paint back electrode. at
347 ELECTROMAGNETIC THEORY

aB Magnetic polarization effects are handled simi-


'VXE=-- (4) larly by defining the field vector B so that it in-
at cludes the magnetic dipole moment density M,
D=/I(E) (5) B = Mo (H + M). The material permeability is then
introduced so that it depends upon the magnetic
B = /2(H) (6) susceptibility analogously, and B = MH . Effects
of conductors are represented by a material
J =h(E, H) (7) conductivity a, such that J e = aE. With these
simple forms for Eq. (5), (6) and (7), Eq. (3)
provided the functional relationships indicated
in Eqs. (5), (6), and (7) are known explicitly. and (4) take on the useful form
With these equations and the laws of mechanics,
classical electromagnetic theory becomes essen- aE
'V X H = € - + aE + J 1 (8)
tially a branch of applied mathematics. at
Equation (I), sometimes known as the Lorentz
force equation, defines the field quantities, E, aH
'V X E =- M- (9)
the electric field intensity, and B, the magnetic ;jt
induction, in terms of an observable, the force
F on a charge q. In Eq. (I), v is the velocity of provided Mand € are constant in time. The term
the charge relative to the observer. Equation (2) J 1 here includes currents arising from charges in
is a statement of the law of conservation of free space plus any (source) currents which are
electric charge in terms of the charge density p independent of E and H. If there are no free
and the total current density J. Equation (3) is charges in the region, J 1 includes only the source
the differential form of Ampere's law, currents; these latter are known, so Eq. (8) and
(9) may be solved for E and H. Since the equa-
£ H ° dl =fo r J ° dS = I
tions are partial differential equations, boundary
conditions over closed surfaces are required for
Ie ofs Js unique solutions. Boundary conditions on the
field quantities, which must hold 'at any bound-
which relates the magnetic field intensity H to ary between two regions, may be derived from
the current, but includes also the displace- these equations. The conditions are: across a
ment current density term aD/at, which was boundary (a) tangential E must be continuous,
added by Maxwell to make the law applicable (b) tangential H must be continuous, (c) normal
to time-varying fields. The term J represents the D and normal B must be continuous. Idealiza-
total current density. Equation (4) is the differ- tions of material properties are sometimes help-
ential form of Faraday's law of electromagnetic ful. For example, a perfect conductor has no
induction. Equations (5), (6), and (7) are func- non-static fields inside it, and at its surface,
tional relationships, for the most part de- tangential E and normal B are zero, tangential
termined experimentally, by means of which H is equal and perpendicular to any surface
the effects of different materials are accounted current density, and normal D is equal to any
for. Mathematically, these equations are em- surface charge density.
ployed to reduce Eq. (3) and (4) to a pair of Two additional equations, especially useful in
equations in only two unknowns. In free space, static problems, may be deduced from Eq. (2),
Eq. (5), (6) and (7) take their simplest form, (3) and (4):
respectively, D = €o E, B = MoH, J = 0 (or J =J s ,
a source current independent of E and H), 'V°D=p (10)
where €o and Mo are constants whose value
depends on the system of units (in the mks 'V ° B =0 (11)
system €o = 8.854 X 10- 12 farad/meter, Mo =
47T X 10- 7 henry/meter). Since matter itself is a Solutions to the field equations are most
relatively dilute collection of charged particles, readily obtained by imposing a restriction on
it is always theoretically possible to define the time dependence. If the fields are assumed
terms so that the theory is a description of the to be independent of time (static), then Eq. (3)
effects and interactions of charges in free space, and (4) or (8) and (9) decouple. One of the
equations becomes 'V X E = O. This means that
with consequently no essential distinction .be- E is irrotational and may be represented by a
tween D and E or between, Band H, as indi- scalar potential function cp, E = - 'Vcp. Combining
cated above. In practice however effects of this with Eq. (10) gives the fundamental equa-
materials are usually best handled in another tion of electrostatics,
way.3,8,9,lO,1l,12 Dielectric polarization effects
are accounted for by making the D vector in- 'V 2 cp=-p/€ (2)
clude the electric dipole moment density P, D =
€o E + P, and then introducing a material con- Poisson's equation. This equation for the elec-
stant, the permittivity €, such that D = €E. The trostatic potential is solved by the standard
relative permittivity of a dielectric material is methods of partial differential equations. The
then equal to one plus the electric susceptibility. boundary conditions on the potential may be
ELECTROMAGNETIC THEORY 348

found from the boundary conditions on the many magnetosratic problems. Unfortunately,
fields. 1l ,12 In practice, it is frequently necessary however, in practice many of the systems de-
to solve for the potential and electric field in a signed to establish a given magnetic field in-
restricted region in which the charge density is corporate ferromagnetic materials. For such
zero, but the potential at the boundary is held materials, the magnetic susceptibility (and hence
at some particular value(s). The problem then the permeability) is not independent of the
is to solve Laplace's equation, 11 2 l/> = 0, subject field intensity and the field equations become
to the stated boundary conditions. The stan- nonlinear. Present mathematical techniques for
dard techniques for solving boundary value handling nonlinear problems are severely limited.
problems are employed. However, if the region Practical magnetostatic problems are, therefore,
of interest is partially open, known analytical frequently solved by some approximation. One
techniques are sometimes inadequate to solve of the simplest and most useful approximations
the problem. In two-dimensional problems of is a representation by a magnetic circuit. 8 ,10,12
such a difficult type, the method of conformal Series and parallel branches of the magnetic
transformations (conjugate functions) is often circuit may be recognized, and the techniques
helpfu1. 10 ,12 The widespread availability of high of linear and nonlinear circuit analysis can
speed digital computers having large memories be applied to obtain a solution. When more
has made it feasible to obtain acceptable solu- accurate solutions are required, digital compu-
tions to a myriad of problems which previously tational techniques of the types mentioned
could not be handled. The most useful general above for electrostatic problems are employed
methods are called the finite difference method widely.1,2
and the finite element method. 1 ,2 The finite Magnetostatic theory is applicable to a myriad
difference method essentially determines the of magnetic devices, including deflection sys-
potential at a finite number of points (the grid) tems, motors, generators, relays, magnetic
by means of a numerical solution of the ap- pickup devices, permanent magnets, memories,
propriate differential equation. The finite ele- transducers, coils, magnetic containment and
ment method divides the region into element suspension systems. Although, historically, the
spaces and finds a solution for each element need for particular solutions arose before sound
space via a matrix solution of an integral computational techniques were available, earlier
equation obtained from an energy or a varia- empirical techniques are being supplemented
tional constraint. and replaced by numerical solutions.
The main applications of electrostatic theory Energy is required to establish electric and
are in (a) the theory of material properties, magnetic fields, and such energy is associated
(b) the calculation of charged particle trajec- with the fields. The field energy in a given
tories in electron guns, deflection systems, and volume may be computed in most cases from a
accelerators (here in conjunction with mag- volume integral of one or both of tfe following
netostatic theory), (c) the calculation of circuit energy density expressions We = '2eE2, Wm =
component values, such as capacitance, and tpH-", respectively the electrostatic and mag-
(d) the determination of potentials, poten- netostatic values.
tial gradients, and induced currents in con- When the fields are time varying, Eq. (8) and
nection with insulation breakdowns and safety (9) are coupled and must be solved simulta-
questions. 14 neously. Almost invariably, a potential function
Magnetostatic theory is developed from Eq. such as a vector potential or a Hertz potential
(11) and (8). Since B is divergenceless, it can be is introduced. 1l ,12 For example, Eq. (11) im-
represented by the curl of a vector A, which is plies that B may be replaced by a vector po-
known as the magnetic vector potential. Equa- tential such that B = 11 X A. Equation (9)
tion (8) can usually be written in terms of this then suggests the convenience of thinking of
potential as follows: the electric field as being the sum of two parts:
112 A = -Ill (13) aA
E=-IIl/>- Tt· (15)
Taken one rectangular component at a time,
this equation is of the same form as Poisson's Then Hand E may be replaced in Eq. (8), and
equation [Eq. (12)] and may be solved in the with the condition on A, 11 . A = Il€al/>/at, the
same way. The boundary conditions on A may foltowing equations may be obtained for A and
be found from those on Band H. In regions l/> (0 assumed zero here)
with no current, Eq. (8) becomes 11 X H = 0 so
that H may be represented-by a scalar potential a2 A = -Ill
112 A - Il€ - (16)
function H = -11 l/>m. In such regions then, in
view of Eq. (11), the magnetic scalar potential, at 2
l/>m, must satisfy Laplace's equation a2 l/>
11 2 l/> - Il€ at 2 = -pte (17)
(14)
That is, both the vector potential A and the
provided 1111 = 0 in the region. The techniques scalar potential l/> satisfy a differential equation
and solutions of electrostatics are applicable to known as the inhomogeneous wave equation.
349 ELECTROMAGNETIC THEORY

Because of their simplicity and practical im- where I!:1z, the current moment, is equal to
portance, solutions for those sources and fields wq!:1z, the maximum dipole moment of the
which simply oscillate at a single frequency oscillating dipole. The factor (t - ...;p:;eor)
have been studied extensively.7,1l,12 In this exhibits the time delay required for the effects
case, the time is eliminated as an independent of the oscillating charges to propagate to distant
variable, as if by a transform operation. (In fact, points. The electric and magnetic fields may be
transform methods are often the best means of computed from Eq. (20) as indicated above.
obtaining transient field solutions.) In the equa- The magnetic field strength produced by an
tions, the time derivatives are replaced by fre- oscillating dipole (point current) is, for example,
quency multipliers so that the resulting equa- in the spherical coordinate system (r, !p) e,
tions are functions of the space variables only.
The vector potential may then be found by w(t - VJ.1.oEor)
standard techniques of partial differential equa- H<{J =-I!:1z.sm e [cos
----'---2---'-.;......:.----'--
tions and boundary value problems. Having A, 4rr r
the field quantity B is found from B = 'i/ X A
and E is found from Eq. (8). In practice, a
theorem which can be derived from the field - w~
r . w(t - V. ~]
sm J.1.oEor)
equations, called the reciprocity theorem/,12
is often helpful. The theorem relates the fields
Ea and Eb produced respectively by a pair of This form, like Eq. (20), shows that the crests
current distributions Ja and Jb. The theorem is and valleys of the field oscillations are propa-
gated in spherical waves at the speed of light
v = (J.1.oEo )-1/2. The solution for a point current
may be employed in an integral similar to Eq.
(19) to find the vector potential of a more
complicated distribution of current. Such solu-
tions may also be employed to find the radiation
For example, if Jb is selected to be a point patterns and input impedances of antennas. 13
current at point P, directed along x (represented In practice, however, the distribution of currents
mathematically by a Dirac delta function), then on antenna structures is not known a priori, i.e.,
Equation (8), E ax (P) = JJJ Eb . Jadv, gives a the antenna currents are not truly source cur-
formula for the computation of the field due rents. High speed digital computers have made
to la which is equivalent to a superposition it possible to employ a numerical technique
integral involving a Greens function. related to the finite element method, known as
Perhaps the most fundamental problem of the method of moments,S for the determina-
electromagnetic theory is the determination of tion of the currents on antennas and scattering
the fields of a point charge, at rest, in oscilla- bodies of rather arbitrary shapes, given the
tion, or in some general state of motion. For a boundary and source conditions. Once these
point charge q, at rest in free space, the solu- currents have been determined, the fields and
tion may be obtained by solving Eq. (2) in impedances are then calculated as indicated
spherical coordinates. With the point charge at above.
the origin, symmetry conditions may be em- The potentials and fields produced by a
ployed to eliminate the angular variation, and charge moving in an arbitrary way may also be
the remaining differential equation in r can be obtained. 6,11 The results may be found in
solved subject to Eq. (0) to give I/>G = qJ4rrEor Stratton. 11 ,pp.47S-476
for the potential associated with the point In regions free of source currents and charges,
charge. A superposition integral the fields and potential satisfy the homogeneous
wave equation [for example Eq. (6) with J =

1/>= fff ::::r (9)


0]. Then one of the simpler solutions which
can be obtained is that of the plane electro-
magnetic wave. With appropriate orientation of
the rectangular coordinate system, the solutions
may then be employed to find the potentials show that plane waves may progress along z,
associated with more complicated distributions. with components as follows:
The field of an oscillating dipole, which is equi-
valent to a point alternating current, is also of
great interest. This solution may be obtained Ex = Eo cos w(t - VJ.1.oEoZ)
from Eq. (6) (single frequency version). If the
point current is directed along z, the z-com-
ponent of the vector potential may be found
by a procedure similar to that employed for a
point charge. The final result is
Hy = Eo -
V~
J.1.o
cos w(t - VJ.1.oEoZ)

where Eo is an arbitrary constant amplitude.


Note that E, H and the direction of propagation
I!:1z .~ are -all perpendicular to one another. The
AzG = - - cos w(t - VJ.1.oEor) (20)
4rrJ.1.or Poynting vector, S = E X H, points in the direc-
ELECTROMAGNETIC THEORY 350

tion of propagation. Moreover, the power carried 5. Harrington, R. F., "Field Computation by Mo-
through a closed surface by an electromagnetic ment Methods," New York, Macmillan, 1968.
field may be computed from a surface integral 6. Jackson, J. D., Oassical Electrodynamics," New
of the Poynting vector. York, John Wiley & Sons, 1962.
With single frequency fields in source free re- 7. Kong, J. A., "Theory of Electromagnetic Waves,"
gions, both Hand E can be represented by New York, John Wiley & Sons, 1975.
vector potentials,1l,12 HI = IJ X AI, E2 = IJ X 8. Panofsky, W., and Phillips, M., "Classical Elec-
A 2 , and moreover the coordinate systems may tricity and Magnetism," Reading, Mass., Addison-
be oriented so that Al and A2 each have a Wesley, 1955.
single component .12 In cylindrical systems, this 9. Peck, E. R., "Electricity and Magnetism," New
single component is commonly along z. HI is York, McGraw-Hill Book Co., 1953.
then transverse to z (TM) and the set of fields, 10. Smythe, W. R., "Static and Dynamic Electricity,"
E I , HI, derivable from AI, are called TM fields. Second edition, New York, McGraw-Hill Book
E2 is likewise transverse to z and the set of Co., 1950.
fields, E 2 , H 2 , derivable from A 2 , are called TE 11. Stratton, J. A., "Electromagnetic Theory," New
fields. This procedure is particularly helpful York, McGraw-Hill Book Co., 1941.
in problems involving transmission lines and 12. Weeks, W. L., "Electromagnetic Theory for Engi-
WAVEGUIDES and is developed in detail in neering Applications," New York, John Wiley &
Weeks.12, Ch. 4-6 Sons,1964.
Some of the most interesting and fundamental 13. Weeks, W. L., "Antenna Engineering," New York,
pro blems of electromagnetic theory are con- McGraw-Hill Book Co., 1968.
cerned with the scattering and diffraction of 14. Weeks, W. L., "Transmission and Distribution of
electromagnetic waves. 6 ,7,10-12 For example, Electric Energy," New York, Harper & Row,
exact solutions are availabl-e for the scattering 1981.
by cylinders and spheres, as w.ell as an infinitely
long slit. Approximate solutions are available Cross-references: ELECTRICITY, PLASMAS, PO-
for many other shapes. The methods are the TENTIAL, PROPAGATION OF ELECTROMAG-
analytical and numerical ones outlined above, ~ETIC WAVES, QUANTUM ELECTRODYNAMICS,
supplemented by generalizations of the prin- QUANTUM THEORY, RELATIVITY, STATIC
ciples of Huygens and Babinet. ELECTRICITY.
Another topic of wide interest is the nature
of fields in ionized gases or plasmas. The appli-
cations range from ionospheric propagation to
microwave devices to nuclear apparatus to
magneto-hydrodynamics to satellite re-entry ELECTRON
problems. The simplest theory for these effects The electron is the smallest known electrically
is developed from Eq. (3) and (4) (single fre- charged particle. Its existence and characteristics
quency version) by separating the ion current were inferred from many experiments clustered
term J e = pv from J, and employing Newton's in and around the last decade of the nineteenth
law to eliminate v in favor of E, H and what- century. In the 1830's, Faraday had tentatively
ever mechanical constraints are applicable 3 ,11,12 suggested that his experiments in ELECTRO-
(see PLASMAS). CHEMISTRY could be interpreted in terms of a
Effects peculiar to charges moving with very small unit of charge attached to ions. This
high velocities have not been included in this notion of individual "atoms of charge" was
discussion (see RELATIVITY THEORY). Quan- somewhat eclipsed, however, by the enormous
tum effects are also discussed elsewhere (see success of Maxwell's theory of electromagne-
QUANTUM ELECTRODYNAMICS and QUAN- tism, which was generally interpreted, by 1880,
TUM THEORY). as favoring a view that electrical phenomena
W. L. WEEKS were due to continuous charge distributions and
motions. G. Johnstone Stoney, in 1874, and
Helmholtz, in 1881, had suggested again an
atomic interpretation of electricity, but it was
References not until the brilliant experiments of Perrin, J.
1. Chari, M. K. V., and Silvester, P. P., "Finite Ele- J. Thomson, Zeeman, and others in the 1890's
ments in Electric and Magnetic Field Problems," that the concept of the electron received firm ex-
New York, John Wiley & Sons, 1980. perimental foundation. Later experiments and
2. Finlayson, B. A., "The Method of Weighted Re- theory (Millikan, Bohr, etc.) established the con-
siduals and Variational Principles," New York, stancy of the electronic charge and interwove
Academic Press, 1972. the concept of an electron of definite charge
3. Good, R. H., and Nelson, T. J., "Qassical Theory and mass into the basic structure of the atom.
of Electric and Magnetic Fields," New York, The Cathode Ray Controversy After the dis-
Academic Press, 1971. covery of the cathode ray in high-vacuum dis-
4. Hayt, W. H., "Engineering Electromagnetics," charge tubes by Plucker in 1858, there de-
New York, McGraw-Hill Book Company, 1981. veloped, with the experiments of Goldstein,
351 ELECl .ON

Crookes, Hertz, Lenard, and Schuster, a con- very precise, but later experiments of a similar
troversy over the nature of the rays. A pre- type gave values close to 1.76 X 1011 coulomb/
dominately German school held that the rays kg. More recent evaluations, drawing on mea-
were a peculiar form of electromagnetic rays. surements of many kinds, give elm = 1.7588 X
The British physicists thought they were nega- 1011 coulombs/kg as a 1982 value calculated
tively charged particles. The controversy pro- from values for e and m given under CON·
vides a classic "case history" of the typical STANTS, FUNDAMENTAL, in this book.
scientific controversy in which two quite differ- The Zeeman Effect In 1896 Zeeman discov-
ent models both explain most, but not all, of ered the broadening of spectral lines when a light
the observable facts. The proponents of each source was in a strong magnetic field. Experi-
model designed ingenious experiments, but in mental refinements of Zeeman and others, and
some cases were so trapped in their precon- theoretical work by Lorentz and Zeeman, per-
ceptions that they badly misinterpreted their mitted the interpretation of this effect as due to
observations. The Germans were especially im- the influence of the magnetic field on oscillating
pressed by the fact that the rays could go or orbiting negatively charged particles within
through thin foils-something no known parti- the light-emitting or absorbing atoms. From the
cles could do. The British were firm in pointing spectroscopic data, the ratio of charge to mass
out that the rays could be deflected by magnetic of these hypothetical particles, and the sign of
fields-something not possible with electro- their charge, could be shown to be equal to that
magnetic waves. Hertz, in what he thought was of cathode rays. The Zeeman effect thus pro-
a crucial experiment, was unable to detect de- vided the first experimental evidence that the
flection of the rays by electric fields, but this negative particles emitted by atoms when heated
very phenomenon was demonstrated by J. J. (Edison effect) or subject to high fields and/or
Thomson and made the basis for his conclusive ionic bombardment (cathode rays) or bom-
experiments that the rays had velocities less than barded by short-wavelength light (photo-
that of light. Thomson showed, further, that if electric effect) were, indeed, actual constituents
one assumed that the rays were composed of of the atoms and were probably responsible for
particles, then the particles had the same ratio of the emission and absorption of light.
charge to mass regardless of the cathode material The Charge on the Electron In the decade
or the nature of the residual gas. Perrin's classic following 1897, many different methods were
experiment, meanwhile, proved that the rays evolved for determination of ionic charges. Some
did indeed convey negative charge. In the decade methods depended upon measuring the total
between 1896 and 1906, Thomson and others charge of a number of ions used as nuclei for
showed that negatively charged particles from cloud droplet formation. Other methods were
sources other than cathode rays had the same more indirect-experiments, for example, which,
ratio of charge to mass: the negative particles combined with the kinetic theory of gases, could
emitted by hot filaments in the Edison effect, give crude values for avogadro's number, N (see
the beta rays emitted by some radioactive mater- MOLE CONCEPT). By dividing the Faraday con-
ials, and the negative particles emitted in the stant (the charge carried in electrolysis by ions
photoelectric effect that had so ironically been formed from one gram-atom of a univalent ele-
discovered by Heinrich Hertz in his great experi- ment) by N, one could determine the average
ment which demonstrated the electromagnetic charge per ion. Similarly, the constants in
rays predicted by Maxwell's equations. Planck's theory of blackbody radiation, when
Thomson's Determination o~ elm In 1897 evaluated experimentally, could provide a nu-
Thomson devised an apparatus in which he merical value for N, as could certain experiments
could deflect a beam of cathode rays with a in radioactivity. All such methods gave values
magnetic field of induction B and also with an of N of the order of 6 X 10 23 , and hence 1.6
electric field of strength E. If the fields are per- X 10- 19 coulomb for the ionic charge. None of
pendicular to each other, and to the original these methods measured individual charges;
path of the beam, and if they occupy the same strictly speaking, the value for the ionic charge
region, then (with proper polarities and mag- could be thought of only as an average value.
nitudes of fields) the electric force on the beam Millikan's experiments with single oil drops,
can equal the magnetic force, so that the beam beginning in 1906, provided a method for mea-
hits the same point on a fluorescent screen as suring extremely small charges with precision.
when no fields are applied. If e is the charge of a He was able to show that the charge on his
given particle, m its mass, and v its velocity, drops was always ne, with e = 1.60 X 10- 19
v = E/ B. Thus, velocities of typical cathode ray coulomb (modern value) and n a positive or
beams could be measured. If the magnetic field negative integer.
is used alone, and the radius of curvature R of He observed the motions of very small charged
the beam is measured, then one can equate oil drops in uniform vertical electric fields. The
centripetal and magnetic field forces: mv 2 /R = drops were so small that they moved with con-
Beu, and then deduce elm = v/BR. With v stant velocity (except for Brownian fluctuations)
known from the previous experiment, elm can for a given force. The force in each case was due
be calculated . Thomson's early values were not to gravity acting on the mass of the drop and to
ELECTRON 352

the electric field (if any) acting on the charge, q, One may speak of the "classical radius of the
on the drop. The charge on a given drop could electron," a = e 2 /me 2 , derived by setting the
be changed by shining x-rays upon it. Using self-energy of the coulomb field of a charge e
Stokes' Law, in a form modified to correct for contained at a radius a equal to the relativistic
the fact that the drops were not large in com- rest energy, me 2 of the electron. This a = 2.82
parison to the inhomogenieties of the surround- X 10- 13 cm, comfortably smaller than any atom,
ing air, and the velocity of a drop in free (gravi- but larger than the usual estimates of sizes of
tational )fall, Millikan could infer the diameter protons and neutrons.
and mass of a given drop, and then calculate its Positive Electrons Dirac's paper in 1928
charge. The charge q always equaled ne. (See could be interpreted as predicting the existence
reference 1 or 2 for experimental details.) A few of electrons that are positive. But until such
other physicists, in similar experiments, thought particles were found experimentally by C. D.
they had detected electric charges smaller than Anderson in 1932 in cloud chamber pictures of
Millikan's e, but their experimental techniques cosmic ray particle tracks, most physicists pre-
were probably faulty. ferred other interpretations of Dirac's paper.
Millikan's experiment did not prove, of course, Positive electrons, or POSITRONS are now known
that the charge on the cathode ray, beta ray, (1) to occur as decay products from cer-
photoelectric, or Zeeman particle was e. But if tain radioactive isotopes, (2) to be produced
we call all such particles electrons, and assume (paired with a negative electron) in certain inter-
that they have elm = 1.76 X lOll coulomb/kg, actions of high-energy gamma rays with intense
and e = 1.60 X 10- 19 coulomb (and hence m electric fields near nuclei, and (3) to be the pro-
= 9.1 X 10-31 kg), we find that they fit very well duct of certain decays of certain mesons. In
into Bohr's theory of the hydrogen atom and principle, positrons could form anti-atoms with
successive, more comprehensive atomic theories, nuclei made from anti-protons and anti-neu-
into Richardson's equations for thermionic emis- trons, but in practice almost all positrons pro-
sion, into Fermi's theory of beta decay, and so duced in ordinary matter quickly meet their end
on. In other words, a whole web of modern by annihilating themselves together with some
theory and experiment defines the electron. (The hapless negative electron. The end product of a
best current value of e (June, 1982) = 1.6028 positron-electron annihilation is a pair of gam-
X 10- 19 coulomb (see CONSTANTS, FUNDA- ma rays.
MENTAL). Recent Theoretical Developments The rela-
The Wave Nature of the Electron In 1924, tivistic quantum mechanical theory of the elec-
L. DeBroglie suggested that the behavior of tron, in its earlier forms, led to embarrassing
electrons within atoms could be better under- predictions of infinite electronic mass and
stood if it were assumed that the motion of an charge. Schwinger and others have developed
electron depends upon some sort of accompany- methods for coping with these infinities, so the
ing wave, the length of which would be hIp (h theory, in general, now satisfactorily agrees with
= Planck's constant and p the momentum of the
observations. Further developments are not un-
electron). This suggestion led to the develop- likely. (See QUANTUM ELECTRODYNAMICS).
ment of QUANTUM MECHANICS by Schr6dinger, Experiments and theory, first in radioactive
decay, and more recently in elementary particle
Heisenberg, and others. The concept of electron physics, have made it clear that electrons can be
waves provided an explanation for experiments thought of as members of a class of particles
on reflection of electron beams by metallic called "leptons." These particles, which include
crystals, carried out from 1921 onward by neutrinos (see NEUTRINO) and muons (see
Davisson and others, and provided an impetus ELEMENTARY PARTICLES) interact with each
for the experiments of G. P. Thomson on the other and with other particles in so-called "weak
diffraction of electron beams by thin films (see interactions," apart from whatever other reac-
ELECTRON DIFFRACTION). tions they may share, such as gravitational,
Other Characteristics of Electrons In apply- magnetic, and electrical interactions.
ing quantum mechanics to certain problems in Applications of Electrons Aside Trom -their
atomic spectroscopy, in 1925 and 1926, Pauli, inherent usefulness in physical theories of mag-
and Goudsmit and Uhlenbeck found that elec- netic, electrical, optical, and mechanical proper-
trons must possess angular momentum of amount ties of matter, electrons either in beams or in
± t(h/21T). Dirac's work on a generalized quan- conductors can be made to do all sorts of use-
tum theory of the electron showed that it ful things. Cathode ray oscilloscopes, electron
possessed a related magnetic dipole moment of microscopes, image converters, certain memory
magnitude eh/41Tme (see ELECTRON SPIN). The devices for computers, television picture tubes,
ratio of the dipole moment to the angular mo- and most "radio tubes" depend upon beams of
mentum (e/me) is larger than can be accounted electrons controlled by electric or magnetic
for in classical terms with any homogeneous fields (see ELECTRON OPTICS). In ordinary
wholly negative model. The concept of elec- metallic conductors, electricity is carried primar-
tronic dipole magnetic moment is essential not ily by electrons. The behavior of electrons in
only in spectroscopy but in theories of ferro- SEMICONDUCTORS and in superconductors (see
magnetism (see MAGNETISM). SUPERCONDUCTIVITY) has in recent years been
353 ELECTRON DIFFRACTION

the basis both of intense theoretical interest and and analytical techniques that have evolved
of interesting and useful devices. since 1927. These classic experiments were
performed with electrons having energies in the
DAVID L. ANDERSON vicinity of ISO eV and 15 keY, respectively.
De Broglie's relationship A = h/mv, where h is
References Planck's constant and A is the wavelength as-
sociated with a mass m traveling with a group
1. Millikan, R. A., "The Electron," edited with an
introduction by 1. W. M. DuMond, Pheonix, Science velocity v, reduces to A = Vi 50/ V for electrons
Series, PSS523, University of Chicago Press, 1963. in the nonrelativistic limit, where V is the ac-
2. Anderson, D. L., "The Discovery of the Electron," celerating voltage and A is expressed in Ang-
New York, Arno Press, 1981. stroms. The two experiments thus used elec-
3. Shankland, R. S., "Atomic and Nuclear Physics," trons having a wavelength of approximately
Second edition, New York, The Macmillian Com- 1 and 0.1 A. The longer wavelength is compa-
pany, 1960. rable to the spacing between atoms in crystals.
4. Condon, E. U., and Odishaw, H., "Handbook of While 50-100 keV electrons, which are used
Physics," p. 7-169, New York, McGraw-Hill Book in commercial electron diffraction instruments,
Co., Inc. 1958. penetrate to a depth of about 10 3 A into a
5. Borowitz, S., and Bornstein, 1. A., "A Contem- crystal, I SO eV electrons penetrate only about
porary View of Physics," Chapter 12, New York, loA. Since the higher-energy electrons are
McGraw-Hill Book Co., 1968. capable of passing through the several layers of
adsorbed foreign material that normally are
6. An excellent (although somewhat disputatious)
discussion of the electron will be found in "Evolu- present on the surface of a crystal, surface
tion of the Concept of the Elementary Charge," by cleanness and therefore the vacuum require-
L. Marton and C. Marton in Advances in Electronics
ments for 100 keY electron diffraction are not
and Electron Physics, 50,449-472 (1980). as stringent as those for low-energy electron
diffraction. This fact, in addition to the relative
ease in focusing intense high-voltage beams, and
Cross-references: ELECTROCHEMISTRY, ELEC- individual interests, resulted in the wide appli-
TRON DIFFRACTION, ELECTRON OPTICS, ELEC- cation of high-energy electron diffraction for
TRON SPIN, ELEMENTARY PARTICLES, MAG-
structure studies. A typical instrument of this
NETISM, MOLE CONCEPT, NEUTRINO, PHOTO-
ELECTRICITY, POSITRON, QUANTUM ELECTRO- type operates at about SO keY, has provisions
DYNAMICS, QUANTUM MECHANICS, SUPERCON- for producing and focusing the electrons, con-
DUCTIVITY, ZEEMAN AND STARK EFFECTS. tains specimen manipulators, means to record
the diffraction patterns and typically is con-
tained in a chamber capable of being evacuated
to 10-6 torr. Diffraction patterns are obtained
ELECTRON DIFFRACTION either by transmission of the beam through
very thin specimens or by working at grazing
The discovery of electron diffraction indepen- incidence and reflection.
dently by C. J. Davisson and L. H. Germer One common use of high-energy electron dif-
(I927) and G. P. Thomson (1927) verified fraction by transmission or reflection is the
L. de Broglie's earlier hypothesis (1924) that study of films on amorphous, polycrystalline,
matter exhibits both corpuscular and wavelike and single crystal substrates. This includes films
characteristics. This hypothesis served as a that have been formed by the oxidation or
stimulus for the formal development of quan- corrosion of a surface as well as those formed
tum mechanics by E. Schrodinger, M. Born, by the deposition of material on a substrate.
W. Heisenberg, and others. Following this In many instances these films consist of crystal-
momentous discovery, which eventually re- lites having an orientation that is related to the
sulted in the award of a Nobel Prize to Davisson structure and orientation of the substrate
and Thomson, electron diffraction was immedi- material.
ately utilized as a tool for the study of the The transmission ELECTRON MICROSCOPE
structure of matter. has been used widely for the study of atomic
Electron, x-ray and neutron diffraction are all arrangements at structural imperfections, such
used for structure studies. Electron diffraction as dislocations and stacking faults. Image con-
is used particularly for those structural studies trast is obtained by local differences in the
that involve small numbers of atoms. This is intensities of diffracted beams. In addition,
due to the strong interaction of electrons with many electron microscopes are constructed in
matter. Thus the principal area of application such a way that it is possible to obtain the dif-
of electron diffraction is for the study of thin fraction pattern associated with the material in
films, surfaces, gases, crystalline defects, and the area being studied. Considerable informa-
small samples. tion on the crystallography of domain struc-
The different energy ranges that were used in tures, order-disorder phenomena, and phase
the Davisson-Germer and Thomson experiments transitions has been obtained by this selective
provide a natural division for a description of area diffraction technique.
the types of equipment, areas of application, Electron diffraction in this high-energy range
ELECTRON DIFFRACTION 354

is also useful for the determination of the atomic cations of low-energy electron diffraction have
arrangements, bond distances, bond angles, and been made in studying crystal surface defects
mean square atomic vibrational amplitudes in such as steps, islands, and domain structures. It
gaseous molecules. is expected that work of this type will result in
Low-energy electron diffraction was used detailed understanding of surface phenomena
sparingly until approximately 1960. Improved such as bonding, catalysis, and corrosion.
diffraction equipment, which enabled the direct
display of the diffraction pattern on a fluores- ALFRED U. MAC RAE
cent screen by accelerating the diffracted elec-
trons after they had passed through grids, and References
the commercial availability of ultrahigh (10- 10
torr) vacuum equipment, resulted in a resurgence Cowley, John M., "Diffraction Physics," Amsterdam,
of interest in this field. The structure of clean North-Holland Publishing Co., 1975.
surfaces, the arrangements of foreign atoms on Amelinckx, S., Gevers, R., Van Landuyt, 1., "Diffrac-
these surfaces at a monolayer or less coverage, tion and Imaging Techniques in Material Science,"
and many aspects of the initial stages in the Amsterdam, North-Holland Publishing Co., 1978.
oriented overgrowth of thin films have been Van Hove, M. A., and Tong, S. Y., "Surface Crystal-
lography by LEED," New York, Springer-Verlag,
studied with electrons having energies in the
1979.
range of 2 to 10 3 eV. It has been revealed that
the atomic arrangement at the clean surfaces of
semiconductors such as germanium and silicon Cross-references: DIFFRACTION BY MATTER AND
is quite unlike that found in the bulk of these DIFFRACTION GRATINGS, ELECTRON, ELEC-
materials. At low coverages, foreign atoms are TRON MICROSCOPE, ELECTRON OPTICS.
normally adsorbed in structures that have a
symmetry and dimensions that are simply re-
lated to the orientation of the substrate plane.
A multitude of such structures has been found
on semiconductors and metals. Their atomic
array is dependent on many parameters, such as ELECTRON MICROSCOPE
the amount of adsorbed material, the tempera-
ture, orientation and cleanness of the substrate. Introduction The standard light microscope
This same equipment is also used to determine uses light in the visible range to produce a mag-
the presence of impurity atoms on the surface nified image. The electron microscope (EM)
by the technique of Auger spectroscopy. The uses electrons for magnification. Since electrons
energy of the secondary electrons is character- are easily absorbed in air at atmospheric pres-
istic of the surface atoms and can be used to sure the EM has to operate in a vacuum. The
make a quantitative identification of surface imaging electrons are usually accelerated to
impurities, which can assume an important role two between 30 and 100 kV. The EM magnifies in
in affecting surface structures. or three stages by means of electromagnetic
While electron diffraction techniques are or electrostatic lenses.
widely used, interpretation of much of the data Resolution In 1978 Ernst Abbe and, inde-
can be challenging. This is due to the strong pendently, H. von Helmholtz proved that in the
interaction of the incident and scattered elec- light microscope the resolution is limited by the
trons with the substrate atoms. It is often how wavelength of the illuminating light. No matter
necessary to apply dynamical scattering theory system perfect and free of aberrations the optical
to explain observations. While interpretation of not a pointis, the image of a geometrical point is
but a disc, the "Airy disc." Regard-
results has been very successful at the high ener- less of any further
gies of 100 ke V, it has been less successful in points cannot be resolvedmagnification, two separate
the low energy regime of 100 eV. Here, the as separate unless their
dependence of the scattered beams' intensity centers are the distance d apart, whereby d is
on incident electron energy is complex and is the radius of the Airy disc referred to the object
indicative of large and energy dependent scat- plane:
tering cross sections, multiple scattering, strong d = 0.5 A
absorption of incident and scattered electrons, (1)
and effects that are substrate dependent. How- n sin 0:
ever, considerable progress has been made in where
recent years in the understanding of these inter-
actions and in the development of computa- A =wavelength of the illuminating light
tional tools. This has led to the interpretation n = index of refraction of medium be-
of data to obtain surface atom spacings, surface tween object and lens (in air, n = 1)
atom vibrational amplitudes and relatively 0: = aperture of lens, i.e., half-angle of
simple surface structures. This has provided an collected light beam
improved understanding of chemical bonding at n sin 0: =NA = numerical aperture of lens (in
surfaces. Other important and successful appli- air, NAmax = 0.95).
355 ELECTRON MICROSCOPE

The wavelength of green light for which the eye step by step. They added a condenser lens and
is most sensitive is: built an iron shield with a narrow center gap
around their magnetic lens. Ruska, working
Agreen ~ 500 nm* (5000 A). from 1932 by himself, equipped the magnetic
lenses with narrow pole pieces and was able to
The resolution of the light microscope is, there- demonstrate in 1933 a resolution of 50 nm
fore: better than the best resolution obtainable with
d ~ 250 nm (2500 A). the light microscope (magnification 12,000X).4
Figure 1 shows a functional diagram of his
Even if we go to the extreme of using the ultra- supermicroscope. Figure 2 shows the details of
violet line of mercury (A = 253.7 nm) oil im- one of his lenses. t
mersion optics (NA = 1.4), quartz lenses and The first commercially available EM was de-
microphotography, the best resolution obtain- veloped by Ruska together with von Borries at
able with the light microscope is still: Siemens & Halske, Berlin, in 1939. The first
American commercial EM came out in 1941 by
d ~ 100 nm (1000 A) RCA.
Parallel to the development of the magnetic
The electron microscope makes use of the electron microscope went the development of
wave properties of the moving electron. Its "de an electrostatic one. In 1931, Bruche and Jo-
Broglie" wavelength is: hannson of the Research Institute of A.E.G. in
Berlin imaged the emitting surface of the cath-
A = ~ = 1.23 [nm) (2a) ode with an electrostatic immersion objective.
mu Vl/2 In 1932, they employed unipotential or Einzel-
for nonrelativistic electrons (below 100 kV) lenses.
Electron Lenses Magnetic Lenses. A charged
and particle entering a uniform magnetic field mov-
ing parallel to the lines of force will not be
deflected. It will move in a straight line. Moving
perpendicular to them it will describe a circle.
The radius of this circle, the "cyclotron radius,"
for relativistic electrons, where is:
h = Planck's constant m v
m = mass of electron P =-- (3)
e B.
v = velocity of electron
V = accelerating voltage in volts. where
For electrons of V = 50 kV the wavelength is m = mass of particle
e = charge of particle
ASOkV = 0.00535 nm v = velocity
B = magnetic field intensity (gauss).
Objective lenses for electrons, unlike those for
light optics, cannot be made free of spherical The circle described by an electron is
aberrations. They have to operate with numeri-
cal apertures that are 500 to 1000 times lower. -IV [cm]
Pe = 3.372 - (4a)
The best theoretical resolution for magnetic B
lenses working with electrons of 50 kV energy for nonrelativistic electrons,
is: 1
1
/)' = 0.21 nm Pe=-Yll.3V+1.11X IO- SV2[cm] (4b)
B
The best practical resolution is between 0.5 and for relativistic electrons,
I nm (5-10 A).
Early History The first electron microscope where V is in volts and B is in gauss.
was developed by M. Knoll and E. Ruska at the The time it takes a particle to describe a cy-
Technical University of Berlin early in 1931. 2 clotron circle is:
Ruska obtained the first magnification in two
stages by means of electromagnetic lenses on 21TP m 1
T = - = 21T - - [sec) (5)
7 April 1931. 3 The EM celebrated, therefore, v e B
its 50th anniversary on that date in 1981. The
greatest magnification obtained with this first
somewhat crude instrument was 17X. Knoll tThe two fIrst EM were destroyed during WW II.
and Ruska improved the electron microscope Ruska has recently constructed exact replicas of them
which were shown at various EM Congresses in Europe
*1 nanometer (nm) = 10-9 meter (m) = 1 millimi- and are scheduled to be permanently exhibited at
cron (mil) = 10 angstrom units (A). Deutsches Museum, Munich.
ELECTRON MICROSCOPE 356

CATHOOE

GAS OISCHARGE TUBE

CONOENSER LENS

OBJECT PLANE
OBJECTIVE LENS

PLANE OF INTERMEOIATE PICTURE

PROJECTOR LENS

VACUUM PUMP OUTLET


OBSERVATION WINOOW
FLUORESCENT SCREEN

FIG. 1. First Superm icroscope (Ruska 1934, Ref. 4).

IRON ARMOR

I
I
I
I
I
I
I
I
I em
..............-
FIG. 2. Magnetic Lens (Ruska 1934, Ref. 4).

A charged particle entering a uniform mag- point P' at a distance


netic field at an angle will describe a cycloid.
While its velocity component normal to the d = Vz (6)
T
field lines causes it to describe a circle, the .ve-
locity component parallel to the field lines re- with angles to the axis identical to those they
mains unchanged. Since the time required to had at point P (see Fig. 3).
describe a circle is independent of the normal A "long" magnetic lens, having a unitorm
velocity, all circles, large or small, are traversed magnetic field extending from the object to the
in the same time interval. In an electron beam image point, will form an upright picture of the
with low divergence where, therefore, the ve- object with an image to object ratio of I to 1.
locities parallel to the magnetic field (vz) are A "thin" magnetic lens is a lens with a mag-
identical, all electrons leaving one point P on netic field short compared to the object-to-
the axis will meet downstream at another image distance. A "weak" thin lens is a lens
357 ELECTRON MICROSCOPE

different potentials. They are important as


lenses for television or oscillograph cathode-ray
A. PIIOJFcrlON IN OII1FeT/ON
tubes. The lenses used in electrostatic transmis-
OF IIAr;NHIC FlEtO sion electron microscopes are usually unipoten-
tial or Einzel-lenses. They have the same poten-
tial on either side of the lens. They consist of
three apertures: the two outer apertures are at
ground or anode potential, and the center elec-
trode can have either a positive or negative po-

ClRECTIOfI OF
.AGNET IC
'no
I tential. Regardless of whether a positive or a
negative potential is applied, the lens will al-
ways be convergent. Figure 4(a) shows the equi-
potential lines of such a lens; Fig. 4(b) shows
the focal length vs VL/V o for this lens, where
VL is the voltage applied to the center electrode
8. PIIOJFCTIDN P£IIP£NOICfJLAR
TO IIA6N£TIC FlFLD
and Vo the cathode potential. Figure 5 shows
the design of a typical electrostatic lens.
The Standard Transmission Microscope (TEM)
The best known electron microscope is the
magnetic transmission microscope where the
image is formed by electrons which have passed
through the specimen. It is composed of the
following major sections: electron gun, conden-
p' ser lens, specimen chamber, objective lens,
projector lens, viewing and photographing
FIG. 3. Cycloids, described by electrons in uniform chamber. It has the following ancillary equip-
magnetic field. ment: (1) power supplies for high voltage,
cathode heater voltage, and focusing currents;
(2) vacuum systems.
with the focal length large compared to the
axial length of the magnetic field. The refrac-
tive power, i.e., the reciprocal of the focal
length, of such a lens is determined by:

(7)
It images according to the general optical equa-
tion:
1 1 1
-=-+- (8)
f a b
where f is the focal length, a is the object dis-
tance and b is the image distance. The picture (a)
is turned around from the position of the ob- 50
ject. The angle it is turned is determined by:

81 = OJ;! f +00

Bzdz [radians] (9)


z

~
~ 40

30 \

\
-00

--
The objective and projector lenses of the elec- ~
tron microscope require extremely short focal
lengths in order to obtain high magnifications
1 :
20

10
\

without going to extremely long microscopes. V "--


These lenses have, therefore, pole pieces which
limit the extent of the magnetic field, both in
-2
----I

the axial and radial dimensions. The treatment


of strong, thin magnetic lenses can be found in (b)
references 5 and 6.
Electrostatic Lenses Immersion lenses con- FIG. 4. Unipotential or Einzel-lens: (a) equipotential
sist of two apertures or two coaxial cylinders at lines (Ref. 7); (b) focaiiength vs VL/VO (Ref. 8).
ELECTRON MICROSCOPE 358

OUTER
LECTRODES

CEN R
ELECTRODE
, I I

2 3 4 em

FIG. 5. Typical electrostatic lens (see ref. 9).

The electron gun generates the electron beam . of interest. In many cases a number of dif-
which illuminates the object. It has to provide ferent specimens can be moved under the
the required electron density within a certain electron beam by rotating or sliding specimen
limited divergence. It consists of the cathode, holders. Some models have special facilities to
a hairpin tungsten filament, enclosed in a cup- keep the specimen at certain high or low
shaped electrode at cathode or a more negative temperatures.
potential called the grid, shield or Wehnelt The viewing chamber at the bottom of the
cylinder. The fine emitting point of the cathode microscope column contains the fine-grain fluo-
must be accurately aligned with the aperture rescent screen that can be observed through
of the grid. The negative grid bias against the glass windows either directly or through a tele-
cathode determines the beam current that scope. If a permanent record is desired, the flu-
passes through the grid. The "crossover point" orescent screen is moved aside and a photo-
which is located near the grid aperture has the graphic plate inside a plate holder is exposed.
smallest beam diameter inside the gun. While The electrons produce a latent image directly.
the anode facing the grid is at ground potential, Each plate can be removed from the vacuum
the cathode is maintained at a high negative separately. A film pack may be used instead of
potential, usually between 50 and 100 kV. plates. Rollfilm is used when a whole series of
The condenser lens or lenses increase the elec- pictures are to be taken.
tron density reaching the specimen by concen- In some cases when an extremely high magni-
trating the beam. An aperture in the condenser fication, for instance 100 000 times, is desired,
lens of 0.25 to 0.5 mm diameter reduces the it may pay to magnify electron-optically to a
amount of stray electrons reaching the somewhat lower magnification and to add a
specimen. final photographic enlargement later; the adjust-
The objective lens provides the first magnifi- ment of the electron mi~roscope is then much
cation. It is a strong, thin lens with a high re- easier, since the field of VIsion is so much larger.
fractive power. It has, therefore, narrow pre- The vacuum ~stem, generally maintaining a
cision pole pieces. An aperture of 25 to 1001l vacuum of 10- to 10- 5 torr (I torr = I mm
diameter is usually inserted in the gap of this Hg), consists usually of an oil diffusion pump
lens to limit the beam divergence. The speci- backed by a mechanical forepump. A second
men is mounted either close above or inside the mechanical pump is used to purge the specimen
magnetic gap. chamber and the photographic chamber before
The projector or image lens selects a small they are opened to the main vacuum column.
portion of the intermediate image produced by A water or liquid-nitrogen-cooled baffle is used
the objective lens and magnifies it again. A third to reduce backstreaming of oil vapor into the
electron-optical magnification is in many cases chamber.
produced by a second projector lens. The mag- While this type of vacuum system may be
netic lenses are usually wirewound coils. In satisfactory for a great number of applications,
some cases permanent magnets are used with or it proved to be unsatisfactory for more de-
without additional coils for fine adjustment. manding investigations. It is impossible to pre-
The specimen chamber is located above the vent completely the backstreaming of diffusion
objective lens. It can be opened to the outside pump oil into the system. The electron beam
for inserting new specimens. It is pumped out interacts with the oil molecules, causing hydro-
before reconnecting it to the main column. carbon "varnish" to be deposited on the speci-
Adjustments by means of micrometer screws men and on critical apertures. This limits the
permit shifting the specimen in the object exposure time and requires frequent cleaning of
plane in order to locate accurately the area apertures. Heating or cooling the specimen stage
359 ELECTRON MICROSCOPE

will improve the situation, but the vacuum may scattered electrons. The unscattered electrons
still not be satisfactory, especially in cases are those that did not interact with the atoms
where film depositions are studied. of the specimen. They traversed it without
When a dry vacuum is desired, getter ion deviating from their trajectory. The elastically
pumps or turbomolecular pumps are used, scattered electrons interacted with the nuclei of
often in conjunction with titanium sublimation atoms in their path. The interaction is especially
pumps. To bring the system down to the pres- strong with heavy nuclei. The number of elas-
sure where these pumps take over, rotary oil tically scattered electrons is proportional to the
pumps or liquid-nitrogen-cooled zeolite molec- j power of Z, the atomic number of the atoms.
ular sieve traps are used. The elastically scattered electrons did not lose
Scientists who converted their system from kinetic energy but changed their direction sig-
oil diffusion pumps to dry pumps report a nificantly. They are removed, to a large extent,
longer cathode life, reduced exposure time, by the limiting aperture in the gap of the ob-
sharper pictures, and practically the elimination jective lens. Inelastically scattered electrons did
of the varnish problem. They report a reduction interact with the electrons of the atoms in their
in varnish build-up from .5 to I nm sec- 1 down path. They emerge within a narrow angle but
to.l to .5 nm min-I. *) have given up some of their energy. Their num-
To reduce the contamination from elastomer ber is proportional to the j power of Z. Inelas-
o-rings, they are sometimes replaced in critical tically scattered electrons cause chromatic aber-
locations, for instance in the specimen chamber, rations. They are focused in a plane other than
by metal gaskets-usually oxygen-free copper the image plane of the unscattered electrons.
rings. Viton A o-rings are used for less critical They degrade the image to some extent.
seals. These o-rings have to be outgassed for Picture Enhancement. The microscope pic-
many hours in vacuum at a temperature of ture can be improved by means independent of
150 c C before they can be installed in the sys- the microscope. It can be intensified by using
tem. Photographic plates must be outgassed too closed-circuit television. In this case it is pro-
before they can be placed in the chamber. jected from the fluorescent screen onto the face
Rotating or sliding motion which is usually of a television pick-up tube and is, after amplifi-
accomplished by rods fed through O-ring seals cation, displayed on the face of a television pic-
or through greased glands can in critical cases ture tube. This can be accomplished, for in-
be done by means of metal bellows. stance, by using a fiber-optic window at the
Power Supplies. The voltage regulation re- end of the viewing chamber. Such a window
quired for the power supplies of the electro- makes it possible also to take contact photo-
static microscope is not very critical. As long as graphs of the fluorescent screen without bring-
the lens voltage and cathode potential maintain ing the photographic material into the vacuum.
the same linear ratio-very often they are iden- It is also feasible to feed the fluorescent image
tical-a good image is obtained. directly into a high-voltage image intensifier.
The refractive power of the magnetic lens de- The picture contrast and resolution can be im-
pends directly on the square of the magnetic proved using data correlation techniques (see,
flux, which is proportional to the lens current-if for example, Ref. 10). It can also be sharpened
the magnetic circuit is not saturated-and varies considerably by means of holographic technol-
inversely with the cathode potential [see Eq. ogy by using optical deburring filters.ll
(7) J. It is, therefore, paramount that all power Other Types of Transmission Electron Micro-
supplies be extremely well-regulated. scopes Electrostatic EM. Its development is as
Image Formation. In the light microscope old as the development of the magnetic electron
the image is formed by the difference in the microscope.1 2 It uses mostly unipotential
absorption of light in the various sections of the lenses. Electrostatic transmission electron mi-
object. There is very little absorption of elec- croscopes are used where a high resolution is
trons in the TEM. The image is formed essen- not required. Since electrostatic lenses do not
tially by the scattering of electrons during their requ.ire highly regulated power supplies, they
passage through the specimen. Those sections are III general less costly than magnetic micro-
that are denser or thicker or are composed of scopes.
heavier atoms will scatter more electrons and Million Volt Electron Microscope. The first
will, therefore, appear darker in the final image extra high voltage microscope went into opera-
on the fluorescent screen. They will be lighter tion in 1960. It was built by Dupouy and Per-
on the photographic plate. rier at the Laboratoire d'Optique Electronique,
Three types of electrons emerge after transit Toulouse, France. 13 It was designed for 1.5 MV
through the specimen: unscattered electrons, operation. The first 1 MV microscope in the
elastically scattered electrons, and inelastically United States started operation in 1967 at the
U. S. Steel Corp. Research Center at Monroe-
ville, Pennsylvania.
*The build-up is measured by observing the decrease The ever increasing number of these rather
in the radius of a small hole, approximately 1 J.I in expensive instruments indicates that they are
diameter, in a carbon film deposited on a thin able to provide information that is not obtain-
substrate. able in any other way. The MV microscope has
ELECTRON MICROSCOPE 360

the advantage of a higher resolution due to the while the second set deflects it in the opposite
shorter wavelength of the electrons at the direction towards the axis, so that the beam
higher voltage. It has a five to eight times crosses the axis in the exit aperture of the lens.
greater penetration power than the conven- Stigma tors are used to correct imperfections of
tional transmission microscope. It permits, the lens.
therefore, the investigation of specimens of The specimen is held several millimeters be-
greater thickness. Biological specimens can be low the exit aperture of the final lens at a tilt-
examined that could not be penetrated and that ing angle between 30° and 60°. The distance
had to be sectioned before. Bulk properties of has to be great enough to prevent the magnetic
various materials, for instance of crystalline field of the lens from interfering with the mea-
materials, can be investigated that may be dif- surement. Backscattered electrons are collected
ferent from those of thin foils. Materials can be by a ring electrode mounted below the exit
studied that could not be sliced thin enough or aperture of the lens. Secondary electrons are
that would be affected or contaminated by the collected by a collector mounted on the side,
slicing process. Another advantage is that, at the facing the specimen. The collector is at a high
higher voltage, the electrons have a lower cross enough positive potential to attract the second-
section for inelastic scattering. This leads to ary electrons that leave the specimen with an
lower radiation damage in materials such as energy of a few volts. Inside the collector box
polymers that are easily damaged at voltages is a scintillation crystal e.g., europium activated
between 15 and 100 kV. The reduced inelastic calcium fluoride [CaFe (Eu) 1 at a still higher
scattering leads also to reduced chromatic positive potential. The light flashes produced
aberration. by the electrons impinging on the crystal are
Scanning Electron Microscope (SEM) While conducted through a light pipe to a photomul-
in the standard transmission electron micro- tiplier tube located outside the vacuum chamber.
scope the total area under observation is irradi- The signal from the photomultiplier is amplified
ated at the same time by the electron beam, and used to modulate the beam of the display
only a single element of this area is irradiated cathode ray tube. The display tube has, usually,
at anyone time by the "probe" in the scanning a long-persistent phosphor screen. The micro-
electron microscope (SEM). This probe is a very
fine electron beam that has been demagnified
by magnetic lenses. It scans the specimen in a

--
television-type raster. The finer the probe, the
higher is the resolution of the microscope. The Electron Gun
signal originating from each element while it is
under irradiation modulates the electron beam
of a display tube. The beam of the display tube
is deflected in synchronism with the probe scan.
A system of this type without probe demag-
nification, and therefore, with low resolution
was first used by M. Knoll in 1935. 14 A demag-
nified probe was used by M. von Ardenne in
1938. The SEM has been developed since that
time into an extremely useful tool through the
work of many researchers. Many models are ~FirstLens
now commercially available.
The SEM can operate in a number of differ-
ent modes that will supply different informa-
tion about the specimen. The SEM can make - " 1S t Set
use of: Deflection
Coils
(a) Secondary electrons from the surface of a

~[J f51--

thick specimen. nd
(b) Backscattered electrons from the surface. 2 Set Lens
101 .:x:x'S<'S<'SI!III~- Second
(c) Transmitted electrons.
(d) Cathodo-Iuminescence and x-rays.
Stigmator
The SEM operates in its most common form
at a voltage between 5 kV and 50 kV. It has a ~ .____Secondary Electron
tungsten hairpin cathode and two demagnifying
: ~ Collector
lenses that are built like the lenses of the stan-
dard transmission microscope but have a short
image distance compared to a long object dis-
tance (see Fig. 6). The final lens has built inside
Specimen
its inner cylinder two sets of vertical and hori-
zontal scanning coils. One set near the top of
the lens deflects the beam away from the axis, FIG. 6. Typical scanning electron microscope (SEM).
361 ELECTRON MICROSCOPE

graph is taken photographically from the face microscopes. This accounts for the three-dimen-
of the display tube. It is possible to display dif- sional qualities of its pictures.
ferent signals side by side that are obtained In the secondary emission mode the contrast
from different types of collectors. Storage tubes in the picture is generated by a change in the
may be used in place of ordinary display tubes. composition of the specimen, causing a change
The signal can also be stored on magnetic tape in the secondary emission ratio. Where the com-
or on a magnetic drum and can be fed to a position of the specimen surface is so uniform
computer for image enhancement. The magnifi- that the secondary emission ratio varies very
cation of the SEM is determined by the ratio of little, the contrast may be produced by the top-
the size of the raster on the display tube to the ography of the surface. The reason for this is
size of the raster on the specimen. The num ber that the secondary emission ratio increases as
of scanning lines per frame can be varied be- the angle of incidence of the primary electron
tween 250 and 1000 lines or more. The time increases.
per scan can be varied from a fraction of a The resolution of the SEM described above is
second, for observation, to several minutes for limited to 10 to 25 nm (l00 to 250 A). This
recording. Since the probe currents are very limitation is determined by the size of the elec-
small, on the order of 10- 12 to 10- 10 A, long tron probe. This in turn is determined by the
exposure times are necessary in order to obtain diameter of the crossover near the aperture of
a good signal-to-noise ratio. The exposure time the grid, by the de magnification of the electron
is limited by the stability of the voltage supplies optical system, and by the increase in spot size
and the difficulty of eliminating mechanical due to the range in kinetic energy of the elec-
vibrations completely. trons emerging from the cathode. The hot tung-
The SEM has a very small beam-convergence sten cathode has a low emission density even
angle-between .005 and .01 radian. This means when operated at the highest temperature com-
that it has a great depth of focus, much greater patible with a useful life (approx. 2900 0 K).
than that of optical or standard transmission Owing to its high temperature, the emitted elec-

Field Emission Cathode

~ ~Second Anode

Magnetic Lens

~-----Specimen Holder

:40''------- Magnetic Energy Analyzer


Detector for Energy-
Loss Electrons

Detector for
No-Loss Electrons

FIG. 7. Crewe-type scanning transmission electron microscope (STEM).


ELECTRON OPTICS 362

trons have a great range in kinetic energy (ap- 2. Freundlich, M. M., "Origin of the Electron Micro-
proximately 0.9 V). A different cathode at a scope," Science, 142 (3589), 185-188 (1963).
lower temperature and higher emission density 3. Ruska, E., "Die friihe Entwicklung der Elek-
would permit a smaller limiting aperture for the tronenlinsen und der Elektronenmikroskopie,"
same optical system and the same probe current. Acta Historica Leopoldina #12, 1979, Deutsche
The lanthanum hexaboride [LaB61 cathode in- Akademie der Naturforscher Leopoldina, Hallel
troduced by A. N. Broers 15 is a great improve- Saale, East-Germany. See on page 32 a copy of
ment over the tungsten filament. the page of Ruska's notebook for this date.
Scanning Transmission Microscope (STEM). 4. Ruska, E., "Uber Fortschritte im Bau und in der
Another major improvement was achieved by Leistung des Magnetischen Elektronenmikro-
A. V. Crewe of the University of Chicago by skops," Z. f. Physik, 87 (9 & 10), 580-602
introducing a field emission electron source. 16 (1934).
The field emission technique was invented in 5. Ruska, E., Arch. Elektrotechn., 38, 102-130
1937 by E. W. MUller (see FIELD EMISSION). (1944).
In Crewe's STEM (see Fig. 7) the field emission 6. Hall, C. E., "Introduction to Electron Micros-
cathode, which requires a vacuum of 10- 9 to copy," Chapter 5, New York, McGraw-Hill Book
10- 10 torr, consists of a fine tungsten point. Co., 1953.
A negative potential of 3 kV is applied to the 7. Mahl, H., and Pendzich, A., Z. Tech. Physik, 24,
cathode against a spherical first anode. A sec- 38-42 (1943).
ohd anode at ground potential is at a potential 8. Johannson, H., and Scherzer, 0., Z. f. Physik, 80,
of 30 kV against the cathode. The two anodes 183-202 (1933).
together form an immersion lens. Though the 9. Mahl, H., Jahrb. AEG Forsch., 7, 43-56 (1940).
tunsten emitting point has a diameter of 100 nm 10. Andrews, H. C., Tescher, A. G., and Kruger,
the electrons seem to emanate from a virtual R. P., "Image Processing by Digital Computers,"
electron source that has a diameter of only 3 to IEEE Spectrum, 9 (7), 20-32,(July 1972).
10 nm (30 to 100 A) due to the very strong 11. Stroke, G. W., "Sharpening Images by Holog-
electric field close to the cathode. Emission raphy," New Scientist 23 Sept. 1971; "Optical
currents of several microamperes are obtainable. Computing," IEEE Spectrum, 9 (12), 24-41
Since the tungsten tip is at room temperature, (December 1972).
the range of kinetic energy of the emitted elec- 12. Briiche, E., "Elektronenmikroskop," Naturw., 20,
trons is less than 0.2 V. With a demagnifying 49 (1932).
magnetic lens, a probe diameter of 0.5 nm Briiche, E., and Johannson, H., "Elektronenoptik
(5 A) is possible with a probe current of 10- 11 und Elektronenmikroskop," Naturw., 20, 353-
to 10-10 A. 358 (1932).
Crewe collects separately the three types of 13. Dupouy, G., and Perrier, F., J. Microscopie, 1,
electrons that emerge after passing through the 167-192 (1962).
specimen, namely: unscattered, elastically scat- 14. Knoll, M., "Aufladepotential und Sekundaremis-
tered, and inelastically scattered electrons. A sian Elektronenbestrahlter Karper," Z. Techn.
ring electrode mounted underneath the speci- Physik, 16 (11), 467-475 (1935); "Aenderung
men collects essentially all elastically scattered der Sekundaren Elektronenemission von Isola-
electrons that have changed their direction sig- toren und Halbleitern durch Elektronenbestrah-
nificantly. The opening in the ring permits the lung," Naturw., 24 (22), 345 (1936).
unscattered and the inelastically scattered elec- 15. Broers, A. N.,Rev. Sci. Instr., 40,1040 (1969).
trons to pass. A magnetic- or electrostatic-an- 16. Crewe, A. V., "Scanning Electron Microscopes:
alyzer separates the two types. In a homoge- Is High Resolution Possible?" Science, 154 (3750),
neous magnetic field perpendicular to the 729-738 (1966).
electron path, the inelastically scattered elec- Crewe, A. V., "A High Resolution Scanning Elec-
trons will describe a circle of smaller radius than tron Microscope," Sci. Amer., 224 (4), 26-35
the unscattered electrons due to their lower en- (1971).
ergy. Two different detectors can then collect
the two types separately. Since the number of Cross-references: ELECTRON OPTICS; FIELD EMIS-
elastically scattered electrons increases much SION; LENS; OPTICS, GEOMETRICAL.
faster than the number of inelastically scattered
electrons with increasing atomic number, the
signals can be used to determine the atomic
number of the atoms at the spot under
investigation. ELECTRON OPTICS
MARTIN M. FREUNDLICH The invention of wave mechanics in 1926 by
Heisenberg and Schrodinger saw a revolution in
physics. It became apparent that the ideas of
References classical dynamics could be formally replaced
1. Ruska, E., "Fifth International Congress for Elec- when a stream of particles was considered.
tron Microscopy," New York, Academic Press, According to the well-known De Broglie hy-
1962. pothesis, a wavelength A can be assigned to any
363 ELECTRON OPTICS

material particle such that is a constant, and k is Boltzmann's constant.


J is enhanced by increasing T or reducing
A = h/mv (1) cp. Typical cathode materials used are tungsten,
tantalum, and the oxides of barium and
where h is Planck's constant, m is the particle strontium.
mass, and v is the particle velocity. A voltage is applied to accelerate the emitted
As one of the many consequences of these electrons, and the current density is given by
ideas, the new science of electron optics J = AT2 exp (-cp/kT) if they are removed to
emerged. In the same year, Busch demonstrated the anode. However, many electrons stay near
that the action of a short axially symmetrical the cathode and repel other emitted electrons
magnetic field on a beam of electrons was back to the cathode. Most devices operate in
similar to that of a glass lens on light. Terms this "space-charge limited" way. For any elec-
used before then in optics found their use in trode configuration, the current density is then
describing electron devices. Electron "lenses" J = G V where V is the applied voltage and G
and "mirrors" having "focal lengths" and "res- is a constant for the particular electrode con-
olutions and aberrations" were described. This figuration. G is roughly equivalent to electrical
analogy has proved useful in the study of the conductance and is called the "perveance." The
behavior of electrons in electronic valves, mag- constant is fundamentally important. The higher
netrons and klystrons, traveling wave tubes, G is in value, the greater is the efficiency of
cathode ray tubes, and electron microscopes, the beam system.
to name only a few devices. The original con- In simple deSigns, the axial portion of the
cept of the electron microscope was evolved
by direct analogy with the light microscope. cathode ~s overloaded, producing excessive emis-
An electron which has fallen through a po- sion and cathode burn-out. To avoid this situa-
tential V has a kinetic energy r = ~mv2 = eV. tion, carefully shaped electrodes must be used,
Hence v = (2eV)I/2/m l/2, and using Eq. (1), we but the acceptable design depends upon the
obtain the useful formula expressing electron application. A bent hairpin point cathode is
wavelength in terms of volts used in, for example, the electron microscope.
It produces low perveance but nigh emission
density. Since G is low, the field at the cathode
A= c~Or2 (2) can be high, and this tends to reduce cp, giving
emission at lower temperature. The electron
where A is in angstroms. guns for klystrons, traveling wave tubes, and
In light optics, the least resolvable distance metallurgical applications such as vacuum melt-
S between two objects is given by the Abbe ing and welding require higher efficiency and
expression S = A/(2n sin i), where n is the re- current density. Hence they are high-perveance
fractive index of the material between object guns.
and lens and 2i is the angle subtended at the Beam Control Electrostatic and magnetic
lens by the object; n sin i is called the numerical fields control the motion of an electron ac-
aperture. In the case of white light of equiva- cording to the following equation:
lent wavelength A= 5600 A, -
F =m -dv =e (E- + v X B)
-
(3')
S(minimum, light) = 1800 A, i.e., 1.8 X 10-5 cm. dt
Using the same expression and Eq . (2), which where
gives A for an electron = 0.04 A for V= 100kV,
F = force
S(minimum, electrons) ~ 0.04 A E = electrostatic field
li = magnetic field
Le., small fractions of an angstrom can be re- v = electron velocity
solved. However, the theory does not take into e = electronic charge
account the fact that lenses are imperfect.
Spherical aberrations drastically affect the situa- The electron must travel in a vacuum if it is
tion. In addition, certain diffraction defects not to be scattered and lose its kinetic energy
and scattering impose limitations. These facts by collision with relatively massive gas mole-
and others make it impossible to obtain resolu- cules. Nuclear particles such as protons and
tions near the theoretical maximum. Instead, neutrons are each about a837 times larger in
2.0 A is a better theoretical value. Practical mass than the electron.
limiting resolutions obtained are of the order _Electrostat~ Ele~tron Lenses If in Eq. (3),
of 2.5 A to 3.0 A. B = 0, then F = eE..:.. The equation says tha..!.. an
Source of Electrons The basic source of electron in a field E experiences a force F in
electrons in electron tubes is the heated fila- the direction of the field. Thus in the system
ment or disk. The well-known equation J=AT2 shown in Fig. I where there is a voltage VI on
exp (-<I>/kT) describes the emission, where J is cylinder 1 and V2 on cylinder 2, and where
the current density, T is the temperature, cp is V2 > VI, the field and the path of an electron
the work function of the emitting material, A are as shown. The electron moving from left to
ELECTRON OPTICS 364

I---~--~

CD I @
(a) (b)

FIG. 1. Electrostatic focusing of electron beam by cylinder lenses CD and @. (a), voltage V2 > VI;
(b), voltage V2 < VI·

right increases its velocity and is deflected _____ Cathode


towards the axis. After passing the median, the OkV - ..----
force is away from the axis, but since velocity -.1 kV - L ~Control
is increased, the electron spends less time in
this part of the field. Therefore, deflection +40 Kv- r ~Anode
away from the axis is less than it was towards
it, and there is a net convergence. If V2 < VI,
as in Fig. 1 (b), then for an electron beam
traveling from left to right the lens will still be
convergent because maximum deflection will
occur after the electrons have slowed down. Object
All such lenses are convergent for E = 0 on
both sides of the lens.
Mai!!..etic Electro.!!. Lenses If in Eq. (3) E = 0,
then F = eli X B. This says that the force
Intermediate
F = e willil sin () € where tvl and llil are the Image
numerical values of the vectors and () is the
angle between them; € is a unit vector per-
pendicular to both,~nd indicates the direction Projector
of the force. For F to be greater than 0, Ii Lens
must be greater than O. The force constrains
the electron to move in a circle of radius Final
p = (Ii sin ())/(li elm). At the same time it Image
moves in a perpendicular direction with velocity
Ii cos (), and therefore, it traces the path of a
helix and returns to the axis in a time
T = 21TI(li elm) which is independent of both
Ii and e. The net effect is that all electrons FIG. 2. Schematic diagram illustrating the general
are focused by li to produce an image of the arrangement of a transmission electron microscope.
source from which they diverge.
The simple magnetic lens consists of a short Magnetic Deflection See Fig. 3(b). If B is a
coil of wire contained in a surrounding shield sharply defined field, then the deflection on
of magnetic material. A small gap in the case leaving the field is given by y = Q2 /2p. To-
material concentrates the escaping field when tal deflection D = L tan a + y. To a good
the. coil is energized. The electron microscope approximation

D=liLQ~ (1 +..!...).
(shown in Fig. 2) illustrates the use of magnetic
lenses. The coils are wound in opposite sense
to cancel the spiral distortion of the electron v m 2L
beam which is introduced by the individual
lenses. It should be emphasized that these considera-
Electrostatic Deflection If in Fig. 3(a) a tions are only approximate. In practice, factors
voltage V d exists between the plates x I X2 and such as the inability to sharply define a field
an electron enters along the axis with a con- edge in space cause added complications.
stant velocity_v = (2 Velm)I/2, then because Cathode Ray Tubes A device worthy of
of thll.. field E y caused by Vd, a transverse brief consideration is the cathode ray display
force Fy causes a motion in this direction to be tube. Its basic components are as follows: an
impinged on the electron, i.e., m (d 2Yldt 2) = electron gun, an acceleration and focus system,
Vdeld, from which dy{dt = (Vd/d) (elm) t and a deflection, and a display system which is
y = ! (Vd/d) (elm) t . The force acts for time normally a phosphor screen. Depending on the
t = Qlv; thus deflection y = ! (Vd/d) (elm) type of tube the applied voltage causes electro-
(QIV)2 = i (Vllld) (Q2fV). The additional de- static or magnetic deflection. In the oscillo-
flection, after F has ceased to act on the elec- scope tube electrostatic deflection is used while
tron, isyl = (Vdlv) (QLI2d). in the large display tube, such as the television
365 ELECTRON OPTICS

r---~ --~I----L --~


Coil llll
I
lll lll llll llll illI
Y= I I
(2Ve/ m) Y. I 1

x, ~
y=
I
(2Ve/ m) Y' I
I
I Y
I P = B elm
I I
I /
I I
1/
t'
(a) (b)

FIG. 3. (a) Deflection of electron beam by electric field between parallel plates XjX 2 • (b) Deflection of elec-
tron beam by magnetic field . Electron path is an arc of a circle only while it is influenced by field. After leaving
the field the path is linear.

CATHODE RAY TUBE


Display
assembly shields the deflection area from the
effect of the high acceleration field required
for high brightness and high writing speed.
The electrons pass through the grid and those
which collide with it cause low energy emission
of secondary electrons which can be collected
readily by placing a small positive voltage on a
neighbouring electrode such as A4 , and there-
fore they do not cause background illumina-
tion of the image produced by the primary
Beam beam .
Formaiion
Table I illustrates the operating features and
performance of the tube. The display envelope
is commonly rectangular and an external mag-
netic coil is needed to align the beam axes with
those of the display face. Plate capacitances
are reduced by the use of side pins. Never-
theless these capacitances and transit time
phenomena contribute to the main limitations
Magnet ic
for use at higher frequencies. However, other
K - Cathode tubes operating at frequencies greater than 500
C - Cont ro l E l etrode Display MHz have been developed.
Color Television Tube In the television tube
FIG. 4. Schematic diagram of cathode ray tubes. the beam must be deflected to trace each part
of the screen in sequence while it is simul-
tube, magnetic deflection is used because of the taneously intensity modulated. The resulting
large scan angles which are required. The tubes picture is a mosaic of dark and light elements.
are illustrated in Fig. 4 . Three primary red, green and blue pictures must
Advanced Oscilloscope Tubes The instru- be displayed in exact registration to obtain a
ment tube shown in Fig. 4 is of low cost and a high-quality color picture. Color mixing to
has limited frequency performance to 5 to 10 produce the wide range of colors required is
MHz. If voltage on A4 is increased, then it obtained subjectively and depends on the rela-
becomes increasingly difficult to deflect the tive brightnesses of the three primary light
beam in such a tube. In recent years perfor- yields at a particular point on the screen.
mance of the comparatively low-cost tube has There are numerous ways of achieving the
been increased to greater than 50 MHz by the color picture, but the "shadow mask tube"
introduction of post-deflection acceleration sys- has been developed to provide the most eco-
tems, the most successful of which is shown in nomical method. There are three primary guns
the tube illustrated in Fig. 5. Here a grid of in the tube, each tilted to converge to a
fine wires placed over the end of the gun central point near the screen. A single de-
ELECTRON OPTICS 366

External
Aluminized
Tw ist Co il
Screen
Final PDA Voltage
A5
Ca

High-
Connector

FIG. 5. Advanced Instrument Cathode Ray Tube. Mesh screen, held near mean plate potential, prevents PDA
field from reducing deflection plate sensitivity. Note other features such as side pin connection to plates to reduce
capacitance; also the blanking arrangement for blanking the beam during time-base flyback. (The method has
advantages. over using grid G1.) See Table 1 for other information.

TABLE 1. PERFORMANCE OF MESH PDA TUBE (See Reference 5)

Beam deflection plate potential 1250 ya 1500Y


First acceleration plate potential 1250 Y 1500 Y
Focusing plate potential 160-80 Y 200-100Y
PDA voltage 12.5 KY 12.5 KY
Typical screen curren t 10 J.LA 10 J.LA
Line width 10 J.LA 0.36 mm 0.32 mm
Linearity 2 per cent 2 per cent
Y sensitivity 4.8 Ycrn- 1 5.8 Ycrn- 1
X sensitivity (time base) 9 Ycm- 1 11 Ycm- 1
Y scan 80 mm
X scan 100 mm
Tube length 350 mm
aVoltages w.r.t. cathode.

App"rent flection coil is used to scan the beams in


Deflection registration (as shown in Fig. 6. Additional
Centers deflection yokes carry currents derived from
the main deflection waveform to introduce con-
vergence corrections as needed to maintain
registration over the complete screen.
Undeflected The screen must emit red light when bom-
Posit ion barded by electrons from the red gun, green
from the green gun, and blue from the blue
gun. To enable this to occur a perforated metal
mask is positioned about 12 mm from the
G screen and the beams in fact converge to its
central point rather than that of the screen.
Guns Electrons must pass through the mask holes to
reach the screen as shown in Fig. 7. The beams
FIG. 6. Simultaneous deflection of three electron appear to originate from three apparent deflec-
beams in shadow mask color tube. tion centers in the deflection field. Thus, they
367 ELECTRON SPIN

'h.dow M"" ~ Bed Phosphor Dot

Green Beam

Green
Phosphor Dot

(0
(0 (0
Red Beam
Mask /
Aperture
FIG. 7. Operation of shadow mask in color television tube.

will strike the face in three points of a triangle Science and Technology," New York, John Wiley
when passing through a particular hole. Red, & Sons, 1965.
green, or blue emitting phosphor "dots" are 4 . Pierce, J. R., "Theory and Design of Electron
deposited at these points as appropriate. The Beams, New York, Van Nostrand Reinhold, 1949.
plate is called a shadow mask because when 5. "Performance data relates to modern German
one gun is switched on , the unwanted dots are CRT," AEG Telefunken 014-131, Table l.
not energized since they lie in its shadow.
Typical mask holes are 0.5 mm in diameter and Cross-references: ELECTRON; ELECTRON MICRO-
about 0.7 mm apart. The total number of holes SCOPE; ELECTRON TUBES; OPTICS, GEOMETRI-
is about 400 000 for a 25-inch tube. CAL; OSCILLOSCOPE; THERMIONICS.
During manufacture the mask itself is used
to process the screen. The phosphor is de-
posited with a photosensitive lacquer and a UV ELECTRON SPIN
light source is placed at the appropriate deflec- The electron, as one of the stable fundamental
tion center. The screen is exposed through the particles (a lepton) of physics, has a small rest
mask, developed, and the process repeated for mass m and a classical unit of charge -e, but also
the other two colors. The deflection centers possesses the attribute of spin, a strictly quan-
move slightly towards the screen for large de- tum mechanical property. Associated with this
flection angles, and a correction lens has to be spin is an angular momentum of magnitude
used to compensate during exposure. The ac- .!t /2 (Ii is Planck 's constant) and a magnetic
curate positioning of the million or more phos-
phor dots is achieved in this way. moment given approximately by ell/2mc,
where eIl/2mc = MB is known as the Bohr
BARRY A. GEORGE magneton (MB = 0.9273 X 10- 20 ergs/gauss).
On the other hand, orbital angular momentum,
References such as for electrons in atoms, is quantized in
integral units of fl . Thus, the electron spin quan-
1. Klemperer, 0., "Electron Optics," Cambridge, The tum number of ! is different than that for
University Press, 1953 (for a rigorous treatment orbital quantum numbers. Furthermore, the
of electron optics). electron spin gyromagnetic ratio (the spin mag-
2. Bakish, R., "Introduction to Electron Beam Tech- netic moment divided by the spin angular mo-
nology," New York, John Wiley & Sons, 1962 mentum) is twice the classical value for orbital
(for the practical applications of electron optics). angular momentum. This feature plays an im-
3. Bakish, R., Ed., "Electron and Ion Beams in portant role in the Zeeman effect of optical
ELECTRON SPIN 368

spectra of atoms. Electron spin and its mag- other sources (Le., hyperfine and exchange
netic moment are predicted by the relativistically interactions) besides the external field. For H =
invariant Dirac equation. Whereas quantum 3000 gauss, v ~ 9 X 10 9 Hz for g ~ 2, a fre-
mechanical wave functions for orbital mo- quency conveniently in the microwave range,
tion are invariant under 21T rotations, electron suggesting microwave spectroscopy methods for
spin wave functions are not invariant under 21T EPR. A block diagram of a typical EPR micro-
rotations, but instead change sign. The vector wave spectrometer is shown in Fig. 1. Rapid ad-
components of the electron spin operator can vances in microwave- and radio-frequency tech-
be conveniently described in terms of the niques before and during World War II greatly
2 X 2 Pauli spin matrices. These two-component benefitted the rapid growth of both EPR and
quantities are termed spinors. NMR.
The first direct experimental evidence for The power of the EPR technique results not
quantized electron spin S (having a z-component only from the enhanced sensitivity, but from
Sz = msll, with ms = ±!) was obtained from the the microscopic information obtained from the
Stern-Gerlach experiment in which a collimated EPR spectra about the environment of the un-
beam of neutral silver atoms was split into just paired electron spins. This microscopic informa-
two separate beams with opposite deflections tion consists of g-shifts [flg = g - gf.e. (f.e. =
when the silver atoms passed through an in- free electron), gr.e. = 2( 1 + 0l./21T - • . .) ~
homogeneous magnetic field. This experi- 2.0023 since the fine structure constant Ol. ~
mental approach developed into the molecular 1/137] resulting from the spin-orbit and orbital
beam approach, which was so fruitfully pursued Zeeman interactions, fine structure splittings
by Rabi, Kusch, and colleagues in the study of (for spin S ~ 1 only) resulting from crystalline
free atoms and molecules. These workers first fields, hyperfine spectra resulting from electron
used the magnetic resonance technique to spin-nuclear spin interactions, line widths and
change atoms from one spin orientation (with shapes resulting from various spin-spin inter-
respect to a space quantization axis) to another, actions, the motion and diffusion of spins, and
thus altering the populations to be deflected by spin-lattice relaxation (spin-phonon interac-
the inhomogeneous field. Another important tions). Pulsed EPR (the spin-echo technique)
ramification of electron spin was in the Zeeman and dynamic (field swept adiabatic fast passage)
effect of optical spectra of atoms in an external measurements have yielded information on
magnetic field. Uhlenbeck and Goudsmit in spin-spin interactions, spin diffusion, and spin-
1925 proposed the concept of spin in order to lattice relaxation. The detailed microscopic
explain the spectroscopic fine structure of information obtained from the spin Hamil-
atoms. The different gyromagnetic ratios for tonian parameters tells us much about the sym-
orbital and spin motion produce a state-de- metry properties and spatial extent of the elec-
pendent Zeeman splitting (flE = gJ.l.BH), where tron (hole) wave function of a particular
the g-factor is given by the famous Lande result paramagnetic species. Figure 2 shows the EPR
which depends on the spin (S), orbital (L), and spectrum of atomic hydrogen situated at a cubic
total (J) angular momentum of the atom. This interstitial site in CaF 2 for H II [100 J. The
result explained the anomalous Zeeman effect. principal splitting into two patterns results
Early studies of spin-dependent effects in from the interaction of the unpaired electron
solids and liquids were made utilizing suscepti- with its hydrogen nucleus with spin I = ! (two
bility and magnetization techniques, techniques hyperfine fields for m[ = ±!). The two sym-'
determining the bulk magnetic moment (in- metric patterns of 9 lines each (intensity ratios
cluding orbital contributions) of a macroscopic 1: 8: 28: 56: 70: 56: 28: 8: 1) result from the
sample. In 1945 Zavoisky reported the first interaction with eight equivalent 19 F (I = !)
observation of electron paramagnetic resonance nuclei. The interaction with the eight 19 F neigh-
[EPR, sometimes also called electron spin reso- bors is termed the superhyperfine or ligand
nance (ESR)] in a solid. Nuclear magnetic hyperfine structure. For each 19 F nucleus the
resonance (NMR) was reported independently hyperfine interaction has axial symmetry and is
a year later by two groups. These developments characterized by tensor components All and
utilizing the magnetic resonance approach, a Ai. For other external field orientations the
much more sensitive and accurate method for eight 19 F nuclei are not all equivalent and ad-
detecting the presence of spins, represented a ditional splittings result. A detailed theory of
natural extension of the magnetic resonance the embedded H-atom wave function, taking
technique first used in molecular beam methods. account of orthogonalization to the core elec-
The detection of magnetic resonance, de- trons of the neighboring 19 F ions is required to
scribed in quantum mechanical terms, repre- explain the hyperfine tensor components.
sents the absorption of power in a radio-fre- A vast amount of work has been done on
quency (RF) coil or microwave cavity in the transition metal ions and rare earth ions in a
form of photons of frequency v and energy wide variety of hosts, including ionic crystals,
2rrfzv such that 21Tlfv = gJ.l.BH. gJ.l.BH is the Zee- covalent crystals, and certain metals. Here we
man splitting resulting from the local magnetic limit our discussion to S-state (L = 0) ions such
field H. H may contain a contribution from as Fe3 + or Mn 2+ [(3d)S configuration for free
369 ELECTRON SPIN

I'Ft!
Ir~TO"1
IL--_ TO lOCK-Il<t
A....OIOO'
R[COROE~

I
LOCA L OSC ILLATOR MICROWAVE CAVITY AND RECEIVER

FIG.!. Block diagram of a high-sensitivity superheterodyne X-band EPR spectrometer. [After Holton, W. C.,
and Blum, H., Phys. Rev. 125,89 (1962).)

ion). For the S = ~ ground state ions the crystal g-value and no hyperfine interaction the EPR
field can play an important role in determining spectrum can be very anisotropic and depends
the EPR spectrum. In the strong crystal field critically on the symmetry and strength of the
case zero field spin splittings are larger than local crystal field. Certain rare earth ions in
typical Zeeman energies. Even for an iostropic appropriate symmetry hosts have been utilized

Ho II [ 100 J

2.9 3.0 3.1 3.2 3.3 3.4 3.5 3.7 Kg

FIG. 2. EPR absorption derivative spectrum of CaF2: H for H II [100). The sharp line
at 3.35 kG is a g-marker while the doublets between the major lines of the two g line
patterns are forbidden transitions. [After Hall, J. L., and Schumacher, R. T., Phys. Rev.
127,1892 (1962).)
ELECTRON SPIN 370

to make a 3- or 4-level maser (microwave ampli- th~ unpaired 2prrg orbital is aligned along the
fication by stimulated emission of radiation). [ 110) axis for potassium and rubidium halides,
Such solid state microwave amplifiers, first but is along the (001) axis in the sodium
proposed by Bloembergen, have been of con- halides.
siderable technological importance. EPR line shapes in solids often are inhomo-
It frequently happens that the paramagnetic geneously broadened by residual hyperfine
ion of interest has an orbital degeneracy in a interactions with nearby nuclei. In these cir-
particular crystalline field. This leads to the cumstances the unresolved lines observed in
interesting Iahn-Teller effect. A static or EPR spectra do not allow the determination of
dynamic distortion (the latter represents a the hyperfine tensors of these residual hyper-
tunnelling between equivalent static distor- fine interactions. Feher developed an ingenious
tions) occurs, lowering the total energy of para- electron-nuclear double resonance technique
magnetic ion plus environs. This distortion can (ENDOR) that permits the resolution of these
have a profound effect on the EPR spectrum. otherwise unresolved hyperfine interactions.
Analysis of these spectra yields information on The idea depends on the much narrower line
the symmetry and magnitude of the distortion. widths of the NMR transitions. An EPR transi-
Non-S-state ions or impurities frequently have tion is partially saturated at fixed field while
large anisotropic g-shifts resulting from the spin- a RF source is swept through the NMR transi-
orbit interaction which depend critically on the tions of the nuclei coupled through the resid-
position of excited orbital states. Analysis of ual hyperfine interactions with the unpaired
the g-tensor thereby provides useful informa- electron (hole). Strong saturation of these
tion on the excited state energies of an impurity NMR transitions alters the populations of
ion or defect. One exam!Jle is the O2 - molecu- the electron (hole) spin energy levels and al-
lar ion, frequently observed as a substitutional ters the magnitude of the EPR signal. The
impurity in ionic crystals. Consider an O2 - ion, EPR transitions are sensitive probes of weak
oriented along a (110) axis, replacing a halogen NMR transitions from a small number of nuclei.
ion in an alkali halide. The axially symmetric ENDOR has been utilized to map the compli-
free O2 - would have a g-tensor with components cated oscillatory wave function of shallow
gil and gl for H parallel and perpendicular to the donors in silicon and accurately determine the
molecular axis, respectively. In the alkali halide position in momentum space of the conduction
the 2prrg orbitals are split by both the spin-orbit band minima. Another interesting ENDOR
interaction and the larger crystalline field pro- example in Fig. 3 shows part of the ENDOR
ducing a correction to gil and a splitting of n spectrum of the F-center and M-center (excited
into two components g(110) and g[OOl). The sign triplet S = 1 state) in KCl at T ~ 90 K. (a) gives
of g[l10) - g[OOl) determines whether the un- the spectrum of the first shell of K-nuclei for
paLred electron in the 2prrg orbital is along the the F-center without optical excitation. (b)
[ 110) or (001) axes. Experimental results show shows additional ENDOR transitions from the

I I I I I i ii i Y/7/////7/////M

b) Light on

I ! ! I I I I I I ! I I
10 11 18 19 20 21

FIG. 3. ENDOR spectra of a KO crystal containing F-centers


and M-centers. Bo = 3345 gauss II [100], T = 90 K. (a) Without
light excitation; first shell (K-nuclei) of the F-center; (b) during
3650-A light excitation, additional ENDOR lines of K-nuclei of
M-centers in the triplet state. [After Seidel, H., Phys. Letters
7,27 (1963).]
371 ELECTRON SPIN

nearest neighbor shells of K-nuclei of the ex- stress. This data is valuable in testing various
cited S = 1 state of the M-center after irradia- theoretical models of these defects.
tion with 3650 A light. The ENDOR technique Many other variations and applications of
provided direct experimental proof of the EPR should be mentioned. These include spin
models of these centers-the F-center consist- echoes, a novel, powerful technique developed
ing of an electron trapped at a halogen ion by Hahn for NMR, but which has also been use-
vacancy and the M-center composed of two ad- ful in EPR. In concentrated spin systems in
jacent F-centers. crystalline solids the isotropic Heisenberg ex-
Another important type of double resonance change interaction, in addition to smaller aniso-
technique is the optical detection of magnetic tropic exchange interactions, exist between pairs
resonance (ODMR). The optical-RF double of neighboring spins. These systems frequently
resonance technique, first proposed by Brossel show spontaneous magnetization (below a
and Kastler, was demonstrated by Brossel and critical temperature), long range spin order, and
Bitter and has been extensively used in study- ferro-, ferri-, or antiferromagnetic behavior.
ing excited states in gases. The ODMR approach The spin modes of these systems can be studied
is particularly useful when the excited state using ferromagnetic resonance (FMR) or
population is very small and direct observation antiferromagnetic resonance (AFMR) tech-
of the excited state EPR spectrum would be niques. The magnetic resonance of conduc-
impossible. The sensitivity is enhanced by de- tion electrons (CESR) has not only been studied
tecting optical photons from the fluorescing in certain high purity metals and heavily doped
excited states while the magnetic field is swept semiconductors, but also in metals containing
through the excited state EPR spectrum. Ab- magnetic impurities. EPR has given useful in-
sorption of microwave photons induces changes formation on unpaired spins in amorphous semi-
in the optical photon emission spectrum. The conductors, on lunar samples, on low-dimen-
principle is similar to the ENDOR technique sional systems (I and 2 dimensions), on systems
in that the sensitivity is enhanced by detecting subjected to hydrostatic pressure or uniaxial
higher energy photons. Figure 4 shows the stress, and on a wide range of unpaired spins
ODMR spectrum of a deep defect in silicon ob- in the form of molecular free radicals. Further-
served at T = 1.7 K for two cases-with and more, spin systems have afforded an ideal sys-
without the application of a compressional tem to study phenomena in non equilibrium
statistical mechanics.
For several decades EPR has been utilized as
a powerful tool, which has been broadly applied
to chemical and biophysical applications. While
al 0 MPa it is impossible to address the many important
applications of the EPR technique to chemistry
and biology, a few illustrative examples should
be mentioned. Paramagnetic molecules or
j::
ions in solution undergo translation and a ro-
z
:::> tational tumbling motion. In these cases the
>-
a:: EPR spectra, line widths, and shapes are strongly
<l:
a:: affected by the magnitude of the anisotropic
I-
eo
terms (anisotropic g-shifts and hyperfine
a:: interactions) and the correlation times char-
~
...J
acteristic of the molecular tumbling rates .
<l: bl 30 MPa The EPR spectra, along with associated theory,
Z
t!J yield useful information on the molecular rota-
u; tional tumbling of the paramagnetic molecules.
a::
:;; The EPR of numerous free radicals, produced
o as intermediates in chemical reactions or by
o
radiation damage, is another topic studied in
chemistry and biology. One most important
example of EPR radiation damage studies is
of the individual amino acids and the DNA
helix. However, many biologically important
molecules are not paramagnetic. A vast field
1.20 1.25 1.30 of growing importance has arisen, termed spin
B (teslal labeling, in which an appropriate paramagnetic
molecule is added to a given system to act as a
FIG. 4. ODMR spectrum the 0.97-eV luminescence "neutral" (noninterfering) probe of the dia-
in neutron-irradiated silicon. B II (011), T = 1.7 K, magnetic biological molecule. Certain parts of
VEPR = 35.0 GHz. (a) Zeros stress spectrum; (b) Spec- important proteins, such as the heme group
trum with 30 MPa compressional uniaxial stress, Til (with Fe 3+) and other porphyrins, have been
[011]. [After Lee, K. M., et al., Phys. Rev. Lett. 48, extensively studied with EPR and ENDOR tech-
37 (1982).] niques. Figure 5 shows the ENDOR spectra of
ELECTRON SPIN 372

2.0 4.0 6 .0 B.O 10.0


~
..J

•l
I

-FREQUENCY [MHz]
FIG. 5. ENDOR spectra due to 14N interactions in hemoglobin. Top trace: Normal human methe-
moglobin A, conc. 6 mM in O.IM phosphate buffer, pH = 7; Bottom trace: Hemoglobin MHydePark.
Washed whole oxygenated red cells were used. Note the absence of the high frequency peaks which
had been assigned to the proximal histidine. [After Scholes, C. P., Isaacson, R. A., and Feher, G.,
Biochim. Biophys. Acta 263,448 (1972).)

normal human methemoglobin A and also that Ramsey, N. E., "Molecular Beams," Oxford, Claren-
of the mutant hemoglobin MHydePark. The dif- don Press, 1956.
ferences in the two spectra give information on Dirac, P. A. M., "The Principles of Quantum Mechan-
the binding of histidine nitro gens to the iron ics," 4th Ed., Oxford, Clarendon Press, 1958, Ch.ll.
and on the nature of the mutant. Along with
NMR, EPR, ENDOR, and ODMR have all be- Electron Paramagnetic Resonance and Applications
come valuable tools utilizing spins in the study Abragam, A., and Bleaney, B., "Electron Paramag-
of biological systems. netic Resonance of Transition Ions," Oxford, Clar-
endon Press, 1970.
THEODORE G. CASTNER Pake, G. E., and Estle, T. 1., "The Physical Principles
of Electron Paramagnetic Resonance," Reading,
References Mass., W. A. Benjamin, 1973.
Slichter, C. P., "Principles of Magnetic Resonance,"
Electron Spin 2nd Ed., New York, Springer-Verlag, 1978.
Stern, 0., and Gerlach, W. Ann. Physik. 74, 673 Wertz, J., and Bolton, J., "Electron Spin Resonance:
(1924); Z. Physik 41, 563 (1927); 8, 110; 9, 349 Elementary Theory and Practical Applications,"
(1922). New York, McGraw-Hill, 1972.
Uhlenbeck, G. H., and Goudsmit, S. A., Physica 5,266 Carrington, A., and McLachlan, A. D., "Introduction
(1925);Nature 117,264 (1926). to Magnetic Resonance with Applications to Chem-
Van Vleck, J. H., "Electric and Magnetic Suscepti- istry and Chemical Physics," New York, Harper &
bilities," Oxford, Clarendon Press, 1932. Row, 1967.
373 ELECTRON TUBES

Poole, D. P., Ir., "Electron Spin Resonance: A Com- peres is given approximately by Child's law:
prehensive Treatise in Experimental Techniques,"
New York, Wiley-Interscience, 1967. 1= 2.33 X 10- 6 A V 3/2 /d 2
Manenkov, A. A., and Orbach, R. (Eds.), "Spin-
Lattice Relaxation in Ionic Solids," New York, where A is the area of the cathode in square
Harper & Row, 1966. meters d is the separation of the cathode and
Geschwind, S. (Ed.), "Electron Paramagnetic Reso- anode in meters and V is the voltage difference
nance," New York, Plenum Press, 1972. between the an~de and cathode. For large posi-
Symonds, M., "Chemical and Biochemical Aspects of tive voltages the electric field is strong enough
Electron-Spin Resonance Spectroscopy," New York, to collect all the electrons emitted by the cath-
Iohn Wiley & Sons, 1978. ode, and the current is independent of voltage
Berliner, L. I. (Ed.), "Spin Labeling: Theory and but depends strongly on the absolute tempera-
Applications," Vols. I and II, New York, Academic ture T as described by Richardson's equation:
Press, 1976. 1= 1.3 X 10 6 AT 2 e- ecf>/kT
Berliner, L. I., and Berliner, I. (Eds.), "Biological
Magnetic Resonance," Vols. I and II, New York, where k is Boltzmann's constant (1.38 X 10- 23
Plenum Press, 1978. joules/K), ¢ is the work function of the cath?de
Clark, R. H. (Ed.), "Triplet State ODMR Spectros- (typically a few volts depending on the matenal),
copy: Techniques and Applications to Biophysical e is the electronic charge (1.6 X 10- 19 cou-
Systems," New York, Wiley-Interscience, 1982. lombs), and A is the area of the cathode in
square meters. This is called the emission-limited
Cross-references: ELECTRON, MAGNETIC RESO- region.
NANCE, ZEEMAN AND STARK EFFECTS. Triodes If one inserts a transparent conduct-
ing grid between the cathode and anode, as in-
dicated schematically in Fig. 3, the device is
called a vacuum triode (three electrodes). If the
grid is made positive relative to the cathode,
ELECTRON TUBES but not as positive as the plate, it will accelerate
the electrons and increase the cathode current,
Although electron tubes have been replaced provided the tu be is operating in the space-
with solid state electronic devices in most appli- charge-limited region. Some of the electrons are
cations tubes will retain their importance in collected by the grid, but if the grid is trans-
applications involving high Voltage, high power, parent, many of the electrons will pass through
high frequency, and visual display. the grid and will be collected by the even more
The basic electron tube consists of an evacu- positive plate. A large grid current is usually
ated glass envelope containing a cathode that undesirable because it requires that the source
emits electrons by thermionic emission and a connected to the grid provide power, and this
positive anode that collects these electrons. The power must be dissipated by the grid, which
tube is evacuated so that electrons can travel in extreme cases may overheat the grid. Conse-
without colliding with gas molecules. Other quently, the vacuum triode is norI?ally operated
forms of electron tubes operate in a high pres- with its grid negative relative to Its cathode. In
sure regime where gas ionization is important such a case it is energetically impossible for
(Le., thyratrons, ignitrons, voltage regulator electrons td reach the grid, and so the grid
tubes, etc.) but high pressure tubes will not be current is always zero. On the other hand, if the
discussed. In addition, one or more transparent grid is sufficiently close to the cathode, and not
wire grids are typically inserted between the too negative, some of the electrons feel the
cathode and anode to control the flow of electric field from the positive plate and pass
electrons. through the grid and are collected by the plate.
Diodes The simplest such tube is called a In this way, a small voltage applied to the grid
diode (two electrodes) and is shown schemati- can be used to control the flow of current be-
cally in Fig. 1. If the anode (or plate) voltage is
positive relative to the cathode, the electrons
that boil off the cathode will be drawn to the
anode and collected. Hence an electrical current
flows from anode to cathode (opposite to the
electron flow) in a diode. On the other hand, if EVACUATED
~ TUBE ~
the anode is negative relative to the cathode,
the electrons are repelled, and no current flows.
The relationship between current and voltage
in the vacuum diode is shown in Fig. 2. For
small positive voltages the electric field that
draws electrons away from the cathode is par-
(0) (b)
tially shielded by the cloud of electrons that
surrounds the cathode. This is called the space- FIG. 1. Vacuum diodes: (a) heated cathode; (b)
charge-limited region, and the current in am- heated filament.
ELECTRON TUBES 374

held at a constant voltage, positive relative to


the cathode, but negative relative to the plate.
(b) Such a tube is called a tetrode and has a plate
T3 current that is approximately independent of
,..---T 2
the plate-to-cathode voltage.
Pentodes One drawback of the tetrode is
,-----T.
(0) that, whenever the plate becomes negative rela-
tive to the screen, secondary electrons are
v knocked off the plate by the incident primary
electrons. These secondary electrons are at-
tracted to the screen. This causes an undesirably
high screen current and can cause the plate
current to reverse direction. To eliminate this
effect, a third grid, called the suppressor grid,
is inserted between the screen and plate. Such
a tube is called a pentode. The suppressor grid
FIG. 2. I vs V characteristics of a vacuum diode for is usually held at a constant voltage near that
various cathode temperatures: (a) space-charge-limited of the cathode and, in fact, is often connected
region; (b) emission-limited region (T3 > T2 > Tl). internally to the cathode.
Cathode Ray Tubes When the electrons
strike the plate of a vacuum tube, their kinetic
tween the plate and the cathode. The important energy is normally transferred to heat, resulting
property of the vacuum tube is that the source in a plate dissipation power limit that should
which controls the grid voltage supplies no not be exceeded without risking damage to the
power, since the grid current is zero, and yet it tube. However, it is also possible to have the
can significantly alter the power delivered by electrons strike a fluorescent screen, in which
a source connected between the plate and case some of the electron energy is transformed
cathode, and thus can be used as an amplifier. to visible light. If the electrons are focused into
The incremental change in plate current ip a narrow beam by the use of apertures, and
can be calculated from the incremental change deflected in two dimensions by means of de-
in grid-to-cathode voltage UGC and plate-to- flection plates or magnetic fields, a visible spot
cathode voltage UPC according to can be produced and moved rapidly across the
face of the tube to produce a visual image. Such
ip =gm uGC + upc/rp a tube is called a cathode ray tube (CRT) and
is shown schematically in Fig. 4. Cathode ray
where gm is called the grid-plate transconduc- tubes find application as visual display devices
tance and rp is called the plate resistance. The in instruments such as oscilloscopes, television
plate-to-cathode of a vacuum tube can thus be receivers, and computer displays.
considered as a voltage source of value - f.J.uGC Klystrons Above a few hundred MHz, the
(where f.J. = gm rp) in series with a resistance rp. plate voltage and plate current exhibit a signifi-
The fact that the amplification factor f.J. is cant phase difference generally due to electron
typically 100-1000 means that a vacuum tube transit time effects, and conventional tube
can be used in a circuit to provide voltage am- designs become ineffective. At microwave fre-
plification. Since the input resistance of a vac- quencies, special tubes are used which employ
uum tube circuit usually exceeds 1 megohm velocity-modulation of an electron beam. An
whereas the plate resistance rp is much smaller external magnetic field may be used to confine
(kilohms), a vacuum tube amplifier provides the electron beam. In the klystron the electron
a very large power gain. beam is modulated by traveling through a pair
Tetrodes In order to raise the amplification of closely spaced grids connected across a reso-
factor and reduce the grid-to-plate capacitance nant cavity. The reflex klystron shown in Fig. 5

ff
for high frequency operation, a second grid, is used as an oscillator. It consists of a single
called the screen grid, is often placed between
the control grid and plate. This grid is usually
e)

II __ ------
(d)
(b) (:)

=?}Trr~ ! 7
(f)

FIG. 4. Cathode ray tube: (a) mament; (b) cathode;


(c) control grid; (d) focusing electrodes; (e) deflection
FIG. 3. Vacuum triode. plates; (f) fluorescent screen.
375 ELECTRON TUBES

FIG. 5. Reflex klystron: (a) filament; (b) grid space


showing bunched electrons; (c) resonant cavity; (d)
reflector; (e) coupling loop.
FIG. 7. Magnetron: (a) cathode; (b) path of elec-
tron in magnetic field; (c) resonant structure.
cavity followed by a drift space and an electrode
known as a reflector. The streaming bunches of
electrons are reflected back into the grid/cavity frequency. This energy is extracted from the
area in the proper phase to sustain oscillation. cavities by coupling loops or a waveguide.
The amplifier klystron shown in Fig. 6 consists Magnetrons are used as fixed-frequency oscil-
of many cavities in series, with the input ap- lators in high power cw service (such as micro-
plied to the first cavity and the output taken wave ovens) and short pulse radars.
from the last. Power is applied or extracted Light-Sensitive Tubes Electron tubes can
from these cavities usually by means of a also be used as sensitive detectors of light and
coupling loop. other electromagnetic radiation in the near
Traveling Wave Tubes In a traveling wave infrared and ultraviolet. A common form of
tube (TWT) the electron beam is focused and such a detector is the photomultiplier (PM)
confined by an external solenoidal magnetic tube. As shown in Fig. 8, such a device is a vac-
field or by a periodic permanent magnet struc- uum tube consisting of a cold, photosensitive
ture in a relatively long cylindrical cavity. The cathode that emits electrons when struck by
electron beam is bunched by means of an rf light. The electrons are attracted to a nearby
current applied to a helical coil which sur- positive electrode (called a dynode), where
rounds the beam. An electromagnetic wave is they typically release 3-6 secondary electrons
thus produced which travels down the tube at which are attracted to the next, even more
a velocity slower than the velocity of the elec- positive dynode, and the whole process repeats
trons in the beam. The electrons are thus able through many stages, producing a large current
to transfer some of their energy to the wave, at the anode. Such a device is so sensitive that
resulting in an amplified rf current in the helix. it can detect a single photon of light, in which
Because no resonant cavities are employed, the case the output consists of a negative voltage
TWT can be used as a broadband amplifier or pulse with a size determined by the capacitance
tunable oscillator at frequencies above about of the measuring instrument.
1000 MHz. An alternative and more compact configura-
Magnetrons In the magnetron-type tube, tion is the channeltron electron multiplier
electrons are emitted from a filament and (CEM) in which a long, thin, evacuated glass
move radially outward in two dimensions tube is coated on the inside with a low-work-
toward a coaxial anode as shown in Fig. 7. function conducting material that takes the
However, an external magnetic field along the
axis of the cylinder causes the electrons to
move in roughly circular orbits around the DYNODE
cathode. The anode consists of a periodic
sequence of resonant cavities. The electrons
are bunched by the electric field across the
entrance to these cavities and thus give up
energy to the cavity field at the resonant

FIG. 6. Klystron amplifier: (a) filament; (b) input -v


cavity; (c) intermediate cavity; (d) output cavity;
(e) anode. FIG. 8. Photomultiplier tube.
ELECTRON TUBES 376

place of the individual dynodes in the PM tube. to measurement and control in manufacturing
A high voltage is applied between the ends of and science.
the tube, and an avalanche of electrons is Understanding of electronic circuits implies
formed whenever light is incident on the more knowledge of the currents in all parts of the
negative end of the coating. By combining many circuit. An electric current accompanies the
of the characteristics of the photomultiplier drift motion of free electrons in a conductor
tube and the cathode ray tube, various types of in response to an electric field, and the energy
image-sensitive tubes, such as the image orthicon required to effect successive electron accelera-
and vidicon tube have been created. These tubes tions between collisions within the solid is
are used in television cameras. called the electric potential difference, or volt-
age (in honor of the early Italian worker in
J. C. SPROTT electricity, Alessandro Volta). The resistance of
a conductor to electron flow involves both
References material parameters and geometrical factors so
that, for example, the resistance of a long thin
Cobine, J. D., "Gaseous Conductors," New York, wire is larger than that of a short thick wire of
Dover Publications, Inc., 1958. the same material. An electric circuit compo-
Langford-Smith (Ed.), "Radiotron Designers Hand- nent having a specific value of resistance is
book," 4th Ed., RCA, 1960. called a resistor; according to Ohm's Law (after
Hutter, R. G. E., "Beam and Wave Electronics in Georg Simon Ohm), the constant of propor-
Microwave Tubes," New York, Van Nostrand Rein- tionality between the current I, carried by a
hold, 1961. resistor, and the potential difference V is just
Gewartowski, 1. W., and Watson, H. A., "Principles of the resistance:
Electron Tubes," New York, Van Nostrand Rein-
hold, 1966. V=RI.
Anderson, L. W., and Beeman, W. W., "Electric Cir-
cuits and Modern Electronics," New York, Holt, This relation is a fundamental to all electronic
Rinehart, and Winston, 1973. circuit analysis and design.
Brophy, 1. J., "Basic Electronics for Scientists," New The energy source that sustains steady cur-
York, McGraw-Hill, 1977. rents in circuits is often a chemical battery.
Sprott, J. C., "Introduction to Modern Electronics," Calculating the currents in various branches of
New York, Wiley, 1981. a network of resistors and batteries can be
achieved by successive applications of Ohm's
Cross-references: CIRCUITRY, DIODE (SEMICON- Law, but is greatly facilitated by Kirchhoff's
DUCTOR), POTENTIAL, THERMIONIC EMISSION, Rules (after Gustav Kirchhoff). These are: (1)
TRANSISTOR. the algebraic sum of all currents at every junc-
tion in a network is equal to zero; and (2) the
sum of voltages around any complete loop of
a network is equal to zero. The first rule is
ELECTRONICS essentially a statement of the conservation' of
electric charge, and the second is a consequence
The field of electronics comprises the design, of the conservation of energy. It often proves
analysis, and application of electric circuits and possible to replace all or a portion of a network
the devices which control electric currents in by an equivalent circuit in order to simplify the
such circuits. Since electric currents arise from analysis. Thus, for example, the Thevenin equiv-
the flow of electrons in conductors and the alent circuit (named after M. L. Thevenin)
properties of control devices depend upon the which can represent the properties of an entire
motion of electrons in them, electric circuits, network is simply a single voltage source in
electronic devices, and their combinations de- series with a single resistor. Thus, very compli-
pend upon the properties of the fundamental cated systems can be analyzed and designed.
unit of electric charge, the electron. The age of The currents in practical electronic circuits
electronics is reckoned from the discovery that are time dependent, and the simplest time-
the electron current in a vacuum triode can be varying current alternates direction sinusoidally.
controlled by electric signals applied to one of Since even the most complex current or voltage
the triode's three terminals, the grid. Invention waveforms may be represented by a Fourier
of the transistor in 1948 expanded the possi- series consisting of harmonically related sine
bilities of electronic circuits to such an extent waves, ac circuit analysis is reduced to simple
that semiconductor devices have to a large sinusoidal currents and voltages. Capacitors and
extent supplanted vacuum tubes, except for inductors impede the varying currents in ac
special applications. The expansion of semi- circuits basically because of the energy that
conductor technology to integrated circuits must be stored and dissipated in the electric
now extends electronic techniques to nearly all field between the electrodes of a capacitor and
aspects of contemporary society, ranging from in the magnetic field associated with the cur-
communications and entertainment to informa- rent in an inductor. This effect is conveniently
tion processing for business and military, and expressed in complex number notation by
377 ELECTRONICS

writing Ohm's Law describing, for example, high operating speeds, which is important in
an ac circuit consisting of a resistor, capacitor extensive systems.
and inductor in series as follows: Electronic circuits, be they composed of dis-
crete transistors or integrated circuits, operate
v = [R + j(wL - l/wC)]I in two distinct modes. In one case, the output
where j =yCl, w is the angular frequency, and signal is a linear function of the input signal
the current and voltage are sinusoidal. The and covers the range from zero to a saturation
quantity in brackets is called the complex im- value determined by device or circuit param-
pedance of the circuit, and its imaginary part eters. Digital circuits, on the other hand, operate
is termed the reactance. If the frequency is such such that the output is either zero or some
that the inductive reactance equals the capaci- finite voltage and no intermediate values are
tative reactance (wL = l/wC) the circuit is said important. These two techniques are compli-
to be in resonance; the current is a maximum mentary, but, although linear circuits take
precedence historically, the world of electronics
and is in phase with the voltage. An important is becoming ever more digital. Turning first to
and familiar application of resonance is to select linear amplifiers, the gain a of the two types of
different channels in radio and TV receivers by single transistor amplifier is given by
adjusting the circuit to resonance at the given
channel frequency with a variable capacitor. (FET)
The key elements in all electronic circuits are
the active devices which can control strong cur-
rents between two terminals of the device in (Bipolar)
response to weak electric signals applied to a
third, thus amplifying input signals almost with-
out bound. Semiconductor devices, most nota- where Vo is the output signal, Vi is the input
bly the bipolar transistor and the field effect signal, Gm , hfe, hie are device parameters, and
transistor, are the principal active components RL is a resistor carrying the output current.
in modern electronic systems, but the great Clearly, the output is a linear function of the
versatility of semiconductor materials, prin- input; the minus sign indicates that the output
cipally silicon, has spawned an amazing variety signal polarity is inverted with respect to the
of other semiconductor devices. Of particular input. Gains exceeding 100 are easily possible
significance is the field effect transistor design in a single stage and greater values are attained
which is fabricated with a metal control elec- simply by cascading several stages. Practical
trode deposited upon the insulating oxide layer amplifiers can be designed to amplify minute
of a small silicon crystal carrying the two out- dc signals and others, particularly those employ-
put terminals; the descriptive terminology of ing resonant circuits, to amplify high frequency
this metal-oxide-silicon structure is MOSFET. signals up to 50 gigahertz.
Voltage signals applied to the metal electrode The performance of amplifiers is enhanced in
change the conductivity of the underlying sili- several respects by returning a fraction {3 V0, of
con and thus the current between the output the output voltage to the input, a process called
terminals, leading to amplification of the input feedback. The output signal of a feedback am-
signal. Ingenious fabrication techniques can plifier is then Vo =a(Vi + (3Vo ), or
produce such structures having dimensions of
the order of 10 micrometers, so that many aVo I
V = _ _I-s:-V· if -a(.l»l
hundreds or thousands of transistors can be o I - a{3 {3 I fJ •
placed on the same single crystal. Suitable
metallic interconnections between devices yield The inequality means that the polarity of the
an entire electronic circuit completely con- feedback signal is negative. It does not prove
tained within one solid wafer of silicon. Ex- difficult to achieve sufficient gain for the in-
tensions of this integrated circuit technology equality to hold, so that, as this expression
lead to VLSI, very large-scale integration, in shows, the amplifier gain depends only upon
which networks of several hundred thousand the feedback network and not at all upon tran-
transistors are included in a silicon chip a few sistor parameters. This achieves greater stability
millimeters square and less than a millimeter and linearity. If, on the other hand, {3 is some
thick. Thus, an extremely complex electronic nonlinear factor, the output signal can be a
circuit can be produced in a practical size, and, complicated function of the input. In this case,
because of the extreme miniaturization, require the feedback amplifier may be looked upon as
minimal electric power. Integrated circuit de- operating upon the input to produce a modified
signs based upon complementary symmetry, output signal. Such operational feedback am-
that is, which use transistors that employ elec- plifiers were first used in analog computers to
tronic conduction in combination with those add, multiply, integrate or differentiate input
that employ hole conduction, so-called CMOS signals. Opamps have subsequently found broad
technology, are particularly appropriate because application and in integrated circuit form are
of very low power consumption. On the other widely used as complete amplifiers in preference
hand, bipolar integrated circuits tend to have to systems built of discrete transistors.
ELECTRONICS 378

Positive feedback in an amplifier, together and accuracy of digital circuits were first real-
with the condition ized in digital computers but now find applica-
tion in all fields of electronics. Digital circuits
a~ = 1 tend to be much more complex than linear
circuits, however, so it remained for the advent
suggests an infinite output signal, according to of integrated circuit technology to make digital
the above expression, which in practice means electronics viable.
that an output is obtained in the absence of an Notation and logic in the binary number sys-
input signal, that is, the circuit oscillates. A tem are analogous to the more familiar decimal
resonant circuit feedback network assures that system. The binary number 10110, for example,
the equality is satisfied only at the resonant stands for increasing powers of two, or

10110
16 + o + 4. + 2 + o
22

frequency, and such oscillators are secondary That is, the binary number 10110 represents
time standards as well as generators of high the same quantity as the decimal number 22.
frequency signals for radio and TV broadcast- Voltage pulses in digital electronics represent
ing, microwave transmission, satellite communi- digital numbers in two ways. The series repre-
cations, etc. In this connection, the high fre- sentation has regularly spaced voltage pulses
quency carrier signal radiated from the sending corresponding to increasing powers of 2 appear-
to the receiving station is modulated with the ing in time sequence beginning with 2°. In the
audio, video, or data signal by varying the carrier alternative method, parallel representation,
amplitude, frequency, or amplitude and fre- voltage pulses for the increasing powers of 2°
quency in combination. At the receiving end, appear simultaneously on separate transmission
the signal is amplified and demodulated to paths. In either case, each digit, be it a 0 or aI,
recover the initial signal. Widespread radio and is termed a bit, a contraction for binary digit.
television communication networks represented Parallel representation is extensively used in
the first major application of electronics and digital circuits because all bits of a number are
have subsequently been joined by many diverse available in a time equal to the voltage pulse for
control and processing applications. In fact, the one bit. Conversely, a time equal to the number
versatility of electronics is so great that it is of bits in the number is needed for serial repre-
common practice to develop electric signals sentation, but, since only one signal path is
analogous to phenomena of interest so that all required, this method is conventionally used for
processing may be carried out electronically. long-distance transmission. Digital transistor
Very strong positive feedback may cause an circuits are considered to operate upon the
amplifier to oscillate between zero output and pulse waveforms representing digital numbers
a finite saturation value, becoming, in effect, according to the logic of Boolean algebra. Thus,
a digital oscillator. One form of such a multi- for example, a single transistor amplifier with
vibrator circuit can be caused to remain in one two inputs, A and B, performs the logical NOT-
or the other state until triggered into the op- AND, or NAND, operation, since only when
posite state by an appropriate external signal. both A and B inputs are present is the output
That is, the circuit remembers. Very large arrays equal to zero (note the signal inversion in a
of such binary circuits make up the memory single-stage amplifier). A NOT-OR, or NOR,
devices needed in digital computers. The inte- logic gate is equally possible, and it is interest-
grated circuit form of a semiconductor random- ing to note that, according to Boolean algebra,
access memory, or RAM, can store as many as all possible logic operations (among the simpler
262,144 bits (see below) of information in a are addition and multiplication) can be achieved
single silicon chip. by suitable arrays of NAND gates alone, or
In digital electronics, voltage signals are used NOR gates alone. This is important for design
to represent quantities in the binary number and maintenance of extensive logic arrays.
system. Since the binary number system has The synergism of digital electronics and inte-
only two digits, 0 and I, these can be repre- grated circuits reaches its ultimate in sophistica-
sented by signal waveforms which have only tion in the microprocessor, which is essentially
two values, zero and some finite saturation an entire digital computer contained in a single
value set by circuit or transistor parameters. silicon chip. The microprocessor combines small
Digital circuits are, therefore, inherently more size and low cost with sophisticated logic pro-
reliable than linear circuits, which must handle cessing that permits a seemingly unending parade
a continuous range of signals. Furthermore, of important and ingenious applications. A digi-
digital signals can be amplified indefinitely and tal computer is a complex array of logic gates
stored accurately. The advantages of reliability organized into five main parts: input, output,
379 ELECTRONICS

memory, arithmetic, and control units. The The power and flexibility of microprocessors
digital circuits are designed to carry out logic is engendering an electronic revolution equiva-
calculations of all kinds and, therefore, the lent to that in communications permitted by
computer is furnished a set of specific instruc- linear circuits decades ago. In manufacturing
tions, or a program, pertaining to any desired and process control, the logic adapts to the job
calculation or process. The input and output at hand and may respond by driving a robot
units present digital numbers and the digital to a given task. The business office of the pres-
words of the program to the computer and subse- ent, and future, contains word processors that
quently retrieve results. The memory stores correct spelling, type at hundreds of words per
each data number and program word at a spe- minute, and edit text effortlessly; data storage
cific address in memory until it is needed during and retrieval systems to describe the instanta-
execution of the program. The arithmetic unit neous state of the business; and electronic mail·
contains the logic gates that perform the logic that eliminates memos and filing. Personal com-
operations called for by the program, and the puters balance bank accounts, will soon permit
function of the control unit is to interpret each electronic banking, and also play intelligent
instruction and set the circuits of the computer games. Smart home appliances talk and listen,
accordingly. The control unit also regulates the and automobiles are made more fuel efficient
basic speed at which the computer operates. and less polluting through adaptive micropro-
Because the functions of the arithmetic unit and cessor control. Even communication is revolu-
the control unit are central to performance of tionized as computers communicate and digital
a microprocessor, they are often referred to as techniques replace linear circuits. This extends
the central processing unit, or CPU, and fabri- to phonograph and TV records as well. Truly it
cated together on one silicon chip. In this case, may be said that electronics supplements human
peripheral chips for memory and input/output intellectual activities not unlike the way the
operations must be associated with the micro- steam engine supplemented human muscle.
processor CPU chip. This permits flexibility in There is also a branch of electronics, more
design, but complete 5-unit, single-chip micro- properly termed physical electronics, which
computers are also widely used. studies physical phenomena of electrons per se.
After the program and data are placed in Very often this focuses on the motion and con-
memory and the microprocessor started, the trol of electrons in a vacuum and from such
control unit reads the first instruction, pre- studies have developed the familiar cathode ray
pares the circuits accordingly and causes the tube used in TVs and comp.uter terminals; the
appropriate number to be read from memory electron microscope and its several derivatives
as specified by the address in the instruction. which provide enormous visual magnifications;
After completion of the indicated operation, photomultiplier tubes and image intensifiers
the result is returned to memory and the con- which detect and display nuclear particles, star
trol unit passes on to the next instruction. The images, and TV shows. Another aspect treats
CPU proceeds sequentially through the program features of electrons in solids, particularly semi-
until the last instruction is reached and the end conductors, transistors, and the host of other
result transferred to the output unit. It proves useful semiconductor devices such as solar
possible to write programs in such a way that batteries, photocells, and light emitting diodes.
at a certain instruction the CPU can jump to Even the large nuclear particle accelerators,
one or the other of two different program con- kilometers in circumference, can be considered
tinuations, depending upon the value of an part of this branch of science.
intermediate result. In this fashion, the micro-
processor adapts to its activity in a way that JAMES J. BROPHY
cannot be predicted by the human programmer.
Note also that by simply changing the program, References
a microprocessor can undertake any task, which
is an extremely powerful feature. On the other Brophy, J. J., "Basic Electronics for Scientists," 4th
hand, often a dedicated microprocessor has a Ed., New York, McGraw-Hill Book Co., 1982.
specific program for its one task embodied in a Horowitz, P., and Winfield, H., "The Art of Electron-
read-only-memory, or ROM, which is fixed ics," New York, Cambridge Univ. Press, 1980.
during fabrication of the integrated circuit. In Abelson, P. H., and Dorfman, M. (Eds.), "Computers
either case, the digital words of the program are and Electronics," Science 21S (Feb. 1982).
various arrays of the digits 0 and I and this
makes it extremely difficult to prepare a pro- Cross-references: ANTENNAS, CAPACITANCE, CIR-
gram in machine language. Many ingenious CUITRY, DIELECTRIC THEORY, DIODE (SEMI-
schemes have been devised which permit pro- CONDUCTOR), ELECTRICITY, ELECTROMAG-
gramming in stylized algebra or rudimentary NETIC THEORY, INDUCTANCE, MICROWAVE
English (or other human language, of course); TRANSMISSION, OSCILLOSCOPES, RECTIFIERS,
in this case, a special, previously prepared pro- SEMICONDUCTOR DEVICES, SEMICONDUCTORS,
gram translates the high-level language program SERVOMECHANISMS, THERMIONICS, TRANSIS-
into machine language. TOR.
ELECTROWEAK THEORY 380

ELECTROWEAK THEORY* gauge theory for objects that are doublets (or
higher multiplets of some group) was provided
By electroweak theory is meant the unified by Yang and Mills.3 It is a generalization of the
field theory that describes both weak and elec- field concept that underlies quantum electro-
tromagnetic interactions. The development of dynamics. In the latter, particles have scalar
a unified electroweak theory is certainly the charges and do not transform to new states in
most dramatic achievement in theoretical an interaction; the carrier of the force, the elec-
physics to occur in the second half of this tromagnetic field, is itself neutral. In this gen-
century. It puts weak interactions on the same eralization the carriers of the force (like W ±)
sound theoretical footing as quantum electro- themselves can carry this more complicated
dynamics. Many theorists have contributed to charge and can have self-interactions. The
this development, which culminated in the field carriers would be massless like photons in
works of Glashow, Weinberg and Salam,! who the symmetry limit, so a new mechanism to
were jointly awarded the 1979 Nobel Prize in make some of them massive had to be discovered.
physics. Some of the important ideas that con- We begin the discussion of the theory in the so-
tributed to this development are the theory of called symmetry limit, a limit that could be
beta decay formulated by Fermi and parity realized at very high energies (energy larger
violation suggested by Lee and Yang and in- than 100 Ge V). In this limit the gauge group is
corporated into the immensely successful V-A SU(2) X U(1). The group SU(2) is similar to the
theory of weak interactions by Sudarshan and isospin group and the left-handed quarks and
Marshak. At the same time ideas of gauge in- leptons form doublets under this group. U(1) is
variance were applied 2 to weak interaction by like electromagnetic interactions, and the charge
Schwinger, Bludman, and Glashow. Weinberg associated with this group is called hyper-
and Salam then went one step further and wrote charge (Y). There is a fundamental relation be-
a theory that is renormalizable, i.e., all higher tween the charge Q, the third component of
order corrections are finite, no mean feat for weak isospin 13 , and hypercharge Y:
a quantum field theory! The theory had to await
the development of the quark model of hadrons (3)
for its completion.
To understand the basic content of the The assignment of quantum numbers for quarks
theory, a simple beta decay of neutron into and leptons are:
proton, electron and antineutrino ~an !>e ex-
pressed by the reaction n ~ p + e + v. The Left-handed doublets (
u v 13 = t
theory is much better behaved if this reaction
is divided into two reactions
e
d e 13 = - t
n ~p + W-, (1)
Hypercharge Y =! - t
(4)
The W particle is called an intermediate W
boson because it is an intermediary in this weak There are interaction strengths g and g' as-
process. The W particle is very heavy, and reac- sociated with the groups SU(2) and U(1)
tions in Eq. (1) occur only "virtually." The respectively. If a and b are members of any
strength of the W interaction (or the coupling isodoublet in Eq. (4), the basic processes that
constant) is comparable to the electromagnetic occur with strength g are
interactions, but the large mass of the W makes
the effective strength of the interaction both a ~b + W+
short ranged and weak. (The effective strength b ~ a + W-
is approximately (coupling constant)2jmass of (5)
W)2). We now know that neutron and proton a~a+WO
are respectively made of (udd) and (uud)
quarks, where the u quark has charge ~ e and b~b+Wo
the d quark has charge - ~ e. The transitions in
Eq. (1) then arise at the quark level from the and those with strength g' are
process a ~ a +Bo
d~u + W-, (2) b ~ b +Bo (6)
Reactions in Eq. (2) can be viewed as transitions where W±, WO form a triplet and are carriers of
between two doublets (u, d) and (v, e-). These the weak SU(2) force, while BO is the carrier
doublets are referred to as weak isospin dou- of the U(1) force. Former process occur with
blets. The crucial idea on how to construct a the strength g, and the latter with g' times the
h ypercharge.
*This work is supported in part by a grant from the This picture needs modification at low en-
Department of Energy under contract No. DE-AT06- ergies. The mass of the W should show up, and
76ER70004. the photon should remain massless and inter-
381 ELEMENTARY PARTICLES

act with matter in the usual manner. How is Except for the t quark, the others have been
this accomplished? This is where Weinberg seen. No one has the slightest idea of why these
and Salam introduced the idea of spontaneous additional doublets occur in nature. This prob-
breaking of symmetry. It is caused by a new iso- lem is referred to as the Generation Puzzle. A
doublet-The Higgs field. It has the following refinement of the theory suggests that the
remarkable property: its minimum of energy actual doublets involve d', s/ and b' which
occurs not when the field strength is zero, but are mixtures of d, s, and b. This mixing, called
when it takes some finite value. This in turn im- Cabibbo mixing, permits decays that alter
plies a prefeded direction in weak isospace, strangeness like s -+ u + W- to occur with a
and this breaks symmetry. The effect of the de- small pro ba bility .
struction of symmetry is as follows: Is the theory complete? Most researchers
(I) The hypercharge field B and the third suspect that strong, weak and electromagnetic
component of the W triplet, WO, mix to form forces should ultimately be united into one
new states unified theory (called grand unified theory).
Many attempts to do so have been made, lead-
A = cos 0 w B - sin 0 w WO ing to new predictions which would signal this
kind of theoretical structure. Some theories
Z = sin Ow B + cos Ow wo. (7) predict the decay of the proton4 although with
Here A is the usual photon, and Z a new mas- a large but measurable lifetime, and others
sive state. predict additional Z bosons. 5 The next few
(2) The Higgs fields interact with W± and Z, years should prove an exciting period in physics
making them massive. A prediction of the theory where many of these new ideas will be tested.
is that N. G. DESHPANDE
(8)
References
where Mz and M ware masses of the Z and W
boson, respectively. 1. Glashow, S. L., Nucl. Phys. 22,579 (1961); Wein-
(3) The coupling of the field A to quarks and berg, S., Phys. Rev. Letts. 19, 1264 (1967); and
leptons is identical to the coupling in quantum Salam, A., in "Elementary Particle Theory" (N.
electrodynamics. Svartholm, Ed.), Stockholm, Almquist and Wiksell,
(4) The bosons W± carry charge, and their 1968.
interactions lead to V-A theory of beta decay. 2. Schwinger, J., Ann Phys. (N. Y.) 2, 407 (1957);
(The name V-A derives from Vector-Axial Bludman, S., Nuovo Cimento 9, 433 (1958);
vector structure of currents in this theory.) Salam and Ward,Phys. Lett. 13,168 (1964).
(5) The gauge boson Z leads to new interac- 3. Yang, C. N., and Mills, R. L., Phys. Rev. 96,191
tions that involve "neutral currents" (Le., (1954).
processes like u -+ u + Z, with Z -+ v + jj). This 4. Pati, J. C., and Salam, A., Phys. Rev. DIO, 275
is a new, completely unexpected prediction. (1974); George, H., and Glashow, S. L., Phys.
The test of this prediction had to await the ex- Rev. Lett. 32,438 (1974).
perimental development of high energy neutrino 5. Deshpande, N. G., and Iskandar, D., Nucl. Phys.
beams. This prediction has now been verified B167, 223 (1980).
and found to yield agreement with the experi-
ment to the level of 5-10%. The Z boson in- Cross-references: CONSER V ATION
LAWS AND
teractions depend on the value of the mixing SYMMETRY, CURRENT ALGEBRA, ELEMEN-
angle 0 w. Several different experiments lead TARY PARTICLES, GAUGE THEORY, GRAND
to the value sin 2 Ow = 0.23 ± 0.02. UNIFIED THEORIES, QUANTUM CHROMODY-
So much for the description of electro weak NAMICS, QUANTUM ELECTRODYNAMICS,
theory. The most important experimental QUARKS, STRONG INTERACTIONS, WEAK IN-
verification, the observations of W± and Z TERACTIONS.
bosons has been announced. Their masses were
predicted to be Mw ~ 80 GeV and Mz ~ 90
GeV. In the summer of 1983 the discovery of ELEMENTARY PARTICLES
both W+ and Z bosons was reported by C.
Rubia using the proton-antiproton collider at The search for the elementary constituents of
CERN. The masses are in excellent agreement matter is as old as physics itself, but any quan-
with the theoretical prediction. The observa- titative attempt at such a theory had to await
tion of the Higgs particle is much more diffi- the experimental discoveries of this century.
cult to carry out experimentally, but is none- Such a search is prompted by two considera-
theless an important test of the theory. tions: the identification of the basic building
Where do new quarks and leptons fit into the blocks of nature and the hope that their laws
scheme? They have to enter as doublets. Two of interaction would be essentially simple.
more lepton doublets have been identified The atom had to yield its claim to indivisi-
(/J.-, vJL) and (7-, vr ), while two quark dou- bility when it was found that electrons were
blets are also expected to occur, (c, s) and (t, b). constituents of all atoms; moreover, the elec-
ELEMENTARY PARTICLES 382

trons from various species of atoms were provided such a successful systematization of
identical. Light, with its particle properties, beta decay data for several nuclei that the
seemed another universal entity connected existence of the neutrino was "well established"
with matter, since the photons (light quanta) more than twenty years before its experimental
which were emitted in atomic transitions ap- discovery. The beta decay interaction was very
peared identical apart from their momenta. weak even compared to the electron-photon
The electron and the photon were the first two interaction.
elementary particles to be discovered, and a Meanwhile, the electron was found to have a
quantitative theory of the emission and ab- positively charged counterpart called the posi-
sorption of photons by the electrons in an atom tron; the electron and positron could annihi-
was possible only after the invention of quan- late each other, with the emission of light
tum mechanics. The corresponding picture of quanta. The theory of the electron did in fact
the atom regarded the electrons in an atom as "predict" the existence of such a particle. It
being subject to electrostatic attraction of the has, since then, been found that the existence
positively charged nucleus (and the mutual of such "opposite" particles (antiparticles) is
repulsion of other electrons), the photons hav- a much more general phenomenon (see below
ing only a transistory existence being either and also see ANTIPARTICLES).
emitted or absorbed in the transitions between Our present catalog of elementary particles
the atomic states. The search for the structure and decay modes contains many more entries.
of matter now became a search for the con- These particles fall into families: the photon
stituents of the nucleus. family, the lepton family, and meson family,
A quantum theory of the nucleus (or rather and the baryon family, as well as a number of
nuclei) was made possible by the discovery of postulated particles like quarks and gluons.
the proton and the neutron. The nuclear Most of these particles are unstable and decay
interaction which was responsible for holding within a time which is often very small by
the nucleus together (against the disruptive elec- normal standards but which is many orders
trostatic repulsion of the protons) was found to of magnitude larger than the time required
be of an entirely new kind, much stronger than for any of these particles to traverse a typical
the electric interaction at short distances but nuclear dimension. There is a wide variety of
decreasing very much more rapidly with dis- reactions between them, but they could be un-
tance. The various complex nuclei differ in derstood in terms of three basic interactions-
the number of protons and neutrons they the strong (or nuclear) interactions, the electro-
contain. magnetic interactions, and the weak interac-
By that time, the theory of the interaction tions. The nuclear forces and the interaction
between electrons and photons had developed between pions and nucleons belong to the first
to the point where the electrostatic repulsion type; the electron-electron and electron-photon
or attraction between electrically charged interactions to the second; and the beta decay
particles could be understood in terms of the interaction to the third. The present theoretical
exchange of photons between them. In the low- framework enables us to handle more or less
est nontrivial approximation, it gave the Cou- quantitatively the electromagnetic and weak
lomb law for small velocities. The basic inter- interactions and certain aspects of all inter-
action was the emission and absorption of actions. Despite this, it is possible to under-
"virtual" photons by charged particles. A stand many aspects of strong (as well as the
similar mechanism could be invoked to explain other) interactions in terms of conservation
the short-range nuclear interaction, and to pic- laws and invariance principles (see CONSER-
ture the nuclear interaction as due to the ex- VATION LAWS AND SYMMETRY).
change of particles which have nonzero masses The classical conservation laws of energy,
which are a fraction of nuclear mass. (The momentum, and angular momentum are valid
approximation procedure used for deducing the in the relativistic quantum theory of elemen-
static Coulomb force from the electron-photon tary particles also. The particles may possess
interaction is no longer valid here; and the intrinsic angular momentum or spin, which,
nuclear force has a rather complicated form. expressed in natural units h/2rr of angular mo-
However, these theoretical considerations did mentum, is restricted to an integer or a half-
predict the existence of a set of three particles odd integer. Angular momentum conservation
called pions, which have since been discovered.) holds only when this spin angular momentum
Another kind of particle and another kind of is included. But one finds that to every particle
interaction were discovered from a detailed there corresponds an antiparticle with the same
study of beta radioactivity in which electrons mass, same spin, and same lifetime. (In the
with a continuous spectrum of energies are case of the photon and the neutral pion, they
emitted by an unstable nucleus. The correspond- are their own antiparticles; they are strictly
ing interactions could be viewed as due to the neutral particles.) The particle and antiparticle
virtual transmutation of a neutron into a pro- have equal and opposite electric charges, and
ton, an electron, and a new neutral particle of the antiparticle of the antiparticle is the original
vanishing mass called the neutrino. The theory particle. Conservation of electric charge is
383 ELEMENTARY PARTICLES

another familiar (although nonclassical) conser- One notices that the various particles be-
vation law satisfied by all known interactions. longing to a family have the same spin and the
In addition to additive conservation laws same values of the additive quantum numbers
which arise from continuous symmetries, there except the electric charge. The photon has a
is a set of "multiplicative" conservation laws universal interaction with all charged particles;
which are associated with discrete symmetries. it has been found possible to connect the con-
It is possible to examine the invariance of the servation of electric charge and this universal
physical laws under space inversion, i.e., using a interaction structure, on the one hand, to the
left-handed coordinate system or vice versa; vanishing mass and unit spin of the photon,
if the statement of the law is unaffected by this on the other. The electron and muon partake
interchange, it is possible to show that a quan- of both electromagnetic and weak interactions,
tum number having the two values ± I can be but do not exhibit any strong interaction.
assigned to classify the quantum-mechanical In fact the muon family appears to be simply
states so that a state with the label +1 will not a duplicate of the electron family except for
change to one with the label -1 due to any a change in the unit of mass. These light par-
interaction. This quantum number is called ticles are collectively known as leptons. This
parity. Just as particles may possess intrinsic fascinating puzzle has now been made more
angular momentum (spin), particles may also intriguing by the discovery of the T-Iepton with
have intrinsic parity. Table I lists the particles possibly its own neutrino. Why the repetition
(and their corresponding antiparticles) with of this pattern, this recurrence?
their respective additive quantum numbers, For a long time people had assumed that
intrinsic parities, and lifetimes. General prin- "the" neutrino had zero mass like the photon.
ciples of relativistic quantum theory imply In weak interaction physics since the left-handed
that antiparticles of integral spin particles have and right-handed fermions behave differently
the same parity as the particles; for half-odd- (see V-A chiral interactions, below) it is not
integral spin particles the antiparticle has the necessary that the mass be zero. Contemporary
parity opposite that of the particle. All experi- thinking puts neutrino masses not necessarily
mental checks are in accordance with this zero, not even diagonal in the species: so that
prediction. we could have neutrino oscillations in which
In addition to invariance under space in- one species slowly transmutes itself into another
version, we may consider particle conjugation in a reversible manner very much like the oscil-
(replacement of particles by antiparticles) lations of a pair of weakly coupled pendulums.
invariance and time reversal in variance , or com- No definite conclusions can be stated about
binations of these transformations. It turns neutrino masses at the present time except for
out that strong and electromagnetic interac- certain numerical limits.
tions are invariant under each of these three The meson family consists of eight members
transformations (and hence any product of which fall into a triplet of pions, a singlet eta,
these), but weak interactions are invariant a doublet of kaons, and a doublet of anti-
only under combined inversion (product of kaons. They are all pseudoscalar (spin zero and
particle conjugation and space inversion) and odd parity) and exhibit strong interactions.
under time reversal. It can be shown that all The charged particles are of course coupled
interactions are invariant under the product of to the photon, but even the neutral members
the three transformations of space inversion, can participate in electromagnetic interaction
particle conjugation, and time reversal if some by virtue of the large probability of virtual
very general principles of the relativistic quan- dissociation into charged particles. They partici-
tum theory of these particles are valid. pate in a variety of weak interactions including
Even the statement that weak interactions are the nuclear beta decay interaction.
invariant under combined inversion has turned It is found that the kaons, the hyperons (bar-
out not to be strictly true. In the decay of the yons other than the neutron and proton), and
neutral K meson we should have expected a their antiparticles, collectively known as strange
short-lived particle (even under combined particles, can decay by weak interactions not
inversion) called K 1 0 and a longer-lived particle involving leptons or photons with a lifetime
(odd under combined inversion) called K 2 0 , which is large compared to the natural periods
provided combined inversion were strictly appropriate to strong interactions. On the other
valid. We do observe such short- and long- hand, these particles are produced copiously
lived components, but we also expect that the in high-energy nuclear collisions. These two
long-lived component K 2 0 cannot decay into circumstances can be understood in terms of
two pions. Yet experimentally we find a small the existence of another additive quantum
amount of decay into two pions. This viola- number, called hypercharge, which is conserved
tion of combined inversion (and, hence, of time in strong and electromagnetic interactions but
reversal invariance) is only two-tenths of a per- violated in weak interactions.
cent, but it is definitely present. Thus, none of The meson-baryon system exhibits further
the discrete symmetries (except the product regularities as far as strong interactions are
of the three) seems to be strictly valid. concerned. The neutron and the proton have
~
t""
TABLE 1. CATALOG OF ELEMENTARY PARTICLES ~
f!::
~
Hyper- Baryon Lepton z
Particle Familya Spin Mass (MeV) Lifetime b (sec) Antiparticle ParityC Charge charge Number Number Charm ;!>
'""
0<
"
Photon, 'Y Photon 1 0 Stable Photon, 'Y 0 0 0 0 0 ~
;!>
1
Electron neutrino, Ve \ 2" 0 Stable Antielectron Undefined 0 0 0 0
neutrino, ve
";:;'""
t""
Electron, e 0.51100 Stable Positron, e+ + -1 0 0 1 0 ~
!1 rJl
Muon neutrino, ViJ 2" 0 Stable Antimuon neutrino, viJ Undefined 0 0 0 0
up!o.
Muon, iJ - 105.659 2.20 X 10-6 Positive muon, iJ+ + -1 0 0 1 0
Tau, 'T -
J !
! 1784 < 2.3 X 10-12 Positive tau, 'T + + -1 0 0 1 0
Neutral pion, 1T0 0 134.96 0.83 X 10- 16 Neutral pion, nO 0 0 0 0 0
Positive pion, 1T+ 0 135 .56 2.60 X 10-8 Negative pion, 1T - +1 0 0 0 0
Eta, Tl 0 549 7 X 10-19 Eta 0 0 0 0 0
Neutral kaon, KO Meson 0 497.7 0.86 X 10-10 Neutral antikaon, 1(0 0 +1 0 0 0
Positive kaon, K+ 0 493.67 1.24 X 10-8 Negative antikaon, j(- +1 +1 0 0 0
CharmedD+ 0 1868 2.5 X 10-13 CharmedD- +1 0 0 0 +1
Charmed DO 0 1863 3.5 X 10-13 CharmedjjO 0 0 0 0 +1
Proton, p ! 938.28 Stable Antiproton, p + +1 +1 0 0
Neutron, n ! 939.57 0.93 X 10+3 Antineutron, ii + 0 +1 1 0 0
Lambda, A ! 1115.6 2.6 X 10-10 Antilambda, .:\ + 0 0 1 0 0
Positive sigma, 1;+ ! 1189.4 0.8 X 10-10 Negative antisigma, l;- + +1 0 0 0
Neutral sigma, 2;0 ! 1192.5 5.8 X 10-20 Neutral antisigma, i O + 0 0 0 0
Bayron
Negative sigma, 2;- !1 1197.3 1.48 X 10-10 Positive antisigma, l;+ + -1 0 0 0
Neutral xi, :E: 0 2" 1315 2.9 X 10-10 Neutral antixi, :E: 0 + 0 -1 0 0
1 -1 -1
Negative xi, :E:- 2" 1321 1.6 X 10-10 Positive antixi, :E:+ + 1 0 0
1 -1 -2
Omega, n- 2" 1672 0.8 X 10-10 Antiomega, ii+ + 1 0 0
Charmed A C 1 7 X 10-13
2" 2273 Charmed .:\c- + +1 0 1 0 +1

aElectron, muon, and tau families are collectively known as the lepton family. The proton and neutron are both nucleons ; other members of the baryon family are
the hyperons.
bThe neutral kaon has a long-lived component K2 0 and a short-lived component, which are quantum mechanical superpositions of the neutral kaon and the neutral
anitkaon.
cElectron, muon, tau, proton, neutron, and lambda parities are defined by convention. Antifermions have opposite parity from fermions, Antibosons have same
parity as bosons. 00
......
385 ELEMENTARY PARTICLES

very nearly the same mass and similar nuclear the moment of emission. With the mesons tak-
interactions, although their electromagnetic ing part in strong interactions, however, such
properties are quite different. The three pions distinctions are no longer obvious, and the ques-
have different electric charges, but again tion of whether a particle is elementary or is
they have approximately equal masses and composed of several other particles cannot be
similar nuclear interactions. This kind of multi- answered except perhaps within the context
plet structure is evident for other strongly in- of a more quantitative but limited model. A
teracting particles: the kaons form a doublet, point of view that has gained some acceptance
the sigma hyperons form a triplet, the xi hy- is that none of these particles are elementary
perons form a doublet, and the lambda hyperon and that each is a composite of several particles.
remains a singlet. In view of the relative weak- This view, while by no means inevitable or
ness of the electromagnetic interaction, it is even well-established, is a possible picture,
tempting to ascribe all deviations from exact because in the realm of elementary particles
equality of the masses to the indirect action of we can not only add particles together to con-
the electromagnetic interaction. In this frame- struct a composite system, we can also "sub-
work, it is possible to consider the members tract" particles by adding antiparticles. The
of a multiplet to be different states of the same claim that particles A and B go to make up
particle corresponding to the values of a new the particle C is different to distinguish from
quantum number. What is remarkable is that if the claim that particles Jj (antiparticle to B)
one takes this point of view, it is possible to and C go to make up the particle A. Further,
show that the strong interactions exhibit a particles play a dual role. On the one hand,
remarkable invariance under a group of con- they are constituents of a composite system;
tinuous transformations which may be viewed on the other hand, they are the objects which
as the group of rotations in a fictitious three- are exchanged to generate forces between the
dimensional space (or more correctly as the constituents. In any case, in view of the very
special unitary group SU(2) of transformations large number of entries in Table I, it is not de-
on two variables). The transformations act as sirable to accept all of them as the ultimate
follows: the singlet is unchanged, the doublet constituents of matter.
components transform like the components of This is even more forcefully brought to our
a spinor, and the triplet components transform attention by the recent discovery of a very large
like the components of a vector. This property number of ultra-short-lived particles. They ap-
of strong interactions is called charge inde- pear as sharp resonances in multiparticle sys-
pendence, and the corresponding conserved tems. Since these "resonances" disintegrate
dynamical variable (with three components) within a short time (even on the nuclear scale!),
is called the isotopic spin. It then turns out that it is difficult to view them as elementary par-
hypercharge conservation is a consequence of ticles, but they seem to play an important
isotopic spin conservation and electric charge role in interaction phenomena and are pro-
conservation. While the conservation of isotopic duced as often as the more stable (and familiar)
spin is violated by the electromagnetic (and mesons and baryons included in Table 1. It
weak) interactions, the charge independence appears at the present time that they ought to
of nuclear interactions is still expected to be be included on more or less the same footing.
satisfied to within a few per cent and experi- A list of the better established resonances is
mental tests confirm this. Since the symmetry given in Table 2 along with the mesons and bar-
associated with invariance under isospin trans- yons from Table 1. Since an unstable particle
formations is not directly related to space-time lives only for a very short time its energy and
properties, one often refers to it as an internal consequently its mass cannot be sharp, and
symmetry. from elementary quantum mechanical consider-
One might now raise the question: Which of ations we should. expect this "width" in the
these particles are basic constituents of matter? mass of a resonance to be inversely propor-
For the case of the atom, say the simplest of tional to its lifetime. Since the width is what
them all, the hydrogen atom, it seems easy to is measured experimentally the width (rather
say that it is a composite system made up of than the lifetime) is usually quoted in connec-
an electron and proton bound together by an tion with resonances.
electrostatic force. However, this answer is Since these particles are coupled in the strong
not completely satisfactory, since the electro- interactions, one would expect them to occur
static force itself is due to the exchange of light in isospin multiplets. This is in fact observed.
quanta, and in the process of atomic transitions It turns out that since strong interactions are
photons are emitted or absorbed. Yet we do invariant under particle conjugation and are
not include them as constituents of the atom. charge independent, we could define a multipli-
In beta radioactivity, electrons and neutrinos cative quantum number called G-parity which
emerge from the nucleus, yet the nucleus is has definite values ± 1 for mesons and meson
not pictured as containing either of these vari- resonances. These values are also included in
eties of particles but rather as made up of Table 2.
protons and neutrons. The beta electron and With these resonances included among the
neutrino are rather assumed to be created at "elementary particles" we have a situation
ELEMENTARY PARTICLES 386

TABLE 2. SOME STRONGLY INTERACTING PARTICLES AND RESONANCES

Mass Width Electric Hyper- Isotopic


Particle or Resonance Spin (MeV)a (MeV)b Parity Charge charge Spin G-parity

Pion,1T 0 138.5 0 0,+1,-1 0


1
Kaon,K 0 495.8 0 0,+1 +1 ~ Undefined
Antikaon, K 0 495.8 0 0,-1 -1 ! Undefmed
Eta, 1/ 0 548.8 0 0 0 0 +
Rho resonance, p 776 158 0,+1,-1 0 +
1
Kaon resonance, K * 893 50 0, +1 +1 ~ Undefmed
Antikaon resonance, K * 893 50 0,-1 -1 1
~ Undefined
Phi resonance, I/> 1020 4 0 0 0 Undefined
Omega resonance, w 782 10 0 0 0
JII/I 3097 0.063 0 0 0
1 1
Nuc1eon,N ~ 938.9 0 + 0,+1 +1 ~ Undefined
1
Lambda, II. ~ 1115.6 0 + 0 0 0 Undefined
1
Sigma, ~ ~ 1193.4 0 + 0,+1,-1 0 1 Undefined
1 1
Xi, :=: ~ 1318.4 0 + 0,-1 -1 ~ Undefined
3
Nucleon resonance, N* ~ 1236 120 + 0,+2,+1,-1 +1 ~ Undefined
Y resonance, y* ~ 1385 36 + 0,+1,-1 0 1 Undefined
1
Xi resonance, :=:* ~ 1530 7 + 0,-1 -1 ~ Undefined
Omega minus resonance, .n ~ 1672 0 -1 -2 0 Undefined
aThe average mass of the members of the isotopic multiplet is tabulated.
bSince the unstable particles of Table 1 live "practically forever" on the nuclear time scale, the corresponding
widths are several orders of magnitude smaller than 1 MeV; these are quoted here as O.

somewhat parallel to atomic spectroscopy responsible for the violation of this higher
before the discovery of quantum mechanics. symmetry must itself be a part of the strong
The catalogue of the strongly interacting interaction. A scheme in which invariance un-
particles (collectively known as hadrons) now der the special unitary group on three variables
contains well over a hundred entries, and it 8 U(3) holds approximately has been success-
would be difficult to consider a hundred plus full in correlating and predicting the spectrum
"elementary" constituents. Yet how are we to of particles and their interactions. The isospin
select the genuine subset of elementary con- group 8U(2) is a subgroup of this unitary
stituents? We have already remarked about the group. Just as for charge independence, no
picture in which every hadron is a composite basic reason has been found for the origin of
system. We should then look for regularities this "unitary symmetry." Still other symmetry
among them, including groupings into families, groups, even wider than 8U(3) and generally
multiplets, etc., as well as for systematic rela- incorporating it, and which are even more
tions between masses, spins, multiplet sizes, etc. significantly violated, are being studied. It
There are also practical questions regarding appears that the complete understanding of
the identification and interpretation of reso- these higher internal symmetries would involve
nances . When a number of reaction chaimels not only their origin, but also the origin of their
are open a resonance may not be easily visible violation.
as a pronounced peaking in crosssection or mass The hadron multiplets appear to have other
plot. Fortunately resonances like the ground- regularities. We can discern, by analogy with
state hadrons seem to fall into multiplets, and atomic physics, subfamilies consisting of a
this together with other systematics aid us in lowest-spin "excited states." The states listed
identifying resonances: observation is intimately in Table 2 may be viewed as the ground states.
tied in to the theoretical framework employed. If we plot the masses squared versus the spin
One notes also that the meson and baryon for several of these subfamilies, we find them to
multiplets seem to fall into further super- lie approximately on straight lines with a uni-
multiplets. Following the analogy of the iso- versal slope of about I (GeV)-2. These may be
spin group, we may now ask what internal thought of as the generalization of the bound
symmetry group is responsible for this interac- state energy versus spin relation for potentials
tion. We must also remember that whatever is to the domain of resonances, and thus, as or-
387 ELEMENTARY PARTICLES

bital excitations; and in this context they are generalizing the electromagnetic coupling to
known as Regge families. charged particle fields to more general non-
The ground states themselves may be under- commutative groups. Identifying the color of
stood in terms of a generalization of the "in- quarks as the realization of an SU(3) color
ternal symmetry" to include a spin aspect also, group leads to the introduction of an eightfold
so that instead of SU(3) we consider SU(6). vector meson field of zero mass coupled to the
The spin ! even-parity baryon octet and spin quarks as well as self-coupled, which leads to
~ even-parity baryon decuplet resonances to- forces with precisely the properties required.
gether then form a single 56-dimensional repre- The quanta of these octets of fields are called
sentation of SU(6). The nine spin 0 odd-parity gluons. They are neutral with regard to iso-
mesons and the nine spin I odd-parity vector spin and its generalization, which in contrast to
meson resonances form a mixture of the 35- color is called flavor. The fundamental strong
dimensional and the I-dimensional representa- interaction is then the gluon interaction. The
tion of SU(6). We can combine this SU(6) hadron-hadron interaction by contrast is the
structure together with the orbital excitations residual interaction between quark compounds,
mentioned above to bring about a phenomeno- somewhat analogous to the van der Waal's
logical SU(6) X 0(3) classification for hadrons. forces between atoms. No wonder nuclear
It is very tempting to think of this SU(6) X forces are so complicated!
0(3) structure as pointing to a substructure of At the present time most experts seem to be-
the hadrons in terms of 3 hypothetical entities lieve that the quantum theory of coupled col-
with spin! and even parity which transform as ored quarks and their Yang-Mills gluons is the
a 3 X 2-dimensional representation of this group theory of strong interactions. It is called quan-
called quarks. The simplest baryon resonances tum chromodynamics (QeD). The difficulties
are then to be viewed as three-quark com- of principle that plagued the theory of photons
pounds and the meson resonances as quark-anti- and electrons, quantum electrodynamics (QED),
quark compounds. The spin, parity, and SU(3) are still there; but in addition we have new dif-
quantum numbers of most hadrons are consis- ficulties in calculation. As if in compensation
tent with this picture and some quantitative we have many more phenomena to which QeD
understanding of the resonance masses and de- can be applied. Nonperturbative methods and
cay parameters can be obtained within this semiphenomenological methods have been de-
framework. sometimes referred to as the quark veloped to cope with these difficulties. Par-
model. It should be pointed out that so far no ticularly important have been the observation
quarks have been discovered and physicists are that the effective coupling strength is energy
not sure if they even expect them to exist as dependent and in QeD it decreases as the mo-
ordinary particles. mentum transfer increases, leading to "asymp-
The baryons and baryon resonances identified totic freedom." The precise manner in which
as three-quark compounds correspond to a this takes place is described by the renormaliza-
completely symmetric spin-internal symmetry tion group equations.
wave function. The spin i and spin ~ baryons The elementary particle scene has been liv-
may be viewed as realizing a third rank sym- ened by the discovery of extremely narrow
metric tensor realization. But quarks, being resonances and their identification as quark-
spinorial entities, should behave as Fermions antiquark bound states. This has led to the
and hence should furnish a totally antisym- identification of a new flavor quantum number
metric third rank tensor. It is therefore ap- called charm. Together with charge and strange-
propriate to endow quarks with an as yet un- ness this suggests that the internal symmetry
identified property called color with at least (flavor) group is SU(4) rather than SU(3) or
three linearly independent states. If for economy SU(2). It is only the breaking of the symmetry
we use only three colors, all known hadrons are by the large mass differences between the
color singlets: one may require "color confine- various quarks that tended to hide these sym-
ment" as an essential ingredient of hadron metries. The discovery of charm has also pro-
structure. vided a means for accounting for the absence of
What about the binding of the quarks and strangeness-changing neutral hadronic currents
more generally of the quark-quark and quark- in weak interactions.
anti-quark forces? We can expect them to be Strong interactions are thus described in
quite strong since at low energies and momen- terms of a color triplet of four flavors of quarks
tum transfers the hadrons act as indissoluble. which are called u, d, s, c (forup, down, strange,
But we need a special feature for these forces: and charm). It is a question of some interest
unlike usual forces in atomic and nuclear physics whether this is the end. It is generally accepted
these forces must be "confining," and hence in- that one of the very narrow resonances should
crease/ with distance. In contrast, in many be identified as a new quark pair. If this is called
high momentum transfer reactions which probe b (for beauty or bottom) and a sixth called t
small distances these quark binding forces are (for truth or top) is also postulated, we would
relatively weak. It turns out that these features have these pairs: u, d; s, c; t, b. This would be
result from an interaction structure invented by then very similar to the leptons, where we seem
Yang and Mills on the basis of gauge invariance to have e, Ve; /1, vf.l; T, VT: three families of
ELEMENTARY PARTICLES 388

quark pairs and three of lepton pairs. This If we really pursue the goal of unification and
family recurrence suggests even more strongly are emboldened by the close similarity be-
that the leptons and quarks are intimately con- tween quarks and leptons attempt to put them
nected; and that we have not exhausted levels together as a grand multiplet, we would be led
of possible unification. What was an electron- to a grand unified theory (GUT), or rather
muon puzzle is now a full-fledged "family" several theories. Symmetry is broken in these
affair! theories so that at ordinary energies they do
As particles get organized into multiplets, not appear.
so do interactions exhibit systematic prop- Particularly simple forms of GUT are based
erties. During the last century Maxwell unified on the groups SU(5) and 0(10); these theories
electric and magnetic interactions into the elec- do predict a quark-lepton transition induced by
tromagnetic theory. With the reinterpretation a very heavy boson. This would lead to the
of the coupling of the charge flow to the elec- decay of the proton, possibly into a positron
tromagnetic field to be implemented by a gauge and neutral pion or other such modes. The
invariant interaction in quantum field theory lifetime for such a decay would depend on the
we got quantum electrodynamics (QED). In heavy boson mass, varying roughly as the
the middle of this century, Sudarshan and fourth power of the mass. Using renormaliza-
Marshak started from Fermi's theory of beta tion group equations and the observed ratio of
decay of the neutron to unify weak interactions the couplings one can estimate the mass and
into a chiral ("handed") V-A interaction. In hence the proton lifetime. This yields a life-
this theory for the first time we identified the time on the order I 031± 2 years. While this
left-handed fermions to be the only ones means an extremely small transition rate there
coupled; in contrast to electromagnetism, in are a great number of protons in the world,
which the left and right chiral components were and detection, if they decay, is imminent.
both coupled but the neutrino was not. The We should also remark that the particles so
adaptation of this theory to strange and charmed far discovered all have either finite mass (Class
particles has now been carried out satisfactorily. I particles or bradyons), or zero mass (Class II
Is it possible to weld together the Maxwell particles or luxons). It is an interesting ques-
theory of electromagnetism and the Sudarshan- tion to ask if particles of imaginary mass (Class
Marshak theory of chiral weak interactions into III particles or tachyons) can and do exist. It
a unified electroweak theory? Clearly the weak used to be thought that such particles could not
interactions are weak and if seen as due to exist, since their existence would violate the
charged counterparts to the photon these inter- principle of relativity, but now we know that
mediate vector mesons should be about 40 such is not the case. If hadronic tachyons exist,
times the proton mass. It turns out that Glas- the quantum theory of tachyons predicts that
how, Weinberg, and Salam constructed pre- they will show up as fixed resonances in mo-
cisely such a theory by starting with a Yang- mentum transfer. Leptonic tachyons (and
Mills theory of a group SU(2) X U(I) involving photonlike tachyons) could probably be best
the left-handed doublet comprising the electron detected in astronomical phenomena. If they
and neutrino on the one hand and the right- exist they would provide for a substantial pres-
handed electron on the other. This symmetry sure in the interior of hot stars and thus provide
must break spontaneously and in the process a balancing force to alleviate gravitational col-
generate substantial masses for the vector lapse. Whether the concept of particles of
mesons. Such a theory is "renormalizable," so imaginary mass, which is the relativistic counter-
that higher order field theory corrections can be part of geometric size used in simpler non-
calculated in an unambiguous manner. It is not relativistic physics, is useful in elementary
fully unified since there are two distinct coup- particle physics is not yet clear.
ling constants which can be related to the ratio One of the remarkable new developments is
of the charged to neutral vector bosons. It pre- the close relationship between cosmology and
dicts a level of neutral current interactions in elementary particle physics. This goes beyond
agreement with experiments. The mechanism the application of standard methods of nuclear
of generation of masses involves the use of cer- synthesis and stellar astrophysics in dealing
tain auxiliary Higgs fields which also serve to with the change in physical laws under cosmo-
give the electron its mass. Once the electron- logical conditions. The spontaneous sym-
neutrino interactions are formulated we could metry breaking that makes weak interactions
use it for the two other lepton pairs and the weaker than electromagnetism no longer oper-
three quark pairs. ates when energy density and temperatures
The existence of three families rather than are very high. Similarly, masses and thresholds
two throws some light on the puzzling small are dependent on the presence of radiation
time reversal violation as "natural," in that the and other particles in high concentration.
complex phases appearing in a mass matrix Finally the baryon to photon abundance ratio
coupling left- and right-handed objects cannot can be related to the violation of time reversal
all be "defined away" as soon as three families invariance.
appear. Despite the breathtaking unifications we are
389 ELEMENTS, CHEMICAL

still short of two unifications. One is the uni- ELEMENTS, CHEMICAL


fication of gravitation with particles inter-
actions. While the emphasis on geometry char- The idea of a basic simplicity underlying the
acteristic of general relativistic gravitation bewildering complexity of nature has always
theory is now permeating particle physics via been a conceptual thread woven into man's
gauge theories, the combination of gravitation view of the world. The Greek philosophers of
with particle theories is still far from satis- antiquity were among the first to record their
factorily envisaged, much less carried out. speculations, and we are to this day influenced
The other unification is of bosons and fer- in an unconscious way by their thoughts, con-
mions by a supersymmetry. Such theories are cerning the elements, which they supposed to
now being actively pursued, since they seem to be the ultimate components of matter and
have some effect of diluting the infinities in chemical change. Thus we speak of "man's
quantum field theory. Supersymmetry is re- battle with the elements" and "the raging ele-
luctant to "break" in a simple manner, yet ments" in unconscious reflection of the ideas
break it must if experiment is our guide: bosons of Thales, Anaximenes, Heraclitus and Em-
and fermions are not degenerate in their masses. pedocles of the fifth century B.C. who believed
To sum up: we have a large number of "ele- that all matter was made of one or more of
mentary" particles which are not so elementary. the elemental substances: earth, air, fire and
We have a number of theories which aid us in water. These ideas did not prove particularly
understanding the regularities observed in their fruitful in advancing our understanding of the
spectrum and in their interactions. We have nature of matter and chemical change. Never-
discovered a large number of invariance groups theless, it was not until van Helmont (1648)
and the extent to which they break. And we that they were challenged on a rational basis.
have found that coupling to the flow of electric In 1662, Robert Boyle, in the Sceptical
charge and its noncommutative generalizations Chymist gave a reasonably clear definition of a
as well as the flow of energy and momentum chemical element with an operational basis
lead to a class of gauge theories which may which we can accept today. "I mean by ele-
underlie all the physical interactions. However, ments, ... certain Primitive and Simple, or per-
we still do not understand the multiplicity of fectly unmingled bodies; which not being made
particles and interactions. The past decade has of any other bodies, or mingled bodies, are the
been a time of great discoveries. Particle physics Ingredients of which all those called perfectly
promise to be a rich area of physics in the fore- mixt bodies are immediately compounded, and
seeable future. Perhaps yet another level of into which they are ultimately resolved." He
discovery awaits us in our search for the consti- gave no list of elements, however, this being left
tution of matter. to Lavoisier who published a naturally incom-
plete, but remarkably accurate list in 1789, in
E. C. G. SUDARSHAN his justly famous, Traite de Chimie.
The Definition of "Element" In modern lan-
guage, Boyle can be paraphrased in the follow-
References ing way: an element is a chemical species that
cannot, by ordinary chemical manipulation be
Marshak, R. E., and Sudarshan, E. C. G., "Introduc- decomposed into a number of simpler chemical
tion to Elementary Particle Physics," New York, species. It is the entity that survives intact the
Intersicence Publishers, 1962. infinite variety of transformations that a sample
Bernstein, J., "Elementary Particles and Thier Cur- of matter can be caused to undergo. Every com-
rents," San Francisco, W. H. Freeman, 1968. pound is composed of two or more of these
Marshak, R. E. Riazzuddin, and Ryan, C. P., "Theory species and can be decomposed into them by
of Weak Interactions in Particle Physics," New York, suitable chemical procedures. This definition
John Wiley, 1969. provides an operational means of identifying an
Streater, R. F., and Wightman, A. S., "TCP Theorem element in terms of laboratory procedure, and
and All That," W. A. Benjamin, New York, 1964. by it, any species that defies decompositional
Weinberg, S., "The First Three Minutes," Basic Books efforts must be classified as an element. Such
Inc., New York, 1977. an assignment must of course be somewhat ten-
Particle Data Group, "Review of Particle Properties," tative, since in a number of cases, substances
Particle Properties Data Booklet, April 1980, CERN, that have stubbornly resisted decomposition
Geneva. and therefore carried the classification, have
ultimately yielded as new techniques developed.
Cross-references: ANTIPARTICLES, CONSERVA- With the recent growth in detailed knowledge
TION LAWS AND SYMMETRY, CURRENT ALGE- of atomic structure, it became possible to de-
BRA, ELECTROWEAK THEORY, GAUGE THE- fine an element in terms of the submicroscopic
ORIES, GRAND UNIFIED THEORY, QUANTUM structure of matter. Such a definition is relieved
CHROMODYNAMICS, QUANTUM ELECTRODY- of the ambiguities mentioned above. Thus an
NAMICS, QUARKS, STRONG INTERACTIONS, element is a sample of matter that consists of
WEAK INTERACTIONS. only one kind of atom, the atoms being identi-
ELEMENTS, CHEMICAL 390

fied in terms of their atomic number, or nuclear nuclear charge but differing in mass number or
charge. Each element is composed of atoms atomic weight are referred to as ISOTOPES. If
having characteristic nuclear charge and an all of these are considered, then there are ap-
equivalent number of electrons. This nuclear proximately 1000 different atomic species rep-
charge can be determined by the charged-par- resented in the list of chemical elements. Only
ticle scattering technique first employed by 18 of the elements existing in nature exhibit a
Rutherford or by the simpler and more precise single mass number, and some, tin being a good
method of Moseley which relates the frequency example, have as many as ten naturally occur-
of the characteristic x-rays produced by the ele- ring stable species differing only in their neutron
ment upon electron bombardment to the nu- number or mass. All of the elements exhibit a
clear charge z by means of the equation variety of mass modifications which are arti-
ficially produced, but these are unstable or
w=R(z-b)2 radioactive. These artificially produced species,
of course, make up the bulk of the previously
where w is the wave number of the x-ray, z is mentioned 1000 different entities. The natu-
the nuclear charge, and band R are constants. rally occuring radioactive isotopes, in addition
This definition removes the ambiguities to those beyond bismuth, include carbon 14,
created by such observations as the decompo- chlorine 36, vanadium 50, potassium 40, rubid-
sition of elemental molecular hydrogen or ni- ium 87, indium 115, lanthanum 138, neodym-
trogen by high temperatures into atomic spe- ium 144, samarium 147, lutetium 176, tan-
cies, or the decomposition of the rare gases into talum 180, rhenium 187, and platinum 190.
charged species in an electric discharge. This The Natural Distribution of Elements The
type of decomposition, which does not effect relative abundance of the elements is quite dif-
the underlying nuclear structure is thus ex- ferent for the earth's crust from what it is
cluded from the operational definition originat- thought to be for the universe as a whole. These
ing with Boyle. terrestrial abundances are listed in Table 1 from
Numbers and Kinds of Elements The num- which it can be seen that only thirteen elements
ber of substances recognized as chemical ele- comprise over 98 per cent of the earth's crust,
ments has steadily increased since the publica- the oceans and the atmosphere. It will be noted
tion of Lavoisier's list which included about that many of the common and important ele-
thirty of the true elements. Today it is recog- ments of commerce are not included in this
nized that there are some 90 naturally occuring list, but rather belong to the remaining 2 per
elements, the exact number depending upon the cent of the earth's crust. Copper, lead, and
level of abundance considered limiting. There nitrogen are especially conspicuous by their
are also about 15 artificial ones with this latter absence.
number possibly increasing as the techniques of If we turn our attention now to cosmic abun-
nuclear science improve. The artificial elements dances, the list has quite a different make up
include those with atomic numbers above 92 as (see Table 2). This list of course is known with
well as promethium and technetium. Techne- considerably less accuracy since it is our at-
tium is also observed in stars, and in that sense tempt to guess at the relative abundance of the
is naturally occuring. (See table in the article chemical elements in the entire universe: our
ISOTOPES). galaxy, all the other galaxies, and the vast, but
The 105 presently known elements represent
distinct chemical species differing by integral
units of positive charge (corresponding to the
charge of the proton) beginning with element TABLE 1. THE THIRTEEN MOST ABUNDANT
number 1, hydrogen, and progressing through ELEMENTS ON THE EARTH'S SURFACE
element number 105. Species having nuclear
charges between 1 and 105 have all been identi- Element Abundance (%)
fied, so it can be said that the list of elements is
complete, except for the possibility of adding Oxygen 49.52
new ones with atomic numbers greater than Silicon 25.75
105. Considerable effort, both experimental Aluminum 7.51
and theoretical, is currently being expended in Iron 4.70
the effort to find super-heavy elements of un- Calcium 3.39
usual stability. With the discovery of element Sodium 2.64
103, the actinide series was completed. Both Potassium 2.40
104 and 105 are quite unstable; however, is- Magnesium 1.94
lands of stability are predicted to occur be- Chlorine 1.88
tween atomic numbers 114 and 126 and again Hydrogen 0.88
between 164 and 194. Titanium 0.58
Most of the elements exhibit variations in Phosphorus 0.120
mass, due to the varying numbers of neutrons Carbon 0.087
present in their nuclei. Atoms having the same
391 ELEMENTS, CHEMICAL

TABLE 2. THE THIRTEEN MOST ABUNDANT not entirely empty spaces in between. This in-
ELEMENTS IN THE UNIVERSE formation has largely been gathered by spectro-
scopic studies of the light emitted by the lum-
Element Relative Abundancea inous bodies in these galaxies and by careful
analysis of the samples of off-planet material
Hydrogen 3.5 X 108 (meteorites) that constantly shower the earth,
Helium 3.5 X 10 7 as well as lunar-samples.
Oxygen 2.2 X 10 5 It can readily be seen that this list is quite dif-
Nitrogen 1.6 X lOs ferent from the terrestrial abundance list. For
Carbon 8 X 104 the universe as a whole, the elements hydrogen
Neon 2.4 X 104 and helium far outrank all others, while on the
Iron 1.8 X 104 earth, hydrogen is only tenth in abundance and
Silicon 1 X 104 helium doesn't even appear on the list. Oxygen
Magnesium 9 X 10 3 remains high, and nitrogen, absent from the
Sulfur 3.5 X 103 terrestrial list, is the fourth most abundant ele-
Nickel 1.3 X 10 3 ment when cosmic abundances are considered.
Aluminum 8.8 X 102 In Fig. I, some of the interesting variations in
Calcium 6.7 X 102 abundance are displayed. The relative abun-
dance is plotted against mass number.
aThese abundances are relative to silicon taken as It will be noted that elements with atomic
1 X 104 . weights that are multiples of four and two are

20 40 60 80 100 120 140 160 180 200

MASS NUMBER
FIG. 1. [From Cameron, A. G. W., Sky and Telescope, 254 (May, 1963).)
ELEMENTS, CHEMICAL 392

more abundant than nearby elements with rela- The heavier elements, up to and including
tively similar atomic weights. Examination iron, are generally the result of carbon fusion in
shows that those elements having proton or very massIve stars with interior temperatures of
neutron numbers 8, 50, 82, and 126 also ex- the order of one billion degrees. This produces
hibit maxima. These numbers are so-called a "soup" of nuclei, nucleons, alpha-particles,
magic numbers confering an especially high de- and photons in which reaction between the
gree of nuclear stability. lighter particles and heavier nuclei leads to ele-
The Origins of the Elements The origin of ments as heavy as iron.
the elements and their eventual appearance in To form elements up to the mass of bismuth
the observed abundances in the earth has been the s- (or slow) process is invoked. This involves
the subject of intense research and speculation the equilibrium addition of neutrons to the
for many years. The currently accepted ideas intermediate nuclei with intervening beta-decay.
involve a series of events, some dating back to This growth of · heavier and heavier nuclides is
the "beginning" and others continuing to occur arrested by the intervention of a number of
in appropriate locations throughout the uni- very short-lived species. The trans-bismuth ele-
verse even today, which result ultimately in the ments require an environment of even higher
production of all the known naturally occuring pressure and neutron abundance, as found in
elements. the cores of supernovae.
The first part of the story, one of remarkably Eventually stars eject matter containing this
short duration, involves a modified "big bang" array of elements into interstellar space. The
of the type postulated by Gamow. At the be- Earth, the Sun, and other planets are presumably
ginning of our universe all matter was thought the result of accretion of this cosmic dust, in a
to exist in a highly condensed sphere of neu- whirling cloud which centrifuges the heavier
trons (the ylem). This giant nucleus underwent nuclides outward, leaving the helium and hy-
a cataclysmic explosion and with a half-life of drogen to coalesce to form our central star.
around eleven minutes the neutrons were con- Synthetic Elements The first of the syn-
verted to protons and electrons. Because of the thetic elements was made in 1937 by the bom-
very high photon density in the ylem about bardment of molybdenum with deuterons. This
25% of the resulting expanding cloud was con- produced technetium, filling a long recognized
verted into helium by condensation of the gap in the periodic table for element number 43.
hydrogen nuclei, concomitantly filling the The other gap, at number 61, promethium, was
universe with black body radiation as the cloud filled in 1947 when 149Pm was isolated in small
cooled. The existence of such radiation, corre- yield from the products of fission of 235U.
sponding to a temperature of 2.7°K has been Since that time 13 additional synthetic ele-
confirmed experimentally. ments have been made, all lying beyond uranium
The remainder of the story is spread over a in the periodic table. These are formed by a
period of from 10 to IS billions of years and variety of processes involving neutron capture
involves the cycle of stellar evolution familiar or bombardment with heavy ions.
to the astronomer. The original cloud of inter-
stellar matter suffered local condensation into RUSSELL H. JOHNSEN
billions of stars. As these stars consume their
hydrogen by a fusion process they undergo a
series of changes in internal conditions acting References
as the factories of the chemical elements. At 1. Chaisson, E., "Cosmic Dawn-The Origins of Life
first only helium is formed, but as the hydrogen and Matter," Boston, Little, Brown and Co., 1981.
is consumed and helium formed the tempera- 2. Choppin, G. R., and Johnsen, R. H., "Introductory
ture of the core falls, allowing it to contract by Chemistry," Reading, Mass., Addison-Wesley Inc.,
gravitational attraction. This contraction offsets 1972.
the cooling trend and the core becomes even 3. Johnsen, R. H., and Grunwald, E., "Atoms, Mole-
hotter-heating the outer hydrogen mantle and cules and Chemical Change," Third Ed., Englewood
causing it to expand and cool, producing the Cliffs, N.J., Prentice-Hall, Inc., 1971.
so-called red giant star. Continued contraction 4. Pagel, B. E. J., and Edmunds, M. G., "Abundances
of the core raises its temperature until helium in Stellar Population and the Interstellar Medium in
fusion is possible, producing the lighter elements Galaxies," Ann. Rev. Astron. Astrophys. 19, 77
such as carbon, oxygen, neon, magnesium, and (1981).
silicon. Essential to the production of these 5. Wannier, P. G., "Nuclear Abundances and Evolu-
lighter elements is the possibility of utilizing tion of the Interstellar Medium," Ann. Rev. Astron.
the extremely short-lived product of the first Astrophys. 18, 399 (1980).
condensation of two helium nuclei, 8Be, (t 1/2 = 6. Keller, O. K., Jr., Burnett, J. L., Carlsen, T. A., and
2 X 10- 16 s). Fortunately the density of matter Nestor, C. W., Jr., "Predicted Properties of the
in the red giant cores is so great that the proba- Super Heavy Elements I., Elements 113 and 114,"
bility of subsequent reaction of beryllium with J. Phys. Chern. 74, 1127 (1970).
helium to produce the lighter elements is very 7. Seaborg, G. T., "From Mendeleev to Mendelevium
high. and Beyond," Chemistry 43, 6 (1970).
393 ENERGY LEVELS, ATOMIC

8. "Physics and Chemistry of the Earth, Vol. 2: Ori- orbit, and is constrained to have the values of
gin and Distribution of the Elements," Proceedings zero or positive integers less than n. The num-
of the 2nd Paris Symposium (L. H. Ahrens, Ed.), ber, ml, represents the component of I along a
Pergamon Press, Oxford, 1977. given axis, and must take on the values of zero
or positive and negative integers whose ab-
Cross-references: COSMOLOGY, ELECTRON, ISO- solute values are less than or equal to the value
TOPES, NEUTRON, PERIODIC LAW AND PERI- of I. The quantum number, ms, can be +~ or
ODIC TABLE, PROTON, RADIOACTIVITY. -!, and represents the component of the spin
along the axis.
Within a given atom, electrons having the
same value of the principal quantum number, n,
ENERGY LEVELS, ATOMIC form a definite group or "shell." Those elec-
The term "energy level" is used in referring to trons possessing the same value of I for a given
discrete amounts of energy which atoms and value of n are in the same subgroup or "sub-
molecules can have with respect to their elec- shell." The possible number of electrons in a
tron or nuclear structure. The concept of per- shell or subshell depends upon the possible
missible discrete energy levels was first intro- values of ml and ms. Whenever a subshell is
duced by Planck in explaining the physical filled, the total angular momentum of the elec-
basis for the spectral distribution of blackbody trons involved is zero. Electrons outside of
radiation. A second related principle due to filled subshells contribute additional angular
Planck was that the emission and absorption of momentum which is summed vectorially and
radiation are associated with transitions between assigned the numbers J, Land S. Here J is
these energy levels, the energy thereby lost or representative of the total angular momentum
gained being equal to the energy, hv, of the L the orbital angular momentum, and S the
quantum of radiation. Here h is Planck's con- spin angular momentum.
stant and v is the frequency of the radiation. An atom is stable only when it exists in the
The first application of energy levels in the state for which the quantum numbers of its
electron structure of atoms to explain optical electrons give the lowest total energy. The
spectra was made by Bohr. The original Bohr energy of the atom may be increased to a higher
atom had as its basis that the only allowable level by having an electron "excited" to another
states of an atom were those in which the elec- state represented by a different set of allowed
tronic angular momentum was an integral mul- quantum numbers. Transitions back again to
tiple of h/2rr. Circular orbits suggested by Bohr the "ground" state will be accompanied by the
were extended by Sommerfeld to include the emission of radiation. Wave mechanics indi-
quantization of momentum in elliptic orbits, cates, however, that only certain transitions
and to provide an improved explanation of from one quantum state to another can be
optical spectra. probable. These "selection rules" specify that
These early concepts were modified by the D.L = ±l and that D.J = 0 or fl.
development of the theory of wave mechanics, Following the principles just given, an elec-
in which it was shown that the allowable "sta- tron energy-level diagram can be constructed
tionary" states for the electrons in an atom for the excited states of the atoms of any
must represent solutions of the Schrodinger particular isotope. Discrete energy levels will
wave equation. These solutions are conveniently exist for the allowed quantum states of excited
represented by a set of "quantum numbers" for electrons. The spectrum of radiation which can
each electron. On this basis the electron struc- be emitted from the isotope will be determined
ture of an atom containing any number of by the energy differences between these states,
electrons can be built up. Two further concepts where the transitions involved are allowed by
that are essential to this picture, however, are selection rules. Figure 1 illustrates such an
electron spin proposed by Uhlenbeck and energy-level diagram for sodium.
Goudsmit, and the exclusion principle due to This diagram indicates that very little differ-
Pauli. In addition to the angular momentum of ence in energy level results from changes in the
the electron in its orbit, each electron possesses electron spin orientation. These orientations
angular momentum due to spin about an axis. correspond to different values of J for the same
The Pauli exclusion principle specifies that no value of L. No attempt is made to show a
two electrons in an atom can exist in the same separation in the diagram for the cases of L = 2
quantum state, corresponding to the same set and L = 3. Such closely spaced energy levels
of quantum numbers. due to the effect of the coupling of the electron
Each electron in an atom can be characterized spin give rise to "fine structure" in spectra.
by four quantum numbers, n, I, ml, ms. The Many spectral lines, when examined with high
energy of an electron depends principally upon resolving power instruments, are found to ex-
the positive integer n, and larger values of n hibit a still finer structure of several lines very
correspond to larger electronic orbits. The close together. This is termed "hyperfine struc-
quantum number I possesses physical signifi- ture" and has been found to be due to two
cance in terms of angular momentum in the causes. One is the isotope effect in which atoms
ENERGY LEVELS, ATOMIC 394

J=l
2
J=-23 J=-21 J=~OR.§.
2 2
J=lOR.§.
2 2
L=O L=I L= I L= 2 L=3
5.121-----=,..:--------.,~=_____,==_-____r--_____,_--___i
5.0
9
8
7
6

4.0

en
~ 3.0
o
>
z
o
g: 2.0
o
W
...J
W

1.0

o 3-..L....,/;......,j'-----..J GROUND LEVEL

FIG. 1. Energy-level diagram for sodium. The large numbers are wave
lengths in A of radiation emitted or absorbed during the indicated transi-
tions. Principal quantum numbers are shown as integers.

of different isotopes of the same element direction result in different energy levels. This
possess slightly different excited electron energy splitting of levels is referred to as the Zeeman
levels. The other cause of hyperfine structure effect in the case of an applied magnetic field,
has been determined to be due to the fact that and the Stark effect in the case of an applied
the atomic nucleus also possesses angular mo- electric field (see ZEEMAN AND STARK EF-
mentum, which is vectorially added to the elec- FECTS). The amount of splitting increases with
tronic angular momentum and quantitized. Dif- the intensity of the superposed field. In addi-
ferences in the resultant states of the atom tion, tl«( spectral structure can vary due to a
correspond to very small differences in the tendency for the orbital momentum, L, and
energy levels and hence in the observed the spin momentum, S, to become uncoupled
spectrum. and undergo space quantization independently
Another source of structure in spectra results in high fields. In weak fields, the space quantiza-
when the atoms emitting or absorbing the radia- tion is determined from the total angular mo-
tion are in a magnetic or an electric field. In a mentumJ.
field, space quantization in the direction of the The splitting of energy levels in electric or
field takes place. The values of the magnetic magnetic fields has become much more im-
moment or electric moment of the atom asso- portant in another type of phenomenon re-
ciated with the various possible components of ferred to as magnetic resonance (see MAGNETIC
angular momentum as quantitized in the field RESONANCE). Two examples are electron spin
395 ENERGY LEVELS, ATOMIC

resonance (esr) and nuclear magnetic resonance


(nmr). If a single level is split into its Zeeman
components by a magnetic field H, electromag-
netic radiation of frequency v will induce tran-
sitions between the Zeeman levels if the fol-
lowing condition is satisfied:

hv =gJAH

Here g is energy level "splitting factor" and JA


is the appropriate "magneton" (Bohr magneton
JAB for esr and nuclear magneton JAN for nmr).
(See reference 1.) For magnetic field strengths
conveniently available in the laboratory, the
resonance frequencies are in the short radio or
microwave regions.
Any electron of an atom may be excited to
some higher allowed energy level by absorption
of the amount of energy specified by the dif-
ference in the energy levels involved. By absorp-
tion of a sufficient amount of energy, any elec- FIG. 2. Mutual potential energy V and force F as a
tron can be removed from an atom, resulting in function of atomic separation in a diatomic molecule.
ionization. It is not necessary to consider only F is repulsive when positive and attractive when
the outermost loosely bound electrons. When negative.
electrons from inner shells are excited or re-
moved, the process of returning to the "ground"
or lowest energy state involves the emission of
"characteristic" x-rays. They are "characteris- tional" bands are observed in molecular spec-
tic" in that the x-ray spectrum produced is troscopy, depending upon what other transi-
typical of the particular atom producing the tions may be simultaneously involved, in the
radiation. Atoms of higher atomic number and ultraviolet region, the visible, the infrared, and
transitions involving electrons in innermost even the microwave region. See reference 3 for
shells produce higher-energy radiation. Absorp- further details on molecular energy levels and
tion, as well as emission, of characteristic x-ray associated spectra.
radiation is observed between allowed electronic An atom may not only combine with others
energy levels. For further information on atomic to form a molecule, but may be one of a large
spectra and energy levels, see reference 2.
In addition to the energy levels associated
with particular types bf atoms, wave mechanics
shows that discrete quantum states and energy
levels are associated with molecular structure. FREE
New energy levels arise from vibration and ro- SOUD ATOM
tation of molecules. To illustrate the allowable
vibrational states for a diatomic molecule, con-
sider Fig. 2, which shows the mutual force, F,
and the potential energy, V, plotted as a func-
tion of the separation, r, between the two
atoms. Possible energy levels are indicated by
the dotted lines. Even the lowest state cor-
responds to an energy greater than Va, where >-
<:>
cthe force between the atoms would be zero, a::
UJ
and hence has some associated kinetic energy. z
UJ
The rotation of a molecule has a quantitized
angular momentum which can be vectorially
combined with that of the electrons. Transi-
tions from one level to another usually involve
a change in the electronic state as well as in the
rotational state. Since energy differences due to
allowed changes in rotational motion are very
small compared to the energy differences in
electronic states or vibrational states, the effect
of transitions in rotational states is to produce
bands of very closely spaced frequencies in the FIG. 3. The electron levels of a free atom split into
emission and absorption spectra. Such "rota- bands when the atom enters a solid.
ENERGY LEVELS, ATOMIC 396

number of atoms forming a crystal, Here solu- to many gigawatt-hours (pumped hydroelectric
tions to the wave equation show that within storage). Energy may be stored in many ways,
the solid, the individual energy levels of the including storage as potential energy in the
free atom broaden into bands of overlapping earth's gravitational field (raising a weight),
levels, This is illustrated in Fig. 3. The bulk kinetic energy (a spinning flywheel), thermal
electrical and optical properties of solids are energy (heating a substance), chemical energy
determined by the nature of these energy-level (carrying out a reversible chemical reaction),
bands. Metals have the highest occupied energy or electrochemical energy (carrying out a
band unfilled, while satisfying the exclusion reversible electrochemical reaction). This article
principle that no two electrons occupy the deals with the storage of energy by means of
same state. This permits electrons to gain energy electrochemical reactions which can be carried
under the application of an electric field and out in both forward and reverse directions. The
flow freely through the solid. Insulating crys- devices which provide this capability are called
tals have the highest occupied energy band rechargeable (or secondary*) electrochemical
filled and appreciably separated from the next cells. Cells may be connected to series and/or
higher band. Semiconductors represent an in- parallel to form arrays called batteries.t,:j:
termediate situation where electrons can be in- There are many current applications for elec-
jected into an unfilled band to contribute to trochemical energy storage devices, varying
conduction. from aerospace (batteries for satellites' on-
If the solid crystal is not completely regular board power, recharged by solar cells; starting
but contains imperfections or impurity atoms, batteries for aircraft) to mnitary (communica-
specific electron energy levels will be associated tions equipment, torpedoes, submarines) to
with these sites. This condition is responsible industrial (electric forklift trucks, other electric
for luminescence and phosphorescence in cer- vehicles) to consumer (battery-powered appli-
tain solids. Additional information on energy ances, automobile batteries). Future applications
bands in solids is given in reference 1. that are providing incentive for current research
and development include energy storage for
WILLIAM E. PARKINS electric utility systems (megawatt hours), for
solar and wind-powered electric systems (kilo-
watt hours to megawatt hours), and for electric
References vehicles (tens of kilowatt hours).
1. Kittel, Charles, "Introduction to Solid State Phys- The characteristics of rechargeable batteries
ics," Fifth edition, New York, John Wiley & Sons that are important in inany applications include
Inc., 1976. the specific energy (energy stored per unit bat-
2. Herzberg, Gerhard, and Spinks, J. W. T., "Atomic tery mass, Wh/kg), the specific power (power
Spectra and Atomic Structure," Second edition, delivered per unit battery mass, W/kg), the cycle
New York, Dover Publications, 1944. life (number of discharge-charge cycles before
Condon, E. U., and Odishaw, Hugh, "Handbook of failure), the lifetime (time to failure), and the
Physics," Second edition, New York, McGraw-Hill cost (usually expressed as $/kWh of energy
Book Co., 1967. storage capability). Of course, it is desirable to
3. Herzberg, Gerhard, "Molecular Spectra and Molec- have batteries of high performance, long life,
ular Structure," Vol. 1, "Spectra of Diatomic Mole- and low cost. The scientific implications of
cules," Second edition, New York,D. VanNostrand these goals are shown in Table 1.
Co., 1950, and Vol. 3, "Electronic Structure of There are a number of fundamental character-
Polyatomic Molecules," New York, D. Van Nostrand istics that enable the achievement of the goals
Co., 1966. of Table 1. The desired characteristics for each
of the three essential parts of a cell (the nega-
Cross-references: ATOMIC PHYSICS, ATOMIC SPEC- tive electrode, the positive electrode, and elec-
TRA, ELECTRON SPIN, MAGNETIC RESONANCE, trolyte) are listed in Table 2. A large electro-
MOLECULES AND MOLECULAR STRUCTURE, negativity difference between the reactants, and
NUCLEAR STRUCTURE, SCHRODINGER EQUA- small equivalent weights yield a high theoretical
TION, SEMICONDUCTORS, SPECTROSCOPY, specific energy for a cell:
X-RAYS, ZEEMAN AND STARK EFFECTS.
-t:.G
Theoretical specific energy =-==---
L'YjMj
ENERGY STORAGE, ELECTROCHEMICAL
Introduction Energy storage is important in
alleviating discrepancies between energy supply *Primary electrochemical cells are based upon reac-
and energy demand. These discrepancies may tions that cannot be reversed; the cells are discarded
be either temporal or spatial or both. Energy after a single discharge.
storage is already being used in many applica- tOften the term battery is misused in place of cell.
tions, varying in size from a few milliwatt- :j:See ENERGY STORAGE, THERMAL-MECHANI-
hours (e.g., electric watches and calculators) CAL for other types of energy storage systems.
397 ENERGY STORAGE, ELECTROCHEMICAL

TABLE 1. GENERAL GOALS FOR which operate at high temperatures (for reasons
BATTERIES. given below).
Ambient-Temperature Rechargeable Cells
Goal Implication Rechargeable cells that operate at ambient
temperature in almost all cases make use of
High specific High cell voltage aqueous electrolytes. The use of aqueous
energy Low equivalent weight electrolytes limits the choice of electrode
High utilization of active material materials to those that do not react with the
water in the electrolyte at an unacceptably high
High specific High cell voltage rate. This consideration eliminates the alkali
power Rapid reactions metals (e.g., Li, Na, K) and the alkaline earth
Rapid mass transport metals (e.g., Mg, Ca, Ba) from consideration,
Low internal resistance even though they have low electronegativities
Long life Reversible reactions and low equivalent weights, and other desirable
Low corrosion rates properties called for in Table 2. Materials that
Negligible rates of side reactions are sufficiently stable with aqueous electrolytes,
have acceptably low electron egativities , take
Low cost Inexpensive, plentiful materials part in electrochemical reactions that can be
Simple manufacturing processes reversed, and have other desirable properties as
High efficiency negative electrodes include Pb, Cd, Fe, and Zn.
Little or no maintenance Elements that have high electronegativities are
found in groups V, VI, and VII of the periodic
chart of the elements. These have been used
both as elements (e.g., Cl z , Brz) and in com-
where 'Yi is the number of moles of reactant i, pounds (e.g., metal oxides such as PbO z ,
Mi is the molecular weight of reactant i, and NiOOH, AgO) in positive electrodes.
tlG is the Gibbs free energy change for the The lists of negative and positive electrode
overall cell reaction. The theoretical specific reactants given above can be used to select pairs
energy is a useful guide in the selection of cells of electrodes to make cells, along with suitable
for applications in which high specific energy is electrolytes. Aqueous electrolytes are usually
important (e.g., mobile uses). For cells with strongly acidic (e.g., H Z S0 4 ) or strongly alka-
solid reactants, it can be.expected that 20-25% line (e.g., KOH) in order to maximize the elec-
of the theoretical specific energy will be ob- trolytic conductivity, which contributes to
tained from well-designed cells. A plot of the achieving a low cell resistance and therefore a
theoretical specific energy for a number of cells high specific power capability. The cells that are
is shown in Fig. 1. currently available, or are under development
A careful examination of Fig. 1 reveals that include Pb/H zS0 4 /PbO z , Cd/KOH/NiOOH,
the cells with the highest theoretical specific Fe/KOH/NiOOH, Zn/KOH/NiOOH, Cd/KOHl
energy generally involve reactants that are not AgO, Zn/KOH/AgO, Zn/ZnBrz-KCI/Br2, Znl
stable in contact with an aqueous electrolyte ZnClz/Cl z .
(Li, Na), whereas those of lower specific energy The Pb/PbO z cell is the most widespread and
are compatible with aqueous electrolytes (Pb, least expensive of all those currently available.
Cd, Fe, NiOOH, etc.). This feature results in the Its most common use is for starting, lighting,
subdivision of rechargeable cells into two major and ignition purposes in automobiles. The auto-
categories: those with aqueous electrolytes, mobile battery stores about 1 kWh. With about
which operate at ambient temperature, and 10 8 automobiles in the U.S., nearly 108 kWh of
those with nonaqueous electrolytes, most of energy is stored in these batteries. For electric

TABLE 2. DESIRABLE CHARACTERISTICS OF SYSTEMS FOR


HIGH-PERFORMANCE ELECTROCHEMICAL CELLS.

Negative Positive
Electrode Electrode
Characteristic' Reactant Reactant Electrolyte

Electronegativity low (-1) high (> 1.5)


Equivalent weight (g/g-equiv.) low «30) low «30) low* «30)
Conductivity (ohm -1 cm -1) high (>104 ) high (> 104 ) high (> 1)
Electrochemical reaction rate (io, A/cm 2 ) high (>10 3 ) high (>103 ) high (>10 3 )
Solubility in electrolyte (mol %) low «0.1) low «0.1)
Mass transport rate (equiv.f(sec-cm 2 » high (>104 ) high (> 104 ) high (>104 )

• A more important criterion for the electrolyte is low density.


ENERGY STORAGE, ELECTROCHEMICAL 398

10,000~------------------------~

C>
~

"
C>
.r=.I 3000 ~

104 "....,
~2000 ~

>-
.
C> C>
~
~
Q) Q)
c c
w 1000 w

-
u
u
800
co 600
0.
-u
u
Q)
0.
00 00
~ 400 ~
0 0
Q)

.-.r=. 103 ~
.-
200

100
Equiva lent Weight, g/ equiv
FIG. 1. Theoretical specific energy of candidate cells. Lines are for cell voltages from 1 volt to 4 volts,
as labeled.

vehicle applications, the specific energy of efforts to develop a lighter-weight, lower-cost


PbjPb0 2 at 25-40 Whjkg is too low-70 Wh/kg FejNiOOH cell for electric vehicle propulsion,
is needed for an urban driving range of 150 km but its modest specific energy of 40-50 Whjkg,
(a common goal). For electric utility energy and its low efficiency (-60%) limit its applica-
storage, the cost of PbjPb0 2 (-$IOOjkWh) is bility for electric automobiles or stationary
too high, and the cycle life is too low. energy storage.
The CdjNiOOH cell, with its very long cycle The ZnjNiOOH cell has a very attractive spe-
life and ruggedness has found various military cific energy of 70-75 Whjkg, but the short
and aerospace applications that do not require cycle life of the Zn electrode «300 cycles) has
storage of much energy (there is a limited supply prevented it from achieving wide application.
of Cd, and it has a high cost). A significant con- Research and development efforts aimed at
sumer market for use in small battery-powered improving the Zn electrode continue.
appliances has also developed. The Cdj AgO and Znj AgO cells offer higher
The FejNiOOH cell, also known as the Edison specific energy than their NiOOH counterparts,
cell, has disappeared from the marketplace, but the high cost of Ag limits them to military,
largely because of its high cost. There are now aerospace, and specialty applications usually
399 ENERGY STORAGE, ELECTROCHEMICAL

requiring the storage of small amounts of tures above 180°C, and is therefore difficult to
energy. control in the cell. The same is true for solid
In recent years, Zn/halogen systems with FeS~ as compared to sulfur, which melts at
flowing electrolyte-plus-reactant streams have 118 C. The LiCI-KCI eutectic electrolyte melts
been under development. These systems may at 3S2°C, and the cell is operated at about
be of acceptably low cost (less than $1 OO/kWh) 4S0°C. The theoretical specific energy of this
because they rely heavily on such inexpensive cell is 960 Wh/kg, as shown in Fig, 1. Practical
materials as carbon and polymers. The specific values as high as 180 Wh/kg have been reported
energies are in an intermediate range (about for single laboratory cells. Only a few small
60 Wh/kg for Zn/Brz and 70 Wh/kg for Zn/CI 2 ). laboratory batteries have been built, many large
The safe storage of halogens is an important cells and some multi-kWh batteries of LiAI/
issue for these systems. Chlorine is stored as LiCI-KCI/FeS cells have been demonstrated.
an icelike material Cl 2 . 8H 20 at temperatures This system is quite similar to Li 4Si/LiCl-KCI/
below 9°C. Bromine is stored as a chemical FeS2, but it has a much lower specific energy
complex formed from quaternary ammonium (about 100 Wh/kg vs. f 80-200 Wh/kg).
organic salts. This complex is a dense, oil-like Molten-salt cells may find application for
liquid which spontaneously separates from the stationary energy storage, and for electric
electrolyte. These Zn/halogen flow batteries are vehicles with high duty cycles, so that thermal
more complex than other batteries, and probably cycling to ambient temperature is only infre-
will not be acceptable for mobile applications if quently required. In addition to high specific
alternative batteries of similar specific energy energy, these cells have inexpensive reactants.
are available. Stationary applications are good If the high-temperature corrosion problems can
possibilities, especially if efficiencies above 70% be overcome, molten-salt cells may be attractive.
are achieved. The ceramic-electrolyte cells are best exem-
There has been a great deal of research devoted plified by the Na/Na20' 8A1203/S cell. The
to nonaqueous ambient-temperature recharge- electrolyte is called beta alumina, and can
able cells, especially those using Li electrodes, conduct sodium ions in the crystallographic
with the hope of achieving 100-200 Wh/kg. planes perpendicular to the C-axis. The purity
This has been a difficult research area because and crystal size of the electrolyte are critical
all the electrolytes investigated have been to achieving long life and high performance.
thermodynamically unstable in contact with This cell operates at 3S0°C, and uses liquid Na
lithium. Consequently, the lithium electrode and liquid S as the reactants. A graphite felt
has not been capable of being recharged very material serves to collect the current in the
many times before failure, and there have been nonconductive sulfur. The electrolyte is used in
some explosions resulting from rapid reactions tube form, with one reactant inside, the other
of lithium with the electrolyte. In addition, outside.
only a few candidate positive-electrode materials The ceramic electrolyte is made from materials
have shown the ability to be recharged repeat- which are basically inexpensive, but purity,
edly. One of these is TiS 2 , a layered crystal that quality control, and manufacturing conditions
permits the entry of Li atoms between its crys- have made the process extremely expensive,
tal planes. Research continues on rechargeable and a major topic for continued development.
ambient-temperature lithium cells, but none are The electrolyte should be strong, very dense
available yet. (>97% of theoretical), of low resistivity « I 0
High·Temperature Systems It has been found n-cm), have controlled crystal size and struc-
that the alkali metals, which promise high spe- ture and be highly pure.
cific energy, can be used more readily at elevated Thousands of single cells of various sizes up
temperatures than at ambient temperature. This to 100 -200 Ah have been tested, as well as
is the case because elevated-temperature opera- several multi-kWh batteries, and one 7S-100
tion permits the use of stable molten-salt or kWh battery. With additional development of
ceramic electrolytes which have high conductiv- the electrolyte, work on sulfur corrosion, and
ities for alkali metal cations. For various chemi- cost reduction, the Na/S cell could find applica-
cal reasons, molten salt electrolytes are used tions similar to those mentioned above for
with the lithium electrode, and ceramic elec- Li4Si/ FeS2 .
trolytes are used with the sodium electrode. Summary and Outlook The several recharge-
The positive electrodes are sulfur or compounds able cells discussed above are listed with some
containing sulfur. These cells operate at 3S0- of their important characteristics in Table 3. It
47 SoC, and large batteries would be kept at is clear from this table that none of the systems
operating temperature by means of efficient at present meets the goals for the major energy
thermal insulation to contain the heat produced storage applications of electric automobiles
by the battery in its normal operation. (>70 Wh/kg, > I 00 W/kg, >300 cycles, <$100/
An example of the molten-salt cells is the kWh), or energy storage for electric utilities
Li4Si/LiCI-KCI/FeSz cell. The compound Li 4Si (>70% efficiency, 80 kWh/m2 floor space,
is used instead of lithium because it is a light- >2000 cycles, 10 years life, < $SO/kWh). There
weight solid, whereas Li is liquid at tempera- are other applications that may be filled by the
ENERGY STORAGE. ELECTROCHEMICAL 400

TABLE 3. STATUS OF.SOME RECHARGEABLE BATTERIES.

Theor. Actual Actual Cycle Cost


Cell Voltage Wh/kg Wh/kg W/kg Life $/kWh Comments

Ambient Temperature

Pb/H2 S0 4/ Pb02 2.1 175 25-41 50-100 300-800 100 available now
Cd/KOH/NiOOH 1.3 202 40-50 200+ 500-2000 >100 available. expensive
Fe/KOH/NiOOH 1.4 267 40-50 100 500+ >100 batteries on test. low efficiency
Zn/KOH/NiOOH 1.7 326 60-75 150 100-300 >100 cells and modules. short Zn life
Cd/KOH/AgO 1.4-1.1 257 50-65 200+ 500 »100 available, very expensive
Zn/KOH/AgO 1.8-1.5 432 80-110 200+ 100 »100 available, very expensive
Zn/ZnBr2-KCl/Br2 1.8 323t 60 75 400+ •• batteries on test, little life data,
projected 35$/kWh
Zn/ZnCl2/CI2 . 8H20 2.1 405 66 70 1400' complex system, batteries on test
High Temperature

LiAI/LiCl-KCI/ FeS
L4Si/LiCl-KC1/FeS2
Na/Na20·8AI203/S
1.3
1.8. I.3
2.0
458
944
758
60-100
120-180
100-160
60-100
100+
60-130
300+
700
250-1500 ..•••• small batteries on test
single cell status
S corrosion problems, expensive
ceramic, batteries on test

-1 kWh system, with electrolyte maintenance.


··Too early to predict; not available now.
tlncludes weight of com pie xing agent.

newer systems, once developed. Batteries are ENERGY STORAGE,


already ubiquitous for storage of small amounts THERMAL-MECHANICAL
of energy (less than a few kWh); they may ex-
pand their applicability to many MWh. Introduction * Energy storage systems serve to
lower the cost of delivering energy in a suitable
ELTON 1. CAIRNS form to a user at the required time. For ex-
ample, in electric utility systems the cost to
produce energy can change by as much as a
References factor of five over a twenty-four hour period.
Cairns, E. J., in "Materials for Advanced Batteries" By storing relatively cheap nighttime energy,
Murphy, D. W., and Broadhead, J., Eds.), Plenum the cost for supplying daytime energy can thus
Publishing Corporation, New York, 1982. be significantly reduced.
Birk, J. R., EPRI Journal, p. 6 (October, 1981). Thermal energy storage systems rely on stor-
Douglas, D. L., and Birk, J. R., Annual Review of ing energy in the form of heat; whereas me-
Energy S, 61 (1980). chanical energy storage systems rely on storing
Kalhammer, F. R., Scientific American 241 (6), 56 energy in the form of potential and/or kinetic
(December, 1979). energy. Other forms of energy storage systems
Cairns, E. J., in "Energy and Chemistry" Thompson, use chemical energy (e.g., batteries), electrical
R., Ed.), Royal Society of Chemistry, London, energy (e.g., capacitors) and electromagnetic
1981. energy (e.g., superconducting electromagnets).
Cairns, E. J., and Hietbrink, E. H., in "Comprehensive Physical Principles-Mechanical Energy Stor-
Treatise of Electrochemistry," Vol. 3 (Bockris, age Mechanical energy may be stored in a
J. O'M., Conway, B. E., Yeager, E., and White, R. E., number of forms:
Eds.), Plenum Publishing Corporation, New York,
1981, p. 421. • Potential energy
E. J. Cairns, ibid., p. 341. • Kinetic energy
E. J. Cairns, in "Proceedings of the 3rd International • Elastic (or strain) energy
Symposium on Molten Salts" (Mamantov, G., • Compressed gas energy
Blander, M., and Smith, G. P., Eds.), The Electro-
chemical Society, Pennington, New Jersey, 1981,
p.138. The international system of units (SI) denotes
the (1) as the unit to be used for energy. It is
equivalent to m 2 • kg . S-2 or N . m, where m is
Cross-references: ELECTROCHEMISTRY; ENERGY
STORAGE, THERMAL-MECHANICAL; MOLE CON·
CEPT;WORK, POWER, AND ENERGY. *See also ENERGY STORAGE, ELECTROCHEMICAL.
401 ENERGY STORAGE, THERMAL·MECHANICAL

meters, kg is kilograms, s is seconds, and N is


Newtons.
Potential energy is the energy associated with
the position of an object in a gravitational field
and is equal to MgAh, where M is the mass of
the object in kg, g is the gravitational constant
(at the earth's surface g = 9.8 ms- 2 ), and Ah is
the vertical distance (m) between two points
within the gravitation field.
Kinetic energy can be stored in an object
moving in a straight line where the stored en-
ergy equals MV2(2 and V is the object's velocity
(m S-I). Kinetic energy can also be stored in
a rotating object, with stored energy being
equal to Iwz /2, where I is the object's moment
of inertia (kg' m 2 ) about its axis of rotation
and w is it's angular velocity (s-I). A wheel- FIG. 1. Schematic diagram of pumped hydroelectric
shaped object in the form of a flat disc or radius energy storage power plant.
R (m), thickness T (m), and density p (kg/m 3 )
has a moment of inertia about its center of
1= TrrpR4 /2, which equals MR2(2.
Elastic (or strain) energy can be stored in the point, the heat of vaporization (J (kg) is the
stretching or compressing of a solid object, stored energy.
where the stored energy equals K(AX)2/2. Pumped Hydroelectric Energy Storage
K is the object's spring constant (kg S-2) and Pumped hydroelectric plants (see Fig. 1) rely
AX is the change of the object's length (m). on the principle of potential energy storage and
The above can be generalized to two or three have been in use for over SO years. The total
dimensions for solids and liquids. power capacity (energy delivered per unit of
Compressional gas energy can be stored in a time) all of such plants in the United States is
gas that has been transformed from one pres- currently approximately 13,000 megawatts
sure (p) and volume (V) to another pressure (MW). A typical plant is rated at 1000 MW. The
plant power rating is determined by the poten-

J
and volume. The expression for this stored
V!
tial energy of the water stored in an upper
reservoir (in relation to the vertical distance to
energy is p dV, which represents the math- the lower reservoir) divided by the time it takes
v, for the water to flow through the water piping
ematical integral (or continuous summation) of and pump-turbine generator separating the two
the parameter p from volume VI to volume V z . reservoirs. These power plants have proved to
Pressure is represented by Newtons per square be an economically attractive way for electric
meter (N m- 2 ) and volume is represented by utilities to store inexpensive evening energy
cubic meters (m 3 ). (used to pump the water into the upper reser-
Physical Principles-Thermal Energy Storage voir) and thus provide daytime energy at a lower
Thermal energy may be stored in two forms: cost by allowing the stored upper reservoir water
to flow through a turbine generator into the
• Sensible energy lower reservoir. Although these types of power
• Latent energy plants are very efficient (approximately 72%),
future construction efforts have been hampered
Sensible energy is stored when a substance at by their large cost, diminished topographical
one physical state (e.g., the solid state) under- opportunities, and their possibly negative en-
goes a temperature change. The expression for vironmental impact as perceived by nearby
this stored energy is CV(Tz - T 1 ), where Cis citizenry.
the specific heat capacity of a unit volume of A good example of a pumped storage plant
the substance in J/kg . °c, where °c is tempera- is the Northfield Mountain (Massachusetts)
ture in degrees Centigrade; V is the volume of facility (1000 MW). The water stored in the
the substance (m3); and T is temperature (OC). upper reservoir can produce power for a nomi-
Latent energy is stored when a substance nal six hours; thus the stored energy is 6000
undergoes a change in physical state at constant MWh which is equal to 2.16 X 10 13 J. For com-
temperature (e.g., from liquid state to gaseous parison purposes the amount of energy stored
state). The amount of stored energy depends on in this one facility is approximately 20% of the
the type of substance, its mass, and the type of storage capacity of all the automotive batteries
state change it undergoes. If the change of state in the United States.
occurs at the substance's melting point, the Flywheel Energy Storage Flywheel energy
heat of fusion (J(kg) is the stored energy. If the storage devices rely on the principle of kinetic
change of state occurs at the sUbstance's boiling energy storage and have likely been used by
ENERGY STORAGE, THERMAL-MECHANICAL 402

man longer than. any other type of energy ferent modes during different time periods.
storage device. They have smoothed or leveled During an off-peak period, relatively inexpen-
man's energy sources in such historic applica- sive energy from coal-fired and/or nuclear base-
tions as the potter's wheel, the grain mill, and load units powers the motor to compress air,
the water wheel. More recently, flywheel tech- which is then stored in an underground reser-
nology has been applied in the design of the voir. During the subsequent peak period, the
steam engine, automobile, pile driver, and gyro- compressed air is withdrawn from storage and
scope. The use of the potter's wheel permitting mixed with fuel. The combustion gases created
a craftsman with a heavy wheel to make uni- by burning this mixture are expanded through
formly shaped pots while he supplied leg energy turbines connected to the generator motor to
in unequal amounts and at irregular times, prob- produce electric power. Because the turbine is
ably dates back 5,000 years. not required to drive a compressor during the
As the flywheel spins, a tensile stress or load peak time period (which it must do in a conven-
is created in its structural material. The amount tional combustion turbine system) a CAES
of stress increases as the square of the flywheel plant reduces the use of petroleum- or gas-based
angular velocity. The ultimate strength of the fuels by more than 60%.
flywheel material to withstand this stress limits The underground caverns used to store the
the amount of energy that can be stored. The compressed air can be excavated in three types
relationship between the strength a (N /m 2 ) of of geologic formations: salt domes, hard rock,
the material, the stored energy E, the material and aquifers. Approximately three-fourths of
mass M, and the density p (kg/m 3 ) is a = KEp/M, the continental United States has suitable geol-
where K is a constant that is greater than or ogy for compressed air caverns.
equal to I and is determined from the geometry A 290 MW compressed air plant using under-
of the flywheel. The energy stored in a flywheel ground salt-dome caverns is the first such plant
per unit volume V is EIV =Ep/M =a/K. For a in the world and is currently operating in Hun-
flywheel with K = 2 (associated with a non- torf, Federal Republic of Germany, providing
tapered wheel rim) and a = 2 X 10 8 N/m 2 (a peak-leveling duty for the German utility Nord-
value corresponding to steel), the energy stor- westdeutsche Kraftwerke Ag. (Fig. 3). This
age per unit volume is 10 8 N/m2 = 108 1/m 3 . power plant was commissioned in 1978 and has
Compressed Air Energy Storage Compressed a storage capacity lasting 4 hours. The first USA
air energy storage (CAES) devices have recently application of this storage technology is cur-
become commercialized for electric utility ap- rently under consideration by a number of USA
plications. They incorporate modified state-of- utilities.
the-art combustion turbines and site-specific
underground reservoirs to store off-peak energy
for later use during peak demand periods (see
Fig. 2). The CAES power plants operate in dif-

COMPRESSED AIR STORAGE CYCLE

Unde,ground
AI, SIO,.
R..... ol'

FIG. 3. Turbomachinery of the world's ftrst com-


FIG. 2. Schematic diagram of compressed air energy pressed air energy storage power plant at Huntorf,
storage power plant. Federal Republic of Germany.
403 ENERGY STORAGE, THERMAL-MECHANICAL

TABLE 1. EXAMPLES OF SENSIBLE HEAT AND


LA TENT HEAT MATERIA LS.

Latent Heat
Sensible Heat, *
Specific Heat Heat of
Capacity Heat of Fusion Vaporization
Material (J/ktC) (J/kg) (J/kg)

Water, H2O 4.18 3.34 X 10 5 2.26 X 10 6


Mercury 1.39 X 10 2 1.18 X 104 2.95 X 10 5
Lead 1.28 X 10 2 2.29 X 104
Glauber's salt 1.93 X 10 3 2.51X 10 5

• At 1 atm., 20°C

Sensible Heat Energy Storage Sensible heat input energy is accomplished by the sensible
thermal energy systems have been used by man heat of the water. The specific heat capacity of
for thousands of years to keep dwellings at rela- water as well as some other substances is pro-
tively constant temperature. Thick adobe walls vided in Table 1.
on a dwelling store heat during the day and Latent Heat Energy Storage Latent heat
keep the inhabitants warm at night. This and thermal energy storage systems have not been
other ways to store solar energy utilizing sen- as widely used as sensible heat storage systems,
sible heat systems have recently been redis- although latent heat systems usually have a
covered because of increased interest in reducing greater energy storage potential than sensible
both fuel consumption and associated heating heat systems. This is due to the high energy
and cooling costs. storage density per unit volume which can
Other ways to store heat often used in the theoretically be achieved with latent heat sys-
past include the use of large stones, ceramic tems, as shown in Table 1. Latent heat systems
bricks, and even cast iron blocks near an early also have the advantage of operating over a
evening fire, which transmit their heat as radiant narrow temperature range, which limits the
energy to warm the user after the fire goes out. design specification requirements to be met
More recently, in steam- electric power plants, by the storage device materials,energy input,
the principle of sensible heat has been used to energy output, and control system components.
store energy from steam made available during However, other technical issues have proved
low electricity demand periods. The steam is to be severe enough to hamper the widespread
used to raise the temperature and pressure of use of latent heat storage systems. These include
water contained in tanks. When steam is needed the low thermal conductivity of candidate ma-
during high electricity demand periods, a valve terials and their volumetric decrease during
is opened on the upper portion of the tank to solidification, both of which inhibit the rate
allow the pressure to decrease and produce the of heat transport during the energy input and
needed steam, which then lowers the tempera- output time periods. Also, many candidate
ture of the water. Although the input and out- materials do not completely solidify at their
put of this process is steam, the storage of the freezing point; rather, portions of the materials

TABLE 2. ENERGY STORAGE DENSITIES.

Energy Per Energy Per


Energy Storage Unit Mass Unit Volume
Technology (J/kg) (J/m3) Comments

Pumped hydroelectric 3 X 103 3 X 106 300 m hydraulic head


Flywheel 2 X 10 5 2 X 10 8 Fiber-epoxy composite
material
Compressed air 6 X 10 5 10 7 Plant similar to Huntorf
Sensible head
magnesite brick 4 X 10 5 2 X 10 8
oil 10 5 10 8
water 4 X 10 5 4 X 10 8 In liquid phase at 1 atm,
100°C
Latent heat, salts 10 5 3 X 10 8
ENERGY STORAGE, THERMAL-MECHANICAL 404

stay in a supercooled liquid state below their 7. "Preliminary Design Study of Underground Pumped
theoretical freezing point. This is due to the Hydro and Compressed Air Energy Storage in Hard
slow nature of the propagation of solid crystals Rock," Electric Power Research Institute Research
in a liquid at its freezing point. Project 1081, Report EM-1589, Vol. 1-13, 1981.
One of the more promising materials which
exhibits the above problems to a lesser extent Cross-references: COMPRESSIBILITY, GAS; ELAS-
than other materials is sodium sulfate decahy- TICITY; HEAT; HEAT TRANSFER; MECHANICS;
drate (Glauber salt, Na2S04 . 10H 20). It has SOLAR ENERGY UTILIZATION; VAPOR PRES-
been successfully tested in two solar homes and SURE AND EVAPORATION; WORK, POWER, AND
has been used in heat pumps and air condition- ENERGY.
ing systems.
Summary A comparison of some of the
thermal-mechanical energy storage systems
available today is presented in Table 2. Each ENTROPY
of the storage systems described above has in-
herent characteristics that determine its size, The word "entropy" was coined from the Greek
weight, and associated cost. Taking these fac- by Rudolf Clausius in 1865 to mean "transfor-
tors together with user needs and the commer- mation." He defined the differential change in
cial maturity of the technology, each storage entropy dS by:
system has its preferred application. Some
operate in a mode where the form of the input
dS =dQ/T (reversible) (1)
and output energy is the same as that which is where dQ is the element of heat added reversibly
stored (e.g., ceramic brick thermal storage sys- to a system at absolute temperature T. Since
tems) whereas others operate in a mode where the values of dQ and T may be defined opera-
the input and output energy is different from tionally, there is no need, in the classical view
that which is stored (e.g., pumped hydroelec- of thermodynamics, to explain entropy in
tric energy storage systems). Thus, the compari- molecular terms. As Lewis and Randall have
son shown in Table 1 only gives a portion of written: "Thermodynamics exhibits no curios-
the many factors that need to be considered ity; certain things are poured into its hopper,
when choosing the "best" storage device for certain others emerge according to the well
a particular energy leveling or smoothing known laws of the machine; no cognizance
application. being taken of the mechanism of the process
or the nature and character of the various
ROBER T B. SCHAINKER molecular species concerned."
In 1948, Claude Shannon 3 demonstrated that
References the change in the function defined by
1. Schmidt, Frank W., and Willmott, John, "Thermal
Energy Storage and Regeneration," New York,
S[ = -kL Pi log Pi (2)
McGraw-Hill Book Company, 1981.
2. Silverman, Joseph (Ed.), "Energy Storage: A Vital uniquely measures the amount of information
Element in Mankind's Quest for Survival and Prog- in any message (Pi = probability that the re-
ress, "Transactions of the First International As- ceiver assigns to the receipt of the ith possible).
sembly held at Dubrovnik, Yugoslavia, 27 May- The sum is over all possible messages. Here is
1 June 1979, Conference jointly sponsored by the a simple example of the meaning of entropy.
National Academy of Sciences (USA) and the Suppose you are asked to guess which number
Council of Academies of Science (Yugoslavia), from 1 to 100 has been secretly written on a
Paragon Press, 1980. piece of paper. Taking each Pi equal to 1/100,
3. "Energy Storage User Needs and Technology Ap- it is found that in the above equation, S[ =
plications," An Engineering Foundation Confer- k log (100). Letting k equal 1 and taking the
ence, sponsored by the Electric Power Research logarithm to the base 2 gives S = 6.67 "bits."
Institute and Energy Research and Development This means if each question has only two pos-
Administration, Asylomar, Pacific Grove, Cali- sible answers (Le., "yes" or "no"), it will take
fornia, 8-13 February 1976, National Technical between 6 and 7 questions to find the number.
Information Service, CONF-760212, April 1977. The questioner begins by asking, "Is the num-
4. Hassenzahl, William V. (Ed.), "Mechanical, Ther- ber between 1 and 50?" Depending on the
mal and Chemical Storage of Energy," Strouds- answer, he continues by reducing the possible
burg, PA, Hutchinson Ross Publishing Co., 1981. numbers by a factor of 2 with each question.
5. Kaihammer, Fritz R., "Energy Storage Systems," Since 26 =64 and 27 = 128, the questioner
Scientific American, 241 (6), 56-65 (December, will surely find the number in 7 questions and
1979). 1/3 of the time he will find it in 6 questions.
6. "An Assessment of Energy Storage Systems Suit- Entropy may therefore be said to measure the
able for Use by Electric Utilities," Electric Power state of ignorance of a person relative to a well-
Research Institute Research Project 225, Report defined question, if the person only knows a
EM-264, Vol. 1-3, July 1976. probability distribution. It measures the ex-
405 ENTROPY

pected number of questions he will have to ask The resultant entropy, Sr,max, is, by Gibbs!
in order to go from his state of partial knowl- definition, the equilibrium entropy. The prob-
edge to a state in which he knows everything ability distribution is known as the Boltzmann
about the well-defined question. According to distribution. It is easy to demonstrate that the
the Shannon derivation,3 S[ represents the parameter (3 is equal to l/kT where k is the
amount of information in a message telling the Boltzmann constant and T is the absolute tem-
actual number written on the paper (for some- perature. If the observer is limited to macro-
one for whom Pi = 1/100). The interpretation scopic observations concerned with the energy
of entropy is used in information theory in the of a body, the above derivation leads to laws
design of codes and in the analysis of informa- which connect the observations to one another.
tion transmission systems. That is, the quantities 1/1, (3, (e), S and various
These two entropies would seem to be dif- combinations are related to one another by
ferent and to be connected only through having equations from which Pi and Ei have been elimi-
the same name. However, they have been shown nated. These relations are known as the "Laws
to be identical when applied to the analysis of of Classical Thermostatics".
how to interpret the messages an observer ob- For example, since, from Eq. (4)
tains from instruments which only can measure
the average properties of quantum systems.! d (e) = L eidpi +L PidEi
Consider the question: "In what quantum state
is this system?" Of course one can never say in
which quantum state a system resides but rather it is clear that the changes in the energy of a
can only give a probability for the system being body may be divided into two classes: (a) those
in a particular state. The probability distribution of the type Leidpi which necessarily change
for the states must be consistent with the ob- the probabilities and therefore the entropy
server's knowledge. The information theory (non-isentropic) and (b) those in which
principle of maximum entropy says to choose L eidpi =0 (i.e., isentropic). This division into
a set of probabilities which agrees with the
available data and maximizes the entropy, for,
in accordance with the meaning of entropy, this two classes of energy exchange gives rise to the
is the most noncommittal view. A state of equi- concepts of "heat" and "work." For a detailed
librium is, by the definition given by Gibbs,4 account of the derivation see reference 2.
a state of maximum entropy. From the infor- The information theory definition of entropy
mation theory point of view, it is a state in thus is shown to contain, as a special case, the
which all of the random motions which can entropy of Clausius. The two ways of approach-
take place are, in fact, taking place so that the ing entropy are seen to be consistent with one
observer knows as little about the systems as it another. In the classical case concepts such as
is possible to know beyond his knowledge of heat, work, equilibrium, temperature and state
the "constants of the motion." If we describe are taken as "operationally defined." From the
a system by giving only the pressure, tempera- information theory perspective, these concepts
ture and volume (or other gross properties), we are derived as necessary constructs in any con-
omit many details. Entropy measures how much sistent theory of molecular phenomena. As
more there is to be said before the quantum demonstrated in (1), (2) and (5) the concept
state is specified. of heat has to be invented to describe energy
As an illustration, consider a closed system of exchanges which take place on a scale too fine
particles. From quantum mechanics, we know to be observed directly whereas the work con-
the system is in some state, i, with energy ei. If cept describes energy transfers which are as-
we make an observation of e, the best we can do sociated with changes in macroscopic, observ-
is infer that it represents the expectation energy, able coordinates.
(e), for this is a repeatable quantity associated Entropy, as a logical device for generating
with the motion. To generate the appropriate probability distributions, has been applied in
probability distribution, we maximize Sr de- reliability engineering, decision theory, and the
fined in Eq. (2), with the following constraints theory of steady-state irreversible processes.
on the Pi: (See list of references with reference 5.) The
generalized approach to entropy usage was ini-
L Pi = 1 (the system is in some state) (3)
tiated by Jaynes. 6 Applications of the entropy
principle in engineering are given in reference 7.
A comprehensive review of the method of
LPiEi = (e) (the system has an expectation entropy maximization in many fields is given
energy) (4) in reference 8.
M. TRIBUS
Maximization of the entropy subject to the two References
equations given leads to the probability distri-
bution 1. Tribus, M., "Information Theory as the Basis for
Thermostatics and Thermodynamics," J. Appl.
Pi = e-I/I-(3€i (5) Mech., B (March 1961).
ENTROPY 406

2. Tribus, M., "Thermostatics and Thermodynamics," Generalization of equilibrium consideration


New York, Van Nostrand/Reinhold, 1961. by the second law of thermodynamics specifies
3. Shannon, Claude, "A Mathematical Theory of Com- that the state of thermodynamic equilibrium
munication," Bell System Tech. J. (July and Octo- of a system is characterized by the attainment
ber 1948). of the maximum of its ENTROPY. Thermody-
4. Gibbs, J. W., "Collected Works," Vol. I, p. 56, New namic coordinates are defined in terms of equi-
Haven, Conn., Yale University Press. librium states.
5. Tribus, M., and Evans, R., "The Probability Foun- Equilibrium between two phases of a system
dations of Thermodynamics," Appl. Mech. Rev., is reached when there is no net transfer of mass
765 (October 1963). or energy between the phases. Phase equilib-
6. Jaynes, E. T., "Information Theory and Statistical rium is determined by the equality of the Gibbs
Mechanics," Phys. Rev., 106,620; 108, 171 (1957). fun<;:tions (also called free enthalpy, free energy,
7. Tribus, M., "Rational Descriptions, Decisions and or chemical potential) of the phases in addition
Designs," Oxford, Pergamon Press, 1969. to equality of their temperatures and stresses
8. Levine, R., and Tribus, M., "The Maximum Entropy (such as pressure and/or field intensities-inten-
Formalism," Cambridge, MA, MIT Press, 1978. sive properties). Equilibrium of first-order phase
change requires continuity of slope or first de-
Cross-references: BOLTZMANN'SDISTRIBUTION rivative of the Gibbs function with respect to
LAW, IRREVERSIBILITY, PHYSICAL CHEMISTRY, an intensive property and is generalized as the
STATISTICAL MECHANICS, THERMODYNAMICS. Clapeyron relation. Second- and higher-order
phase changes are given by the condition of
continuity of curvature or second derivative of
EQUILIBRIUM the Gibbs function and so on.
Chemical or nuclear equilibrium of a reactive
In the elementary sense of the macroscopic (vis- system is reached when there is no net transfer
ible to the naked eye) system, eqUilibrium is of mass and/or energy between the components
obtained if the system does not tend to undergo of a system. At chemical or nuclear equilibrium,
any further change of its own accord. Any fur- the Gibbs function of the reactants and the
ther change must be produced by external products must be equal according to stoichio-
means. metric proportions, in addition to uniformity in
Mechanical and Electromagnetic Systems temperature and stresses. Chemical equilibrium
Equilibrium in mechanical and/or electromag- is summarized in the form of the Law of Mass
netic systems is reached when the vectorial Action. The trend for the displacement from an
summation of generalized forces applied to the equilibrium state is specified by Le Chatelier's
system is equal to zero. In any potential field, principle.
that is, gravitational or electric potential or Thermodynamic equilibrium is reached when
magnetic vector potential, force can be ex- the condition of mechanical, electromagnetic,
pressed as gradient of potential (magnetic force thermal, phase, and chemical and nuclear equi-
however, is a curl of a vector potential). The librium is reached.
potential energy therefore has an extremum at Stability of Equilibrium A process or change
the equilibrium configuration. For example, a of state carried out on a system such that it is
system such as a mass suspended by a string always near a state of equilibrium is called a
against the gravitational force (or its weight) is quasi-stationary equilibrium process. This re-
at mechanical equilibrium if the tensile force in quires that the process be carried out slowly. If
the string is equal to the weight of the mass it a mechanical system is initially at the equilib-
supports. The d' Alembert principle further rium position with zero initial velocity, then
states that the condition for equilibrium of a the system will continue at equilibrium indef-
system is that the virtual work of the applied initely. An equilibrium position is said to be
forces vanishes. stable if a small disturbance of the system from
Thermodynamic Systems When a hot body equilibrium results only in small, bounded mo-
and a cold body are brought into physical con- tion about the rest position. The equilibrium is
tact, they tend to achieve the same warmth unstable if an infinitesimal displacement pro-
after a long time. These two bodies are then duces unbounded motion. In the gravitational
said to be at thermal equilibrium with each field, a marble at rest in the bottom of a bowl is
other. The zeroth law of thermodynamics (R. H. in stable equilibrium, but an egg standing on its
Fowler) states that two bodies individually at end is in unstable equilibrium. When motion
equilibrium with a third are at equilibrium with can occur about an equilibrium position with-
each other. This led to the comparison of the out disturbing the equilibrium, the system is in
states of thermal equilibrium of two bodies in neutral (or labile, or indifferent) equilibrium,
terms of a third body called a thermometer. an example being a marble resting on a per-
The temperature scale is a measure of state of fectly flat plane normal to the direction of grav-
thermal equilibrium, and two systems at ther- ity. It is readily seen that stable eqUilibrium is
mal eqUilibrium must have the same tempera- the case when the extremum of potential is a
ture (see THERMODYNAMICS). minimum.
407 EQUILIBRIUM

When dealing with general thermodynamic ber of elements, the distributions are measured
systems, the fact that entropy tends to a maxi- by thermodynamic probability (W); the most
mum in the trend toward equilibrium of a nat- probable distribution is such that W is a maxi-
ural process generalizes the above mechanical mum. This optimal principle is consistent with
consideration with respect to stability. An equi- the condition of maximum entropy (S) given iIi
librium state can be characterized as a stable the above. The Boltzmann hypothesis states
equilibrium when the entropy is a maximum; that S = k In W, where k is the Boltzmann con-
neutral equilibrium when displacement from stant.
one equilibrium state to another does not in- Depending on the specifications of W, namely,
volve changing entropy; and unstable equilib- those of Maxwell-Boltzmann (for low concen-
rium when entropy is a minimum. Any slight tration of distinguishable particles, weak inter-
disturbance from an unstable equilibrium state action and high temperature, such as a dilute
of a system will lead to transition to another perfect gas), Fermi-Dirac (for elemental par-
state of equilibrium. ticles with antisymmetric wave functions at
Statistical Equilibrium In the microscopic high concentrations of indistinguishable par-
sense, that is, treating systems in terms of ele- ticles and low temperatures, such as electrons in
mental particles such as molecules, atoms, and metal), or Einstein-Bose (for elemental particles
other material or quasi-particles (such as pho- with symmetric wave functions, such as He 4 at
tons in radiation, phonons in solids and liquids, high concentration of indistinguishable particles
rotons in liquids), equilibrium states are recog- and low temperature), equilibrium distributions
nized as the most probable states. An equilib- take different forms (see BOSE-EINSTEIN STA-
rium state of a system is therefore defined in TISTICS and FERMI-DIRAC STATISTICS). The
terms of most probable distributions of its ele- Maxwellian speed distribution in a dilute per-
ments among microscopic states which may be fect gas is a distribution based on Maxwell-
defined in terms of energy states. In this sense, Boltzman statistics.
statistical equilibrium is a condition for macro- As a consequence of molecular considera-
scopic equilibrium and an equilibrium state of a tions, when two systems are connected for
system is one of its extremal states. In the transfer of mass without significant transfer of
methods of STATISTICAL MECHANICS, the proba- energy, such as two containers at different tem-
bility of distribution is expressed in terms of peratures connected by a capillary tube, we
the density of distributions in the phase space. have the relation of thermal transpiration.
Based on the Liouville theorem, if a system is in Trend toward Equilibrium The mechanism
statistical equilibrium, the number of the ele- by which equilibrium is attained can only be
ments in a given state must be constant in time; visualized in terms of micrpscopic theories. In
which is to say that the density of distribution the kinetic sense, equilibrium is reached in a gas
at a given location in phase space does not when collisions among molecules redistribute
change with time. For an isolated system, the the velocities (or kinetic energIes) of each mole-
distribution is represented by a microcanonical cule until a Maxwellian distribution is reached
ensemble. At equilibrium, no phase point can for the whole bulk. In the case of the trend to-
cross over a surface of constant energy, and the ward equilibrium for two solid bodies brought
density of distribution is preserved. In this case into physical contact, we visualize the transfer
individual molecules of a system can be repre- of energy by means of free electrons and pho-
sented by phase points. Any part of an isolated nons (lattice vibrations) .
system in statistical equilibrium can be repre- The Boltzmann H-theorem generalizes the
sented by a canonical ensemble. A subsystem of condition that with a state of a system repre-
a large system in thermal equilibrium also be- sented by its distribution function [, a quantity
haves like the average system of a canonical en- H, defined as the statistical average of In [, ap-
semble. A system and a constant temperature proaches a minimum when equilibrium is
bath together can be considered as an isolated reached. This conforms with the Boltzmann
system. A phase point in a canonical ensemble hypothesis of distribution in the above in that
can represent a large number of molecules, thus S = - kH accounts for equilibrium as a conse-
accounting for strong interactions. A canonical quence of collisions which change the distribu-
ensemble is characterized by its temperature tion toward that of equilibrium conditions. In
and is therefore pertinent to the concept of terms of quantum states and their probabilities,
thermal equilibrium. When applied to equilib- it is also readily shown that any value of S
rium of systems involving mass exchange, such below the equilibrium value will increase in
as a chemical system, we have a "particle bath" time. Recursion of an original state for a frac-
in addition to a constant temperature bath. The tion of one gmol of molecules may take a time
pertinent representation for equilibrium includ- period much longer than the estimated life of
ing mass exchange as well as energy exchange is the universe of 10 10 years.
known as a grand canonical ensemble, which ac- Consideration of perturbation from an equi-
counts for the chemical potentials of its librium state leads to methods for dealing with
elements. rate processes and methods of irreversible ther-
When applied to a system with a large num- modynamics in general.
EQUILIBRIUM 408

Fluctuation from Equilibrium A necessary 1. The coupling between different atoms in a


consequence of the random nature of elemental crystal is small compared with the forces
particles in a body is that the property of such holding the electron within the separate
a body is not at every instant equal to its aver- atoms.
age value but fluctuates about this average. A 2. The Born-Oppenheimer approximation is
precise meaning of equilibrium can only be at- valid.
tained from consideration of the nature of such 3. The total wave function is a product of
fluctuations. In the above, we have repeatedly one-electron functions.
considered a "large" number of particles. It is
important to know how large a number is The Frenkel exciton is a tight-binding descrip-
"large." When considering fluctuation of energy tion of an electron and hole bound at a single
from an average value in an isolated system, the site such that their separate identities are not
ratio of the two is given to be proportional to lost. This model is applicable to insulating crys-
1/..jN, where N is the total number of elements tals. In the case of semiconductors, nonequilib-
in the system. This is also the magnitude of the rium electrons and holes are bound in excitons
fluctuation of number of particles in a system at low temperatures by Coulomb attraction.
involving transformation of phases and chemi- Semiconducting crystals are characterized by
cal and nuclear species. An eqUilibrium state is large dielectric constants and small effective
one at which the longtime mean magnitude of masses, therefore the electrons and holes may
fluctuation from the average state is indepen- be treated in a good approximation as com-
dent of time and this magnitude has reached a pletely independent particles, despite the
minimum value. Coulomb interaction. This results from the
Large perturbation from a given state of fluc- fact that the dielectric constant reduces the
tuation leads to a relaxation process toward a Coulomb interaction between the hole and
state of equilibrium. The relaxation time, for electron to the extent that it produces a weakly
instance, measures the deviation from quasi- bound pair of particles which still retain much
stationary equilibrium of a process which is car- of their free character. The exciton forms be-
ried out at a finite rate. cause it represents a state of slightly lower
energy than the unbound hole-electron. The
S. L. Soo effective mass theory used to describe such
References weakly bound particles was developed by
Wannier.2 These weakly bound excitons are
Goldstein, H., "Classical Mechanics," Cambridge, most appropriately described when the band
Mass., Addison-Wesley Publishing Co., Inc., 1956. structure of the solid is considered. The in-
Soo, S. L., "Analytical Thermodynamics," Englewood trinsic fundamental-gap exciton in semicon-
Cliffs, N.J., Prentice-Hall, Inc., 1962. ductors is a hydrogenically bound hole-electron
Mayer, J. E. and Mayer, M. C., "Statistical Mechanics," pair, the hole being derived from the upper
New York, John Wiley & Sons, 1977, Chapter 6. valence band and the electron from the lowest
conduction band. It is a normal mode of the
Cross-references: BOSE-EINSTEIN STATISTICS, EN- crystal created by an optical excitation wave,
TROPY, FERMI-DIRAC STATISTICS, IRREVERSI- and its wave functions are analogous to those
BILITY, STATICS, THERMODYNAMICS, VECTOR of the Block wave states of free electrons and
ANALYSIS. holes. When most semiconductors are optically
excited at low temperatures, it is the intrinsic
excitons that are excited. The energies of the
ground and excited states of the exciton lie
EXCITONS below the band-gap energy of the semiconduc-
tor. Hence, the exciton structure must first be
Intrinsic Excitons The exciton is a quantum of determined in order to determine the band-gap
electronic excitation produced in a periodic energy. The exciton binding energy can be
structure such as an insulating or semiconduct- determined from spectral analysis of its hydro-
ing solid. This quantum of energy has motion genic ground and excited state transitions (this
and the motion is characterized by a wave vec- also gives central-cell corrections). Precise band-
tor. Frenkel l was the first to treat the theory gap energies can be determined by adding the
of optical absorption in a solid as a quantum exciton energy to the experimentally measured
process consisting of atomic excitations. The photon energy of the ground state transition.
excitation process implies that the excited elec- Both direct and indirect exciton formation
tron does not leave the cell from which it was occurs in semiconductors, depending on the
excited. In his attempt to gain insight into the band structure. The former is characterstic of
transformation of light into heat in solids he many of the II-VI and III-V compounds, and
was able to explain the transformation by first- the latter is characteristic of germanium and
order perturbation theory of a system of N silicon. For indirect optical transitions, momen-
atoms having one valence electron per atom tum is conserved by the emission or absorption
with the following properties: of phonons. The detailed nature of the band
409 ExCITONS

structure of semiconductors with degenerate or free excitons allowed the production of exci-
simple valence bands is elucidated by under- tonic molecules in appreciable numbers. In
standing the intrinsic-exciton structure of these examining the spatial extent of the excitonic
semiconductors. gas in a parabolic potential well, they treated
High Density Electron-Hole Pairs Exciton the exciton gas as an ideal fas. The equiparti-
complexes analogous to Hl , H2 + and H- were tion theorem <V) = a<r 2 ) = ~ KT predicts that
first described by Lampert. The exciton in real the radial extent of the gas is R::TI/2. They
crystals may be considered as a boson since it examined the spatial profile by slowly scanning
consists of an electron and a hole, each being a a sharply focused image of the crystal across a
fermion, bound together by Coulomb interac- narrow slit masking the Ge photodetector. In
tion. Two single excitons exhibit an attractive the temperature range 5-22 K, the TI/2 radial
covalent interaction which results in the forma- profile was reasonably well obeyed.
tion of an excitonic molecule if the electron Spatial Resonance Dispersion For those
and hole spins are anti parallel. The binding states in a crystal where the photon wave vector
energy of such a molecule has been calculated and the exciton wave vector are essentially
by Akimoto, et al. 4 They showed that exciton equal, the energy denominator for exciton
molecules should be stable for any electron- photon mixing is small and the mixing becomes
to-hole mass ratio. Therefore, at high excitation large. These states are not to be considered as
intensities and if the mean. time for the forma- pure photon states or pure exciton states but
tion of exciton molecules is less than the radia- rather mixed states. Such a mixed state has
tive lifetime of single excitons, then the exciton been called a polariton. When there is a disper-
molecule system is produced. sion of the dielectric constant, spatial dispersion
At high excitation intensities two states of has been invoked to explain certain optical
the excited system have been proposed: (1) the effects of crystals. It was originally thought
excitonic-molecule gas, (2) the electron-hole that it would introduce only small corrections
droplet. Calculations of the ground state ener- to such things as the index of refraction until
gies of the excitonic-molecule system and the Pekar6 demonstrated that, if there was more
electron-hole droplet system have been success- than one energy transport mechanism, as in the
ful in determining which system is active in the case of excitons, this was not true. Spatial dis-
particular material being studied. It was recog- persion addresses the possibility that two dif-
nized that the band structure of the material ferent kinds of waves of the same energy and
would have a major impact on the ground state same polarization can exist in a crystal differ-
of the system. ing only in wave vector. The one with an anom-
With high exciting intensities the density of alously large wave vector is an anomalous wave.
hole-electron pairs is increased and excitons are In the treatment of dispersion by exciton theory,
formed at a higher rate. At high concentrations Pekar showed that if the normal modes of the
the interaction among excitons becomes impor- system were allowed to depend on the wave
tant, and when a certain threshold density of vector, a much higher order equation for the
excitons is reached liquid droplets are formed. index of refraction would result. The new
These collective droplets consist of nonequilib- solutions occur whenever there is any curvature
rium electrons and holes and, when electron- of the ordinary exciton band in the region of
hole recombination occurs specific radiation is large exciton-photon coupling. These results ap-
emitted. A luminescence line appears at a slightly ply to the Lorentz model as well as to quantum-
lower energy than that of the free exciton in mechanical models whenever there is a depen-
both Si and Ge which has been identified as dence of frequency on wave number.
light from an electron-hole liquid. Bound Excitons The intrinsic exciton may
Motion of Excitons The motion of excitons bind to various impurities, defects, and com-
has been dramatically demonstrated by Gourley plexes and the subsequent decay from the bound
et al. 5 in stressed Si. The crystal was stressed state yields information concerning the center
along the (100) direction by a rounded (R R:: 5 to which it was bound. Bound exciton com-
cm) steel plunger. The indirect band edge is plexes are extrinsic properties of materials.
lowered in energy by strain, as a result the These complexes are observed as sharp-line
droplets, free excitons, and excitonic molecules optical transitions in both photoluminescence
are accelerated in a strain gradient toward the and absorption. The binding energy of the
point of maximum shear stress. The image of exciton to the impurity or defect is generally
the photoluminescence at 15 K, produced by weak compared to the free exciton binding
laser excitation at the side of the crystal was energy. The resulting complex is molecular-
displayed by vidicon detection and is shown like (analogous to the hydrogen molecule or
in Fig. 1. The parabolic strain well provides molecule-ion) and has spectral properties which
both spatial confinement and reduced electron- are analogous to those of simple diatomic mole-
hole liquid binding energy. This experimental cules. The emission or absorption energies of
arrangement allowed them to achieve large these bound exciton transitions are always
concentrations of free excitons with low-level below those of the corresponding free exciton
steady state excitation. The high density of transitions, due to the molecular binding energy.
EXCITONS 410

FIG. 1. Exciton luminescence emanating from a 1.5 X 1.5 X 4 mm crystal of ultrapure Si at


T "" 10 K. The top crystal surface, accurately flattened and polished, is pressed with a spherical
contacting surface of a steel plunger cut from a ball bearing with radius R = 3.8 cm. Excitons
produced at the left crystal surface by a CW Ar laser are drawn into a parabolic potential well
corresponding to a shear strain maximum beneath the plunger (Gourley et al., Ref. 5).

Bound excitons were first reported by Haynes 7 were added to silicon. A modified linear rela-
in the indirect semiconductor silicon. He found tionship has been found for donors and ac-
that when group V elements were added to sili- ceptors in compound semiconductors.
con sharp photoluminescent lines were produced The sharp spectral lines of bound exciton
and these lines were displaced in energy in a complexes can be very intense (large oscillator
regular way. The binding energies of exciton strength). The line intensities will, in general,
complexes produced by adding different group depend on the concentrations of impurities
V donors were described by the linear relation: and/or defects present in the sample.
The theory of "impurity" or defect absorp-
E = a.IE j tion intensities in semiconductors has been
studied by Rashba. 8 He found that, if the ab-
where E is the binding energy of the exciton sorption transition occurs at k = a and if the
and E j is the ionization energy of the donor. discrete level associated with the impurity
The small differences in ionization energies for approaches the conduction band, the intensity
different effective mass chemical donors result of the absorption line increases. The explana-
from central cell corrections. A similar relation- tion offered for this intensity behavior is that
ship was found when the group III acceptors the optical excitation is not localized in the
411 EXPANSION, THERMAL

impurity but encompasses a number of neigh- tion and its Lo-phonon sideband is less than
boring lattice points of the host crystal. Hence, the Lo-phonon energy hwo by as much as 30%
in the absorption process, light is absorbed by in some materials. The quasiparticle consists of
the entire region of the crystal consisting of the the exciton-phonon bound state, having im-
impurity and its surroundings. purity modes associated with dielectric effects
In an attack on the particular problem of of the center rather than local modes associated
excitons which are weakly bound to localized with the mass defects of the substituents.
"impurities," Rashba and Gurgenishvili 9 derived Resonant Raman scattering occurs with exci-
the following relation between the oscillator tons as intermediate states, spin-flip scattering
strength of the bound exciton F d and the oscil- from bound excitons is observed and the pro-
lator strength of the intrinsic excitons fex, duction of excitons by two photon processes
using the effective-mass approximation provides added information about a specific
material. Exciton mechanisms are active in
Fd = (Eo/IE i)3/2fex , lasing transitions in many materials.
The study of free and bound excitons has
where Eo = (2fi 2 Im)(rr/n o )2/3. E is the bind- contributed appreciably to the understanding
ing energy of the exciton to the impurity, m is of the basic intrinsic and extrinsic properties of
the effective mass of the intrinsic exciton and many materials.
no is the volume of the unit cell.
It has been shown in some materials that Fd DONALD C. REYNOLDS
exceeds fex by more than four orders of mag-
nitude. An inspection of the above relation References
reveals that, as the intrinsic exciton becomes
more tightly bound to the associated center, 1. Frenkel, J.,Phys. Rev. 37,17 (1931).
the oscillator strength, and hence the intensity 2. Wannier, G. H.,Phys. Rev. 52, 191 (1937).
of the exciton complex line, should decrease 3. Lampert, M. A., Phys. Rev. Lett. 1,450 (1958).
as (1 II E 1)3/2. 4. Akimoto, 0., and Hanamura, E., J. Phys. Soc.
In magnetic fields bound excitons have unique Japan 33, 1537 (1972).
Zeeman spectral characteristics, from which it 5. Gourley, P. L., and Wolfe, J. P., Phys. Rev. Lett.
is possible to identify the types of centers to 40,526 (1978);Phys. Rev. 20B, 3319 (1979).
which the free excitons are bound. Bound exci- 6. Pekar, S. I., Sov. Phys.-JETP 6, 785 (1958) [En-
ton spectroscopy is a very powerful analytical glish transl: Sov. Phys.-Solid State Phys. 4, 953
tool for the study and identification of im- (1962) ].
purities and defects in semiconductor materials. 7. Haynes, J. R.,Phys. Rev. Lett. 4,361 (1960).
It has been employed rather extensively over 8. Rashba, E. I., Opt. Spektrosk. 2,508 (1957).
the last few years for the characterization of 9. Rashba, E. I., and Gurgenishvili, G. E., Fiz. Tverd.
materials. Tela 4,1029 (1962) [Eng!. transl: Sov. Phys.-Solid
Multibound Excitons Sharp photolumines- State 4, 759 (1962)].
cent lines have been observed at energies less
than the energy of the line associated with an The following books, and references therein, present a
exciton bound to a neutral donor in silicon, broad coverage of the field of excitons:
germanium and silicon carbide. Similar lines
have also been observed which are associated Dexter, D. L., and Knox, R. S., "Excitons," New York,
with acceptors in silicon and gallium arsenide. Interscience, 1965.
The energies and widths of these lines were Knox, R. S., "Theory of Excitons," Suppl. 5 of Solid
such that they could not be explained in terms State Physics, New York, Academic Press, 1963.
of any recombination mechanism involving just Davydov, A. S., "The Theory of Molecular Excitons,"
a single exciton bound to a neutral shallow im- New York, McGraw-Hill Book Company, Inc., 1962.
purity center. A model involving a multiexciton Reynolds, D. C., and Collins, T. C., "Excitons, Their
complex bound to a donor (acceptor) was in- Properties and Uses," New York, Academic Press,
voked in which each line was associated with 1981.
radiative recombinations of an exciton in the
bound multiexciton complex. The behavior of Cross-references: PHONONS, PHOTON, SOLID·
these lines in the presence of magnetic and STATE PHYSICS, SOLID·STATE THEORY.
stress fields helped to establish the viability of
the bound multiexciton complex model.
Interaction of Excitons with Other Systems EXPANSION, THERMAL
The interaction of excitons with phonons to
form Stokes and anti-Stokes transitions are Definition and General Remarks All substances
commonly observed in crystals. Interaction change their shapes as a consequence of under-
with the Lo phonon is strong in most crys- going changes in temperature. A measure of this
tals. The exciton-bound-phonon quasiparticle change is the thermal coefficient of expansion.
is one form of interaction that has been observed In most cases the result is an increase in the
in several materials. In the case of the quasi- length, area, or volume of a sample. The effect
particle the energy separating the parent transi- is by no means negligible but for moderate
EXPANSION, THERMAL 412

changes in the temperature a first-order correc- heat at constant volume (which is most often
tion suffices. The prevention of thermal expan- obtained from theory). This relationshi~, based
sion requires very large mechanical stresses. For solely on the laws of thermodynamics is
example, a compressive stress of order 5 X 108
dynes/cm 2 is necessary to prevent a steel bar Cp - Cv = (~2 VT)/X (3)
from expanding when the temperature is in-
creased by 20°C. For solids in the form of a where T is the absolute temperature and X the
thin rod or cable, this change is confined (to compressibility defined by
first order) to a change in length, and a linear
coefficient of expansion is thus defined by X= _1.V (aapV)T (4)

ao = L(~~)p (1) To obtain this resuilt we write the first law of


thermodynamics (see THERMODYNAMICS):
where Lo is the length of the specimen at O°C
and the subscript P implies that the pressure is 8Q = dU+PdV (5)
kept constant. Correspondingly, for fluids and Since
for solids of arbitrary shape, one defines a cubi-
calor volume coefficient of expansion, ~o, by
the relation Cv = eQ)
8T v = (au)
aT v (6)

1 (av)
,Tt 'P (2)
av TdV + (au)
~o dU = (au) aT v dT
= Vo
(7)
with Vo being the volume at the reference tem-
perature (usually chosen to be OoC). It may and
readily be shown that f30 = 3ao. Thus for solids
ap TdP+(av)
one usually tabulates values of ao while, of dV=(av)
course, only ~ has meaning for fluids. Whereas aT pdT (8)
~o is simply 1/273 for all ideal gases (this fol-
lows from the equation PV
= nR T), there exists we obtain
a wide variation for f3 values among liquids.
8Q = [(~~)T + p]( ~;)T dP
Table 1 gives a and ~ values for several sub-
stances. The negative value of ~ for water below
4°C is anomalous and is caused by the compara-
tively open lattice structure of ice. In the case
of nonisotropic crystals, the coefficient of lin-
ear expansion differs for different directions in
+{cv + [(~~)T + p](~i)p}dT (9)
the crystals and may even have opposite signs
along different directions as is the case for Thus
Thermodynamic
CaC0
3 Relationships The cubical C 8Q C av P au (10)
·
coefficient of expansion plays an important p= (8T)p= V + [ (aT)p] [ + ( av)T]
role in relating the molar specific heat at con-
stant pressure, Cp, (which is usually measured By the second law of thermodynamics, we have
directly in the laboratory) to the molar specific
dS =dU P
- +-dV
T T
(11)
TABLE 1. LINEAR COEFFICIENTS OF
EXPANSION, a, FOR SOME SOLIDS AND On using Eq. (7) again this becomes
CUBICAL COEFFICIENTS OF EXPANSION,~,
FOR SOME LIQUIDS AT ROOM
TEMPERATURE dS = .!.rl(au)
TL av T + p] dV +.!.T(au)
aT VdT (12)

Since
Aluminum
Copper
Nickel
25.0
16.8
12.8
X 10-6
X 10-6
X 10-6
(as) (as)
dS = av T dV + aT V dT (13)
Sodium 77.0 X 10-6
Mercury 18.2 X 10-5
one obtains on comparing Eqs. (12) and (13)
Glycerin 48.5 X 10-5
Water (0-4°C) -3.2 X 10-5
(:~)T = ~ [p + (~~)T] (14)
413 EXPANSION, THERMAL

as) 1(au)
(aT (15)
N de = _ 3NkT d(log v) (23)
v = T aT v dV dV

Since To obtain the thermal expansion coefficient,


one expands de/dV:
a (as)
aT a (as)
av T = av aT v (16)
de = (de) + (V- Vo ) (d 2 e) (24)
dV dV v=vo dV2 v=vo
we get from Eqs. (14) and (15)

au (au) ] But the compressibility is given by

L
1 2 I [
"TavaT=-T2 p+ aVT
= NVo (:~:)v=vo (25)

1
+"T
[(ap) au]
aT v + aTav
2
(17) and thus
or V - Vo = _ 3NkTX d(log v) (26)

(au)
av
=T(ap) _p
T aT v
(18)
Vo 0 dV
Differentiating with respect to T and realizing
that 3NkT is the thermal energy, we obtain:
Inserting Eq. (18) into Eq. (10), yields

Cp - Cv = T(av) (ap\
aT p aiJv (19)
aVo
--=-
CvXo
d(log v)
d(log V)
(27)

Now More exact crystal models yield values for

(20) d(log Vi)


1; = - d(log V)
Thus on replacing (ap/aT)v, the final form of where Vi is the ith frequency of a set of normal
Eq. (19) becomes modes of vibration. 3 These and other refine-
ments give rise to modifications of the simple
T avV V2 Griineisen theory.
Cp - Cv = - (a aiJp
V) (21 ) Source of Thermal Expansion The dynam-

ap-
T
V2
ical basis for thermal expansion is the presence
of an anharmonic component for the interac-
tion potential. A qualitative argument for this
or Eq. (3) when the definition of compressibil- is based on the property of typical potential
ity [Eq. (4)] is employed. energy of atoms in a lattice. 4 Figure I repre-
Thus for a substance where £j and X are exper- sents such a curve and it will be observed that
imentally known Cv may be established from a the curve is not symmetrical about '0, the equi-
measurement of Cpo librium distance between atoms. As the internal
Griineisen Relation 2 Griineisen introduced energy E increases with an increase in tempera-
the parameter 1 = £jV/XCv and on the basis of ture, the average value of , shifts to larger val-
simple models reached the conclusion that 1 is ues. Clearly if the potential curve were per-
independent of temperature. This implies that fectly parabolic about '0, we would not have
the thermal expansion coefficient is propor- thermal expansion. This argument can be made
tional to the specific heat and has the same qualitative as follows. s Taking x as the displace-
type of temperature dependence. This is true ment of a lattice atom from its eqUilibrium
for many substances and has, in fact, been em- neighbor separation, the potential energy has
ployed as a means for predicting values for Cv the form
at low temperatures. To illustrate the physical (28)
basis of the Griineisen relation , we will work V(x) = cx 2 - gx 3
with a crystal model of N oscillators of identi-

1:
Then X, the average displacement using the
cal frequency, v, and each having equilibrium Boltzmann distribution function becomes
energy, e. In the region of hv <l!; kT, the free
energy is given by:

1:
xe-V(x)/kT dx
F = N (e+ 3kT log :;) (22) x= (29)

Since this must be a minimum at equilibrium, e-V(x)/kT dx


we obtain:
EXPANSION, THERMAL 414

V(l")

+
o

I
l"_I,
o I I
I
I

1"1 ·1

FIG. 1. Potential energy curve for atoms in crystalline solid as a function of interatomic separation.

Assuming that the anharmonic term is much constant, the Clausius-Mosotti formula is
less than the harmonic contribution, we expand
V(x) to yield £-I=AN (33)
£+2
Differentiating with respect to T, the tempera-
ture, and making use of the assumption that
£ > I, one finds that
and

On integrating between T and 8, the Curie tem-


Both integrals are readily evaluated to give perature, one obtains

= 3kTg £ = 3/{3 (35)


x=-- (32) T-8
4c 2
or a constant temperature coefficient for ther- which is a typical Weiss law and indicates the
mal expansion. This simple derivation may be Curie constant is of the order of the reciprocal
amplified to include specific interaction forces of the thermal expansion. The special electronic
for the atoms of a lattice. 6 structure of ferromagnetic materials also gives
Thermal Expansion and Curie Temperature rise to anomalous thermal expansion coeffi-
The Curie-Weiss law for both ferroelectrics and cients in the transition region. For some mate-
ferro magnets may be shown to be connected rials {3 values are depressed and for others {3
with the thermal expansion coefficient. 7 Con- values increase more rapidly with temperature.
sider a region where the dielectric constant, C, Both magnetostriction and the variation of the
is large. If we let N be the cell density and A a energy of magnetization with the atomic size
415 EXPANSION, THERMAL

account for the anomalous behavior of different 4. Halliday, D., and Resnick, R., "Physics," Third edi-
substances. tion Part I, New York, John Wiley & Sons, 1977
(p.506).
JACOB NEUBERGER 5. Kittel, C., "Introduction to Solid State Physics,"
New York, John Wiley & Sons, 1976.
References 6. Peieris, R. E., "Quantum Theory of Solids," New
York, Oxford Press, 1955. (Reprinted in 1964.)
1. Any text in thermodynamics such as:
Zemansky, M. W., "Heat and Thermodynamics," 7. Dekker, A., "Solid State Physics," Englewood
New York, McGraw-Hill Book Company, Inc., Cliffs, N.J., Prentice Hall, 1957; Sinnott, M., "The
1951. Solid State for Engineers," New York, John Wiley
2. Mott, N. F., and Jones, H., "The Theory of the & Sons, 1958.
Properties of Metals and Alloys," New York, Dover
Publications, 1936. Cross-references: BOLTZMANN'S DISTRIBUTION
3. Arenstein, M., Hatcher, R. D., and Neuberger, J., LAW; COMPRESSIBILITY, GAS; DIELECTRIC
"Equation of State of Certain Ideal Lattices," THEORY; HEAT; HEAT CAPACITY; MAGNETISM;
Phys. Rev., 13, No.5, 2087-2093 (1963). THERMODYNAMICS.
F
FALLOUT through the tropopause, primarily in certain
regions where mixing between the two layers is
The term fallout generally has been used to more probable.
refer to particulate matter that is thrown into The exact characteristics of the radiation
the atmosphere by a nuclear process of short associated with fallout depend on the nature of
time duration. Primary examples are nuclear the nuclear processes from which its radioactivity
weapon debris and effluents from a nuclear originates. Generally these radioactive nuclides
reactor excursion. The name fallout is applied are fission products formed from the fissioning
both to matter that is aloft and to matter that of uranium or plutonium, but, under appro-
has been deposited on the surface of the earth. priate circumstances, considerable quantities of
Depending on the conditions of formation this radioactivity can be formed through nuclear
material ranges in texture from an aerosbl to reactions induced by neutrons that are pro-
granules of considerable size. duced by the weapon or reactor. The radiation
The topographic distribution of fallout is problems associated with local fallout are us-
divided into three categories called local (or ually those of high-intensity gamma-ray radia-
close-in), tropospheric (or intermediate), and tion fields resulting from the relatively large
stratospheric (or world wide) fallout. No dis- quantities of radioactive material that fall back
tinct boundaries exist between these categories. to earth within a few tens of miles from the
The distinction between local and tropospheric point of origin. The important radioactive
fallout is a function of distance from source to materials consist in this case of short-lived fis-
point of deposit, while the primary distinction sion products and neutron-induced radioactive
between tropospheric and stratospheric fallout nuclides. The hazards of worldwide fallout come
is the place of injection of the debris into the more from the problems of the long-lived
atmosphere, above or below the tropopause. radionuclides, such as 13 4 Cs, 137CS, and 90Sr,
Whether radioactive debris from a nuclear that can enter the human food chain and
weapon becomes tropospheric or stratospheric ultimately be absorbed by the body.
fallout depends on yield, height, and latitude of For a nuclear weapon burst in air all materials
burst (the height of the tropopause is a function in the fireball are vaporized. Condensation of
of latitude). fission products and other bomb materials is
Because air acts as a viscous medium, a drag then governed by the saturation vapor pres-
force is developed to oppose the gravitational sures of the most abundant constituents. Pri-
force that acts on airborne particulate matter. mary debris can combine with naturally ocur-
This makes the velocity of fall dependent on ring aerosols, and almost all of the fallout
particle size. The larger particles (diameters becomes tropospheric or stratospheric. If the
greater than about 20J,Lm) have a higher rate of weapon detonation takes place within a few
settling and create local fallout. Smaller parti- hundred feet of (either above or below) a land
cles injected below the tropopause are carried or water surface, large quantities of surface
by prevailing winds over large regions of the materials are drawn up or thrown into the air
surface of the earth and create the tropospheric above the place of detonation. Condensation
fallout. Tropospheric fallout particles larger of radioactive nuclides in this material then
than about O.I-J,Lm diameter continually mix leads to considerable quantities of local fallout,
through the circulating air mass that is in con- but some of the radioactivity still goes into
tact with the surface of the earth and gradually tropospheric and stratospheric fallout. If the
settle to the ground, or are washed down by burst occurs sufficiently far underground the
rain or snow. Many smaller particles form surface is not broken and no fallout results:
nuclei for raindrops. Parts of the tropospheric The considerable significance of radioactive
fallout many remain in the atmosphere for a fallout became apparent on July 24, 1946, when
month or more, long enough to circle the earth a nuclear explosive of about the same size as the
several times. The mean residence time above Hiroshima and Nagasaki weapons was detonated
the tropopause of stratospheric fallout is from 30 meters below the surface of the lagoon at
5 to 30 months, during which time it com- Bikini atoll. This explosion produced a column
pletely encircles the earth. It gradually returns one-half mile in diameter consisting of about

416
417 FALLOUT

FIG. 1. Nuclear explosion at Bikini Atoll, July 24, 1946.

a million tons of water, as well as a base surge, Japanese fishing vessel Lucky Dragon. The
as illustrated in Fig. 1. Instead of the radio- people of these atolls have continued since that
active debris being lifted by the explosion into time to receive medical examinations on a regular
the troposphere and stratosphere, as had been basis. From the information obtained a reason-
the situation for prior nuclear detonations, ably thorough knowledge has accrued on the
almost all the radioactivity produced by this effects of fallout radiation on the health of one
underwater explosion was trapped in the water, small group of individuals.
spread out in a radioactive cloud, and fell as The aerodynamic principles governing deposi-
rain. The radioactive material was strongly tion of fallout are the same regardless of the
absorbed by the painted surfaces of a group origin of the material which is thrown into the
of ships which had been assembled as a target air. Therefore the deposition of any airborne
fleet. It also settled into joints, ventilating sys- matter should follow the same pattern as equiv-
tems, almost everywhere. It clung so tenaciously alent material produced by nuclear explosives.
that the paint sometimes had to be removed to Two reasonably contemporary but very dif-
reduce significantly the amount of radioactivity. ferent nonradioactive phenomena which can be
The results were completely unexpected, since used for such comparisons have been the depo-
most planners had predicted a stronger shock sition of ash from the eruptions of Mount St.
wave to hit the ships than had been experienced Helens and the deposition of acid rain in the
from the earlier air detonation, but they greatly northeastern United States and eastern Canada.
underestimated the amount of radioactive The eruption on May 18, 1980 of Mount St.
contamination that would occur. As a result of Helens produced huge quantities of observable
this test a large research effort was begun to try airborne particulate matter. This eruption has
to understand the nature of fallout from nuclear been determined to have released an energy
detonations and the procedures needed to be equivalent to the detonation of about 35 MT
followed to decontaminate the regions on of TNT, comparable to the largest nuclear
which the distributed radioactive material was explosions ever detonated. The basic difference
deposited. between these two types of explosions is that
Some type of radioactive fallout has been the Mount St. Helens eruption was strictly
associated with every near-surface detonation mechanical, carrying with it several cubic kilo-
of a nuclear explosive. On one occasion the meters of debris, considerably more than would
radiation from fallout debris endangered human be thrown into the air by even a very large
health. This was the detonation on March 1, nuclear explosion. However, the size distribu-
1954 at Bikini atoll, of a thermonuclear device tion of the individual ash particles thrown into
with a yield equal to that of about 17 mega- the air by the eruption was similar to particle-
tons (MT) of trinitrotoluene (TNT). The yield size distributions of airborne material from
was greater than expected and an unexpected surface detonations of high-yield nuclear explo-
shift in wind direction caused significant quan- sives. Also, the pattern of deposition of the ash
tities of radioactive fallout to be deposited was very similar to the expected deposition
over a region extending several hundred miles patterns for fallout from the surface detonation
to the east, over the inhabited atolls of Ailing- of an 8-MT nuclear weapon in the vicinity of
nae, Rongelap, Rongerik and Utirik and on the Portland, Oregon, as predicted in 1959 during
FALLOUT 418

hearings held by the Joint Committee on Atomic Danielson, E. F., "Trajectories of the Mount St. Helens
Energy of the Congress of the United States. Eruption Plume," Science, 211, 819-820 (1981).
The exact pattern of deposition of fallout Storeb¢>, P. B., "Prediction of Massive Wash-out of
depends on weather conditions. The eruption Nuclear Bomb Debris," Health Physics 11, 1203-
of May 18 happened to occur during a period 1211 (1965).
when weather conditions were similar to the Babich, H., Davis, B. L., and Statzky, G. "Acid Pre-
conditions assumed for the 1959 study. On the cipitation: Causes and Consequences," Environment
other hand, the eruption of May 25 occurred 22(4),6-13 (1980).
during a rainy period. Not as many ash particles
were thrown aloft during the. May 25 eruption Cross-references: ATOMIC ENERGY, FISSION, FU-
as during the May 18 eruption but the number SION, ISOTOPES, NUCLEAR REACTIONS, RADIO-
was still more than usually expected from a ACTIVITY, VOLCANOLOGY.
surface detonation of a nuclear weapon. On
May 25 rain mixed with the ash to form mud,
which fell locally and not uniformly. However
the fallout pattern of this ash was similar to F ARADA Y EFFECTS*
that predicted by Storeb<,h for "rainout" of debris In 1845, Michael Faraday discovered the first
from nuclear explosives, again making the event magnetooptical effect 1 when he observed the
worthy of study as a possible predictor of ef- rotation of the plane of polarization of light as
fects which could follow the detonation of a a result of its passage through lead borate glass
nuclear weapon. (See VOLCANOLOGY.) in a direction parallel to an applied magnetic
The series of events which result in acid rain field B. This is known as the Faraday effect or
are at the other extreme of the fallout picture Faraday rotation and was important historically
for, instead of consisting of particulate matter because it provided the first concrete evidence
of finite size, much of the material released for a connection between magnetism and light.
from chimneys, smokestacks and motor-vehicle Since then many other magnetooptical effects
exhausts is composed of individual molecules have been investigated-notably cyclotron reso-
or groups of molecules of the oxides of sulfur nance, magnetic dichroism, and the Voigt and
and nitrogen. In the atmosphere these sub- Hall effects-their common linkage being their
stances go through chemical reactions to form dependence on various components of the di-
sulfates and nitrates, which are acidic and which electric or conductivity tensor, as well as on the
dissolve in water and fall as precipitation at magnetic permeability tensor in the case of
distances of 100 to 1000 kilometers downwind magnetic materials. Faraday rotation has now
from their original source. The mechanical been shown to be a general property of matter
result is a type of fallout not unlike that from and has been observed in a variety of solids
an air-burst of a nuclear weapon, during which (especially semiconductors), liquids, and gases,
essentially all matter is vaporized. Although the over a wide range of frequencies. 2 It is often
particulate matter cannot be physically ob- a very useful technique 3 ,4 for the determina-
served, as in the case of the fall of ash from tion of various quantities such as effective mass
Mount St. Helens, the chemical effects of the rn, collision frequency v, and mobility Ji. In
acid rain can be observed, for example, in the fact, not only does it complement cyclotron
increased acidity of the lakes of the Adirondack resonance determinations of rn but it is espe-
Mountains, where the water of some lakes has cially useful in determining electron and hole
become too acidic to support fish life. Similar effective masses in solids in cases where cyclo-
nuclear-weapon fallout would be observed tron resonance is unobservable. s The latter
simply as an increase in radioactivity in the
water.
circumstance occurs when v» w, where W is
the angular frequency of the radiation, and
C. SHARP COOK corresponds to large damping (since T =V-I,
where T is the damping or relaxation time).
Also, since cyclotron resonance occurs when
W = we, where We = (eB/mc) is the cyclotron
References frequency with e being the magnitude of the
Brunner, H., and Pretre, S. (Eds.), "Radiological Pro- charge, it is clear that for a typical m value of
tection of the Public in a Nuclear Mass Disaster," 10- 1 rno (where rno is the free electron mass)
Proceedings of symposium at Interlaken, Switzer- and a maximum B value of 100 kG we have
land, 26 May-l June, 1968, Bern, Bundesamt fUr resonance at W = 1.77 X 10 13 S-I, correspond-
Zivilschutz, 1968. ing to a wavelength A = 326 Jim, i.e., for practi-
Cook, C. S., "Initial and Residual Ionizing Radiations cal purposes sharp cyclotron resonances typi-
from Nuclear Weapons," in Attix and Tochilin (Eds.), cally do not occur for wavelengths smaller than
"Radiation Dosimetry," Vol. III, New York, Aca- infrared, and in many cases actually no smaller
demic Press, 1969, pp. 361-399.
Freiling, E. C. (Ed.), "Radionuclides in the Environ· *Researeh for this article was partially supported by
ment," Washington, D.C., American Chemical So- the Department of Energy, Division of Materials
ciety, 1970. Science, under Contract No. DE-AS05·79ERI0459.
419 FARADA Y EFFECTS

than microwave. In other words, Faraday rota- refers to either the natural frequency of the
tion may be measured over a far wider range of bound charges (classical model) or the frequency
frequencies and in a far wider range of materials separation of spectral lines (quantum mechani-
than cyclotron resonance. However, it has the cal model). In a solid there are contributions to
disadvantage of measuring an average effective () from both free and bound electrons and also
mass in cases where the effective mass is aniso- the nuclei but, as a general rule, at optical and
tropic. It is also used to deduce interstellar infrared frequencies, the dominant contribu-
magnetic field values as well as providing sup- tion to () is from the free electrons. For low
port for the conclusion that there is little or no photon frequencies only transitions within the
antimatter in the galaxy. We use cgs units in same band (intraband) are of importance in a
this article . semiconductor but as we approach the optical
A linearly polarized wave can be decomposed region the band to band (interband) effects
into two waves of opposite circular polarization, must be included. A classical calculation of cij
called right and left (RCP and LCP), which is based on the Boltzmann equation or, more
propagate independently. In general, optical frequently although less rigorously, on the Drude
rotation occurs in a medium when its refractive model, which assumes that all charge carriers
indices for right- and left-circularly polarized act independently. The Drude model result is
radiation, n+ and n_, are unequal so that the often referred to as the cold-plasma limit since
phase velocities c/n+ and c/n_ are also unequal. it corresponds to the result obtained by use of
In the case of natural optical activity, this arises the Boltzmann equation in the limit of extreme
from asymmetry among the atomic layers degeneracy (zero temperature).
whereas Faraday rotation arises from the aniso- Consider now the case of electrons moving
tropy produced by the magnetic field. The freely (wo = 0) in a crystal-lattice background-
former (which will not concern us here) disap- while being cognizant of the fact that, except
pears on reflection back through a sample for a change in the sign of (), similar results hold
whereas the latter is doubled. In the case of an for positive carriers such as, for example, holes
absorbing medium, the difference in absorption in the valence band of semiconductors. Then
coefficients for the two components (referred the Drude model leads t0 3
to as dichroism) causes the emerging beam
to be elliptically polarized . c±=c/-[Wp 2 /W(w±We +iV)), (4)
The Faraday rotation () is defined to be one-
half the phase angle change between the RCP where cl = n 2 is the (real) dielectric constant of
and LCP waves and corresponds to the amount the lattice. It follows that, if W is much larger
of rotation of the major axis of the transmitted than the other three frequencies, the Drude
polariZation ellipse. 3 For radiation of frequency model leads to
W propagating a distance d through the medium
along the direction of the magnetic field, the (5)
rotation is given by
wd rrd Thus () is proportional to the magnetic field B
() = - (n+ - n_) = ~ (n+ - n_), (1) and also to m- 2 , making clear how a measure-
2c 1\ ment of () can determine the effective mass m.
This formula is also used to determine galactic
where A is the wavelength in vacuum. A theo- magnetic fields from observations, over a range
retical evaluation of n+ in the case of nonmag- of frequencies, of the Faraday rotation associ-
netic materials (magnetic materials will be ated with polarized radio waves from such
discussed below) starts with the relation objects as pulsars.6 Estimates of primordial
magnetic field values have also been made from
(2) observations of the () of a distant extragalactic
radio source. 7 Furthermore, a measurement of
where k, the imaginary part of the complex re- the Faraday rotation of radio waves emitted by
fractive index, arises from absorption and c± artificial satellites and transmitted through the
is related to the components of the dielectric ionosphere can be used to measure the electron
tensor Cij (i, j =x, y, z) by density along the path. 8 Turning to the ques-
(3) tion of how much antimatter there is in the
galaxy, we note that positrons and electrons
Thus the problem of calculating () is reduced to cause rotations in opposite directions. But po-
a calculation of cij. In general, rotation arises larized light traversing the interstellar medium
due to interaction of the radiation with either does suffer Faraday rotation, demonstrating
free or bound charge carriers. There are five that there are not comparable numbers of elec-
basic frequencies to be considered: the wave trons and positrons. In fact, when these results
frequency w, the plasma frequency wp = (which in essence give the difference in the
(4rrNe 2 /m)1/2, where N is the number of charge number of electrons and positrons) are com-
carries per unit volume, the collision frequency bined with dispersion measures (which depend
v, the cyclotron frequency We, and wo , which on the sum of the number of electrons and
FARADA Y EFFECTS 420

positrons), it is found that the number of Faraday effect is intensity dt;pendent, especially
positrons is negligibly small. 9 in a strong magnetic field. 1 A closely related
In the case where We is much greater than the phenomenon is the inverse Faraday effect, i.e.,
other three frequencies (high-field, low-fre- the magnetization of the medium by intense
quency approximation), polarized radiation,15 which has been suggested
as the basis of a nondemolition optical quantum
(6) counting measurement. 16
Faraday rotation has also been used as a diag-
i.e., () is now negative, and it is proportional to nostic tool to study and measure the large mag-
B- 1 and independent of v again and also w. netic fields which are produced both in con-
Also if wp «w then a zero in () occurs lO at trolled thermonuclear fusion plasma and in
a photon frequency W =(we 2 + v2 )1/2, which laser-produced plasma,17 as well as being one
can lead to a determination of v. If v» w, of the first phenomena to be studied by the
We »(wp/n) then we get the low-frequency megagauss magnetic fields which are being
Faraday rotation increasingly produced in many laboratories. IS
All of the effects discussed so far depend on
()=-dw e w p 2j2cnv 2 the dielectric tensor and arise from the inter-
(7) action of the charge carriers with the electric
= -21TduoJ.lBjc2n, field of the electromagnetic wave and, in addi-
tion, the spatial dependence of the electric
where J.l = erjm is the carrier mobility and Uo = field is generally neglected (electric dipole
ne 2 r/m is the static conductivity. Since the approximation). However, for magnetic mate-
latter expression for () does not contain m or rials the Faraday rotation depends on the
v explicitly, one can use known values of Uo and magnetic permeability tensor. In ferromagnetic
the other parameters to deduce the mobility of metals very large rotations occur which are
charge carriers from a measurement of (). proportional to the net magnetization and not
In strong magnetic fields, account must be to the external magnetic field. 19 ,20 Thus the
taken of the fact that electron energies in a rotation per cm in a magnetic field of 10 4 gauss
magnetic field are confined to discrete Landau is of the order of 2 degrees in quartz, 10 2 de-
levels and, in the case of interband transitions, grees in aluminum and 1.3 X 10 5 degrees in
this gives rise to oscillatory effects in the Fara- iron. The large value of the latter rotation arises
day rotation, while there is evidence for a con- from a spin-orbit interaction: the magnetic
tribution also from exciton transitions. In gen- moment of an electron, due to its spin, inter-
eral, there are other complications3 which are acts with the magnetic field which arises by
sometimes of importance. For example, the virtue of its motion through the electric field
collision frequency v can be frequency- and created by the nuclei and all the other electrons
magnetic-field dependent. Also, in polar semi- in the absence of radiation. This phenomenon is
conductors there is a contribution from optical often called the ferromagnetic Faraday effect
lattice vibrations to the dielectric tensor. and refers to the transmitted beam, whereas
In the case of thin samples, multiple reflec- effects associated with reflection from a ferro-
tions can play an important role with an atten- magnetic material are called polar Kerr mag-
dant increase in the complexity of the analy- netooptic effects. In a certain sense the spin-
sis.u , 12 An example of where such multiple orbit interaction can be looked on as the effect
reflections playa role 13 is the two-dimensional of a large internal magnetic field acting on the
electron space-charge layer (the motion being electrons. Also, the role of the external magnetic
quantized in one direction whose effective field is peripheral in that it serves only to mag-
width is negligible compared to the wavelength netize the sample in a certain direction. It should
of the transmitted radiation), which is formed also be mentioned that there is also a contribu-
in various modern microelectronics systems, tion from spin-orbit effects in nonferromagnetic
such as at the semiconductor surface in a metal- material, albeit small compared to the situation
oxide-semiconductor (MOS) system. There is for ferromagnetics.
also a contribution to () due to boundary ef- Absorption in the metallic ferromagnets is
fects in the transmission of radiation through very large except in the case of very thin films.
different material. In the case of metals, the On the other hand, ferrimagnetic substances are
usual method of observing () in transmission is particularly good magnetooptical materials be-
not convenient except for very thin films be- cause they combine the low absorption of a
cause metals are very good reflectors in the good insulator with high permeability. For
visible and infrared regions. As an alternative, example, yttrium iron garnet (YIG) is trans-
() is measured on reflection (the polar reflection parent in the optical region and also gives rise
Faraday effect) and gives information on the to a large Faraday rotation which makes it an
electron band structure of nonferromagnetic excellent material for the observation of mag-
metals. netic domains. In fact, measurements of () can
The advent of high-intensity laser radiation be used to measure several macroscopic magnetic
has motivated the inclusion of various nonlinear properties of thin rare earth garnet films which
terms into the laws of qptics. In particular, the are used for magnetic bubble devices. 21 Ferri-
421 FEEDBACK

magnetic materials are used extensively in 16. Braginskii, V. B., and Khalili, F. Ya., "Optico-
microwave technology, 2 2 their importance stem- Magnetic Effects in Nondestructive Quantum
ming from the fact that they can be used to Counting," Sov. Phys.-JETP 51, 859 (1980).
make Faraday isolators which permits a signal 17. Luhmann, N. C., Jr., "Instrumentation and Tech-
to be transmitted with low attenuation in one niques for Plasma Diagnostics: An Overview," and
direction but causes the reflected signal to be Vernon, D., "Submillimeter Interferometry of
highly attenuated. Thus, for example, this High-Density Plasmas," in Button (Ed.), "Infrared
permits the decoupling of an oscillator from a and Millimeter Waves," Vol. 2, Academic Press,
measuring system. Similar devices have also New York, 1979, pp. 1-135; Stamper, J. A.,
been used in laser systems.23 At higher fre- McLean, E. A., and Ripin, B. H., "Studies of
quencies it turns out that antiferromagnetic Spontaneous Magnetic Fields in Laser-Produced
materials perform better. Plasmas by Faraday Rotation," Phys. Rev. Lett.
40, 1177 (1978).
R. F. O'CONNELL 18. Fowler, C. M., Caird, R. S., Gam, W. B., Erickson,
D. J., and Freeman, B. L., "High Field Faraday
Rotation of Some Zn(VI) Compounds," Journal
References of Less-Common Metals 62, 397 (1978).
19. Argyres, P. N., "Theory of the Faraday and Kerr
1. Barr, E. S., "Men and Milestones in Optics V: Effects in Ferromagnetics," Phys. Rev. 97, 334
Michael Faraday," Appl. Optics 6, 631 (1967). (1955).
2. Palik, E. D., and Henvis, B. W., "A Bibliography 20. Bennett, H. S., and Stern, E. A., "Faraday Effect
of Magneto-Optics of Solids," Appl. Optics 6, 603 in Solids," Phys. Rev. 137, A448 (1965).
(1967). 21. Tanner, B. H., "Magneto-Optical Experiments on
3. Palik, E. D., and Furdyna, J. K., "Infrared and Rare Earth Garnet Films," Am. J. Phys. 48, 59
Microwave Magnetoplasma Effects in Semicon- (1980).
ductors," Rep. Prog. Phys. 33, 1193 (1970). 22. Button, K. J., and Hartwick, T. S., "Microwave
4. Piller, H., "Faraday Rotation," in Willardson and Devices," in Rado and Suhl (Eds.), "Magnetism,"
Beer (Eds.), "Semiconductors and Semimetals," Vol. I, Academic Press, New York, 1963, pp.
Vol. 8, Academic Press, New York, 1972, pp. 621-666.
103-179. 23. Wang, S., Shah, M., and Crow, J., "Studies of the
5. Lax, B., "Resonance Spectroscopy of Solids and Use of Gyrotropic and Anisotropic Materials for
Plasmas," J. Mag. and Mag. Materials 11, 1 (1979). Mode Conversion in Thin Film Optical Wave Guide
6. Manchester, R. N., and Taylor, J. H., "Pulsars," Application," J. Appl. Phys. 43, 1861 (1972).
W. H. Freeman and Co., San Francisco, 1977.
7. Shapiro, S. L., and Wasserman, I., "Massive Neu- Cross-references: HALL EFFECT AND RELATED
trinos, Helium Production, and the Primordial PHENOMENA, KERR EFFECT, LIGHT, MAGNE-
Magnetic Field," Nature 289,657 (1981). TISM, POLARIZED LIGHT, PROPAGATION OF
8. Ratcliffe, J. A., "An Introduction to the Iono- ELECTROMAGNETIC WAVES, SEMICONDUC-
sphere and Magnetosphere," Cambridge Univ. TORS.
Press, Cambridge, U.K., 1972, pp. 196-198.
9. G. Steigman, "Observational Tests of Antimatter
Cosmologies," Ann. Rev. Astron. Astrophys. 14,
339 (1976).
FEEDBACK
10. O'Connell, R. F., and Wallace, G. L., "Null Fara-
day Rotation-A Clean Method for Determination The concept of feedback lies at the heart of
of Relaxation Times and Effective Masses in MIS modern systems theory and control engineer-
and Other Systems," Solid State Commun. 38, ing. The term itself seems to have been used for
429 (1981). the first time in a technical sense in 1920, 1 and
11. Donovan, B., and Medcalf, T., "The Inclusion of refers to the return or feedback of system out-
Multiple Reflections in the Theory of the Faraday put signals to the inputs in order to improve
Effect in Semiconductors," Brit. J. Appl. Phys. or change the behavior of the system. A very
15, 1139 (1964). simple example is the control of a heating sys-
12. O'Connell, R. F., and Wallace, G. L., "Multiple tem by using a bimetallic strip (Fig. 1). The
Reflections in the Theory of the Faraday Effect," system output is heat, the input the voltage
Phys. Lett. 86A, 283 (1981). across the heating element. When the desired
13. O'Connell, R. F., and Wallace, G., "Ellipticity and temperature (the set-point) is reached the heat
Faraday Rotation due to a Two-Dimensional Elec- causes the bimetallic strip to deform sufficiently
tron Gas in a Metal-Oxide-Semiconductor (MOS) to switch off the heating element. As the tem-
System," Phys. Rev. 826,2231 (1982). perature falls back below the set-point the strip
14. Manakov, N. L., Ovsiannikov, V. D., and Kielich, returns to its original shape, contact is reestab-
S., "Nonlinear Variations in the Faraday Effect lished, and the heater switches on again. A
caused in Atomic Systems by a strong Magnetic second illustrative example may be found in the
Field," Phys. Rev. A21, 1589 (1980). fly ball governor (attributable to Sir James Watt)
15. van der Ziel, J. P., Pershan, P. S., and Malmstrem, and used to control steam engines (Fig. 2). As
L. D.,Phys. Rev. Lett. 15 190 (1965). more steam is fed to the engine the shaft
FEEDBACK 422

from an attack on the problem of feedback


amplifier stability. The three succeeding decades
"
I saw the emergence of what is now considered
the classical theory of control as a coherent
I
discipline which held and continues to hold a
[] heater p~votal positi~n in a world of increasing tech-
mcal compleXity and automation. The classical
results, including the stability criteria and de-
sign techniques of Nyquist, Bode, and Nichols
L-------~~r_--------~ are . conce~ned primaril~ with linear system~
havmg a smgle control mput and a single out-
supply put. The interested reader can do no better
FIG. 1. Simple bimetallic strip heating control. than consult Refs. 3, 4, or 5 or any other of
the excellent introductory textbooks available
on the subject. The historical perspective has
accelerates and the centrifugal forces generated been discussed by Macfarlane.
6

are used to reduce the amount of steam made From the early 1960s system complexity in-
available, notice here the importance of negative creased ~xponentially and various problems
feedback-as the output exceeds the set-point arose wh1ch were not amenable to the classical
the input signal is decreased and vice-versa. If methods of feedback design. Many practical
the governor had been incorrectly installed so systems have not one, but many inputs and out-
that ~he steam supply increased with increasing puts, and these exhibit varying degrees of inter-
veloc1ty (positive feedback) the resulting insta- action with a signal to one input causing re-
bility would have destroyed the steam source sp?nses on several outputs. For example, the
or engine or both. The governor acts as a regu- a~Justment of the wing flaps of an aeroplane
lator. If the load is increased the speed of the wlll c~use a change in the direction of flight,
shaft will initially decrease but with negative but wIll also cause the aircraft to bank and
feedback the governor will then increase the a. chang~ in the. set-point of one of the pr~pul­
ste~m supply pressure to restore the speed close
SlOn U~l1ts of a Jet foil will cause changes in the
to 1tS nominal value. This illustrates one of the roll, pitch, and yaw motions of the vessel. Im-
most important uses of feedback, namely to plementation of classical single-loop controllers
reduce the sensitivity of the controlled variable on such systems frequently leads to stability
to disturbances made to the system. Even earlier problems traceable to this interaction and
examples of such regulation are mechanical hence Rosenbrock, Macfarlane, and oth~rs7,8
cl?ck~ and lift tenters designed to regulate the
produced exten~ions of the classical techniques
grIndmg of corn. A detailed discussion of the to allow the des1gn of multi-input, multi-output
mechanisms used in centrifugal governors and (MIMO) feedback systems. Unlike the earlier
of the early history (up to 1930) of control methods these are in no sense "pencil and
engineering may be found in Ref. 1. paper" techniques; they usually require the use
The true birth of feedback control in the of computer graphics terminals and extensive
analytical as well as the practical sense can interactive, computer-aided design programs:
perhaps be pinpointed to the publication of They resemble earlier work however in that
Nyquist's seminal paper of 1932,2 which arose they ~res~nt infor~ation ~oncerning system
dynamiCs m a graphical form easily interpreted
by the practiced engineer.
Most analysis and design techniques require
that the system to be controlled be linear and
that ~ linear controller ~e used. However, ~any
prachcal systems are highly nonlinear and the
design of a nonlinear controller such as the
bimetallic strip in the heating co~trol can lead
to a particularly elegant solution to ~ control
problem. Analysis and design of nonlinear feed-
back systems can be difficult. However the
stabil~ty theorems of Liapunov9 have pro~ided
a bas1s for the development of techniques such
as the cricle criteria 8 which allow analysis and
design of a restricted set of nonlinear feedback
syste~s using traditional frequency domain
techmques for both single-input-single-output
and multivariable systems. Although closed
loop stability can be guaranteed by such meth-
ods, design tends to be conservative and in
FIG. 2. Simple governor. practice there has been some succes~ in the
423 FERMI-DIRAC STATISTICS AND FERMIONS

application of approximate methods such as particular, biological systems have very success-
the describing function 10 whereby a nonlinear fully used feedback for millions of years. At-
element is replaced by a set of equivalent linear tempts have also been made to model ecological
elements for a set of specified input signal and economic systems in order to gain further
amplitudes. understanding of these complex feedback sys-
Another area of increasing importance is that tems and to be able to apply appropriate control
of adaptive or "self tuning" feedback. This has to modify their behavior.
been an identifiable topic in the literature of
control engineering for at least the last two G.E.HAYTON
decades, although the majority of practical P. M. TAYLOR
applications are of relatively recent date. Adap-
tive techniques have been mainly concerned References
with classically structured feedback systems,
the word adaptive signifying that the controller 1. Bennett, S., "A History of Control Engineering
"learns" about the controlled process while at 1800-1930," Stevenage, U.K. and New York,
the same time controlling its behavior. It is thus Peter Peregrinus, 1979.
expected that control will improve with the 2. Nyquist, H., "Regeneration Theory," Bell Syst.
passage of time and further that the control Tech. 1. 11,126-147 (1932).
will be robust in the presence of minor varia- 3. D'Azzo, J. D., and Houpis, C. H., "Feedback Con-
tions in the parameters of the controlled pro- trol Systems Analysis and Synthesis," New York,
cess and its environment. In crude terms the McGraw-Hill,
controller may be described as one which 4. Di Stefano, J. J., Stubberud, A. R., and Williams,
"learns from its own mistakes" and may thus I. J., "Theory and Problems of Feedback and
be regarded as having some degree of intelli- Control Systems," Schaums Outline Series, New
gence. To date the majority of applications York, McGraw-Hill, 1967.
have been in nonlinear, stochastic systems. A 5. Zadeh, 1. A., and Desoer, C. A., "Linear System
good overview of both theory and application Theory: A State Space Approach," New York,
may be found in Ref. 11. McGraw-Hill, 1963.
The work described above is concerned with 6. Macfarlane, A. G. J. (Ed.), "Frequency Response
the design of feedbJlck systems; the hardware Methods in Control Systems," New York, I.E.E.E.
used to implement the controllers clearly varies Press, 1979.
from application to application, but it is of 7. Rosenbrock, H. H., "Computer Aided Control
interest to note that virtually every area has System Design," New York, Academic Press,
been affected by the availability of inexpensive 1974.
microprocessors. The availability of such com- 8. Patel, R., and Munro, N., "Multivariable System
puters as control elements allows much greater Theory and Design," Elmsford, N.Y., Pergamon,
flexibility in the design of controllers. The use 1981.
of a digital controller however causes additional 9. Gibson, J. E., "Nonlinear Automatic Control,"
delays around the feedback loop because of New York, McGraw-Hill, 1963.
finite sampling rates, computation times, and 10. Atherton, D. P., "Nonlinear Control Engineering,"
digital-to-analog conversion times. Designs are New York, Van Nostrand Reinhold, 1975.
generally undertaken either directly on the 11. Harris, C. J., and Billings, S. A., (Ed.), "Self-Tim-
discrete systems mode11 2 or via a design of a ing and Adaptive Control," Peter Peregrinus,
controller for a continuous system. 13 Sampling Stevenage, U.K. and New York, 1881.
rate, wordlength, and computer algorithm need 12. Kuo, B. C., "Digital Control Systems," New York,
to be carefully chosen for successful design. Holt-Saunders, 1980.
Currently one of the most exciting applica- 13. Katz, P., "Digital Control using Microprocessors,"
tions of feedback is in the rapidly growing area Englewood Cliffs, N.J., Prentice-Hall International,
of robotics. One method of increasing the in- 1981.
dustrial applicability of robotic devices is to 14. "Proceedings of the FirstInternationalConference
give them the capability of sensing their envi- on Robot Vision and Sensory Controls, April 1-3
ronment via simple vision and touch systems. 1981, Stratford-upon-Avon, U.K.," I.F.S. Con-
The robot gripper is frequently a useful loca- ferences, Kempston, Bedford, England, 1981.
tion for such sensors. Signals from the sensor
are processed to extract the relevant informa- Cross-references: BIONICS, CIRCUITRY, CYBER·
tion, which is then passed to the robot control NETlCS, MECHANICS.
system, which in turn uses it to determine an
optimal control strategy and thus obtain a
degree of closed loop feedback.14 FERMI-DIRAC STATISTICS AND
The impetus to understand and design feed- FERMIONS
back systems has come from all branches of
engineering, in particular electronic and me- Solid metals are good conductors of heat and
chanical engineering. However, the concept of electricity because about one electron per atom
feedback has much wider scope than this. In is free to migrate through the volume of the
FERMI-DIRAC STATISTICS AND FERMIONS 424

conductor. These electrons were once thought on E with most particles having low E and few
to behave like gas molecules which obey Max- having high E. Indeed for T = 0, the number of
well-Boltzmann statistics in which the number particles with E =1= 0 becomes zero-in a Max-
of particles at higher energies falls off exponen- well-Boltzmann gas all particles come to rest at
tially according to a relation of the form absolute zero. A Fermi-Dirac gas behaves very
differently at absolute zero. The exponent of e
I is plus or minus infinity depending upon
nE,T = eE / kT whether E is greater or less than Ei. The ex-
ponential term is either infinity or zero. The
where E is the energy, k the Boltzmann Con- denominator is either infinity or one. All energy
stant and T the absolute temperature. This elec- states below Hi are filled whereas all those above
tron gas theory was qualitatively useful in ex- Ei are empty. Thus, consistent with the assump-
plaining many metallic properties, but it was tions, at absolute zero the electrons do not
never quantitatively successful. One notable crowd into one state of zero. energy but are
failure was its prediction that electrons should uniformly distributed among those states which
contribute to the specific heats of metals. are below the critical energy Ei called the
Bohr had shown that the electron in hydrogen Fermi energy or the Fermi level. Fermi energies
is not free to assume any energy, but is re- depend on the kind of metals but they are of
stricted to certain permitted energies called the order of several electron volts. Thus, even
quantum states or energy levels. When this quan- at absolute zero, some electrons have energies
tum view of atomic electron structure was ex- which would be typical of a Maxwell-Boltzmann
tended to more complex atoms, it was found electron gas only if that gas were at several
that electrons obey the Pauli exclusion princi- thousand degrees.
ple-only two electrons in anyone atom having The contrast may be dramatized by the fol-
oppositely directed spin can occupy the same lowing analogy. If grains of sand are spilled on
energy state. Thus in an atom with many elec- an open floor, they will spread out so they are
trons, no more than two can have the lowest only one deep and each has zero potential en-
permitted energy, no more than two may have ergy. If the grains are poured into a drinking
the next higher permitted energy, etc. An unex- straw, the straw will fill to a certain height and
cited atom with all its electrons in their lowest some grains will have considerable potential
possible energy states includes many electrons energy.
whose energy is well above the energy of the Heating a metal from absolute zero to room
lowest two. The old electron gas theory of temperature adds only .025 eV to the average
metals recognized that the inner electrons asso- energy of its particles. Since the electrons al-
ciated with each atom were quantized but as- ready have a much greater average energy, heat-
sumed that the electrons that were not bound to ing a metal has but a slight effect on the energy
particular atoms were entirely free to migrate distribution of the electrons. This accounts for
through the metal with no a priori restrictions the fact that electrons make a negligible contri-
on their energy. Fermi-Dirac statistics describes bution to the specific heats of metals, and it
the behavior of the electron gas under the as- also explains why metals must be glowing hot
sumption that all electrons within the conductor before electrons acquire enough additional en-
have their energies quantized and obey the Pauli ergy to escape from the metal surface as in the
principle. This new viewpoint leads to a distribu- filaments of radio tubes. Since the quantum
tion of electron energies according to a relation view of electrons in a metal provides both a
of the form qualitative and quantitative picture of many
metallic properties, we know metallic electrons
are quantized Fermi particles rather than un-
nE,T e(E-Ej)jkT +I quantized Maxwell particles. The application of
Fermi-Dirac statistics to semi-conductors ac-
where the new symbol, Ei, is a critical energy counts for their special properties as demon-
characteristic of the metal more fully described strated by transistors.
below. If the metal is at high temperature, this From the standpoint of wave mechanics, all
function approaches the Maxwell-Boltzmann particles which are confined in any way are
distribution. We can see this by noting that if quantized. Those whose spin is integral have
T -+ 00, the exponent of e approaches zero re- symmetric wave functions and do not obey the
gardless of E. Thus, in both cases, the number of Pauli principle. If they are so numerous that
electrons of each energy tends to become uni- they must be treated statistically, they are
form. The high-temperature electrons have so called bosons and are described by Bose-Ein-
many states available to them that quantum re- stein statistics. Photons are the most common
strictions make little difference. If we let the bosons. Those particles whose spins are odd
temperature approach absolute zero, the dif- multiples of ! have antisymmetric wave func-
ference between these distributions becomes tions and obey the Pauli principle. They are
extreme. If T is very small, the Maxwell- called fermions and obey Fermi-Dirac statistics.
Boltzmann distribution is strongly dependent Although electrons are the most common ex-
425 FERMI SURF ACE

ample, protons, neutrons, and J..L-mesons are all surface exists simply because the electrons obey
fermions with spin!. At high temperatures, the Fermi-Dirac statistics.
quantum nature of both bosons and fermions Consider first an elementary model of a metal
becomes insignificant and both obey the classi- consisting of a lattice of fixed positive ions im-
cal statistics of Maxwell-Boltzmann. The tech- mersed in a sea of conduction electrons which
nique of deriving these distributions is called are free to move through the lattice. Every
statistical mechanics. direction of electron motion is equally probable.
To convey the over-all method of STATISTI- Since the electrons fill the available quantized
CAL MECHANICS, we note that it is a probability energy states starting with the lowest, a three-
theory in which the basic technique is to com- dimensional picture in momentum coordinates
pute the number of possible ways in which a will show a spherical distribution of electron
system can arrange itself subject to restrictions momenta and, hence, will yield a spherical
as to the number and total energy of the par- Fermi surface. In this free electron model, no
ticles. These ways are all assumed equally likely. account has been taken of the interaction
(There are 52 factorial, 52!, different arrange- between the fixed positive ions and the elec-
ments which might result from the shuffling of trons; indeed the only restriction on the move-
a deck of playing cards. Each is equally likely.) ment or "freedom" of the electrons is the
Then, depending on the nature of the particles, physical confines of the metal itself.
bosons or fermions, the number of distinguish- A short derivation starting with the Schro-
able ways is computed. (In the game of bridge, dinger equation shows that the total energy of
there are many fewer deals 52! /(13!)4, than an electron (and thus also its kinetic energy)
there are shuffles because the order in which a is given by
player receives his cards does not change his
"hand.") The probability of any particular dis- E=1i 2k 2/2m =p2/2m
tinguishable distribution is proportional to the where m is the mass of the electron. A plot of
number of ways in which it can be achieved. (If E against k is then a parabola, as shown in Fig.
we flip a coin five times, there are 2 5 = 32 lea). The Cartesian components of those values
orders in which the coin can fall. Of these, there of k which are possible solutions to the Schro-
are ten ways to get two heads and only one way dinger equation are ki = 21Tni/L, where the n;'s
to get five heads. We therefore find getting two are integers and L is a physical dimension of
heads ten times more probable than getting five the metaL Since for each energy value so de-
heads.) The actual expected distribution is the fined there are actually two states (one for an
one which can be achieved in the largest number electron with spin up, one with spin down), it
of ways. can be shown that the density of energy states
JAMES A. RICHARDS, JR . available to the electrons is

Reference
gee) = (2m?12 E1I2
21T 2 fj3
Leighton, Robert B., "Principles of Modern Physics," where geE) dE is the number of states in the
New York, McGraw-Hill Book Co., 1959. energy range E to E + dE. Then nee), the num-
ber of electrons per unit volume occupying
Cros&-references: BOLTZMANN'S DISTRIBUTION energy states in this energy range, is
LAW, BOSE-EINSTEIN STATISTICS AND BOSONS,
ELECTRON SPIN, STATISTICAL MECHANICS. neE) dE = g(E)[(E) dE
where fiE) = {exp [(E - Er)/bT] + 1}-1, a
function characteristic of particles which obey
FERMI SURF ACE Fermi-Dirac statistics. In this expression, T is
the absolute temperature, b is Boltzmann's
The Fermi surface of a metal, semi-metal, or constant, and Er is a parameter r1p.l)ending on
semiconductor is that surface of constant en- the number of electrons involved and indeed
ergy in momentum space which separates the turns out to be the Fermi energy. Er can be
energy shtes which are filled with free or evaluated by integrating neE) dE from E = 0 to
quasi-free electrons from those which are un- E = 00 and recognizing that the integral is equal
filled. [Momentum space is defined in terms of to N, the total number of electrons per unit
three orthogonal axes, the components of the volume. The result (at T = 0 K) is
momentum vector, Px, P;y, and pz (or alterna-
tively, the components of the wave vector, k x ,
ky, and kz: p = fik, where 11 = Planck's constant Er = 1T2112 (3N)2/3
divided by 21T). The components of momentum 2m 1T
of an electron at a given instant of time may be
thought of as the coordinates of a point in mo- <
At T = 0 K, for E Ef, fiE) = 1, while for
mentum space which then moves about as E> Ef, fiE) = O. Physically this means that the
various forces act on the electron.] The Fermi probability of a state below the Fermi level
FERMI SURFACE 426

E surface is well defined by the statement above.


It must, however, be noted that this is not
necessarily true for semiconductors where the
number of free electrons per unit volume may
be very much smaller.
The foregoing treatment gives a qualitative
insight into the physics of metals and, under
some circumstances, semi-metals and semicon-
ductors. A more detailed analysis requires that
the effects of the ions in the lattice be recog-
nized. This can be accomplished by introducing
the periodic potential duet to the lattice through
which the electrons must move. Then the elec-
trons are no longer "free," but, depending on
--------=--=--------k the strength and character of the potentials and
the approximations used in solving the Schro-
(a) dinger equation, act as "quasi-free" particles.
Another approach is the "tight-binding approx-
imation"; occasionally a combination of the
E two approaches is used. In any case, introduc-
tion of lattice effects changes the characteristics
of the model; the total energy and kinetic
energy of an electron are no longer equivalent.

\ J
The periodic lattice can be described con-
veniently in terms of Brillouin zones, each of
which is large enough (in momentum space) to
accommodate two electrons per atom. The
Brillouin zone boundaries appear to the elec-

\ / trons as Bragg reflection planes or energy dis-


continuities, resulting in an energy versus wave
number plot as shown in Fig. l(b).

"-
For many metals, the "nearly free" electron
description corresponds quite closely to the
~~~~~~~~--k physical situation. The Fermi surface remains
-l!! _!:!!. -It 0 .!. 2", 3rf nearly spherical in shape. However, it may now
a a a a a a be intersected by several Brillouin zone bound-
(b) aries which break the surface into a number of
separate sheets. It becomes useful to describe
FIG. 1. (a) Energy plotted against wave number for the Fermi surface in terms not only of zones
the free electron model. (b) Energy plotted against or sheets filled with electrons, but also of zones
wave number for the "quasi-free" electron model, or sheets of holes, that is, momentum space
showing energy discontinuities at Brillouin zone volumes which are empty of electrons. A con-
boundaries. ceptually simple method of constructing these
successive sheets, often also referred to as "first
zone," "second zone," etc., was demonstrated
being occupied is one; whereas for states with by Harrison. 1 An example of such a construc-
E > Ef the occupancy probability drops tion is shown in Fig. 2. This construction works
abruptly to zero. For temperatures greater than quite well, for example, for aluminum which
absolute zero, tl!e occupancy probability drops has three valence electrons per atom. Experi-
smoothly from 1 to 0 in a range of energy of ments, and indeed more elegant theoretical cal-
width approximately equal to bT. This shell of culations, show that the fourth zone is totally
partially filled states gives rise to the following unoccupied and that the third zone monster is
definition: The Fermi level is the energy level not multiple-connected in the manner shown.
at which the probability of a state being filled The recipe for constructing these figures, some
is just equal to one half. of which may even be pleasing to art con-
A numerical evaluation of the Fermi energy noisseurs, cannot be developed in the limited
for a simple metal having one or two conduc- space of this article but will be found in the
tion electrons per atom yields a value of approx- references. I ,2
imately 10- 11 erg, or a few electron volts. The The intense research effort of the last 15
equivalent temperature, Ef/b, is several tens of years on the Fermi surfaces of metals and semi-
thousands of degrees Kelvin. Thus, except in metals originated, to a great extent, with Pip-
extraordinary circumstances, when dealing with pard's ingenious deductions, based on anoma-
metals, bT < Ef; i.e., the energy range of lous skin-effect experiments, concerning the
partially filled states is small, and the Fermi Fermi surface of copper.3 Prior to Pip pard's
427 FERMI SURF ACE

,., BANO S,d BAHO

VALENCE I

VALENCE 2

VALENCE 3

VALENCE 4
FIG. 2. Fermi surfaces in several zones or bands, for face-centered cubic metals having various numbers of
"quasi-free" electrons per atom, as constructed by Harrison. 1

work, it was taken for granted that in copper, (1) The DE HAAS-VAN ALPHEN EFFECT4 is
with one quasi-free electron per atom, the first an oscillatory behavior of the magnetic suscep-
Brillouin zone would be only half filled and, tibility (or more generally, oscillatory behavior
hence, would have a nearly spherical Fermi sur- of any electronic property) due to the quantiza-
face. His work suggested that a series of eight tion of magnetic flux through an electron orbit
necks pull out and touch the Brillouin zone in units of hc/e (where h = Planck's constant,
boundaries in the [111] crystallographic direc- c = speed of light, and e = electron charge).
tions. This shape has now been confirmed and Measurements of the periods of the oscillations
precisely mapped, not only for copper but also can be directly related to extremal cross-
for silver and gold. sectional areas of the Fermi surface.
A variety of experimental techniques has been (2) Cyclotron resonances refers to oscilla-
developed, capable of yielding both overlap- tions in the magnetic field dependence of high-
ping and complementary information concern- frequency surface impedance arising when the
ing Fermi surfaces of metals. Some of these electron's cyclotron frequency is an integral
techniques are described briefly: multiple of the rf frequency. The periods of
FERMI SURFACE 428

these oscillations measure dA/dE, the rate of metallic behavior. Because the Pauli exclusion
change of cross-sectional area with energy at principle prevents more than one electron from
extremal cross sections. occupying a given quantum state, the electrons
(3) Magnetoresistance 6 refers to extra resis- in most of the filled states below the Fermi
tance in a metal in the presence of a magnetic energy are "frozen," that is, they cannot be
field. This magnetoresistance may be changed excited to a higher energy state by an external
by changing the direction of the magnetic electric or magnetic field because the higher
field and/or its magnitude. This direction and states are already occupied. Only the relatively
field dependence gives information about Fermi small number of electrons within about bT of
surface topology, particularly regions of con- the Fermi energy can respond to external in-
tact with Brillouin zone boundaries. fluences, and these "Fermi surface" electrons
(4) The magnetoacoustic effect 7 refers to determine all the basic metallic properties, e.g.,
oscillations in the magnetic field dependence of electrical conductivity, magnetic susceptibility,
ultrasonic attenuation. The oscillations occur heat capacity, catalytic activity, thermoelectric
when dimensions of cyclotron orbits of elec- effects, etc. Thus the experimental characteriza-
trons in extremal states on the Fermi surface tion and theoretical understanding of Fermi
are equal to integral multiples of the ultra- surface properties is one of the major activities
sonic wavelength so that measurements of these in solid state physics.
oscillations give extremal linear dimensions of Extensive application of the experimental
the Fermi surface. techniques described above has resulted in a
(5) The anomalous skin effect 3 occurs when great deal of Fermi surface information about
the electron mean free path is long compared to a large number of metals.12 Nearly all the ele-
the rf skin depth. In this circumstance, elec- mental metals (except those with severe sample
trons moving nearly parallel to the sample sur- preparation problems) have been investigated, 13
face will dominate the conductivity. A measure- and many are extremely well characterized. 14
ment of surface impedance then gives an The complicated Fermi surface geometries that
integral of the radius of curvature over that occur in the transition metals can be con-
part of the Fermi surface containing the effec- veniently described by any of several parametri-
tive electrons. zation schemes, the most successful of which is
(6) Positron annihilation in metals 8 gives based on the Koringa-Kohn-Rostoker (KKR)
directly the number of electron states in method of energy band calculation. 1s This tech-
various cross sections of the Fermi surface. nique allows even the most intricate Fermi sur-
Because a positron impinging upon a metal is faces to be accurately described by a small
quickly thermalized, when it annihilates with number of physically meaningful parameters
an electron the resulting gamma rays must (usually seven or less), so that the anisotropy
carry off the annihilated electron's momentum. of various fundamental quantities over the
Thus gamma ray angular correlation measure- Fermi surface can be studied and correlated
ments can be related to the distribution of in a simple way. These parametrization schemes
electron momenta. have been used in detailed studies of the
(7) The Gantmakher effect 9 (or radio fre- variation over the Fermi surface of the electronic
quency size effect) occurs in thin samples when Fermi velocity, 15 scattering lifetime due to
a magnetic field parallel to the sample surface impurities and defects,16 the electron-phonon
is adjusted so that extremal electron orbits just interaction,17 and superconducting energy gap.
fit in the sample thickness. RF energy can then Paralleling this experimental work has been a
be transported across the thickness of the great deal of theoretical effort devoted to first-
sample by the electrons. Observation of these principles calculation of energy bands and Fermi
Gantmakher resonances, like the magnetoacous- surface properties. 18 ,19 Like the experimental
tic resonances, gives extremal linear dimensions work, theoretical treatments of the Fermi sur-
of the Fermi surface. faces of elements have been highly developed,20
These techniques have been used primarily to to the extent that agreement with experiment
measure extremal properties of the Fermi sur- is usually within 10% and often much better.
face geometry. Considerable effort has also Increasingly, both theory and experiment are
been applied to the measurement of nonex- turning away from the elements to alloys21,22
tremal properties lO ,l1 which can do much to and intermetallic compounds,23 where a much
elucidate Fermi surface geometry. The preci- greater range of unusual metallic behavior is to
sion and applicability of each of these tech- be found. Of particular interest is the behavior
niques is dependent on the material under in- arising from various many-body effects: super-
vestigation. Among pertinent factors are the conductivity from the electron-phonon interac-
number of quasi-free electrons per atom, crystal- tion, spin fluctuations and itinerant mag-
lographic structure, magnetic properties, purity, netism from exchange interactions, and elec-
and practicality of sample preparation. tron localization and local moment formation
The fundamental interest in the measurement from electron correlation effects. Experimen-
and calculation of Fermi surface properties tally these many-body effects reduce the
derives from their central role in understanding Fermi velocity of electrons at the Fermi surface
429 FERRI MAGNETISM

and can be studied through effective mass mea- 19. Koelling, D. D., Rep. Prog. Phys. 44,139 (1981).
surements in the de Haas-van Alphen effect 23 20. Mackintosh, A. R., and Andersen, 0. K., in "Elec-
and cyclotron resonance. The theoretical de- trons at the Fermi Surface," (M. Springford, Ed.),
scription of these many-body effects is less well Cambridge, U.K., Cambridge Univ. Press, 1980,
developed, but is receiving increasing atten- p.149.
tion.24 Understanding these many-body effects 21. Saito, Y., and Maezawa, K., in "Proceedings of
and their influence on Fermi surface properties Twelfth International Conference on Low Tem-
and metallic behavior is likely to be a major perature Physics," (E. Kamda, Ed.), Kyoto, Aca-
theme of both theory and experiment in the demic Press of Japan, 1971, p. 583.
coming years. 22. Coleridge, P. T., in "Electrons at the Fermi Sur-
face," (M. Springford, Ed.), Cambridge, U.K.,
H. V.BOHM Cambridge Univ. Press, 1980, p 321.
NORMAN TEPLEY
23. Crabtree, G. W., Johanson, W. R., Campbell, S. A.,
GEORGE CRABTREE
Dye, D. H., Karim, D. P., and Ketterson, J. B.,
"Proceedings of the International Conference on
References Physics of Transition Metals," (P. Rhodes, Ed.),
Inst. Phys. Con! Ser. 55,79 (1981).
1. Harrison, W. A.,Phys. Rev., 1l8, 1190 (1960). 24. Wilkins, J. W., in "Electrons at the Fermi Sur-
2. Ziman, J. M., "Electrons in Metals; A Short Guide face," (M. Springford, Ed.), Cambridge, U.K.,
to the Fermi Surface," London, Taylor and Fran- Cambridge Univ. Press, 1980, p. 46.
cis, 1963.
3. Pippard, A. B., Phil. Trans. Roy. Soc. London Cross-references: CYCLOTRON RESONANCE; DE
Ser. A, 250, 323 (1957). HAAS-VAN ALPHEN EFFECT; ENERGY LEVELS;
4. Shoenberg, D., in "Proceedings of the Ninth In- SOLID STATE PHYSICS; SOLID STATE THEORY;
ternational Conference on Low Temperature Phys- TRANSPORT THEORY.
ics," J. G. Daunt, D. O. Edwards, F. 1. Milford,
and M. Yaqub editors, New York, Plenum Press,
1965 (p. 665).
5. Kip, A. F., in "The Fermi Surface," W. A. Harri- FERRIMAGNETISM
son, and M. B. Webb editors, New York, John Snoek's publication (1946) of his wartime work
Wiley & Sons, 1960 (p. 146). on ferrites established the existence of new
6. Pippard, A. B., "The Dynamics of Conduction ceramic magnetic materials capable of combin-
Electrons," New York, Gordon and Breach, 1965 ing the resistivity of a good insulator (10 12 ohm-
(p.90). cm) with high permeability. (see MAGNETISM.)
7. Tepley, N.,Proc. I.E.E.E., 53,1586 (1965). In 1948, Neel introduced the term ferrimag-
8. Stewart, A. T., in "Positron Annihilation," A. T. netism to describe the novel magnetic properties
Stewart and L. O. Roellig editors, New York, of these materials. A simple ferrite is composed
Academic Press, 1967 (p.17). of two interpenetrating FERROMAGNETIC sub-
9. Gantmakher, V. F., Zh. Eksperim, i Tear. Fiz., lattices with magnetizations Ma(D and Mb(D
43, 345 (1962). (English Trans!.: Soviet Physics which decrease with increasing temperature and
.TETP, 16, 247 (1962).) vanish at the Curie point, Te. In a ferromagnetic
10. Dooley, 1. W., and Tepley, N., Phys. Rev., 187, material, the resulting saturation magnetization,
781 (1969). M, would be Ma + Mb; however, in a ferrite,
11. Henrich, V. E., Phys. Rev. Letters, 26, 891 strong antiferromagnetic interaction between
(1971). sub lattices results in anti parallel alignment, and
12. Cracknell, A. P. and Wong, K. C., "The Fermi Sur- M = Ma - Mb. In general Ma(T) =1= Mb(D, and
face," Oxford, Clarendon Press, 1973. the material behaves in most respects like a fer-
13. Young, R. c., Rep. Prog. Phys. 40,1123 (1977). romagnet, exhibiting domains, a hysteresis loop,
14. See, for example, Karim, D. P., Ketterson, J. B., and saturation of the magnetization at relatively
and Crabtree, G. W.,.T. Low Temp. Phys. 30,389 low applied magnetic fields. Practical values for
(1978) and Dye, D. H., Campbell, S. A., Ketter- saturation magnetization and Curie temperature
son, J. B., and Vuillemin, J. J., Phys. Rev. B23, range from 250 to 5000 oersteds and from 100
462 (1981). to 600°C.
15. Crabtree, G. W., Dye, D. H., Karim, D. P., and Ferrimagnetic materials have spinel, garnet,
Ketterson, J. B., J. Magnetism and Magnetic Ma- and hexagonal structures. A typical spinel ferrite
terials II, 236 (1979). . is NiFe204' Other ferrites may be obtained by
16. "Proceedings of the International Conference on substituting magnetic (Co, Ni, Mn) or non-
Electron Lifetimes in Metals," (D. H. Lowndes magnetic (AI, Zn, Cu) ions for some of the Ni
and F. M. Meuller, Eds.), Phys. Condo Matter 19, or Fe ions, e.g., Nil-yCoyAlxFe2 -X04, where
1-423 (1975). x and y may be varied to modify M and Te.
17. Crabtree, G. W., Dye, D. H., Karim, D. P. Koelling, Yttrium iron garnet (YIG), Y 3FeS012, is the
D. D., and Ketterson, J. B., Phys Rev. Letters classical ferrimagnetic garnet which combines
42,390 (1979). very low magnetic loss with high resistivity.
18. Harrison, W. A., "Pseudopotentials in the Theory Substitution of magnetic RARE EARTH ions
of Metals," New York, W. A. Benjamin, 1966. (Gd, Yb, Ho, etc.) for Y and of nonmagnetic
FERRI MAGNETISM 430

ions (Ga, AI) for some of the Fe ions leads to the lower part of the microwave spectrum
many different ferrite compositions with a wide where M = 200 to 3000 G is adequate. In the
range of M and magnetic loss. The rare earth millimeterwave region, f= 30 to 100 GHz, one
ions form a third magnetic sub lattice with at- uses Ni-Zn ferrites (M = 5000 G) and hexagonal
tendant magnetization Me antiparallel to the re- ferrites of various compositions.
sultant magnetization Ma,b of the two Fe sub- All microwave ferrite devices such as isolators,
lattices. Since Me and Ma,b exhibit different circulators, switches, phase shifters, limiters,
variations with temperature, the net magnetiza- parametric amplifiers, and harmonic generators
tion may vanish twice, at Te and at an inter- are based on interactions of rf signals with the
mediate temperature called the compensation ferrite magnetization. Aligning M with an ex-
point, Tromp, where Me = Ma,b. ternal biasing magnetic field, Hde, and applying
A typical hexagonal ferrite is BaFe12019' a microwave signal in an orthogonal direction
Again, other magnetic ions such as Mn, Co, and leads to strong interaction and gyro magnetic
Ni may be introduced to produce wide varia- resonance. On a microscopic scale, this is ex-
tions in M and Te. Hexagonal ferrites are char- plained as application of a torque to the un-
acterized by large anisotropy fields with an axis paired ELECTRON SPINS of the magnetic ions
of symmetry which may be either a direction of which causes them to precess at the rf frequency
hard (planar ferrites) or easy (uniaxial ferrites) much like so many spinning tops. The preces-
magnetization. sional motion has a microwave RESONANCE fre-
To distinguish among major fields of applica- quency fr dependent upon Hde and the gyro-
tions, ferrites can be separated into five groups: magnetic splitting factor geff. In ferrimagnets
soft, square-loop, hard, microwave, and single- with spinel structure, geff is related to the g-
crystal ferrites. factors of the sub lattices as follows:
Soft ferrites have a slender, S-shaped hysteresis
geff =M/[(Ma/ga) - (Mb/gb)]
loop with low remanence and low coercive force
permitting easy magnetization and demagnetiza- On a macroscopic scale, this interaction modi-
tion with little magnetic loss. Mn-Zn and Ni-Zn fies the rf magnetic field in a manner which is
ferrites with spinel structure exhibit these prop- described by introducing an antisymmetric per-
erties and permit adjustment of M, and perme- meability tensor [J.1) whose complex compo-
ability, J.1i, over a wide range of values through nents depend on M, Hde, and frequency. When
variations in composition. Ni ferrite has J.1i = 15 the frequency approaches fr, one observes a
and M = 3000 G, whereas Ni-Zn ferrite may resonance absorption line whose width, D.H, is
have as much as M = 5000 G combined with a determined by the magnetic loss of the material.
permeability of several thousand. Mn-Zn ferrites Values for D.H cover a range from <1 oersted
have values of J.1i = 500 to 5000 depending on for single-crystal YIG to >1000 oersteds for
composition. These ferrites are uniquely suited some polycrystalline Ni-ferrites. The interaction
to low-loss inductor and transformer cores for of rf fields and electron spins becomes a maxi-
radio, television, and carrier telephony. mum if the rf field is circularly polarized in the
Square-loop ferr.ites are materials exhibiting an same sense as the precessional motion of the
almost rectangular hysteresis loop with two di~­ spins. Circular polarization in the opposite di-
tinct states of remanence and with a coerciv, · rection produces almost no interaction and no
force of a few Oe. All practical square-loop fer· gyro magnetic resonance. This permits design of
rites have a spinel structure. The Mg-Mn (Zn) nonreciprocal ferrite devices. At high levels of
system has retained its preeminent position in microwave power, nonlinear coupling between
computer memory applications two decades microwave signal and precessional spin motion
after its discovery in 1951. More recently, Li-Ni causes the parametric excitation of higher order
ferrites and more complex systems containing modes of spin motion (magnetostatic modes and
Li, Mn, and Al have become competitive in ap- spin waves). This effect has been exploited in
plications requiring stability and fast switching limiters and parametric amplifiers.
over a wide range of temperatures. Single-crystal ferrites of practical importance
Hard ferrites are characterized by hexagonal are rare-earth garnets grown in a flux of molten
structure, a hysteresis loop enclosing a large lead oxide. Some of these are optically transpar-
area, and a coercive force of several thousand ent permitting direct observation of magnetic
Oe. These ferrites can store a significant amount domains. Interaction of infrared and visible light
of magnetic energy, and have found widespread with the electron spins is called the magneto-
application as permanent magnets in hi-fi loud- optic effect. It permits electronic modulation
speakers, small motors, generators, measuring in- of a beam of light which propagates through a
struments, etc. single-crystal garnet. Devices of this type are of
Microwave ferrites have garnet, spinel, or hex- great potential interest in the rapidly developing
agonal crystal structure, and very low electric laser technology.
and magnetic loss factors. In general, the Magnetic Bubbles Single-crystal, rare-earth
required M increases with the frequency, f, of garnet sheets have been grown on a substrate
application. Substituted and pure garnets, Mg- with a preferred direction of magnetization per-
Mn-Al ferrites and Mg-Mn ferrites are used at pendicular to the plate. In these plates, tiny
431 FERRI MAGNETISM

round magnetic domains called bubbles can INERT SUBSTRATE


be formed by an applied magnetic field. These
bubbles can be propagated, erased, and ma-
nipulated to perform binary functions in
computers including logic, memory, counting,
and switching.
Bubbles are cylindrical magnetic domains
whose magnetization is reverse to that in the
remainder of the thin magnetic layer in which
they are present. If the magnetic layer is a
garnet then the process of bubble formation
can be observed with a polarizing microscope
as an external bias field applied perpendicular
to the surface is slowly increased until bubbles
develop from isolated island strip domains. This
GARNET EPI T AX I AL FILM
process is illustrated in Fig. 1. These single
domain configurations will only occur if the
magnetic material has a uniaxial anisotropy
with the easy axis of magnetization perpen-
dicular to the surface.
An external bias field is a necessary condi-
tion to support isolated bubble domains with
the allowable ranging of the bias field approxi-
mately one-tenth the saturation magnetic mo-
ment, about 50 Oe for the typical garnet film
supporting 2-J-Lm-diameter bubbles. The upper
limit of the bias field is set by bubble collapse,
the lower limit by a distortion instability in the
bubble shape leading to the conversion of a
bubble into a long meandering strip domain. MAG NETIC F IELD
This instability occurs at the domain strip
width at which the total energy is independent
of the strip length. These rules are applicable
for materials in which the uniaxial anisotropy
field Hk is much greater than 41TMs (Ms is the
saturation magnetization in gauss). The ratio APPROX 2/-1 IN
Hk/41TMs is defined as the material q and is, in DIAMETER
fact, a measure of the "stiffness" of the mag-
netization. Experience has shown that for most
useful bubble materials q > 3.
A material length 1 = uw/41TMs2 characterizes
materials for the bubble domain size they sup-
port. Here I is in centimeters and Uw is the do-
main wall energy in ergs/cm 2 . The optimum
magnetic film thickness h = 41 results in the
smallest stable domain diameter d = 1Tl at a bias
field HA = (0.3) 41TMs . (B IAS FIELD)
The interplay of the material parameters
Uw and Ms in determining bubble stability is BUBBLE FORMATION
shown in Fig. 2 and 3. Note that the bias field
acting directly on the bubble domain in the FIG. 1. The garnet epitaxial film is grown on a non-
classical sense and the energy of domain wall magnetic substrate. The serpentine nature of the mag-
both act to reduce the size of a bubble and are netic domains in garnet films arises from a preferred
opposed by the stray magnetostatic fields "easy" axis of magnetization perpendicular to the
which arise from the surface magnetic charges film surface. With no external magnetic field , the mag-
and which attempt to increase the bubble size. netic domains are arranged such that the sample is
The expression for the stray field HD shown magnetically neutral, half pointing up and half point-
in Fig. 2 is an approximation. Since the total ing down. As a small external magnetic field is applied,
energy h for the bubble has a minimum with the domains, whose polarity is opposite to that of
ah/ar = 0 and a2 h/ar 2 > 0 the bubble is the field, shrink. If the external magnetic field is fur-
statically stable. ther increased, the stripes shrink into cylinders-the
For a film of thickness h = 41 the nominal bubbles. The bubbles are free to move throughout the
bubble diameter d, i.e., the diameter when the film and can be viewed as tiny magnets afloat in a
bias field is set in the middle of the stable range, magnetic field sea of the opposite polarity.
FERRI MAGNETISM 432

tHeft
A) EXTERNAL BIAS
FIELD HA
o f------::-r_-

tHeft

B) EFFECTIVE r_
Of------
DOMAIN WALL
FIELD Ho-w

47TMs
C) MAGNETOSTATIC
(STRAY) FIELD HO
o I - - - - -r-_--
(0)

-M s

FIG. 2. The fields effective on the domain wall of the bubble domain which combined maintain
static stability are indicated. The bias field H A acts to diminish the volume of the bubble by exerting
a direct force on the domain wall directed radially inward. An effective domain wall field results from
a reaction by the domain wall energy density Gw to reduce the wall area to zero. The magneto static
(stray) field HD operates to equalize the overall magnetic surface charge by an increase in the bubble
volume. By resolving (a) into (b) and (c) the r-dependent part of the field is seen to be that of a
cylindrical dipole magnet.
433 FERRIMAGNETISM

t
INCREASES r

o
o /HTOTAL

r-r/----:=-=:-::::::=::=::s:a:
\ /b
r:--....:::;;--

DECREASES r

FIG. 3. There are two values of the bubble radius for which HT, the sum of
the fields illustrated in Fig. 2, is zero. Condition a is radially unstable and b is
stable, a conclusion that can be reasoned from this figure.

is predicted by theory to be S/. To store the tion rotates in a plane parallel to the wall as
greatest amount of information in a given area we move radially across the wall of the bubble.
is equivalent to packing the greatest number of Rotation within the wall can be either clock-
bubbles into that area, thus the highest bit wise (cw) or counterclockwise (ccw). In a hard
density will be realized in a material which bubble the domain wall is divided into cw and
supports the smallest-diameter bubbles. ccw Bloch segments linked to one another by
Bubble propagation circuits such as T-bar, Neel segments. In a Neel wall the magnetization
chevron, etc., provide a traveling local perturba- rotates cw or ccw in a plane perpendicular to
tion of the bias field. A bubble will move in a the wall. Neel and Bloch segments, really twists
small magnetic bias field gradient aHz/ax at a in the bubble wall, repel each other through the
velocity Vx = }lw(2raHz jax - SHe/rr)j2 where exchange energy which operates to keep
}lw is the planar wall mobility in cmjs - Oe, "adjacent" spins parallel. Thus the static prop-
He is the domain wall coercivity in oersted, and erties of hard bubbles differ from those of nor-
r is the bubble radius. At higher values of the mal bubbles. Examples of wall configurations
applied field gradient, often reached in device are shown in Fig. 4; bubbles (a) and (b) have
operation, the velocity reaches a maximum normal static properties. Bubble (c) is the only
value vp which in turn limits the maximum bubble that moves parallel to a field gradient,
data rate of a bubble device. A theoretical i.e., "normal" in the dynamic sense.
expression for this limiting velocity is vp = Three techniques can be used to suppress
24rA/h y'k;; where r is the gyromagnetic ratio, hard bubbles. They are multilayer garnet films,
A the exchange constant, Ku the uniaxial ion implantation, and a very thin (~300 A)
anisotropy constant (Hk = 2Ku/Ms), and h the Permalloy film directly on the garnet layer. Ion
magnetic film thickness. For most bubble film implantation is normally used. Figure 5 illus-
compositions in which the bubble thickness trates the effect of a hydrogen ion implant into
equals the bubble diameter, vp is in the range the outer 0.5 }lm surface of a nominally 6-}lm-
2000-3000 cm/s. thick (YGdTmh(GaFe)s012 garnet film on
It was an unexpected discovery that bubbles suppressing hard bubbles. The material param-
with differing static and dynamic properties eters of (YLuSmh(CaGeFe)s 0 12 garnet films
can exist in the same garnet film. Different are less sensitive to temperature variation and
domain wall states give rise to these unusual are used exclusively in devices with bubble
properties. The simplest bubble wall structure diameters 3 }lm or less. Garnet films can be
is a Bloch wall. In a Bloch wall the magnetiza- tailored to support 0.5-}lm bubbles, which im-
FERRIMAGNETISM 434

t t t Iii. ++1:1 t t t t t t 1:1 ••• Itl t t t

-------~ ----- -----~------


(0) (b)

t t t lil+ + +Iil t t t

(c) (dl
FIG. 4. (a) A simple Bloch wall. (b) a ccw sense for the Bloch wall. (c) A wall
structure partially (a) and (b). (d) A more complex example of wall structure.

plies that a storage density of 2 X 10 7 bits/em 2 ined as a constant amplitude rotating field in
can be realized with the garnet material system. the plane of the figure whereas the bias field
Garnet films are grown epitaxially on polished and the easy axis are normal to this plane. It
(Ill) wafers of Gd 3 Gas 0 12 . can be seen that the bubble moves from a fea-
Bubble operations used in chip designs are ture to the like feature one period away as the
propagation, generation, detection, transfer, drive field rotates by 360 0 •
replication, annihilation, and swap. It is beyond Propagation elements can be combined with
the scope of this section to discuss all of these bubble generators, detectors, etc. to build func-
in detail; however, we will dwell on the most tional chips. Many chip organizations are pos-
basic operation, that of propagation. Conductor sible. For example, Fig. 7 illustrates a 70 kbit
propagation was the first method used to move "endless loop" shift-register architecture.
bubbles but was replaced by feed access which Bubble memory chips are packaged in an as-
features inductive power field to produce bub- sembly of two mutually orthogonal wire-
ble motion via patterned permalloy features. wound solenoids which are driven to provide
Field access propagation by T-Bar permalloy a rotating in-plane drive field, and a permanent
features is illustrated in Fig. 6. The Permalloy magnet which provides the bias field necessary
pattern is generally not in direct contact with to maintain stable bubbles. Because the field
the bubble supporting garnet material to of the permanent magnet tracks the tempera-
guard against local spontaneous domain nuclea- ture variation of the bias field, the temperature
tion. Bubbles move to the right with a cw rota- range of satisfactory operation is wide. Data
tion of the in-plane field and to the left with a retention is completely nonvolatile.
ccw rotation. The drive field should be imag- Magnetic bubbles bridge the capacity-data
435 FERRI MAGNETISM

FIG. 7. Example of a 70-kbit bubble chip, approxi-


mate size 5 mm by 6 mm, processed at the Reading
Division of Western Electric Research Department.

retrieval time gap left vacant by magnetic core


and semiconductor devices on one side and the
electromechanical magnetic tape and disk on
the other. Improvements in bubble materials,
circuit processing, and device design have ad-
vanced bubble technology to where it is used in
FIG. 5. The effectiveness of a hydrogen ion implant applications requiring 10 6 -1 0 8 bits and re-
in suppressing hard bubbles is evident in this photo- trieval times less than 0.005 s. Bubble chips
graph. Only normal bubbles are found in the implanted with 10 million storage cells are currently under
area to the right, while a diverse collection of hard development . Magnetic bubbles utilize remanent
bubbles and strips are found to the left. magnetization to provide data storage. How-

IN-PLANE ROTATI NG FIELD

U BIAS FI ELD

FIG. 6. The Permalloy T-Bar pattern (Permalloy is a highly permeable NiFe alloy) becomes magne-
tized in the presence of the drive field Hxy. Magnetic bubbles couple to precise positions on the track
and move when H xy is rotated.
FERRI MAGNETISM 436

ever, unlike magnetic disk or tape storage tric, and terroelastic, but here only nonmag-
the operation is entirely nonmechanical. netic crystals are considered.
For more information the reader is referred When a given phase (phase I) has a structure
to a number of articles and books on the sub- that can be regarded as a slight distortion of
ject of magnetic bubbles that have recently another real or imaginary structure (structure
appeared. II), it is called a ferroic phase. (Distortion, un-
like modification, implies lowering in sym-
W. H. VON AULOCK metry.) If structure II cannot be regarded
A. H. BOBECK as a slight distortion of any other structure, it
is called the prototype of phase I. If there really
References exists a phase whose structure is an equisym-
General metric slight modification of the prototype, it
is called the prototypic phase. The interatomic
Standley, K. J., "Oxide Magnetic Materials," Oxford, distances in the prototypic phase, like those in
Clarendon Press, 1962. every phase, vary continuously with tempera-
Smit, J., and Wijn, H. P. 1., "Ferrites," 'New York, ture (and pressure, etc.), while those in the pro-
John Wiley & Sons, Inc., 1959. totype are fixed . The prototype is the zero
Lax, B., and Button, K. J., "Microwave Ferrites and point for all the order parameters, all the lattice
Ferrimagnetics," New York, McGraw-Hill Book Co., vibration coordinates, and all the components
Inc., 1962. of electric polarization vector and mechanical
von Aulock, W. H., "Handbook of Microwave Ferrite strain tensor. If we agree that the prototype
Materials," New York, Academic Press, 1965. is the same as the prototypic phase at a certain
von Aulock, W. H., and Fay, C. E., "Linear Ferrite De- temperature Tp, then the prototypic phase at
vices for Microwave Applications," New York, Aca- any other temperature is not the zero point.
demic Press, 1968. For example, let the point group of the proto-
Snelling, E. C., "Soft Ferrites," London, Iliffe Books, type be 4mm; the z component of spontaneous
Ltd., 1969. polarization vector and the (x, x) and (z, z)
Helszajn, J., "Principles of Microwave Ferrite Engi- components of spontaneous strain tensor of
neering," London, Wiley lnterscience, 1969. the prototypic phase are zero at Tp and nonzero
at any other temperature. In some cases, no
Magnetic Bubbles prototypic phase exists really (owing to melting,
1. Bobeck, A. H., Bell Syst. Tech. J. 46, 1901 dehydration, or the like); still it is possible to
(October 1967). imagine the prototype.
2. O'Dell, T. H., "Magnetic Bubbles," London, The term "a ferroic phase" is often abbre-
MacMillan, 1974. viated to "a ferroic," and "a prototypic phase"
3. Bobeck, A. H., and Della Torre, E., "Magnetic to "a prototypic," omitting "phase."
Bubbles," Amsterdam, The Netherlands, North- If one phase is a slight distortion of a second
Holland Publishing, 1975. phase, and this phase II is a slight distortion of
4. Chang, H., "Magnetic Bubble Technology," New a third phase, and this phase III cannot be re-
York, IEEE Press, 1975. garded as a slight distortion of any structure,
5. Bobeck, A. H., and Scovil, H. E. D., Scientific then phase III, not phase II, is recognized as the
American, p. 78, June 1971. prototypic of phase I. Both phase I and phase
6. Bobeck, A. H., Bonyhard, P. I., and Geusic, J. E., II are ferroics derived from phase III. Phase III
Proc. IEEE 63,1176 (August 1975). is the common prototypic of phases I and II. No
7. Nielsen,J. W., Licht, S. J., Brandle, C. D., IEEE phase can be both ferroic and prototypic. The
Trans. Magnetics MAG-10, 474 (1974). relationship between ferroic and prototpyic
8. Tabor, W. J., Bobeck, A. H., Vella-Coleiro, G. P., phases is not simply the relationship between
and Rosencwaig, A., "A new type of cylindrical lower- and higher-symmetry phases.
magnetic domain (hard bubble)," AlP Conf. The structure of a ferroic can be conceived
Proc. , 10,442-457 (1972). to result from modulating the prototype in a
9. Slonczewski, Malozemoff, J. C., and Voegeli, 0., wave. Let Q denote the quotient (~ 1) of the
"Statics and Dynamics of Bubble Containing Bloch wavelength by the unit-cell length of the proto-
Lines," AlP Conf. Proc. 10,458-477 (1972). type. According to whether Q is integral or de-
10. Eschenfelder, A. H., "Magnetic Bubble Tech- viates slightly from an integer, the ferroic is
nology," New York, Springer-Verlag, 1980. said to be commensurate or incommensurate.
(The deviation from an integer is not due to
Cross-references: FERROMAGNETISM, MAGNE- thermal expansion.) Only if it is commensurate
TISM, RESONANCE, TRANSFORMER. is the ferroic three-dimensionally periodic (as
the prototype is). We will consider below only
commensurate ferroics.
FERROICITY, FERROELECTRICITY, As the cell multiplicity of a ferroic we refer
AND FERROELASTICITY to the quotient (integral) of the number of
molecules in the primitive unit cell of the fer-
Originally the concept ferroic was a unification roic by that of the prototype. If the customary
and generalization of ferromagnetic, ferroelec- unit cell of the ferroic or prototype is not
437 FERROICITY, FERROELECTRICITY, AND FERROELASTICITY

primitive, it needs to be rendered into a primi- Different situations are regarded as the same
tive unit cell. oriate (short for orientational state) if they can
A ferroic has a definite number of situations. be changed to each other by pure translations.
Situation is a kind of state. The structure of The number of situations belonging to one oriate
the ferroic can be specified by assigning to equals the number of situations belonging to
every atom a displacement vector from its posi- any other oriate. This number equals the cell
tion in the prototype. When the ferroic is in multiplicity. The number of all situations of
a situation, the displacement vectors of all the the ferroic, therefore, equals the product of the
atoms are uniquely determinate. The displace- number of oriates and the cell multiplicity.
ment vectors in one situation are not all equal Furthermore, the number of oriates equals the
to the displacement vectors in another situ- quotient of the order of the point group of the
ation. The latter displacement vectors (or prototype by that of the ferroic.
the latter situation) can be obtained by per- Figure 1 shows a model ferroic. Points A,
forming a certain operation of the space group B, ... , F have z = Co /2. Their images across the
of the prototype upon the former displacement xy plane as a mirror are A', B', ... , F'. Points
vectors (or the former situation). The men- A, B, C, D, A', B', C ' , D' are the vertices of a
tioned operation is not uniquely determinate; primitive unit cell of the prototype. (Thus Co is
there are many operations each changing the a lattice constant of the prototype.) White
former situation to the latter. By performing atoms lie at the vertices. The unit cell, more-
every operation of the space group of the pro- over, constains two black atoms which lie at z =
totype upon one situation, all situations can be ±d/2 (0 < d < Co /2). The ferroic results from
obtained. The number of situations is finite. the displacement of every black atom by the
Regardless of the situation the ferroic is in, drawn vector which is perpendicular to the z
it has the same free energy. axis. (For simplicity it is assumed that no white

y y

(0) (b)
y y

(c) (d)
FIG. 1. A model ferroic; (a) situation S 1, oriate ~ 1; (b) situation S 2, oriate ~ 1; (c) situation S 3, oriate ~ 2;
(d) situation /)4, oriate O 2 ,
FERROCITY, FERROELECTRICITY AND FERROELASTICITY 438

atom is displaced.) Obviously, B, D, E, F, B', Since the spontaneous polarization vector


D', E', F' are the vertices of a primitive unit and spontaneous strain sensor are invariant
cell of the ferroic. Hence the cell multiplicity is under every translation, any two situations be-
2. The ferroic has, in all, four situations which longing to the same oriate are ferroelectrically
are represented by $ 1 , S 2 , $ 3, $ 4. The transla- and ferroelastically unrelated.
tion along the x axis by length ao (= AB), A specimen of a full ferroelectric (or ferro-
obviously, changesSI toS2, andS 3 tO$4' Thus elastic) can easily be changed from a multi-
S1 and S 2 are regarded as the same oriate, e 1 ; domain state to a unidomain state, and from a
S3 and 4 as the same oriate,e 2 . Obviously, no unidomain state to another unidomain state,
pure translation changes $ 1 to S 3 (the 7r /2 rota- by the application of an electric field (or a
tion about the z axis does so). Thus $ 1 and S 3 mechanical stress). On the other hand, a speci-
are regarded as different oriates. In other men of a partial ferroelectric (or ferroelastic)
words, (91 differs from (92. The number of situa- can hardly be affected in this fashion. Even for
tions 4, equals the product of the number of a full ferroelectric (ferroelastic), if its cell
oriates, 2, and the cell multiplicity, 2. multiplicity is greater than 1, a specimen of it
A specimen of a ferroic often consists of can hardly be changed from a multiterrain
several domains (or terrains); each domain (or state to a uniterrain state, and from a uniter-
terrain), as a whole, is in one oriate (or situa- rain state to another uniterrain state, by the
tion), and the oriates (or situations) of any two application of an electric field (or a mechanical
adjacent domains (or terrains) are different. stress).
Domain corresponds to oriate, and terrain to If a ferroic is both ferroelectric and ferro-
situation. But the number of domains (or ter- elastic, the ferroelectricity and ferroelasticity
rains) in the specimen has no connection with are not always coupled. Only when the con-
the number of oriates (or situations) of the dition is satisfied that ferroelectrically related
ferroic; the former can be any integer between oriates are all ferro elastically related and vice
1 and 00. Each domain comprises a number of versa are the ferroelectricity and ferro elastic-
terrains; this number may be any integer be- ity completely coupled; i.e., a turning of the
tween 1 and 00, having no connection with the spontaneous polarization vector by a field is
number of situations per oriate, i.e., the cell necessarily accompanied by a turning of the
multiplicity. spontaneous strain tensor and, conversely, a
When at least two of the oriates of a ferroic turning of spontaneous strain tensor by a stress
are unequal with respect to (the direction of) is necessarily accompanied by a turning of
the spontaneous polarization vector, the fer- spontaneous polarization vector. Any ferroic
roic is said to be ferroelectric and those two that is both fully ferroelectric and fully ferro-
oriates are said to be ferroelectrically related. elastic satisfies the above condition.
(Two oriates that are equal with respect to the Two oriates that can be changed to each
spontaneous polarization vector are said to other by a field (or stress) are not always ferro-
be ferroelectrically unrelated.) According to electrically (or ferro elastically) related. If two
whether all the oriates, or not all but some, are oriates are unequal with respect to electric sus-
ferroelectrically related, the ferroic is said to be ceptibility tensor or elastic compliance tensor
fully or partially (or a full or partial) ferro- or piezoelectric modulus tensor, it is never
electric. Analogously, when at least two of the impossible to change the oriates to each other
oriates of a ferroic are unequal with respect to by a field or a stress or a combination of
the spontaneous strain tensor, the ferroic is both, respectively.
said to be ferroelastic and those two oriates are
said to be ferro elastically related. According to KElTSIRO AIZU
whether all the oriates, or not all but some, are
ferro elasticity related, the ferroic is said to be References
fully or partially (or a full or partial) ferro-
elastic. The oriates of a partial ferroelectric Aizu, K., Phys. Rev. B 2, 754 (1970).
(or ferroelastic) can be divided into subsets Aizu, K., l. Phys. Soc. lpn. 44,334 and 683 (1978).
such that any two oriates belonging to the same
subset are ferroelectrically (or ferroelastically) Cross-references: CRYSTALLOGRAPHY, DIELEC-
unrelated and any two oriates belonging to dif- TRIC THEORY, MECHANICAL PROPERTIES OF
ferent subsets are related. The number of SOLIDS.
oriates in one subset equals that in another sub-
set. Hence this number is a divisor of N, the
number of all oriates of the ferroic. It indicates FERROMAGNETISM
the degree of partiality. In particular, if it is
1 the ferroic is fully ferroelectric (or ferro- Ferromagnetism is an example of cooperative
elastic); if N, not ferroelectric (or ferro elastic). phenomena in solids. It is characterized by a
Any ferroic with a prime N, especially with N = spontaneous macroscopic magnetization M
3, is neither partially ferroelectric nor partially (magnetic moment per unit volume) in the ab-
ferroelastic. sence of an applied magnetic field at tempera-
439 FERROMAGNETISM

tures below a critical value known as the Curie When Jij is positive, parallel ordering or align-
temperature, Tc. This property is exhibited by ment of the atomic spins in a common direc-
the transition metals, Fe, Co, and Ni; the rare tion is favored, and so there is a large spontane-
earth metals, Gd, Tb, Dy, Ho, Er, and Tm; and ous magnetization even in the absence of an
by a variety of alloys, compounds, and solid applied field. For negative Iij, antiparallel spins
solutions involving the transition, rare earth, result in lower energy (see ANTIFERROMAG-
and actinide elements. Ferromagnetic Curie NET ISM , FERRIMAGNETlSM).
temperatures range from a fraction of a degree Maximum ordering obtains at the absolute
to hundreds of degrees Kelvin. zero of temperature where the randomizing ef-
Cooperative magnetic behavior results from fect of thermal agitation disappears. The initial
the exchange interaction between electrons, decrease of the magnetization as the tempera-
which is qualitatively described as follows. ture is increased from zero is well represented
Electrostatic coulomb repulsion between like by a superposition of wavelike disturbances
electric charges acts to keep two electrons apart, known as spin waves. At the Curie temperature
a separation which is also favored by the Pauli the magnetic ordering is destroyed by thermal
exclusion principle if the electrons have parallel agitation and the spontaneous magnetization is
spins. Thus if two electrons are farther apart zero (Fig. I). Above the Curie temperature a
when their spins are parallel than they would be ferromagnetic material behaves paramagneti-
if their spins were antiparallel, the parallel state cally and has a net magnetization only in the
will have lower mutual electrostatic energy. presence of an applied field (see MAGNETISM
However, the kinetic energies increase if the and PARAMAGNETISM).
electrons are separated, and consideration of At temperatures below Tc a sample of ferro-
this energy may lead to lower total energy for magnetic material is usually divided into small
antiparallel spins. In other words, the exchange regions called domains, which vary in size and
interaction is electrostatic in nature, but is shape with a typical dimension from 0.1 to
modified by details of kinetic energy and the 1000 fJ.m. Within each domain the magnetiza-
exclusion principle, and is highly dependent on tion is uniform and has the maximum or satura-
the spatial distribution of the electrons. The tion value,Ms , characteristic of the temperature
exchange energy between two electrons, though of the material (Fig. 1), but the direction of
electrostatic, is usually expressed in the mathe- alignment of the individual moments in each
matically equivalent form, - 2Js 1 ·S2, where J is domain changes from one domain to the next.
the quantum mechanical exchange integral, re- The magnetization of the sample, which is the
lated to the overlap of the charge distributions resultant of the magnetization of all the do-
I, 2; and S1 and ~ are the spin angular momen- mains, may be much less than the saturation
tum vectors of the two electrons. The value of J value (of a single domain), or it may even be
is usually expressed in energy units, such as ergs, zero in a completely demagnetized state.
when the angular momentum vectors are as- The magnetization of a sample increases when
signed non dimensional values, i.e., spin quan- a magnetic field, H, is applied. The value of H
tum numbers. When J is positive, parallel spins required to saturate the magnetization may be
represent a lower energy state than antiparallel as small as 0.01 oersted or as large as several
spins. Since each electron has a magnetic mo- thousand oersteds, depending on the material.
ment proportional to its spin angular momen-
tum, a state of parallel spins corresponds to a
state of parallel magnetic moments.
The principal effect of the exchange interac-
tion is embodied in the empirical Hund's rules,
which describe the combination of electron
spins in an atom to form the atomic spin. The
principal interaction between magnetic atoms,
i.e., between atoms with magnetic moments as
a consequence of spin angular momenta, is
thought to be of exchange character also, and
there has been considerable success in describ-
ing magnetic properties by assuming this inter-
action to have the form,

J( = - 'LIijSi . S;.
i,j
Here Si and Sj are the spin angular momentum
vectors of atoms i and j; the exchange integral,
Jij, may vary for different pairs and is usually
regarded as a phenomenological parameter to FIG. 1. Typical variation of spontaneous domain
be evaluated by means of experimental data. magnetization,Ms, as a function of temperature, T.
FERROMAGNETISM 440

M
Ms

o~------~~~--------~

d
I
-I
-H o +H
FIG. 2. Typical variation of sample magnetization,
M, as a function of applied field strength, H; the mag-
EASY
AXES
+ MAGNETIC,
FIELD
netization curve depends on the magnetic history of FIG. 3. Schematic representation of the change of
the sample. domain structure and magnetization with applied
field.

The magnetization process for a previously directions; consequently its physical properties
demagnetized material is represented by the depend on direction . It is easier to magnetize a
O-to-a portion of the magnetization curve in magnetic material in some directions, called
Fig. 2. When the applied field is subsequently easy axes, than in other directions. When the
removed, the magnetization exhibits the phe- effect of anisotropy is superimposed on the ef-
nomenon of hysteresis, or lagging behind, trac- fects of exchange and dipolar coupling, the
ing the path a-b and retaining a finite value, ordered atomic moments break up into seg-
called the remanence, when the field is zero. ments or domains, so that in each domain the
The coercive force is the value of the field magnetization is uniform and lies along or near
which must be applied opposite to the direction one of the easy axes. There is a large change in
of the magnetic moment to trace the path b-c the direction of magnetization from one do-
and reduce the magnetization to zero. Further main to the next, with the reorientation oc-
increase of the field in this direction, followed curring gradually (on an atomic scale) in a nar-
by its removal, traces the path c-d-e. Repetition row transition region known as the domain
of the cycle then traces the curve e-f-a-b-c-d-e-, wall.
etc. Permanent magnets are characterized by a The initial part of the magnetization process,
large coercive force; i.e., a large reverse field O-a in Fig. 2, is represented schematically in
(typically thousands of oersteds) is necessary to Fig. 3. In the demagnetized state the domains
destroy the magnetization However, in soft are arranged to minimize the external field due
magnetic materials the coercive force may be to the magnetization and to give zero net mo-
less than one oersted. ment for the sample (Fig. 3a). In low fields the
Domains exist in a ferromagnetic material be- domain boundaries move so that the domains
cause their formation results in a lower total with magnetization direction near the direction
energy for the sample than it would have if the of the applied field grow in size while other
entire sample were a single domain. Total align- domains are depleted (Fig. 3b). As the field
ment of the magnetic moments is favored by strength is increased, the domain boundaries
the exchange forces, but these are usually short are swept out of the sample (Fig. 3c), and
range forces acting between an atom and its finally the (single) domain magnetization ro-
neighbors. The dipole-dipole forces, although tates into the direction of the applied field until
weaker, are long range and, alone, would orient saturation is reached (Fig. 3d). When the field
the atomic moments like bar magnets, north is removed, domains re-form in varied orienta-
pole to south pole, in closed chains to mini- tion along the easy axes, but there is a prefer-
mize the external field of the magnet (see ence for domains whose magnetization vectors
DIPOLE MOMENT and MAGNETISM). One addi- lie in the easy directions nearest the direction in
tional factor is required for the formation of which the field was applied, and consequently
domains. This factor is anisotropy, a result of the sample as a whole has a magnetic moment
the crystal structure of most solid materials. (remanent magnetization).
The structure of a crystal is not the same in all In some thin magnetic films the domains are
441 FEYNMAN DIAGRAMS

cylindrical. Both the cylinder axis and the mag- Morrish, A. H., "The Physical Principles of Magnetism,"
netization are perpendicular to the plane of Melbourne, FL, Krieger Publishing Co., 1980.
the film, whereas in the domains shown in Vonsovskii, S. V., "Magnetism," 2 Vols., New York,
Fig. 3 the magnetization is in the plane of the John Wiley & Sons, Inc., 1974.
diagram. Cylindrical domains-called bubbles
because of their appearance and mobility at the Cross-references: DIPOLE MOMENTS, FERRIMAG-
surface - are caused when perpendicular anisot- NETlSM, MAGNETISM, MAGNETOSTRICTION,
ropy built into the film during preparation is PARAMAGNETISM.
stronger than the demagnetizing effect of the
planar structure. Bubble domains are useful for
data storage and control applications.
Amorphous materials have no crystalline FEYNMAN DIAGRAMS
order. They are often referred to as glasslike
and have properties similar to those of liquids. I. Introduction Feynman diagrams provide
Despite the total atomic disorder, the magnetic one of the most powerful methods known for
properties of amorphous alloys are sharply de- finding the physical properties of systems of
fined: Domain patterns are observed, including interacting particles, i.e., particles interacting
bubble domains; sharp Curie points are de- with an external field and/or with each other.
tected; and large permeabilities exist, almost Since all physical systems-solids, liquids, gases,
as large as that of Permalloy. molecules, plasmas, atoms, nuclei, elementary
Except when rare earth atoms are present, particles-are composed of interacting particles,
the exchange interactions in amorphous alloys the Feynman diagram method is now used ex-
are positive and lead to ferromagnetism. Using tensively in all branches of modern physics. In
a variety of elements over a wide range of this article, the examples are from solid-state
composition, one can obtain great flexibility of physics; the reader is referred to the References
magnetic properties. When rare earth atoms are for examples from other branches. A very ele-
added, negative exchange interactions are in- mentary account of the subject is found in
duced, allowing further control of the magneti- Ref. 1.
zation and anisotropy energy to help stabilize The idea behind Feynman diagrams is this: In
domain formation and other factors affecting order to find the important physical properties
magnetic properties. of a system of interacting particles, it is not
Ferromagnetism is also characterized by many necessary to know the detailed behavior of the
particles, but only their average behavior. The
secondary effects. Among the more important quantities which describe this average behavior
are the specific heat of the ferromagnetic state, are called "propagators" or "Green's functions"
which peaks at the Curie temperature, confirm-
ing that a phase transition has occurred; the and physical properties may be calculated di-
magnetoresistance, which in metals can change rectly from them. For example, the single-par-
the electrical resistance by as much as five per- ticle propagator is defined thus: we put an extra
cent, depending on the direction of magnetiza- particle into the system at point rl at time tl
tion; and the magnetostriction, which causes and let it interact with the external field and
the sample dimensions to change during mag- with the other particles for a while (i.e., we let
netization. These secondary effects give addi- it "propagate" through the system). Then the
~ingle-particle propagator is the probability (or
tional insight into the nature of the ferromag-
netism and are useful in technical applications. m quantum systems, the probability amplitude)
that at later time t2, the particle will be ob-
serv~d at another point, r2. From the single-
R. 1. JOENK
T. R. MCGUIRE particle propagator, we can calculate directly
the energy and lifetime of certain excited states
of the system, the ground state energy etc.
References Similarly, t~ere are the two-particle propagator,
the no-particle propagator or "vacuum ampli-
Chen, C. W., "Magnetism and Metallurgy of Soft Mag- tude"), etc., which yield other physical proper-
netic Materials" ("Selected Topics in Solid State ties (see Ref. 1).
Physics," Vol. XV), Amsterdam, North-Holland Feynman diagrams give us a method of calcu-
Publishing Co., 1977. lating propagators by means of pictures. We
Chikazumi, S., and Charap, S. H., "Physics of Mag- shall first show how this is done in a simple
netism," Melbourne, FL, Krieger Publishing Co., ~lassical example from "liquid-state" physics,
1978. I.e., the drunken man propagator, then give a
Coey, J. M. D., "Amorphous Magnetic Order," Journal couple of illustrations from solid-state physics.
of Applied Physics 49,1648-1652 (1978). II. Classical Example: The Drunken Man
Craik, D. J., "Structure and Properties of Magnetic Propagator A man who has had too much to
Materials," London, Pion Ltd., 1971. drink (Fig. 1), leaves a party at point I and on
Malozemoff, A. P., and Slonczewski, J. C., "Magnetic the way to his home at point 2, he can stop off
Domain Walls in Bubble Materials," New York, Aca- at one or more bars-Alice's Bar (A), Bardot
demic Press, 1979. Bar (B), Club 6 Bar (C), ... etc. We ask for the
FEYNMAN DIAGRAMS 442

FIG. 1. Propagation of drunken man.

probability, P( I, 2), that he gets home. This The series may be evaluated approximately
probability, which is just the propagator here by selecting the most important terms in it and
(with time omitted for simplicity), is the sum summing them. This is called "partial summa-
of the probabilities for all the different ways he tion." For example, suppose the man's favorite
can propagate from 1 to 2 interacting with the bar is Alice's bar, so that P(A) is large and all
various bars. Assuming, for simplicity, that the other P(x),s are small. Then Fig. 2(a) becomes
various processes involved are independent, thi8 approximately Fig. 2(b). Assuming for simplic-
is just Po(1, 2) (= probability that he will go ity that all Po(r, s) = e (a constant) we have, us-
"freely" from I to 2, i.e., without stopping at ing Table I:
any bar), plus Po(1, A) X P(A) X Po(A, 2)
(= probability he will go freely from I to bar A, P(1, 2) ~ + Po(1, A)P(A)Po(A, 2) + ...
Po (1, 2)
times the probability that he stops off at A for = e + e2 P(A) + e 3 p2 (A) + .. .
a drink, times the probability that he then pro-
ceeds freely to 2), plus Po (1 , B) X P(B) X = c [1 + cP(A) + c 2 P2(A) + ... 1
Po(B, 2) (= probability for route I-bar B-2)
plus etc.··· plus Po (1 , A) X P(A) X Po(A, A) X = ef[ 1 - cP(A)] (2)
P(A) X Po(A, 2) (= probability for route I-A-
A-2) plus etc..... This gives us an infinite se- where we have used the fact that the expression
ries for the propagator: in brackets is a geometric series. This same tech-
nique is used in the quantum case.
P(1, 2) = Po (1, 2) + Po (1 , A)P(A)Po(A, 2) III. Single Electron Propagating in a Crystal
The example here is just like the previous one,
+ Po (1 , B)P(B)P(B, 2) + ... except that instead of a propagating drunken
man interacting with various bars, we have a
+P o(1, A)P(A)Po(A, A)P(A)Po(A, 2) + .... (1) propagating electron interacting with various
ions in a crystal. A crystal consists of a set of
Now this series is a complicated thing to look positively charged ions arranged so they form a
at. To make it easier to read, we draw a "pic- regular lattice, as in Fig. 3. An electron interacts
ture dictionary" to associate diagrams with the with these ions by means of the Coulomb force.
various probabilities as shown in Table I . Using The single-particle propagator here is the sum
this dictionary, series (1) can be drawn as in of the quantum-mechanical probability ampli-
Fig. 2(a). Since, by Table I, each diagram ele- tudes for all the possible ways the electron can
ment stands for a factor, Fig. 2(a) is completely propagate from point rl in the crystal, at time
equivalent to series (I). However, Fig. 2(a) has t 1 , to point r2 at time t 2 , interacting with the
the great advantage that it also reveals the phys- various ions on the way. These are: (1) freely,
ical meaning of the series, giving us a "map" without interaction; (2) freely from rl, tl (= "1"
which helps us to keep track of all the se- for short) to the ion at rA at time tA, inter-
quences of interactions the drunken man can action with this ion, then free propagation
have in going from 1 to 2. from the ion to point 2; (3) from 1 to ion
443 FEYNMAN DIAGRAMS

2 222

= +~++ +.... + + ... +

(a)

+ + ...

(b)

FIG. 2. (a) Feynman diagram series for drunken man propagator, or for electron propagating in crystal. (b)
Approximate series ("partial sum") for drunken man propagator.

TABLE 1. FEYNMAN DIAGRAM DICTIONARY


FOR DRUNKEN MAN PROPAGATOR

Word Picture Meaning


o ® ©r. 2

*: ® ®
Probability of propagation from I
P(1,2) to 2.

Po(r, s)
t. Probability of free propagation
from r to s. FIG. 3. Electron propagating among ions A, B, C ...
in crystal.

~
Probability of stopping off at bar X
P(X) for a drink. the following changes are made: change "proba-
bility" to "probability amplitude," and change
the meaning of the circle with an X to "proba-
bility amplitude for an interaction with the ion
at X." When this is done, the series for the
B, interaction at B, then from B to 2, etc. propagator can be translated immediately into
Or we could have the routes I-A-A-2, I-A-B-2, exactly the same diagrams as in the drunken
etc. We can now use the dictionary in Table man case! That is, Fig. 2(a) is also the propa-
I to translate this into diagrams, provided gator for an electron in a crystal, provided that

a
-
b c d
FIG.4. Electron propagating in electron gas.
FEYNMAN DIAGRAMS 444

r~r'
FIG. 7. Feynman diagram for interaction between
two electrons.

the charge distribution of the gas, and puts


these lifted-out electrons down again further
away. This picture is shown in Fig. 5.
Using this picture, we can decompose the
FIG. 5. "Hole" picture of empty space around elec- propagator into the sum of the probability am-
plitudes for all possible ways that the extra
tron in electron gas. electron can propagate through the system in-
teracting with the other electrons. The simplest
way is free propagation without interaction.
we just use a quantum dictionary to translate Another way is shown in the "movie" in Fig. 6.
the lines and circles into functions. The series Figure 6(a) shows the extra electron entering
can be partially summed, and from the resulting the electron gas. In 6(b) we see the extra elec-
propagator we obtain immediately the energy tron interacting with a nearby electron in the
of the electron moving in the field of the ions. gas, creating a hole nearby and a lifted-out elec-
IV. Electron Propagating in an Electron Gas tron further away. The extra electron, the hole,
We now look at the problem of many interact- and the lifted-out electron then propagate freely
ing particles, taking as an example a large num- through the gas as in 6(c). In 6(d), the extra
ber of electrons interacting with each other by electron interacts with the lifted-out electron,
means of the Coulomb force. It is assumed that causing it to fan back into the hole, thus annihi-
there is a uniform, fixed positive charge "back- lating both the lifted-out electron and the hole.
ground" present which keeps the whole system Figure 6(e) shows the extra electron propagat-
electrically neutral. This system is called the ing out of the system. (It should be noted that
"electron gas" and is used as a simple model for unlike the drunken man case, the processes
electrons in a metal. The propagation of an shown in Fig. 6 are not physical but rather
added electron through this system is shown in "virtual" or "quasi-physical" processes, since
Fig. 4. Figure 4(a) shows the uniform charge they do not conserve energy and they may vio-
distribution in the undisturbed system, with an late the Pauli exclusion principle.)
extra electron entering from the left. In 4(b) Let us represent the Coulomb interaction be-
the extra electron has entered, repels other elec- tween an electron at point r and one at r' by a
trons away from it (Coulomb repulsion be- wiggly line as in Fig. 7. Then the sequence of
tween like charges), so we get an "empty space" processes in Fig. 6 may be represented by the
near the extra electron and repelled electrons Feynman diagram shown in Fig. 8(a) or 8(b).
further away. The extra electron surrounded by Note that hole lines are drawn as electron lines
the empty space is called the "quasi electron" with a direction opposite to the direction of
and it propagates through the system as shown increasing time. By analyzing the other possible
in Fig. 4(c), (d). It is convenient to view the processes which can take place, we find the
empty space around the extra electron as com- series for the electron propagator shown in Fig.
posed of "holes" in the electron gas. That is, 9. Again, these diagrams may be evaluated by
the Coulomb repulsion "lifts out" electrons writing the appropriate factor for each free
from the electron gas in the immediate vicinity electron and hole propagator and each interac-
of the extra electron, thus creating "holes" in tion, and carrying out a partial sum. The result-

,~! n ~, ! : ~~. ~ ~:!


~ • I'.' • \ ;.

.
~ , ~, ,,> ...
,
,'"
. ~ ,
I •

a b c d e
FIG. 6. One possible way in which an electron can propagate through the electron gas.
445 FIBER OPTICS

or
t
time --+
(0 ) ( b)
FIG. 8. Feynman diagram for the sequence in Fig. 6.

~ = j ~ + + ~ ~ + + ~ ~ + + ••••

FIG. 9. Feynman diagram series for electron propagator in the electron gas.

ant expression for the propagator yields directly FIBER OPTICS


the energy and lifetime of the quasi electron.
For a more extensive and detailed account of As the name suggests, fiber optics deals with
Feynman diagrams, see Ref. 1. . the transmission of light through thin strands of
transparent dielectric materials such as glass or
R. D. MA TTUCK plastic. The general subject of fiber optics can
be subdivided into three broad classes: fiber
bundles for illumination and image transmis-
References sion, fiber optic sensors, and optical fiber tele-
communkations. However, the physical prin-
1. Mattuck, R. D., "A Guide to Feynman Diagrams in ciples involved in light transmission through
the Many-Body Problem," London, McGraw-Hill, optical fibers are the same in all three areas of
1967. (Most elementary account, examples from applications.
solid-state and nuclear physics.) Light Transmission Through Optical Fibers
2. Fetter, A. L., and Walecka, J. D., "Quantum The- The simplest kind of optical fiber consists of an
ory of Many-Particle Systems," New York, Mc- inner circular cylinder of a homogeneous dielec-
Graw-Hill, 1971. (Advanced, examples from solid tric material, the core, that is surrounded by
state and nuclear physics.) another homogeneous dielectric material of
3. Mandl, F., "Introduction to Quantum Field The- lower refractive index, the cladding. Because
ory," New York, Interscience Publishers, 1959. the refractive index profile of core and cladding
(Elementary account, examples from elementary resembles a step, this structure is called step-
particle physics.) index fiber. A light ray traveling along a straight
4. Rickayzen, G., "Green's Functions and Condensed line in the core suffers total internal reflection
Matter," Academic Press, London, New York, if it impinges on the core-cladding boundary at
1980. (Intermediate, examples mostly from solid a sufficiently small angle. Rays traveling at
state physics.) angles larger than a critical angle are only par-
5. Bjorken, 1. D., and Drell, S. D., "Relativistic Quan- tially reflected and lose all their energy after
tum Mechanics" and "Relativistic Quantum Fields," only a few repeated reflections. Thus, the fiber
New York, London, McGraw-Hili, 1964. is able to trap light within a cone defined by
the critical angle. (See Fig. I).
More complicated optical fibers consist of a
Cross-references: ELECTRON, ELEMENTARY PAR- core whose refractive index is inhomogeneous,
TICLES, GRAND UNIFIED THEORIES, MANY- changing continuously from a relatively high
BODY PROBLEM, PHOTON, QUANTUM CHROMO- value on axis to a lower value at the core-
DYNAMICS, QUANTUM ELECTRODYNAMICS. cladding interface. Rays in such a graded-index
FIBER OPTICS 446

INDEX OF
REFRACTION

JACKET
/
n( r )
CORE

CLADDING

JACKET

STEP- INDEX FIBER

Fig. l.

fiber travel on curved paths and remain trapped in arrival time of the slowest and fastest ray or
inside the core if their trajectory does not en- mode. In graded index fibers, this intermodal
counter the core-cladding interface (Fig. 2). dispersion can be compensated partially. The
Light propagation in optical fibers can also travel time of light depends not only on its
be described in terms of wave optics. Instead geometrical distance but also on the index of
of rays traveling at different angles we now refraction of the medium. Light moves more
speak of modes. A mode is defined as an elec- slowly in regions with high refractive index and
tromagnetic wave that travels through a uni- faster in a low-index medium. Since the light
form fiber without changing the shape of its trajectory in a graded-index fiber samples high-
transverse field distribution. Depending on index regions near the fiber axis and lower-
their core diameters and refractive index dis- index regions farther from the axis, averaging
tributions, fibers can support varying numbers takes place. It is possible to design graded-index
of modes. A fiber with a large diameter and/or fibers that very nearly compensate the travel
large index difference can support many modes. time differences along all possible rays or for
On the other hand, it is possible to design a fiber all possible modes. Such optimized fibers have
with sufficiently small core diameter and/or a refractive index distribution that is nearly
small index difference so that it can support (but not exactly) parabolic as a function of the
only one mode in each of two possible polariza- radial coordinate. Even though no perfect
tions. Each mode can be represented by bundles compensation is possible, a very substantial
of rays traveling at angles characteristic of the improvement over intermodal dispersion in
mode in question. step-index fibers can be achieved.
In step-index fibers, the distance and hence Fibers that support only a single mode (single-
the travel time of light depends on the angle at mode or monomode fibers) do not suffer from
which a ray propagates. Or, expressed in the intermodal dispersion. However, a different
language of wave optics, the travel time is dif- source of pulse spreading is operative in all
ferent for each mode. This undesirable property fibers: chromatic dispersion. This means that
of step-index fibers has the consequence that light of slightly different wavelength propa-
pulses, whose energy is spread over many rays gates at slightly different velocities. Light
traveling along different paths or on different sources used to excite optical fibers have finite
modes, arrive at the fiber end spread out over a spectral width. But even if the unmodulated
time interval corresponding to the difference source were strictly monochromatic, the pro-
447 FIBER OPTICS

INDEX OF
REFRACTION

JACKET
/
CLADDING

n (r )

CLADDING

JACKET

GRADED-INDEX FIBER

Fig. 2.

cess of modulation introduces new spectral to a few methods for making glass fibers. A
components. Thus, chromatic dispersion is simple method consists in placing a solid glass
operative in all fibers. Fortunately, one of the rod of relatively higher refractive index into a
most important materials for making optical glass tube of lower refractive index. The end of
fibers-fused silica-has the property that dis- this rod-in-tube assembly is heated to the
persion vanishes to first order at a wavelength softening point and is pulled into a step-index
near 1.3 pm. If operated at the minimum dis- fiber. The diameter of the resulting fiber de-
persion wavelength, extremely short pulses pends on the diameter of the starting tube, the
(psec) can be transmitted over long lengths drawing temperature, and the drawing speed.
(tens of km). More sophisticated methods use an arrangement
In addition to dispersion , the performance of of concentric crucibles filled with different
an optical fiber is characterized by the power molten glasses. The core and cladding glasses
loss that a light signal suffers when traveling in combine at the tips of the crucibles and are
it. The loss parameter depends mostly on the drawn into step-index fibers.
materials out of which the fiber is made. Fibers Low-loss fibers for telecommunications appli-
made of plastics have high losses and are useful cations must be made of very pure materials.
only for light transmission over a few meters. Contamination of the pure glass with metal ions
Fibers made of fused silica have achieved ex- or with water can increase the losses dramati-
tremely low losses indeed. The loss spectrum of cally. In some chemical vapor deposition meth-
fused silica has its lowest value near 1.5 pm. ods , glass is deposited on the outside of a man-
Losses as low as 0.2 dB/km have been measured drel by converting silicon tetrachloride and
in fused silica fibers. This means that one-half dopant vapors into doped silicon dioxide. In
of the original input light power still arrives at the modified chemical vapor deposition (MCVD)
the end of a fiber of 15 km length. For longer method a mixture of silicon tetrachloride and
wavelengths the loss increases sharply because oxygen is passed through the inside of a heated
the light can excite vibrations of the atoms of silica tube. Added to this mixture are dopant
fused silica. At shorter wavelengths the loss gases such as germanium tetrachloride or phos-
increases sharply because the light induces elec- phorus trichloride. Germanium or phosphorus
tronic transitions. In silica fibers, the window increases the refractive index of fused silica,
of high transparency extends roughly from whereas the addition of boron or fluorine to
0.5 pm to 1.8 pm. silica glass decreases its refractive index. The
Fiber Fabrication Even though plastic fibers chlorine compounds are converted to oxides
are cheap to fabricate, their high losses preclude and are deposited on the hot inner surface of
their use in most telecommunications applica- the silica tube, where they fuse to form layers
tions. For this reason we limit the discussion of doped silica glass. The rate of deposition is
FIBER OPTICS 448

controlled by the concentration of the vapors Short sections of such fibers act as tiny graded-
inside the tube, by the temperature, and by index lenses. Inside a parabolic-index fiber an
the speed with which the heating torch is passed image focused onto one of its ends is refocused
along its outside. By careful control of the ratio periodically along its length. The quality of the
of silicon tetrachloride and dopant materials, transmitted image depends on the quality of
desired refractive index distributions can be the index profile and the length of the fiber.
achieved. After the deposition step the tube is Fiber Sensors Because of their great length,
heated until it collapses to form a solid rod-the minute changes of the refractive index along
preform-out of which the fibers are drawn. the fiber can result in sizeable phase changes of
Fibers for optical communications are protected the transmitted light. The index change may be
against scratches (which weaken their strength) induced by many different effects such as mag-
by a plastic jacket which is applied while the netic fields, temperature, pressure, stress etc.
fiber is drawn. Observation of the propagation properties of
Fiber Bundles An important application of light in response to such external disturbances
fiber optics is the use of fiber bundles for illu- make fibers ideal as sensors. Pressure induced
minating or viewing inaccessible places such as refractive index changes due to underwater
body cavities (endoscope). Fiber bundles are sound make fibers useful as hydrophones. Their
formed by combining individual fibers. If no sensitivity to magnetic fields (Faraday effect)
attention is paid to the relative position of the permits their use as current sensors on high
fibers at either end, we speak of an incoherent voltage transmission lines; their sensitivity to
bundle. Fibers in incoherent bundles may have stress enables their use as strain gauges.
diameters between 50 to 200 Jim. If the relative A particularly interesting application of fiber
positions between all fibers are carefully main- optics is their use as rotation sensors. In prin-
tained, a coherent bundle results. Incoherent ciple, rotation can be sensed with any device
bundles are cheaper to manufacture and are that allows a comparison of the time of flight
used for guiding light to illuminate inaccessible of light in the two opposite directions along a
places. To make it coherent, the fiber bundle is closed, rotating loop. Fibers have the advantage
first wound as a closed loop of the desired that the light path along the loop can be made
thickness and circumference. Next, the fibers very long by using fiber coils to magnify the
of this closed-loop bundle are fused tightly to effect. Rotation rate sensors for ships and air-
each other at one point of the circumference craft are being developed.
before they are cut at that point. Finally, the Optical Fiber Telecommunications Because
ends are polished. The fibers at both ends of of their low losses, optical fibers are ideally
the resulting linear bundle are now restrained suited as waveguides for the transmission of
to maintain their relative position but are free messages modulated on light signals. The dis-
to flex in between. A coherent bundle can be cussion of the use of fibers as sensing elements
used to transmit images. The image of an object may lead to the suspicion that optical fiber
is focused by lenses onto one end of the bundle. communications might be plagued by inter-
Each picture element is transmitted along one ference from electromagnetic fields, thermal
of the fibers. The resulting image can be viewed changes, or sensitivity to vibrations. Fortu-
at the far end of the bundle through an eye- nately, quite the opposite is true. The ability
piece with a resolution that depends on the to sense changes in their environment is due
density and diameter of the fibers. A coherent to phase changes imparted by the fiber to a
bundle may contain from 5,000 to 50,000 light signal passing through it. However, for
fibers with diameters between 5 and 25 Jim and communications applications the phase of the
may be several meters long. Flexible, coherent light field is not being sensed, so that one of
fiber bundles of sufficiently small diameter are the advantages of fiber communications is its
used to view the inside of the stomach or other insensitivity to electromagnetic interference.
body cavities. Illumination is provided either by If fiber optics are used improperly, sensitivity
shining light through a subsection of the bundle to vibrations could be a problem (modal noise)
or by a separate bundle that need not be coher- but can be minimized by proper design of the
ent. Tapered coherent fiber bundles can be used system.
to magnify or demagnify images. If desired, Typically, an optical fiber communications
images can be scrambled by scrambling the rela- system consists of the following components:
tive positions of the fibers in the bundle. a modulated light source, the fiber waveguide,
Very short, coherent fiber bundles-fiber optic and a detector. For use with multimode fibers
plates-are used for special applications. A fiber the light source can be either a cheap, reliable
optic plate inserted into the face of a cathode light emitting diode (LED) or a more sophisti-
ray (picture) tube can enhance image brightness cated and expensive injection laser. Since single-
by capturing more light from a fluorescent coat- mode fibers cannot be excited efficiently with
ing on its inside surface. Fiber optics plates are an LED, a laser must be used. This is the reason
also used to connect successive stages in image why, in spite of their higher dispersion, multi-
intensifiers or infrared image converters. mode fiber systems coexist with single mode
An image can be transmitted through a single fiber systems.
fiber with a parabolic refractive index profile. Both LEOs and lasers work basically on the
449 FIELD EMISSION

same principle. Electrons and holes are injected flow in an external circuit. Thus, detectors and
into the junction region of a semiconductor light sources complement each other in their
diode where they combine to emit a light mode of operation.
photon. In an LED the photons escape from In time, optical fibers may well replace most
the junction as soon as they are generated. In of the metallic conductors that are now being
an injection laser each photon is reflected back used in the telephone plant.
and forth between the mirrors of a resonant
cavity and is forced to stimulate the emission D. MARCUSE
of more photons. Consequently, the light of an
LED is completely incoherent while the output References
of a good laser is highly coherent. Lasers and
LEDs are pulse modulated by pulsing their 1. Kapany, N. S., "Fiber Optics," Academic Press,
electrical feed current. New York, 1967.
With the optimum refractive index profile 2. Giallorenzi, T. G., "Fiber Optic Sensor," Optics
and operating at the wavelength of minimum and Laser Technology, 13(2),73-78 (April 1981).
chromatic dispersion (near 1.3 Mm) multimode 3. Miller, S. E., and Chynoweth, A. G. (Eds.), "Opti-
fibers can transmit pulses at rates of 100 Mbits/ cal Fiber Telecommunications," Academic Press,
sec over distances of several kilometers. With New York, 1979.
single-mode fibers, pulse transmission rates of
several Gbits/sec can be achieved. Cross-references: BIOMEDICAL INSTR UMENTA-
Several optical fibers are usually combined TION; DIELECTRIC THEORY; OPTICS, GEOMET-
into fiber cables of various constructions. Each RICAL; REFLECTION; REFRACTION.
fiber in the cable provides one independent
transmission path for light pulses that them-
selves may carry thousands of telephone con- FIELD EMISSION
versations or tens of television programs. Some
cable designs first combine several fibers into a Field emission of electrically charged particles
ribbon and then incorporate several fiber rib- occurs when a sufficient high electric field is
bons into one cable. Other designs support applied to the surface of a conductor. Specifi-
loosely fitting individual fibers in plastic tubes cally, field emission of electrons from cold
several of which are incorporated into a cable. metals into a vacuum is a basic physical effect
In all cases plastic or metallic support members comparable to thermionic, photoelectric, or
are included in the cables for added strength. secondary emission. Field electron emission is
Metallic conductors are sometimes used for the also termed cold emission or autoelectronic
purpose of powering repeaters along the route emission. Field emission of electrons from
of the cable. metals at room temperature requires an electric
Optical fibers can be spliced. In ribbon cables field of an order of magnitude of 3 X 10 7
all fibers in one ribbon are usually spliced simul- volts/cm; this can be obtained by applying a
taneously. In some methods, splicing is accom- few thousand volts to a sharply curved cathode,
plished by sliding the ends of the fibers into which may be either a fine wire, a sharp edge,
plastic sleeves and bonding them into place. or a needle tip. From such cathodes, field
It is also possible to heat the ends of two fibers emission was first observed by R. W. Wood in
until they join to form a fusion splice. 1897, and technical application in high-voltage
Optical fiber cables are used to connect local rectifiers and x-ray tubes was attempted by
exchanges and for long distance transmission J. E. Lilienfeld in the early 1920s. He failed
between cities, and they are being considered as because of the inadequate vacuum techniques
submarine cables. available at that time. The quantum mechanical
In applications where the length of the opti- theory of field emission was given by R. H.
cal fibers does not exceed a few kilometers Fowler and L. W. Nordheim in 1928, which
(for use between local exchanges for example) agreed with the current-voltage relationship
no repeaters are needed. But for applications measured by R. A. Millikan and C. C. Lauritsen,
between cities or for submarine cables, repeaters but experimental work to further verify the
must be used at distances from 5 to 20 (or even predictions advanced only with the possibility
more) kilometers. At each repeater the light of controlling highly perfect emitter tips in the
signal is first converted to an electrical signal field emission microscope by E. W. Mi.iller in
which is used to regenerate the pulses before 1937. Subsequently field emission microscopy
they are launched into the next section of the became an established research technique for
optical communications link. Amplification and surface phenomena connected with adsorption.
pulse regeneration of the light signal itself, even By operating a point emitter at a positive po-
though possible in principle, it not yet contem- tential in the presence of gas, Mi.iller discovered
plated for any practical fiber optic communica- field ionization (1951) and developed the low-
tion system. temperature field ion microscope (1956) which
Light detectors are semiconductor devices surpassed all other microscopic devices with its
which absorb photons and convert them into capability of showing the individual atoms as
electron-hole pairs which cause a current to the building blocks of the crystal lattice of the
FIELD EMISSION 450

metal specimen. In the last three decades, tech- Amp /cm 2


nical application of field emission has also been
successful with the development of powerful
flash x-ray tubes (W. P. Dyke, 1955). The ex-. 108
tremely large current densities of up to 10 8
amperes/cm 2 make a field emission cathode
very attractive for mm wave tubes, cathode ray
tubes, and electron microscopes, but the sensi-
tivity to contamination and cathode sputtering
are detrimental to stability and long lifetime.
Field emission can be explained with the con-
cepts of quantum mechanics. The conduction
electrons of a metal are moving in a potential
trough, from which they can escape ordinarily
only by addition of thermal energy (thermionic
emission), by an energy transfer from photons i
(photoelectric emission), or by collision with
other energetic particles (secondary emission).
If the barrier of the trough is narrowed by the
application of an external electric field to be
comparable with the wavelength of the elec-
trons inside the metal, then a small amplitude 100 50 30
of the electron wave will penetrate outside the FIG. 1. Current density of field emission for a tung-
barrier. Thus there is a finite probability that sten cathode plotted as a function of reciprocal field
the electron will tunnel through the potential strength, for various temperatures.
barrier, even if its kinetic energy is insufficient
to go over it. According to the Fowler-Nordheim
theory of field emission, the current density J a few angstroms above the surface (field ioniza-
(in amperes/cm 2 ) as a function of field strength tion). Ion currents are small (less than 10- 9
F (volts/cm) and of the work function <p (elec- amp) because of the limited supply of the gas
tron-volts) is approximately given by molecules. The fields needed for ionization are
very high, about 2.2 X 10 8 volts/cm for hydro-
gen and 4.5 X 10 8 volts/cm for helium.
J= 1.55 X Field electron emission has been successfully
applied to develop a powerful flash x-ray tube
(W. P. Dyke), a high resolution scanning elec-
A more refined theory takes into account the tron microscope (A. V. Crewe), and a bright
effect of the image force on the electron, which point electron source for high resolution electron
reduces the exponent of the above equation by lithography. Field ion emission from liquid
a factor slightly smaller than unity and which metals is now actively pursued as a continuous
depends upon <pyp. The field required for a high brightness point ion source with a poten-
given current density is thus reduced by some tial application in high resolution ion lithography
10-20%. The temperature dependence of field (R. L. Seliger). The continuous miniaturization
emission is found to be very small below about of electronic devices makes these techniques
1000 K (Fig. 1). Considerable increase in emis- quite attractive because of their small sizes and
sion is observed when both the temperature and high brightness.
the field are high. This effect is called T-F Although its technical application is not yet
emission. widespread, field emission has been a subject of
Field emission of positive ions can occur when intensive investigations, and the field electron
an adsorbed layer is desorbed from the emitter and the field ion microscopes have become
surface by a positive electric field (field desorp- productive tools of basic research in the study
tion). At a field of a few volts per angstrom, of solid surfaces. The specimen of the field
lattice atoms can also be evaporated in the form emission microscope is a needle-shaped field
of multiply charged ions (field evaporation). emitter with a hemispherical tip of a radius of
These ion currents cannot be sustained for a some 10- 5 to 10-4 cAt, arranged in a vacuum
long period of time, since the adsorption layer tube opposite a fluorescent screen. With a
or the emitter surface will soon be consumed. few thousand volts applied, field emitted elec-
Continuous field ion emission is obtained by trons move away from the tip in a radial di-
operating the positive emitter in a gas of low rection, displaying on the screen an enlarged
pressure. Gas molecules attracted to the tip projection image of the distribution of electron
surface by polarization effect can be adsorbed emission at the emitter. The magnification of
on the apex of surface atoms (field adsorption), this microscope is approximately equal to the
or can be ionized when their valence electrons ratio of screen distance to tip radius and can
tunnel into the metal when the molecules are exceed one million. The lateral resolution is
451 FIELD EMISSION

limited to 25 A by a random tangential velocity individual organic molecules, such as phthalo-


component of the electrons due to the Fermi cyanine or similar aromatic compounds, cannot
distribution inside the metal and by diffraction yet be fully explained. The interpretation of
due to the de Broglie wavelength. As even thermal desorption experiments is often diffi-
traces of adsorption layers change the work cult because of the occurrence of surface migra-
function and, thereby, the emission, the pres- tion, while the well-defined field strength ob-
ence of such layers can be readily detected on tained in field desorption experiments cannot
the screen. The imaged tip cap usually repre- be fully utilized because of the complex polar-
sents a single crystal, so that the behavior of ization conditions at the various adsorption
adsorption layers can be studied in its depen- sites.
dence on crystallographic orientation of the The field ion microscope (Fig. 2) was devel-
substrate. As little as 10-3 of a monolayer of oped to overcome the limited resolution of the
oxygen is clearly discernible on many metals. field electron microscope. In contrast to the
Because of the small temperature dependence case of electrons, the undesirable tangential
of field emission, such layers, their surface component of the motion of the imaging ions
migration, adsorption and desorption rates, and can be reduced by lowering the emitter temper-
the activation energies of these processes can be ature. The short de Broglie wavelength of ions
measured in a wide temperature range. This is is also an advantage. Usually operated with
done by immersing the entire microscope into helium or neon as the imaging gas and the tip
a cryogenic bath and by heating the emitter to cooled down to 20 to 80 K, a spatial resolution
any desired temperature up to near its melting of -2.5 A can be achieved. Thus most atoms
point. Most studies have been done with the on the lattice can be resolved (Fig. 3). The
high-melting metals, W, Re, Ta, Mo, Nb, Ir, Pt, specimen surface developed by field evapora-
Rh, Pd, V, Ni, Fe, Ti, Cu, and various alloys tion at low temperature, is atomically perfect
and with adsorption layers of Hz, N z , Oz, CO, and free from contamination. Helium atoms
CH 4 , and the noble gases. Detailed patterns of are ionized above field adsorbed atoms, and
are then accelerated toward the screen as-
sembly. Nowadays, the screen assembly con-
sists of a micro channel plate and a fluorescent
screen. The microchannel plate converts the ion
image into an electron image and amplifies the
image intensity by a factor of about 100 to
1000. Each protruding surface atom is imaged
by 10 3 to 104 helium ions/sec. The surface
stays atomically clean as all impurity gases,
having a lower ionization potential than helium,
are ionized in the low field region in space
before they can reach the tip. Image stability
requires that the evaporation field of a sample
be higher than the ionization field of the imag-
ing gas. Use of helium is thus limited to refrac-
tory metals. For soft metals such as Au, Cu, Fe,
Ni, AI, etc. neon, argon or hydrogen may be
used with a slightly inferior resolution. Many
metals yield to the mechanical field stressF2 /8n
produced by the image field, which amounts to
about 1000 kg/mm 2 at the helium best image
field of 4.5 X 10 8 volts/cm.
The field ion images usually do not tell the
chemical identity of the imaged atoms. This
can be done by combining the field ion micro-
scope with a time-of-flight mass spectrometer
with a single atom detection sensitivity. This
new instrument (Fig. 4), the atom-probe field
ion microscope, allows the observer to select
one atom by tilting the tip until its image falls
onto a small probe hole in the screen. By super-
FIG. 2. Schematic diagrams of an early field ion imposing a 1O-nanosecond high voltage pulse to
microscope. The hemispherical cap of the tip, having a the dc imaging voltage the selected atom is field
typical radius of 500 A, is radially projected on to the evaporated and travels through the probe hole
fluorescent screen with the help of helium ions. Modern and the time of flight tube to reach the particle
FIM uses a stainless steel chamber, and is equipped detector, which is a Chevron channel plate. The
with a channel plate for image intensification. Other- flight time can be measured either with an oscil-
wise, the construction is essentially the same. loscope or with a set of electronic timers. From
FIELD EMISSION 452

... -. .. -. .....,;;.••..
. .... .. •.•.•..J.. . ..... . . .. .
'

-- ... ....
-...: ,.. . •
:
. -,
,

... ~ "
...
," '
'

..•••..•• J•"
~ -.- ... .. .. ".;. .....
.....
........ •
'
' 0
:" :.~.
" .•.••.
. . . . . . . . . .'
'
.....
••.•

. ".eO . 'e• • •
'. :.'.
..e .

•.. .:.......
- -...... 0':'
.. _.......
..... '.
... • •••. •. •••. ••.•• e' ••

,,- ...........
~. .~ •
- "I·, - ... .
0

.' . ... ::: ~ ~'"


~...........;"!···t·:·:~: :"?'~:f~'" "... :/.:.-,':. '~. . '.'
4
~.
• • ._
' _" '

t··..'. . . ..... ......"-- -.-...'......... ".' .... ..........'."'.,. ...


.. .. ':-~-:: _;.:~
. ;'4:~"'::. :.. .. . i '''' • '. .... . -4 " •.••• _

- ..
.- • ! .' ';"::.
- . . .i ••••.• -.:. e' .': ..'
.o.. ...... ,' .~.".~ ~' '.. -. ' '"'
... . •.••••
.' : •.. •• ." .. '."' .... • ' .• '

....• ...... ... , " ,... ." .....•


t. ... " ..
'
". ". .; '
t; , I

.• '."'" t, , ... • •••: -:~.:


' .... • . .'..
"'" .. " . .
-'-:" .,- ..... ". :. , .r
_
'1 •

~ f ·", ............
.. ..... ... ...._-_....
j :' ,••• t._':.......
. : .. '.' .......••.
.
.... '. . . .. . .- '.' ...•. ... .
-......... .. - ....' '.:: .;.':,. . -
. '. ...•. - ....
- ':.
.....
' '
•'. . ......
" ,

:
"

-;:.
'..
"

.~.. •" ::~...~ ..... '.-' . .. .'


.. -.i. '..
• • ." '. • e· .....
,.~._. •••• .~ ~..
.
, ~ ~..
_ ... '

...-,- .. .: '.•~:'. .- .•.......


'. '~ -' ' f", _...
. '.' :: .
.,
,e'
~ '.,:~: '. ' •.,~,
'.

. e· _" ':'. :'. ~'....~:. ......~.• ' • :.


..."
•••• ' '.. ..,... • '0 •

. ..'~... .'-.. ...•..." .


"., ,
'. ~. •

.. ..
Q

...
' ."

'. ., '.-
'
" ,
,
, •
.. \• ..
.


• •
". f.

.• •

:


• "
FIG. 3. Field ion micrograph of a fully ordered equiatomic platinum-<:obalt alloy crystal, showing the indio
vidual atoms as single dots. Original magnification on the S-incn screen was 1.5 million. In the image, only Pt
atoms are visible.

the flight time, the mass to charge ratio and the displayed. A pulsed laser imaging atom-probe is
chemical identity of the atom are identified. shown in Fi~ 6.
The energy focused time of flight atom probe is The atom-p~obe and the field ion microscope
capable of separating isotopes of an element to have been wid'ely applied to study fundamental
the root of the mass lines (Fig. 5). The nano- problems in metallurgy, materials science, and
second pulsed field evaporation can also be surface science. Single vacancies produced by
done with laser pulses. High resistivity materials quenching, cold working, or particle irradiation
such as high purity silicon can only be studied can be counted and localized, Interstitials re-
with the pulsed laser atom-probe. A slightly dif- sulting from impurities can be seen and their
ferent version of the atom-probe is the imaging thermally activated migration to the tip surface
atom-probe. It is essentially an ordinary FIM, can be measured. The structures of dislocations
but uses a screen assembly with single ion de- and the matching of the lattices along grain
tecting sensitivity, or the Chevron channel plate. boundaries can be studied in atomic detail. Sur-
Although the mass resolution is severely limited face diffusion of single atoms and simple atomic
by the short flight path, there is the advantage clusters on atomically perfect surfaces, inter-
of finding the spatial distribution of field de- action between adsorbed atoms, and adsorbed
sorbed atoms from its desorption image. In fact layer superstructure formation can be studied
by activating the chevron to detect only a certain quantitatively. The charge distribution of an
ionic species by a time gating technique, the adsorbed atom as manifest in the dipole moment
spatial distribution of a selected species can be and polarizability can be measured. The long-
453 FIELD EMISSION

TOP VIEW

II' I
A(LAIIO
I HHIU&C£5

SCOP f

FIG. 4. The energy-focused time of flight atom-probe. The tip orientation can be adjusted with the help of an
external gimbal mount and a metal bellows seal. The image can be seen through the 45 0 inclined mirror. By flip-
ping the channel plate assembly away, the pulsed field evaporated atom can travel through the 1630 spherical
condenser plates, and be detected. The flight time is displayed in the oscilloscope, as well as recorded in an
electronic timer.

Atom-probe mass spectrum


100
01 a PI-Au alloy

50

197Au Z+

lA
198pt H

pt3+ 19Z PI Z+

o /l,. 1. t~ n.
65 96 97 98 99
MASS CHARGE RATIO
FIG. 5. An atom-probe mass spectrum of a Pt-Au alloy where all the isotopes are separated.
FIELD EMISSION

+
xyz lens manipulation :w
~
p .
mountIng
assembly
454

Pulsed laser
Las~r.--,~ulse
~~==-'- ==i.... ~ .::
Quartz lens :'.

Trigger signal
+r----------. . . .: · . CEMA
Oscilloscope
or
~screen
Electronic ~ Fiberoptic
timer plate
S Ion image
computor Ion signal

FIG. 6. A pulsed-laser imaging atom-probe. Desorption images can be displayed, and simultaneously chemical
species of desorbed atoms identified from the flight times.

range and short-range order parameters in alloys Field Ionization and Field Evaporation," hog.
can be investigated. The composition of surface Surface Sci. 4, part I (1973).
layers in surface segregation of alloys can be Gadzuk, J. W., and Plummer, E. W. "Field Emission
measured with the atom-probe with true atomic Energy Distribution (FED)," Rev. Mod. Phys. 4S,
layer depth resolution. The composition of pre- 487-548 (1973).
cipitates, segregation of impurities to the grain Panitz, J. A., "Imaging Atom-probe Mass Spectros-
boundaries, and cluster formation in alloys can copy," Progr. Surface Sci. 8,219-262 (1973).
be studied in the atom-probe. The atom-probe, Tsong, T. T., "Quantitative Investigations of Atomic
an analytical tool of ultimate sensitivity, has . Processes on Metal Surfaces at Atomic Resolution,"
also revealed the phenomena of field adsorption, hogr. Surface Sci. 10, 165-248 (1980).
thus provides new insights into the mechanism Kellogg, G. L., and Tsong, T. T., "Pulsed-Laser Atom-
of the field ion image formation. The atom- probe Field Ion Microscopy," J. Appl. Phys. SI
probe has also been used to study gas adsorp- 1184 (1980).
tion, field evaporation, and exchange diffusion
of surface atoms. As scientists are always in- Cross-references: ADSORPTION AND ABSORPTION,
terested in understanding physical phenomena CRYSTALLOGRAPHY, ELECTRON MICROSCOPE,
through microscopic theories, atom-probe and LATTICE DEFECTS, POTENTIAL, SOLID-STATE
field ion microscopy have a lot to contribute in PHYSICS, SOLID-STATE THEORY, THERMIONICS.
the future.
ERWIN W. MULLER
TIEN T. TSONG
FIELD THEORY
The description of the physical world has
evolved profoundly through the ages, at times
References
because of, at other times being the cause of,
Good, R. H., Jr., and MUller, E. W., "Field Emission," sweeping changes in our philosophical, mathe-
in "Handbuch der Physik," Second Edition, Vol. matical and experimental knowledge. Greek
XXI, pp. 176-231, Berlin, Springer-Verlag, 1956. geometry concerned itself essentially with prop-
Gomer, R., "Field Emission and Field Ionization," erties of "objects as such," a triangle or a cube
Cambridge, Mass., Harvard Univ. Press, 1961. being studied, for example, without any thought
MUller, E. W., and Tsong, T. T., "Field-Ion Microscopy, of their spatial environment; the Ptolemaic sys-
Principles and Applications," New York, London, tem enhanced this view into a c1ockmaker's
Amsterdam, Elsevier, 1969. dream, where celestial bodies parade around the
Swanson, L. W., and Bell, A. E., "Recent Advances in earth, rigidly driven in circular motions. Only
Field Electron Microscopy of Metals," Adv. Electron. with Descartes' analytical geometry did objects
Electron Physics 32, 193-309 (1973). become "portions of space" and the properties
MUller, E. W., and Tsong, T. T., "Field Ion Microscopy, of space itself the main object of study; with
455 FIELD THEORY

Galileo and Newton, a correct science of dy- magnetic field, or the sources and sinks of hy-
namics was born, which permits the prevision of drodynamics. Moreover, we have to describe in
an amazing number of mechanical phenomena which way the values of the field quantities
in that space from a few first principles. change when the point at which they are con-
Field theory studies the phenomena of the sidered, or the time, is changed. In the absence
physical world as due to interactions which of discontinuities, for instance in vacuum, one
propagate through space; the "geometrical expects these values to differ by infinitesimal
emptiness," which is the space of mathematics, amounts if the corresponding points are infini-
becomes the medium into and through which tesimally close, in some way which is typical of
actions take place or, even more drastically, a the field considered; in other words, that the
structure which is itself determined by the prop- rates of change of the field quantities with re-
erties of matter, as in general relativity (which spect to the space coordinates and time be con-
will not be discussed in this article). nected by relations which specify both how
Suppose two bodies interact in space, e.g., the these changes can occur compatibly with the
sun and the earth with Newton's law, or two geometrical properties of space, and how they
electric charges with Coulomb's law . Two pic- are related to the sources. Group theory deter-
tures of this situation are equally possible and mines all the possible forms which are permis-
correct. One is that this interaction cannot be sible for these relations, which take the name of
conceived if both bodies are not there and that field equations; each field theory is character-
we should study primarily its effects without ized by a special set of field equations, which
looking for a detailed mechanism for its propa- are clearly partial differential equations.
gation from one body to the other; this is the RELATIVITY and QUANTUM THEORY have
description of "action at a distance", in which played a great role in the development of field
forces are the main concepts and space is a vac- theory; we shall briefly discuss, later, their in-
uum into which bodies follow trajectories deter- fluence both in the explanation of new physical
mined by the forces acting upon them. The phenomena and in the mathematical formula-
other picture consists of imagining that each tion of the theory.
body, whether alone or not, modifies the struc- Field theory has taken an entirely new shape
ture of the space which surrounds it, geometri- with the so-called second quantization, which
cally or because in each point of that space has led to several modern developments, of
there is now potentially a force, which becomes which some embody faithfully the concepts
active if another body occupies that point, but outlined thus far and others instead represent
should be conceived as existing there in any new views in natural philosophy; this is still a
case; the main objective here is to study how matter of controversy at present, and it is yet
these "fields of force" are created in space by unpredictable whether a reasonably lasting de-
material objects and how they propagate; this is scription of nature will come out of such at-
the point of view of "action with contact," tempts or whether a new drastic turn in human
which finds its full development in field theory. thought will be necessary before we can hope to
Mathematically, a field is characterized by understand the fundamental laws of physics. Be
assigning to each point of space a quantity that as it may, the ideas and the computational
which is intrinsically associated with it; a tem- techniques of field theory have proved already
perature, for instance, or a velocity, or a tensor of invaluable help in the description of many
or a spin or of arbitrary rank. "Intrinsic" means phenomena, from particle physics to supercon-
that if we change our frame of observation, this ductivity.
quantity does not change; supposing, e.g., that It is convenient to examine first the theories
our field is that of the velocities at a given in- in which the field quantities are ordinary func-
stant of all the points of a moving fluid, if we tions of space and time points, regardless of
rotate our coordinate system we shall observe whether they have a direct physical meaning (as
different values for the components of those with the velocities of hydrodynamics and the
velocities, just because we, not the velocities, electromagnetic forces) or not (as with the wave
have changed position. It is therefore essential function which obeys a Schrodinger equation).
that, together with the specification of the field This comprises of course most of classical and
quantities, their transformation laws also be as- modern physics; mathematics permits again,
signed under changes of the reference frame; however, a tremendous conceptual simplifica-
these laws are indicated by the description of tion. In the study of continuous media or fields
the field quantity as a "scalar" (which does not one is, most often, interested in one of the fol-
change), a "vector" (which changes with the lowing classes of phenomena:
same law as the coordinates), a "tensor," etc.; (1) Phenomena which consist of the propa-
the complete specification of all such possible gation of some action; the medium itself is not
laws is a standard chapter of group theory. transported from one place to another; typical
Physically, we have to account for the creation is the propagation of waves, whether they be
or the existence of the field, by describing the seismic, fluid or electromagnetic;
field quantities as generated by "sources," such (2) Phenomena in which there is transport or
as positive or negative charges for the electro- diffusion of a quantity in a medium: of heat in
FIELD THEORY 456

a wall, of solute in a solvent, of neutrons in a (instead of the inverse-square law of static


pile; fields); this part constitutes the radiation field,
(3) Equilibrium phenomena: deformations of which is responsible for the transmission of en-
strained elastic bodies, electro- or magnetostatic ergy and signals (the radiated energy is, of
fields as determined by charges and boundaries. course, supplied by the mechanism which drives
Each class is ruled by essentially one type of the generating charges or currents). This is easy
equation. Let ~ = d2 /dx2 + d2Idy2 + d2/dz 2 to understand: the energy radiated through a
denote the Laplace operator; </>= </>(x,y, z, t) large sphere around the source is proportional
the field quantity; F some function of x, y, z, t, to the area of the sphere times the square of E;
</> and, at most, of the first-order derivatives of it vanishes therefore with increasing radius for
cf>; v a velocity; and D a diffusion constant. The all but the radiative component, for which it
corresponding equations can be brought into the stays constant: energy is actually removed from
standard forms: the source and radiated away to all distances.
The study of radiation is a most important part
~cf> - ~ d2 cf> = F (hyperbolic partial differential of the theory, both macroscopically (telecom-
v2 dt 2 munications, radar) and microscopically (atoms,
equation) (I) nuclei, elementary particles).
Relativity and quantum mechanics have ex-
~</> - .!.. :cf> = F (parabolic partial differential tended and modified profoundly the classical
picture presented so far. The very concepts of
D ut
space and time change with special relativity:
equation) (2) events which are simultaneous for an observer
~cf> = F (elliptic partial differential equation) are not such when seen by another observer
in uniform motion with respect to the first, be-
(3) cause time and space are mixed together by the
If the field quantity has more than one com- Lorentz transformations which relate the refer-
ponent, one may deduce for each of its com- ence frames associated with the two observers.
ponents an equation which is essentially of the As a consequence, the laws of nature can retain
same type, although it may be difficult or im- their universal validity only if they are formu-
possible to obtain an independent equation for lated in the same form by any such observer,
each component. i.e., if their form is not altered by a Lorentz
This classification of physical phenomena ac- transformation-technically speaking, if they are
cording to the type of equation to which their "Lorentz covariant." This requirement becomes
study can be reduced is of the greatest impor- a stringent dogma; it suffices to determine, with
tance: Eqs. 0), (2) and (3) are called in fact the help of group theory, the possible equations
"the equations of mathematical physics"; more for any conceivable relativistic field theory; it is
specifically, Eq. (1) is also called the wave equa- of invaluable help, when computations are made,
tion, Eq. (2) the heat equation, and Eq. (3) the in checking or correcting them.
Laplace or potential equation. The study of The nonrelativistic Schrodinger equation for
their mathematical properties gives complete the wave function of a particle is, but for the
information on all the physical phenomena appearance of complex quantities, of the type
which they describe. (2) described before : this is not acceptable in a
The equations of quantum mechanics can also relativistic world, because time and space are
be brought, at least formally, into the form of not treated alike. One needs either an equation
Eq. (1) or (2); the intervention of complex which contains only second-order derivatives, or
quantities modifies the situation somewhat, in a one with only first-order derivatives; for a free
way which we cannot discuss here. particle, this leads either to the Klein-Gordon
Electromagnetic phenomena fall typically into equation, which is of type (1), or to the Dirac
the category of Eq. (I): each of the components equation, which contains linearly only the first-
of the electric field E == (Ex, E r , E z ) and of the order derivatives of the wave function, but has a
magnetic field H == (Hx , H y , Hz) satisfies, in vac- mathematical structure which necessarily assigns
uum, Eq. (I), with F = 0; the connections be- special physical properties to the particles de-
tween E and H are given by the Maxwell equa- scribed by it. It was one of the greatest triumphs
tions, which characterize completely the theory, theoretical physics ever witnessed, to discover
and lead in vacuum to the result just mentioned. that such properties are actually displayed by all
When E or H does not vary with time, Eq. (I) particles which obey the Dirac equation: spin,
reduces to Eq. (3), thus yielding electro- or and the existence for each Dirac particle of a
magnetostatics. corresponding antiparticle, i.e., a particle having
The electromagnetic field, i.e., the vectors E the opposite mechanical and electrical prop-
and H, generated by a distribution of moving erties. (See article on ANTIPARTICLES.)
charges or currents confined within a limited The requirement of relativistic covariance has
volume has a part which becomes dominant at a thus led to fundamental physical discoveri~s; for
large distance from that volume, because it de- each particle obeying the Dirac equation, the
creases only with the inverse of that distance corresponding antiparticle has been experimen-
457 FIELD THEORY

tally found in nature; electron and positron, Lamb shift, were accounted for with amazing
proton and antiproton, neutron and antineu- accuracy by quantum field theory. Qualitatively,
tron, etc. What is more, the theory predicts that the new conceptual framework has proved ex-
a particle-antiparticle pair can be created in a tremely useful in understanding elementary
collision phenomenon, if sufficient energy is phenomena, especially with the belp of the dia-
available, or can annihilate itself, giving away its grams devised by R. P. Feynman, which give a
energy in the form of electromagnetic radiation simple intuitive picture of collision and radiation
or other particles. processes involving elementary particles. Very
The classical theory allowed only for the little has been achieved quantitatively, though,
electromagnetic radiation emitted by moving for theories other than electrodynamics, because
charges or currents; the creation or absorption of the tremendous mathematical difficulties
of particles in collision phenomena, as well as which arise as soon as the simplest approxima-
the creation or annihilation of pairs, were out- tion techniques are not applicable because the
side its scope and possibilities. A new formula- interaction is too strong; nevertheless, these
tion of the theory was needed, which could ac- ideas have proved greatly helpful in many ways,
count consistently for all such phenomena, in combination with general principles of sym-
handling situations in which particles can be metry, Lorentz invariance and causality.
created and destroyed in any numbers. The .A beautiful consequence of this conception,
formalism devised for this purI>0se is that of which assumes that particles are the quanta of a
quantum field theory. field (as photons were recognized by Einstein to
The basic idea is to describe each type of par- be the quanta of the electromagnetic field) was
ticle by means of a field which is not any more the discovery of H. Yukawa, that whenever such
an ordinary numerical function of space and quanta have a mass different from zero, the
time, but an "operator," i.e., a quantity which force they create between two bodies which
changes the number of particles existing in any interact by exchanging such quanta with each
given state of the system. If the field operator other must be an exponentially decreasing func-
is known, one can then evaluate the probability tion of distance; this force can become of a
of a given state (so many particles, with deter- coulombian type only if the mass of the quanta
mined energies and momenta) changing into an vanishes. Thus, the Coulomb force can be ex-
equally determined, different state. If the par- plained as due to the exchange of photons
ticles do not interact among themselves or with among electric charges, the nuclear forces
other particles, no change is possible; if there is (which have typically short ranges) as due to the
interaction, the field operator has a structure exchange of massive particles among nucleons.
which can cause such transitions. The field equa- Exchanges of this nature are not observable in
tions appear to be essentially the same as those the laboratory, because this runs against Heisen-
of the classical Maxwell, Klein-Gordon, Dirac berg's indeterminancy principle; if enough en-
theories, etc.; their structure is however funda- ergy is supplied, however, such quanta can ac-
mentally different, because they now must be tually break loose and do appear as the particles
equivalent to infinite sets of ordinary equations, created in collision processes.
which couple states with different and ever- The mathematical difficulties encountered in
increasing numbers of particles. quantum field theory are many, and there is as
Fields which are associated with particles yet lack of agreement as to the best way to cir-
obeying Bose-Einstein statistics (of which any cumvent some of them. Besides mathematical
number can be found in any given state) have complexity, which prevents all but the simplest
radically different mathematical properties from calculations, there are many unsolved problems
fields associated with particles obeying Fermi- of mathematical rigor and apparent inconsisten-
Dirac statistics (of which at most one can be cies which can be removed only by delicate
found in any given state); examples of the first analyses. Typical of the latter is the fact that
are photons (the massless neutral quanta of the unsophisticated calculations give infinite values
electromagnetic field), pions (massive particles, for masses and charges of interacting particles,
with or without electric charge, which are be- and a painstaking analysis is required to retrieve
lieved to be responsible for nuclear forces), etc.; from them the significant physical values; this is
examples of the second are electrons and posi- the so-called renormalization procedure, which
trons, protons and antiprotons, etc. copes with infinities which partly are already
The passage from numerical fields to operator present in the classical theory (such as the in-
fields is called "second quantization"; quantum finite electromagnetic contribution to the mass
field theory deals with operator fields. of a point-like charged particle, when computed
Striking successes have been met with this ap- from Maxwell's equations) and partly originate
proach. From a quantitative point of view, they from the new formalism (which permits, for in-
are confined mostly to electrodynamics, where stance, pair creation).
very small deviations from the values predicted It is not yet certain whether such difficulties
by the non-quantized theory, which were ob- are due to the lack of adequate mathematical
served in the measurement of the magnetic techniques or are the expression of a funda-
moment of the electron and in the so-called mental inadequacy of the theory to describe
FIELD THEORY 458

ultimate laws of nature. For this reason, while, lure. Later radiochemical work indicated that
on the one hand, the attention of some the- some of the new activities were from elements
oreticians has been directed to perfecting the chemically similar to the much lighter elements
mathematical foundations of quantum field Ba, La, etc.
theory (giving rise to axiomatic field theory and Fission remained unrecognized until O. Hahn
to more rigorous methods of obtaining and and F. Strassmann, German radiochemists,
studying the quantum field equations, etc.), on showed by very careful work that these prod-
the other hand, most physicists have been trying ucts were not merely chemically similar to
new avenues, such as the S-matrix theory, dis- lighter elements, but were lighter elements.
persion relations, the so-called Regge poles, etc.; They published their startling results in the
these approaches have certainly led to very use- January 6, 1939 issue of Naturwissenschaften.
ful results, but they leave altogether at least as On January 16, Lise Meitner and O. Frisch sent
many doubts as hopes. in to Nature (from Stockholm and Copenhagen)
Whatever may be the future prospects of field a paper in which they named the new process
theory as the correct means for describing the "fission," predicted that the fragments should
fundamental laws of nature, its tremendous use- have large kinetic energies, and explained the
fulness in providing a conceptual framework, in process in terms of a liquid-drop model. On the
inspiring new ideas, and in suggesting computa- same date, Frisch sent in to Nature another
tional techniques has been overwhelmingly dem- paper in which he reported that he had ob-
onstrated in the last decades. It has now found served the large electrical pulses from fission
a new, very fertile ground of application in the fragments in an ionization chamber.
study of systems containing a very large number Niels Bohr in the meantime had taken the
of particles, where it has already provided a rea- news of the discovery of fission, and the predic-
sonably good quantitative understanding of tion of large energies, to a conference on
superconductivity and superfluidity, and prom- physics in Washington, D.C. During the confer-
ises many other results of interest in the study ence physicists in a number of laboratories in-
of solids and liquids. dependently verified the tremendous kinetic
energies of fission fragments, unaware as yet of
E. R. CAIANIELLO Frisch's results.
During the months following the discovery of
Cros&-references: ANTIPARTICLES, ELEMENTARY fission, there was feverish activity in laborato-
PARTICLES, QUANTUM THEORY, RELATIVITY, ries around the world. It was soon discovered
SCHRODINGER EQUATION, VECTOR PHYSICS. that neutrons were being produced by fission,
and that almost all of the fission of U was tak-
ing place in the relatively rare isotope, 235U. In
that same year (I939), Bohr and Wheeler pub-
FISSION* lished their theory of fission, based on the
Nuclear fission is the breakup of a heavy nu- liql1id-drop model, which is still basic to mod-
cleus, such as that of uranium, into two eql fission theory.
medium-weight nuclei, with the release of a Development of Atomic Energy On the date
considerable quantity of energy. Also produced of publication of the Bohr-Wheeler paper, Sep-
are a few neutrons, some gamma rays, and a tember I, 1939, Germany invaded Poland, the
number of beta-particles (electrons) from the Second World War was underway, and fission
radioactive decay of the two fragments. Fission suddenly had a new importance. It was realized
occurs spontaneously in some cases, or may be by many that a chain reaction was possible for
induced by bombardment of the fissionable fission, with the neutrons from each fission pro-
material with neutrons, protons, or other ducing more fissions, resulting in the release of
particles. very large amounts of energy.
Discovery of Fission Although fission was The fission process results in the conversion
of 0.09 per cent of the mass of the original nu-
not discovered until 1939, it had been realized, cleus into kinetic energy. This amounts to
ever since Einstein published his theory of rela- about 200 MeV per fission, or 3.20 X 10- 11
tivity in 1905, that there was a theoretical pos- joules. The fission of 1 kilogram of 235U thus
sibility of releasing tremendous energy from releases a total energy equal to 8.21 X 10 13
matter. joules, or 2.28 X 10 7 kilowatt-hours. This is
Fission is now known to have been first pro- roughly equal to the daily output of a large
duced by Enrico Fermi and his co-workers in hydroelectric power plant such as Hoover Dam,
1934, when they irradiated many elements, in- and very much greater than the energy released
cluding uranium, with the newly discovered in chemical reactions. One kg of 235 U is equiva-
neutrons. They found a number of different lent in energy release to the burning of 3.45 X
l3-activities to be produced from uranium, but 10 6 kg of coal (C) by 9.20 X 10 6 kg of oxyfen.
believed that these were due to neutron cap- This chemical process releases 7.2 X 10- 1 of
the mass as heat energy, more than 10 million
*Work performed under the auspices of the U.S. times smaller than for fission.
Dept. of Energy. The fission of 1 kg of 235 U is also equivalent
459 FISSION

in energy released to the detonation of 19.6 X The work of designing and building nuclear
10 6 kg (19.6 kilotons) of high explosive. A weapons was carried out at a laboratory set up
kiloton, 1000 metric tons, is conventionally at Los Alamos, New Mexico. This laboratory,
taken to be equivalent to 10 12 calories, or under the direction of J. Robert Oppenheimer,
4.186 X 10 12 joules; a megaton is 1000 times began its work in early 1943. Before the end of
larger. the war, a good fraction of the world's most
Thus it was known to many people in 1939 eminent nuclear physicists had come to work at
that it might well be possible to produce the Los Alamos, including Bohr, Fermi, Frisch, and
destructive effect of many thousands of tons of many British scientists.
high explosive with a single bomb containing a One of their most basic problems was to find
relatively small amount of fissionable material. a way to assemble the fissionable components
It seemed probable that Germany would press of a weapon rapidly enough to produce a pow-
ahead with this development. Aware of this erful chain reaction, lasting less than one [J.sec.
danger, scientists in the rest of the world largely The individual masses of enriched uranium or
ceased publishing fission results by 1940. plutonium had to be of such a size and shape as
Work on fission was continued quietly at an not to be capable of a chain reaction; i.e., they
increasing rate. In June 1942, the Manhattan had to be of less than critical mass. If these
Project (under U.S. Army direction) was set pieces were not assembled at sufficient speed,
underway in the United States, with the objec- the result would be only a minor explosion, or
tive of producing nuclear weapons, if possible. perhaps the melting of the device. Two ap-
The first chain reaction was produced on De- proaches were tried. One involved firing one
cember 2, 1942, under the direction of Enrico piece of fissionable material at another in a
Fermi, who had arrived from Fascist Italy in short "gun." The other was the implosion
January 1939. Fermi and his co-workers had method, in which the fissionable material is as-
piled up blocks of ordinary uranium and sembled into a highly compressed mass by the
extra-pure graphite (carbon) to produce a nu- explosion of a surrounding spherical shell of
clear reactor, under Stagg Field Stadium of the high explosive.
University of Chicago. The carbon was used to Both methods of achieving a nuclear explo-
slow down fission neutrons and thus increase sion were ultimately successful, but it was not
the likelihood of fission. Cadmium rods in- known whether either method would work un-
serted in the reactor (at that time called a til July 16, 1945, when the first "atomic
"pile") were used to control the chain reaction bomb," a plutonium implosion device, was set
by capturing a certain fraction of the neutrons. off in a desert area near Alamogordo, New
Thus, by December 1942, a fission chain re- Mexico. On August 6 a uranium gun weapon
action had been achieved. In the following was exploded over Hiroshima, Japan, and on
years, research vital to the Manhattan Project August 9 a second plutonium implosion warhead
was carried on at many laboratories. In order to was detonated over Nagasaki. Each weapon had
produce the fissionable material for nuclear a yield equivalent to about 20 kilotons of high
weapons, two tremendous industrial plants explosive and caused tremendous destruction.
were set up beginning in 1943, at Oak Ridge, On August 10 the Japanese first offered to
Tennessee, and Hanford, Washington. surrender, and accepted Allied terms on August
The Oak Ridge plant was for the purpose of 15; the mobilization of the massive United
separating the more fissionable isotope, 235U, States invasion force was called off. Germany,
from the much more common isotope, 238U, which had surrendered on April 8, 1945, was
which accounted for 99.3 per cent of the mass found to have made little progress toward
of ordinary uranium metal. The most successful nuclear weapons during the war.
separation process, which is still in use at Oak Following the end of the war, the work on
Ridge, was that of gaseous diffusion. Uranium nuclear weapons and nuclear power in the
in the form of a gas, uranium hexafluoride, is United States were placed under the newly
passed through a long series of porous barriers. created (1946) Atomic Energy Commission.
The lighter isotope 235U can diffuse more Similar agencies have since been set up in many
readily than 238U, and the result of the process countries. On August 29, 1949, the U.S.S.R.
is enriched uranium, the 23SU content having detonated its first nuclear device. During the
been increased from 1 part in 140 to around next several years, enormous production plants
95 per cent. for 23SU were built at Paducah, Kentucky, and
The Hanford plant consists of giant nuclear Portsmouth, Ohio, as well as facilities for pro-
reactors to produce a new element, plutonium, ducing 239pU and hydrogen isotopes on the
from ordinary uranium by neutron capture. It Savannah River in South Carolina. On Novem-
was thought, and eventually proved, that plu- ber 1, 1952, the Los Alamos National Labora-
tonium should be fissionable by slow neutrons tory exploded the first thermonuclear device
in the same way as 235U. Capture of a neutron (hydrogen bomb), with a yield equivalent to
by 238U produces the heavier isotope 239U, many megatons of high explosive. Such a
which then decays by beta-emission to the new weapon uses nuclear fission to trigger nuclear
element 239Np (neptunium), and then by an- fusion, in which light elements (such as the
other beta-decay to 239pu. various isotopes of hydrogen) are combined at
FISSION 460

exceedingly high temperature to give heavier ture, the most probable velocity is 2200
nuclei, with considerable release of energy. On meters/sec., which corresponds to a neutron
August 12, 1953, the USSR set off its first energy of 0.0253 eV. Intermediate reactors use
thermonuclear weapon. In the years since then, partially slowed down neutrons, with less than
Great Britain, France, China, and India have all 100-keV energy.
developed their own nuclear weapons. In order for a chain reaction to continue, it is
Nuclear reactors and nuclear power have been necessary that each fission produce, on the av-
developed extensively since the war, in many erage, at least one more fission. The average
countries. Nuclear-powered submarines, aircraft number of fissions produced by the neutrons
carriers, and other vessels have revolutionized from one fission is called k, the criticality fac-
naval strategy. Nuclear-propelled rockets (the tor or multiplication constant. If k is less than
"Rover" program at Los Alamos) have been de- 1.0 (the critical value), a chain reaction, even if
veloped for space exploration. The possible use begun with many fissions, will soon die out.
of nuclear explosives for earth-moving, increas- A reactor must have k = 1.0 during normal,
ing gas-well yield, and other engineering pur- steady, operation, and must have k greater than
poses has been an extensive project ("Plow- 1.0 during the start-up operation. These changes
share") of the Lawrence Livermore National in criticality factor can be brought about by
Laboratory, as well as of Los Alamos. Nuclear introducing or removing from the reactor con-
power plants already account for a significant trol rods made of neutron-absorbing elements,
fraction of electrical power production. such as cadmium or boron.
Chain Reactions The basic feature which This regulation of reactor power is made
makes both nuclear reactors and nuclear much easier by the phenomenon of delayed
weapons possible is the "chain reaction," in neutrons, which are emitted from some fission
which each fission produceS another fission, or products for a few seconds after the fission. In
several, by means of the several neutrons the case of thermal-neutron fission of 235U, 0.7
emitted from fission. If there is not enough per cent of the neutrons produced (0.017 out
fissionable material, or it is not arranged com- of 2.43 neutrons per fission) are delayed in this
pactly enough, no chain reaction will be possi- way. Only if k were increased above 1.007 in
ble. The fission neutrons emitted in such a situ- this case could the reactor power increase
ation will have too great a chance of escaping rapidly, since each fission would then, on the
from the fissionable material, or of being ab- average, produce one more fission promptly.
sorbed in non-fissionable material, to continue Reactors are of course designed to avoid this
the chain of fissions should one fission occur. "prompt critical" condition, since this can lead
There is thus a "critical mass," or minimum to overheating and destruction of the reactor. In
amount of fissionable material, necessary for a any case, reactors cannot possibly achieve the
chain reaction in any given arrangement. For a high values of the criticality factor needed for a
spherical mass of metal in air, the critical mass true nuclear explosion.
of highly enriched (94 per cent) 235U, for in- A "breeder" reactor is one which produceI' as
stance, is 52 kg; the critical mass for 233U or much nuclear fuel as it consumes, such as by
239 Pu metal is lower about 16 kg. It is the cap- using the capture of neutrons in 238 U to produce
ture of neutrons by 238 U which leads to a higher thermally fissionable 239 Pu. It is of course nec-
critical mass in the first case. The critical mass essary to reprocess the fuel elements to recover
can be lowered by mixing fissionable material the 239pu. Breeder reactors are now being
with graphite or other material as a "modera- developed for more efficient power production.
tor" to slow down the neutrons, or by sur- It is of interest that the uranium contained in
rounding the fissionable material with a reflec- coal, which is discarded with the fly ash and flue
tor to scatter neutrons back. gases, could produce more power than the coal
The smallest critical masses are achieved by if efficiently used.
water (or heavy water) solutions of fissionable The Fission Process The fission of a nucleus
material, since Hand D are most effective in can be understood on the basis of the liquid-
slowing down neutrons. The "Water-Boiler" drop model, which was first discussed by
thermal reactor at Los Alamos operated ordi- Meitner and Frisch and later greatly extended
narily with about 1100 grams of enriched by Bohr and Wheeler. This model explains
uranium in water solution, and achieved criti- many features of fission, but others are still not
cality with less than 600 grams. fully understood, and the complete theory of
The- energy of the neutrons producing most fission is still awaited.
of the fissions in a reactor determines whether On the basis of this model, the nucleus is as-
it is called "fast" "intermediate" or "ther- sumed to be similar to a uniformly charged
mal." Fast reacto;s use neutrons ~nly slightly drop of incompressible liquid. It will then be
slowed down from the 2-MeV average energy normally spherical, kept in this shape of mini-
with which they are emitted. Thermal reactors mum energy by the effect of surface tension.
use neutrons slowed down, by collision with However, the individual parts of the positive
moderator atoms, almost to the velocities corre- charge, actually protons, tend to repel each
sponding to thermal motion at the temperature other and to lessen the effective surface tension.
of the reactor. At 294 K, about room tempera- It has been calculated that the two effects will
461 FISSION

cancel each other out for a value of Z 2fA equal sion may also be induced by gamma rays of
to about 45 (Z is the atomic number or the energy equal to the fission barrier or by ener-
number of protons; A is the mass number or getic particles of many kinds such as protons or
total number of nucleons in the nucleus) . The alpha particles.
ratio of Z 2fA to this critical value is called the The probability that particles passing through
fissionability parameter x. a target of fissionable material will induce fis-
For a fissionability parameter equal to 1.0, sion is measured by the fission cross section.
the nucleus should have no effective surface The "cross section" is the effective area of a
tension and no stability against distortion, so nucleus for a given process. For a thin target of
that it should promptly elongate to a point thickness 1 cm having n nuclei per cubig centi-
where the coulomb (electrostatic) forces can meter, each with fission cross section or cm 2 ,
blow it apart. Such a nucleus would have no the probability that a particle passing thruugh
"fission barrier" and could not exist long. the target will induce a fission is nlor.
Known nuclides have lower values of the fis- Figure I shows the fission cross section for II
sionability parameter; for 235U the value of x is fissionable nuclei, as a function of the energy
about 0 .8. For such nuclei, it takes 5 or 6 MeV (in MeV) of the neutron ' inducing fission. For
of energy to deform the nucleus to the point those nuclides (233U, 23SU, 239pu) which lead to
where coulomb forces can cause fission. How- even-even compound nuclei, the cross sections
ever, such a fission barrier may be overcome by are very high for low-energy neutrons; these
the capture of a neutron, which adds an excita- are "fissile" nuclides. For the other target
tion energy of 5 or 6 MeV. nuclides, the fission cross section is extremely
For the even-Z, odd-N nuclides 233U, 235U, small until the neutrons have energy in excess
and 239pU (N is the number of neutrons in the of a threshold energy. All the cross sections
nucleus), fission can be induced by the capture may be seen to increase noticeably at about 7
of even a low-energy neutron. Even-even target MeV. This increase is due to the added possi-
nuclides require more energy, so that 238U is bility of fission following low-energy neutron
very unlikely to fission upon capture of a ther- emission, which becomes energetically possible
mal neutron, but needs about I MeV additional here.
neutron energy to overcome the fission barrier. The energy released when fission takes place
Even-even compound nuclei (i.e., after neutron amounts to about 200 MeV. This is the differ-
capture) are evidently more likely to undergo ence in mass between a heavy fissionable nu-
fission. Even-even nuclei are also much more cleus and the two medium-weight nuclei, plus
likely to undergo spontaneous fission, in which neutrons, into which it breaks up. This energy
an occasional nucleus manages to overcome the release of 200 MeV amounts to 21.3 per cent
fission barrier without any added energy. Fis- of the mass of a single proton or neutron. Most

If)
z
a:
'"m2
z
o
I-
oW
If)
If)
If)
o
51
Z
o
in
If)
ii:

o 4 6 8 10
NEUTRON ENERGY

FIG. 1. Dependence of fission cross section (a "barn" is 10-24 cm 2 ) on inducing neutron energy (in MeV) for
11 different fissionable nuclei (reprinted with permission from R. L. Henkel, "Fission by Fast Neutrons," in J. B.
Marion and J. L. Fowler, Eds., "Fast Neutron Physics, Part II," New York, Interscience Publishers, 1963).
FISSION 462

of the energy appears in the form of kinetic and an increase of the charge of the nucleus by
energy of the two fission fragments, which are one unit. The neutrinos ultimately carry away
sent flying apart with an average total of 167 about 10 MeV of the energy released, and for
MeV in the case of thermal fission of 23SU. all practical purposes this energy is not detec-
The two fragments share the kinetic energy in table and is never seen again. The fission prod-
the inverse ratio of their masses, since they have ucts are intensely radioactive immediately after
equal and opposite momenta. The mass division fission, as the beta decays having the shortest
is usually asymmetrical, as may be seen in Fig. half-life are completed, and gradually become
2. The probability that neutron fission of 23SU less radioactive with the passage of time.
will produce two given fragments of nearly For a small fraction of the fission products it
equal mass (symmetrical fission) is about 600 is energetically possible for a neutron to be
times lower than that of the most probable case emitted at some stage in the chain of beta de-
of asymmetrical division into light and heavy cays. These delayed neutrons, amounting on
fragments. The initial fragments (before neu- the average to 0.017 per thermal-neutron fission
tron emission) in this case have masses averag- of 23SU, are very useful in stabilizing the opera-
ing 96 and 140 mass units. The lighter fragment tion of nuclear reactors, as mentioned above.
has on the average about 99 MeV of kinetic Because they are emitted immediately follow-
energy, and the heavy fragment about 68 MeV. ing beta decays, they appear to follow the same
These energies correspond to initial velocities of radioactive decay curves as some of the beta
104 and 1.0 X 10 9 cm/sec, or 4.7 and 3.2 per decays, and have varying half-lives, of the order
cent of the speed of light, respectively. The of seconds.
total fragment energy is reasonably well ap- In rarer cases-about one fission out of 400-
proximated by 0.121 Z2/A 1/3 MeV. an alpha particle (4He) is emitted during the
Such fragment energies are due to the Cou- fission process, in addition to the two frag-
lomb interaction between the two fragments, ments. This process is usually called ternary
and would be expected on the basis of any fis- fission, as distinguished from the usual binary
sion theory. The asymmetrical mass division, fission. Because of the high energy (averaging
however, would not be predicted by liquid-drop 15 MeV) and direction (roughly perpendicular
theory and is evidently due to nuclear shell to fragment motion) of the alpha particle, it is
effects. The liquid-drop model would, in its probably formed at the same time as the two
simplest form, predict primarily symmetrical heavier fragments, and between them. In still
fission, which is rarely observed. rarer cases, other light charged particles such as
The fragments, as we have seen, carry away nuclei of tritium (3H) or of 6He are seen. There
most of the 200-MeV energy release in the seems even to be evidence that, in perhaps one
form of kinetic energy. The rest of the energy is fission out of 100 000, three fragments of
released in the form of neutron energy, gamma roughly equal mass are formed.
rays, and beta decay. The prompt neutrons, In the vast majority of fissions, however, two
which are emitted within 10- 14 sec. following fragments and several neutrons are the result.
fission, will be discussed below. The fission For low-energy fission it is clear that most, or
fragments are strong sources of prompt gamma perhaps all, of the neutrons are emitted by the
rays, emitting about 8 within a microsecond or fragments. The average number of neutrons per
less following fission. The gamma rays have a fission (ii) ranges from 2.0 to 4.0 for various
broad spectrum of energies, up to as much as 7 nuclides. Fission at high energies, such as that
MeV, but they average about 1 MeV apiece, so induced by 100-MeV alpha particles, produces
that the total energy emitted in prompt gamma many more, neutrons, most of which are
rays is about 8 MeV. This is more than simple emitted before fission.
theories would predict. In the typical case of thermal-neutron fission
Immediately after the emission of neutrons of 23SU, ii is 2.43, including delayed neutrons.
and prompt gamma rays, the fragments (now For individual fissions the number v can vary
called "fission products") begin the process of from zero to 5 or 6, the standard deviation
beta decay, which ultimately accounts for per- from the average being ±1.l0. The prompt
haps 22 MeV of fission energy release. The fis- neutrons in this case have an average energy of
sion products have neutron-to-proton ratios 1.94 MeV. The neutron energy spectrum in this
which are nearly the same as that of the heavy and other cases is well described by a Max-
nucleus from which they were formed (about wellian distribution,
1.55). This is too many neutrons for stability in
the fission-product mass region, where non- N(E) = (2/yrrT3) VEe-EfT
radioactive nuclei have neutron-proton ratios in
the range 1.3 to 104. Since there is thus energy
to be released by changing neutrons to protons, in which N(E) is the number of neutrons per
the result is a long sequence of beta decays av- unit energy E, and T is a parameter equal to
eraging 3 or 4 for each fission product. Each two-thirds of the average energy E. Such a spec-
beta decay results in the emission of a negative trum is predicted from nuclear temperature
electron, a neutrino, and usually gamma rays, theory.
463 FISSION

9 r----.---.r---.----.----.----.----.----.---.r---, 9
• INITI AL MASS YIELD
8 + + FINAL MASS YIELD 8

7 7

+t+L. ++ ...... +t (J)

('\
+ , •••• \ ••~ ••
6~

/'.\
a:
f-
5~

i +\
z

.!
....
+\ 4°
II::

l~~.·
LIJ

!
<Xl

t .·~. 3~
z
2 ~ ZlL •• ' $>.~O" -:·~~.:.·: :;;\:- f 2

t/
i ~.
.......; • . ~
. +\
j . ....".•."... +\"'
.! : ~~ ,'''- ,, '
.l .•.~. :~· ··
++~. :
+\u....... .........~....)~
~.
\\
+!t:.t....
o ! !
o
70 80 90 100 110 120 130 140 150 160 170
FRAGMENT MASS

FIG. 2. Dependence of neutron yield on initial fragment mass for thermal-neutron fission of 235U. Average
numbers of neutrons emitted by light and heavy fragments are given the symbols vL and vH; the total from both
fragments is v. Standard deviations are indicated by dotted lines. Also shown are the initial and final mass yields
[reprinted from J. Terrell, Phys. Rev., 127,880-904 (1962) J.

Of the roughly 2-MeV average energy per zero for the lightest of the light fragments, and
neutron, about 0.75 MeV is contributed by the also for the lightest of the heavy fragments, and
motion of the emitting fra~ments. There is evi- rises to high values elsewhere. Thus the two
dence that the average neutron energy increases fragments from a fission will usually emit quite
with increasing number of neutrons. This would different numbers of neutrons, which implies
be expected from considerations of nuc~ar quite different initial excitation energies. This
temperature, which lead to the relation E = phenomenon came as quite a surprise when it
was first reported by Fraser and Milton in 1954.
0.75 + 0.65~, in MeV, in agreement with It has since been found to be a common and
experimental data. perhaps universal feature of fission.
The total kinetic energy carried away by the This sawtooth dependence of neutron num-
typical 2 or 3 neutrons per fission is thus about ber on fragment mass may be understood on the
5 MeV, of which 2 MeV is taken from fragment basis of varying stiffness of the fragments
energy. The fragments after neutron emission against deformation. The fragments having
thus have a total energy reduced (in the case of "magic" numbers (closed shells) of neutrons or
235U) from 167 to 165 Me V, and lower mass protons would be expected to be stiff and to
numbers. be exceptionally resistant to deformation and
The final fragment masses may be determined the consequent excitation. The magic number
by radiochemical data on fission products. The of neutrons, N= 50, will occur for fragments in
final mass yields for 235U are shown in Fig. 2, the vicinity of mass 82, at the lower boundary
as crosses. The initial mass yields may be deter- of the light fragment peak. Similarly, two closed
mined from simultaneous measurement of the shells (Z = 50 and N = 82) occur at the lower
velocities of the two fragments and are also edge of the heavy fragment peak, near mass
shown in the Figure. The differences between 130. The low neutron yields seen at these
initial and final mass yields are accounted for masses may be quantitatively explained on the
by emission of neutrons and may be used to basis of the effects of closed nuclear shells on
determine the numbers of neutrons. The aver- the deformation parameters. It is hoped that
age numbers of neutrons emitted by individual this fragment-deformation theory will be able
fragments are shown (VL and VH) as functions to account quantitatively for the spectrum of
of initial fragment mass; the total v(= VL + VH) fragment masses, since the mass yields and neu-
is also shown. tron yields tend to vanish near the same magic
As may be seen in Fig. 2, the average numbel numbers.
of neutrons emitted by a fission fragment de- Fission isomers have been of much interest
pends strongly on the fragment mass; it is near recently. These are excited states of fissionable
FISSION 464

nuclides, produced by bombardment with en- of momentum of the vehicle equals change of
ergetic particles or nuclei, which exist for times momentum of the working fluid), and the dic-
of the order of 10-2 to 10-9 seconds before tates of the perfect gas laws dominate the design
fissioning. Such quasi~stable states, lying a few and operation of the propulsion system, be it
MeV above the unexcited ground states, are an aircraft engine with propeller, a turbojet
believed to be "shape isomers" explainable by a engine, a solid rocket motor, a gas-turbine-
double-humped fission barrier, for which there driven rotor system in a helicopter, or any
is other evidence in the intermediate structure other propulsion device.
of resonances in the fission cross sections. Such The advent of the "jet age" coincided with
a fission barrier has a second minimum, at some World War II and a wide variety of propulsion
deformation, in the energy required to deform devices have been studied and developed since
the nucleus from the ground state. It is thought then. Table I lists the major types of propul-
now that all fission barriers have such a double- sion devices and indicates their status. Note the
humped structure, first explained by V. M. three energy sources listed. In time, additional
Strutinsky in terms of variation of shell struc- energy forms may prove feasible such as the
ture with deformation. laser beam, electromagnetic waves and proton
or other particle beams transmitted to a flying
JAMES TERRELL receiver.
Fundamental Relations Assume an ideal en-
References gine inside of which the fluid receives energy
and is heated and accelerated as shown schemat-
Gindler, J. E., and Huizenga, J. R., "Nuclear Fission," ically in Fig. I. The acceleration of a mass
Chapter 7 in "Nuclear Chemistry," (Ed. by L. Yaffe) flow of air or fluid ma from an initial velocity
Vol. II, New York, Academic Press, 1968. 1J0 (which equals the forward flight velocity of
Glasstone, S., and Sesonske, A., "Nuclear Reactor En- of the vehicle) to the jet velocity at the exit
gineering," Third Edition, New York, Van Nostrand lJe will result in a net thrust F, which is equal
Reinhold,1980. to the mass flow rate multiplied by the velocity
Glasstone, S., Ed., "The Effects of Nuclear Weapons," increment. Additional terms are added to this
Third Edition, U.S. Government Printing Office, momentum relation in order to correct for the
1977. additional mass flow rate of the fuel or propel-
Hyde, E. K., "The Nuclear Properties of the Heavy lant (mp) (the fuel is carried in the vehicle)
Elements III: Fission Phenomena," Englewood and for any difference in static pressure of the
Cliffs, N.J., Prentice-Hall, Inc., 1964; New York, jet exit of the engine (Pe) and the atmospheric
Dover Publications Inc., 1971. or ambient pressure Po.
LA.E.A., "Physics and Chemistry of Fission," Pro-
ceedings of the Third LA.E.A. Symposium, Vienna, F = ma(lJe - 1J0) + mplJe + Ae(Pe - .Po)
LA.E.A., 1973.
Keepin, G. R., "Physics of Nuclear Kinetics," Reading,
The last term in this equation is called the pres-
Mass., Addison-Wesley Publishing Co., 1965. sure thrust; it is positive if Pe > Po (which
Terrel, J., "Prompt Neutrons from Fission," in Pro-
occurs when there is incomplete expansion of
ceedings of the LA.E.A. Symposium on the Physicsthe gases in the engine exit nozzle) or negative
and Chemistry of Fission; LA.E.A., Vienna, 1965.if Pe < Po (which occurs when there is over-
Vandenbosch, R., and Huizenga, J. R., "Nuclear Fis-
expansion in the engine exit nozzle). For a
sion," New York, Academic Press, 1973 . rocket engine, the air mass flow ma = 0, and
the thrust is equal to the last two terms of the
Cross-references: ATOMIC ENERGY, FUSION, NU- equation only. In the case of a propeller
CLEAR RADIATION, NUCLEAR REACTOR, NU- engine, there are really two different air flows,
CLEAR STRUCTURE. and the first term in the above equation is
split into two separate terms: one flow which
crosses the plane of the propeller and is acceler-
FLIGHT PROPULSION FUNDAMENTALS ated by the propeller blades and a second
smaller airflow which goes through the engine
Propulsion is the act of changing, or maintain- to furnish oxygen for combustion.
ing, the motion of a vehicle flying in air or in Consider a winged vehicle in equilibrium
space. This includes deceleration for reentering rectilinear flight in a two-dimensional (fixed
the earth's atmosphere with manned spacecraft. plane) trajectory; assume all control forces,
Fundamentally, the operating principles of lateral forces and turning moments to be zero
propulsion devices, regardless of type or class, and the flight direction to be the same as the
embody the basic laws of MECHANICS, THER- thrust direction. In the direction of the flight,
MODYNAMICS, CHEMISTRY, ELECTRICITY, and the instantaneous vehicle mass my times the
other sciences. In propelling a vehicle, it is vehicle acceleration dlJ/dt has to equal the sum
necessary to accelerate a mass, commonly called of all the forces, namely a component of the
the working fluid, in a direction opposite to thrust F, the aerodynamic drag D, and a com-
flight (or thrust force). Newton's basic laws of ponent of the gravitational attraction or the
motion, the conservation of momentum (change weight mvg. The angles are as defined in Fig. 2.
465 FLIGHT PROPULSION FUNDAMENTALS

TABLE 1. TYPES/STATUS OF FLIGHT PROPULSION DEVICES


(ELECTRIC PROPULSION EXCLUDED)

Energy Source*

Propulsion Device Chemical Nuclear Solar Working Fluid

Piston engine/propeller D/P Surrounding air


Turbojet D/P TFD Surrounding air
Turbofan D/P TFD Surrounding air
Turboprop D/P Surrounding air
Turbo-ramjet TFD Surrounding air
Ramjet D/P Surrounding air
Pulsejet D/P Surrounding air
Rocket D/P TFD Stored propellant
Ducted rocket TFD Stored + surrounding
air
Air turborocket TFD Stored + surrounding
air
Solar heated rocket TFD Stored propellant
Photon rocket TFND Photon ejection (no
stored propellant)
Solar sail TFD Photon absorption (no
stored propellant)

*D/P: developed and/or considered practical; TFD: technical feasibility has been
demonstrated, but development is incomplete; TFND; technical feasibility has not
been demonstrated as yet.

mvdv/dt = F - D - mvg sin 0 (2) rium flight, 0 = 0, dv/dt = 0, and thus Eq. (2)
reduces to F = D. The vehicle mass mv consists
The vehicle mass mv multiplied by the accelera- of the vehicle dry mass or final vehicle mass
tion in a direction perpendicular to the flight after expenditure of all propellant (mv)C plus
path (v dO/dt) must equal the sum of all forces the propellant or fuel mass mp. For steady fuel
perpendicular to the flight direction; here, the flow, mp = mpt. For the case of gravity-free
lift force L must be considered. flight in a vacuum (true space environment),
Eq. (1) can be rewritten for a rocket:
dO
mv v -dt =L - m v g cos 0 (3)
dv = _F_ = mpVe
dt mv (mv)C + mpt
The solution to these two equations results in
the determination of a two-dimensional trajec- Integration gives
tory, maximum flight velocity, range, and other
mv - mp (mv)j
flight performance parameters. The actual solu-
tion is three-dimensional, and must usually be a
Vv =- Ve In mv
= Ve In -(mv
-)-
C
numerical integration since mv decreases with
time, Land D vary with speed and altitude, and Thus the maximum velocity attained by a
the direction of thrust is not the same as the rocket-space vehicle operating in a gravitation-
flight direction; also, both the flight angle and less vacuum is equal to the product of the
the angle of attack are usually changing. For the average effective rocket exhaust velocity Ve and
case of a linear, simplified horizontal equilib- a logarithmic function of the initial vehicle
mass (mv)i = (mv)C + mp (fully fueled vehicle)
at start of the engine operation, divided by the

..---r
__ 'LIGHT OIRIEC T ION

__ _____ t - - ::~Z:E:r:~

FIG. 1. Simple engine. FIG. 2. Simple free body diagram of flying vehicle.
FLIGHT PROPULSION FUNDAMENTALS 466

final vehicle mass (with all the fuel expanded) HIGH PRESSURE GAS TANK

(mv)f at the end of engine operation. This


PRESSURE REGULATOR
velocity !Iv will be large when Ve is large, i.e., GAS FILL VALVE
high energy is available from the propellant or
the engine and when (mvR is small, i.e., whel
the dry mass of the vehicle (dry engine mass
tanks, payload, or structure) is small and no
unnecessary mass is designed into the vehicle.
This means that mp is large and the initial
vehicle mass (mvR consists largely of propellant.
Two important measures of engine perfor-
mance are specific fuel consumption, sfc, (for ____ 'JXIDIZER FILL PLUG
airbreathing engines) and specific impulse Is
(for rocket engines); one is the reciprocal of the
other. Specific fuel consumption for turbojets,
turbofans, and ramjets is expressed as pounds
of fuel per hour per pound of thrust, and as
pounds of fuel per hour per shaft horsepower CALIBRATION
for piston engines and turboprop engines. ORIFICE

Specific impulse is usually expressed in seconds,


a short designation for units of thrust (force)
per units of propellant mass flow per second.
Both of these parameters are an indication of
5000 0 F
engine design quality, the higher the Is the
better, and conversely for sfc.
By definition, specific impulse per second is:
F F
Is = -:- = -.- (kg force/kg mass per second)
w mg

3000 OF HOT GAS

with w being propellant flow rate per second . FIG. 3. Simplified schematic diagram of liquid pro-
pellant rocket engine with pressurized gas feed sys-
For a given Is, the nozzle expansion ratio should
be stated such as 1000 -+ 14.7 (sea-level specific tem and uncooled thrust chamber (reproduced by
impulse at 1000 psi chamber pressure), or permission from McGraw-Hill Encyclopedia of Science
1000 -+ 0.2, indicating expansion at high alti- and Technology, Vol. II, New York, McGraw-Hill
tude or space flight. Book Co.)
Some of the most significant types of en-
gines are described briefly in the remainder of
this section. dependent from its surrounding fluid. Thus a
Rocket Engines These engines use both a rocket can operate in space, air, or under water.
fuel and oxidizing propellant and both are The supersonic nozzle jet exit velocity Ve of a
stored within the flying vehicle, making it in- rocket using ideal gas laws can be derived to be

TABLE 2. TYPICAL DATA FOR VARIOUS ROCKET ENGINES

Typical Range of Typical Range of


Type Thrusts (lbs) Duration Application

High thrust liquid 1 000 000 to 4 000 000 I to 5 min Booster and sustainer stages of large
propellant rocket for each engine with missiles and space vehicles
several engines in a
cluster
Large solid propellant 100 000 to 3 000 000 1 to 3 min Long range missiles, space
rocket boosters
Small missiles, lunar landing
Prepackaged storable 100 to 100000 1 to 60 sec and takeoff
liquid propellant missiles, lunar landing and takeoff
Jet assisted takeoff 200 to 10000 5 to 30 sec Assist takeoff of airplanes
(solid propellant)
Space vehicle attitude 1 to 150 0.01 to 10 sec/cycle; Control position, angle and
control accumulate up to orientation of spacecraft
an hour
467 FLIGHT PROPULSION FUNDAMENTALS

LOw PRE SSURE GAS FOR


2gkR T [1 _ (pe\<k-l)/k SLIGHT I ATION OF TANKS
Ve =
(k- l)M Pc)
where
Ve = nozzle exit velocity
g = gravitational constant
k = ratio of specific heats of gas
R = universal gas constant
M = molecular weight of hot gas FILL PLUG

T = absolute combustion temperature


Pe = nozzle exit gas pressure
=
Pc combustion chamber pressure. GAS GENERATOR VALVES

The exhaust velocity (or the specific impulse GAS GENERATOR


COMBUSTION CHAMBER
which is Is = Ve/g) increases as the molecular (1400 0 F GAS)

weight M is decreased or as the combustion STARTER MOTOR


TURBINE

temperature T is increased. Because of the pres- FUEL AND OXIDIZER GAS


sure ratio effect, there is actually a slight (10 to GENERATOR fEED LINES
(I TO 4%. OF
FUEL PUMP
20 per cent) increase in specific impulse as the OXtolZER PUMP
PROPELLANT FLOW)

altitude is increased (lower ambient pressure) or MAIN PROPELLANT VALVES


as chamber pressure is increased. Values of Ve TURBINE EXHAUST
or Is calculated for a given propellant and THRUST CHAIMBEI.--- GAS DUCT

engine from thermochemical and thermody-


namic data are usually very close to actual
performance (usually within 5 to 10 per cent),
because rocket combustion efficiencies are high
and nozzle losses are usually low. Schematic FIG.4. Simplified schematic diagram of liquid pro-
diagrams of several liquid and solid propellant pellant rocket engine with turbopump feed system
systems are shown in Figs. 3 to 6, and some and regeneratively cooled thrust chamber (reproduced
important applications are shown in Table 2. by permission from McGraw-Hill Encyclopedia of
In liquid propellant rocket engines the pro- Science and Technology, Vol. II, New York, McGraw-
pellants are fed under pressure from tanks in Hill Book Co.)

FIG. 5. F-l rocket engine used in booster stage of advanced Saturn space vehicle.
(Courtesy of Rocketdyne, A Division of North American Aviation, Inc.)
FLIGHT PROPULSION FUNDAMENTALS 468

the vehicle into a thrust chamber where they Solid Propellants. In a solid propellant rocket
are injected, mixed, and burned at high pres- engine all the propellant is contained within the
sures and very high temperatures to form the combustion chamber. The hardware includes, in
gaseous reaction products, which in turn, are addition to the combustion chamber nozzle, an
accelerated in a nozzle and ejected at high igniter and provisions for mounting the rocket
velocities. The feed system for transferring the (Fig. 6). Solid propellants themselves usually
propellants into the thrust chamber includes have a plastic, cakelike appearance (specific
valves and controls. gravity is approximately 1.6) and burn at high
The principal components of a thrust chamber pressure (10 to ISO atmospheres) on their
(Figs. 3 and 4) are the nozzle, the chamber, and exposed surfaces to form hot exhaust gases
the injector. An injector introduces and meters which are ejected through the nozzle. The
the flow of the liquid propellants and also physical mass or body of the propellant is
atomizes and mixes them in the correct pro- called the grain.
portions in such a manner that they can be Processed (including curing) propellants fall
readily vaporized and burned. In the combus- into three general types: (1) double-base; (2)
tion chamber, the burning of the liquid propel- composite; and (3) composite double-base.
lant takes place at high pressure, usually be- A double-base propellant forms a homoge-
tween 5 and 150 atmospheres. neous cured propellant, usually a nitrocellulose-
The gas pressure feed system (Fig. 3) offers type of gunpowder dissolved in nitroglycerin,
one of the simplest and most common means of plus minor percentages of additives. Both the
transferring propellants by displacing them with major ingredients are explosives, and both con-
a high-pressure gas which is fed into the tanks tribute to the functions of fuel, oxidizer, and
under a regulated pressure. In a turbopump binder.
feed system, the propellant is pressurized by A composite propellant forms a heterogeneous
means of pumps driven by one or more tur- propellant grain with the oxidizer crystals and
bines (Fig. 4) which derive their power from a powdered fuel (usually aluminum), held to-
the expansion of hot gases. A separate gas gether in a matrix of synthetic rubber (or ap-
generator ordinarily produces these gases in the propriate plastic) binder, such as polybutadiene.
required quantities and at a temperature which Normally, composite propellants are less haz-
will not hurt the turbine buckets (1200 to ardous to manufacture and handle than double-
2000°F). base propellants.
Liquid Propellants. A bipropellant rocket unit Composite double-base propellants are a com-
has two separate propellants, a fuel and an bination of the double-base and composite pro-
oxidizer (such as liquid hydrogen and liquid pellants, usually comprising a crystalline oxi-
oxygen), which are not mixed until they come dizer (ammonium perchlorate) and powdered
in contact with each other in the combustion aluminum fuel, held together in a matrix of
chamber. Most liquid propellant rockets have nitrocellulose-nitroglycerin. The hazards of pro-
been of this bipropellant type. A monopropei- cessing and handling this type of propellant are
lant contains oxidizing agent and combustible similar to those of double-base propellants.
matter in a single substance. It can be a mixture Representative formulations for the three
of compounds, such as hydrogen peroxide with basic types of propellants are given in Table 4.
liquid alcohol, or it may be a homogeneous In actual practice, each manufacturer of a pro-
chemical agent, such as hydrazine. Typical pellant has a proprietary precise formulation
values of liquid and solid propellant character- and processing procedure. The exact percentages
istics are given in Table 3. of ingredients, even for a given propellant, such

Thrust termination
port (typical)

I!destruct Protective coating or


system insulation liner

FIG. 6. Typical solid propellant motor with case-bonded grain and other components.
...
~
'"

TABLE 3. TYPICAL PERFORMANCE OF SEVERAL CHEMICAL ROCKET PROPELLANTS.

Molecular
Theoretical Optimum Weight of
Specific Impulse Bulk Mixture Ratio Combustion Exhaust Gas Burning Rate
Propellant (1000 psi -+ 14.7) Specific Gravity (Oxidizer-Fuel) Temperature, F lb/mole in/sec

Cryogenic Liquid
Oxygen and kerosene 300 1.02 2.6 5800 22
High Energy Liquid
Fluorine and hydrogen 412 0.46 8.0 4700 9
Fluorine and hydrazine 365 1.31 1.3 7300 18
Oxygen and hydrogen 390 0.29 4.2 4500 8
Storable Liquid
Nitrogen tetroxide and
hydrazine 292 1.21 1.3 5100 22
Monopropellant (Liquid) ...,
Hydrogen peroxide 149 1.44 1365 34 t""

Hydrazine 245 1.0 1800 14.6 :;;


:t
Solid
...
."
:o:l
Double base solid 255 1.6 5300 24 .4 to .5 0
."
Composite solid 264 1.9 5500 24 .4 to .8 ~
t""
Composite double base 272 1.9 6600 21 .4 to .8 00
0
...,z
~
z
0
>
:::t"l
...>z
t""
00
FLIGHT PROPULSION FUNDAMENTALS 470

TABLE 4. REPRESENTATIVE SOLID PROPELLANT FORMULATIONS.*

Double Base Composite Composite Double Base

Ingredient Wt% Ingredient Wt% Ingredient Wt%

Nitrocellulose 51.5 Ammonium 70.0 Ammonium perchlorate 2004


perchlora te
Nitroglycerin 43.0 Aluminum powder 16.0 Aluminum powder 21.1
Diethylphthalate 3.2 Polybutadiene- 11.78 Nitrocellulose 21.9
acrylic acid-
acrylonitrile
Ethyl centralite 1.0 Epoxy curative 2.22 Nitroglycerin 29.0
Potassium sulfate 1.2 Triacetin 5.1
Carbon black <1% Stabilizers 2.5
Candelilla wax <1%

*Furnished by U.S. Air Force Rocket Propulsion Laboratory, Edwards, Calif.

as PBAN, will not only vary from one manu- Air-breathing Engines As shown by Figs. 7
facturer to another, but also often varies from to 10, there exists a variety of different types of
one motor application to the next. air-breathing propulsion engines. The range of
Compared to liquid propellant, the solid pro- performance values shown in Table 5 for dif-
pellant requires no pump or pressurization for ferent types of air-breathing engines is repre-
the fuel tank, and hence is mechanically simpler. sentative and does not correspond to data for
The combustion chamber with solid propellant specific engines. Each engine is optimized for a
is larger, especially for large rockets, and is fre- specific flight operating condition of speed
quently operated at higher pressure than for a (Mach number) and altitude. For example the
liquid engine. Solid rockets are simpler, more turbojet engine listed in Table 5 for a specific
storable, and are usually more immediately fuel consumption of 1 pound of thrust per
ready for use, but are generally lower in per- pound of fuel flow per hour operating at a Mach
formance when compared to liquid propellant number of M = 0.8, is a different engine from
rockets. the one operating at M = 3.0 with a fuel con-
In addition to producing a thrust force, solid sumption of 1.5; also it operates at different
and liquid propellant engines can be used also altitudes.
for producing auxiliary power and control In an aircraft or missile engine, the atmos-
torques to be applied to the vehicle. This latter pheric air is usually first compressed (by a
is called thrust vector control and basically it is mechanical rotary compressor or by a diffuser);
usually a mechanical means for altering the then heated by burning with fuel; sometimes
direction of the engine's thrust during flight. sent through a turbine (which provides the

INLET AND SUPERSONIC


DIFFUSER

FUEL INJECTOR

OBLIOUE SHOC~ CENTRAL - BODY HOUSING


ACCESSORIES

FIG. 7. Simplified schematic diagram of ram jet (reproduced by permission


from H. H. Koelle, "Handbook of Astronautical Engineering," New York,
McGraw-Hill Book Co., 1961).
471 FLIGHT PROPULSION FUNDAMENTALS

CQMPRESSOR- -r-- -

TUR81N

FIG. 8. Simplified schematic diagram of turbojet (reproduced by permission from H.


H. Koelle, "Handbook of Astronautical Engineering," New York, McGraw-Hill Book
Co., 1961).

rotary power for driving the compressor and an inlet spike. To maintain good efficiency and
accessories, such as electrical generators or hy- the desired airflow, some diffusers and nozzles
draulic pumps); and then ejected at high incorporate a variable wall contour or cross-
velocities (usually supersonic) through a nozzle. section geometry. In general, air-breathing en-
The process in air-breathing engines approaches gines have been well developed to a high state
'>n ideal thermodynamic cycle (such as the Otto of reliability and have given millions of hours
cycle for reciprocating engines or the Brayton of good service.
cycle for a turbojet) which limits their maxi- The available oxygen from the air limits the
mum theoretical efficiency. The inlet duct combustion process. For example, at constant
serves to scoop up the desired air mass flow flight speed, the thrust thus decreases with alti-
and to convert some of the kinetic energy of tude (or oxygen density) and below a combus-
the flow into pressure, thus reducing the tion pressure of approximately 3 psi, combus-
velocity. After heating the flow in a combustion tion is not easily sustained (flameout limit).
chamber, the reverse process occurs adiabati- Available high-temperature materials will set an
cally in the nozzle, where it is desired to attain upper limit to the maximum combustion tem-
a maximum exhaust velocity. The efficiency perature at approximately 1700 to 2400 oF. At
of energy conversion in the inlet duct, nozzle, high speeds and high altitude, the ram-compres-
combustors, compressors, and turbines is a very sion of the air causes its temperature to rise
important factor, and becomes a predominant substantially, so that the amount of energy
criterion at supersonic and hypersonic velocities, that can be added by combustion (without
when compression is achieved usually by a damaging turbine materials) is thus limited;
series of oblique shock waves commencing at also, special cooling provisions are required.

FIG. 9. Photo of 1-47 turbojet engine. (Courtesy of General Electric Company.)


'"l
~

c:i
..,::r:
~
~
o~
c:
~
00
::;
2
'"l
TABLE 5. TYPICAL DATA FOR AIR-BREATHING ENGINES. c:
2
Cruise Specific Fuel
o
>
Consumption or Thrust to Weight or :::t'l
Engine Type Flight Speed (mph) Altitude (feet) Specific Thrust Power to Wt. Ratio Typical Applications ..,2
>
~
Piston engine 0-400 mph 0-35000 0.37 to 0.52 lb/hp-hr 1.0 to 2.0 Transport aircraft 00
with propeller Small airplanes
Helicopters
Target drones
Turbojet 0-2000 0-100000 0.8 lb/lb-hr at M = 0.8 0.8 to 5.0 Bomber
1.5 lb/lb-hr at M = 3.0 (depending on Fighter
afterburner) Missiles
Transport aircraft
Target drones
Turbofan 0-600 0-40,000 0.5 lb/lb-hr at M = 0.1 3.5 to 5.5 Transport aircraft
0.65 lb/lb-hr at M = 0.9
Turboprop 0-450 0-35000 0.8 lb/hp-hr at M = 0.6 2.0 Fast transport
0.25 lb/hp-hr at M =0 aircraft
Small airplanes
Helicopters
Ramjet 1200 to 4200 0-120000 1.5 lb/lb-hr at M = 2 up to 20.0 Anti-aircraft missiles
2.0 lb/lb-hr at M = 6

...
....
....
473 FLIGHT PROPULSION FUNDAMENTALS

FIG. 10. Sectional photo ofCFU56-2 turbofan engine (Courtesy of General Electric Company).

The simplest air-breathing engine ~ a ramjet tion pressure at the distance of the earth from
(Fig. 7). It does not produce static thrust (at the sun is about 5 X 10- 7 kg/m 2 ). Although
zero flight speed, such as during takeoff) and this force is limited by being directed only
thus needs a rocket engine or some other en- "away" from the sun, solar "sailing" can pro-
gine to bring it to its minimum operating speed. vide low thrusts, and attitude control, turning
The piston engine with a propeller was the a spaceship completely around in a few hours.
very first engine to fly. It is the most economic No working fluid is carried in the vehicle.
engine for subsonic flight speed and is now used For the vehicle to carry its own light source,
mostly in small airplanes. The hot gases do a photon rocket engine of adequate thrust
work against a piston (not a turbine), which in would necessitate energies and techniques far
turn requires a crank mechanism to convert the beyond present capabilities.
reciprocating piston motion into shaft rotation.
The use of a variable pitch propeller and super- G.P.SUTTON
chargers for precompression of the inlet air D. M. Ross
further increases the economy.
The turbojet (Figs. 8 and 9) can be designed References
for a variety of speeds, altitudes and thrust
ranges. It often includes special design features 1. Sutton, G. P., and Ross, D. M., "Rocket Propulsion
such as afterburners and water injection to in- Elements," Fourth edition, New York, J. Wiley &
crease the thrust. Sons, 1976.
The turbofan (Fig. 10) followed the turbojet 2. Morgan, H. E., "Turbojet Fundamentals," Second
in development and utilizes the same design edition, New York, McGraw-Hill Book Co., 1958.
principles. It is widely used in cargo and pas- 3. "Janes' All the World's Aircraft, 1981-82," New
senger airplanes. Its advantages over a turbojet York, Janes' Publishing Inc., 1982.
include reduced fuel consumption, higher 4. Constant, E. W., "Origins of the Turbojet Revolu-
thrust to weight ratio and reduced noise-but tion," Baltimore, Johns Hopkins Univ. Press, 1980.
it is limited to subsonic flight speeds.
The fuel used is usually a narrow-cut petro- Cross-references: AERODYNAMICS, ASTRONAU·
leum refinery hydrocarbon product, having the TICS, CHEMISTRY, DYNAMICS, ELECTRIC PRO·
approximate formula of CH 1 .9S • PULSION, FLUID DYNAMICS, IMPULSE AND
Photon Propulsion. Solar sail engines rely on MOMENTUM, MECHANICS, STATICS, THERMO·
the reflection of photons from the sun (radia- DYNAMICS.
FLUID DYNAMICS 474

FLUID DYNAMICS information on situations in which electro-


magnetic forces exist, see MAGNETO-FLUID-
Fluid dynamics is study of the motion of mat- MECHANICS.
ter in the gas, liquid, plastic, or plasma state. In addition to internal forces (pressure,
When restricted to flow of incompressible (Le., stress) and external body forces (for purposes
constant density) fluids, it is called hydro- of this article, weight), additional "fictitious"
dynamics; when dealing with electrically con- body forces appear in those cases where an ac-
ducting fluids with magnetic fields present, it celerated frame of reference is used to describe
is called magneto-fluid-dynamics; when dealing the flow situation. Particularly common are
with practical problems of air flow past airplaneCoriolis and centrifugal forces which arise in
wings, through ventilating equipment, etc., a frame of reference rotating relative to an in-
it is called aerodynamics; when dealing with ertial frame. (See CORIOLIS EFFECT, DYNAM-
nonlinear materials such as wax and polymers, ICS.) A frame fixed to the earth has this prop-
it is called rheology. erty, but the fictitious forces can usually be
Fundamentally, two approaches to describ- ignored; the centrifugal force is combined
ing the motion of fluids are taken. The first, with the gravitational force to produce the
and most common, is continuum or field dy- ordinary measured weight (see FLUID STAT-
namics, the physical elements of which were ICS), and the Coriolis force is so small that it
stated in modern form by Stevin (about 1590), usually can be ignored. The Coriolis force is,
Torricelli and Pascal (1647), D. Bernoulli however, extremely important in fluid dynamics
(1738), Euler (1755), and Stokes (1845). Clari- of the atmosphere, which involves motion over
fication of the complex mathematical nature global scales and is the basis for long range
of the correct physical equations describing weather prediction.
the deformation and flow of matter has involved A fluid is, by definition, a substance in which
and continues to involve the most astute there are no shear stresses when at rest. All sub-
mathematicians. stances in motion have shear stresses acting in
The second approach is kinetic theory and addition to the normal stresses. In motion, the
normal stresses in a fluid are not equal in all
non equilibrium statistical mechanics. In this ap-
proach, matter is not thought of as a contin- directions, but the average of the normal stresses
uum or field, but as consisting of discrete is employed and called pressure as an extension
molecules moving in empty space (gas) or of the concept of pressure which arises in fluid
bound to one another by forces (condensed statics. Many fluids, e.g., tar, waxes, oils, honey,
matter-liquids and solids). In either case, the bread dough, and many synthetic polymers,
molecules are in continuous motion, either have both normal stress and shear stress compo-
colliding with other molecules or vibrating nents which depend on the recent deformation
about a "home position." This approach, dis- history as well as on the current rates of de-
cussed even in antiquity, was formulated in formation (see RHEOLOGY).
more modern form by D. Bernoulli (1738), The relations between the stress components
Clausius (1857), Maxwell and Boltzmann and the current strain components and rate of
(1860s) and Gibbs (1902), and continues to be strain components are extremely important in
developed and applied (see KINETIC THEORY). formulating the equations of fluid dynamics,
Continuum Dynamics In continuum dynam- elasticity, and rheology. The relations which
ics, fluid properties-namely, velocity com- exist for a particular substance determine, in
ponents u 1, U2, U3, density p, pressure p (morefact, whether the flow of the substance is to be
generally stress), temperature T, viscosity /J., described within the field of fluid dynamics,
conductivity, dielectric constant, electric field,
elasticity, or rheology. In fluid dynamics, the
electric current density, etc.-are assumed to bestress-deformation relation is assumed to be a
meaningful functions of three spatial variables simple linear dependence of each stress compo-
nent on the current rates-of-strain at the same
Xl, X2 , X3, and time t. The internalforces (pres-
sure, stress), the velocity components, and the point. Because the substance is also considered
density are usually unknowns to be calculated, to be isotropic (no crystal axes), these linear
while weight, dielectric constant and some otherrelations can be shown mathematically to in-
properties are presumed given. volve only two proportionality constants. The
Measurement of these physical quantities in a proportionality constants are called the first
moving field is important both for practical viscosity coefficient and the second viscosity
reasons and as the means for developing, the coefficient; except in very unusual situations,
working concepts of fluid dynamics. Examples the second viscosity coefficient is zero, so that
of methods for measuring fluid velocity com- it is common to consider the first viscosity co-
ponents Uj are given later in this article. For efficient as the viscosity /J.. A substance which
methods of measuring pressure, see FLUID adequately follows this simple linear stress-
STATICS. deformation relation is called a Newtonian
When electromagnetic body forces are present, substance (see VISCOSITY).
the situation is quite complicated, and we will With the linear relation between stress compo-
in this article consider only the simpler situa- nents and rate-of-strain components, the stress
tions where the only body force is weight. For components can be expressed in terms of the
475 FLUID DYNAMICS

pressure (i.e., the average of the normal an example being the adiabatic equation,
stresses), the viscosity, and the spatial deriva-
tives of the velocity components. Then the P (P)'1 (3)
basic dynamical equation of continuous media-
which equates the sum of all forces on an ele-
Po = Po
ment to the element's mass times its accelera- applying to an ideal gas. (r, the ratio of heat
tion-takes the following form, known as the capacities of the gas, is equal to 1.4 for air.)
Navier-Stokes equations: A variety of assumptions, usually involving the
first and second laws of thermodynamics and
thermodynamic equations of state of the sub-
stance, are needed in various situations.
Two further special assumptions are made in
formulating situations dealt with in a large
part of the fluid dynamics literature. One is
the assumption that the viscosity coefficient
J.l. is zero. The Navier-Stokes equations, then,
The Navier-Stokes equations are components with J.l. = 0, are called the Euler equations.
of a vector equation. The index notation used These equations are not really applicable to any
for subscripts is that an index i or j may be 1, fluid substance, but there are types of flows
2, or 3, representing components of ,:ectors where important characteristics are illustrated.
along axes 1, 2, or 3, respectively, and 1!1 any Such flows are said to be ideal fluid or in viscid
term where the same index occurs tWice, a fluid flows. The second special assumption is
sum of terms with that index taking suc- that the fluid is incompressible, i.e., has con-
cessively the values 1, 2, and 3 is implied. J.l. stant density.
is assumed to be a known and constant value These partial differential equations, even the
characteristic of the fluid, and gj are com- Euler equations, are nonlinear and have no
ponents of the known weight per unit mass, general solutions, even for the most restrictive
nominally 9.8 N kg- l downward. boundary conditions. Particular solutions are
The left-hand side of Eq. (1) represents the carried out for various idealized flows. I ,2 Ex-
acceleration component of the parcel of fluid at amples of particular solutions for selected geo-
the point Xl, X2, X3 at time t, and the right- metrical boundaries are described later in this
hand side is the force per unit volume divided article.
by the mass per unit volume. Kinetic Theory and Nonequilibrium Statistical
Two additional equations are needed to de- Mechanics In this approach, fluid properties
termine the five unknowns UI, U2, u3, P, P, are associated with averages of properties of
since the three equations of motion (with sub- microscopic entities. Density, for example is
script i taking on the values 1, 2, and 3) are the average number of molecules per unit
insufficient. volume times the mass per molecule. While
One of these needed equations is the so-called much of molecular theory in fluid dynamics
continuity equation, aims to interpret processes already adequately
ap a
- + - (pJ.l.j) =0. (2)
described by the continuum approach, addi-
tional properties and processes are presented.
at aXj The distribution of molecular velocities (i.e.,
how many molecules have any particular ve-
This equation, employing the same index nota- locity), time dependent adjustments of internal
tion, and based on the concept that p represents molecular motions, and momentum and energ~
the quantity of matter as well as the inertial transfer processes at boundaries are examples.
mass of the fluid, essentially states that matter See KINETIC THEORY and STATISTICAL
flowing into a fixed volume is equal to the time MECHANICS.
rate of increase of the matter in that volume. Examples of Flows In Fig. 1, the special
It is a kinematical statement. flow called Couette flow is indicated sche-
The fifth equation is, in various situations, matically. The flow is between parallel plates,
some form of thermodynamic equation of state, lower plate at y = 0 at rest, upper plate at YB

"Us U-
~~~~~~-~
-.--.
_'US

J'I_'I~i~ FIG. 1. Couette flow .


FLUID DYNAMICS 476

moving with constant speed UB in the x direc- variety of fluid motions are described as sound
tion. Stress throughout the fluid is constant, waves; when the small-motion or acoustic de-
given by Pxy = l1(du/dy) = I1(UB/YB). This is scription can be used the principle of super-
pure shear flow and experimentally is often position is valid. This powerful principle
considered to define and measure the viscosity allows addition of simple simultaneous motions
coefficient 11 assumed constant for the homo- to represent a more complex motion, such as
geneous fluid. The velocity profile appearing the sound reaching the audience from the
at the right in the figure shows by velocity ar- instruments of a symphony orchestra. The
rows of different length at the various positions superposition principle does not apply to large-
Y how the velocity varies with position. Steady scale (nonacoustical) motions, and the subject
flow (no dependence of any quantity on time), fluid dynamics (as distinct from acoustics)
constant pressure, constant density and laminar treats nonlinear flows, i.e., those which cannot
flow are additional assumptions for Couette be described as superpositions of other flows.
flow. The flow is realized experimentally by The description of small motions in a small
confining the fluid in the narrow annulus be- region of even a nonlinear flow is useful; at
tween rotating concentric cylinders of nearly each place in the flow there is a "local sound
equal radius; the cylinders rotate at different speed."
speeds. Since sound waves travel with a sound speed
In Fig. 2, the special flow is in a pipe of uni- relative to the fluid, waves moving in a moving
form cross section, pressure is assumed to be fluid can sometimes be carried off in a direc-
constant across each cross section but to vary tion opposite to the direction of sound travel.
linearly with distance x along axis of pipe so The flow where such a thing happens is called
dp/dx = (Pi - P2 )/L. Pistons driving the flow supersonic; the flow speed is greater than the
are assumed to be infinitely far away, so that sound speed at the spot where the flow is super-
the flow velocity, parallel to pipe axis, has the sonic. Supersonic flow occurs around high-
same dependence upon y and z for all x. The speed vehicles and missles, and in pipes when
velocity profile is parabolic in both the two- high pressure gas escapes through a constric-
dimensional case (infinite parallel plates) and in tion such as a partially open valve into a region
the circular cross-section case. Mean flow veloc- of sufficiently lower pressure.
ity Urn and viscosity coefficient 11 are assumed A steady supersonic flow always must pass
constant; the flow is assumed steady and lami- through a shock front to slow down to sub-
nar. For a circular cross-section pipe of radius a, sonic flow again. i ,4 A shock front is a surface
at any distance r from the center, u = 2urn(1 - of discontinuity separating fluid of high pres-
r2 /a 2 ), and the volume passing a cross section sure, high density, and high temperature from
per second is Q = 1Ta 2 Urn = 1Ta 4 (pi - P2 )/8p.L. lower pressure, density, and temperature fluid.
Since these formulas do not apply near pipe en- If the shock front is in motion, it is called a
trances, caution in applying them to pipes of shock wave. The component of fluid velocity
finite length is necessary even when the flow is perpendicular to the shock front is also dis-
steady and laminar. (See later discussion of tur- continuous, while the other components are
bulent and laminar flows.) continuous. See SHOCK WAVES.
Other examples of idealized solutions are one- The continuum description of flow fails to
dimensional flow of an ideal gas through a nor- describe nearly all actual flows because actual
mal shock wave, flow of an ideal gas without flows when looked at carefully are turbulent.
viscosity through a pipe of slowly changing cross Turbulent flows have violent and erratic fluctu~
section (wind tunnel), and one-dimensional ations of velocity and pressure which are not as-
finite waves in an ideal gas. Many other solutions sociated with any corresponding fluctuations of
involve making whatever approximations and the boundaries containing or driving the fluid.
assumptions are necessary to obtain descriptions Turbulence is generally considered to be the
of observed flows. manifestation of the nonlinear natur'e' of the
When motion of the fluid consists of only fundamental equations. Under certain condi-
small fluctuations about a state of near-rest, tions as mentioned earlier in describing Couette
the continuum equations are linearized by and Poiseuille flows, nonturbulent or laminar
neglecting nonlinear terms and become the flow exists. A common example is· cigarette
equations of acoustics. See ACOUSTICS. A large smoke rising from a cigarette held at rest; near

~ L

4~t~iJ:~,~);tl~it;w,\:~~I;~~I~~i~jl;jD ~tt
)
u
----
rf
-_.-
- --.....

...
FIG . 2. Poiseuille flow.
477 FLUID DYNAMICS

the cigarette, the stream is smooth and straight,


or laminar, and further up the flow breaks into
turbulence.
Reynolds showed that Poiseuille flow in a
T
h
pipe occurs when PUma/Jl is smaller than 2000.
The combination of variables is 4imensionless
and is called the pipe Reynolds number. Blood
-L
flow in capillaries is laminar, but water flow
in household pipes is turbulent unless the flow
is about that allowed by a leaky faucet or less.
Other types of flow have Reynolds numbers FIG.4. Discharge through a small hole in a bucket.
characterizing transition from laminar to turbu-
lent; for example a sphere falling in a fluid of
viscosity Jl obeys Stokes' Law
density) fluid from a small hole. For the cylin-
mg = 61raJlu drical bucket of water in Fig. 4, equate the sum
of the quantities in the Bernoulli formula at the
where u is the constant speed of fall, m is the top surface to the sum at the hole: Pl + pgh +
sphere mass, a its radius, and g the weight per o = P2 + 0 +! PU2 2. The pressure at both top
unit mass, but the law is obeyed only if the and bottom is atmospheric pressure P l; the
sphere Reynolds number pua/Jl is smaller than speed at the top is approximately zero because
about 1. the hole is considered to be very small. The pre-
Because of turbulence and viscosity, the very dicted speed of the emerging water is therefore
simple and useful Bernoulli formula is not valid; (2gh)ll2 regardless of the size of the hole (as
it can be derived as applying to a constant- long as the hole is small).
density fluid with zero viscosity in laminar flow. A valuable instrument in the form of a probe
However, under certain conditions, the formula for observing fluid speed is the Pitot tube; its
applies approximately even when the flow is operation is described by the Bernoulli formula.
turbulent, predicting properties within 5 to 20 A glass or metal tube with an open end points
per cent of the observed values. The Bernoulli into the flow, and the pressure difference flp
formula states between the stagnant fluid in the tube and the
moving fluid allows calculation of the fluid
p + pgh +! pu 2 = Silme constant in all places in speed at the place where the tube tip is inserted
the fluid by u = (2 flpfp)l/2 where p is the fluid mass
where p is the fluid pressure, p is the fluid mass density. When observing air speed, the pressure
density (which must be treated as constant), u is difference flp is easily measured by connecting
the fluid speed, h is the vertical height above the open ended tube via rubber hose to a glass
some convenient reference level, and g is the U-tube water manometer. Errors as much as 50
weight per unit mass. per cent may easily occur in various practical
When combined with the equation of conti- situations, but order-of-magnitude measure-
nuity, the Bernoulli formula gives a simple de- ments at least are usually possible.
scription of the Venturi, used in the automobile The Pitot tube is the only simple instrument
carburetor (see Fig. 3). Continuity states ulA 1 = for measuring fluid velocity at a point. It is
u2A2 = volume crossing any cross section of the useful only in certain steady flows (no de-
pipe per second. Ul fU2 =A2 fA 1 is small so Ul pendence of any quantity on time) and the de-
is smaller than U2' The Bernoulli formula states gree of its accuracy is hard to determine in a
Pl+!pUl2=P2+!PU22, and since Ul is given situation. Absolute velocity measure-
smaller than uz, Pl is larger than P2' The atmo- ment is possible only by observing the velocity
spheric pressure Pl pushes the liquid up into the of tracer particles by optical methods. s Many
lower pressure region P2 . other velocity measuring instruments are cali-
Another common situation descfloed by the brated, either by comparison to tracer velocity
Bernoulli formula is the discharge of (constant measurements or by towing the instrument at
a known speed through the fluid at rest. A
tracer must be small, particularly if its density
differs from that of the fluid, so that it moves
--u I. exactly as the fluid surrounding it is moving.
'1tmosphet6 Usable tracers are liquid droplets or solid par-
oi, A{ ticulates in gases, and gas bubbles or solid par-
inlet ticulates in liquids. Tracer dimensions are usu-
ally 1 to 100 micrometers, the smaller ones
liquid being required when the velocity being measured
fuel is ch,mging spatially or temporally. Observation
of the tracer is carried out either by photography
FIG. 3. Pipe flow with a constriction; carburetor with repetitive flash illumination (called chrono-
employing Venturi. photography) or by Doppler methods.
FLUID DYNAMICS 478

The laser Doppler velocimeter has had many at small scales, and in rapid succession, is
applications in recent years. s In this instru- carried out by optical means. For example,
ment light is scattered by tracers from two co- intensity of Raman components of scattered
herent laser beams which cross one another in a laser light can be analyzed to determine tem-
chosen small volume inside the moving fluid; perature, density and even species concentra-
the fluid carries micron-size tracer particles and tion. Pressure at walls of channels or at surfaces
the scattered light from them in the inter- of inserted objects is measured by "wall taps"
section volume is gathered by a lens on to a or by miniature diaphragm or piezoelectric
photomultiplier. If the direction of the velocity gages. A "wall tap" is a small hole at the point
of the tracer is known relative to the bisector on the surface where the pressure is to be
of the directions of the laser beams, the mea- measured which connects with a pressure gage
sured frequency of oscillation of the photo- by a small channel filled with a static fluid. s
multiplier current gives the speed v of the tracer
by means of the formula: RAYMOND J. EMRICH

VD AO References
vb =V cos e =-----''-- 1. Landau, L. D., and Lifshitz, E. M., "Fluid Mechan-
2 sin (0//2)
ics," London, Pergamon Press, 1959.
e
where is the angle between the bisector of the
2. Lamb, H., "Hydrodynamics," First American
Edition, New York, Dover Publications, 1945.
incoming laser beams and the tracer velocity 3. Hirschfelder, J., Curtiss, C. F., and Bird, R. B.,
v, AO is the wavelength of the light in the fluid "Molecular Theory of Gases and Liquids," New
medium, 0/ is the angle between the incoming York, John Wiley & Sons, 1954.
laser beams, and VD is the measured frequency 4. Gaydon, A. G., and Hurle, I. R., "The Shock Tube
of the photomultiplier signal, called the Doppler
in High-Temperature Chemical Physics," New York,
frequency. The quantity determined is the mag- Van Nostrand Reinhold, 1963.
nitude of one component of the velocity; the
5. Emrich, R. J. (Ed.), "Fluid Dynamics," Vol. 18 of
sense (sign) of the component is not mea-
series "Methods of Experimental Physics," New
sured but must be learned from other informa-
tion. The electronic equipment to measure and York, Academic Press, 1981.
display or record the Doppler frequency is
Cross·references: COMPRESSIBILITY, GAS; FLUID
rather elaborate, but available commercially.
STATICS; GAS LAWS; KINETIC THEORY; LIQUID
Fluid speed as a function of position is mea-
STATE; MAGNETO·FLUID·MECHANICS; STATIS·
sured by moving the optical system mechanically TICAL MECHANICS.
about in the fluid field. The intersection volume
is ordinarily small-much less than a cubic
millimeter.
Measurement of fluid velocity at small scales
and in small times is the most promising method FLUID STATICS
for study of turbulence. The laser Doppler
velocimeter can measure the speeds of succes- The definition of a fluid-in distinction to a
sive tracer particles passing through a small solid-is a substance in which there are no shear
measuring volume in times as close together as stresses when it is at rest. A fluid is an infi-
1 ms, but the measurement is not very accurate. nitely slippery material. In reality, a substance
Another instrument which must be calibrated, is considered a fluid if shear stresses in it vanish
the hot-wire anemometer, is used to assemble so rapidly when motion ceases that, for times
more accurate data collected in an air or water under consideration, it may be considered shear-
stream. The hot-wire anemometer is a probe free. Substances like tar and glass require very
consisting of two steel prongs supporting at long times for shear stresses to relax, in com-
their ends a tungsten wire of diameter 2-5 parison with fluids like water and gasoline. The
fJ.m and length 1-3 mm carrying an electric cur- sense that everyone has of the difference be-
rent which would heat it white-hot except for tween a fluid and a solid is more clearly felt
the cooling produced by the fluid passing over than is the nature of internal forces (stress) be-
it. Its resistance is related to its temperature, tween the parts of the substance. Shear stresses
which in turn is related to the fluid velocity are the internal forces opposing sliding of one
component perpendicular to the wire. Thus, part of a substance past the neighboring parts.
after it is calibrated, its resistance as a function All real fluids in motion do have shear stresses
of time can be interpreted as a time record of (see FLUID DYNAMICS), and solids also have
the stream speed in which it is immersed. At shear stresses which help maintain their solid
larger scales than millimeters, data on the suc- shape when weight and other forces act on them
cessive positions of injected dye spots or hydro- (see MECHANICAL PROPERTIES OF SOLIDS).
gen bubbles in water are collected by cinema- Historically the complexity of internal forces
tography using television recorders. introduced great confusion (from our modern
Measurement of other fluid variables-density, viewpoint) into the fundamental principles of
composition, and temperature-with precision, mechanics. Principles applying to fluids at rest
479 FLUID STATICS

were clarified by Torricelli and by Pascal whose forces acting-weight and electromagnetic forces
experiments in 1647 settled the question of the can act also. Both these are forces per unit vol-
existence of an absolute vacuum and of weight ume and are called body forces, in distinction
of the air. Sir Isaac Newton devoted over one to stresses which are surface forces or contact
quarter of the famous Principia to analysis of forces.
fluids (Book II, 1713); his principles are more In a fluid the only forces acting (see, how-
subtle in application to fluids, and Newton's ever, "surface tension," described below) are
contributions to fluid mechanics are seldom weight, electromagnetic forces, and pressure.
quoted today. Probably Euler was the first to For a volume element at rest the sum of these
give, in 1755, a clear idea of the complex rela- must be zero:
tion of the internal forces (stress and body
forces) and the motion of fluids and solids. pg+JX B+PeIE-\lp=O. (1)
The concept of pressure, and methods of Here P is density (mass per unit volume), g is
measuring pressure of gases, soon led to the weight per unit mass, J is electric current per
discovery of Boyle's law (1662), and eventually unit area perpendicular to J, B is magnetic field,
to the clarification of the concept of tempera- Pel is electric charge per unit volume, and E is
ture through its measurement by the gas electric field. In a dielectric liquid, Pel can in-
thermometer. clude polarization charge and, in a magnetic
Meaning of Pressure If we consider a surface liquid, J may include ampere an currents repre-
within a substance we expect that material on senting magnetic polarization. For more infor-
one side will be pulling material on the other mation on the nature of the electromagnetic
side with a force having components both nor- force, see MAGNETO-FLUID-MECHANICS.
mal and tangential to that surface. In a static Eq. (1) is the basic equation of fluid statics; it
fluid, however, the tangential component must is valid at all interior points.
be zero; otherwise the material would have a In all the foregoing, it is assumed that "rest"
shear stress and would not be called a fluid. is reckoned relative to an inertial frame of ref-
Pascal realized that at a given place in the fluid, erence. In a frame of reference translating or
if a surface with a different orientation is con- rotating relative to an inertial frame, additional
sidered, the force per unit area on it is again
perpendicular to that surface and has the same "fictitious" body forces must be added to Eq.
magnitude as the force per unit area on the (1) (see CORIOLIS EFFECT, DYNAMICS).
original surface. (This is not true for a solid ex- An additional force called surface tension is
cept for very special cases of loading.) The mag- evident at interfaces between liquids which do
nitude of the force per unit area for all these not mix (e.g., oil and water), between liquids
orientations at the same point is called the pres- and vapors, and between liquids and solids (see
sure at that point. The strange property of a SURFACE TENSION). This force plays the role
force per unit area, which is a vector, having the of a boundary condition that must be satisfied
same value at a point for all orientations seems when Eq. (1) is integrated over an interface in a
to give it the property of a scalar. Actually, fluid statics problem. Across a curved interface,
since the property of a tensor is to relate one with no electric or magnetic field present, the
vector to another vector , the stress tensor for following relation between the surface tension
a static fluid is expressed by the negative of the S, the radii of curvature R I and R 2 , and the
pressure times the unit tensor at all points. This pressure jump PI - P2 over the surface exists at
each point on the surface:

.
expresses the force per unit area as perpendic-
ular to whatever surface is considered at the
point. (In general in a solid or in a moving fluid,
the force per unit area is related to the vector
PI - P2 =S (~I + ~J (2)
giving the orientation of the surface by the The surface tension S has units of force per unit
stress tensor. The vectors have different direc- length in the surface, and is a joint property of
tions, and all six components of the stress ten- the two fluids . It depends markedly on temper-
sor have their own dependence on spatial posi- ature and, for a liquid and its vapor, becomes
tion.) In a static fluid, only the magnitude of zero at the critical temperature of the substance.
the force per unit area varies from point to The surface tension still exists in a case of a
point, and the pressure gradient \lp can be plane interface (it is a force which will do work
shown to be equal to a force per unit volume in in a quasistatic deformation of the fluids), but
the direction in which the pressure decreases no pressure difference exists across the plane
most rapidly. interface. This follows from the nature of pres-
Stress components are positive when the sure as normal force per unit area. At a dielec-
neighboring material is pulling, and negative tric boundary, with electric or magnetic fields
when pushing: tension (+) and compression (- ). present, an additional contribution to the pres-
Pressure is, however, treated as positive when sure discontinuity given by Eq. (2) exists. At a
pushing, since in gases tension never occurs and liquid-vapor boundary, the additional contribu-
in liquids tension seldom occurs. tion is known as electrostriction. I
Body Forces and the Law of Fluid Statics Equation (1) is not only the fundamental ex-
Stresses on a sample of matter are not the only pression of the meaning of pressure; it is also
FLUID STATICS 480

the basis for the measurement of pressure. Be- ecules before reaching a distant wall, and if the
fore describing methods of pressure measure- molecules with which they collide have in a
ment, however, we will look at the meaning of certain measure the density and velocity dis-
pressure from a molecular viewpoint. tribution characteristic of the point, it is
Molecular Models of Pressure 2 Pressure is thought that the reverse flux of molecules may
nearly always used as one of the thermodynamic also have the local characteristics. (Again it is
variables in equations of state. Our extensive necessary, in making this statement, to limit
knowledge of properties of substances, as com- ourselves to cases of no net flow of mass
piled in thermodynamic tables, provides a through the area.)
wealth of data for testing molecular models of This justifies, to some extent, the application
liquids and gases, as well as practical informa- of an equation of state locally within a fluid
tion for engineering design. The model em- with gradients of density, temperature, and pres-
ployed in the kinetic theory of gases has been sure. It is apparent that only within volume ele-
so successful that there is a tendency on the ments larger than about one mean free path
part of some to accept this molecular model's cubed can an equation of state have meaning in
view of pressure as "real" and universally ap- this model.
plicable. It is however restricted to gases, and Liquid state models suppose the molecules to
models of liquids are less successful and their be in close contact with one another, as in a
view of pressure is somewhat more complicated. solid, but over a range of several molecular
The kinetic theory of gases, rooted in ideas diameters the forces between molecules are
going back at least as far as Daniel Bernoulli such that long range order is not maintained as
(1738), pictures matter in gaseous form as con- in a crystal. The picture of pressure in this
sisting of isolated molecules flying about in a model is quite similar to the picture of stress in
force-free vacuum and making occasional col- a solid, and the absence of shear stresses in
lisions with one another and with solid contain- liquids at rest is actually only a qualified one in
ing walls (see KINETIC THEORY). At a hypo- that the time for a shear deformation to relax
thetical small surface within the gas, there is is shorter than some time considered to be of
nothing most of the time, but occasionally a interest. The substance is called a glass if the
molecule flies through the surface unimpeded. relaxation time is longer. (See LIQUID STATE.)
The model assumes such events to be random In statistical models of a monatomic sub-
and uncorrelated, but to have a distribution of stance such as argon, the pressure is defined 3 as
directions and times between occurrences. The a sum of two terms, one a kinetic term equiv-
model further assumes that the number of alent to the momentum flux of molecules
molecules flying through a surface per unit time through a hypothetical area in empty space as
and in one sense is proportional to the area con- described above for the kinetic theory of gases,
sidered, and that the num ber per unit area, per and a collisional transfer of momentum term
unit time, is the same for all orientations at any representing the ever-present forces between
one position. This is the case, however, only for adjacent pairs of molecules. The microscopic
fluid statics-not fluid dynamics-that is, there pressure tensor tij (i.e., the negative of the
must be no net transfer of mass through the stress tensor) for isotropic molecules such as
area (no mean motion.) argon can be expressed as
If the position of the surface is at or near and
parallel to one of the solid boundaries, from
which the molecules are considered to rebound, tij =L { POI.iPOI.j
the equal flux of molecules into the surface and 01. m
back out is associated with an average force per
unit area applied to the wall. This average force
per unit area at the wall, which is equal to twice + -I .t......
" (qR· 1'1
- qOl..) OUOI.{j}
-
J oq.
-
2 {3 01.1
the normal component of momentum flux
through the surface in one sense, is then con-
sidered to be the pressure at the wall, and the where the subscripts i and j take on the values
flux through a surface not at a wall for which I, 2, 3 representing coordinate axes, and the
there is likewise an equal flux in the other sense, subscripts 0: and {3 are numbers characterizing
is interpreted as the pressure within the gas. In each of the molecules in the system so that the
this way, a meaning is given to "normal force sums are over all the molecules and all pairs of
per unit area" at any point within the gas, and molecules. The momentum components of the
if the density or velocity distribution of mole- molecules are denoted by POI.i, the mass of each
cules varies from place to place, twice the flux by m, the separations of two molecules by
of normal component of momentum in one (q{3j - qOl.j), and the forces between them by
sense through a hypothetical surface, namely (oUOI.{3/oqod). The delta function 8(qOl. - x) re-
the pressure, will vary from place to place as stricts the sum to only those molecules lying
well. Since molecules actually have some size in within a volume cell denoted by position x in
the model, those passing through a hypothetical space. The macroscopic pressure tensor is ob-
surface at a point may collide with other mol- tained when tij is averaged over the equilibrium
481 FLUID STATICS

statistical distribution of all coordinates qa and the atmosphere, the arrangement is called a
momenta Pa. barometer. When the pressure in some reservoir
The above pressure tensor expression implies is measured relative to the ambient atmosphere,
that the thermodynamic and mechanical vari- one speaks of the pressure difference as the
able pressure has the same meaning in a vapor "gage pressure" of the reservoir.
and in a liquid; the first, kinetic term predom- For accurate measurement, the liquid must
inates in a vapor, and the second, collisional not evaporate; a "low vapor pressure" liquid
transfer of momentum term predominates in a such as mercury or silicone oil is commonly
condensed liquid, so that the expression pro- used.
vides a continuous pressure across a plane inter- If the vertical height of either reservoir is ap-
face (when there is no net transfer of matter preciably different from the corresponding
and static equilibrium prevails). Because of liquid-gas surface in the manometer, a correc-
mathematical complications associated with the tion for the variation in pressure with height in
statistical mechanics of many body systems, the the gas must be made. An equation similar to
consequences of the model have not yet been Eq. (3) is used, but p is then the gas density.
fully worked out. The picture of pressure in the A more likely correction will be necessary
model is however clear. due to different diameters of the glass tubing
Measurement of Pressure Two basic instru- on the two sides and consequent differences in
ments are used to measure pressure, and all the curvatures of the meniscuses. This latter
other pressure gages are calibrated by these. correction is minimized by using very large di-
U-Tube Manometer. Two columns of liquid ameter tubing. Tubing of I cm radius is needed
partly filling a piece of glass tubing bent in to a when the liquid is mercury, and accurate mea-
U-shape with hoses connecting to two reservoirs surement of h by interferometry is used.
in which the pressure is to be measured are seen Use of the manometer is so common that
frequently in laboratories. The liquid typically pressure units stated as h and the name of the
has a density 10 3 times the density of the gas liquid employed in the manometer have be-
which is connecting the manometer to the res- come common. Table I lists some of the be-
ervoirs. The manometer equation wildering variety of pressure units found in the
literature. The value of each unit is given in
terms of the SI unit Pa. The table permits pres-
sure units to be converted among each other, as
gives the pressure differences P2 - PI between well as to the standard SI unit.
the reservoirs, where p is the mass density of Piston and Cylinder Gage. The other abso-
the liquid in the manometer, g is the weight per lute pressure measuring instrument uses a tight-
unit mass (nominally 9.8 N kg-I), and Z I and fitting piston in a precision bore cylinder. It is
Z2 are the vertical coordinates of the respective the basic standard for measurements above
surfaces between liquid and gas on the two about 100 kPa and even within the range of
sides of the U-tube. Eq. (3) is derived by integra- usefulness of the U-tube manometer it is used
tion of Eq. (I) along a path through the liquid, to calibrate other gages. Also called a dead-
under the assumptions that p is constant (ho- weight gage and a gage tester, it is seldom seen
mogeneous liquid) and that the liquid-gas sur- outside a standards laboratory because its oper-
faces are either plane or have equal curvatures. ation requires skill, care, patience, and condi-
If absolute pressure is to be measured in one tions of cleanliness.
reservoir, the pressure must be maintained at The vertical piston supports loads of known
zero in the other, usually by a vacuum pump. weights, and a liquid (usually oil) or a gas (ni-
When used to measure the am bient pressure in trogen or air) in the cylinder below the piston,

TABLE 1. PRESSURE UNlTS.*

Name of Unit Value Name of Unit Value

1 kgf/cm 2 98.07 kPa 1 in. Hg 3.38 kPa


1 dyne/cm 2 0.1 Pa 1 in. H 2 O 249 Pa
1 bar 100 kPa 1 mm Hg (Torr) 133.3 Pa
1 atm 101.3 kPa 1 mm H 2 O 9.81 Pa
o acoustic decibel 10 Pa (transducers) 1 Ib/ft 2 47.9 Pa
(reference level)* * or
20 ILPa (hearing) 1 Ib/in 2 (psi) 6.895 kPa

·The SI unit of pressure is the pascal (symbol Pa); 1 Pa = 1 N m -'.


··Sound pressure level, in decibels, is 20 times the logarithm to the base 10 of the ratio of the sound
pressure to the reference pressure. The reference pressure should be explicitly stated; if it is not, the
levels noted in the table are often used. Also, unless otherwise explicitly stated, it is understood that
the sound pressure is the rms pressure change.
FLUID STATICS 482

communicating with a gage to be calibrated, is Deductions from the General Principles of


compressed by a hand pump until the piston Fluid Statics We have stated the elementary
and loads rise a short distance. As the liquid or basis of fluid statics along with the meanings of
gas slowly leaks out through the small piston pressure and surface tension in Eqs. (1) and (2).
and cylinder clearance, the pressure is main- Some traditional wisdom now derivable from
tained. The effective area of the piston and these equations preceded their statement, such
cylinder has been determined in principle by a as "water always runs downhill," "water seeks
metrological microscope in a standards bureau. its own level," and "a body is buoyed up by a
Piston and cylinder gages are obtainable from force equal to the weight of the displaced fluid"
commercial manufacturers, who supply the (Archimedes' principle). Let us derive some
effective area by comparing gages with gages general conclusions from the laws of statics,
built and maintained by national standards namely from Eqs. (1) and (2).
bureaus. Pressure Variation with Depth. We have al-
With care, pressures can be measured with ready, in discussing the manometer, established
errors as small as 0.01% in ranges from 100 Pa the variation of pressure with depth in lakes
to 100 MPa (1 millibar to 1000 bars). and in the oceans. This is Eq. (3), valid because
Other Gages. The common gages with a the density of water in the ocean varies so little
pointer indicating on a circular dial contain a with depth, and because even at the greatest
metal chamber which deforms with the applied depths, the distance from the center of the
pressure. By combining metallurgical art with earth has changed by such a small fraction that
mechanical art to translate the motion of the g may be considered practically constant.
deforming part into rotation of a needle on a Hydraulic and Pneumatic Actuating Systems.
dial without backlash, manufacturers have been Provision for application of a force or forces at
able to supply gages with a wide variety of distant sites by means of a fluid in tubing con·
operating ranges, compensated for temperature nected to piston-and·cylinder arrangements is
change, and retaining precision on the order of very common. Movement of the control piston
0.1 % of the range. increases the pressure in the fluid and the in-
Mechanical distortion of a thin diaphragm crease in pressure occurs almost immediately
held rigidly at its edges is another effect widely (transmitted at the speed of sound in the fluid)
used for pressure measurement. Sensing of everywhere in the fluid. This uniform pressure
diaphragm deflection by capacitatively coupling increase is sometimes called Pascal's law and
it to a fixed electrode and conversion of the ca- follows from Eq. (1). The aspect of such ar-
pacitance change to an electrical signal has been rangements which causes the most technical
developed to form a gage of high sensitivity and trouble is that the fluid finds the smallest holes
reliability. Such gages are the best way of mea- or cracks in the containing system and leaks
suring pressure in the range below 10 Pa, and out. Sealing of pistons, valves and other moving
are often found on vacuum systems. Their out- parts with gaskets introduces friction, so that
put is assumed to be a linear function of pres- the computed forces are not precisely found
sure, so that, with points at the ends of a in practice. By choosing the size (area) of the
straight line on a graph determined at vacuum piston-and-cylinder, the magnitude of force to
and at 10 Pa (by means of a piston and cylinder be delivered can be chosen. An extremely large
gage) to an accuracy of 0.1%, a pressure scale in force is achieved by this means in the hydraulic
the range 0-10 Pa is defined. press, which allows forces to be "magnified,"
Minute diaphragms formed in silicon chips by and by means of mechanical valves (in a pump)
integrated circuit etching techniques are mass to produce significant displacements of the
produced and employ deflection sensors of dif- large force.
fused dopant in the diaphragm itself. The sen- Surface of Water at Rest Is Horizontal. The
sors are electrical resistance paths which change existence of sea level on the earth, combined
their resistance when stretched. While quite sen- with the detectable curvature of the sea surface
sitive to temperature as well as to pressure, their over several kilometers, follows from the ab-
small size, adaptability to electrical signal gen- sence of all body forces in Eq. (1) except
eration, and cheapness make them useful in weight. Weight, as represented by the common
processing equipment and in other systems. g, is the resultant of the gravitational force in
Their accuracy is less than that of the other the nearly spherical earth and the local centrif-
gages described. ugal force due to the earth's rotation. (Centrif-
Many other devices capable of calibration are ugal force is the fictitious force introduced by
used for pressure measurement. 4 It is apparent describing pressure on the earth in a rotating
that pressure cannot be measured in the interior frame of reference.) Due to the oblate shape of
of a fluid but only at a solid surface bounding the earth, as well as due to the centrifugal force,
the fluid. Sometimes a probe with a pressure g is not directed exactly toward the earth's cen-
sensitive element is inserted into a fluid to at- ter. The meaning of "vertical" is given by the di-
tempt a measurement within, but there is no rection of g. That the surface of a liquid is
assurance that the pressure is not changed by perpendicular to g, i.e., "horizontal," follows
inserting the probe. A "contactless" pressure from the vector nature of Eq. (1) and the
measurement is impossible. knowledge that the surface is one of constant
483 FLUID STATICS

pressure. Since g changes in direction as we con-


sider various locations on the earth, the surface
of constant pressure, i.e., sea level, also changes
direction, giving the nearly spherical shape of
the oceans' surfaces. So long as no solid body-
one able to maintain shear stresses-is between
water in a stream and water in the ocean, the
water will move to form the static surface at
atmospheric pressure which is the surface at sea
level. Thus water runs down until it gets to the
ocean, which is the final static equilibrium state.
Capillary Rise of Liquid. That sap rises to the
top of the tree-well above sea level-is under-
stood by Eq. (2) and the knowledge that the
pressure is reduced below the atmospheric pres-
sure by the curvature of the meniscus in the sap
column.
Archimedes' Principle. That an immersed
body is buoyed up by a force equal to the
weight of the displaced fluid follows readily
from Eq. (I), the assumption being that the
only body force is weight. The pressure over
the surface of the body varies with depth and is
larger underneath than on top; the pressure at
opposite sides is the same at the same depth, so FIG. 1. Archimedes' principle.
the sideways forces pushing on the body are op-
posite and there is no net sideways force. Equa-
tion (I) with weight g as the only body force
takes the form dp
- =-pg ( 4')
pg - 'Vp = 0, (4) dz
where both terms are forces per unit volume. and the adiabatic equation of state for air
We divide the immersed body into many im-
aginary vertical prisms, one of which is shown
in Fig. I, with cross sectional area A and height P)(l-'Y)h
T= To ( - , (6)
h. The net upward force on the top and bottom Po
surfaces of the prism is 'Vph times A and is, ac-
cording to Eq. (4), identical to the weight of where 'Y is the ratio of heat capacities at con-
fluid in the prism of cross section A and height stant pressure and at constant volume, and is a
h, namely, pghA. This is true for each ~rism constant equal to 1.4 for air, and To and Po are
and we can add the results for all the pnsms, the temperature and pressure at any reference
thus deriving Archimedes' principle. height Zo.
Criterion for Unstable Atmosphere. Air is If a parcel of air at height Zo should undergo
compressible, and over heights of hundreds of a displacement to a different height Zo + dz it
meters the density of the atmosphere changes may find itself surrounded by air at a different
appreciably. Mixing of the air at levels up to temperature and density from its own tempera-
about 400 m above the ground is important in ture and density. If it has not changed its den-
predicting whether rain will occur and whether sity by the correct amount, it will not be in
pollutants inserted near the ground will stay equilibrium and will rise or fall further (un-
there or be diluted as unstable air mixes. The stable), or will be pushed back to its initial
criterion for stability is that the temperature height (stable). The amount its pressure has
T increase with height z at a rate greater than changed is given by Eq. (4') and the amount
dT g its temperature has changed is then given by
-=-- (5) Eq. (6). The criterion for stability is
dz Cp '
the so-called adiabatic lapse rate. The heat ca- dT = f.
dT) dz = (dT) (d P ) dz
pacity per unit mass at constant pressure, ~P' \dz 0 dp 0 dz 0
is a property of air: 1010 J kg- 1 K-l. EquatlOn
(5) is derived from the basic equation of fluid
statics, with weight the only body force, = (dT) (_ Pog) dz. (7)
namely Eq. (4), in the form* dp 0

*The sense of the vector g is down, whereas z is posi- Using Eq. (6) and the ideal gas equation of state
tive upward. in the form
FLUID STATICS 484

Po known as the Fourier series representation of


= --...:.....:'----
Po
1-
Cp ( -r- To
r) (8) f(x). If the left and right hand sides of Eq. (2)
are multiplied by cos 2nmxjL or by sin 2nmxjL,
and the resulting equation is integrated with re-
spect to x over one period L, the orthogonality
Equation (5) for the adiabatic lapse rate is of the harmonic sine and cosine components on
obtained. this interval implies the following formulas for
the Fourier coefficients ao, an, b n :

J
RAYMOND J . EMRICH
L 2
References ao = -1 / f(x)dx

f
1. Landau, L. D., and Lifshitz, E. M., "Electrodynam- L -L/2
ics of Continuous Media," Oxford, Pergamon Press,
1960, pp. 64-69. 2 L/2 2nnx
2. Hirschfelder, 1. 0., Curtiss, C. F., and Bird, R. B., an = - f(x) cos - - dx n = 1,2,· ..
"Molecular Theory of Gases and Liquids," New L -L/2 L
York, John Wiley & Sons, Inc., 1954, pp. 694 et
seq. 2 {L/2 2nnx
3. Irving, J. H., and Kirkwood, John G., J. Chern. bn = - f(x) sin - - d x n = 1,2,· ..
Phys. 18, 817 (1950). L -L/2 L
4. Soloukhin, R. I., Curtis, C. W., and Emrich, R. J.,
"Measurement of Pressure," in "Methods of Ex- (3)
perimental Physics-Fluid Dynamics" (R. J.Emrich,
Ed.), Vol. 18, New York, Academic Press, 1981, Other equivalent forms of the Fourier series
pp.499-515. can be obtained from Eq. (2). For example, f(x)
may be written
Cross-references: COMPRESSIBILITY, GAS; FLUID
DYNAMICS ; FLUID MECHANICS; GAS LAWS;
KINETIC THEORY ; LIQUID STATE. f(x)=ao + L~ C n cos (-
2-n-
nx¢>n) (4)
n=l L
where
Cn = yan2 + b n 2

FOURIER ANALYSIS bn
¢>n = tan-! - . (5)
Fourier analysis is the mathematical representa- an
tion of functions as linear combinations of sine,
cosine, or complex exponential harmonic com- Another widely used representation is the ex-
ponents. Such representations take their name ponential (or complex) form of the Fourier
from Jean Baptiste Fourier (1768-1830), series,
French mathematician and physicist, who used
them extensively in developing a mathematical L
~

f(x) = cxnei(2rrnx/L) (6)


theory of heat conduction. They continue to
be widely used in mathematical physics and en- n=-oo
gineering, finding application in the study of

Iii
where
such diverse subjects as diffraction, diffusion,
image formation, spectroscopy, electrical net-
works, x-ray crystallography, and the theory of L/2
probability.
CXn =- (an - ibn) =- f(x)e- i(2rrnx/L) dx.
2 L -L/2
Fourier Series Let f(x) be a periodic function
with period L, i.e.,[(x) satisfies (7)
f(x) = f(x - L) (1) For a real-valued f(x), cx+ n and cx- n are not in-
dependent, since cx- n = cx+ n *, where * indicates
for all x. For a wide class of such functions, we complex conjugate. However, when f(x) is
can represent f(x) by an infinite summation of complex-valued, cx+ n and cx- n are in general not
sine and cosine harmonic components related .
An important property of Fourier series,
2nnx 2nnx known as Parse val's theorem, states that the
f(x) =ao + L
~
an cos --+ L~
bn sin - - , mean quadratic content of f(x) (which often
has the physical interpretation of average power)
n=l L n=l L
may be found by summing the squares of the
(2) Fourier coefficients. Thus, for the three repre-
485 FOURIER ANAL YSIS

sentations used,
F(v) = 2 J o
00 [(x) cos 21TVX dx. (12)

J
Other forms of the Fourier integral and
Fourier transform are often found in the litera-
1 Cn 2 ture. For example, frequency v is often replaced
-
L/2
1[(x)12 dx = a0 2 + L00

by angular frequency w = 21TV, in which case


L -L/2 n=l 2 Eqs. (9) and (10) become

(8)
[(x) = -
1 f 00
F(w)e'WX dw

i:
2n _00

No discussion of Fourier series would be com-


plete without some mention of the conditions F(w) = [(x)e- iwx dx. (13)
under which the series actually converge to the
functions they are to represent. Conditions
which are both necessary and sufficient are not Alternatively, the forms

J
known. However, for the series to converge to
[(x) at each point of continuity, it is known to
1
be a sufficient condition that [(x) have only a [(x) = . r;:;::: F(w)e iwx dw
00

finite number of maxima and minima in the in- V 21T -00

f
terval - L /2 EO;; x EO;; L /2. Under this same condi-
tion, at points of discontinuity of [(x) the series
converges to the arithmetic mean of the values F(w) = -1- 00
[(x)e- iwx dx (14)
of [(x) immediately to the left and right of the
discontinuity.
...n:rr -00

Fourier Integrals Just as a periodic function are sometimes found. Here we shall continue to
can be represented by the sum of harmonic use the forms of equations (9) and.( 10).
components in a Fourier series, so too an aperi- As in the case of Fourier series, the Fourier
odic function may be represented by an integral integral representation will converge to [(x)

r:
over a continuous spectrum of complex-expo- only for a certain class of functions. Conver-
nential components. In this case we write gence at points of continuity is assured if [(x)
satisfies the following set of sufficient condi-
tions:
[(x) = F(v)ei27rllx dv (9) (i) [(x) is absolutely integrable;
(ii) [(x) has only a finite number of maxima
and minima in any finite interval; and
which is referred to as the Fourier integral rep- (iii) [(x) has no infinite discontinuities.
resentation of [(x). Here F(v), known as the Under these conditions, at points of disconti-
Fourier transform (or Fourier spectrum) of [(x), nuity of [(x), the Fourier integral converges to
is given by the arithmetic mean of the values of [(x) im-
mediately to the left and right of the disconti-
F(v) =[ : [(x)e-i27rIlX dx. (0) nuity. In general, anyone of the above sufficient
conditions can be slightly relaxed at the price of
strengthening one or both of the additional
The variable v is generally referred to as the conditions.
frequency variable. While the class of functions encompassed by
Note that a simple expansion of the exponen- the above conditions is wide, there do exist
tial in Eq. (10) yields F(v) in terms of sine and certain functions important' in mathematical
cosine integrals, physics which are not included, such as sin x,

L:
cos x, and [(x) = 1. These may be included if
the concept of the Dirac delta function, l) (v), is

1:
F(v) = [(x) cos 21TVX dx
introduced, having the properties
o(v) = 0 v* 0,
-i [(x) sin 21TVX dx. (11)

If [(x) is an even function of x, the sine integral


f"-00
l) (v) dv = 1. (5)

vanishes, leaving F(v) expressed as a Fourier co- Strictly speaking, 0 (v) is not a function in the
sine integral, usual mathematical sense. However, it can be
FOURIER ANALYSIS 486

TABLE 1. FOURIER TRANSFORM PAIRS

=exp (_1I'x2)

t
[(x) F(v) = exp (-w 2 )

Ixl " ! sin 1I'V


[(x) = F(v) = - -
1I'V
Ixl >!
I - Ixl Ixl" 1 sin 1IV]2
[(x) ={ F(v)= [ ~
o Ixl> 1
[(x) = 1 F(v) = S (v)

[(x) = cos 1I'X F(v) =! 6 (v -1) +! c'i (v +!)


[(x) =exp (i1l'x2) F(v) = exp [ -i1l' (v 2 - 1)]

=L L
00

[(x) 6 (x - n) F(v) = 6 (v - m)
n=-OD m_- oo

treated rigorously using the theory of distribu-


tions.
In Table I are presented the Fourier trans-
forms of a number of the more important func-
f(x,y) = IL:
F(vx, Vy)

tions encountered in mathematical physics and . exp [i21f(vxx + Vyy)) dvXdvy

If
engineering. From the basic Fourier transform
pairs listed in the table, many other pairs may
be derived with the help of the following F(vx, Vy) = f(x,yl 00

Fourier transform theorems. -00

Linearity: If f(x) = a g(x) + b hex), then


F(v) = a G(v) + b H(v), where G(v) and H(v) . exp [-i21f(vxx + Vyy)) dxdy. (17)
are the Fourier transforms of g(x) and hex),
respectively. For the most part, the properties of one-dimen-
Similarity: If f(x) = g(ax), then F(v) = sional Fourier transforms carry over directly to
(1/laI)G(v/a). the N-dimensional case. There is extra richness
Shift: If f(x) = g(x - a), then F(v) = G(v) exp in the theory for the multidimensional case,
(-i21fav). however. For example, if [(x,y) exhibits circu-
lar symmetry, i.e., is a function of only radius
r in the plane, then Eqs. (17) can be reduced to
Convolution: If [(x) =[: g(~)h(x- ~)dt the form of the so-called Fourier-Bessel trans-
form (or Hankel transform of zero order),
then F(v) = G(v)H(v).
A relation entirely analogous to Parseval's
theorem of Fourier series is Plancherel's theo-
rem,
fer) = 21f fo
00 pF(p)Jo(21frp) dp

F(p) = 21f 1o
00
rf(r)Jo(21frp) dr (I8)

where r = (X2 + y2 )112 , P = (vx 2 + Vy2) 1/2 , and


In many physical applications this theorem leads J o is a Bessel function of the first kind, zero
to the interpretation of IF(v)12 as the energy order.
spectrum of the function f(x). Discrete Fourier Transforms Of particular
The Fourier integral representation (and the importance in the analysis of experimental data
Fourier series representation) can be generalized is the discrete Fourier transform (DFT), which
to apply to functions of two or more indepen- is readily implemented on a digital computer.
dent variables. Most important are functions de- Let f(x) be a function of finite duration (i.e.,
fined over a plane or over three-dimensional [(x) is zero outside the range 0 < x < 2X), and
space. In the case of two independent variables suppose that the bandwidth of f(x) is limited
we have to the approximate range - B < v < B. Let f (x)
487 FOURIER ANAL YSIS

be sampled with spacing I /2B between samples, branches of science that benefit from Fourier
and define fn to be equal to (1/2B)f(n/2B). analysis in each case.
The number of such samples is N = 4BX. Then The applications of Fourier analysis can be
the OFT of the sampled data set fn (n = 0, I, conveniently divided into two major categories .
. . . N, - I) is defined by Within the first category we find Fourier analy-
sis used purely as an analytical tool, for the cal-
culation and prediction of the results of experi-
Fm =Nil fn exp (-i21T mn) ments not yet performed, or for the explanation
n=O N (19) of experimental results already obtained. For
example, in certain important cases the wave
m=O,I, ... ,N-I. antplitude produced by a diffracting structure
may be calculated using Fourier transforms, a
The corresponding inverse discrete Fourier consequence of the fact that the complete dif-
transform relation is fraction integrals may be reduced to Fourier
transforms.
fn = -I L:
N-l (mn)
Fm exp i21T- . (20) As a second example of this category, it might
be desired to predict the image obtained when a
N m=O N
certain object is viewed through an imaging sys-
tem. The characteristics of such a system are
The relation between the resulting coefficients often most easily specified in terms of its effects
Fm calculated by Eq. (19) and the desired sam- on spatial sine wave objects of various spatial
ples of the continuous Fourier transform F(v) frequencies. To predict the image obtained for a
is relatively complex, but can be stated explicitly nonsinusoidal object, that object may be de-
as c?mposed, by means of a Fourier i~tegral, into
sme wave (or complex exponential) compo-
L:
00

Fm = F[(m - kN)/2X]. nents. The image is then found by synthesizing,


k=-oo
again with a Fourier integral, the various sine
wave image components, after their amplitudes
In the event that there is little energy in the spec- and phases have been modified by the known
trum of f(x) outside the interval-B<v<B, properties of the imaging system. A similar ap-
as supposed at the start, then to a good ap- proach is used in the analysis of linear, time-
proximation Fm is equal to sampled values of invariant electrical networks. It can also be used
the continuous spectrum F(v). For indices in the to calculate the diffusion of heat through a con-
range m = 0, I, ... , (N/2) - I, Fm = F(m/2X) ducting plate when a known temperature distri-
(these are the "positive-frequency" components bution is impressed across one edge of the plate.
of f(x)), while for indices in the range m = A second major category of applications uti-
(N/2) + I, . .. , N - I, Fm = F«m - N)/2X) lizes Fourier analysis as an experimental tool,
(these are the "negative frequency" compo- applied to experimental data to derive related
nents of f(x)). physical information. For example, in Fourier
The OFT can be computed very rapidly using spectroscopy the interferogram obtained from a
a digital computer with the help of so-called scanned Michelson interferometer may be
fast Fourier transform algorithms. Note that a Fourier-analyzed to yield the spectral intensity
brute-force calculation of any single coefficient distribution of the radiation source. In x-ray
Fm of the OFT would require N complex mul- crystallography, the x-ray diffraction pattern
tiply-and-adds, one for each term in the summa- produced by a crystal may be Fourier-analyzed
tion. (We refer to each complex multiply-and- to yield information about the crystal lattice
add as an "operation.") Since there are N such structure. In radio astronomy the correlation
coefficients, the total number of operations coefficients produced when the signals collected
required would be N2 . When N is a power of 2, by various elements of an array are cross-cor-
a reorganization of the order in which compu- related may be Fourier-analyzed to yield a map
tations are performed allows the total number of the radio brightness distribution of the sky.
of operations to be reduced to N log2 N. The In all of these latter examples, Fourier analysis
savings in computation time can be enormous plays a fundamental role in transforming data
for large N, and it can be said that the discovery which are readily collected by experiment into
of fast Fourier transform algorithms has made data which are more directly related to the
possible the widespread use of Fourier analysis physical properties of interest.
in modern computational physics. JOSEPH W. GOODMAN
Applications The applications of Fourier
analysis are so widespread throughout physics
and engineering that to summarize them ade- References
quately in a brief space is nearly impossible. At
best we can only outline some of the most im- 1. Bracewell, R. N., "The Fourier Transform and Its
portant classes of applications, illustrating with Applications," New York, McGraw-Hill Book Co.,
a simple mention some of the particular 1965.
FOURIER ANALYSIS 488

2. Goodman, J. W., "Introduction to Fourier Optics," ikro sin [(kw sin 8 )/21
New York, McGraw-Hill Book Co., 1968. A = Ce I 8' (3)
3. Papoulis, A., "Systems and Transforms with Appli- k sin
cations in Optics," New York, McGraw-Hill Book
Co., 1968. But since we are interested in the intensity 1,
4. Arsac, J., "Fourier Transforms and the Theory of which is the square of the amplitude, we find
Distributions," Englewood Cliffs, N.J., Prentice- that
Hall, Inc., 1966. 1 sin 2 a
5. Titchmarsh, E. C., "Introduction to the Theory of
Fourier Integrals," Oxford, Oxford University Press, 10 =----;;- (4)

1937.
6. Gaskill, J. D., "Linear Systems, Fourier Transforms, where a =! kw sin 8, and 10 incorporates all the
and Optics," New York, John Wiley & Sons, 1978. constants. r
7. Brigham, E. 0., "The Fast Fourier Transform," The function (sin 2 a)/a 2 (which is the square
Englewood Cliffs, N.J., Prentice-Hall, Inc., 1974. of what is sometimes called the sampling func-
8. Papoulis, A., "Signal Analysis," New York, McGraw- tion) is shown in Fig. 2; the alternate light and
Hill Book Co., 1977. dark bands are the Fraunhofer diffraction pat-
tern for a long, thin slit.
Cross-references: CALCULUS OF PHYSICS, DIFFER- A similar calculation 1 for a circular aperture
ENTIAL EQUATIONS IN PHYSICS, FOURIER of radius R gives the expression
OPTICS.
~= (2/1(P»)2 (5)
10 P
FOURIER OPTICS where p = kR sin 8 and J 1 is a well-known Bes-
sel function. Tabulated values gives the plot of
Diffraction Theory The articles OPTICS, GEO- Fig. 3, which is qualitatively similar to Fig. 2.
METRICAL and OPTICS, GEOMETRICAL AD- The bright central spot, the A iry disk, contains
VANCED are based on the approximation that about 84% of the energy which passes through
light is propagated as rays. For systems whose the aperture. Its periphery corresponds to the
dimensions are of the magnitude of the wave- first zero of J1(p), for which p=3.832. The
length of light, we must take into account its angle is then given by
wave nature. A simple examle is the rectangular
slit of Fig. I, whose width w is about equal to sin 8 =p/kR
the wavelength A.. Consider a plane wave strik-
ing this slit normally. Some energy from this or, approximately,
wave will be absorbed and reradiated by the
edges of the slit; the interaction of the initial 8 = 3.83A./2rrR = 1.22A./D. (6)
and reradiated waves produces a change in This formula indicates the angular resolving
direction, which we call diffraction. power of a lens of diameter D, showing that
The phenomenon of diffraction can be ex- Fraunhofer diffraction imposes a fundamental
pressed quantitatively by using the fact that limit. The same result applies to the problem
waves are specified in terms of sines or cosines; of~resolving two separate points; if their Airy
for mathematical convenience, we shall use the disks overlap in the fashion indicated by Fig. 4,
equivalent function exp Okr), where r is the they are considered to be just distinguishable;
displacement along the wave. Since the wave this is the famous Rayleigh criterion.
is periodic, its amplitude at r = 0 and r =A. is It is possible to improve on this limit by put-
unchanged. Hence, exp Uk A.) = 1, k A. = 2rr, or ting more of the radiant energy into the central
k = 2rr/A. (1) spot. This is done by apodization (from the
Greek "without feet," which refers to the re-
gives the defining equation for the propagation duction of the side lobes in Figs. 2 or 3). The
constant k. To find the amplitude A that this method involves placing a nonuniformly trans-
diffraction wave has on a screen at some arbi- mitting plate in front of the lens. For example,
trary distance to the right, we integrate over let a coating of the form
the area S of the slit, obtaining
A (x) =cos (rrx/w)
A =C Je ikr dS (2) be used with the slit of Fig. 1. This additional
term inside the integral of (2) will lead to a
transmission curve as shown in Fig. 5. The cen-
where C is the constant involving the maximum tral bright line is now wider and more intense.
strength of the wave and dimensionality factors. Spatial Filtering We next consider the dif-
As Fig. 1 shows, r =ro + x sin 8; taking dS = fraction process for a series of N sUfts of width
1dx, and integrating from -w/2 to w/2, we w with a uniform spacing d (Fig. 6); such a
obtain structure is called a Ronchi diffraction grating.
489 FOURIER OPTICS

FIG. 1.

The integral in Eq. (2) is generalized to a sum where r =! kd sin (). The first term on the right
of N integrals, one for each slit, and the result is like Eq. (4), and comprises the shape factor,
of the combined summation and integration is representing the contribution of the N identical
slits to the pattern. The other term is the grat-
~ =(Sin
10 a
a)2( Nsmr
sin.Nr)2 (7) ing factor, and indicates the effect of repeating
the slits. Figure 7 shows the shape factor as a

.,:
I
I

. .j:--.
I

""-1:"z -"-
y;,~

".
FIG. 2.
FOURIER OPTICS 490

FIG. 3.

FIG. 4.

dotted envelope for the complete pattern; note The application we wish to consider involves
how raising N from 5 to 20 (or higher) sharpens the combination of a grating and a lens (Fig.
the bright lines forming the diffraction pattern. 8a). There will be the usual image of the grating
The pattern we show here is found in connec- (Fig. 8b) at the place specified by the Gaussian
tion with other physical phenomena as well; lens equation (see OPTICS, GEOMETRICAL). In
electron and X-ray diffraction are examples addition, the diffracted beams corresponding to
(see discussion in DIFFRACTION BY MATTER 'Y = 0, ±1T in Fig. 7, or to ()o = 0, ()1 ='A/d, ()-1 =
AND DIFFRACTION GRATINGS). -'A/d (the zero and first order beams) are shown
491 FOURIER OPTICS

1/1 0
\.5
, \
\ Apodized
\~ aperture

\
\ Original
aperture

\
\
0.5 \
\
\

7r
2
"
7r 37r
2
27r 511'
2
Q

FIG. 5.

2 3 N-\ N

~ 1-W~ ~ ~
'r4-d--+-j

FIG. 6.

- --1i10

.........

20 slits

o 7r

FIG. 7.
FOURIER OPTICS 492

H.H' Image
Diffraction
of
pattern
grating

(a)

1====
I

Diffraction Pattern

enting
(b)
FIG. 8.

in detail in Fig. 8a. It is quite clear that the being specified by the square wave of Fig. 9a,
parallel zero-order beams will meet at the focal this is equivalent to removing the high frequency
point F'. It can easily be shown by using the components of the Fourier series representing
paraxial matrices of OPTICS, GEOMETRICAL, the wave and the result is Fig. 9b; Le., the sharp
ADY ANCED, that a set of rays which are parallel corners of the image have been rounded or
to themselves but not to the lens axis (such as smoothed. This procedure could be applied in
the four for 9 1 > 0) will meet on the plane principle to satellite photographs, which are
through F' but off-axis. composed of strips; low-pass spatial filtering
Now we place an opaque aperture at F' which will remove the sharp boundaries. The converse
blocks all diffraction orders except for 9 0 ; this situation-an opaque disk at F'-will then give
is the process of spatial filtering. If we think of high-pass spatial filtering (Fig. 9c), and one
the transmission properties of the grating as application could be to improve the readability
493 FOURIER OPTICS

I lens of variable thickness from center to edge,


the time of transit varies with distance travelled
f-d· in the glass, causing a shift in the phase of the

- -
wave from one part to another. This shift simply
contributes an additional term in the exponent
of the diffraction integral (2), where it has an
.. w
effect analogous to the apodization mentioned
x above. It can be shown l with some elementary
geometry and some very formidable algebra
that the phase shift due to a spherical lens has a
quadratic dependence on the x and y coordi-
(a) nates but that by positioning object and image
so that they obey the Gaussian lens equation,
I this dependence reduces to a linear one. It will
then be recognized that the diffraction integral
(2), involving an exponential term of the form
exp [(ik (x + y) 1, is generating a two-dimen-
sional Fourier transform of the remaining
factors inside that integral, these factors speci-
fying the nature of the incoming wave. Thus,
a lens is a device which performs a Fourier
x transform on an object, and also forms an
image. We have already seen an example of this
in Fig. 8; the Fourier transform of a periodic
(b)
object, the grating, reduces to a discrete Fourier
series, the result being located at the focal plane.
I Fourier transform methods are used in science
and engineering to solve the differential equa-
tions associated with transient phenomena;
they give a way of computing the response of
a circuit to an electrical impulse, for example.
This response is identified as the admittance
(the reciprocal of the impedance) and it is also
called the transfer function. The optical equiva-
lent of an impulse is a point source and the
response is the optical transfer function. Like
admittance or impedance, it will depend on
(e) frequency, but this is measured in line pairs/
mm and is known as the spatial frequency vx.
FIG. 9. It is possible to calculate the dependence of the
transfer function I on spatial frequency, with
results as shown in Fig. 10. As already pre-
of illegible documents by sharpening edges. The dicted, there is a maximum value Vxo of Vx
details of evaluating the Fourier series are given beyond which the resolution vanishes. The
by Garrard. 2 curves shown represent the situation for a
Optical Transfer Functions The effect of a perfect lens; the presence of aberrations will
lens on light rays is the subject of geometrical
optics (see associated articles) and the major
problem is the inherent aberrations, which dis-
I{lI~
play themselves as blurry or distorted images.
Light treated as waves brings in additional prob-
lems. The first of these is a direct consequence
of Eq. (6), which indicates that the finite diam-
eter of any optical system imposes a limit on its
ability to resolve closely spaced objects. It is RECTANGULAR APERTURE
customary to use a test object in the form of
alternate black and white bars; the resolution is
expressed in terms of the largest number of CIRCULAR APERTURE
line-pairs/mm that can be distinguished at the
image plane. The other limitation is imposed by
the ability to distinguish bars from spaces, i.e.,
the image contrast.
Both resolution and contrast are affected by
the fact that when a light wave passes through a FIG. 10.
FOURIER OPTICS 494

degrade these results. In addition, the transfer cast transmitters and receivers within allocated
function can be shown to depend as well on frequency bands or providing signals for labora-
the relative contrast of object and image and tory equipment such as counters and spectrum
predict how this aspect of image quality decays analyzers. The most commonly used device
with increasing spatial frequency. for such applications is the quartz crystal
oscillator. Other applications require time-
ALLEN NUSSBAUM keeping in addition to frequency control. Pre-
cise navigation on earth, navigation in space
References very far from the earth, and certain digital com-
munications applications require much greater
1. Nussbaum, A., and Phillips, R. A., "Contemporary performance than may be obtained with quartz
Optics for Engineers and Scientists," Prentice-Hall crystal oscillators. These more stringent require-
(1976). ments are generally provided by atomic or
2. Garrard, A.,Am. J. Phys. 31,723 (1963). molecular frequency standards called ATOMIC
CLOCKS. In addition, scientists require a very
Cross-references: DIFFRACTION BY MATTER AND precise and accurate frequency standards for
DIFFRACTION GRATINGS; FOURIER ANALYSIS; applications such as precision spectroscopy
LENS; OPTICS, GEOMETRICAL; OPTICS, GEO- and the measurement of relativistic effects.
METRICAL, ADVANCED; OPTICS, PHYSICAL. These special applications make use of a wide
variety of frequency standards including those
mentioned above, but also including such unu-
FREQUENCY STANDARDS sual devices as superconducting cavity stabilized
oscillators, saturated absorption stabilized lasers,
Introduction A frequency standard is simply or even the Mossbauer effect.
an oscillator whose characteristics of either Operating Principles For frequency standard
frequency stability or accuracy satisfy certain applications, the most critical parameter of the
subjective performance criteria. Such a device resonator is its relative linewidth. The loaded
is approximately characterized by its nominal Q of the resonator is the most commonly used
frequency and nominal amplitude of oscilla- figure of merit. It is equal to the total stored
tion. But noise and systematic effects cause the energy divided by the energy dissipated in one
actual frequency and amplitude of operation to cycle of the oscillation and is inversely propor-
differ from nominal. The level of the noise- tional to the linewidth. Quartz crystals have
induced frequency deviations is characterized Q's as high as a few million. Superconducting
by the quality called frequency stability, while cavities operating at microwave frequencies
the ability of the oscillator to achieve a pre- may have Q's as high as lOll, and atomic reso-
determined frequency without calibration is nances suitable for use in frequenc? standards
called accuracy. Both quantities are usually today have Q's ranging from 10 to 10 12 •
stated on a fractional basis, /::"f/ fo, where to is A fundamental limitation on a frequency sta-
the minimal operating frequency. Although bility of the device is determined from the Q
oscillators cover the entire range of frequency and signal-to-noise-ratio, SIN: The stability
stabilities smaller than 10- 2 , for the purpose of cannot be better than (l/Q)(SIN).
this article frequency standards are considered There are many different types of noise
to cover the range of stabilities smaller than sources. For quartz crystal oscillators, the
10- 6 • Their size and weight can range any- fundamental sources are thermal (Johnson)
where from 10 cc and 50 gms to 1000 liters noise which is associated with the dissipation
and 100 kilograms. of energy in real resistors and shot noise which
Nearly single frequency oscillation results is associated with the discrete nature of the
from the use of a frequency selective element charge carriers in the electrical circuitry. Since
called a resonator. Two types of circuitry are the frequency of the atomic standard is derived
used to produce a voltage output. An active from the observation of a small number of
oscillator is constructed by combining the atoms or molecules, frequency instabilities are
resonator with an amplifier and a limiter in a observed which derive directly from the proba-
feedback loop. However, it is sometimes de- bilistic nature of quantum mechanics.
sirable to obtain the characteristics of more Noise affects the performance in two distinct
than one resonator in a frequency standard. ways. First there is noise in the amplifiers
This is accomplished by using an active oscil- which buffer or isolate the frequency standard
lator of the type just described to probe a from the user. Thermal or shot noise in these
second resonator. Feedback maintains the amplifiers adds to the output signal. The re-
frequency of the active oscillator equal to sultant frequency fluctuations have a low pass
the nominal center frequency of the second characteristic, i.e., they become smaller at
resonator. higher frequencies. There are also noise sources
The range of frequency stability from 10- 6 within the feedback loop of the standard which
to 10- 9 is generally important for frequency do not simply add to the output signal, but per-
control applications, such as maintaining broad- turb the frequency of the oscillator directly.
495 FREQUENCY STANDARDS

In this case a white voltage noise source pro- level, temperature compensation is the tech-
duces white frequency fluctuations. nique most often used. The circiut of the oscil-
Accuracy is a property of atomic frequency lator is modified by adding a second tempera-
standards. High accuracy is obtained by mini- ture sensitive element such as a thermistor
mizing perturbations on the atomic system due which compensates for the temperature de-
to collisions with other atoms, interaction with pendence of the resonator. Such devices can
electric and magnetic fields, and the effect of the achieve frequency stability of 10-6 over wide
velocity of the atoms themselves. For example, temperature ranges and are both small and in-
the definition of the second is 9,192 ,631,770 expensive, finding wide application in communi-
oscillations of the radiation from the cesium 133 cations, radio, and television.
atom in its ground state. The term ground state When higher precision is required, compensa-
refers to the lowest energy state of an atom tion is usually abandoned in favor of tempera-
which is at rest in a field-free region of space. All ture control. The frequency of the resonator
present-day atomic frequency standards actually is a cubic function of the temperature, and there
observe atoms in thermal equilibrium with their are two temperatures called the turnover points
surroundings and therefore moving at velocities at which the frequency variation with tempera-
on the order of 100 m/s. From the point of view ture is zero. The crystal is placed in an oven
of the observer, the frequency emitted by such and operated very near one of the turnover
atoms is perturbed by two effects. The first- points. The stability of oscillators containing
order Doppler effect is proportional to the ve- ovenized crystals varies between approximately
locity vector of the atom. It is analogous to the 10- 9 to 10- 12 for one-second averages and the
change in pitch of a train whistle passing a sta- aging rate typically varies between 10- 10 and
tionary observer. The second-order Doppler ef- 10- 11 per day. Typical applications include
fect, so called because it is proportional to the high-quality laboratory instrumentation such as
square of the velocity, is a relativistic effect re- frequency counters and clocks for space ve-
sulting from the property of time dilation. A hicles which stress the small size, low power,
radiating atom in motion behaves like a clock and high reliability of quartz crystal oscillators.
and according to the special theory of rela- Atomic Frequency Standards The change in
tivity, runs slow compared to a stationary state of an atom or molecule between two of
clock. its discrete energy levels is a resonance which
One might think that the ideal frequency may be used in a frequency standard. The fre-
standard would combine the best characteristics quency associated with the transition is the
described above. It would have as high a Q as energy difference divided by Planck's con-
possible, very small interactions, very small stant. It is generally not convenient or practical
velocity for the atoms and a very high signal- to use most of the allowed transitions between
to-noise ratio. However, these characteristics energy states of an atom because the frequencies
are inconsistent. For example, in an atomic are usually too high for coherent signal process-
standard, high signal-to-noise ratio generally ing techniques or because the transitions are
means observing large numbers of atoms which extremely sensitive to either electric or mag-
in turn implies frequent atomic collisions and netic fields. Consider a simple atom having
substantial interactions. The required com- a single unpaired electron such as hydrogen or
promises have resulted in the development of any of the alkalai metals. The frequency cor-
a wide variety of different frequency standards responding to the transition of the electron be-
emphasizing different aspects of performance: tween two of its discrete energy levels is typi-
accuracy, stability, size, weight, power, re- cally in the ultraviolet or visible portion of the
liability, and cost. spectrum and therefore the atom must be used
Quartz Crystal Oscillators A quartz resonator in its ground electronic state. If the electron has
is fabricated from a thin slice of material cut an asymmetric orbit, then it will have an as-
from a commerical or naturally grown quartz sociated magnetic moment which can interact
crystal. Since the material is piezoelectric, an with the magnetic moment of the nucleus.
acoustic vibration results in an electric poten- This interaction separates the ground state into
tial between the two faces of the crystal. Coup- several energy levels called fine structure. How-
ling is provided by depositing thin film metal ever, the transition frequencies are usually in
electrodes on the two faces of the crystal. the infrared, still too high for practical applica-
To a first-order approximation the crystal acts tions. If the electron is in a symmetric orbit
like an electrical resonator composed of an then it has no orbital angular momentum but
inductor, capacitor, and resistor and is used by it still has a magnetic moment due to its spin.
installing it in a standard oscillator circuit. The interaction of this magnetic moment with
The center frequency of the resonance is the magnetic moment of a nucleus produces
determined from the dimensions of the reso- a set of energy levels called hyperfine structure
nator and is therefore inherently very sensitive which are typically at microwave frequencies.
to temperature. Frequency standard operation Transitions between such energy levels form the
therefore requires some means of temperature basis for all commercially available atomic
compensation or control. At the 10- 6 stability clocks. Not until practical methods of fre-
FREQUENCY STANDARDS 496

quency synthesis are developed for frequencies standard illustrates all the aspects of the atomic
higher than 100 GHz will atomic clocks be used beam frequency standard. The atomic transi-
extensively in that range. Nevertheless, various tion used is the hyperfine transition in the
laser techniques are used today at quite high ground state. This transition has a very small
frequencies as standards for atomic and molec- dependence on electric fields and is independent
ular spectroscopy and length. of magnetic field to first order at zero applied
Conventional electronic or acoustic resonances field. The atomic beam is formed by placing the
are observed through the frequency selective cesium metal in an oven with a small exit
nature of their power absorption. When coupled hole. Cesium metal melts at 37°C, making it
to an amplifier they can be used to provide very easy to form an atomic beam of reasonable
frequency selective gain. The analogous situa- density. The desired hyperfine level is selected
tion holds for atomic resonances provided that by passing the beam through the inhomogeneous
an ensemble of atoms can be produced in one magnetic field of a dipole magnet which de-
of the two energy states of the desired transi- flects the desired atoms through a region con-
tion. In the case of hyperfine spectroscopy, the taining a microwave cavity. A microwave field
upper and lower energy states have very nearly at the transition frequency of 9.2 GHz stimu-
equal populations, since their separation is lates a change in atomic state which is analyzed
very small compared to the thermal kinetic by a second region of inhomogeneous mag-
energy of the atoms. Microwave frequency netic field. The number of cesium atoms which
standards generally employ either magnetic make the transition between the two hyperfine
or optical state selection to create imbalance levels is measured using a hot wire detector
in the state populations. which ionizes the cesium atoms that success-
Magnetic state selection depends upon the fully passed through the spectrometer. The
fact that the two hyperfine energy levels behave microwave signal applied to the spectrometer
very differently in a strong magnetic field is swept slowly back and forth across the
whose effect on the electron is larger than the resonance while the atom flux is measured.
electron's interaction with the nuclear magnetic The signal is maximum when the frequency
moment. In an inhomogeneous magnetic field, of the applied microwave field is equal to the
the atoms feel a force proportional to the field frequency of the atomic transition. Electronic
gradient and the atoms in one state are de- circuitry provides a correction signal which is
flected oppositely from the atoms in the other used to maintain the average frequency of the
state. By intercepting one state while allowing microwave oscillator at the center of the
the other to pass through an aperture, an atomic atomic transition.
beam composed of atoms in the desired state Commercial cesium standards have frequenc~
is obtained. stabilities which varr between six parts in 10 1
Alternatively, optical state selection is ac- and six parts in 10 1 for observation periods of
complished by inducing a transition to a 100 s and the frequency stability improves
higher electronic energy level from which the inversely as the square root of the observation
atom decays spontaneously to the ground time. The best stabilities for cesium beam
state. The light source must be narrower than frequency standards generally vary between a
the separation between the upper and lower few parts in 10 14 and one part in 10 13 . Labora-
ground-state hyperfine levels. Atoms induced tory cesium standards, called primary cesium
to make transitions out of one hyperfine level standards, are constructed in such a way that
to an excited state relax back to the two the systematic effects which perturb the
ground-state hyperfine levels in nearly equal atomic resonance frequency may be measured
proportions. Thus the hyperfine level to which very precisely. As a result, they are accurate
the light source is tuned is depopulated. and reproducible to a few parts in 10 14 . Ac-
The lifetime of the hyperfine transition is curacy is limited principally by the ability
extremely long compared to the maximum experimentally to determine the residual first-
length of time the atom could be observed in order Doppler shift, although several other
a practical spectrometer. However, without affects such as the relativistic time dilation,
taking suitable precautions the observer will the effect of electric fields and the stability and
detect only a very broad line due to the Doppler homogeneity of the magnetic field also con-
shifts from many atoms traveling in random tribute to the inaccuracy. Fundamental ad-
directions. Various techniques are used to vances in the accuracy of atomic frequency
eliminate the first-order Doppler effect. For standards will probably come from devices
example, in an atomic beam spectrometer the which more nearly observe atoms at rest in
microwave radiation is applied to the atoms free space. .
perpendicular to the mean velocity vector of
the atomic beam and is reflected back upon SAMUEL R. STEIN
itself. The net effect is to reduce the observed
width of an atomic resonance to a value equal Cross-references: ATOMIC AND MOLECULAR
to the inverse of the observation time. BEAMS, ATOMIC CLOCKS, FEEDBACK, VIBRA-
The operation of a cesium beam frequency TION.
497 FRICTION

FRICTION percent of the total resistance to sliding. The


widely held belief that friction is due mainly to
Introduction Friction is the resistance to mo- a roughness effect does not find experimental
tion which exists when a solid object is moved support. Cleaved mica (smooth to an atomic
tangentially with respect to the surface of an- scale) shows very high friction.
other which it touches, or when an attempt is Laws of Friction If a normal load L presses
made to produce such motion. Friction thus
takes its place as one of the general systems of two surfaces together, then we may apply a
force which are considered in MECHANICS tangential force up to some limiting value F s ,
(others being GRAVITY, ELASTICITY, etc.). Un- and the surfaces will remain at rest. Sliding
fortunately friction depends to a marked extent occurs when the tangential force exceeds the
on the material properties of the contacting static friction force F s , and almost as soon as
surfaces, and even more importantly, on any motion starts the tangential force takes on a
surface contaminants which may be present, so characteristic value, Fk, and always acts in a
that it is very difficult to estimate values of the direction opposite to the relative velocity of
friction force theoretically, with an uncertainty the surfaces. Fs is often some 30 per cent larger
of less than about 20 per cent. In many calcula- than Fk, but sometimes they are equal.
tions in solid mechanics, this uncertainty in the The ratio Fs/L is the static friction coefficient
friction constitutes the limiting factor in deter- fs(or J.l.s), while the ratio Fk/L is the kinetic
mining the accuracy of the over-all calculation. friction coefficient!k. We may cite quite general
The friction force arises from the fact that, statements, or "laws," involving these coef-
when two solids are pressed together, bonding ficients of friction. The three classical laws
between their surface atoms occurs, and these dating back to the seventeenth and eighteenth
bonds have to be broken before sliding can com- centuries, are:
mence. Bonding of any considerable strength (1) The friction coefficient is independent of
occurs only in places where the surface atoms the load.
come within range of each other's strong force- (2) The friction coefficient is independent of
fields (i.e., closer than about 3 X lO-IOm); thus, the contact area.
when ordinary solids with appreciable roughness (3) The kinetic friction coefficient is inde-
are used, bonding is confined to a few small pendent of the sliding velocity.
patches (called junctions) over their interface, More recently, it has been found possible to
where the high spots (or asperities) of one make another general statement about the fric-
material have made contact with asperities on tion coefficient:
the other material. (4) The friction coefficient is independent of
The energy used up in the friction process the surface roughness.
appears almost entirely in the form of heat. We may summarise these laws in the compre-
Generally this consists of a moderate tempera- hensive statement.
ture rise over the contacting bodies, and super- (5) The friction coefficient is essentially a
posed on this there are higher "flash tempera- material property of the contacting surfaces,
tures" at the junctions. At high sliding speeds, and of the contaminants and other films at their
softening or even melting of the tips of the interface.
asperities may occur. Although in practice these "laws" are reason-
In addition to the bonding or "adhesion" ef- ably well obeyed, there are often systematic
fect, which is the principal cause of friction, divergencies, some of which have important
there are four other mechanisms which use up consequences. In the rare cases in which the
energy during sliding; energy which must be friction coefficient varies with load, it is often
supplied by the friction force. These mecha- because of some special effect (e.g., a surface
nisms are: coating which is broken up at heavy loads);
(I) A roughness effect, caused by the inter- however, materials with a high elastic limit (e.g.,
locking of asperities and the need to lift one polymers) generally show a friction coefficient
surface over the high spots of the other; which goes down somewhat as the load is in-
(2) A ploughing effect, whereby an asperity creased. The fourth law applies closely to the
on a hard material can dig a groove in a softer intermediate ranges of surface roughness, but it
material; is found that very smooth surfaces have higher
(3) A hysteresis effect, whereby there is friction because they tend to seize, and very
elastic and plastic deformation of the material rough surfaces have higher friction because of
at or near the junctions, and not all the defor- asperity interlocking.
mation energy is recoverable; Velocity and Time Effects in Sliding Viola-
(4) An electrostatic effect (with electric insu- tions of the third friction law are important
lators), where work must be done to separate because they can lead to friction-induced oscil-
electrically charged regions on the sliding .sur- lations. With some materials fk is almost inde-
faces. pendent of velocity over a very wide range;
In a great majority of applications these four however, with hard materials the friction gen-
mechanisms do not account for as much as 20 erally goes down as the speed goes up (a so-
FRICTION 498

TABLE 1. COEFFICIENTS OF FRICTION

Materials Surface Conditions Is Ik


Metals on metals (e.g. steel on steel, Carefully cleaned 0.4-2.0 0.3-1.0
copper on aluminum) Unlubricated, but not cleaned 0.2-0.4 0.15-0.3
Well lubricated 0.05-0.12 0.05-0.12

Nonmetals on nonmetals (e.g., leather Unlubricated 0.4-0.9 0.3-0.8


on wood, rubber on concrete) Well lubricated 0.1-0.2 0.1-0.15

Metals on nonmetals Unlubricated 0.4-0.6 0.3-0.5


Well lubricated 0.05-0.12 0.05-0.12

called negative characteristic), while soft mate- tion can produce drastic differences in the
rials show a negative characteristic at high friction.
sliding speeds and a positive characteristic at Some typical friction values are given in
low sliding speeds. Often, fk changes by about Table 1.
10 per cent during a factor-of-ten change of Materials with Unusual Frictional Properties
velocity. In a sliding system which has elastic In this category we may place the hard non-
compliance, a negative characteristic introduces metals (diamond), which give low friction
an instability, and at high speeds this takes the (- 0.1); the elastomers (natural rubber) which
form of harmonic oscillations (e.g., a violin give very high friction (-0.9); the layer-lattice
string), while at low speeds relaxation oscil- substances (graphite, molybdenum disulfide,
lations occur (e.g., a creaking door). The relax- cadmium iodide) which give low friction (-0.1)
ation oscillations are usually referred to as stick- and are used as solid film lubricants; the hexag-
slip, because during a part of each cycle the onal close-packed metals (rhenium, magnesium,
surfaces are at rest. The severity of the stick- titanium, cobalt) which when clean give lower
slip is enhanced because, while the surfaces are friction coefficients (-0.5) than do other met-
at rest, fs increases somewhat with time of stick als; "Teflon," which adheres very poorly to
(about 10 per cent for every factor-of-ten in- other solids and accordingly gives very low
crease in time of stick above 10-3 seconds). friction (-0.05); and metals in group IVa of
Value of the Friction Coefficient In most the periodic table (titanium, zirconium, haf-
situations the total real area of contact, or sum nium) which cannot be well lubricated by any
of all the asperities, is produced by plastic known liquid substances. Generally, the fric-
deformation of the asperities, and if we assume tional anomalies of solids may be traced to
that strong bonds are formed joining the mate- peculiarities in their structure or surface prop-
rials across the interface we find that the fric- erties.
tion coefficient is given by Current Problems and Areas of Research
Interest At one extreme there is concern
f=s/p with obtaining high and reproducible friction
for traction purposes, as in the development
where s is the plastic shear stress, and p the of better brakes and clutches, and in the de-
plastic indentation hardness, of the softer of sign of runway surfaces at airports which give
the two contacting materials. Since sand pare high friction even in very wet weather. At the
similar plastic strength parameters, their ratio other extreme, work is being done to reduce
tends to be constant, within the range 0.3 to friction in devices which run continuously and
0.6, for a wide variety of materials. With some dissipate energy, such as automobile engines.
materials the real area of contact increases Often, problems arise when new materials must
during sliding, and friction coefficients above be used. Thus the reduction in the use of as-
1.0 are then commonly observed. Especially bad bestos because of health hazards and of cobalt
in this connection are materials with a high because of restricted availablity has stimulated
ratio of surface energy to hardness, namely much research and development work. Per-
clean soft metals such as lead, aluminum and haps the most persistent problem is that of con-
copper. trolling stick-slip, which is responsible for
A lubricant acts by introducing a layer of much of the noise pollution of our environment.
lower shear stress Sl at the interface, and lowers Related Fields of Interest Friction is con-
the friction coefficient accordingly, down to sidered one of a group of mechanical interac-
0.05 in favorable cases. Many nominally clean tion phenomena, and research in the other
materials also give low friction, as result of members of the group, generally referred to as
contamination during manufacture or handling. tribology, is of great interest to workers in the
Slight differences in the degree of contamina- friction field. There is an extensive common
499 FUSION POWER

literature. The other tribological phenomena where the fuel is deuterium (D) and tritium (T)
are wear (the removal of surface material as and the reaction products are helium (the 0:
result of mechanical action), lubrication (the particle, 4He) and a neutron (n); the numbers
properties of surface films which reduce fric- in parenthesis are their energies. The energy
tion and wear), adhesion (the tendency of solid from fusion may be appreciated by noting that
objects pressed together to remain together), 36 m 3 of natural water contains I kg of deute-
and electrical contact effects (the electrical rium, which on fusion with tritium produces an
resistance at the interface between contacting energy equivalent to 5 X 104 m 3 of gasoline.
or sliding solids). The deuterium is easily extracted from water,
Other effects which are often classified as but the tritium, which does not occur naturally,
basic tribological effects are erosion (the wear is made by allowing the neutrons to combine
produced by particles traveling in a fluid and with lithium, using the nuclear reactions
impinging on a solid surface) and fretting (the
friction and wear produced as result of vibration 6Li + n = T + 4He + 4.8 Mev
at the interface between contacting surfaces). (2)
There has been a recent upsurge of interest in 7Li + n = T + 4He + n - 2.5 Mev.
these fields, based on the realization that the
various tribological phenomena have great There is sufficient high grade lithium ore to
practical importance . Thus, in a modern in- supply the world's electricity requirements for
dustrialized country excessive friction wastes thousands of years. The possibility of an abun-
about 0.5% of the Gross National Product, dant energy source combined with low poten-
while the damage done by wear is about an tial radioactive hazard make fusion power at-
order of magnitude greater. tractive. Even allowing for the radioactive
tritium, the hazard is much less than that for
ERNEST RABINOWICZ fission reactors, in that no actinides, iodine,
cesium, strontium, or plutonium are produced.
References Ultimately there is the possibility of fusion
Bowden, F. P., and Tabor, D., "Friction and Lubrica- between deuterium nuclei alone using the
tion of Solids," Oxford, Clarendon Press, Part I, reactions
1950, Part II, 1964 .
Buckley, D. H., "Surface Effects in Adhesion, Fric- D + D = He 3 (0.82 MeV) + n (2.45 MeV)
(3)
tion, Wear and Lubrication," New York, Elsevier, D + D = T (1.01 MeV) + H (3.02 MeV)
1981.
Ku, P. H. (Ed.), "Interdisciplinary Approach to Fric-
thereby eliminating the need for lithium. How-
tion and Wear," NASA Report SP-181, Washington, ever the energy produced is less, and the peak
D.C., National Aeronautics and Space Administra- reaction rate for DID fusion occurs at a tem-
tion, 1968. perature that is nearly ten times that for D/T
Rabinowicz, E. , "Friction and Wear of Materials," fusion and is about ten times smaller. This
New York, Wiley, 1965. results in a ratio of energy produced by the
Suh , N. P., and Saka, N., "Fundamentals of Tribol-DID fusion reactions to energy supplied as heat
ogy," Cambridge, MA, MIT Press, 1980. in the reactants of about 70, compared to about
Szeri, A. S., 'Tribology-Friction, Lubrication and 1800 for D/T fusion reactions, making an over-
Wear ," New York, McGraw-Hill Book Co., 1980. ali energy gain more difficult to achieve. Thus
Cross-references: DYNAMICS, MECHANICS, STAT- attention is primarily directed first to achieving
ICS. fusion power using D/T fuel. It is worth noting
that D/D reactions are appreciable in a D/T
fuelled reactor, increasing the average energy
FUSION POWER per reaction from the 17.6 MeV given in Eq.
(1) to 22.4 MeV.
Introduction When nuclei of certain light ele- The D/T fusion cross section as a function of
ments fuse together an excess binding energy energy rises rapidly to a peak at 110 ke V, equiv-
is liberated. This is the energy source of stars, alent to a temperature of 10 9 K. If the density
and it is the object of fusion research to find is sufficient for a thermal distribution of particle
ways of using the energy for electrical power velocities, the rapid rise in cross section results
production on earth. Because of the abundance in the few particles with energies in the Max-
of suitable light elements the energy source is wellian tail contributing most to the reactions
virtually unlimited. and so the peak reaction rate is at a rather lower
In order to fuse, the reacting nuclei must have temperature, around 6 X 10 8 K, and conceptual
sufficient energy to overcome their coulomb fusion reactors are often designed to operate at
repulsion, so nuclei with a high ratio of mass to a temperature of 10 8 K or so. At this tempera-
charge heated to a high temperature are most ture the fuel is fully ionized by collisions, so
suitable. The fusion reaction of interest is the electrons are not attached to any particular
nucleus and electron charge density equals the
D + T = 4He (3.5 Mev) + n (14.1 Mev) (1) nuclear charge density. This state of matter is
FUSION POWER 500

called a plasma, and the problems of fusion the charged particles produced by the thermo-
power generation are mainly plasma physics nuclear reactions, provided the plasma dimen-
combined with engineering, rather than nuclear sions exceed the particle cyclotron orbit diam-
problems. eter. This makes a self-sustained thermonuclear
Research into the feasibility of thermonuclear reaction possible because the energy from the
fusion power began in the mid-l 950s, and for particles can thermalize with fuel particles and
25 years has been concentrated on the problems balance power lost from the plasma as a whole.
of heating and containing high temperature The minimum temperature for a self-sustained
plasma. Progress in plasma physics has been thermonuclear reaction is called the ignition
sufficiently encouraging for a number of temperature, and the plasma must be heated by
countries to start constructing large experi- external sources supplying power greater than
ments to investigate plasma heated by its own the power loss up to the ignition temperature.
thermonuclear fusion reactions: these will At a minimum, the power loss is by bremsstrah-
begin in the latter part of the 1980s. Successful lung photon radiation from free electrons de-
operation of the experiments will demonstrate flected in coulomb collisions. With bremsstrah-
the scientific feasibility, and lead to research lung loss the ignition temperature is about
reactors to investigate the engineering and com- 4 X 107 K. Much greater power loss may be
mercial potential of fusion power near the be- produced by excitation radiation from elec-
ginning of the next century. trons tightly bound to partially ionized heavy
The demonstration of the feasibility of fusion impurity atoms: only 0.1 % of iron atoms will
power is taking longer than fission because of increase the ignition temperature to 7 X 107 K
the large characteristic dimensions required for and ~2.s% of iron will prevent ignition com-
thermal isolation of the plasma, and because pletely. Since the fusion and radiation powers
a very high temperature is required to start the scale with density in the same way, the ignition
reactions. This means that comparatively ex- temperatures are independent of d.ensity. It is
pensive inflexible experiments are required to therefore important to keep the plasma very
explore new ideas. pure.
Fusion Reactor Parameters A fundamental Theoretically a uniform static magnetic field
problem of fusion power is thermally to isolate reduces the particle diffusion coefficient by
and confine the plasma while it is heated and about a factor (WTc)2 , where Tc is the collision
the reactions occur. In stars the plasma is held time, W (=eBlm) is the cyclotron frequency, and
by gravitational force, but this force is too elm is the charge-to-mass ratio of the particle.
weak to hold the small mass of plasma in a This is because the magnetic field restricts the
reactor. Also, practical electrostatic fields are trajectory of charged particles to small circular
too weak, even if Ernshaw's theorem (that orbits so that the displacement at each collision
there is no equilibrium for a system of charged is a cyclotron orbit diameter rather than a mean
particles in an electric field) can be overcome. free path. The reduction of diffusion velocity
Two approaches to solve the containment prob- is proportional to the square of the magnetic
lem are being studied: one, which uses magnetic field intensity, and since elm is larger for the
forces, is described in this article; the other, electron than for ions, the electrons determine
which uses inertial forces, is described in the the overall plasma diffusion velocity; the ion
article on LASER FUSION. density follows the electron density closely
A magnetic field can exert pressure on plasma because very little charge imbalance can exist
because the free electrons make the plasma a in a plasma. When the electron-ion collisions
good electrical conductor. A magnetic field B are simple two-body coulomb collisions, the
external to the plasma causes an induced cur- diffusion is called classical. Fusion reactor
rent in the plasma surface which interacts with parameters are such that WTc is sufficient for
the field to produce a pressure perpendicular to classical containment for the period of the
the magnetic field lines proportional to B2 ; this fusion burn. However, in practice it is found
follows from the fundamental laws of electro- that there are waves and instabilities in the
magnetics. The pressure of plasma that can be plasma-magnetic field system and the particles
supported is usually appreciably lower than the act together in groups which effectively collide
applied magnetic field pressure because some even under conditions where two-body coulomb
magnetic field must be mixed with the plasma collisions are negligible. The mechanism of these
to make the overall system stable. The external "cooperative phenomena" is not completely
magnetic field pressure must then balance the clear, but it appears that gradients of tempera-
sum of the plasma pressure and the internal ture, current, and density play a part. It has
magnetic field pressure. A measure of the effi- been argued that, at worst, the diffusion velocity
ciency of using the confining magnetic field in the absence of a magnetic field may be
is ~, defined by the ratio of plasma pressure reduced by a factor WTc and not (WTcP as
divided by the external magnetic field pressure, with classical diffusion. Such enhanced diffu-
i.e. ~ a: nTIB2, where n is the particle number sion, with an appropriate numerical factor,
density and T is the temperature. called Bohm diffusion, scales as TIB and is
The magnetic field also hinders the escape of independent of the collision frequency. Early
501 FUSION POWER

experimental results indicated diffusion times containment time is required to control the
of only a few times the Bohm rate, which is plasma temperature and therefore the thermo-
inadequate for a reactor, but in the last ten nuclear burn, which could last hundreds of
years the ratio has been increased to 10 3 -1 04 , seconds. Too high a temperature may cause
which is good enough. the {3 to exceed a stability limit, causing the
Theoretically the thermal conduction should plasma to expand and the burn to extinguish.
also be reduced by a factor of about (W1'c)2 by Ideas to control the temperature are to vary
the magnetic field, but, as with particle diffu- the density, or inject particles, or adjust the
sion, much greater values are observed experi- loss by producing nonuniformities in the mag-
mentally and often dominate the loss power. An netic field, or change the magnetic field slightly
increase of ~ 1OOX for electron thermal conduc- to compress/decompress the plasma. Although
tion and up to ~ 1OX for ion thermal conduction the temperature of a thermonuclear fusion
are commonly observed. The loss is expressed reactor is high, the low density means that the
in terms of an empirical energy containment heat content is so low that there is no explosion
time (1') defined by the plasma energy density hazard if the plasma were to expand to the wall
(nT) divided by the loss power. The fusion of the containment vessel.
power balanced against the loss power gives the At the operating density and temperature,
product n1' as a function of temperature. This thermonuclear power density averaged over
has a minimum of the plasma and blanket is considerably less than
the power density in the core of a fission reac-
(4) tor. For instance, with n = 2 X 10 20 m -3 and
T ~ 10 8 K the total fusion power density is
at about T = 3 X 10 8 K, corresponding to an- ~4 MW/m 3 compared to ~80 MW/m 3 core
other condition for ignition. Over a wide range power density in a PWR fission reactor and
of temperature near ignition the thermonuclear ~500 MW/m 3 in a fast fission reactor. An
power is proportional to n 2T2, which together important difference between fission and fusion
with the definitions of l' and {3 gives related reactors is the form in which the power appears.
ignition conditions n T1' ;;;;. 4 X 10 28 or {31' > In fission the power is mostly thermal, whereas
3.3/B2 in the range of temperature of interest. the power from D/T fusion is produced in two
For example, with T = 1 X 10 8 K and n = 2 X parts: 25% in IX power and 75% in very fast
10 20 m- 3 an energy containment time of 2 neutrons. The neutrons pass out of the plasma
seconds is required for ignition, which, with and generate heat in a "blanket," which is a heat
a typical value for the magnetic field of 5 Tesla, exchanger combined with a tritium breeder; it
gives a requirement on {3 of 6.6%. is coaxial with the plasma and has a thickness
It is an important object of present research of ~0.5 m, fixed by the neutron mean free
to understand the scaling and physical processes path. The minimum diameter of the tube con-
that determine l' and {3 and obtain ignition. The taining the plasma (the "first wall") is deter-
ignition condition given in Eq. (4) is similar to mined by the thermal insulation produced by
the condition for net power output from the the magn..etic field and the economic need to
reactor, derived by Lawson. If the efficiency of make it ,G the thickness of the blanket and
recycling the total power output to heat more neutron shield. In practice the first wall diam-
plasma is 0.2 (probably optimistic), Lawson's eter is in the region of 4 m.
condition gives n1' > 1 X 10 20 s/m 3 for a net The large minimum diameter of the first wall
power output. in fusion reactors also means that the first wall
Since the thermonuclear power density at the power load, which is proportional to the diam-
operating temperature is proportional to n 2 T2, eter, is an important limiting parameter in the
it follows from the definition of {3 that it is also reactor economy. The first wall conditions are
proportional to B4{32. This shows that it is very more severe than for the pin clad in a fission
important to operate at a high magnetic field reactor because of the following factors; a larger
and {3. The limit on magnetic field is the strength temperature gradient; a higher neutron energy
of the magnet coils (which must support the (~10 times fast reactor value); thermal cycling
magnetic pressure) or the loss of superconduc- due to pulsed burn of tens to hundreds of
tivity. Typically, fusion reactors are designed seconds; it must not liberate impurities; it must
for a magnetic field of between 5 and 10 Tesla. withstand the external atmospheric pressure;
The limit on {3 is determined by equilibrium and and it is more difficult to change because the
stability of the plasma in the magnetic field complicated magnetic field coils and blanket
which depending on the magnetic configuration system must be moved. At present, tentative
of the system, may range from a few times fusion reactor designs are based on first wall
10- 2 to a few times 10- 1 . total power loadings in the region of 1 MW /m 2
The {3, temperature and magnetic field deter- and allow for a new wall every 1 or 2 years.
mine the operating particle density of the fusion The crucial design base values for the first wall
reactor to be near 10 20 m -3, which is about loading must await experimental measurements
10-5 of atmospheric particle density. After with high energy, high fluence neutrons to be
ignition some means of controlling the energy produced in fusion engineering test facilities.
FUSION POWER 502

Although the size and cost of these facilities Second, to optimize the charge exchange effi-
will be a considerable fraction of a full scale ciency in the gas cell, the beam energy should
fusion reactor cost, a number are in the con- be chosen such that the velocity is approxi-
ceptual design stage, aiming at operation near mately the orbital velocity of electrons in the
the end of the 1990s. atoms; against this the heating increases with
The large characteristic dimension of the energy so a compromise is chosen, typically
plasma and the limitation of power loading in -160 keY for a deuterium beam with a cor-
the first wall may make the cost of the fusion responding efficiency of -15%. Third, the
nuclear island larger than that of the fission plasma density must be such that the beam is
case. Recent estimates indicate a cost ratio of absorbed in the plasma-too great a density
-8, and, taking into account the cost of the causes absorption at the surface and too little
conventional plant, the overall fusion cost has density allows the beam to pass through the
been estimated to be -2 times that of a PWR. plasma. The density is probably rather lower
However, because so many of the factors are than the density required for the thermo-
difficult to quantify, the costing of a fusion nuclear burn, so further fuelling may be re-
power station can at best be only very approxi- quired, even allowing for fuel added by the
mate. beam. Last, because of the beam cross section,
Plasma Heating It can be shown simply from which is determined by the ion source current
a balance of thermonuclear fusion power and density limitations and beam divergence, a
power loss, allowing for a contribution from number of large holes are required in the vessel
the hot a particles, that the power required to containing the plasma to allow the beams to
ignite the plasma is -B4{31 2 /2 MW/m 3 where {3I, enter. A significant fraction of fusion power
the value of {3 at ignition, is -1.3/reB2. For and tritium production may be lost when
example, a thermonuclear fusion reactor with neutrons escape through the holes, and, perhaps
a plasma volume of 200 m3 , B =5 T and {31 of more important, neutron bombardment of the
6% would require a pulse of -56 MW heating ion source will cause activation and maintenance
power listing for a few energy containment difficulties. A way of overcoming the energy
times (of -I second). With a heater efficiency limitations which is at an early stage of develop-
of 15%, the power input to the heater would ment is to use a negative ion source. To produce
be -380 MW. After ignition, the fusion burn the beam of neutral atoms, electrons can be
may last for 100 seconds at a power output of removed from negative ions with a much higher
I GW, so the total fusion energy produced is efficiency (50-90%) than the charge exchange
-150 times the energy required to heat the efficiency to positive ions. The efficiency is
plasma. insensitive to the beam energy, enabling beam
Many fusion reactor designs employ ohmic, energies of 500 keY and a higher plasma den-
power to ionize and start heating the fuel sity to be contemplated. At high beam energies,
charge. When the electron temperature (Te) the time to thermalize the beam energy with
rises, the ohmic heating power falls because the plasma may be important, during which a
the resistivity is proportional to Te -3/2, and significant fraction of the fusion power may
to ignite the plasma additional heating is re- come from nonthermal particles.
quired. Adiabatic compression heating, pro- Electromagnetic (RF) heating is at an earlier
duced by suddenly increasing the confining stage of development than neutral injection
field, is unsuitable because of the limited vac- heating using positive ion sources. It has the
uum space left around the plasma which must advantages of higher efficiency and reduced size
be filled by expensively produced magnetic of the access holes in the blanket. There are
flux, and also because of engineering limits three types of interest: electron cyclotron
on the maximum field available. There are two Fesonance heating (ECRH) which, typically,
methods being developed for the additional occurs at a frequency of 200 GHz (wavelength
heating: neutral injection and the application -1.5 X 10-3 m); lower hybrid resonance heat-
of electromagnetic waves. At the moment ing, which occurs at a frequency around 3 GHz
which method is best is not clear. (wavelength -10- 1 m); and ion cyclotron reso-
The principle of neutral injection heating is nance heating (ICRH) with a frequency of
to use a very high power beam of atoms which, about 65 MHz (wavelength -5 m). Generation
being neutral, penetrates the magnetic field. problems increase with frequency, whereas
When the beam collides with the plasma it is coupling the power to the plasma is best for
ionized and so is confined by the magnetic field, ECRH.
subsequently sharing its energy with the plasma. The most successful method so far has been
The neutral beam is made by passing positive ICRH. This employs standard low frequency
ions from a high current ion source through a sources which have a high efficiency (around
gas cell so that they pick up electrons from the 90%) and overall, the efficiency of heating the
gas atoms by charge exchange. Neutral injec- plasma is between 20 and 60%. Input power
tion heating has some fundamental limitations. densities of -50 MW/m2 have been investigated.
First, the gas source current is limited by heating The RF is launched from antennae near the
of the acceleration electrode grid to -2 kA/m 2 . plasma edge, producing a compression wave
503 FUSION POWER

which propagates across the magnetic field radially, the wave spirals slowly to the plasma
towards the plasma center. The magnetic field center, so that is spends a long time near the
is slightly nonuniform and the wave frequency surface where it may heat the walls and liberate
is chosen to resonate with the ion cyclotron impurities; and lastly, like ICRH, the density
frequency at the plasma center so that absorp- must exceed a critical value for the wave to
tion occurs. Originally it was intended to use propagate. Powerful klystron (~l MW/unit)
absorption of the wave at the second harmonic sources are available to generate the waves,
because no absorption is possible in the funda- although development is required for long
mental mode. Although absorption at the sec- pulse operation. Low power experiments indi-
ond harmonic is expected to be strong near cate an overall efficiency of ~30%.
reactor temperatures it is rather weak in present ECRH may have the potential for controlling
experiments. Fortunately it was found that the certain plasma instabilities associated with
presence of a few per cent concentration of adverse plasma current profiles as a function
another species (e.g., H or He) greatly increased of radius. It is hoped to avoid the wrong cur-
the absorption efficiency because of the ion rent profile by using the heating to control the
hybrid resonance and subsequently the majority temperature and thereby the conductivity as a
species is heated by collisions. Potential diffi- function of radius.
culties with ICRH in a reactor are neutron bom- The currents are induced by a changing mag-
bardment and electrical breakdown of the netic flux. Because of the limited magnetic
insulator which support the antennae and power flux swing that is possible in a transformer,
absorption near the wall which may liberate fusion reactors that depend on plasma current
impurities. It may be possible to reduce the may only be suitable for a pulsed burn. Pulsed
latter problem by focusing the waves toward systems are not desirable because the thermal
the plasma center, but with ICRH it cannot be and mechanical strains impose severe engineer-
eliminated because the wavelength is apprecia- ing limitations. A potential way of overcoming
ble compared to the plasma dimensions. A the problem is to use the heating method to
further problem is that the density must exceed drive the current continuously. This has been
~l 0 19 m -3 for the waves to propagate, so that demonstrated in principle with lower hybrid
the antennae must be placed in the surface heating, tangental neutral beam injection, and
plasma. ECRH. Driven systems may have some economic
In contrast, ECRH waves have no launching penality in that about 10% of the reactor elec-
problems as long as the frequency exceeds the trical output might be required to drive the
electron plasma frequency, which is propor- current.
tional to the square root of the plasma density. Magnetic Containment Although a uniform
The generation of powerful very high frequency magnetic field greatly restricts diffusion perpen-
fields has only recently been possible with the dicular to the field lines, it has no effect on
development of the gyrotron oscillator and the transport parallel to the field lines. To confine
frequency condition should not seriously limit the plasma parallel to the field, the lines must
the plasma density. The waves are directed to be curved to form either a toroid or a mirror.
the plasma via guides which are overmoded A toroid is effectively a doughnut shape pro-
(approximately a few cm dimensions) to enable duced by bending a cylinder into a circle so
the high power densities to be carried. It is that there are no ends through which the plasma
envisaged that ~200 0.3-MW gyrotons will be can escape. The axis of the cylinder is called the
used. Advantages over ICRH are that because of minor axis and the axis about which it is bent
the small wavelength the heating can be better is called the major axis. The magnetic field is
focused at the plasma center; smaller holes are produced by solenoidal coils placed around the
required in the blanket; and dog-leg waveguides bent cylinder. The magnetic mirror geometry
can be used to avoid neutrons escaping. The is a straight cylinder with magnetic field again
overall efficiency is similar to ICRH, ~ 50%. produced by coaxial solenoid coils. To restrict
A crucial element of success will be the devel- plasma flow parallel to the cylinder axis the
opment of reliable high power sources. magnetic field is increased near each end so
Lower hybrid heating uses compression waves, that the field lines are curved near the region
like ICRH. The frequency, which is between of increased field, creating a component of
the electron cyclotron frequency and the ion magnetic pressure which reflects particles mov-
plasma frequency, is around a few GHZ, so ing axially back toward the main plasma. Un-
dog-leg waveguides can be used, as with ECRH, fortunately there is always a fraction of particles
avoiding neutrons escaping. The wave electric with sufficient energy directed parallel to the
field must be polarized parallel to the plasma axis to overcome the magnetic pressure and
magnetic field for absorption. At the resonance, escape. Furthermore, the fraction is maintained
the wave becomes electrostatic and ions are by collisions in the main body of the plasma.
heated directly. Difficulties are that the absorp- The great majority of magnetic confinement
tion depends on the plasma temperature, and experiments are now toroidal, although in the
the frequency of the waves must be changed US and USSR there is a "second line" of work
as the heating proceeds; rather than move on the mirror geometry. Advantages of mirror
FUSION POWER 504

over toroid are better access, possibly better poloidal field to achieve confinement is the
stability, and smaller unit size, but adequate poloidal beta, defined by {3p ex; nTIBp2 which
containment of the axially moving particles is ~l. Since the current must not be allowed
remains a severe problem. to fall to zero, the thermonuclear burn must
In a toroid, the magnetic field gradient is be produced in a pulse lasting for the period of
inward along the major radius because the the transformer action.
solenoid coils must be more closely spaced on The helical transform in the tokamak is not
the inside of the cylinder, so concentrating the able to provide equilibrium because, unlike
field. In this simple form, there would be loss the stellarator, the current that produces the
of containment because the ions and electrons helical field is not flowing in a rigid conductor.
shift in opposite directions in the field gradient Equilibrium is, however, easily achieved either
and the resultant charge separation causes an by using a conducting wall (called a flux-
electric field to develop which, combined with conserving tokamak) so that the outward pres-
the magnetic field, cause an outward drift of sure compresses flux between the plasma and
both ions and electrons. Containment can be wall, or, more commonly, by applying a vertical
restored by adding another magnetic field magnetic field (Bv) which, acting with the
orthogonal to the first, looping the minor axis. plasma current I, produces a force I X Bv to
The resultant magnetic field is then helical balance the outward pressure force. With the
about the minor axis, and the angular displace- latter method, Bv may be varied by a feedback
ment of the field line about the minor axis for circuit controlled by optical sensors of the
one revolution about the major axis is known plasma position. This method of maintaining
as the rotational transform. The effect is to equilibrium does, however, limit the plasma
allow particles moving parallel to the helical pressure, because of the condition that there
field lines to reduce the charge accumulation, shOUld be no null field point in the plasma
thereby reducing the drift. which would be a source of plasma loss. This
Three types of toroidal confinement geometry gives a maximum {3p::S; 1/2e where e = plasma
are being investigated: the stellarator, the toka- inverse aspect ratio = minor radius/major radius.
mak, and the reversed field pinch. In a stellara- For a typical e of! , {3p ::s; ~ .
tor toroid, the helical transform field is gen- A third type of toroidal confinement geometry
erated by I pairs of helical coils outside the is the reversed field pinch. It differs from the
plasma, wound around the cylinder. Experi- tokamak in that a shear condition for plasma
mentally, values of 1=2 or 3 are investigated. stability is used. The condition is obtained by
The windings in each pair carry a positive and making Bp - BT with the field reversed near
a negative current. A disadvantage of the the plasma surface. As with the flux-conserving
stellarator field configuration is the fall-off of tokamak, eqUilibrium is obtained by using a
field from the coils, making a given field more conducting shell.
expensive to produce and the maximum mag- An important theoretical aspect of toroidal
netic field in the plasma less than with a simple systems is the effect of the helical geometry
solenoid. Another problem is that, to accommo- on confinement. Particles moving along helical
date the stellarator coil structure, a large ratio magnetic flux tubes experience a decrease in
of major radius to minor radius is required which magnetic field as the flux loops around the
may result in an inconveniently large reactor outside of the minor axis, causing the tube to
size. Against this, there are the important po- bulge. Thus, if a length of helical flux is un-
tential advantages that the confining field is wound it would have a series of constrictions
determined by currents in fixed conductors which act like a succession of magnetic mirrors.
external to the plasma enabling the possibility Thus particles with a low velocity parallel to
of continuous operation, and containment the flux tend to congregate in the low magnetic
and heating are possible without the necessity field regions: these are called trapped particles.
of currents in the plasma which may cause The rest of the particles, which have a high
instabilities. parallel velocity, pass unimpeded along the flux
The second and most developed confinement tube; these are called passing particles. Because
system is the tokamak toroid; it originated in of either trapping or collisions some particles
the USSR and its name derives from the Russian cannot move around the helix to allow cancel-
words for toroidal chamber and magnetic field. lation of the charge separation due to drifting
A simple solenoid, coaxial with the minor axis in the magnetic field gradient. The noncancella-
is used to produce the main or (toroidal) mag- tion results in an outward drift, enhancing the
netic field (BT), and the transform is made by diffusion rate above the classical value. Three
a toroidal plasma current parallel to the minor regimes of enhanced diffusion (called neoclas-
axis instead of by external coils. This produces sical) have been identified, depending on the
a weak component of magnetic field (the po- electron-ion collision rate ll. At low density the
loidal field, B p) around the minor axis; typically collision rate is so low that some particles can
Bp/BT ~ 0.1. The current is induced by trans- bounce back and forth many times between the
former action (either air-core or iron-core). mirrors. This is called the banana diffusion
A measure of the effectiveness of using the regime, which is named after the shape of the
505 FUSION POWER

trajectory of the particle motion. The diffusion the poloidal magnetic field (Bp) produced by
rate is q2 E- 3/ 2 times the classical value where the plasma current when the toroidal magnetic
the safety factor q = EBT/Bp. Typically, q - 2 field is comparatively small. The system is
and E -1 and the classical diffusion is increased unstable because Bp increases as the plasma
by -20X. When collisions within the mirror are radius decreases and so a given initial inward
much more frequent than the bounce fre- perturbation grows. The resultant deformation,
quency there are no trapped particles and known as the sausage instability, may be
charge cancellation is inhibited by collisions of stabilized by an axial field inside the plasma
the passing particles, increasing the classical of magnitude> Bp/V2. The next surface
diffusion by a factor 1 + q 2 which is - 5X; this mode, when m = 1, n = 1, is caused by in-
is called the Pfirsch-SchlUter regime. In the creased pressure inside a given kink perturba-
intermediate collisional regime the enhance- tion, where the field lines are close together,
ment factor is ve/Rqv where Ve is the electron increasing the amplitude of the perturbation.
thermal velocit~ Since the classical diffusion is It is stable when q at the plasma surface is
proportional to v, the enhanced rate is inde- greater than 1; this is the Kruskal-Shafranov
pendent of v, producing a flat region of the condition and is basic to tokamak operation.
graph of diffusion vs. v. Consequently this is In a tokamak higher mode stability can be
called the plateau regime. achieved by suitably peaking the current profile
Other nonuniformities in the magnetic field toward the plasma axis. Kink modes may also
may lead to other trapped particle effects. For be stabilized by image currents if a conducting
example in stellarators, the multipole fields wall is put close to the plasma.
cause helical modulations which make the mean Inside the plasma the interchange of plasma
position of the bananas drift, leading to plots and tubes of magnetic flux can cause instability
of the orbit positions that look like large ba- (called internal pressure driven modes). For
nanas, and consequently called super-bananas. stability, the average curvature of the lines of

f
A related effect of gradients on particle trap- force must be convex towards the plasma, lead-
ping is ripple caused when solenoidal cells are
used. A ripple of a few per cent on the minor ing to the "minimumB" condition 0 dilB > 0
axis can cause serious loss, particularly for a
particles because of their large cyclotron orbits. taken along a flux tube. These internal modes
Further theoretical predictions of trapped are localized near surfaces at which the lines of
particles effects include the bootstrap current force join on themselves when they pass around
and the trapped particle pinch effect. The boot- the torus, i.e., when q =min. Suydam showed
strap current is caused by the radial diffusion that they can also be stabilized by changing the
velocity (vr) of electrons in the banana regime. angle of the helical magnetic field as a function
The diffusion velocity acting with the poloidal of distance from the minor axis by an amount
magnetic field produces a force proportional depending on the pressure gradient. The sheared
to vrBp along the minor axis acts on the un- field acts like a basket weave that prevents
trapped ions and electrons to produce an axial interchange of plasma and flux.
current. The current is predicted to be sufficient However nonlinear effects can cause a more
to provide a tokamak rotational transform up complicated situation. For example, the plasma
to a (3p ~ 4e- 1/ 2 , offering a possible method current causes heating which changes the con-
for a continuously operating tokamak fusion ductivity, so affecting the current and magnetic
reactor. The trapped particle pinch effect (also field distribution. Also the finite resistivity,
called the Ware pinch effect) is caused by the which is a function of temperature, allows mag-
toroidal electric field used to drive the tokamak netic field diffusion such that oppositely di-
current. It makes the banana orbits drift towards rected field lines may join to form loops in the
the minor axis until the pinch force is balanced plasma. This is called resistive instability and
by the pressure gradient force. the loops are called magnetic islands; resistive
Stability The temperature and density in a instabilities may limit the effectiveness of shear
fusion reactor correspond to conditions in the to stabilize internal modes. Further, various
banana neoclassical regime, and gives an nr radial regions of the plasma may be connected
value easily satisfying the ignition and Lawson by rapid parallel diffusion along wandering
conditions. However the experimentally mea- magnetic flux lines. These are called ergodic
sured diffusion coefficients often exceed the regions. Overall, it seems that while the plasma
neoclassical values. This is thought to be due to under some conditions may be grossly stable,
various instabilities in the plasma-magnetic field mild nonlinear effects of instabilities within the
system. The instabilities are characterized by plasma may tend to flatten the internal pressure
two mode numbers: m, corresponding to the profile, increasing the pressure gradient near the
number of perturbations in the minor cross surface and enhancing diffusive loss processes.
section, and n, corresponding to the number Finally, a class of interchange instabilities
of perturbations parallel to the plasma minor called ballooning modes should be mentioned.
axis. The lowest surface mode, m = 0, is sym- These are local (high m, n) modes associated
metrical about the minor axis and is due to with the unfavorable curvature of magnetic
FUSION POWER 506

field near the outer regions of the toroidal T, {3, and n may cause difficulties in achieving
plasma. Ballooning may be imagined as like ignition or obtaining economical operation.
the unstable bulge at the weak point of an over- However, they may be connected with the
pumped bicycle inner tube. Calculations indi- evolution of impurities at high power operation
cate that these modes may limit the maximum so that improvements can be expected from
{3 in a tokamak to between 5 and 10%, although methods to improve plasma cleanliness.
it is possible that nonlinear effects may allow Many of these effects seem to be connected
the limit to be exceeded without disastrous with the presence of instabilities. Under many
loss. conditions ripples move along the plasma
It should be emphasized that plasma currents (Mirnov oscillations) and flat regions of the
may be the source of many of these instabilities. temperature profile have been observed which
Thus stellarators, in which the transform may could be interpreted as the effect of magnetic
be produced without plasma current, is likely islands. At the Murakami density limit the
to be most stable, followed by tokamaks and plasma may catastrophically disrupt. Although
lastly by the reversed field pinch. The reversed the plasma energy in a reactor is small ("'1 MJ),
field pinch operates with q < 1, relying on image there is concern that rapid release of magnetic
currents in the conducting wall and magnetic energy may strain the system.
shear for stability. Its cylindrical geometry A number of tokamak experiments are aimed
necessitates a region of reversed magnetic field at finding ways of injecting fuel and keeping
near the plasma surface. This automatically the plasma clean. Fuel injection is normally by
develops at an early unstable phase of the puffing gas on the outside of the plasma, relying
discharge, and is probably maintained by con- on an inverted density gradient or perhaps the
tinuous mild resistive instabilities. The ad- trapped particle pinch effect to transport new
vantage of the reversed field pinch is its ability fuel inward. Other experiments are to investi-
to operate at a much higher {3 than the tokamak; gate the practicality of injecting small pellets
theoretically the maximum is 0.5. of fuel. Experiments to cleanse the plasma
Experiments In the last ten years world involve scraping the outer layer of impure
fusion research has mainly concentrated on the plasma into a special chamber where it is neu-
tokamak geometry showing continuous progress tralized and pumped away. The scraping is
in extending the plasma parameters towards achieved by diverting the magnetic field lines
ignition with increased size of apparatus. Recent by using either special coils for a cooled deflec-
experiments have a major radius of "'1.3 m, a tor, called a pumped limiter.
minor radius of "'0.5 m, BT '" 5 T and I'" 6 X The second type of toroidal system, stellar a-
I 0 5 A, giving the record plasma parameters: tor, has received less support than tokamaks.
T '" 7 X 10 7 K (at n '" 5 X 10 19 m- 3 ), nT '" This is probably because of the relative success
3 X 10 19 (at n '" 10 21 m- 3 and T'" 10 7 K), of tokamaks in the early days when stellarators
{3 '" 4% and T '" 10- 1 s; these values were not gave containment times of only a few Bohm
all obtained at the same time, or in the same times, probably due to a combination of im-
apparatus. In many respects the scaling of these purities, field errors, and the heating method.
quantities encourages the belief that a toka- However in the last decade stellarator progress
mak fusion reactor with reasonable dimensions has been encouraging. Small low temperature
may be physically feasible. A major milestone ("'10 4 K) devices give classical containment,
will be the control of plasma heated by thermo- while larger devices (T'" 3 X 10 6 K) are begin-
nuclear fusion, which is an object of experi- ning to give neoclassical results in the plateau
ments at present being constructed. regime, as expected. Recently ion temperatures
However, although much sophisticated theory of '" 7 X 10 6 K, densities of 10 20 m -3 , {3 '" 1%
has been developed, many of the basic results and nT '" 4 X 10 18 have been reported with a
continue to defy satisfactory explanation. The radius of only 0.1 m and B'" 3 T. In some
tokamak results give little support for neoclas- experiments the plasma is heated by an axial
sical diffusion in that for years the electron current which causes an added transform; it
thermal diffusion coefficient was observed to is found that as the current is reduced, the
scale as lin and be independent of T and B. energy confinement time increases. Experi-
This corresponds to an energy confinement ments, where there is no current and the
time scaling T ex: na 2 first discovered on the plasma is either heated by neutral beam injec-
experiment Alcator. Recent larger experiments tion or RF, show the best confinement time
show the rather different scalings T ex: na 2 Tl/2 and have no MHD activity or major disruptions.
or na 2 q 3/4 at lower density and a peak of T as This indicates that the plasma currents, essen-
a function of density at high density. Also no tial for the tokamak configuration, are detri-
bootstrap current has been observed, the ob- mental to containment and may be responsible
served {3 as a function of heating saturates at for the density limitations. Theoretically 1 = 2
high heating power, and the plasma is cata- or 3 stellarator equilibria exist for {3 up to 5%,
strophically unstable (disrupts) above an aver- and there should be stability up to (3 '" 10%
age density (the Murakami limit) which is pro- when small e ("'0.05) and superimposed fields
portional to the current density. The limits on of different pitch are used. It appears that
507 FUSION POWER

because of resistive instabilities, shear in the stable; the curved field lines are, of course, only
magnetic field may not be sufficiently main- required near the ends when the system is long.
tained to suppress instabilities, and a minimum Although the containment is grossly stable too
B magnetic configuration is required. many particles acquire enough parallel velocity
The origin of the reversed field pinch is the to penetrate the mirrors. A possible way to
observation of a quiescent period that auto- overcome the problem that is being investigated
matically developed in unstable plasma in the is to heat the plasma in the mirror regions
large Zeta toroidal experiment of the early preferentially, so that the axial temperature
1960s. The parameters were major radius gradient produces an electrostatic field directed
1.5 m, minor radius 0.5 m and toroidal current to reduce the loss. Encouraging results of
4 X 10 5 A, giving a plasma temperature -1.5 X T-3X 10 6 K,n-3X 1019 m-3,{3-0.1-0.5,
10 6 K, density -5 X 10 19 m- 3 , {3 - 0.1 and and T - a few times 10-3 , have been achieved
energy containment time -0.003 to 0.01 s. and work is proceeding on larger experiments
This was explained by subsequent low {3 the- to investigate the scaling.
oretical analysis which showed that, in certain Toward Reactors Although there are prob-
ranges of poloidal and toroidal magnetic field, lems to be overcome in all confinement sys-
a force-free shear-stabilized magnetic field tems, the reactor potential of tokamak has
configuration with reversed field near the received most attention, followed by stellara-
surface is possible. In the 1970s the results tors, mirrors, and reversed field pinches.
were confirmed and experiments to extend Table 1 shows the parameters and objectives
them to higher temperature and {3 were started. of large tokamak experiments on the path to a
A major objective is to determine whether the reactor. The first three will be in full operation
magnetic configuration can be maintained for by the mid 1980s. The largest, JET, has a D-
the burn period of a fusion reactor either auto- shaped cross section which maximizes the (3.
matically by noncatastrophic small-scale insta- The total cost will be "'$450 million. It will
bilities or by the application of a reversed field operate at high density, and may even achieve
at the plasma surface. the ignition nT product. In constrast, TFTR
The only linear containment system with aims to operate at a lower density where non-
fusion reactor potential is the mirror. In its thermal reactions rather than ignition are sig-
simple form it is unstable because interchange nificant, possibly just achieving the Lawson nT
instabilities may develop where the magnetic product for energy breakeven. As a physics
field lines are concave towards the plasma. This milestone in fusion, the nT value has been com-
was overcome by adding an extra field produced pared to criticality in fission development,
by current flowing in conductors placed outside although it should be emphasized that it is
of the plasma. This produces a complicated not so definitive.
magnetic field configuration that is minimum {3 The next three tokamak experiments listed

TABLE 1.
R a BT IT Pulse Time
Experiment Objective "em) (m) (T) (MA) (s)

JET (Euratom) Approach to ignition 3.0 0.3 X 2.1 3.4 4.8 20


Maximize (3
Long pulse
TFTR (USA) Approach to energy breakeven 2.7 0.9 5.2 2.5 1.6
JT60 (Japan) Ignition parameters in hydrogen 3.0 1.0 4.5 2.7 5.0
plasma
Impurity control
TORUS II (France) Superconducting coils 2.2 0.7 4.5 1.7
Long pulse
RF heating
T-15 (USSR) Superconducting coils 2.4 0.8 3.5 1-2
Approach to ignition
INTOR (lAEA) Ignition 5.2 1.3 5.5 6.4 100
Blanket design
Fusion burn control
Technology of electricity
producing systems
FUSION POWER 508

are to investigate particular reactor require- loss of low radiation hazard compared to fission
ments : impurity control, long pulse, RF heating, reactors.
superconducting coils, etc. Lastly, INTOR is an The cost of magnetic confinement fusion re-
ignition experiment to test crucial engineering search in 1982 was ~$330 million in Europe,
aspects such as blanket design, materials, tritium $200 million in Japan, $460 million in the
handling, plasma burn control, and even limited USA, and probably a similar amount in the
electricity production. In 1982 it was in a con- USSR. These amounts are small compared to
ceptual design stage, the final design awaiting that spent on energy as a whole. Fusion power
results on near-ignition conditions from the has a large number of unresolved problems
JET generation of experiments. It is hoped to which may only be investigated with large ap-
be able to start construction of INTOR late in paratus; the risk of failure is far from negligible,
the 1990s. The subsequent stage of develop- but the benefit of success would be great.
ment will be a full scale prototype reactor. One
of the major problems will be first wall erosion J. A. REYNOLDS
caused by the high energy neutrons, estimated
to be ~ 100 metric tons of material per gigawatt References
(electric )-year.
It has been suggested that advantage could be "Comments on Modern Physics, Part E," New York,
taken of the high neutron energies by using the Gordon and Breach.
neutrons to produce fission power from U238 , Reports on the bi-annual conferences on Plasma
or to breed plutonium from U238 which is ~10 Physics and Controlled Nuclear Fusion Research,
times more efficient than in fast reactors be- Vienna, Austria, International Atomic Energy
cause of the harder neutron spectrum. These Agency.
schemes are known as hybrid fusion-fission.
They have the advantage of lower energy break- Cross-references: BREMSSTRAHLUNG, FISSION,
even nr than pure fusion, but at the expense of NUCLEAR REACTIONS, NUCLEAR REACTOR,
complicated engineering construction and the PLASMAS, LASER FUSION.
G
GAS LAWS the molecules together. The equation of state
of a real gas is therefore determined by the na-
The term "gas law" refers to the thermody- ture of the intermolecular forces. One of the
namic equation of state of a gas, which is an earliest, simplest, and most useful equations is
equation relating the pressure P, the volume V, that of van der Waals
the absolute temperature T, and the number of
moles v. The equation of state is a valid relation
when and only when the gas is in a state of ther-
modynamic equilibrium; the pressure and tem-
(p + ;2) (V- b) =RT(for 1 mole) (1)
perature are then constant and uniform through-
out the volume occupied by the gas. where a and b are constants, determined empiri-
Ideal or Perfect Gas The ideal or perfect gas cally for each gas, which are related to the at-
is defined thermodynamically by the two condi- tractive and repulsive forces, respectively. This
tions: (1) it obeys the equation of state: pV= equation can be related theoretically, in first ap-
vR T where R is the gas constant per mole (R = proximation, to a molecular model in which the
8.3169 X 10 7 erg mole- 1 °C-1), and (2) the molecules are represented by rigid elastic spheres
internal energy U is independent of pressure and that weakly attract each other. The van der
volume and is a function only of the tempera- Waals equation accounts qualitatively for the
ture «aUla V)T = 0). The statistical-mechanical liquid-vapor phase transition. The constants a
definition of an ideal gas is that it is a gas of and b can be determined from critical point
noninteracting molecules, i.e., the molecules data.
exert no appreciable forces of attraction or re- Other equations of state for an imperfect gas
pulsion on each other. Since the notion of a have been proposed which are more accurate
finite "size" of a molecule connotes the exis- than the van der Waals equation, e.g., the equa-
tence of a repulsion which prevents two mole- tions of Dieterici, Berthelot, Beattie-Bridgeman,
cules from overlapping each other, the mole- and Benedict-Webb-Rubin. These empirical
cules of an ideal gas must be of negligible "size." equations are useful in treating the thermody-
The two thermodynamic properties can be de- namic properties of gases at high densities. At
duced from the statistical mechanical definition. low densities, the empirical equations have been
The ideal gas equation: p V = vR T embodies superseded by the virial equation of state
the experimental laws of Boyle, Charles and
Gay-Lussac. It can be derived either from ki- - pV B(T) C(T) D(T)
= I +--+--+--+ ... (for I mole)
netic theory or from statistical mechanics. It is RT V V2 V3
often written in the form: p = nkT where n is
the molecular number density and k is Boltz- (2)
mann's constant (k = 1.3804 X 10- 16 erg
°C-1). In the case of a mixture of inert, ideal where B(T), C(T), and D(T) depend on· the na-
gases, each gas obeys the equation: Pi = nikT ture of the gas and are called the second, third
where Pi and ni are, respectively, the partial and fourth virial coefficients, respectively. The
pressure and partial density of the ith compo- departures of a gas from ideality are represented
nent gas. Boyle's law Will not hold if the gases in in this case by a power series in the density. We
the mixture react chemically since a change in p may rewrite Eq. (2) as
or V will in general change the value of v. plkT = n + iJ(T)n2 + C(T)n 3 + D(T)n4 + . . .
Real or Imperfect Gas The ideal gas law is, of
course, only an approximation which holds at (3)
temperatures sufficiently far above the critical
temperature and at sufficiently low densities. where iJ, C, D are the virial coefficients referred
The ordinary properties of bulk matter in the to one molecule. .
liquid and solid states require the existence of The basic experimental problem in this field
strong intermolecular repulsions which endow is to measure the virial coefficients of different
the molecules with a finite "size" and also re- gases as functions of the temperature. The
quire the existence of attractive forces to hold higher-order coefficients beyond B(T) and C(T)

509
GAS LAWS 510

are very difficult to measure. The basic theoreti- lightest gases Hz and He and especially at low
cal problem is to calculate the virial coefficients temperatures, it is necessary to introduce quan-
from an assumed form for the intermolecular tum corrections in the equation of state.
potential energy and, ultimately, to derive the Intermolecular Forces and the Equation of
intermolecular potential from quantum me- State In order to calculate the cluster integrals
chanics. and virial coefficients, one must first choose a
Statistical Mechanics of the Imperfect Gas form for the intermolecular potential u(r). In
The derivation of the virial equation of state principle, the potential u(r) is determined by
from the intermolecular potential involves sev- quantum mechanics for any pair of molecules
eral steps which may be summarized as follows: and could be found by solving the Schrodinger

Virial Equation -+- Helmholtz Function -+- Canonical Partition Function


-+-Grand Potential (PV) -+-Grand Partition Function
-+-Configuration Integral -+-Cluster Integrals -+-Intermolecular Potential

The two routes indicated are via the canonical equation. In practice, this is virtually impossible,
and the grand ensembles. The most difficult and quantum-mechanical calculations have been
step is the evaluation of the configuratior. made only for the very simplest molecules. In
integral: the case of interactions between neutral, non-
polar, spherical molecules, e.g., noble-gas atoms,
the quantum-theoretical interaction energy can
be approximately decomposed into several parts,
of which the two most important are the dis-
persion energy and the valence repulsion energy.
dTI dTz "'dTN (4) The former corresponds to the van der Waals at-
traction and the latter to the van der Waals re-
where <I> is the total intermolecular potential pulsion. The dispersion energy varies inversely
energy of the gas of N molecules. The proper with the sixth power of the distance. The va-
way to evaluate QN was first sketched by Ursell lence-repulsion energy takes account of the
and later carried through by Mayer who as- short-range repulsion that sets in when the elec-
sumed central forces that were pairwise additive, tron distributions of the two molecules begin to
Le. , overlap, and it is associated with the Pauli ex-
<I> =L u(rij)
clusion principle. There is no simple, general
form for the dependence of the valence repul-
i<j sion potential on the distance: it is often em-
where u(rij) is the potential between molecules pirically represented by A e-ar or by pr-n where
i and j. Neither of these assumptions is correct, n = 12 is commonly used. In the case of mole-
but they appear to be good approximations, and cules that possess permanent electric dipole or
with their aid, it is possible to evaluate QN quadrupole moments, there are additional con-
rigorously in terms of the so-called cluster tributions to the van der Waals attraction but
integrals, b/, which are integrals over the co- these are usually less important than the dis-
ordinates of I molecules only. B(T) is obtained persion energy (HzO is an exception).
directly from b 2 , C(T) is obtained from b 3 and In the absence of a complete quantum-
b 2 , and the lth virial coefficient requires evalua- mechanical expression for the intermolecular
tion of the cluster .integrals \.1p through b/. Ex- potential, it is necessary to approximate the
plicit formulas for B (T) and C(T) are: potential by a semi-empirical formula, contain-
ing one or more adjustable constants, which is
chosen on the grounds of physical plausibility
and mathematical convenience. The semi-empir-
ical force law is then used to calculate macro-
scopic properties that are known from experi-
ment, and the parameters in the force law are
adjusted to give the best agreement with experi-
ment. Given the form of the intermolecular po-
tential, it is possible to calculate not only the
virial coefficients in the equation of state but
where Ii; == exp [- u(rij)/kTj - 1. Higher coeffi- also the kinetic-theory transport coefficients
cients in the virial series are increasingly more (Le., the viscosity coefficient, the thermal con-
difficult to evaluate. ductivity, and the various diffusion coefficients
The calculations just described are based on of the gas) and the density, compressibility, and
the classical Maxwell-Boltzmann statistics and sublimation energy of the solid. A particular
are sufficiently accurate for most gases at ordi- functional representation of the intermolecular
nary temperatures. However, in the case of the potential can be considered satisfactory only if
511 GASES: THERMODYNAMIC PROPERTIES

it is possible to secure agreement with all ex- A promising theoretical approach to the equa-
perimental data involving a particular pair of tion of state of a dense gas or liquid is provided
molecules with a single choice of the parameters by the method of the radial distribution func-
that appear in the law of force. tiong(r, n, T). Because of intermolecular forces,
The semiempiricallaw that is most frequently the actual density at a small distance r from a
used to represent the interaction between non- given molecule is different from the bulk den-
polar molecules is the Lennard-Jones (12,6) sity n and is represented by ng(r, n, T). Thus
potential: the radial distribution function measures the
u(r) = 4e[(alr)12 - (alr)6] (7) effect of intermolecular forces on the probabil-
where e and a are parameters characterizing thil ity of finding two molecules close together.
particular pair of molecules. This simple two- While it is difficult to determine g(r, n, T)
parameter function, when inserted in Eq. (5), theoretically, it can be found experimentally
predicts a temperature variation of the second from the diffraction pattern observed when
virial coefficient in good agreement with experi- x-rays are scattered by the fluid.
ment. The same potential with slightly different R. D. PRESENT
values of e and a also explains the temperature
variation of the viscosity coefficient over a sub-
stantial temperature range. Third virial coeffi- References
cients calculated from Eq. (6) do not agree with Cowling, T. G., "Molecules in Motion," London,
experiment at low temperatures near the criti- Hutchinson & Co., Ltd., 1950 (for the general
cal point. The large discrepancies have been reader).
attributed to three-body forces which invalidate Hill, T. L., "Statistical Mechanics," Ch. 5 and 6, New
the assumption of pairwise additivity. Calcula- York, McGraw-Hill Book Co., 1946 (statistical me-
tions of virial coefficients and transport coeffi- chanics of imperfect gases and dense fluids).
cients of gases and of equilibrium properties of Hirschfelder, J. 0., Curtiss, C. F., and Bird, R. B.,
the crystal lattices have also been made for "Molecular Theory of Gases and Liquids," Chs. 3 and
other semiempirical potential functions, but the 4, New York, John Wiley & Sons, 1954 (covers all
results are not very different from those found aspects of the subject and is the standard reference
with the (12, 6) potential. Nevertheless, there is in this field).
accumulating evidence both from experiment, Present, R. D., "Kinetic Theory of Gases," Ch. 6 and
e.g., atomic beam scattering, and from theory, 12, New York, McGraw-Hill Book Co., 1958 (kinetic
e.g., quantum-mechanical calculations of the theory of the second virial coefficient; intermolecular
dispersion energy, that the (12, 6) potential has forces).
serious defects, and it has been replaced in re- Rushbrooke, G. S., "Introduction to Statistical Me-
cent work by more complicated multiparameter chanics," Ch. 16, London, Oxford University Press,
potential functions. 1949 (good introduction to imperfecti\as theory).
Further advances in this field will come, on Levelt Sengers, J. M. H., Klein, M., and Gallagher, J. S.,
the theoretical side, from a more detailed "Pressure-Volume-Temperature Relationships of
knowledge of the intermolecular forces, and on Gases, Virial Coefficients," AEDC-TR-71-39, 1971,
the experimental side, from more accurate ways and American Institute of Physics Handbook.
of extracting virial coefficients from thermo-
dynamic data. Current values of C(T) are not Cross-references: COMPRESSIBILITY, GAS; GASES:
only subject to experimental errors in the p, V, THERMODYNAMIC PROPERTIES; INTERMOLEC-
T measurements but also to substantial uncer- ULAR FORCES; KINETIC THEORY; THERMODY-
tainties incurred in fitting the data with poly- NAMICS.
nomials in the density.
Dense Gases High-density gases cannot be
conveniently represented by the virial equation GASES: THERMODYNAMIC PROPERTIES
of state because of the slow convergence of the
virial series. Furthermore, the theoretical evalua- Fundamental Principles The thermodynamic
tion of the higher virial coefficients on the basis properties of a substance may be classified as
of any plausible molecular model would meet either reference properties, energy functions, or
with great computational difficulties. Other ap- derived properties. I The reference properties of
proaches are therefore needed, e.g., the empiri- a single-component system with their symbols
cal equations of state already mentioned and and units are pressure, p, Pa or Nm- 2 ; volume,
the principle of corresponding states. In the V, m 3 ; temperature, T, K; and entropy, S,
latter method, one introduces the reduced, di- JK-I mol-I. For a specific amount of a pure
mensionless variables: Pr = PiPe, Vr = VI Ve, and gas, it is necessary to specify only two of these
Tr = TITe where the subscript c refers to the reference properties to fix the state of the sys-
critical point. The principle of corresponding tem and its properties. For mixtures of gases,
states then asserts that all substances obey the the composition must also be specified to fix
same equation of state in terms of the reduced the system completely. The energy functions
variables. The variables may also be reduced in with their sym bois and units are internal energy,
terms of intermo)ecular potential parameters. U, J mol-I; enthalpy, H, J mol-I; Helmholtz
GASES: THERMODYNAMIC PROPERTIES 512

energy, A, J mol-I; and Gibbs energy, G, states are important for diatomic molecules at
J mol-I. These functions represent the energy higher temperatures but are entirely negligible
available for performing useful work under for most polyatomic molecules. Vibrational
various process conditions. Derived properties energy levels are obtained from an analysis of
with their symbols and units include heat infrared and Raman spectroscopic data by
capacity, C, JK-I mol-I; fugacity, t, Pa or applying the principles of wave mechanics and
Nm- 2; compression factor, Z, unitless; and the group theory.4, 7, 8 The interpretation of the
Joule-Thomson coefficient, fJ" K Pa- I or Km2 spectra includes the assumpti()n of a model for
N-I. the molecule with parameters such as bond
Properties are termed intensive if they are in- lengths, bond angles, and force constants. The
dependent of the amount of the material. Ex- parameters are varied within certain limits until
amples are pressure and temperature. Properties the best agreement with observed spectra is ob-
such as volume and entropy, which are depen- tained. Rotational energy levels are observed in
dent on the amount of material, are termed infrared, Raman, and microwave spectra. The
extensive. rotational energy includes not only the rotation
Absolute values may be determined for the of the molecule as a whole but also internal
reference properties, but the energy functions rotations by groups of atoms within the mole-
must be determined relative to an arbitrary zero cule and a pseudorotation in some ring
reference point. The internal energy, ug, of the molecules. 9
ideal gas at the absolute zero of temperature is For higher orders of accuracy in the calcula-
generally taken as the zero reference point of tion of the thermodynamic properties, addi-
the enthalpy and free energy functions. Other tional contributions may be determined that
reference points include a zero value for the en- are caused by the interaction between vibra-
thalpy of the ideal gas at the ice point, H273.IS, tional and rotational motions, centrifugal dis-
and another in which the sensible enthalpies are tortion of the molecule during rotation, and
combined with chemical energies. 2 In the latter anharmonicity of the vibrations. Another con-
base, the value of H298.IS is zero for the tribution due to nuclear spin can be, and
assigned reference elements so that the values generally is, neglected for all molecules except
of H298.15 for the various compounds are equal H2 and D2 since it causes a detectable effect
to their heats of formation from the assigned on measurable quantities only at very low
reference elements. temperatures. Adjustments are made for the
Thermodynamic properties of gases are cal- presence of isotopes in some diatomic and
culated for both the ideal gas state and the real polyatomic molecules.
gas state. A gas is defif\ed to be ideal if it fol- The calculation of the thermodynamic prop-
lows the simple equation of state, p V = R T, for erties of an ideal gas is based not only on
1 mole of gas. Gases behave in this manner only theory but also on accurate experimental vapor
at very low pressure, but the ideal gas state is a heat capacity, heat of vaporization, and low-
convenient reference state for the calculation of temperature calorimetric data. 9 The ideal gas
the thermodynamic properties. Thus, the ther- heat capacity cP and entropy SO are derived
modynamic standard state 3 is defined as the from these data and are compared to theoreti-
ideal gas at 1 atmosphere (= 101 325 Pa) pres- cal values. When differences are found, the
sure at each temperature, and it is denoted by a theoretical calculations are revised until there
superscript degree mark as in HO and So. The is good agreement. In this manner, new informa-
ideal gas properties have been calculated for tion is gained about the conformation of the
many substances, but the real gas properties are molecule, its frequencies of vibration, etc. Thus,
known for relatively few substances. experimental data provide a firm base to test
Thermodynamic properties are used in the theoretical calculations and improve the cal-
calculation of energy balances, reaction com- culation of all of the thermodynamic properties.
positions at chemical equilibrium, reaction tem- Theoretical calculations become increasingly
peratures, and the work involved in the com- complex as the molecular size increases. Thus, a
pression or expansion of gases in various method of increments has been devised to cal-
systems. culate the thermodynamic properties of large
Ideal Gas Properties The thermodynamic molecules based on an "anchor compound" for
properties of an ideal gas such as heat capacity a given homologous series. I ,10
at constant pressure, cp, enthalpy function, Real Gas Properties The thermodynamic
(HO - Hg)IT, entropy, So, and Gibbs energy properties of real gases are determined primarily
function, (Go - Gg)IT, in units of JK-l mol-I, from experimental compressibility (pressure-
are calculated from theoretical equations and volume-temperature) measurements (see COM-
from an analysis of spectroscopic and molecular PRESSIBILITY, GAS). All other properties are
structure data. 4- 1 These complex calculations calculated either from equations of state or
are based upon the contributions from all of from a correlation of the individual experi-
the energy states available to the molecule, such mental data points. In addition, the properties
as translational, electronic, vibrational, and ro- may be estimated from generalized correlations
tational. Contributions from excited electronic of the compression factor (Z = p VIR T). Here
513 GASES: THERMODYNAMIC PROPERTIES

and in the equations that follow, V denotes an an~ysis of isometric and isothermal p- V-T
molar volume in m 3 mol- I and R denotes the data. IS The slopes of the isometrics (aplaT)v
molar gas constant, in JK-I mol-I. are found by analytical, residual, and graphical
Experimental compressibility measurements techniques. The integrals as in Eq. (2) are in-
have been made by a variety of methods l1 ,12 tegrated graphically or numerically.
such as constant volume cells (Eucken and Extensive correlations of data have been made
Meyer), variable volume cells (Beattie and to develop methods for estimating the prop-
Douslin, Michels, and Sage and Lacey), expan- erties of gases.1 ,13 One method based on the
sion systems with variable sample mass (Bur- theory of corresponding states presents the
nett), and differential systems (Whytlaw-Gray). thermodynamic properties as a function of re-
The apparatus of Beattie and Douslin is used to duced temperature (Tc = TITe), reduced pres-
measure both isometrics and isotherms up to sure (Pc = pIpe), and the compressibility factor,
350°C and 400 atmospheres. The Michels appa- Zc. The subscript c refers to the critical state.
ratus may be used up to 3000 atmospheres but Gas Mixtures The thermodynamic properties
is limited to a temperature of 150°C. Sage and of a mixture of gases may be calculated, but
Lacey have made extensive measurements on the procedures are only approximate unless
both gas and liquid systems up to 240°C and compressibility data are available for the par-
670 atmospheres. ticular mixture. 11 - 13 Since few data for mix-
Equations of state have been derived from tures are available, the properties must be
compressibility data to represent the behavior estimated from: (a) the equations of state of
of a gas over wide ranges of temperature and the pure gases assuming either additive volumes
pressure.11 ,13-14 Numerous equations have or additive pressures, (b) an equation of state
been published but one of the most important for the mixture, or (c) generalized correlations
is the virial equation, of the compressibility factor based on pseudo-
reduced conditions. 1
pV=RT(l + BfV+ CfV2 + DIV3 + . . . )
ROLAND H. HARRISON
(1)
It is quite important because the parameters B, References
C, D, etc., are related to the interactions be-
tween molecules according to the intermolecu- 1. Hougen, O. A., Watson, K. M., and Ragatz, R. A.,
lar potential energy theory. Other equations "Chemical Process Principles, Part Two: Thermo-
having wide applications are those of Beattie dynamics," New York, John Wiley & Sons, Inc.,
and Bridgeman and of Benedict, Webb, and 1959.
Rubin (see GAS LAWS). 2. Guggenheim, E. A., "Thermodynamics," pp. 244-
The energy functions , entropy, and heat ca- 248, Amsterdam, North Holland Publishing Co. ,
pacity of a real gas are calculated as the sum of 1967.
the ideal gas properties and a correction for the 3. "Selected Values of Properties of Hydrocarbons
nonideality of the gas. The corrections for the and Related Compounds," Thermodynamics Re-
nonideality of the gas are called difference or search Center Hydrocarbon Project, Texas A&M
departure functions. For example, S-So is the University, College Station, Texas, 3444 loose-
entropy of the gas in the real state less that of leaf sheets extant October 31, 1981.
the gas in the standard state at the same tem- 4. Herzberg, G., "Molecular Spectra and Molecular
perature. Theoretical equations needed for the Structure. II. Infrared and Raman Spectra of Poly-
computation of the thermodynamic properties atomic Molecules," New York, Van Nostrand
have been derived in terms of pressure, volume, Reinhold Co., 1945.
temperature, and the first derivatives (a Vja T)p 5. Stull, D. R., Westrum, E. F., Jr., and Sinke, G. C.,
or (opjaT)v. I3 "The Chemical Thermodynamics of Organic Com-
For example : pounds," New York, John Wiley & Sons, Inc.,
1969.
6. Stull, D. R., and Prophet, H., "JANAF Thermo-
H-HO= fOG [p- T(apjaT)vl dV+pV-RT chemical Tables," National Standard Reference
Data Series, National Bureau of Standards (U.S.),
v
37,1141 pp. (June 1971), U.s. Government Print-
(2) ing Office, Catalog No. C 13.48:37, Washington.
7. Knox, J. H., "Molecular Thermodynamics: An In-
The heat capacity differences Cp - C~ and troduction to Statistical Mechanics for Chemists,"
C u - C~ are functions also of the second de- Revised Edition, New York, John Wiley & Sons,
rivatives (a 2 V/OT2)p or (a 2 plaT 2 )v depending Inc., 1978.
upon whether the equations are written in 8. Wilson, E. B., Jr., Decius, J. C., and Cross, P. C.,
terms of p and T, or V and T. The quantities "Molecular Vibrations-The Theory of Infrared
appearing in Eq. (2) are usually evaluated from and Raman Vibrational Spectra," New York,
equations of state. However, the most accurate McGraw-Hill Book Co., Inc., 1955. Reprinted by
properties are those which are calculated from Dover, New York, 1980.
GASES: THERMODYNAMIC PROPERTIES 514

9. McCullough, 1. P., Pennington, R. E., Smith, J. C., ries that many believe describe all known fun-
Hossenlopp, 1. A., and Waddington, G., ''Thermo- damental interactions except gravitation; the
dynamics of Cyc1opentane, Methy1cyclopentane, theories of this type are called gauge theories.
and 1,cis-3-Dimethy1cyclopentane: Verification They are characterized by a principle that is a
of the Concept of Pseudorotation," J. Arn. Chern. generalization of the gauge invariance principle
Soc. 81,5880 (1959). of electromagnetism, namely: Gauge theories
10. Scott, D. W., and McCullough, J. P., "The Chem- describe relativistic quantum fields that are in-
ical Thermodynamic Properties of Hydrocarbons variant under a set of local symmetry transfor-
and Related Substances," U.S. Bur. Mines Bull. mations, i.e., operations of a continuous group
595, 1961. (Lie group), whose elements are denoted by
11. Rowlinson, J. S., in Flugge, S. (Ed.), "Encyclo- parameters that are functions of space and
pedia of Physics," Vol. XII, pp. 1-72, Berlin, time. QED itself is a highly successful gauge
Springer-Verlag, 1958. theory, whose gauge group is Abelian, i.e., it
12. Douslin, D. R., Harrison, R. H., and Moore, R. T., has only mutually commuting elements.
"Pressure-Volume-Temperature Relations in the A non-Abelian generalization of QED is the
System Methane-Tetrafluoromethane. I. Gas Den- "standard" electroweak theory, which is a
sities and the Principle of Corresponding States," gauge theory that unifies the electromagnetic
J. Phys. Chern. 71,3477-3488 (1967). with the weak nuclear interactions that are re-
13. Beattie, 1. A., and Stockmayer, W. H., in Taylor, sponsible, e.g., for nuclear beta decay. Another
H. S., and Glasstone, S. (Eds.), "Treatise on Phys- non-Abelian gauge theory is quantum chromo-
ical Chemistry," Vol. 2, pp. 187-290, New York, dynamics (QeD), designed to explain the bind-
Van Nostrand Reinhold Co., 1951; Beattie, J. A., ing of unobserved elementary particles called
Chern. Rev. 44, 141-192 (1949). quarks into observed strongly interacting par-
14. Goodwin, R. D., and Haynes, W. M., "Thermo- ticles called hadrons (e.g., proton, neutron,
physical Properties of Propane from 85 to 700 K pion). It postulates the exchange between
at Pressures to 70 MPa," National Bureau of Stan- quarks of eight massless "color gluons," analo-
dards (U.S.), Monograph 170, 1982. gous to the ordinary photon of electromagne-
15. Harrison, R. H., and Douslin, D. R., "Derived tism, in order to account for the total confine-
Thermodynamic Properties of Ethylene," J. Chern. ment of quarks (whose nonintegral electric
Eng. Data 22(1),24-30 (1977). charges may not be observed) and to account
for the strong nuclear force. 2 Non-Abelian
Cross-references: COMPRESSIBILITY, GAS; GAS gauge theories have also been proposed that
LAWS; HEAT CAPACITY; KINETIC THEORY; unify all three of the above interactions (i.e., all
THERMODYNAMICS. fundamental interactions except gravity); the~
are called grand unification theories (GUTS).
QED is formulated in terms of a relativistic
GAUGE THEORIES four-vector field A.u (/1 = 0, 1, 2, 3), which is
the quantum generalization of the electromag-
The principle of gauge invariance in classical netic four-potential. The principle of gauge in-
electromagnetism expresses the postulate that variance states that A.u can be replaced by A.u +
electric and magnetic field strengths are mea- oAjox.u, where A is any (relativistic) scalar func-
surable, and thus have objective meaning, while tion of space and time, without affecting any of
the scalar and vector potentials (from which the the consequences of the theory. Among these
fields are usually derived) are neither measurable consequences are:
nor uniquely defined. For example, a static
electric field is the negative gradient of a scalar • The appearance of massless photons (at all
potential function whose value is undefined up frequencies, spanning the entire electro-
to a constant. Similarly, the magnetic field B is magnetic spectrum)
the curl of a vector potential, to which may be • A unique form of electromagnetic interac-
added any gradient of a scalar function without tion with charged particles
changing B, because the curl of a gradient van- • The conservation of electric charge.
ishes identically. The importance of gauge in-
variance was first pointed out by Hermann The last property follows from gauge invariance
Weyl in 1922. Its quantum mechanical signif- alone.
icance was emphasized by Fritz London in QED, proposed by P. A. M. Dirac in 1927,
1927, namely: Gauge invariance fixes the form was plagued for two decades by so-called di-
of the electromagnetic interaction and guaran- vergences, i.e., ~nfinite values predicted for
tees the atomicity of electric charge (without, physical quantities that are actually finite. In
however, fixing its value). Its application to rel- the late 1940s, it was found that this problem
ativistic field theories of elementary particles could be circumvented by an ingenious calcula-
was made by Wolfgang Pauli and others.l tional procedure called renormalization. Thus
Quantum electrodynamics (QED) is the quan- far, among quantum field theories applicable to
tum theory of the electromagnetic field. It is elementary particles, only gauge theories have
the prototype of important quantum field theo- been found to be renormalizable.
515 GEODESY

The first gauge theory other than QED was group that has eight generators, has eight kinds
formulated in 1954 by C. N. Yang and R. L. of "photons," called gluons. In all three cases
Mills. Among their motivations were: (QED, Yang-Mills, and QCD) we deal with an
"unbroken" gauge symmetry, characterized by
• To find the consequences of assuming a a vacuum state which is itself gauge symmetric.
law of conservation of isotopic spin (which It is possible, however, to have a nonsym-
was thought to be the strong-interaction metric vacuum state, leaving open the possibil-
analogue of electric charge) ity that the quanta of the field (the "photons")
• To have a principle that would select, from are massive. This occurs as a result of a phenom-
many possibilities, a unique form for the enon known as spontaneous symmetry break-
strong nuclear interaction. ing. An example of such a field is the electro-
weak field theory of Weinberg, Salam, and
The isotopic spin is a quantum-mechanical Glashow.
concept, having no classical analogue; it is rep-
resented by a vector operator whose compo- LAURIE M. BROWN
nents have the same commutation relations as
the quantum mechanical angular momentum, References
hence the term spin. These operator compo-
nents are the generators of a non-Abelian Lie 1. See Pauli, W., "Relativistic Field Theories of Ele-
group. In order to ensure that the strong- mentary Particles," Rev. Mod. Phys. 13, 203-32,
interaction Lagrangian be invariant under this (1941).
local isospin group, a four-vector field bit is 2. So far, only one experimental group has claimed to
seen to be necessary, having three components, observe free nonintegral charge.
and bearing the same relation to the isospin that 3. For history and further descriptive materials, see
the electromagnetic field bears to the electric Rosner, Jonathan L., Am. J. Phys. 48, 90-103
charge. The field bit (because of non-Abelian (1980),' and Yang, Chen Ning, Annals. of N. Y.
isospin gauge invariance) turns out to obey a Acad. of Sci. 294, 86-97 (1977).
nonlinear equation of motion.
In the case of electromagnetism, the interac- Cross-references: CONSERVATION LAWS AND
tion with an electric charge, described by a SYMMETRY, ELECTROWEAK THEORY, ELE-
wave function 1/1 (x), where x = (x, t), is ob- MENTARY PARTICLES, GRAND UNIFIED THE-
tained by replacing any gradient acting upon ORY, ISOSPIN, QUANTUM CHROMODYNAMICS,
1/1 (x), by the so-called covariant derivative DIl; QUANTUM ELECTRODYNAMICS, QUARKS.
that is,
all """* DIt = all - ie All,
GEODESY
where all == a/axil and All is the electromag- Geodesy comprises the determination of the
netic four-potential. It follows from quantum earth's external form and gravitational field,
mechanics that an arbitrary phase is allowed the and the location of points with respect to earth-
complex wave function 1/1 (x). That is, the fixed reference systems. The earth's external
theory is invariant under the replacement form is customarily defined by the geoid: the
1/1 (x) """* eiCi(x) 1/1 (x). equipotential of the earth's gravity field which
most closely approximates the mean sea level.
To insure invariance under this transformation, The geoid is irregular in form, so that the
terms in the Lagrangian must contain deriva- mathematical representation thereof is neces-
tives in the form DIl, not all, for only then: sarily an approximation. The most important
approximation is an oblate ellipsoid of revo-
DIl {eiCi(x) 1/1 (x)} =eiCi (x)D'1l1/l(x) lution, which is conventionally defined by its
equatorial radius, a, and the flattening,f, equal
where to (a - b)/a, where b is the polar semidiameter_
D'1l = all + iolla(x) - ie AIl(x). Location is conventionally expressed in coor-
dinates referred to such an ellipsoid, in terms of
D'1l is equivalent to DIl because of the gauge in- the latitude I/J, the angle between the normal to
variance of All (x), which allows the addition of the ellipsoid and the equator; the longitude A
the gradient of an arbitrary scalar function, i.e., from the reference meridian, Greenwich; and
olla(x). altitude h above or below the ellipsoid.
Like the All of QED, the isotopic spin invari- If the ellipsoid is considered to be rotating
ant bll field of Yang and Mills, whose gauge with rate w, and to be an equipotential for the
group is SU(2), consists of massless "photons." combined effects (called gravity) of centrifugal
However, since SU(2) has three generators, and gravitational acceleration, additional param-
while the Abelian group of electromagnetism eters customarily required are 'Ye, the accelera-
has only one, there are three kinds of "pho- tion of gravity at the equator; and m, the ratio
tons." Similarly QCD, based upon the SU(3) of the centrifugal acceleration at the equator,
GEODESY 516

w 2 a, to 'Ye. 'Ye and m are connected to the traverse-a series of measured distances at mea-
total mass M contained in the ellipsoid by: sured angles with respect to each other; or by
trilateration-a system of overlapping triangles
GM=a 2'Ye[I-I+3m/2- ISml/14 with all sides measured. Much of the land area
of the world is covered by triangulation, which
+ 0([3)] (1) gives the difference in latitude and longitude
where G is the constant of GRAVITATION between points in the same network with a
(6.67 X IO- B cm 3 g- 1 sec- 2 ). The customary for- relative error of about 10-5 •
mula for the acceleration of gravity 'Y at geo- (2) Vertical control comprises the determina-
detic latitude q, is: tion of heights, which is performed separately
from horizontal control because of irregularities
'Y = 'Ye [1 + (Sm/2 - 1- 17ml/ 14) sin 2 q, in atmospheric refraction. The most accurate
method, leveling, measures successive differ-
+ ([2/8 - Sml/8) sin 2 2q, + 0([3)] (2) ences of elevation on vertical staffs by horizon-
tal lines of sight taken at intermediate points
The customary formula · for the gravitational over short distances (less than 150 meters) bal-
potential external to the ellipsoid is: anced so as to minimize differential refraction

V=G;[I-J2 (;-Y P2(sinq,)


effect. The datum to which vertical control
refers is mean sea level as determined by tide
gages. The accuracy is such that the error in

-J4 (;-f P4 (sinq,)- 0([3)] (3)


difference of elevation between points on the
same principal network should be a few tens
of centimeters or less. Current research empha-
sis in horizontal and vertical control is on the
In Eq. (3), V is written as positive, which is the determination of temporal changes in areas of
convention of astronomy and geodesy, contrary earthquake potential.
to that of physics. In Eq. (3), P2 and P4 are (3) Geodetic astronomy comprises the deter-
Legendre polynomials, and mination of the direction of the gravity vector
J 2 = 2/(1 - 1/2)/3 - m(1 - 3m/2 - 2[17)/3 and the direction of the north pole at a point
on the ground. Astronomic longitude is the
+ 0([3) (4) angle between the meridian of the gravity
vector and the Greenwich meridian and is
while J 4 is usually taken as a quantity deter- determined by measuring the time of inter-
mined observationally from satellite orbits. section of a line of sight by a star. Astronomic
The discrepancies in location of the actual latitUde is the angle between the gravity vector
geoid and a well-fitting ellipsoid are nearly al- and the equatorial plane, and is determined by
ways 10- 5 or less of the radius vector, while measuring the maximum altitude attained by a
the discrepancies in intensity of the gravitational star. In these types of astronomic observation,
acceleration from that of the standard ellipsoid several stars are normally observed which are
are nearly always 10-4 or less of the total in- selected so as to minimize error due to atmo-
tensity. The mathematical representation of spheric refraction. Astronomic azimuth is de-
these discrepancies may either be in the form termined by the measurement of the horizontal
of spherical harmonic coefficients or in the angle between a target and Polaris or other
form of mean values for areas; the former being reference star.
preferable for effects on satellites in orbit, and (4) Gravimetry comprises the determination
the latter for use of terrestrial data. The poten- of intensity of gravitational acceleration. Most
tial theory dealing with the relationship between gravimetric observations are made differentially,
variations in the location of the geoid and the by determining the change, with change in loca-
intensity of gravitational acceleration is known tion, of the tension on a spring supporting a
in geodesy as Stokes' theorem. constant mass. These measurements are connec-
There are five principal categories of measure- ted through a system of reference stations to a
ment in geodesy. few laboratory determinations of absolute ac-
(1) Horizontal control comprises the deter- celeration of gravity. The relative accuracy of
mination of the horizontal components of posi- gravimetry is about ±.OOI cm/sec 2 on land and
tion-latitude and longitude-starting from fixed ±.OOS cm/sec2 at sea. The principal difficulty in
values for a certain point. It includes measure- its geodetic application is irregular distribution
ment of distances over the ground by metal of observations.
tapes or by pulsing or modulating radio or light (5) Satellite tracking comprises the deter-
signals, and measurement of angles about a mination of the directions, ranges, or range
vertical axis by theodolites. Over the land, the rates of earth satellites (including the moon)
relative horizontal position of points is obtained from ground stations or other satellites, plus
either by triangulation-a system of overlapping altitudes above the sea surface. These observa-
triangles with nearly all angles measured, but tions will be affected both by errors in posi-
only occasional distances measured; or by tions of the station with respect to the earth's
517 GEOPHYSICS

TABLE l. GEODETIC PARAMETERS. actions between the earth and extraterrestrial


forces and phenomena. It consists of a number
Current Estimate and of interlocking sciences dealing with physical
Parameter Standard Deviation properties, its interior, its atmosphere, oceans,
glaciated areas, its age, motions and paroxysms,
Mean sidereal and the practical applications of the acquired
rotation rate, w 0.7292115085 X 10- 4 sec- 1 knowledge. All of these sciences use the meth-
Gravitational con- ods of physics for measurements and analysis.
stant X mass, GM 39860047 (±5) X 10 7 m3 /sec 2 From observational material, often of an in-
Equatorial radius, a 6378137 ± 2 meters direct nature, attempts are made to derive
Oblateness, lz 108263 (±0.5) X 10-8 abstract models of states and processes through
advanced mathematical concepts, but in many
cases through statistical relations.
Geophysics is an ancient science. In its early
center of mass and by perturbations of the stages it was developed by the Greeks who
orbit by the earth's gravitational field; hence, attempted to determine the shape and size of
in conjunction with a dynamical integration the earth (Eratosthenes, 275-194 B.C.). Among
of the orbit, they are used to determine the its most illustrious contributors have been
position of tracking stations and the variations Galileo Galilei (1564-1642), Sir Isaac Newton
of the gravitational field. Radar altimetry from (1642-1727) who dealt with the motions of
satellites is accurate to ± 10 cm, and hence af- the earth and its gravitational field, Alexander
fected by oceanic temperature, salinity, and von Humboldt (1769-1859) and Karl Friedrich
dynamics, as well as geoidal height and orbital Gauss (1777-1855) who observed and devel-
perturbation. Satellites can also be used as oped theories of the geomagnetic field; and
elevated targets by simultaneous observations Vilhelm Bjerknes (1862-1951) who laid the
from several ground stations. foundation for the hydrodynamic theories of
The principal practical application of geodesy the atmosphere and the oceans. During the
is to provide a distribution of accurately meas- current century its roster of distinguished
ured points to which to refer mapping, naviga- scientists includes: L. Vegard (polar aurora),
tion aids, engineering' surveys, geophysical sur- Sidney Chapman (aeronomy), Sir Harold Jef-
veys, etc. The principal scientific interests in freys and F. A. Vening-Meinesz (structure of
geodesy are the indications of the earth's the earth); Emil Wiechert, Boris Galitzin,
internal structure from the variations of the Beno Gutenberg and J. B. Macelwane (seis-
gravity field and surface motions. mology); H. U. Sverdrup (oceanography); C. G.
Numerical values of the leading geodetic pa- Rossby and T. Bergeron (meteorology); A.
rameters are given in Table 1. Wegner, H. Hess and J. T. Wilson (tectono-
WILLIAM M. KAULA physics).
Geophysics is the outstanding example of
organized international scientific cooperation.
References In the 19th Century the International Polar
Bomford, G., "Geodesy," Fourth Edition, London, Year (1882) served coordinated observations
Oxford Univ. Press, 1980. of the geophysical mysteries of the polar re-
Moritz, H., "Geodetic Reference System 1980," Bu/[. gions. Major milestone's in the development
Geod., 54,395-405 (1980). of the science were the International Geophys-
Garland, G. 0., "Introduction to Geophysics," Second ical Year (IGY) and the period of International
Edition, Saunders, 1979. Geophysical Cooperation (1957-1959), when
Heiskanen, W. A., and Moritz, H., "Physical Geodesy," 8000 scientists from 66 nations tackled a host
San Francisco, W. H. Freeman, 1967. of physical mysteries of our planet. The IGY
Committee on Geodesy, National Research Council, started man's most spectacular ventures to
"Geodesy: Trends and Prospects," Washington, D.C., date: The launching of artificial earth satellites
National Academy of Science, 1978. and other space probes. It started the conquest
Kaula, W. M., "Theory of Satellite Geodesy," Waltham, of the icy wastes of the Antarctic continent
Mass., Blaisdell, 1966. where many geophysical secrets are locked, and
through the Antarctic Treaty created the only
Cross-references: ASTROMETRY, ASTRONAUTICS, weapon-free area in the world solely devoted to
GEOPHYSICS, GRA VIT ATlON, POTENTIAL, RO- scientific research. The IGY took place in an
TATION-CURVILINEAR MOTION. interval of high solar activity. It was followed
by the year of the quiet sun (IQSY, 1964-65)
to cover the range of solar influences on the
earth (see SOLAR-TERRESTRIAL PHYSICS PRO-
GEOPHYSICS GRAM). The Global Atmospheric Research
Program (GARP) started with a number of
Geophysics is the physics of the earth and the observational experiments covering monsoons,
space immediately surrounding it and the inter- tropical circulations, and a coordinated complete
GEOPHYSICS 518

global coverage of weather phenomena (FIGI, geoid is 35.7 cm, the largest depression 17.8 cm,
1980). for the lunar effect; the total solar tide can
A brief survey of various subfields of geo- reach 24.6 cm. The combined total at new and
physics follows. full moon is 78.1 cm.
Geochronology is study of the age of the earth Geomagnetism deals with one of the most
and its various geological formations. Inade- important physical characteristics of the earth.
quate earlier methods of sedimentology have Its magnetic field can be represented by a cen-
been replaced by the use of radioactive decay tered dipole, which at present is inclined at
constants and isotope ratios. This has made it an angle of about 11 0 to the axis of rotation.
possible to date approximately all major geo- Its field strength at the earth's surface is about
logical eras. For the oldest rocks, the decay 0.3 gauss at the equator and 0.6 gauss near the
of 238 U to 206Pb has led to ages of around poles. There are small slow secular changes both
3 X 10 9 years. in ionosphere direction and strength of the
Geodesy deals with the size and shape of the field. About ~ of the field strength, designated
earth and its gravitational field. Because of the as the main field, is assumed to be caused by
rotation of the earth, its lack of absolute rigid- internal forces, principally by differential rota-
ity, crustal mass distribution, and tidal forces, tion between the earth's core and its mantle.
the shape is not quite spherical. The largest About ~ of the field is externally produced by
deviation from sphericity is the flattening at a ring current in the ionosphere. That part is
the poles. Historically this polar flattening often disturbed by electron and proton invasions
has been determined by surface measurements from solar eruptions. Major flares cause magnetic
and lunar observations, but since 1957 the most storms lasting from a few minutes to several
accurate determinations have been made by hours. In the polar zones these are often ac-
observations of the motions of artificial satel- companied by auroras. In those regions field
lites. The current estimate of the flattening is disturbances of 0.035 gauss have been observed.
1 : 298.257. The sea-level surface of the earth, Small, short disturbances of the field are com-
which is irregular, is referred to as the geoid, mon. The earth's magnetic field, the magneto-
which represents an equipotential surface. The sphere, extends far beyond the atmosphere,
undulations of the geoid have been determined especially on the side away from the sun, where
by surface gravity measurements and by utiliza- the distance of the magnetopause is about
tion of data from artificial satellites. The value 10 earth radii. The force of the magnetic field
of gravity depends mainly on latitude because in the space surrounding the earth has created
of the flattening and the variation of centrifugal two zones where solar protons and electrons
force from pole to equator. The normal value are trapped, the van Allen belts.
of gravity at the earth's surface, in centimeters The magnetic field distribution at the earth's
per second per second is represented by surface has been mapped, showing not only the
total intensity but also inclination and declina-
'Y = 978.0327(1 + 0.0053024 sin 2 c.p tion. The latter is of importance for direction
finding with magnetic compasses. The elements
- 0.0000058 sin 2 2c.p), of the earth's magnetic field undergo slow
where c.p is latitude. This formula is based on the secular changes and in geologic time intervals
Geodetic Reference System 1980, representing even reversals of polarity of the field have
geodetic constants officially approved by the been observed. Paleomagnetisms, determined
International Union of Geodesy and Geophysics from the remanent magnetization from the
in 1979. time of formation of iron-bearing rocks, not
Gravity measurements have shown that, in only show rotations of the field but divergent
spite of large mass differences at the surface, motions of the continents.
the earth is nearly in isostatic equilibrium. Seismology deals with study of earthquakes.
Various crustal blocks act as if they floated in Most of them can be attributed to breaks in the
a dense subcrustal material. The undulations earth's crust. A few may be subcrustal and
of the geoid do not exceed 100 meters. Ap- caused by phase transformations. Some minor
proximate dimensional figures for the earth ones are associated with collapse of cavities and
are: surface area 510X 10 6 km 2 ; volume volcanic eruptions. All of them are characterized
1.083 X 10 9 km 3 ; average density 5.516g/cm 3 ; by the sudden release of elastic waves. These
mass 5.974 X 10 27 grams; equatorial radius are longitudinal, transverse, Rayleigh or Love
6,378.137 km. (surface) waves. They are recorded at seismo-
The deformations of the solid earth by tidal graphic stations, now often consisting of arrays
forces are an additional consideration. The of digitally recording sensors responding to
twice daily occurring tides are observed by various frequencies. Through analysis of the
deflections of the vertical or variations of travel times of various wave groups from the
gravity. For the lunar tides the variations point of origin and their amplitudes, the hypo-
amount to 0.165 milligal, for the sun to 0.076 center and epicenter of an earthquake as well
milligal (one milligal equals 10 micrometers as its magnitude can be determined. Like other
per sec per sec). The maximal elevation of the waves, earthquake waves can be reflected and
519 GEOPHYSICS

refracted upon entering a different medium. about 1300 volcanoes have been active with
The internal constitution of the earth has been over 5500 dated eruptions. Duration of eruptive
derived primarily from seismic evidence. This activity and amounts of ejecta vary widely. In
reveals an upper and a lower crust, an upper historical times the 7-12 April 1815 eruption
and lower mantle, an outer and an inner core. of Tambora (Indonesia) was the greatest, with
The boundary between crust and mantle was an estimate of 100 km 3 of tephra ejected. The
first discovered by A. Mohorovicic in 1909. 26-27 August 1883 eruption of Krakatoa
Its depth varies from about 5 km in some (Indonesia) which killed 36,000 persons, threw
oceanic areas to 60 km in some continental an estimated 10 7 tons of dust into the air. This
regions. Seismic wave analysis also permits caused spectacular sunsets. The dust veil stayed
some statements on the strength of materials in the stratosphere for several years. Some
in the earth. The crust and upper mantle are hypotheses of climatic fluctuations have been
rigid (lithosphere) with a more plastic layer based on such veils. The dust from the much
below (asthenosphere). Present interpretations smaller eruption of Mt. St. Helens (Oregon) on
assume that the inner core (1370 km thick) is May 18, 1980, after 123 years of dormancy,
solid and the outer core (2100 km thick) is caused observable reduction in local maximum
liquid. The scattering properties of various temperatures. Prediction of volcanic eruptions
rock types in the crust reveal many inhomoge- remains an uncertain art. It is based on pre-
neities and seismic procedures, and are widely cursory seismic activity, gaseous exhalations,
used in geophysical prospecting for mineral and lava movements in or near the craters,
resources. Areas of present or ancient volcanic activity
Careful records kept during the 20th Century, show abnormal geothermal gradients and are
together with historical records, have yielded a in some areas being tapped for electric energy
fair picture of magnitude and distribution in production.
time and space of earthquakes. Such seismicity Meteorology and aeronomy are concerned
studies permit risk estimates and hazard mitiga- with the physical state and the motions of the
tion. The effort devoted to earthquake predic- atmosphere, which is divided into a number
tion has not yielded much in the way of progress of layers. The lowest is the troposphere with
toward that elusive goal. The principal ap- an average thickness of 7 to 8 km in polar
proaches are geodetic observations of ground regions and 13 km in the equatorial zone. Tem-
motions, interpretation of foreshocks, changes peratures decrease to an interface, called the
in wells, radon exhalations, and even animal tropopause, where polar temperatures are about
behavior. Earthquakes under the oceans often 218 K, at the equator about 193 K. The next
result in seismic sea waves (tsunamis). A warn- higher layer is the stratosphere, where tem-
ing system for endangered coastal areas has peratures stay nearly isothermal in the first few
been quite successful because of the slow travel kilometers and then increase again above about
times of these waves. 20 km to a maximum of 270 K at 50 km where,
Geodynamics deals with the slow motions of above the stratopause, the mesosphere begins.
crustal material. In 1912 Alfred Wegener pre- The warming in the stratosphere is produced
sented a hypothesis of continental drift. This by absorption of the short wavelengths of solar
was fiercely contested until the 1960s. By then radiation by photochemically produced ozone
suboceanic cores had shown ocean-floor spread- (0 3 ). The anthropogenic threat to the ozone
ing and satellite geodetic observations yielded layer by oxides of nitrogen and halomethanes
drifts of continental plates of several centimeters has become a major environmental concern. In
per year. This has led to the plate tectonics the mesosphere temperatures again decrease to
model which postulates the motion of litho- 180 K at about 80 km, where notable dissocia-
spheric and upper mantle layers (100 km thick) tion of the permanent constituents of the
over the less viscous asthenosphere, driven by atmosphere occurs. This is the region of rare
convective subcrustal thermal circulations. noctilucent clouds; above this region lies the
There is evidence for different and separate thermosphere. Here by vigorous interaction
convective systems under the continents and with the solar wind and cosmic rays many of
oceans. At the edges of the various plates form the atmospheric constituents are in excited
mountain systems, volcanic arcs, and earth- atomic states and ionized. There are several
quake zones. Extrusions of material from lower temporarily or permanently ionized layers in
layers take place and some of the plates seem at the thermosphere, often designated as iono-
their edges to submerge under others. Although sphere: The D-layer below 90 km, the E-layer
problems and contradictions still remain with between 90 and 180 km, and the F-layer above
this model much research in this area promises 160 km. The ionization undergoes a diurnal
clarifying discoveries. variation and large changes in response to the
Volcanology deals with some of the most solar activity rhythms.
spectacular phenomena in geophysics. They Most manifestations of weather take place
include explosive eruptions, lava flows, gaseous in the troposphere. They are governed by the
exhalations, magma intrusions, geysers, and hot general atmospheric circulation, which is stimu-
springs. Since the last ice age it is estimated that lated by the differential heating between tropi-
GEOPHYSICS 520

cal and polar zones. The resulting motions in It can be readily applied also for mapping of
the air are subject to the laws of fluid dynamics off-shore stratigraphy. Radioactivity and geo-
on a rotating sphere with friction. They are chemical procedures are auxiliary methods,
characterized by turbulence of varying time often used in boreholes. Distortions of arti-
and space scale. Evaporation of water from the ficially created electric fields in the ground can
ocean and its transformation through the vapor indicate the presence of shallow ore bodies but
state to droplets and ice crystals are important are now more extensively employed for ground-
symptoms of the weather producing forces (see water exploration.
METEOROLOGY).
Hydrology studies the water cycle on the H. E. LANDSBERG
earth in detail. It includes the runoff from pre-
cipitation, the surface courses of water and
their floods, the deposited forms of water as References
snow and ice, and the return of water to storage
underground or the ocean. The study of ice Academic Press, Advances in Geophysics, Vols. 1-25,
caps, sea ice, and glaciers (cryosphere) is an (H . E. Landsberg, H. E. Landsberg and J. van Mieg-
important phase of this field and contributes hem, B. Saltzman, succeeding editors), New York,
to the understanding of the earth's heat balance. 1952-1983.
Stream flow has been successfully mathemati- Aki, K., and Richards, R. G., "Quantitative Seis-
cally modelled but flash floods still present mology," 2 Vols., W. H. Freeman and Co., San
a major problem. Francisco, 1980.
Oceanography has a broad overlap with both American Geophysical Union, Reviews of Geophysics
meteorology and hydrology. It includes the and Space Physics, Vols. 1-22, Washington, D.C.,
study of wind-driven waves and currents, and 1963-1983.
the storage of heat and its release to the atmo- Bird, J. M. (Ed.), "Plate Tectonics," American Geo-
sphere. Ocean currents are an important mecha- physical Union, Washington, D.C., 1980.
nism for the equalization of the temperature Chovitz, B. H., "Modern Geodetic Earth Reference
differences between low and high latitudes. Models," EOS 62, 65-67 (1981).
Near some coast lines, wind-induced upwelling Dziewonski, A. M.; and Boschi, E., "Physics of the
of highly nutrient deep ocean water is of major Earth's Interior," Elsevier/North Holland Pub-
importance for the fishing industry. Tempera- lishers, New York, 1980.
ture and salinity differences are additional Eather, R., "Majestic Lights," American Geophysical
causes for the dynamics of the ocean but most Union, Washington, D.C., 1980.
exchanges take place in a shallow mixed surficial Houghton, J. T., "The Physics of Atmospheres,"
layer, only about 100 meters thick, but oc- Cambridge Univ. Press, New York, 1977.
casionally reaching to 500 meters depth. Below Jacobs, J. A., "The Earth's Core," Academic Press,
that is a thermocline with high density cold New York, 1975.
water to the bottom. Exchange time of this L'vovich, M. I. "World Water Resources and their
abyssal water with higher layers is measured Future," (English Translation, R. L. Nace, Ed.),
in centuries. American Geophysical Union, Washington, D.C.,
The tidal motions and their dependence upon 1979.
configuration of ocean basins and coastal lines McElhinny, M. W., "The Earth," Academic Press, New
were among the earliest geophysical phenomena York, 1979.
observed and analyzed by man. They were also Melchior, P., "The Tides of the Planet Earth," Per-
the first to be predicted by computer. These gamon Press, Oxford, 1978.
predictions are now quite precise in timing and Ratcliffe, J. A., "An Introduction to the Ionosphere
amplitude and are of great importance for ships and Magnetosphere," Cambridge Univ. Press, New
with ever increasing draft. York, 1972.
Exploration geophysics uses physical methods Simkin, T., Siebert, L., McQennand, L., Bridge, D.,
derived from the study of earthquakes and Newhall, C., and Latter, J. H., "Volcanoes of the
magnetic and gravitational fields. These, using World," Hutchinson Ross Publishing Co., Strouds-
highly sensitive instruments, permit the dis- burg, PA 1981.
covery and mapping of the structure and in- Simpson, D. W., and Richards, P. G. (Eds.), "Earth-
homogeneities in the upper geological layers. quake Prediction," Maurice Ewing Series, Vol. 4,
These result in local anomalies and permit American Geophysical Union, Washington, D.C.,
inferences about potential mineral resources. 1981.
Magnetic anomalies, often associated with ore Telford, W. M., Geldart, L. P., Sheriff, R. E., and
bodies, can be rapidly mapped by sensitive Keyes, D. A., "Applied Geophysics," Cambridge
airborne magnetometers. Seismic techniques, Univ. Press, New York, 1979.
using small explosions and a multiple array of Thurman, H. V., "Introduction to Oceanography,"
sensors coupled to a computer use reflection 2nd Ed., Charles Merrill Publishing Co., Columbus,
and refraction of wave energy to map geological OH,1981.
structures three-dimensionally. This technique Torge, W., "Geodesy," Walter de Gruyter, New York,
is extensively used for petroleum exploration. 1980.
521 GRAND UNIFIED THEORIES

Cross-references: FLUID
DYNAMICS, GEODESY, non-strongly interacting particles or leptons
GRAVITATION, INTERNATIONAL SOLAR-TER- (electron, muon, tau, and neutrinos) as funda-
RESTRIAL PHYSICS PROGRAM , MAGNETISM, mentally similar.
METEOROLOGY, PLANETARY ATMOSPHERES, Just as the electro weak unification pre-
SEISMOLOGY, VOLCANOLOGY. dicted the existence of the Z boson, grand uni-
fied theories predict the existence of new, ex-
tremely massive vector bosons. In most GUTs,
GRAND UNIFIED THEORIES these new bosons can mediate the decay of the
proton into lighter particles, such as a positron
Introduction Much of our progress in under- and neutron pion , as shown in Fig. 2. This is
standing the basic interactions of matter has a remarkable prediction because it has generally
involved the development of unified theories, been believed that the proton is absolutely
in which two or more apparently unrelated stable. It is known experimentally that if the
forces are realized to be different manifestations proton does decay, its average lifetime must be
of a more fundamental underlying force. The longer than 10 30 yr, which corresponds to fewer
first modern example was the unification of the than one decay per ton of matter per year! The
electric and magnetic forces into the electro- strength of the new interactions can be pre-
magnetic interaction by Maxwell and others in dicted from the observed strength of the strong
the 19th Century. Maxwell's equations not and electroweak interactions. Within the sim-
only reveal the intimate relation of the electric plest grand unified theories, the proton life-
and magnetic fields, but also predict the exis- time is predicted to be 7p '" 10 29 - 2 yr, pre-
tence of electromagnetic waves that can be cisely the range that may be observable in new,
emitted or absorbed by charged particles. We dedicated experiments!
now know that the quanta of these waves are
massless photons (-y) carrying an internal spin
of one unit of Planck's constant It . They are
therefore referred to as spin -lor vector (or
gauge) bosons.
The next successful unification combined the
electromagnetic force not with gravity (as had
been unsuccessfully attempted by Einstein
during much of his life) but rather with the )

weak force responsible for {3 decay. This electro- (0 ) (b)


weak theory of Glashow, Weinberg, and Salam
(GWS), which reached its fruition in the early
1970s, combined the intermediate vector boson
model of the weak interactions (Fig . la , b), in
which the weak interactions are mediated by
the exchange of massive electrically- charged ) )
vector boscllls W+ and W-, with quantum elec- ) )

trodynamics, in which the electromagnetic


force is mediated by the exchange of photons (c) (d)
(see Fig. lc). The result was a mathematically
and physically consistent quantum field theory .

: fG~ :
)
Furthermore, the electroweak unification pre- )
dicted the existence of a new electrically- uR
neutral vector boson (the Z). The subsequent
discovery of the weak neutral current interac- dG
)
tion, mediated by the Z (Fig. I d) along with )
the discovery of the charm quark (needed to
correctly describe strongly interacting particles
in the model) provided a dramatic verification (e)
of the electro weak unification. FIG.!. (a) Beta decay n ~ pe-v. (b) vN ..... e-X,
Grand unified theories (GUTs), which are so where N is a nucleon (proton or neutron) and X rep-
far untested, go one step further and unify the resents any strongly interacting final state (e.g., X =
electro weak interactions with the nuclear or N, N1T, N1T1T). (a) and (b) occur via the exchange of
strong force. (The strong interactions are thought charged weak bosons Wi . (c) The electromagnetic
to be due to the exchange of massless vector process e-N ~ e-X. (d) The weak neutral current
gfuons (G) between quarks, which are believed process vN ..... vX. In (a)-(d), the quark color quantum
to be the constituents of the proton, neutron, number is not changed, but the flavor (u, d, etc.) is
pion, and other strongly interacting particles. changed in charged current processes. (e) A strong
See Fig. Ie). In addition to unifying three of interaction between quarks via the exchange of a
the four known interactions, GUTs have the at- gluon. Quark color (but not flavor) is changed in the
tractive feature of regarding the quarks and the transition.
GRAND UNIFIED THEORIES 522

l- X
e+
j .

p{ :: Ix:
u >
U
u }M" .)2.
(0 ) (b)

FIG. 3. (a) Vertex for fermion k to be transformed


d~ .. ~ .. ~+ into fermion j upon emission or absorption of boson i.
(b) Self-interactions between three or four bosons.
U~U
U ) u actions between the gauge bosons are also pre-
scribed by the gauge symmetry (Fig. 3b).
FIG. 2. Typical diagrams in which proton decay is For example, the GWS electroweak theory is
mediated by the exchange of superheavy bosons carry- based on the gauge group SU2 ® U 1 . TheSU 2
ing both flavor and color quantum numbers. group implies three gauge bosons W+, W-, and
Wo, with gauge coupling g, while the U 1 group
involves one neutral boson B with coupling g'.
The same new interactions that lead to the The charged bosons W+ and W-, with masses
instability of the proton may explain the long- mw± ~ 83 GeV/c 2 , mediate the ordinary
standing puzzle of the apparent excess of mat- charged-current weak interactions, the neutral
ter over antimatter in the universe. Instead of boson Z = WO cos Ow - B sin Ow, with mz ~
having to postulate this excess as an asymmetric 94 GeV/c 2 , is exchanged in neutral current
initial condition on the big bang, it is possible processes, and the massless photon 'Y = WO sin
that it was generated dynamically by the new Ow + B cos 0 w is responsible for electromagne-
interactions in the first 10- 35 seconds after the tism. The weak angle is Ow == tan- 1 g'/g and the
big bang when the universe was incredibly hot electric charge is e = g sin 0 w. Parity noncon-
and dense! Of course, all matter will ultimately servation in the weak interactions is introduced
decay into a gas of electrons, positrons, neu- by assigning the left- and right-handed fermions
trinos, and photons in a time ~ 10 3o yr. if the to different representations of the SU2 ® U 1
universe does not collapse first! group. The left-handed up and down quarks
In addition to their theoretical elegance and (UL, dL) and the left-handed neutrino and
their fascinating predictions about the creation electron (VeL, eL) transform as doublets, which
and decay of matter, grand unified theories means that they are related by the symmetry
yield explanations of the relative electric charges operators and can be transformed into each
of quarks and leptons and of the value of the other upon the emission or absorption of a
weak interaction angle, and they predict the W+ or W-. (The color quantum number is un-
existence of superheavy magnetic monopoles. changed in these processes). The more massive
These as well as difficulties of GUTs are de- left-handed particles (CL, sL), (tL, bL>, (V/J.L,
tailed below. Particle physicists eagerly await IlL-) and (V'TL, TL-) also transform as doublets.
the confirmation or refutation of GUTs in pro- The right-handed fermions are unaffected by
ton decay experiments. SU2 transformations and do not couple to W±
The Standard Model There now exists a bosons (they do couple to the Z and 'Y).
renormalizable (Le. free from unmanageable The basic equations of gauge theories do not
divergences) field theory of the elementary allow mass terms for the gauge bosons. The W+,
particles and their interactions that is compatible W-, and Z masses are introduced by spon-
with all known experimental facts. In this taneous symmetry breaking. This can occur if
standard model the electroweak and strong the ground state contains a nonzero expecta-
interactions are gauge invariant, which means tion value of spin-O Higgs fields (essentially
that the forms of the equations of motion are a Bose condensation). The W± and Z bosons
left invariant under a group of transformations (but not the photon) couple to the Higgs field
of the fields into each other, which can be per- and develop a nonzero effective mass.
formed independently at different space-time The strong interactions are described by
points. Gauge invariance requires the existence quantum chromodynamics (QeD), which is a
of N vector or gauge bosons Ai, one for each of gauge theory based on the gauge group SU 3
the N generators of the symmetry group. The with ga.uge coupling gs. There are eight massless
amplitude for a fermion (lepton or quark) k to gauge bosons or gluons in QeD. Quarks are
emit or absorb boson i and be transformed transformed from one color state (R = red,
into fermion j is gLijk, where g is the strength G = green, B = blue) into another upon emission
parameter or coupling constant of the theory or absorption of gluons, with the flavor quan-
and Lijk is the jk element of the ith group gen- tum number (u, d, c, s, etc.) unchanged. The
erator (Fig. 3a). The forms of the self-inter- QeD couplings of left-and right-handed quarks
523 GRAND UNIFIED THEORIES

are equal (parity is conserved), and there are no


direct couplings of gluons to the SUa invariant
leptons, which therefore do not participate in
the strong interactions. N
Despite the many successes of the standard a
model, most physicists believe that it is too
arbitrary and complicated to be taken seriously
as the fundamental theory of elementary par-
ticles. For example, the standard model involves
the direct product of three different gauge
I
~ '~Q2
groups with three distinct coupling constants / M~ DESERT M.
(as = gs2/4rr co: 0.25, ag == g2/4rr co: 0.03, a = e 2/ PRESENT EXPERIMENTS UNIFICATION
4rr co: 0.007 when measured at laboratory ener-
gies). There is no fundamental explanation for FIG.4. Variation of the effective couplings a3 = as =
the pattern of spontaneous symmetry breaking, gi/41T, a2 = ag =g2/41T, and a1 = %a' =%g,2/41T with
for the breaking of parity in the weak but not momentum.
the strong interactions, for the existence of
repeated fermion families, or for the relation
between the electric charges of the quarks and (b) Because of vacuum polarization and other
leptons. There are many arbitrary parameters, higher order corrections, measured gauge coup-
such as coupling constants, fermion masses, lings actually depend logarithmically on the
and weak mixing angles (19 in all if the neu- typical squared momentum Q2 in an experi-
trinos are massless). Finally, gravity is not in- ment. The observed low energy couplings are
corporated in the theory. predicted to approach a common value (up to
Grand Unified Theories Grand unified the- known normalization factors) QG = gG 2/4rr at
ories constrain some of the arbitrary features of momenta Q2 ~ MX 2 for which spontaneous
the standard SU 3 ® SU2 ® U 1 model by em- symmetry breaking can be ignored, as in Fig.
bedding it in a simple (i.e., not a direct product) 4. From the observed ratio of as/a at low
group G with a single coupling constant gG. energy and the theoretical Q2 dependence, one
The simplest example is the SUs model of can predict: (i) Mx co: My co: 2.4 X 1014±0.S
Georgi and Glashow. SUs implies 24 gauge GeY/c 2, which is twelve orders of magnitude
bosons, of which 12 are those of the standard larger than Mw and Mz! (The threshold-free
model. The other 12 are supermassive (m > 10 14 region between Mw and Mx is referred to as
GeY/c 2 ) and carry both flavor and color quan- the desert.) (ii) The requirement that all three
tum numbers. They are the X and Y bosons, couplings meet at Mx yields the additional
with electric charges ~ e and! e, each of which prediction: sin 2 Ow =a/Qg=0.214±0.00S,
comes in three color states, and their anti- which is in remarkable agreement with the ex-
particles. Quarks, antiquarks, leptons, and anti- perimental value 0.21S ± 0.012 obtained from
leptons are related by SUs transformations and neutral current reactions. Predictions (i) and (ii)
are fundamentally similar. The left-handed hold for essentially all grand unified theories
fields (VeL, eL-, daL, a = R, G, B) form a five- with no new mass scales in the desert and no
dimensional representation S *, which means exotic fermions or weak interactions.
that they can be transformed into each other (c) The X and Y bosons can mediate proton
upon emission or absorption of gauge bosons. decay through diagrams such as those in Fig.
Similarly, the fields (eL+, ULa , dLa, and UaL) 2. The average proton lifetime in years in the
fdrm a 10-dimensional representation (the SUs model is predicted to be Tp ~ "AMx4J
right-handed fields are related by CPT). The (aG 2 m 5) ~ (0.8 - 13) X 10 29 (Mx/2.4 X 10 14
heavier fermion families are assigned to addi- GeY/c 2)4 ~ 3 X 1029±2 yr, where the coeffi-
tionalS* and 10-dimensional representations. cient A haS' been estimated using several phe-
Other grand unified theories are based on still nomenological models of the proton and meson
larger groups, such as SOlO or E 6 . They involve wave functions. This extremely long lifetime
additional supermassive bosons emitted in tran- is due to the very short range (Rx -1f/Mxc co:
sitions between the S* and 10, between dif- 10- 28 em) of the interaction. The predicted
ferent families, or between the ordinary and lifetime is comparable to current experimental
hypothetical new fermions. limits T p exp ~ 10 3o _10 31 yr and is within the
Implications of Grand Unification (a) There sensitivity of several dedicated experiments now
is basically only one interaction. The compli- under construction. Most other theories give
cated structure of interactions observed at low similar predictions for Tp, although models
energies is due to the pattern of spontaneous with supersymmetry or large mixing effects can
symmetry breaking (gauge boson masses). lead to longer lifetimes.
Similarly, the quarks (q) and leptons (1) are The dominant decay modes are p ~ e+ MO and
closell related. This explains the simple rela- p ~ vM+, where MO represents neutral mesons
tion 2' qu = -qd = -1 1
qe = e between quark (rrO , pO, 'T/, w, rr+ rr-, ... ) and M+ = rr+, p+, rr+
and lepton charges (charge quantization). rro, .... Bound neutrons can decay into vMo
GRAND UNIFIED THEORIES 524

and e+ M-. Different gauge groups (SUs, SOlO, and flatness problems. However, it is not clear
etc.) can be distinguished by their predictions that this expansion would terminate in an ac-
for the ratio of e+ MO to iiM+ deca~s. All models ceptable manner. Phase transitions and other
with a desert between Mw and 10 4 GeV/c 2 im- aspects of GUTs may also be of relevance for
ply the selection rule D.B = D.L, where Band L galaxy formation.
are baryon and lepton number. (B = + 1 or -1 for (e) Almo&t all GUTs (with the exception of
nucleons (p or n) or antinucleons, respectively. the SUs model) predict nonzero masses for the
L = +1 for leptons (e-, v) and -1 for antilep- neutrinos. Estimates are very model dependenti
tons.) Theories with lower mass scales (which typically ranging from mv = 10-9 eV/c 2 to 10+
are disfavored by the sin 2 Ow test) can predict eV/c 2 . Neutrino oscillations associated with
baryon number violating processes with dif- mass >10- 5 eV/c 2 could explain the missinf
ferent selection rules. For example, some solar neutrinos. Neutrinos in the 10-100 eV/c
models with particles with mass ~ 10 6 Ge V/ c 2 range could account for the missing mass in
predict that free neutrons can oscillate into anti- galactic clusters and would dominate the mass
neutrons in a time scale Tnn 2: 10 7 sec and that density of the universe.
two nucleons in a nucleus can annihilate in a Some of the simpler GUTs, such as the
time Tnuc 2: 10 30 years. simplest SUs model, correctly predict the ratio
(d) The baryon-number-violating interactions of the b quark and T lepton masses.
would have been very important in the first Outstanding Problems Despite their attrac-
10- 35 sec after the big bang when the tempera- tive features and successful predictions, grand
ture kT was comparable to Mx c2 • Relics of that unified theories have several serious problems
first instant may be observable today. and shortcomings. (a) Perhaps the most severe
For example, grand unified theories may ex- is the gauge hierarchy problem, which refers to
plain the baryon (matter-antimatter) asymmetry the incredibly tiny ratio (Mw/Mx)2 ~ 10-24 .
nB/n'Y ~ 1O- 10 ±1, njj/nB ~ 0, observed in our This quantity is not a natural feature of GUTs:
part of the universe, where nB, njj, and n'Y are in the usual models it must be put into the
the number densities of baryons (p, n), anti- theory by hand by fine tuning parameters.
baryons (p, n), and blackbody photons. If Theories with composite Higgs bosons and
baryon number were absolutely conserved and models with an approximate low energy sym-
there is no large scale separation of baryons and metry between bosons and fermions (super-
antibaryons in the universe (no plausible separa- symmetry) are promising approaches to this
tion mechanism has been found) then the net problem, but neither has been completely
baryon number would have to be postulated as successful.
an asymmetric initial condition on the big (b) Another problem is that GUTs have not
bang. Alternatively, nB - njj may be generated shed much light on the fermion mass spectrum
dynamically in GUTs. The ingredients needed and mixings, the reason for heavy fermion
are (i) B violation (this alone is sufficient to families, or the origin of CP violation.
erase or dilute any initial net baryon number), (c) Superheavy magnetic monopoles are a
(ii) nonequilibrium of the D.B =1= 0 interactions, serious problem. Much work remains to be
(otherwise nB = njj), and (iii) CP and C viola- done on the dynamics of phase transitions in
tion (to distinguish baryons from antibaryons). the early universe and on possible monopole
A possible specific mechanism is that super- suppression mechanisms.
heavy spin-O Higgs bosons and antibosons drop (d) Finally, gravity has so far resisted all
out of equilibrium at. kT :s !rfxc 2 • Their CP- attempts at unification. Supergravity theories,
violating decays into qq, ijij, iji, and qi can cre- in which spin-~ fermions are related to spin-2
ate a slight excess of quarks over antiquarks. gravitons, appear to be a promising direction.
These eventually form nucleons after most qij
pairs annihilate. Detailed estimates are diffi- PAUL LANGACKER
cult, but the ratio nB/n'Y ~ 10- 10 appears quite References
plausible. Introductory papers include:
Grand unified theories also predict the exis- 1. Georgi, H., and Glashow, S. L., Physics Today,
tence of superheavy (m ~ 10 16 GeV/c 2 ) mag- Sept., 1980, p. 30.
netic monopoles, which would have been pro- 2. Weinberg, S., ScientificAmerican, June 1981, p. 64.
duced prolifically in the big bang. The monopole
density depends sensitively on the nature and Proton decay is reviewed in:
sequence of phase transitions (in the ground 3. Goldhaber, M., Langacker, P., and Slansky, R.,
state spontaneous symmetry breaking pattern) Science 210,851 (1980).
in the early universe. Second-order transitions
probably imply many orders of magnitude Grand unified theories are reviewed in:
more monopoles than are allowed by limits on
4. Langacker, P.,Phys Reports 72, 185 (1981).
the total mass density of the present universe.
Strongly first-order transitions may sufficiently Cross-references: ELEMENTARY PARTICLES,
suppress monopole production and may lead to GAUGE THEORIES, QUANTUM CHROMODY-
a period of exponential expansion of the uni- NAMICS, ELECTROWEAK THEORY, ANTIPAR-
verse that could solve the cosmological horizon TICLES, COSMOLOGY, QUARKS.
525 GRAVITATION

GRAVITATION The success of the space program to date is an


additional tribute to Newton's genius. Other
Gravitation is the phenomenon characterized calculations study powered space flight in order
by the mutual attraction of any two phys~cal to examine possible orbits for exploration of
bodies. This universal character of the graVIta- the solar system. There is every reason to be-
tional force was first recognized by Sir Issac lieve that Newton's gravitational theory is suffi-
Newton who also gave its quantitative expres- ciently accurate for this purpose. Einst~in's
sion. For point masses or spherical bodies a modification of the theory, to be descrIbed
simple expression results: below, will probably have little effect on our
GM I M 2 space program for some time to come.
F=--!....--"- (1) The weight of a body of mass M on the earth
R2 is the force with which it is attracted to the
center of the earth. On the surface of the earth
In addition to the masses M I, M 2 of the two the weight is given by
bodies and their distance apart R, the force de-
pends only on a constant G = 6.670 X 10-8 W=Mg
dyne cm 2 which is independent of all proper-
ties of the particular bodies involved. The same
force law describes the motion of the planets where the acceleration due to gravity is ob-
around the sun, of the moon around the earth, tained from Eq. (1):
as well as the falling of an apple to the earth. A GME
body moving under an inverse square law as g = - - = 980 cm/sec 2 = 32 ft/sec 2
given in Eq. (1) satisfies the three laws estab- RE2
lished by Kepler for the motion of the planets
around the sun: All freely falling bodies near the surface of the
(I) The planets move in elliptical orbits with earth are accelerated at the same rate g. It is for
the sun at one focus (the general orbit is a conic this reason that Galileo found that both light
section) (Fig. I). and heavy objects take the same time to reach
(2) The radius vector sweeps out equal areas the ground when dropped from the Leaning
in equal times. Tower of Pisa.
(3) The square of the period of r~vo~ution. is An astronaut is said to be in a state of weight-
proportional to the cube of the semimaJor aXIS: lessness when he is in orbit. Strictly speaking,
a 3 = (2rr)-2GM 0 T2. Here M0 is the mass of he still has weight, for the earth's gravity .still
acts on him. Otherwise he would fly off mto
the sun and T is the period of the planet. These outer space. However, when in free fall, the
results together with a detailed analysis of local effects of the gravitational field are elimi-
anomalies in the motion of the moon estab- nated for the astronaut. Objects which are re-
lished the correctness of the Newtonian theory leased fall together with him and hence remain
of gravitation (see KEPLER'S LAWS). in his vicinity, unlike the situation on the
Recently, careful calculatio!ls have bee':l .c~r­ ground. Therefore, the organs of the body re-
ried out to determine the orbIts of the artIfICIal spond as though the gravitational field were
satellites which have been launched by the absent and this gives the sensation of weight-
United States and the Soviet Union. These have lessness. Conversely, we sense the earth's gravity
required modifications in the force law Eq. (l) and feel weight because we are supported by
to take into account the deviation of the earth's the earth's surface.
figure from a sphere and the anistropy. of the Gravitational Field According to Newtonian
earth's density as well as the atmosphenc drag. theory, the sun exerts the gravitat~onal fo!ce
directly on the earth without an mtervemng
medium for transmitting that force. The be-
havior of such forces is called "action at a dis-
tance." To overcome the conceptual difficulty
of a force acting directly over large distances,
one assumes that a gravitational field fills all
space. The force acting on any mass is deter-
mined by the gravitational field in its neighbor-
hood. Thus at the point P a distance R from
the center bf the earth, the gravitational field
has the magnitude

FIG. 1. An elliptical orbit for a planet around the


sun. The shaded areas indicate equal areas swept out
in equal times at different ~arts ~f t~e orb~t: C1~r~y, and magnitude of the force on a mass M at P is
the speed of the planet vanes WIth Its posItIon In Its simply F = M§. Note that the field is to exist at
orbit. P even in the absence of the mass M.
GRA VITA nON 526

It is sometimes convenient to introduce the thought of in terms of an infinite velocity of


gravitational potential which determines the propagation for the gravitational field. Letting
field through its gradient. For a spherical earth, the velocity of light become infinite in Max-
it is defined as well's equations eliminates all independent dy-
namical behavior for the electromagnetic field.
In that case there could be no radio or televi-
sion. The special theory of relativity which is
based on the velocity of light in vacuum being
In general I/> will satisfy Poisson's equation the maximum velocity for the transmission of
energy, implies that Newton's theory requires
a21/> a21/> a21/>
-+-+-=4rrp (2)
modification.
ax 2 ay2 az 2 Principle of Equivalence The mass of a body
may be measured either by weighing W = Mg
p is the density of matter. The potential energy
(gravitational mass) or by observing its motion
of a mass M, in the field is simply expressed in under a known applied force using Newton's
terms of 1/>, second law of motion F=MA (inertial mass).
The equality of these two differently defined
V=MI/> masses has been measured by V. B. Braginsky
and V. I. Panov of the Soviet Union to an ac-
This is equal to the work which must be done curacy of 2 X 10- 12 improving on earlier mea-
by an outside agent if he were to move the mass surements by R. H. Dicke and R. V. Eotvos.
M to its location in the gravitational field from It is this equality which distinguishes the gravi-
infinitely far away . tational force from all other forces in giving all
A body has enough speed to escape from the bodies the same acceleration. The discussion of
earth's attraction if its total energy, kinetic plus weightlessness pointed out that local effects of
potential, is zero or greater. From the definition the gravitational field are eliminated for an ob-
of potential energy above, the escape velocity server in free fall precisely because all bodies
at a distance R from the center of the earth is fall at the same rate. It follows that the gravita-
given by tional field measured by an observer will de-
pend on his state of motion. In a sense there is
2 2GME an equivalence between a gravitational field
Ves =-- (3) down and an acceleration up for the observer.
R However, the equivalence is not complete, for
At the surface of the earth the Ves = II km/sec. real gravitational fields converge on their
It is interesting to note that when R = R s , the sources so that two particles released at the
Schwarzschild radius, the escape velocity is just same time will drift closer together as they fall.
On the other hand, acceleration fields have no
equal to the velocity of light, c(3 X 105 effect on the separation of particles moving on
km/sec.): parallel paths (Fig. 2). In a curved space, ini-
2GM tially parallel geodesics-the "straight lines" -do
Rs=-- (4) not maintain a constant separation (e.g., great
c2 circles on a sphere). Thus, the gravitational field
may have its explanation in the geometry of a
The subscript E has been dropped from M be- curved space-time.
cause the Schwarzschild radius Rs is defined for Red Shift According to the quantum theory,
any mass. For the earth, Rs = 0.9 em while for a photon of frequency v has an energy hv (h is
the sun, Rs = 3 km. Now, according to special Planck's constant), and by the relation E = mc 2 ,
relativity, the speed of light is a limiting speed
for matter and cannot be exceeded by any sig-
nal. Therefore, it is fortunate that the Schwarz-

\\ II
schild radii of the sun and the earth are interior
to their surfaces. Otherwise we would receive
no energy from the sun and we could not ex-
plore the solar system.
Although one can introduce the gravitational
field, it is an auxiliary concept in Newtonian
theory for the field has no independent dynami-
cal behavior as is true of the electromagnetic
field (e.g., electromagnetic waves). At any time, ~
the Newtonian gravitational field is determined
(a) (b)
by the configuration of masses at that instant
and does not depend on previous history or FIG. 2. The paths of particles releaSed in: (a) an
state of motion. Thus, if the sun were to vanish, acceleration field (the acceleration is up, the apparent
the gravitational force on the earth would im- force is down); (b) a gravitational field showing
mediately be removed. This property may be convergence toward the source.
527 GRAVITATION

this quantum has a mass m =hv/c 2 • To lift a The curvature of space-time is defined in terms
mass m a height H requires expenditure of the of a four-index tensor Rvpajl., the curvature
energy mgH. Therefore, a photon emitted at tensor. The vanishing of the curvature tensor
the surface of the earth arrives at the height H means that no real gravitational field is present.
with the energy The field equations are ten linear combinations

hv - (hv/c 2 ) gH = (1 - ~~)=
hv hv'
of the curvature components which are of the
second order in the derivatives of the metric
tensor and are a generalization of Poisson's
At the surface of the earth, the frequency shift equation [Eq. (2) I. Symbolically these equa-
amounts to tions are written
!w Gpa = 87T1< Tpa
- = 1.1 X 10- 16 H(H in meters)
v
where Tpa is a symmetric tensor which de-
This shift was measured by Pound and Rebka, scribes the distribution of matter and energy
using the Mossbauer effect, in good agreement throughout space-time and I< = G/c 2 . In a weak-
with the prediction. As standard clocks are de- field static approximation, these equations con-
termined by atomic transitions in freely falling tain Newton's theory of gravitation with the
atoms, it follows that if the same photon were Newtonian gravitational potential given by
emitted at the height H, it would be measured 2¢/c 2 =g44 - 1.
to have the frequency v not v'. Therefore, an The metric tensor outside a static spherically
observer at H must conclude that clocks at H symmetric mass distribution is given by the
run fast compared with the identical clocks on Schwarzschild solution:
the ground in the ratio (v/v') (Fig. 3).
Einstein's Theory of Gravitation Albert Ein-
stein assumed that gravitation is a physical ds 2 = (I 21<m)
- -r- dt 2 - (1 - 2:mf dr 2
effect produced by the curvature of a four-
dimensional space-time. The generalization of
Newton's gravitational potential is the metric - r 2d8 2 - r2 sin 2 8dI{J2
tensor gpa in terms of which the four-dimen- This geometry exhibits the red shift described
sional distance, and hence the geometry of above and in addition shows three other effects:
space-time, is determined: ( I) The bending of a ray of light passing near
4
the sun's edge by
ds 2 = L: gpa dx P dx a 58 = 1.75".
p, a=l
(2) The precession of the perihelion of Mer-
cury by
z 5¢ = 43".03/century.
(3) The retardation of signals passing near
the sun. For a radar pulse reflected from Mer-
cury, this amounts to a maximum time delay

H~----,--------r-
!::1t = 1.6 X 10-4 sec.
/ /

/
/

/
/
/
Observations and experiments to check these
/
/
/

/
/ predictions are still in progress.
/
/
/
/ Since one can see stars near the sun's edge
//
/
/
/
only during an eclipse, the optical data on the
GROUND / _ _ _ _ _ _- L_
/ _ _ _ _ __ bending of light have been slow and difficult to
obtain and such measurements have poor reli-
LEVEL~

TIME ability-about 10 to 25 per cent. However, us-


ing radio frequency measurements, a group in-
FIG. 3. Photons are emitted on the ground and are cluding I. I. Shapiro has observed the angular
received at the height H. Between the two dotted lines positions of two sources, 3C279 and 3C273
representing the beginning and end of a pulse, the which have an angular separation of about 10°.
same number of oscillations, n, are received at H as The latter source acts as the reference as 3C279
are emitted at the ground level. Because of the red is occulted by the sun each year on October 8.
shift, the interval t' between oscillations at H is greater Their results are in agreement with Einstein
than the interval t between oscillations on the ground. theory within 6 per cent. E. Fomalont and
Therefore, the time measured at H for the reception R. Sramek have carried out similar observation
of the n oscillations is greater than the time required of three radio sources, the central one being
for their emission on the ground: nt' > nt. This result occulted by the sun on April 11 of each year,
implies that clocks run faster at H than on the ground. which gives agreement within 2 per cent. Im-
GRAVITATION 528

provements in these observations are expected Gravitational Lens Effect A consequence of


in coming years. the bending of light by a massive object is that
Shapiro has also reevaluated the optical data a distant galaxy may behave like a lens by bend-
with regard to the solar system and has also ing light and other electromagnetic signals from
taken new data using radar ranging. In both even more distant objects.
cases he finds agreement with the predicted Two quasars, labeled by their astronomical
value for the perihelion precession of Mercury coordinates 0957 + 561 A, B, were observed to
within 3 per cent. If he combines the data, he have identical red shifts, identical spectra, and
reduces the error to 1 per cent. very similar shapes. Further analysis indicated
As a new test of Einstein's theory of general that they are two images of the same object
relativity, Shapiro suggested measuring the re- produced by an intervening galaxy which acts
tardation of radar echo signals from Mercury as a gravitational lens. Qualitatively, if the earth,
when the planet moves into a position of su- a galaxy, and a distant quasar are lined up, one
perior conjunction. The gravitational field of can expect that signals passing near or through
the sun, as represented by the Schwarz schild the outer limbs of the galaxy would give us two
solution, not only produces a bending of the images displaced from the center while a central
ray, but also affects the time of flight of the image might also be formed. Thus one may ex-
signal. Therefore, the time delay between the pect to see three images plus a central galaxy
transmission of a radar pulse to Mercury and which might obscure the central image. As the
the reception of the reflected signal will depend galaxy moves off the earth-quasar line of sight,
not only on the relative positions of the earth two of the images would move closer together,
and Mercury in their respective orbits, but also coalesce, and eventually would not be refo-
on whether the radar signals pass near the sun cused to the earth if the galaxy is too far off
(Fig. 4). Current measurements give agreement the line of sight. Since, in fact, only two images
within 2 per cent. are observed one infers that two of the images
C. H. Brans and R. H. Dicke have proposed a have merged and, indeed, it appears that the
"scalar-tensor" theory of gravitation in which image of the galaxy itself is obscured by one of
the added scalar function determines the the images (see Fig. 5).
strength of the gravitational interaction. In this Gravitational Collapse. The gravitational force
theory the gravitational constant G is no longer between any two masses is attractive. There-
a universal constant, but depends on location fore, given a quantity of matter, under action
and time. This effect has not been verified, but of gravity alone it will become as compact as
the theory also introduces small changes in the possible. In the planets the compaction process
above predictions of the Einstein theory. Un- is stopped by the electrical forces which act
fortunately, the experimental errors in the ob- between atoms and molecules in close range.
servations are such that one cannot distinguish The pressure in the sun, however, is much too
between the Brans-Dicke and the Einstein great to be supported by such solid body forces.
theories. Nonetheless, the seeming arbitrariness This tremendous pressure is balanced primarily
of the scalar interaction makes the simpler by the counter pressure of electromagnetic
Einstein theory preferable at the present time. radiation which is produced by the nuclear
processes at the sun's center. Stars in which the
nuclear processes have ended undergo a further
contraction which is stopped by the pressure of
A free electrons at the densities associated with
EARTH ORBIT white dwarfs. This pressure, which occurs be-
I \ cause electrons obey the Pauli exclusion prin-
I \ ciple, is capable of supporting up to 1.4 solar
I \
\ masses within a volume of 10-4 to 10-8 of the
solar volume. Objects which are more massive
I continue the crush. Neutrons become the most
I stable particles in the interior and the contrac-
I
tion is stopped by repulsive nuclear forces when

I
f® a neutron occupies only about 10- 39 cm 3 , the
nuclear volume. If the resulting neutron star is
I one solar mass, its radius is just 10 km and its
I
I volume 10- 15 the sun's volume . Objects with
more than about 1.2 solar masses cannot be
B stable as neutron stars. They continue to con-
tract. Beyond this point the situation is con-
FIG. 4. There will be very little time delay, due to fused by the abundance of exotic elementary
the sun's gravitational field, in the reception of the particles, but there is no theoretical evidence
reflected radar signal when the earth-Mercury position that the contraction can be stopped .
is as in A-C compared to that when their position is as One might have hoped that Einstein's theory
in A-B. of gravitation would contain a short-range re-
529 GRAVITATION

FIG. 5. This is a black and white print of a false color photograph. The display at the
left was generated by a computer program that summed and color-coded many images of
the twins. Elongation of both the northern twin and the southern one in the direction of
four o'clock is an artifact of imperfect alignment of the telescope optics. The northern
image was adjusted so that it had the same brightness as the southern one. Display at the
right was made by subtracting the adjusted northern image (top) from the unadjusted
one (bottom). What remains is an image of the galaxy one arc second north of the south-
ern twin. (Courtesy of the Institute for Astronomy and Planetary Geoscience Data Pro-
cessing Facility, University of Hawaii.)

pulsion which would stop this endless contrac- ing neutron star.) So, not everything continues
tion. However, the opposite is the case. First of to collapse. But, there are many very massive
all, all forms of energy contribute to the attrac- stars and in the absence of more information it
tive mass in general relativity, and secondly, the is unreasonable to rule out the possibility that
fact that matter determines the geometry means some will indeed go through an indefinite col-
that there should be peculiarities in the space lapse or that some may have already done so.
when an object is highly collapsed. There are What physical effects result from the col-
several general theorems, particularly by R. Pen- lapse? It was pointed out above, Eq. (4), that
rose and S. Hawking, whose general conclusion at the Schwarzschild radius the escape velocity
seems to be that as long as the energy density from a point mass is the velocity of light. Thus,
remains everywhere positive, collapse is inevita- no signal can escape from a body which has col-
ble. This does not mean that collapse actually lapsed below Rs. This result can in fact, be de-
occurs in nature. As a very massive star pro- duced from the Schwarzschild solution of the
ceeds through the various stages indicated in Einstein equations which is given above. As a
the above paragraph, it may become unstable result, our knowledge of events is limited at the
and throw off enough mass through an explo- Schwarzschild radius; the surface r = Rs is an
sive process, such as a supernova, that it may absolute event horizon. (Because we can receive
settle down at a planetary size, or as a white no light or other signal from a source which has
dwarf, or as a neutron star. We have observa- collapsed below its Schwarzschild radius, we
tional evidence for the existence of these ob- call such objects black holes.)
jects. (A pulsar is thought to be a rapidly rotat- Note that a neutron star of one solar mass has
GRAVITATION 530

a radius of 10 km while Rs = 3 km; a neutron


star of 10 solar masses will have a radius of 30
km and Rs = 30 km. Thus, we have observa-
tional evidence for the existence of objects
which are very nearly black holes. There is at
present an active search for peculiar stellar mo-
tions which might indicate the presence of a
black hole. On the other hand, if there should
be a collapsed object in the nucleus of a galaxy,
the very high concentration of stars near the
black hole would produce a cusp in the surface
·brightness. So far there is no positive evidence,
but the existence of black holes in the center of
galaxies or as collapsed individual stars has not
been ruled out.
Gravitational Waves Einstein's field equa-
tions require that the gravitational field have a
finite velocity of propagation-the same as that
for light. Therefore, the gravitational field has FIG. 7. One of the aluminum cylinders used by
independent dynamical degrees of freedom Professor Joseph Weber as a detector of gravitational
which permit gravitational waves to exist in waves. (Courtesy of Professor Weber.)
two states of polarization. These states are
wholly transverse. That is, the waves act on
matter only in planes which are orthogonal to
the direction of propagation. In passing through cident gravitational wave sets up transverse
matter, one state produces oscillations such oscillations in the cylinder. These oscillations
that there is a compression followed by elonga- are transformed into electrical signals by piezo-
tion along one axis and a corresponding elonga- electric crystals which are bonded to the sur-
tion followed by compression along the perpen- face of the cylinder (Fig. 7). However, the ob-
dicular axis (Fig. 6). For a periodic wave this servations are made at room temperature and
process repeats at the frequency of the wave. the thermal fluctuations produce displacement
The other state of polarization has the same of the end faces of about 10- 14 cm-an amount
effect along axes rotated by 45°. This character which, although only one-tenth the diameter of
for the modes is caused by the tensor nature of a nucleus, interferes with the expected gravita-
the potentials gpa which limits the lowest order tional effect. The gravity wave signal must be
of gravitational waves to quadrupole radiation. separated out from this background noise. To
A crude estimate of the energy radiated by the eliminate fluctuations due to earth tremors,
earth-sun system per year amounts to 10 16 ergs trucks, or people dropping hammers, the whole
(about 106 kWh). Radiating at this rate, the apparatus is acoustically insulated from the
earth has lost about 10- 15 of its available me- ground.
chanical energy since its formation 5 X 10 9 i The initial detection program used principally
years ago. Presumably there are stronger sources two identical cylinders, 153 cm long and 66 cm
of gravitational waves available in the universe. in diameter. One is located at the University of
Experiments to detect gravitational radiation Maryland and the other at Argonne National
were begun in 1958 by J. Weber. For a detector Laboratory, 1000 km away. The electronic
he used an aluminum cylinder which is sus- recording system is narrowly tuned to 1660 Hz
pended in the earth's gravitational field. An in- which has an acoustic half-wavelength of 153
em in aluminum. Thermal oscillations are ran-
domly generated and one does not expect cor-
relation between the outputs of two detectors,
• •
particularly if they are 1000 km apart. There-
• • • • fore, Weber looks for coincidences in the out-
put signals of the two detectors. The observa-
tion technique is to record each signal sepa-
• • • • rately at its own location and at the same time
• • • • • to transmit the Argonne signal to Maryland
where it can be compared directly with the
• Maryland signal. Coincidences of a certain
(a)
pulse height are then marked (Fig. 8). A careful
(b)
statistical analysis compares the coincidence
FIG. 6. (a) A circular arrangement of dust particles rate due to random fluctuations with the ob-
before a gravitational wave arrives. (b) The same served rate. Weber concludes that there is a
particles after a passage of a wave consisting of one "significant coincidence rate of about one
state of polarization. The second state of polarization every two days."
would produce the same effect, rotated at 45° . Unfortunately other centers which have con-
531 GRAVITATION

tern has been observed which appears to exhibit


the dynamical effect resulting from its emission
of gravitational waves. A pulsar has been ob-
served which is part of a double star system.
The pulsar, identified as PSR 1913 + 16, has a
precise spin rate of 16.94 revolutions per sec-
ond. It is an object of 1.4 solar masses and has a
diameter of about 20 kilometers. Its companion
is about the same size, but there is no search-
light to indicate that it is also a pulsar.
Using equations of motion derived from Ein-
J stein's theory, the parameters describing the
1111 /1/1//1/ I II III/III orbit have been determined. This is possible
'--AA1IL ~J/. I / U ~ JIr~ '.)./ U j Ii u.J. with high accuracy because the precise spin rate
//////171////// ,/ '17 n 1 / 1// / can be used as a clock. One found that the orbit
is tilted 45° from the perpendicular plane to
Coincidence Time Mark! Ma ry land Detector the line of sight, the separation of the two com-
FIG. 8. Argonne National Laboratory and Uni- panions varies from 1.1 to 4.5 solar diameters,
versity of Maryland coincidence. (Courtesy of Pro- and that the periastron advances 4.2 degrees per
fessor Weber and Physical Review Letters.) year. Even more remarkable, however, the pe-
riod of periastron passage is decreasing (Fig. 9).
Over the six years of observation the period has
decreased by 1 second. This means that the
structed detectors for gravitational waves have bodies have moved a little closer together,
been unable to corroborate Weber's observa- which indicates a loss of energy. This loss has
tions. Therefore, at this time there is general been calculated to be that expected if the sys-
agreement, among most workers in the field, tem PSE 1913 + 16 is radiating gravitationally
that Weber's results cannot be accepted as evi- in accordance with the Einstein theory.
dence for the existence of gravitational waves. Quantum Theory The gravitational interac-
However, his claims have stimulated many other tion among the elementary particles is down by
people to begin the search for gravitational a factor of 10-40 from the electromagentic or
radiation . Many improvements in the design of strong nuclear interactions. Therefore, one can-
detectors have been proposed. The more ob- not expect to observe quantum effects at the
vious is to cool the detector with liquid helium level of today's experiments. Nonetheless, be-
to reduce thermal oscillations which mask the · cause the gravitational field has two indepen-
change in dimensions produced by an imping- dent modes, the quantum field formalism re-
ing gravitational wave. Several ingenious ideas quires that the gravitational field operators
have been proposed which could improve the exist and satisfy appropriate commutation rela-
sensitivity by as much as a factor of lOs. tions. However, the gravitational field equations
Among these is the use of very large (several are complicated by being nonlinear and by
kilograms), carefully grown and annealed sap- their covariance under arbitrary coordinate
phire crystals instead of aluminum bars. An- transformations. From its linear approximation,
other is the use of a carefully designed micro- one expects that the quantized field will be a
wave cavity whose frequency would shift by a spin 2 boson field . However, the linear approx-
very small amount when a wave impinges on it. imation may be deceptive because the gravita-
A third design would use the shifting interfer- tional field determines the geometry of space-
ence fringes of laser light when the distance be- time . Quantization of the geometry may have
tween mirrors changes because a gravity wave is the effect of undermining our current descrip-
incident . Progress is slow because the techno- tion of particle properties. Therefore, the labels
logical problems are difficult. But there is con- we use to distinguish particle properties may be
fidence that in about ten years we will be lost. Indeed, one can show that a black hole is
monitoring the arrival of gravitational radiation characterized completely by its mass, angular
inaugurating gravitational astronomy. When momentum, and electric charge. Thus, when
radio astronomy began, one did not know or elementary particles are absorbed by a black
understand the origin of the radiation. Now, in hole, all of their characteristics, e.g. , baryon
fact , radio observations rival and in some cases num ber, flavor, and color, disappear. In this
exceed the resolution of optical observations. way, the gravitational field may be a link among
These have shown structures which can not be all particles and all interactions.
seen optically. The same problems will arise As a result, a number of novel developments
with gravitational radiation; the interpretation have occurred. One is an attempt by Roger Pen-
of the observations will be much more difficult. rose to smear out space-time points by identify-
But we may learn much about the universe ing them with the intersections of light rays.
which can not be discovered otherwise. The light rays themselves are to be described
Although scientists are still waiting for the by new mathematical objects called twistors.
observation of gravitational waves, a stellar sys- (Twistors have a similar relation to the con-
GRA VIT A nON 532

0
!
iii'
0
z
0
0
w
~
w
:::;;
i=
z
0
a: -1
~
(J)
«
crw
a..
~
t::
x
(J)

-2

1974 1982

FIG. 9. Emission of gravitational radiation by PSR 1913 + 6 leads to an increasing deviation in


the time of periastron passage compared with a hypothetical system whose orbital period remains
constant. The solid curve corresponds to the deviation predicted by the general theory of relativity
while the dots represent the measured deviation. The pulsar now reaches periastron more than a
second earlier than it would if its period had remained constant since 1974. The data provide the
strongest evidence now available for the existence of gravitational radiation. (Courtesy of J. H.
Taylor, J. M. Weisberg, and The Astrophysical Journal.)

formal group in four dimensions as spinors have An intiutive understanding of this process can
to the Lorentz transformations.) If one can be given as follows. The fluctuations of a quan-
construct a quantum theory of twistor fields, tized field produce particle-antiparticle pairs. If
then space-time and its geometry would like- one particle of a pair should be produced with
wise be quantized. Another interesting sug- momentum toward the horizon, the other will
gestion is that through quantum gravity the be produced with momentum away from the
distinction between Fermi-Dirac particles (half- horizon. There is a finite probability that one
odd-integral spin) and Bose-Einstein particles particle will fall across the horizon while its
(int,gral spin) can be broken down. This work antiparticle moves away from the horizon and
comes under the heading of "supergravity." is observed. The net result is that black holes
While many interesting results have been ob- are unstable because of quantum fluctuations
tained from studies of both of these new ideas, of the various quantized fields. However, the
the results are still too preliminary to suggest decay of black holes is a very slow process, the
that either is in the right direction. temperature of a IM0 black hole being T0 =
On the other hand, Stephen Hawking has pro- 10- 6 OK. It is estimated that any black hole
duced a striking result in a semiclassical treat- created in the initial big bang smaller than 10 15
ment. He considers a quantized scalar field in grams would have radiated away all of its mass-
a classical background metric-that of the energy by this time.
Schwarzschild metric which has an absolute Cosmology One expects the gross structure
event horizon. Hawking assumes that in the in- of the universe to depend on global geometrical
finite past the scalar field is in its lowest energy properties and the mean distribution of matter
state-the vacuum state. He then shows that in more than on the details of particle interac-
the infinite future the scalar field is in a state tions. Among the various cosmological solu-
corresponding to a black body temperature of tions of Einstein's equations are those models
based on a homogeneous and isotropic matter
fzc 3 M0 distribution in which matter is streaming away
Tb = ~ 10- 6 - from every point with a velocity dependent on
8rrGMk M . its distance from that point. Such models agree
533 GRAVITATION

with observations made by Hubble on the red expansion. One attempts to distinguish between
shift in the spectra of distant galaxies. He inter- the steady-state theories and the evolutionary
preted this red shift as a Doppler effect due to a models by counting the number of galaxies and
recessional velocity of the galaxy which is pro- the number of radio sources as a function of
portional to its distance: distance. These observations, though not defini-
tive, suggest that the evolutionary models are
v =Hr correct.
There are two other observations which
with H = 30 km/sec/l 0 6 light-years and r mea- strengthen this conclusion. One of these is the
sured in millions of light-years. This value of H recognition of the QSO (quasi-stellar objects) as
leads to an age for the universe of about 10 10 extra-galactic objects. The QSO are very small,
years. very luminous objects, and with very large red
The models based on Einstein's equations are shifts. The simplest and most satisfactory inter-
evolutionary. This means that the rate of ex- pretation of the observed red shifts is that it is
pansion of the universe, hence Hubble's con- the cosmological Doppler shift. With this inter-
stant H, depends on time. Bondi and his co- pretation, 3C273, the nearest, most luminous,
workers have suggested that the universe is in a and the first identified QSO, is at a distance of
steady state and that H is in fact independent 1.6 X 10 9 light-years. This distance is so great
of time. The steady-state theory says that the that it is possible that an early state in the de-
distribution of galaxies and the density of mat- velopment of galaxies is being observed. On the
ter remain the same on the average. New mat- other hand, the amount of energy being re-
ter is created in order to fill in the thinning leased is so great ('" I 049 ergs/sec) that a nuclear
density of matter as the universe maintains its source for the energy is ruled out, and it is sug-

20.-------.-------.--------,-------.-------,
This Work:
f----i Resolution

~ Measured CBR
I 15
E
u
"-
II>

N
E
I p - wave
u T Opt i cal
'- 10 J..

3 2 .96 K B a
l ckbody
N

0
)(
~
5
LL

O~----------------~~~~~~~

o 5 10 15 20 25
Frequency (cm- I )

FIG. 10. Measured spectrum of the cosmic background radiation plotted as


±lu error limits. There are gaps in the data at the frequencies of strong atmo-
spheric emission lines where the errors become very large. The shaded region in-
cludes contributions to the error which are uncorrelated across the spectrum.
Additional solid lines are shown to represent the effect of changing the frequency-
independent scale factor by ±lu. The shaded region can be scaled up or down
within these limits. The spectrum of the 2.96°K blackbody, which has the same
integrated flux as the measured spectrum, and selected microwave and optical
measurements of the CMB are also shown for comparison. (Courtesy of David P.
Woody and Paul L. Richards and The Astrophysical Journal.)
GRA VITA TION 534

gested that the energy comes from the gravita- does not select a particular cosmological model
tional contraction of 10 10 M0 down to its from the many which are allowed by Einstein's
Schwarzschild radius of about 10 10 km. How- gravitational theory, those permitted by the
ever, it is equally possible that the QSO repre- Brans-Dicke modification, or other possibilities
sent the opposite effect, the exploding of mat- which have not yet been constructed.
ter from a highly compacted state which is
being thrust out. At present the models giving JOSHUA N. GOLDBERG
the energy through collapse seem to fit the ob-
servations. From the point of view of the References
steady-state cosmology, the QSO create serious 1. Bergmann, P. G., "The Riddle of Gravitation,"
problems. They are far away and there seems to New York, Charles Scribner's Sons, 1968.
be an overabundance of them at very large dis- 2. Bonnor, W., "The Mystery of the Expanding Uni-
tances, hence a long time ago. verse," New York, The Macmillan Co., 1964.
In 1949, George Gamow and his co-workers 3. Chaffee, Frederic H., "The Discovery of a Gravita-
suggested that at an early stage in an evolution- tional Lens," Scientific American 243(5), 70
ary expanding universe, matter and radiation (November, 1980).
interacted very strongly and were in thermal 4. Davies, P. C. W., "The Search for Gravity Waves,"
equilibrium. As the universe expanded, the op- Cambridge, U.K., Cambridge Univ. Press, 1980.
portunity for interaction decreased, the radia- 5. Gamow, G., "Gravity," Garden City, N.Y., Dou-
tion and matter were effectively decoupled. bleday and Company, 1962.
Matter cooled more rapidly and condensed into 6. Partridge, R. B., "The Primeval Fireball Today,"
the galaxies and stars. The radiation maintained American Scientist 57, 37 (1969).
its thermal equilibrium distribution, its black- 7. Rindler, W., "Essential Relativity," New York,
body distribution, behaving like a gas which is Van Nostrand Reinhold, 1969.
undergoing an adiabatic expansion. Gamow 8. Sciama, D., "Modern Cosmology," Cambridge,
estimated the temperature of this remnant of U.K., Cambridge Univ. Press, 1971.
the primeval fireball at about 2S o K. Fifteen 9. Weber, J., "General Relativity and Gravitational
years later, Dicke came to a comparable con- Waves," New York, Interscience Publishers, 1961.
clusion independently and set about to observe 10. Weisberg, J. M., Taylor, J. H., and Fowler, L. A.,
this radiation. In the meantime, Penzias and "Gravitational Waves from an Orbiting Pulsar,"
Wilson, while tracking down residual noise in a Scientific American 245(4), 74 (October 1981).
sensitive microwave radiometer, discovered an
isotropic background radiation at a wavelength Cross-references: ASTROMETRY, ASTROPHYSICS,
of 7.4 cm whose intensity corresponded to the COSMOLOGY, POTENTIAL, RADIO ASTRONOMY,
emission of a blackbody at a temperature of RELATIVITY, SOLAR PHYSICS.
3.5°K. Subsequent measurements by many ob-
servers have verified the existence of this back-
ground radiation, but have reduced the tem-
perature to 2.7°K. GYROSCOPE*
The use of microwave radiometers from the
ground was very successful in mapping the low The gyroscope consists of a flywheel or a sphere
frequency region. Unfortunately, because of that is spinning (usually at high speed) about an
atmospheric absorption and interference from axis. If this axis is free in space (such freedom
local sources, this method of measurement may be provided by gimbals, by floating the
could not be carried over the peak of the curve spinning mass on a column of gas or fluid, or
for black body radiation at 2.7 OK. However, by suspension in a magnetic or an electrostatic
recent measurements by Woody and Richards field), the axis will remain parallel to its original
used a balloon-based spectrophotometer to position even though the gyroscope is mounted
measure the cosmic background radiation from on a vehicle that translates and rotates in three
the low frequency side well over the peak into dimensions. This property of the "free gyro"
the high frequency side of the black body curve. often referred to as "spatial memory" was used
Within the errors of their measurements the as an artificial horizon as early as 1744 by
total flux in the frequency range 2.4 - 13.5 Serson. This permitted a ship's navigator to
cm -1 (A -1) agrees with the flux from a black take readings with a sextant (measurement of
body at a temperature of 2.96 OK with lower the angular elevation of a star with respect to
and upper limits of 2.88 and 3.09 OK respec- the horizon) when the horizon was obscured by
tively. These results together with those from darkness, fog, mist, etc. This early instrument
earlier measurements are shown in Fig. 10. has led to the modern vertical gyro.
The prediction and observation of the black- In 1852, Leon Foucault built one of the first
body radiation and its interpretation as evi- precise gyropscopes. Utilizing a flywheel 80
dence of a primeval fireball give very strong millimeters in diameter supported in near-fric-
support to the concept of an evolutionary uni- tionless gimbals, this instrument was sensitive
verse, a universe which has expanded from a
compact volume to its present size. However, it *I11ustrations and text excerpted from Refs. 1 and 2.
535 GYROSCOPE

enough to detect the rotation of the earth. This rate of turn. The principle is also used for plat-
free gyro maintained a fixed orientation in form stabilization where the platform carries
space as the earth turned. The relative motion instruments which must bear specific orienta-
was observed through a microscope. tions with respect to the earth. By carefully
In 1896, the gyroscope saw its first applica- regulating the torque, the gyroscope and plat-
tion for guidance when Obry used it in a self- form rotate with the earth without being in
propelled torpedo. An unguided torpedo, under contact with the earth.
the influence of winds, currents, and ocean Gyroscopic Torque If the gyro is not free,
waves would not follow a prescribed course. but is forced tor turn , its spin axis is forced to
The gyroscope, on the other hand, was pointed change its orientation. Refer to Fig. lea). Let
at the target before launching the torpedo, and the initial position of the spin axis be repre-
by means of linkages connected to the spin axis sented by the vector WI and the position after
support, it actuated the rudder of the torpedo, time !:1t be the vector Wz. The vector !:1w rep-
steering it along a straight course. resents the change in angular velocity. It has a
When the gyroscope is not free, that is, when magnitude
the spin axis is forced to turn in space, the gyro-
scope develops a torque about an axis that is (1)
perpendicular to the plane containing the spin
axis and the axis about which turning takes where !:1¢ is the angle turned during time !:1t.
place. This property was utilized to find north Likewise, the angular momentum vector H
by continually reorienting the spin axis until may be shown initially and after time !:1t, as
indicated in Fig . l(b) .
the gyroscope could not detect the rotation of
the earth. This instrument, called the gyrocom- H = Iw (2)
pass, was perfected in 1908 by Anschutz of
Germany and by Elmer Sperry (1911) of the where I = mass moment of inertia of wheel
United States. This instrument used a wheel about the spin axis
0.1 to 0.2 meters in diameter, and was sus- W = angular velocity of wheel.
pended in such a way that it tended to remain
vertical. Since the vertical turned with the earth, The change in magnitude of angular mo-
the gyroscope following it produced a torque. mentum is
This torque, in turn, was used to provide self-
turning of the spin axis in a direction that !:1H = I!:1w (3)
would reduce the torque. In the final equilib- The torque required to change the angular mo-
rium position, the instrument pointed north. mentum is Ti, and the equal and opposite
The gyrocompass is still widely used today . torque T felt by the structure is
The gyroscopic torque is also used to stabilize
ships. Due to the motion of waves, the unstabi- !:1H
lized ship rolls considerably. It is impractical to T=- (4)
shift huge masses fast enough to counteract the !:1t
irregular motion. On the other hand, a gyro-
scope develops torque instantly and with much Apply Eqs. 0)-(3) to Eq. (4):
less effort. This led to the development of the !:1¢
gyroscopic ship stabilizer. Utilizing a wheel 3 T=H- (5)
to 6 meters in diameter, and a hydraulic turning !:1t
mechanism, angular rates are applied to the spin
axis support producing tremendous gyroscopic In the limit, !:1¢/!:1t approaches 4>. If the turn
torques upon the ship . In moderately rough rate vector 4> makes an angle 1/1 with the plane
seas, the gyro stabilizer can eliminate 70 to 80 containing the vectors HI and Hz, then the
per cent of the roll motion. The control signal perpendicular component is
to the turning mechanism is provided by a small 4> sin 1/1.
"vertical gyro" which is very much like Serson's
artificial horizon gyro . This essentially is a guid- Applying the above concepts to Eq. (5), we
ance system (like Obry's) using a small guidance have
gyro to provide the vertical and a large gyro to
provide the "muscle ." T= H4> sin 1/1 (6)
If the motion of the gyro is free, and if a as the magnitude of the torque. In vector nota-
torque is applied to the structure containing tion, this becomes
the spinning mass, it will turn or precess about
an axis that is perpendicular to the plane con- T = H X 4>. (7)
taining the torque axis and the spin axis. This is
the converse of the gyroscopic torque phenom- In order to determine the sense of the torque
enon. By adjusting the amount of torque, the vector T, rotate the angular momentum vector
gyroscope provides a controlled rate. This prop- H toward the input rate vector 4> and apply the
erty is utilized in an autopilot during a constant right-hand rule.
GYROSCOPE 536

~
H2 = IW2

W2
~H='W"' H, = Iw ,

w,

(a)

~ ¢

1
, T, H
./
,/
./ I
./ I
H II
II
II
} II
..- ,, ')
-
~

T " "~
(e) -- --
--~ " "
~

--

(d)

FIG. 1. Gyroscopic torque.

Gyro Drift The free gyro maintains its orien- is approximately,


tation in space, provided that there are no dis-
turbances to the gyro. However, the presence H=MR 2 w. (10)
of small torques produces a precession or gyro Then the gyro drift is given approximately by
drift. Usually the magnitude and direction of
the torque are unknown. Hence, gyro drift is
usually unknown, and thus, constitutes an O.d =Mgb gb/R
--=-- (11)
error. MR w
2 Rw·
One common source of this unknown or un-
certain torque is the axial shift of the spin axis In superprecise gyroscopes, the axial shift b is
bearings. In a vertical gravity or' -acceleration of the order of one millionth of the wheel ra-
field g, the torque Td is given by dius, the wheel speed w = 25,000 rpm = 2510
rad/sec, and the wheel radius R = 25 mm =
Td =Mgb (8) 0.025 meters. Then,
where M = mass of gyro wheel
g = acceleration along the I/J axis 8d = (9.8 m/s2) 10- 6
b = axial shift. (0.025 m) (2510 rad/s)
The resulting gyro drift rate 8d is given by =0.156 X 10- 6 rad/s
. Td Mgb = 0.0322° /hr.
Od=-= - - . (9)
H H
A tolerance of one part in a million may seem
If the gyro wheel has most of its mass concen- extraordinary. However, as extreme as this
trated in its rim, then the angular momentum seems, it is not good enough for typical applica-
537 GYROSCOPE

tions in inertial guidance systems, where a drift 2. Cochin, Ira, "Analysis and Design of the Gyro-
rate of 0.010 Jhr is required. For such applica- scope," New York, John Wiley & Sons, 1963.
tions, exceptional accuracy is demanded-pos- 3. Bulman, D. N., and Maunder, L. J., "Dynamics
sible only in ultrasophisticated instruments. of the Dynatune Gyro," Mech. Engr. Sci., 22(3),
One such device is the "ring laser gyro," which 137-141, (June 1980).
consists of three lasers (each about 0.1 meter 4. Chow, Weng, and Hambenne, Jare!, "Multioscil-
long) arranged in a triangle. This instrument lator Laser Gyros," IEEE Trans. Quantum Elec-
makes use of the linear momentum of a beam. tron., QE-16 (9),918-935 (Sept. 1980).
5. McLeod, D. L. "Solid Rotor Electrostatic Gyro,"
IRA COCHIN IEEE Proc. Nat!. Aerosp. Electron. Conf., vol. 3,
pp. 1199-1205, May 1979.
References
Cross-references: ROTATION-CURVILINEAR MO-
1. Cochin, Ira, "Analysis and Design of Dynamic TION: INERTIAL GUIDANCE; MASS AND IN-
Systems," New York, Harper & Row, 1980. ERTIA; MECHANICS.
H
HALL EFFECT AND RELATED hydrodynamics. For example, it was pointed
PHENOMENA out in a paper given at the symposium men-
tioned above that the Hall currents in the
If a current of particles bearing charges of a aurora can be as large as one million amperes.
single sign and constrained to move in a given In magnetohydrodynamic technology, Hall
direction is subjected to a transverse magnetic phenomena enter into the design of MHD
field, a potential gradient will exist in a direction electrical power generators.
perpendicular to both the current and the mag- Principles Involved The Hall effect is a mani-
netic field. This phenomenon is called the Hall festation of the force, usually known as the
effect, after E. H. Hall who discovered it in a Lorentz force, which is exerted on a charged
metai -in- -r879-. To commemorate the 100th particle moving perpendicular to a magnetic
anniversary of Hall's discovery, a symposium field. The direction of the force is perpendicular
was held on November 13, 1979 at The Johns to both the magnetic field and the velocity of
Hopkins University. It was at this university, the charge. If this sidewise thrust is not counter-
while a graduate student under Henry Rowland acted, the charged particle undergoes a deflec-
in the Physics Department, that Edwin Herbert tion. The result of such motion is to create a
Hall did the experiments which led to his dis- charge unbalance and to produce a transverse
covery of what is now universally known as the electric field component, known as the Hall
Hall effect. The published proceedings of the field. The force on the charged particle due to
centennial symposium, entitled "The Hall this Hall field tends to oppose the force result-
Effect and Its Applications," occupy a total of ing from the magnetic field. Mathematically, the
550 pages and illustrate the wealth of activity forces and fields in question are represented by
which has evolved from Hall's work. For ex- the following vector equation:
ample, studies of the Hall effect provide ex-
tremely useful techniques for obtaining infor- F=e[E+(l/c)vXH] (1)
mation about the electronic properties of
solids. As will be discussed in more detail, where F is the force, e the charge, and v the ve-
Hall data can be used to obtain concentrations locity of the particle, E is the electric field, and
and numerous other properties of the charge H is the magnetic field. A permeability of
carriers. In addition, the Hall effect is the unity is assumed, and for the Gaussian system
basis of a variety of specialized devices, in of units, c is the speed of light and F is in dynes.
which the fundamental unit (referred to as a The use of the symbol H in the above equation,
Hall element or, sometimes, as a Hall generator) when more precisely the magnetic induction
is made of a material in which the voltage or, B is meant, is perhaps unfortunate. However,
depending upon the application, the power pro- this notation was used almost universally in
duced by the Hall effect is especially large. the early literature on Hall phenomena. In the
More specific details are given later in the sec- Gaussian system the values of Band H are the
tion entitled "Hall Effect Applications." same in nonmagnetic materials (a permeability
Broadly stated, the utilization of the Hall ef- of unity). As the equation indicates, the total
fect yields a variety of miniature, light-weight, force on the particle of charge e is the vector
highly reliable devices which can efficiently sum of that due to the electric field E and that
be used in a number of applications such as: resulting from the magnetic field. The latter
(l) replacement of the troublesome contacts enters via the vector cross product v X H, which
commonly found in conventional circuitry; represents a vector perpendicular to both v and
(2) measurement of position or velocity of ro- H and in the direction iridicated by the right
tating elements; (3) measurement of magnetic hand rule as v is rotated into H. The magnitude
fields or magnetic field gradients by means of of the vector v X H is equal to the product of
a simple probe and associated circuitry. The the individual magnitudes of v and H times the
above applications are of especial importance sine of the angle between them. Hence, only
in the guidance and sensor systems of satellites. the component of v perpendicular to H con-
Finally, Hall phenomena play an important role tributes to the sidewise force.
in plasmas in magnetic fields and in magneto- The physics involved in the Hall effect is con-

538
539 HALL EFFECT AND RELATED PHENOMENA

+ + + ++ +

FIG. 1. Hall field due to action of magnetic field on positive charge carriers. For example shown, e, lx, and Hz
are positive and, therefore, ux , Ex, and Ey are also. The resultant electric field and the Hall angie are shown at
lower left. In the case of electrons, the Hall field is reversed for positive I x' both Ux and e now being negative.

viently illustrated by considering a confined the electronic charge (4.8 X 10- 10 esu in
stream of free particles, each having a charge e Gaussian units ; 1.6 X 10- 19 coulomb in prac-
and an initial velocity Vx. A magnetic field in tical units), and its sign is negative for trans-
the z direction produces initially a deflection of port by electrons and positive for transport
charges along the y direction. This charge un- by holes (deficit electrons). The Hall coefficient
balance creates an electric field E y, and the is defined by the ratio Ey/IxHz , namely
process continues until the force on a moving
charge due to the Hall field Ey counter-balances RH =-Ey/IxHz = l/nec (Gaussian units) (3)
that due to the magnetic field so that further
particles of the same velocity and charge* are Thus a very simple relation exists in the free-
no 10l)ger deflected. A pictorial representation particle example between the Hall coefficient
of this is shown in Fig. 1. The magnitude of the and the charge-carrier density. It is also seen
Hall field follows at once from Eq. (I), with F = thatRH is negative for conduction by electrons,
o at steady state, namely positive for conduction by holes. Now the elec-
tric current density I x exists by virtue of an ap-
={l/c)uxHz =IxHz/nec (Gaussiant units)
*
Ey plied electric field Ex. With the Hall field pres-
ent, the resultant electric field in a solid lies at
(2) some angle (J to the x axis. This angle is called
In obtaining the last equality, the electric the Hall angle, namely
current density Ix was expressed in terms
of the density of charge carriers n by the prod-
() =- tan-lEy/Ex (4)
uct neux . For electronic (i.e., as opposed to Thus the Hall effect may be described as a rota-
ionic) conduction in solids, the magnitude of e is tion of the electric field. At zero Hall field, the
equipotential lines are perpendicular to J, but
*If the particles have a distribution of velocities, when the Hall field appears, they are oblique,
then it is only those particles having a certain "aver- so that a Hall voltage exists across the specimen
age" velocity, which are undeflected. This point is ex- in a direction normal to the current. The rota-
panded later. tion aspect is also brought out by considering
tin the Gaussian system, mechanical quantities are the components of the conductivity tensor,
in cgs units, electrical quantities in esu, and magnetic which relate electric current densities and fields.
fields in gauss or oersteds. In the practical system, For the boundary condition that Iz = 0, we
mechanical quantities are in cgs units, electrical quan- may write
tities in volts and coulombs, and magnetic fields in
gauss or oersteds. In all equations in this article, ex- Ix = uxxEx + uxyEy
cept for Eq. (1), conversion from the Gaussian system Iz = 0, H = Hz (5)
to the practical system is effected by replacing c by
unity and replacing H by H /1 0 8 , where these quanti- Iy =- UxyEx + UxxEy
ties explicitly occur. The practical units for R Hare
cm 3 /coulomb; those for mobility are cm 2/volt-sec. Equations (5) hold for media of sufficient sym-
The corresponding units in the Gaussian system metry§ that Uxx = UY.l' Uyx = - Uxy, and Uxz =
are cm 2-sec/esu and cm 2/statvolt-sec. If practical units Uyz = Uzx = uzy = O. The latter insures that Ez
are to be used in Eq. (1), to obtain the force in dynes
it is necessary not only to make the changes specified
above but also to include a factor of 10 7 on the
*Although the Hall effect can be discussed for any
constrained electron gas, we shall, for simplicity, talk
right-hand side. This results from the additional pro- about a solid conductor.
cedure of converting charge in coulombs to esu and § Isotropic media, e.g., or cubic systems with coordi-
the field terms in volts/cm to statvolts/cm. nate axes along cube axes.
HALL EFFECT AND RELATED PHENOMENA 540

vanish when Jz is zero. Since the boundary IlHH/c = tanO, or /JH =RHUC
conditions for Hall effect require that Jy (Gaussian units) (10)
vanish, Eqs. (4) and (5) yield the following re-
sult for the off-diagonal elements Uxy: With the use of Eqs. (8) and (9), it can be seen
that the ratio of Hall and conductivity mobilities
Uxy = Uxx tanO (6) is precisely the Hall coefficient factor. It follows
that the Hall angle is proportional to Il and H.
The inverse of the conductivity tensor is the When transport by two or more kinds of
resistivity tensor, which relates E to J. A general charge carriers occurs, Eq. (8) is not applicable,
definition of the Hall effect involves relating it and one must return to the general relations of
to the antisymmetric* components of the resis- Eq. (5). For conduction by electrons and holes
tivity tensor. This leads to the vector equation (ofrespective densities n, p), the weak-field Hall
for the Hall field, namely coefficient can be written in the form
EH = R H H X J (Gaussian units) (7) Ro = -(re!le2n - rhllh2p)/
The identity in the first part of Eq. (3) is recog-
nized as a special case of Eq. (7) [ Ie 1c(llen + Ilh p)2) (II)
Application to Real Solids In a real solid the
idealized free particle treatment no longer ap- where lel,/Je, etc., are positive. We note that the
plies and must be replaced by a theory that charge-carrier densities are now weighted by the
takes into account the distribution of velocities mobilities!le, Ilh. There are also Hall coefficient
and the interactions of the charge carriers with factors for each carrier. For arbitrary H, the
impurities, defects, and lattice thermal vibra- terms involve field-dependent factors.
tions of the solid (i.e., the scattering), as well as Analysis of Hall-effect data is one of the most
the band structure of the solid. The latter con- widely used techniques for studying conduction
sideration relates to the fact that the charge car- mechanisms in solids, especially semiconductors.
riers in the solid are not free but exist in a po- For the single-carrier case, one readily obtains
tential energy field having the periodicity of the carrier concentrations and mobilities, and it is
lattice. As a result of these constraints, only cer- usually of interest to study these as functions of
tain energy states, or bands, are allowed for the temperature. This can supply information on
charge carriers. In addition, the relationship be- the predominant charge-carrier scattering mech-
tween the energy and velocity of the carriers is anisms and on activation energies, i.e., the ener-
not the simple! mv 2 of the free electron, but is gies necessary to excite carriers from impurity
more complex. As an approximation, one fre- levels into the conduction band. Where two or
quently characterizes the charge carriers by an more carriers are present, the analysis is more
effective mass, m*. Taking into account most of complicated [cf. Eq. (II)], but much informa-
the complexities mentioned above, one still tion can be obtained froni studies of the temper-
ature and magnetic field dependencies.
obtains an expression similar to Eq. (3), namely Unlike, for example, the magnetoresistance,
RH = r/nec (Gaussian units) (8) the Hall effect is a first-order phenomenon. At
weak magnetic fields it depends linearly on H,
The Hall coefficient factor, r, actually depends and it does not vanish in isotropic solids if all
on the nature of the scattering, the band struc- the carriers have essentially the same velocity or
ture, the magnetic field strength, and on the if the scattering is characterized by a relaxation
statistics characterizing the distribution of veloc- time which is independent of the carrier energy.
ities of the carriers. Fortunately it depends The Hall effect forms the basis of a number of
weakly on these factors, and its value is usually devices used in isolating circuits, transducers,
within, say, 50 per cent of unity. multipliers, converters, rectifiers, and gauss-
An important attribute of charge carriers is meters (for measurement of magnetic fields). As
their mobility, Le., their drift velocity per unit mentioned earlier, the fundamental component
electric field. The conductivity mobility Il is re- of such devices is a slab of material (called a
lated to the conductivity u by "Hall element" or a "Hall generator") possess-
ing favorable Hall characteristics. Additional
Il = u/ne (9) devices are mentioned in the last section.
Experimental Determination A number of
It is also customary to define a Hall mobility IlH techniques are available, the most direct being
for conduction by a single type of charge carrier to measure, by means of a potentiometer or
by the relation other high-impedance device, the Hall voltage
VH across a parallelepiped in a direction normal
*These components are defined by the condition to both Hand J. The Hall coefficient follows
that Pik =-Pki, where the Pik [cf. Eq. (5») are de- from Eq. (3) or (7), with attention to the foot-
3 note to Eq. (2):
fined by Ei = L PikJk, 1 ==x, 2 ==y, 3 == Z.
k=1 RH = 1011 VHt/IH, (practical units) (12)
541 HALL EFFECT AND RELATED PHENOMENA

I
6
4
2

5
3

FIG. 2. Arrangement of contacts for measurement of Hall coefficient and related effects. Hall probes are 3 and
4. Probes 2 and 6 are for resistivity and magnetoresistance. Probes land 5 allow a check of the uniformity of the
specimen. For Hall effect and transverse magnetoresistance, the magnetic field is in the direction of the thickness
t; for longitudinal magneto resistance, it is along I. To avoid disturbances due to contact shorting, all probes
should be at a distance of at least 2w from the end contacts.

where t is the thickness of the specimen and I is none will be deflected since the Hall field
the total current. The arrangement is shown in cancels the v X H force. This is the case in an
Fig. 2. Since VH may be the order of microvolts, isotropic metal, where the velocity of all the
extreme care must be taken to avoid extraneous electrons is essentially the Fermi velocity. It is
voltages. An example is the misalignment volt- also true for electron scattering mechanisms de-
age, caused by probes 3 and 4 not being on an scribed by a relaxation time which does not
equipotential plane when H = O. This may be depend on energy. In these cases-assuming, of
eliminated by taking measurements for opposite course, a single type of carrier-there is no
directions of H and taking half the difference, magnetoresistance. If, however, there is a distri-
inasmuch as VH(H) is odd in H. Alternatively, bution of electron velocities-as in a semicon-
one can adjust the position of the Hall probes ductor-then it is clear that only those electrons
for a null reading with H = O. Other techniques of a certain "average" velocity will be unde-
involve use of resistances with sliding contacts flected. The remaining carriers, having velocities
suitably attached to the specimen. Other spu- either larger or smaller than the "average" will
rious voltages can arise from temperature gradi- be deflected and will traverse longer paths, thus
ents and resulting thermoelectric or thermo- increasing the resistance of the conductor. A
magnetic emf's (see following section). Some of similar situation obtains if more than one type
these can be eliminated by taking appropriate of carrier is present. It is also apparent that any
averages among measurements for reversed po- mechanism which shorts out the Hall voltage-
larities of I and direction of H. It is usually e.g., special geometry, shorting contacts, in-
desirable to maintain good thermal contact be- homogeneities in the specimen-will increase the
tween all points of the specimen and a constant- magnetoresistance.
temperature bath. Although it is possible to For the reasons discussed and the fact that
analyze adiabatic Hall data (no heat flow to or !:::.p/Po varies as H2 in weak fields, magnetore-
from the specimen during measurement), iso- sistance is a second-order effect. It tends to sat-
thermal data are preferred because of the sim- urate in strong magnetic fields, unless there is a
plicity of the equations. All of the relations in disturbance of the Hall field as mentioned
this article are for isothermal conditions. Errors above. Magnetoresistance is even in H, and it is
in measurement can also result from a shorting related to the symmetric components of the re-
of the Hall voltage, especially by the end con- sistivity tensor, i.e., those for which Pik = Pki
tacts. With regard to the latter, the error is es- [cf. footnote on p. 401]. It can be measured
sentially negligible if the length-to-wid th ratio of with the geometry shown in Fig. 2 by maintain-
the specimen is about 4 or more, and the Hall ing the current constant and determining the
probes are near the center. potential difference between probes I and 5 or
Related Effects A widely-studied galvano- 2 and 6 as a function of magnetic field. Mag-
magnetic effect is transverse magnetoresistance, netoresistance data can supply information on
usually written !:::.p/Po. This phenomenon can be charge-carrier scattering and band structure. In
illustrated by the pictorial scheme in Fig. I. If a the latter case, if anisotropy exists, it is useful
charge carrier is deflected by the Lorentz force to obtain data for different crystallographic di-
so as to traverse a longer path, it will contribute rections. There is also a longitudinal magneto-
less to the conductivity, and there will be a pos- resistance, measured when H is parall~l to J.
itive magnetoresistance. It was noted, however, This effect vanishes in isotropic solids, and its
that if all charge carriers have the same velocity, presence indicates anisotropy in scattering or
HALL EFFECT AND RELATED PHENOMENA 542

band structure, or inhomogeneities in the speci- in Semiconductors: Helicon and Alfven Waves," esp.
men. p. 30 ff., London, Taylor and Francis Ltd., 1971.
By shorting the Hall field or by choosing a Beer, A. C., "Galvanomagnetic Effects in Semicon-
disk geometry so that such a field does not ductors" (Supplement 4 to "Solid State Physics,"
exist, one obtains a "magnetoresistance" (more F. Seitz and D. Turnbull, Eds.), New York, Academic
strictly, a magnetoconductivity) which does not press, 1963.
saturate. This is called the Corbino magnetore- Chien, C. L., and Westgate, C. R. (Eds.), "The Hall
sistance or Corbino effect, after O. M. Corbino Effect and Its Applications" (Proc. Commemora-
who studied circulating secondary currents in a tive Symposium on the Hall Effect and its Applica-
"Corbino" disk carrying a primary radial elec- tions, Nov. 13, 1979), New York & London, Plenum
tric current in a magnetic field. Press, 1980.
There are a number of thermal effects in a Dunlap, W. Crawford, Jr., "An Introduction to Semi-
magnetic field, which can produce transverse conductors," New York, John Wiley & Sons, Inc.,
voltages or temperature gradients. These result 1957.
from the velocity separation of charge carriers Fritzsche, Hellmut, "Galvanomagnetic and Thermo-
by the Lorentz force-the energetic ones going magnetic Effects," in "Methods of Experimental
to one side, the slower ones going to the other. Physics" (1. Marton, Ed.), Vol. 6B, "Solid State
Temperature gradients are produced, and also Physics" (K. Lark-Horovitz and V! A. Johnson, Eds.),
electric fields. In the Righi-Leduc effect, a longi- p. 145, New York, Academic Press, 1959.
tudinal temperature gradient produces a trans- Hall, E. H., Amer. J. Math., 2, 287 (1879); A mer. J.
verse temperature gradient (thermal analog of Sci., 19, 200 (1880).
the Hall effect); in the Nernst effect, it produces Kuhrt, F., and Lippmann, H. 1., "Hallgeneratoren" (in
a transverse electric field. In the Ettingshausen German), Berlin, Springer-Verlag, 1968.
effect, a longitudinal electric current produces a Lindberg, 0., Proc. Inst. Radio Engrs., 40, 1414
transverse temperature gradient. This latter ef- (1952).
fect, if large, can lead to complications in the Putley, E. H., "The Hall Effect and Related Phenom-
measurement of the Hall field since the po- ena" ("Semi-Conductor Monographs," C. A. Ho·
tential probes and leads are seldom made of garth, Ed.), London, Butterworths, 1960.
the same material as the specimen. Therefore Rosa, R. J., "Magnetohydrodynamic Energy Conver-
the Ettingshausen temperature gradient can pro- sion," esp. p. 59 ff., New York, McGraw-Hill, 1968.
duce a thermoelectric voltage which adds to Seitz, Frederick, "The Modern Theory of Solids," New
the Hall voltage. York, McGraw-Hill, 1940.
Hall Effect Applications* The Hall effect Shercliff, J. A., "A Textbook of Magnetohydrody-
finds application in many practical devices. namics," esp. p. 18, Oxford, Pergamon Press, 1965.
These include a variety of contactless switches Shockley, William, "Electrons and Holes in Semicon-
such as solid-state keyboards, plunger-operated ductors," New York, Van Nostrand Reinhold, 1950.
switches, proximity switches, and vane switches. Swift-Hook, D. T., in "Direct Generation of Electric-
Other device applications include tachometers ity" (K. H. Spring, Ed.), pp. 143-155, New York,
and various other counting systems, and the Academic Press, 1965.
control of automobile ignition. For space appli-
cation, such Hall effect devices as sensors, Cross-references: CONDUCTIVITY, ELECTRICAL;
gaussmeters, magnetometers, brushless DC ELECTRIC POWER GENERATION; ELECTRICITY;
motors, and uses in connection with satellite MAGNETOMETRY; MAGNETO-FLUID-MECHAN-
magnetic-hysteresis damping and gravity-gra- ICS; POTENTIAL; SEMICONDUCTORS; THERMO-
dient stabilization at synchronous altitude are ELECTRICITY.
extremely important. Inasmuch as Hall elements
can be made of silicon, their processing is com-
patible with silicon integrated-circuit tech-
nology and a number of devices utilizing Hall HEALTH PHYSICS
effect in conjunction with IC functions are now Health physics is the profession that is con-
produced on silicon chips. cerned solely with the protection of man from
ALBERT C. BEER the damaging effects of radiation. t It attempts
to understand the action of radiation on man
and his environment, to establish appropriate
References limits for exposure to radiation, and to devise
appropriate methods for detection, measure-
Alfven, H., and Fiilthammar, C.-G., "Cosmical Electro- ment and control of radiation exposure. Al-
dynamics," esp. pp. 180-185, Oxford, Clarendon though this profession is relatively new, man's
Press, 1963. awareness and concern for the harmful effects
Baynham, A. c., and Boardman, A. D., "Plasma Effects of ionizing radiation are not of recent origin.
Perhaps the earliest record of damage to man

*The information presented in this section can be tThe Health Physics Society in 1971 extended the
found in the centennial symposium proceedings "The scope of health physics to include also non-ionizing
Hall Effect and Its Applications." electromagnetic radiations.
543 HEALTH PHYSICS

from iOnlzmg radiation dates back to about By far the most serious exposures were to
1500 when the high incidence of lung diseases young women engaged in the radium dial paint-
was recognized among the Schneeberg miners ing industry. Some of these women ingested
of Saxony and the Joachimsthal miners of Bo- relatively large quantities of radium as a conse-
hemia. In 1879 Herting and Hess performed the quence of tipping brushes with their lips as they
first autopsies on these miners and reported applied radium paint on the dials of clocks and
malignant growths in the lungs; however, the watches. The total number of radium dial paint-
cause of these malignancies was not understood ers and others who took radium by mouth or
until after 1896 when Roentgen first announced injection and, as a result, died with readily de-
his discovery of x-rays and Becquerel reported tectable symptoms of radiation damage is not
the discovery of radiation due to uranium. Even known. The first recorded fatality due to radium-
during the first year following the discovery induced cancer, resulting from exposure in the
of x- and 'Y-radiation, many things were learned radium dial industry, was in 1925, and since
about both the harmful and useful characteris- that time histories of over 1000 Ra cases have
tics of this new source of energy. Grub be, a been recorded and studied in the United States.
manufacturer of Crookes tubes in Chicago, In 1942-43 there was begun at the University
Illinois, was using his equipment to study the of Chicago a program to explore the possibility
fluorescence of chemicals even before the public of assembling a critical mass of natural uranium
press on January 4, 1896, heralded Roentgen's in such a way that a "pile" or NUCLEAR REAC-
discovery of x-rays. During January, 1896, he TOR could be operated for the production of
first noticed an erythema on the back of his the new element plutonium to be used in atomic
hand and later the formation of a blister with weapons. A. H. Compton, the director, and his
skin desquamation and epilation. His hand was associates debated the wisdom of proceeding
sufficiently painful that he sought medical aid with this project because they realized that in a
on January 27, 1896. Realizing from first-hand single large reactor there would be produced
experience the destructive power of x-rays ionizing radiation equivalent to that from thou-
Grubbe on January 29, 1896, treated a patient sands of tons of radium. Yet all the radium that
for carcinoma of the breast with his Crookes had been available to man throughout the world
tube. Not only was this treatment significant only amounted to about two pounds, and these
because it was one of the earliest - if not the men were acutely aware of the extensive suffer-
first-therapeutic uses of ionizing radiation, but ing and death that had resulted from its misuse.
it is noteworthy that he acted as one of the They decided to call together a rather unique
first health physicists when he used lead as a group of scientists to evaluate these health
shield to protect the rest of the body of the problems, to develop new instruments, equip-
patient. Even Becquerel and Madame Curie ment, and techniques and to establish radiation
learned from first-hand experience the need standards for the protection of nuclear energy
for radiation protection when they received workers. The group assembled was concerned
skin burns from the careless handling of radium with the health of the workers and consisted
(see RADIOACTIVITY). mostly of physicists; hence, they were called
S. Russ in 1915 made a comprehensive series health physicists. E. O. Wollan was the leader
of recommendations for radiation protection to and the other senior members were H. M.
the British Roentgen Society, and if these rec- Parker, C. C. Gamertsfelder, K. Z. Morgan, R. R.
ommendations had been heeded, many of the Coveyou, 1. C. Hart, L. A. Pardue, and O. G.
early radiation fatalities might have been averted. Landsverk. Thus in 1942-43 health physics had
It was not until 1928, when the International its beginning at the University of Chicago. Al-
Commission on Radiological Protection was though prior to this time many early pioneers
formed and published the first set of recom- such as S. Russ (England), L. S. Taylor (United
mendations for radiation protection, that there States), G. Failla (United States), A. Mutscheller
began to be widespread interest and concern for (Gennany) and R. Sievert (Sweden)-to name
this problem. In the following year the National only a few-had devoted considerable attention
Council on Radiation Protection was formed to the radiation protection problem, it was not
and it has set the standards in the United States. until the advent of health physics that a profes-
Beginning at the time of the First World War sional group was organized with this as its sole
and continuing until about 1930, there were objective.
many unfortunate exposures to radium. Some As the nuclear energy programs expanded,
of these were the result of therapeutic injec- large laboratories were established to carry on
tions of radium, the drinking of radium and the program at Oak Ridge, Los Alamos, Han-
radiothorium water, and occupational exposures ford, Berkeley, Brookhaven, Savannah River,
of radium chemists. During this period, radium etc., and as the need for health physicists was
was considered to be a useful therapeutic agent, recognized beyond nuclear energy programs and
and as a result it was administered by physicians in private industry, hospitals, military organiza-
in the United States to hundreds of patients. In tions, utility power companies, state and federal
some cases it was taken as a general tonic, and agencies of public health, and colleges and uni-
in others it was given as a curative agent for hy- versities, the profession of health physics grew
pertension, anemia, arthritis, and many other and expanded very rapidly so that today (1982)
human ailments-even for insanity. it is estimated there. are about 14 000 health
HEALTH PHYSICS 544

physicists in the world. In 1956 the Health unnecessary exposure may be permitted unless
Physics Society was organized which now has a it can be balanced by benefits of equal value.
membership of about 5000. This society pub- At the same time they must be made to respect
lishes the journal, Health Physics. In 1959 the ionizing radiation-not fear it.
American Board of Health Physics was formed The duties of the health physicist in applied
for the certification of persons whose technical operations are very diverse and differ consider-
competence and judgment qualify them to be ably from place to place, depending upon the
responsible for handling major problems of ra- size and nature of the operation. For example,
diation exposure and/or contamination control. the health physicist in a reactor operation would
The International Radiation Protection Associa- have duties quite different from those of the
tion was organized at the First International health physicist associated with an accelerator
Congress on Radiation Protection in Rome on program or the health physicist connected with
September 7, 1966, with K. Z. Morgan as its a state public health organization charged with
first president. In 1982 it included 28 affiliated the survey of medical x-ray equipment. A few
societies with 10 000 members living in 70 typical applied health physics activities may be
countries. summarized as follows:
There are three principal areas in which health (1) Aid in the selection of suitable locations
physicists are employed-education and training, for buildings in which radioactive materials are
applied activities, and research-and these three to be produced or used, and conduct pre-
areas will be discussed below. operation background surveys.
In the early period, health physicists were (2) Offer advice in the design of laboratories,
scientists who had to develop their own com- hoods, remote control equipment, radiation
petence during employment on the various shields, etc.
atomic energy programs. In 1948 AEC Health (3) Provide personnel monitoring meters for
Physics Fellowship programs were established radiation dosimetry to all persons subject to
at Oak Ridge National Laboratory in coopera- radiation exposure and read these meters fre-
tion with Vanderbilt University and the Univer- quently; make thyroid counts, breath measure-
sity of Rochester and later with other national ments, urine and feces analyses; check body
laboratories and universities. This program, over with scanners and total body counters and con-
a twenty-five-year period in the U.S., became duct other tests to aid in estimating how much
the principal source of senior health physicists. (if any) radioactive material is fixed in the
Unfortunately, during the period 1973-1976 body; and maintain accurate records of the ac-
the programs, which had expanded into some cumulated dose from each type of ionizing radi-
twenty universities, were phased out. Although ation received by each individual for his protec-
most of the early health physicists began as tion and for the protection of the employer.
physicists, there is today a need for health (4) Make frequent surveys of all accessible re-
physicists with many different backgrounds, actor areas, radioactive sources, x-ray equip-
e.g., physics, biology, chemistry, mathematics, ment, high voltage accelerators, chemistry and
engineering. In the AEC Health PhysiCS Fellow- physics laboratories, metallurgical shops, and
ship program the student satisfied the usual other working areas where radiation exposure is
Ph.D. requirements of courses, research, and possible.
thesis in one of these major departments, and (5) Advise scientists, supervisors and research
at the same time, he met additional require- directors of all radiation exposure hazards, of
ments such as special courses and summer work permissible working time in a given area, and of
at one of the National Laboratories where he radiation protection measures and techniques
was given practical experience in health physics. (e.g., protective clothing, shields, remote con-
Likewise, some of the graduate programs of trol equipment) and aid supervision in the solu-
the U.S. Public Health Service during this tion of new radiation problems as they develop.
same period provided training in health physics. (6) Make frequent surveys of all radioactive
In spite of the various opportunities for educa- waste discharged beyond the area of immediate
tion and training in health physics, the supply control, maintain accurate records of the level
has not kept pace with the demand because of of this radioactivity in the air, water, soil, vege-
the expanding nuclear industry and the increas- tation, milk, etc., and advise management of
ing uses of radiation in hospitals. remedial measures as they are needed.
In addition to the above-mentioned graduate (7) Aid in all emergency operations where
programs in health physics, there are education there are associated radiation hazards.
and training programs at all operating levels. (8) Purchase and maintain in working order
The on-the-job programs are important because and in proper calibration suitable health physics
the success of health physics can be measured, survey and monitoring instruments which are
to a considerable degree, in terms of how well used as aids in the protection of personnel from
plant managers, supervisors, scientists, engineers, radiation damage.
technicians and operations personnel are made (9) Prepare operations manuals on "Rules and
to realize their responsibility for protecting Procedures Governing Radiation Exposure" and
themselves and their associates from radiation assist in preparation of Environmental Impact
damage. They must be ever aware that to some Statements required of nuclear power plants.
degree all radiation exposure is harmful and no (l0) Assist in radiation protection problems
545 HEALTH PHYSICS

related to civil defense, emergency planning, are innumerable, complex energy exchanges
weapons fallout, space radiation, nuclear power that take place as the ENTROPY of the system
plants, etc. increases. An understanding of the many
(11) Interface with members of the public in low energy transitions is basic to a proper inter-
helping them to understand the risks of low pretation of the effects of ionizing radiation on
level exposure and the desirability of keeping man. Health physicists are working toward the
all exposures as low as reasonably achievable. ultimate goal of developing a coherent theory
(12) Many health physicists work as con- of radiation damage. Only when such a theory
sultants assisting radiation workers, atomic is available can they have complete understand-
veterans, medical patients, etc. by litigation to ing and confidence in the many extrapolations
receive compensation for alleged radiation to man of the effects of ionizing radiation on
damage. animals. Such information is essential in de-
Health physics research ranges from the ap- veloping reliable radiation protection standards
plied and engineering programs to very basic and measures that are enforced by the applied
studies. It is a working together of scientists of health physicist. Many health physicists are
many disciplines-physicists, chemists, biolo- working on programs to reduce unnecessary
gists, engineers, geologists, mathematicians-all medical diagnostic exposure, which at present
studying the effects of ionizing radiation on accounts for over 90 percent of exposure of the
man and on his environment. In these studies, United States population.
they are working at all levels-nuclear, atomic, The units used in health physics often result
molecular, plasma, gas, solid, liquid, cell, animal, in some confusion. In the early period units
and the ecosystem. In this research program, such as the threshold erythema dose, roent-
radiation ecologists are studying the effects of gen, and rep were used, but they have been
low levels of radiation exposure on the environ- phased out and replaced by the rad and the rem,
ment. Some essential organisms in the environ- and in some cases these two units are now being
ment are known to concentrate radioactive replaced by the gray and the sievert. The rad
waste by a factor of 100 000 or more, and the corresponds to the absorption of 100 ergs per
health physicist must determine the importance gram of medium and is the unit of absorbed
of the indirect damage of ionizing radiation to dose. The rem is the unit of dose equivalent.
man's environment as well as its direct effects. We have the following relation:
Internal dose studies are under way by research- Dose Equivalent (rem) = Absorbed Dose
ers in health physics, studies which have led to (rad) X Q X N in which Q is a qualtiy factor that
the publication of the official handbooks on corrects for LET (linear energy transfer) or the
maximum permissible concentration of the vari- density of ionization along the tracks of ioniz-
ous radio nuclides in food, water, and air. These ing particles. Thus we have Q = 1 for x-, gamma,
handbooks are issued by the National Council and beta radiations and Q = 20 for alpha radia-
on Radiation Protection and the International tion, recoil ions, and fast neutrons. The N is a
Commission on Radiological Protection. They biological correction factor which attempts to
are under constant revision by the health physi- correct for such things as nonuniform distribu-
cist as more reliable and detailed information tion of radionuclide deposition in body organs,
becomes available. Biologists in health physics the radiosensitivity of the irradiated tissue and
are studying the uptake distribution and elimi- its essentialness to proper body function. It is
nation of radionuclides which are taken into common to set N equal to:
animals and man by the several modes of intake
-ingestion, inhalation and skin penetration. for x- and gamma radiation
Engineers in health physics are exploring and for external exposure (radionuclide not in-
demonstrating new methods for the disposal of side body)
radioactive waste in deep wells thousands of for radium when it is the parent radio-
feet below the earth's surface in salt mines and nuclide
in a number of other geological formations. 1 for all tissue except bone
They are studying the seepage rates of radio- 5 for bone if internally deposited radio-
nuclides into various soil formations, its slow nuclide is not radium and the radiation is
dissipation from packages of radioactive waste not x- or gamma.
deposited on the ocean floor, and the dilu-
tion of airborne radioactive waste as it is dis- The gray corresponds simply to 100 rads and
charged from stack of nuclear power plants the sievert to 100 rem.
under varying meterological conditions. The The curie corresponds to an amount of a
physicists in health physics are making basic radionuclide which disintegrates at the rate of
studies of the various energy exchanges that 3.7 X 10 10 disintegrations per second. The
take place in matter when it is exposed to becquerel has been introduced to replace the
ionizing radiation. This information aids in the curie. It corresponds to a quantity of radio-
development of better radiation detection nuclide which disintegrates at the rate of 1 dis-
systems and leads to an understanding of the integration per second. There is considerable
true meaning and consequence of radiation opposition to the gray, sievert, and becquerel
exposure. When high energy radiation (in the units-especially in the U.S., where they are
MeV or keV region) strikes living matter, there seldom used.
HEALTH PHYSICS 546

There are many so-called maximum permis- (UV, visible, microwave, rf, infrared, sonic,
sible dose levels, but the one most commonly ultrasonic, and infrasonic), but here many new
used is the limit of 5 rem per year to the total problems are being faced and there are un-
body of a radiation worker. The present limits certainties. For example, the maximum per-
for internal dose in the U.S ., upon which the missible occupational exposure limit of rf and
present U.S. levels of maximum permissible microwave radiation is 1000 times h~her in
body burden of radionuclides and their maxi- the U.S. than in the USSR (10 mWjcm in the
mum permissible concentration in our water, U.S. and 10 p.Wjcm 2 in the USSR).
air, and food are based, are set at 5 rem per year There is disagreement and strong debate 3
to total body, gonads, and red bone marrow; among health physicists regarding the cancer
30 rem per year to bone, thyroid, and skin; risk from low-level exposure to ionizing radia-
and 15 rem per year to all other body tissue. tion. The common estimates of general cancer
Various maximum permissible dose levels are risk range between I X 10-4 to about 8 X 10-3
in use for exposure to members of the public, lethal cancers per person rem. This author 4
but perhaps the most generally applied levels believes one should not use a value smaller
are those of the Environmental Protection than 10-3 lethal cancers per person rem, or
Agency of the U.S. in reference to the uranium twice this value for total cancer incidence, until
fuel cycle (i .e. , nuclear power plants, reprocess- some important questions can be answered and
ing plants, etc.). These limits are 75 mrem per some of the biases in the present studies can be
year to the thyroid and 25 mrem per year to all evaluated.
other body organs and the whole body. Health physics continues to grow as a most
The health physicist makes use of many types interesting and challenging profession for scien-
of instruments to determine the dose of ionizing tists of many backgrounds. The success of these
radiation. For area survey measurements he programs is attested by the fact that ionizing
uses geiger counters, ion chambers, propor- radiation with unparalleled potential for radia-
tional counters, and scintillation counters. tion hazards has expanded in its use and appli-
Sometimes the instruments have sliding windows cations into almost every area of human en-
so that they can respond preferentially to alpha, deavor and yet the nuclear energy industry has
beta, gamma, and low-energy x-rays. Neutron become one of the safest of all industries.
doses are measured with proportional counters,
BF-3 counters, and threshold detectors (i .e., KARL Z. MORGAN
activation of radionuclides of indium, Pu-239,
U-238, Np-237, S-32, etc.) The Ge-Li detectors References
(lithium drifted germanium detectors) are
widely used to determine the energy of radio- 1. Morgan, K. Z., and Turner, J. E., "Principles of
nuclide emissions and thus identify them. When Radiation Protection," R. E. Krieger Pub. Co.,
the radioactive material is contained inside the 1973.
body, the body burden is determined by anal- 2. Morgan, K. Z., "Graduate Program for the Health
yses of urine and feces and by total body and Physicist in the United States," Health Phys. 11,
scan-counting, using large scintillators or Ge- 895 (1965).
Li detectors. Personnel monitoring used to be 3. Gofman, J. W., "Radiation and Human Health ,"
done by the use of pocket ion chambers and Sierra Club Books, San Francisco, 1981.
film badges. Today the pocket ion chambers 4. Morgan, K.\Z., "Hazards of Low-Level Radiation,"
are often replaced by direct-reading fiber in "Encyclopaedia Britannica," pp. 213-229, 1980.
dosimeters or small G-M counters which display
the dose rate and the integrated dose. The film Cross-references: MEDICAL PHYSICS, NUCLEAR
badges have for the most part been replaced by INSTRUMENTS, NUCLEAR RADIATION, NU-
TLD's (thermoluminescent dosimeters) for CLEAR REACTORS, RADIOACTIVITY, REACTOR
beta, gamma, and x-ray monitoring. In the past, SHIELDING.
neutron personnel monitoring was done mostly
by the use of thick nuclear emulsion photo-
graphic films . However, these are being replaced
by polycarbonate foils which are read with HEARING
optical instruments. This foil technique is a
big improvement over the photographic film, The role of the sense organ of hearing is to code
in which fading of the tracks is a serious prob- acoustic disturbances into neural signals suitable
blem (the half-life of the tracks ranges from 3 for transmission to the brain. The study of this
weeks to 6 months depending on the type of process necessarily involves anatomy and physi-
emulsion and the relative humidity). For ology of the ear, the nature of auditory path-
poly carbonate foils, track fading is almost ways and central nervous system activity in
nil and fast neutron detection sensitivity hearing, properties of acoustic signals that elicit
is about 1,000 times higher than that of photo- auditory responses, and observed phenomena of
graphic emulsions. auditory behavior. These aspects serve to define
Only a relatively few health physicists are and delineate areas for investigations of hearing
engaged in work with nonionizing radiations and are the topics of discussion for this article .
547 HEARING

In this approach to hearing, questions are asked is concerned with empirical testing of these
about the structure of the system, how the sys- models, with results that are in the direction of
tem functions, and the relationships between a more coherent understanding of hearing. In-
inputs and outputs of the system. These three vestigators often endeavor to relate many phe-
kinds of data-morphological, physiological, and nomena of hearing to a particular model. An
psychological-need to be compared and cor- example is the critical-band model, a concept
related for a full understanding of hearing. It is of a filterlike mechanism early in the auditory
important, however, that these three frames of system that affects various auditory functions
reference be kept separate and not be confused. in a similar manner. Frequency analysis, mask-
Although phy'siological functions may corre- ing, pattern perception, and changes in hearing
spond in a general way to anatomical sequences, with aging are but a few of the features of hear-
several physiological functions may occur in the ing that can be related to a critical-band filter
same anatomical structure or a single function model.
may require several anatomical units. In a sim- When the structure of the ear is examined, it is
ilar manner, psychological functions cannot convenient to consider the external, middle, and
usually be identified with specific physiological inner ear separately . A cutaway drawing of the
functions, and it is recognized that the central ear is shown in Fig. I. From a functional point of
nervous system, as well as the auditory system, view, the ear may be divided into an outer and
is involved in any auditory response. The corre- inner part. The outer is concerned with the trans-
lations between and knowledge about structure formation of acoustic energy into mechanical
and functions are best developed for peripheral, energy and the inner with the transduction of
rather than central, parts of the auditory system mechanical energy into neural impulses. The
because the ear is more accessible for examina- auricle and external auditory meatus constitute
tion and study than are the more central parts the external ear. The meatus is an irregularly
of the auditory system. shaped tube approximately 27 mm long with a
Traditional theories of hearing have been diameter of about 7 mm, terminated by the
largely concerned with pitch perception. A com- tympanic membrane. The outer ear serves as an
prehensive theory does not exist that can ac- acoustic resonator to produce a gain in acoustic
count for the complexities of the processing of energy of a bout 10 to 15 decibels in the fre-
speech or music by the auditory system. How- quency range from 2000 to 5000 Hz. This is a
ever, there are a number of excellent models com bined effect of the concha (the area at the
that depict various physiological, sensory, and entrance to the ear canal) and of the ear canal
perceptual functions of hearing. Much research itself, each having a separate resonance. The

Auricle

Semicircula r

Eustachian Tube

FIG. 1. A cutaway drawing of the ear.


HEARING 548

former has a resonance at about 5000 Hz, the


latter a resonance at slightly above 200 Hz. The
ear drum is in a protected position at the end
of the canal, and humidity and temperature
conditions at the drum are relatively indepen-
dent of those external to the ear.
The middle ear is an irregular, air-filled space
in the petrous portion of the temporal bone.
The three ossicles of the middle ear-the mal-
leus, the incus, and the stapes-provide mechan-
ical linkage between the tympanic membrane
and the fenestra vestuli, an opening in the vesti-
bule of the inner ear commonly known as the
oval window. The handle of the malleus attaches
to the tympanic membrane, and the footplate
of the stapes attaches to the oval window. The
important function of the middle ear is imped- FIG. 2. Cross section of cochlea.
ance matching and transformation of energy of
air vibrations in the ear canal to fluid in the membrane and is bound below by the basilar
cochlea. This efficiency of sound transmission membrane. The cochlear duct is filled with a
from one medium to another is accomplished viscous fluid called endolymph, and the duct is
primarily because the area of the tympanic surrounded by a fluid called perilymph that has
membrane responding to vibrations of air is about twice the viscosity of water. The scala
considerably larger than the area of the foot- vestibuli and the scala tympani join at the apical
plate of the stapes at the oval window to the end of the cochlea at a passage called the helico-
inner ear. The lever action of the ossicles con- trema. The scala tympani terminates at the basal
tributes to the efficiency of this transformation end at the round window, a membrane-covered
to a lesser extent than the area ratios of the two opening into the middle ear. The scala vestibuli
membranes. The amplification gain due to the is continuous with the vestibule; the oval win-
middle ear is approximately 30 dB. The effec- dow opens into the vestibule. Vibrations at the
tiveness of the middle ear action in increasing footplace of the stapes are transmitted into the
hearing sensitivity is evidenced in middle ear fluid adjacent to the oval window. Vibration of
pathologies where the ossicular chain is dis- the stapes and resultant disturbances in cochlear
rupted. A hearing loss on the order of 25-40 dB fluids results in movement of the basilar mem-
occurs. The second function of the middle ear, brane. The cochlear duct contains the sensory
that of protecting the inner ear from loud receptors, specifically the organ of Corti, which
sounds, is accomplished by reflex action of the lies upon the basilar membrane. There are about
middle ear musculature, the stapedius, and the 12 000 hair cells; one end of each rests on the
tensor tympani. The action of the muscles is to basilar membrane. The other ends of the hair
draw the stapes away from the oval window, re- cells are the cilia, very fine hairline processes,
tract the ear drum, and change ossicle vibrations which make contact with the tectorial mem-
in such a way as to decrease the transmitted brane, a membrane that overlaps the organ of
pressure. The stapedius may be more responsive Corti and that functionally behaves as if it were
to accoustic stimulation than the tensor tym- hinged at the cochlear wall. There are three rows
pani. Latency of muscle contraction and pos- of outer, and one row of inner, hair cells along
sible muscle fatigue limit protection of the inner most of the length of the basilar membrane.
ear by these mechanisms. Middle ear air pressure When vibrations are introduced into the inner
is equalized by virtue of the eustachian tube ear and cause displacement of the basilar mem-
which connects the middle ear and the naso- brane a shearing action of the cilia occurs that
pharynx. The pressure equalization is necessary results in neural activity. It is assumed that
for normal ear drum movement. amplification occurs in the inner ear in that
The inner ear is a system of cavities in the small pressures on the basilar membrane result
dense petrous portion of the temporal bone. One in a shearing force of considerably greater mag-
of the cavities is the cochlea, a bony labyrinth nitude that distorts the hair cells. The result is
that is approximately 35 mm in length coiled increased sensitivity of the hearing system. Phys-
around a central core for two and three-quarters ical properties of the cochlea are such that
turns. The spiral-shaped cochlea is divided into different frequencies tend to localize at differ-
three ducts, two bony and one membranous. ent points along the basilar membrane. The
A cross section of cochlea showing the three basilar membrane is narrowest and stiffest at
ducts is shown in Fig. 2. The upper bony duct, the basal end, and most lax and widest at
the scala vestibuli and the lower bony duct, the the apical end of the cochlea. High-frequency
scala tympani, are separated from each other by sounds result in the greatest disturbances near
a membranous labyrinth, the cochlear duct. The the basal end, and low-frequency sounds tend
cochlear duct is bound on top by Reissner's to localize near the apical end. When the role
549 HEARING

of the cochlea in pitch and loudness analyses


is considered, it is now realized that more is in-
volved in pitch perception than the place of
localizat~on on the basilar membrane, although
the partIcular neural fibers involved are prob-
ably relevant. Of particular interest in this re-
gard is the phenomenon called low pitch where emporal
there is no energy at the frequency that corre- Lobe
sponds to the perceived pitch. Our impression
that we are hearing a fundamental pitch for a
person talking over a telephone is a good ex-
ample. Telephone circuits do not transmit fre-
quenc~es . in the fundamental pitch range so
there IS, m fact, no energy in this range yet an
experience of pitch is experienced by the lis-
tener. Loudness is probably related to the total
number of neural impulses per unit time.
The auditory pathways provide for the neural
impulses from the ear to be transmitted to the
cerebral centers of the auditory cortex. Process-
ing of the neural signals probably occurs at hY..,.,....,___ Dorsal
& Ventral
synaptic connections as well as in the cortex. Cochlear Nuclei
The cell bodies of the receptor neurons are
located in the spiral ganglion. Neurons of the
auditory nerve make synaptic connections with
the hair cells of the cochlea. Nerve fibers typi-
cally inneryate many hair cells, and more than
one nerve fIber may make a connection with the FIG. 3. Schematic depicting auditory pathways.
same hair cell. There is evidence to indicate
that there are also descending neural pathways
as well as ascending ones. The central nervous The stimulus for hearing is sound. Sound
syst~m may thus be involved in auditory pro-
involves a disturbance in the air that is a forward
cessmg at the cochlea. Spiral ganglion axons and backward, rarefaction and compression
make synaptic connections with cells of the cen- movement of air particles. The unit of forc~
usu~y used in acoustics is the dyne. Sound pres-
tral nervous system at the cochlear nucleus. At
this point, there is interconnection between the sure IS frequently expressed in dynes per square
pathways for the two ears. Other synaptic centimeter. Intensities of sounds are usually
stations between this point and the auditory measured on a decibel scale, a logarithmic ratio
cortex include the inferior colliculus and the scale. The tremendous loudness range of the ear
medial ~eniculate body. A schematic depicting is exemplified by the fact that the most intense
the audItory pathways is shown in Fig. 3. It can sound that can be tolerated is a million million
be observed that the major projection is to the times greater in intensity than a sound that is
contralateral (opposite) cerebral hemisphere for just audible. This is a range of approximately
each ear. That is, the right ear projects primarily 120 dB. The frequency range of hearing is fre-
to the left hemisphere and the left ear to the quently given as 16 to 20 000 Hz. The ear is most
right hemisphere. Some evidence has been pre- sensitive in the middle frequency range of 1000
sented that ~or right-handed people, the left to 6000 Hz. In terms of discrimination of fre-
cerebral hemIsphere processes speech and the quency and intensity, it is possible for about
rig~t hemisphere processes non-speech signals.
1400 pitches and 280 intensity levels to be
EVldence from pathological auditory systems is distinguished.
of particular interest with respect to the audi- The truly phenomenal aspects of hearing can
tory pathways. In fact, one excellent way to be observed in such behavior as localization of
understand the auditory system is to study the sounds, speech perception and particularly the
system when it is in a breakdown or patholog- understanding of one voice in the noisy environ-
ical state. An impaired cochlea, for example, ment of many, and the recognition of acoustic
may result in a better than normal response to events that only last a few milliseconds. It is
small amplitude changes in a sound. A lesion of these and other behavioral phenomena that need
the VIIIth Nerve is frequently manifested by a to be accounted for in theories of hearing.
rapid decrease in the ability to respond under ROBERT W. PETERS
sustained stimulation. The ability to process
speech is markedly affected when there is an in- References
volvement of the lower central nervous system. Books
Cortical involvement does not affect usual Bekesy, Georg yon, "Experiments in Hearing," New
speech or pure tone inputs. York, McGraw·Hill Book Co., 1960.
HEARING 550

Rasmussen, Grant, Ed., and Windle, William F., "Neural previously Centigrade) and 100° C are assigned
Mechanisms of the Auditory and Vestibular Sys- to the fixed points.
tems," Springfield, Charles C. Thomas, 1960. In the modern metric Systeme International
Stevens, S. S., and Davis, Hallowell, "Hearing: Its d'Unites (abbreviated SI in all languages) tem-
Psychology and Physiology," New York, John Wiley perature is measured in units of Kelvin (the
& Sons, 1947. word degree is dropped). The triple point of
Green, D. M., "An Introduction to Hearing," New pure water is assigned the value of 273.16 K
Jersey, Lawrence Erlbaum, 1976. (corresponding to O.OloC) and a temperature
Moore, B. C. J., "The Psychology of Hearing," Sec- interval of one Kelvin is the same as one degree
ond Edition, New York, Academic Press, 1982. Celsius.
Pickles, J. 0., "An Introduction to the Physiology of When comparing the definitions of heat and
Hearing," New York, Academic Press, 1982. temperature, it is possible to encounter a cir-
Schubert, E. D., "Hearing: Its Function and Dysfunc- cular argument. Por example, heat may be
tion," in "Disorders of Human Communication, Vol. defined to be the flow of energy due to a tem-
1 (G. E. Arnold, F. Winckel, and B. D. Wyke, Eds.), perature difference; and temperature may be
Vienna and New York, Springer-Verlag, 1980. defined as a property of an object which de-
Tobias, J. V. (Ed.), "Foundations of Modern Auditory termines the direction of heat flow. The cir-
Theory," New York, Academic Press, Volume I, cularity in reasoning will be broken later in this
1970; Volume 11,1972. article when heat will be shown under certain
conditions to be equivalent to work, which can
Periodicals be independently defined and measured.
The Journal of the Acoustic Society of America. Measurement of Heat The calorimetric def-
inition of heat was established by Joseph Black
Cross-references: ACOUSTICS;
ARCHITECTURAL (1728-1799) during the eighteenth century.
ACOUSTICS; MUSICAL SOUND; NOISE, ACOUS- Equal masses of water, initially at different
TICAL; PHYSICAL ACOUSTICS; REPRODUCTION temperatures, were mixed in a container called
OF SOUND. a calorimeter, which was insulated from its sur-
roundings. It was observed that the final tem-
perature of the mixture was always exactly half-
HEAT way between the two initial temperatures (if
the effects of the calorimeter and the thermom-
Heat and Temperature Heat is (internal) en- eter were negligible). The temperature changes
ergy in transit due to a decrease in tempera- t:..T of the hot and cold masses of water were
ture between the source from which the energy equal and opposite,
is coming and the sink toward which the energy
is going. When heat is added to a system, the t:..Th = -t:..Tc.
result may be either to raise the temperature or
to cause a thermal transformation (e.g., melt- If different masses of water were mixed, the
ing, boiling, expansion, increase in resistance). temperature changes were observed to be in-
Conversely, the removal of heat from a system versely proportional to the respective masses,
may either lower the temperature or cause
freezing, condensation, contraction, decrease in
resistance, etc.
The concept of temperature begins with the
physiological sensation of hotness and coldness. or
The hotter an object feels, the higher is the
temperature, and vice versa. However, the sen- (1)
sation of temperature is easily influenced by
the thermal condition of the skin and the ther- This experiment suggested thinking about
mal conductivity of the object being touched. something lost by the hot mass of water and
Thus the concept of temperature must be quan- gained in equal amount by the cold mass of
tified using inanimate instruments that re- water; or, in other words, that there was a trans-
producibly measure ranges of temperature over fer of heat. Until the middle of the 19th Cen-
a continuous scale. tury heat was assumed to be a material fluid
Many different types of thermometers are that could be neither created nor destroyed, ca-
available, but the two most common are the pable of flowing between atoms and molecules,
colored-alcohol-in-glass and the coiled bimetallic and attracted by some materials and repelled by
strip. Historically the normal freezing and nor- others. Antoine Lavoisier (1743-1794) gave the
mal boiling points of pure water were chosen as fluid principle of heat the name caloric theory.
the two fixed points of the principal tempera- Equation (1) was shown to be valid only
ture scales. In the English system of units the when two samples of water were mixed at ini-
values of 32°p (P for Fahrenheit) and 212°p tially different temperatures, but failed to pre-
are assigned to the fixed points. In the practical dict the correct temperature changes when
metric system the values of O°C (C for Celsius, water and another substance were mixed. Thus
551 HEAT

Eq. (1) was modified to the general case: material fluid that flowed without limitation.
Rumford proposed instead that heat was a form
CA mAATA = -CwmwATw, (2) of motion, but was unable to explain the na-
ture of the motion.
where A stands for a substance with a higher The attack on the caloric theory continued
temperature mixed with water w, and C is a in the 1840s, especially by the quantitative ex-
property of the material known as the specific periments of James Prescott Joule (1818-
heat capacity (usually shortened to specific 1889). In several different experiments Joule
heat). The left-hand term in Eq. (2) is the heat allowed falling objects to perform work on a
QA transferred to or from substance A, and the sample of water that consequently experienced
right-hand term is the corresponding amount of a rise in temperature. Heating one pound of
heat Qw transferred from or to the water. water through 1°F he was able to determine
More accurate observations showed that the that I Btu of heat was produced by 772 foot-
value of C for any particular substance was dif- pounds of work, which is sometimes called the
ferent and depended on the physical circum- mechanical equivalent of heat. In other words,
stances under which the final temperature was energy could be converted from one form to
obtained. Gases, for example, showed a smaller another, an idea that would be incorporated
value of C if the temperature changes occurred into the formulation of the law of conservation
at constant volume than if the temperature of energy.
changes occurred at constant pressure. Thus Recognition that heat is another form of en-
there needed to be a Cv for specific heat at con- ergy means that the mechanical equivalent of
stant volume and Cp for specific heat at con- heat is simply a conversion factor between dif-
stan t pressure. ferent units of measurement. The currently ac-
It was thought to be necessary to define a cepted conversion factors are: I Btu is equiv-
special unit in which calorimetric heat would be alent to 778.28 foot-pounds and I calorie
measured. Water was selected as the standard (1SoC) is equivalent to 4.1858 joules.
substance and Cp given the numerical value of In order to discuss the modern definition of
unity. In the English system the British thermal heat based on Joule's experiments it is neces-
unit (or Btu) is the heat needed to raise the sary to specify the physical properties of the
temperature of the mass associated with one walls surrounding the system being investigated.
pound of water from 60° F to 61 ° F at one at- When an adiabatic wall is used to separate two
mosphere of pressure. systems A and B, the macroscopic properties of
In the pre-SI metric system the calorie was one system are found to be unaffected by those
defined as the heat required to raise the tem- of the other. For example, if a thick asbestos or
perature of a mass of one gram of water from styrofoam wall separates a gas from a wire, the
14.SoC to IS .SoC at one atmosphere of pres- properties of the gas (pressure, volume, temper-
sure. A large calorie or kilocalorie (Calorie ature) have no influence on the properties of
written with a capital C) corresponds to 1000 the wire (tension, length, resistivity). In other
ordinary calories and is used almost exclusively words, an adiabatic wall is a perfect insulator
in dietetics. In the modern SI metric system to heat.
there is no unique unit for heat. Instead heat is If, however, the two systems are separated by
recognized as a form of energy and is measured a diathermic wall, such as a thin sheet of
in joules, as are all forms of energy. copper, the properties of the two systems will
Heat and Work The caloric theory was change spontaneously, Ulitil an equilibrium
widely accepted toward the end of the 18th state of the combined system is reached, be-
Century because of its simplicity and many suc- cause a diathermic wall is a perfect conductor
cessful applications. The first really conclusive of heat. The two systems are then said to be in
evidence that heat could not be a material fluid thermal equilibrium, which means they have
was given by Benjamin Thompson (1753-1814), the same temperature. Thus, temperature may
born in Woburn, Massachusetts, who left the also be defined as the property that determines
United States because of royalist sympathies whether or not two systems are in thermal
during the American Revolution and later be- equilibrium.
came Count Rumford in Bavaria. In 1787, using Consider next the concept of work. If there
a very delicate balance, he determined that exists a macroscopic force F, undergoing a
there was no detectable difference in the weight macroscopic displacement ds in a direction
of an object whether hot or cold, which raised parallel to the force, the work dW is defined as
doubts about the material nature of the caloric Fds. Each simple system has its own expression
fluid. for work, such as -PdV for a hydrostatic
In 1798 Rumford observed the temperature system, Tdl for a wire, fJ.oHdM for a magnetic
rise in brass chips produced during the boring system, etc. Simple calculations show that the
of a cannon. The heat generated by friction be- work in a finite process depends on the external
tween a moving boring tool and the cannon processes (so-called path) used to change the
appeared to be inexhaustible. Obviously the system from its initial state to its final state.
caloric theory of heat could not be based on a There is no such thing as the "work in a sys-
HEAT 552

Diathermic walls at
Adiabatic wall the same temperature

--
F
f

(a) (b)
FIG. 1. The behavior of the system surrounded by
adiabatic walls in (a) is the same as that whose dia-
thermic walls and surrounding enclosure are at the
same temperature in (b).

tern." The work is not a function of properties


(so-called coordinates) of a system. In mathe-
matical terms work is an inexact differential
and the element of work may be written lfW.
In Fig. I (a) there are shown two ways in
which work may be performed on a system sur-
rounded by adiabatic walls. A force may cause
th~ piston to move some distance; and a falling Volume
object may rotate an electric generator that
provides a current in a resistor embedded in the FIG. 2. The adiabatic work done on the system is
system and included as part of the system. Con- the same whether going by path I or path II from the
sider two arbitrarily chosen states of the sys- initial state i to the final state f
tem, i and f (initial and final). It is possible
using the apparatus in Fig. 1(a) to proceed from
i to f along many different paths. One process ~ith a~ adiabatic 'Yall. This temperature equality
might be to move the piston in and then main- ~s achIeved expenmentally by noting the read-
tain the current until the final state is achieved. mgs of thermocouples whose junctions touch
Such a process is shown by path I in Fig. 2. An- the inner wall enclosing the system and the
other process might be to perform the electrical outer wall surrounding the inner wall. When the
work first and the work of compression second ther;ffi0couple readings differ, energy is provided
as shown by path II in Fig. 2. The total work i~ to eIther the inner wall or the outer wall.
proceeding from state i to state f is the same for Using this equipment, it is possible to mea-
each path. In other words adiabatic work W d sure the work Wad when the system is brought
is independent of the path' and depends only gn from state i to state f adiabatically (by holding
bo~h dia~hermic walls at the same temperature).
the initial and final states of a system.
Interestingly enough, it seems that experi- It IS agam observed that Wad is independent of
ments of this type may never have been per- the path. The consequences of independence of
formed. There was a rapid and universal ac- path may now be derived. In mechanics the
ceptan.ce of the law of conservation of energy w?rk need.ed. to move an object slowly and
followmg the work of Joule and especially WIthout fnctIOn from one height to a higher
Hermann von Helmholtz (1821-1894). It is ~levation is independent of the path. Thus one
an accepted fact, indeed a law of nature known IS lead to the concept of gravitational potential
energy. Similarly, since the adiabatic work in
as .the first law of thermodynamics (in a re- bringing the system in Fig. 1(b) from state i to
stncted case), that adiabatic work is indepen-
dent of path. This statement is established be- state f is also independent of the path then
yond any reasonable doubt, however, because of themust
there exist a function U of the coordinates
system such that
all the consequences of the first law of thermo-
dyanics have been so well verified by direct ex- /1U= Ur Ui = Wad. (3)
perimental evidence.
It is ~n e.xperimental fact that the apparatus The function U might justifiably be called the
shown 10 FIg. l(b), in which the adiabatic wall "adi~batic work function," but instead is called
is replaced by two rigid diathermic walls, whose t~e mternal. energy function. Notice that if posi-
temperatures are maintained equal at all times tive work IS done on the system its internal
is in all respects equivalent to the apparatu~ energy is increased, a sign conventio~ universally
553 HEAT CAPACITY

used in physics. In some older discussions of we can only speak of the internal energy of the
the first law of thermodynamics there is a system. It is impossible to subdivide the in-
minus sign on the left in Eq. (3) and work is ternal energy into a mechanical and a thermal
negative when performed on a system. The part, just as it is impossible to subdivide the
latter sign convention followed the engineer- water in a lake into parts resulting from (1)
ing practice of defining work to be positive flow from a stream, (2) rain, (3) springs, etc.
when a machine (the system) operates on its
surroundings. RICHARD H. DITTMAN
Suppose now that the inner wall of the ap-
paratus in Fig. I(b) is now deliberately set at Reference
a different temperature from the outer wall.
Zemansky, Mark W., and Dittman, Richard H., "Heat
While the temperature difference is maintained, and Thermodynamics," 6th Edition, New York,
the system is again brought from state i to state McGraw-Hill Book Company, 1981.
f and the work W is measured. This work would
be found to be different from Wad, that is, dif- Cross-references: CALORIMETRY; HEAT CAPAC-
ferent from Vf- Vj. Since it is undesirable to ITY; KINETIC THEORY; TEMPERATURE AND
give up the law of the conservation of energy, THERMOMETRY; THERMODYNAMICS; WORK,
the conclusion is inescapable that the difference POWER, AND ENERGY.
between Wand Wad (or Wand Vf - Vj) is due
to another form of energy that must have en-
tered or left the system by virtue of the temper-
ature difference between the system and its HEAT CAPACITY
surroundings. This energy flow, which is in-
ternal energy moving between the system and The heat capacity of any thermodynamic sys-
its surroundings, is none other than the quan- tem is
tity that is called "heat."
We see that the old calorimetric definition and oq
the modern thermodynamic definition of heat, c=-
dT'
(1)
as energy transferred by virtue of a temperature
difference, are in agreement. However, any cir- where oq is the quantity of heat* required to
cular reasoning involving definitions of heat and produce the temperature increment dT. If the
temperature is now broken; heat can be inde- system consists of a single substance with defi-
pendently defined by measurements in terms of nite chemical composition and physical state, C
work performed on a system surrounded by is proportional to the total quantity of matter
different walls. Denoting the heat that has en- in the system. The heat capacity of 1 gram is
tered or left the system again by the symbol Q, the specific heat, and the heat capacity of one
we get mole is sometimes called the molar heat capac-
ity. Commonly used units of heat capacity in-
Q: Wad - W: Vr Vi - w, (4) clude cal mole- 1 deg- 1 , cal g-l deg- 1 , and J
mole- 1 deg- 1 •
which is the mathematical formulation of the The heat capacity of a substance depends on
first law of thermodynamics. the variables that determine its thermodynamic
Rewriting Eq. (4) in the form
state-e.g., temperature, pressure, electric and
Vr Vj: W+Q (5) magnetic field-and also on the constraints im-
posed during the absorption of heat. Here we
shows that Wand Q are methods of producing limit consideration to a substance subject to a
a change in internal energy. W is the mechanical hydrostatic pressure P and to no other forces.
method of energy transfer, and Q is the non- In this case, the thermodynamic state is deter-
mechanical or thermal method. Together they mined by two variables, usually taken to be
constitute the two thermodynamic methods in temperature and either pressure or molar vol-
which V may be changed. ume V. The usual conditions for which C is of
Work and heat are processes of energy trans- interest are constant volume and constant pres-
fer. When all flow is completed, the words sure, and the symbols Cv and CP are used for
"work' and "heat" have no longer any useful- the heat capacities measured under these
ness or meaning. It would be just as incorrect to conditions.
refer to the "heat in a system," as it would be The difference between Cp and Cv is related
to speak of the "work in a system." The per- to other thermodynamic properties by the first
formance of work and the flow of heat are law of thermodynamics, which, for processes of
methods whereby the internal energy of a sys-
tem is changed. As a result of these processes, *The symbol {; indicates a differential that is not
the internal energy of the system has either in- exact, i.e., one for which a line integral is not deter-
creased, decreased, or remained constant. Once mined by the initial and final points alone, but also
the processes of transferring energy are over, depends on the path.
HEAT CAPACITY 554

interest here, may be written and solids to the energies associated with differ-
ent degrees of freedom of the constituent par-
l}q = dU + P dV = dH - V dP, (2) ticles. The heat capacity of a liquid is more
where U is the energy and H = U + PV is the
complicated and is not considered here.
enthalpy. (We write this and all subsequent ~x­ Ideal Gases An ideal gas obeys the equation
of state, PV = RT = NkT. The gas constant R is
pressions involving thermodynamic properties 8.314 J mole- 1 K-l, N is Avogadro's number,
for one mole.) From Eqs. (1) and (2), we have and k = R/N is Boltzmann's constant. By Eqs.
(7), (8) and (9), Cp - Cy = R.
Cy = (au)
aTp
(3) In the application of statistical mechanics to
the calculation of CP or Cy it is convenient to
consider first the predictions based on classical
and principles. The pertinent result of classical sta-

Cp = (~~)p + p(~~)p = (~~)p. (4)


tistical mechanics is the principle of equiparti-
tion of energy: any term in the energy of a
particle proportional to the square of either a
coordinate or a momentum contributes an aver-
Subtraction of Eq. (3) from Eq. (4) and substi- age of! kT to U. Since the molecules of an ideal
tution of the mathematical relation gas have three translational degrees of freedom,
each with an associated kinetic energy propor-
(~~)p = (~~)y + (~~)T (~i)p (5)
tional to the square of a momentum, the ex-
pected translational contributions to the energy
and heat capacity are U =!NkT =!RT and, by
gives Eq. (3), Cy = ~R. For monatomic gases this is
the only contn'bution to Cy and this value is in
Cp - Cy = {(~~)T +p} (~i)p· (6) excellent agreement with experiment.
Diatomic molecules can rotate about each of
two independent axes perpendicular to the in-
The quantity of heat required to produce a cer- ternuclear axis, and the two atoms can vibrate
tain temperature increase dT is greater at con- with respect to each other along the internu-
stant pressure than at constant volume by the clear axis. Each of the two rotational degrees of
sum of the work of expansion in the constant freedom has a kinetic energy proportional to
pressure process, p(a V/aT)p dT, and the in- the square of a momentum, and the vibrational
crease in internal energy accompanying the ex- degree of freedom has both a kinetic energy
pansion, (au/a (a
V)T v/anp dT. proportional to the square of a momentum and
An expression for Cp - Cy that is more use- a potential energy proportional to the square of
ful than Eq. (6) is a coordinate (the vibration is approximately
Cp -Cy TVa2/K,
= (7) harmonic). In classical statistical mechanics, Cy
is therefore expected to be ~R, but for most
which can be obtained from Eq. (6) by intro- diatomic gases at room temperature Cy is close
ducing a relation based on the second law of to ;R. This discrepancy, one of the historically
thermodynamics. important failures of classical theory, is resolved
by proper consideration of quantum effects as
suggested by Einstein. The allowed energy lev-
(~~)T + P= T(~i)Y' els of a harmonic oscillator are not continuous
as in classical mechanics, but are given by
the mathematical relation En = nhv, with n = 0, 1,2,3,···,
where h is Planck's constant, v is the natural
frequency of the oscillator, and the energies
En are measured from the lowest level. For the
and the definition of the coefficients of thermal average energy of a single oscillator, quantum
expansion a and isothermal compressibility K, statistical mechanics gives

a=..!.(av)
V aTp
, (8)
_
E=
hv
e hv/ kT -

and Instead of the classical value k, the contribution


to Cy is
= -~(~~)T· (9)
(hV)2 ekv/kT
K

The subject of the following sections is the de =k kT (10)


relationship of the heat capacities of ideal gases dT (ehv/kT - 1)2
555 HEAT CAPACITY

This contribution to Cv is negligible for T ..;; empirical rule of Dulong and Petit: for most
0.1 hv/k because most of the oscillators remain monatomic solids Cv is approximately 3R at
in the n = 0 level, but it increases rapidly near room temperature. However, this value is often
T ~ !hv/k and approaches the classical value exceeded at very high temperatures, partly as a
for T > hv/k. For many diatomic molecules, consequence of anharmonicity in the inter-
hv/k is a few thousand degrees Kelvin or more, atomic forces. At low temperatures , Eq. (11) is
and a room-temperature Cv of ~R can be un- in good agreement with exreriment; Cv is
derstood as the sum of the translational and ro- found to be proportional to T , although often
tational contributions. The details of the tem- only at temperatures below 8 0 /100, and 8 0
perature dependence predicted by Eq . (10), is given accurately by the sound velocities. This
including the approach of Cv to !R at suffi- agreement is to be expected because at low
ciently high temperatures, have been verified temperatures only low-frequency vibrations
experimentally for a number of gases . contribute, and these are treated accurately in
In a f ew cases (H 2 , HD and D2)' diatomic the Debye model. At intermediate tempera-
gases exist at temperatures for which kT is com- tures, the agreement with experiment is only
parable to the spacing of the rotational energy approximate.
levels, and quantum effects can be observed in Occasionally the lattice heat capacity of a
the rotational contribution to Cv . For H2 , Cv molecular crystal is represented by the sum of a
drops below ~R as the temperature is reduced Debye heat capacity for the vibrations of the N
below about 300 K, and below 50 K it becomes molecules as units and the appropriate number
equal to the ~R that is characteristic of transla- of terms given by Eq. (10) for the intermolecu-
tion. The translational energy levels of ordinary lar vibrations.
gases are so closely spaced that quantum effects In metals, there is an electronic heat capacity
are never important in the translational heat related to the translational motion of the con-
capacity . duction electrons. The small mass and high den-
The heat capacity of poly atomic gases can be sity of the electrons make quantum effects im-
treated by a straightforward generalization of portant, and Sommerfeld showed that their
the foregoing discussion. A molecule with n heat capacity should be proportional to tem-
atoms has three translational degrees of free- perature for temperatures below about 104 K.
dom and, if it is nonlinear, three rotational and This contribution is usually significant below a
3 n - 6 vibrational degrees of freedom . If it is few degrees Kelvin, where the lattice heat ca-
linear, it has two rotational and 3 n - 5 vibra- pacity is small, and also at high temperatures,
tional degrees of freedom. where it contributes to deviations from the rule
Solids For every solid there is a lattice heat of Dulong and Petit .
capacity associated with vibrations of the The heat capacities of a number of solids have
atoms. If the interatomic forces are harmonic, bumps or peaks superimposed on the smoothly
the N atoms in one mole of a monatomic varying lattice and electronic heat capacities.
solid have 3N independent vibrational modes, These are usually called anomalies and two dis-
and the lattice heat capacity is the sum of 3N tinct types can be recognized. A Schottky
terms given by Eq. (10) . Since the spectrum of anomaly is a smooth bump that arises from a
the 3N frequencies of a real solid is complicated set of energy levels for a single particle. The
and difficult to calculate, an approximation in- splitting of the rotational states of magnetic
troduced by Debye is widely used . In the Debye ions by electric fields, and of nuclei by electric
model, the vibratIOnal modes are sound waves in or magnetic fields, are examples for which the
an elastic continuum; the boundaries of the solid associated anomalies occur in the ranges 10- 1
determine the allowed wavelengths and the to 10 3 K and 10- 3 to 10- 1 K, respectively.
sound velocities then determine the frequencies. Lambda anomalies are sharp peaks produced by
To limit the number of frequencies to 3N, the cooperative processes involving many particles
spectrum is cut off at a maximum frequency in a transition from a low-temperature ordered
k8 0 /h where 8 0 , the Debye temperature, is state to a high-temperature disordered state.
determined by the sound velocities and is typ- Examples are: the momentum ordering of 4He
ically a few hundred degrees Kelvin. The cutoff atoms in liquid 4He at 2.18 K and of electrons
corresponds to the fact that in a real crystal, in superconductors at temperatures ranging
the vibrations have a minimum wavelength com- from 10- 1 to 10K; the magnetic ordering of
parable to the interatomic distance . The heat ferro magnets and antiferromagnets at tempera-
capacity is tures from 10- 3 to 10 3 K; the spatial ordering
of different atoms of an alloy on a superlattice,
12 4 R -
Cv = -rr
5
(T)3
8
(11)
e.g. , ~-brass at 750 K; and the rotational disor-
der in certain molecular crystals e.g., H2 at
0 1.5 K.
NORMAN E. PHILLIPS
for T ..;; 8 0 /20, but it increases less rapidly at
higher temperatures and approaches the classi- Cross-references:
CALORIMETRY, GAS LAWS,
cal limit 3R for T > 8 0 , The predicted high- HEAT, TEMPERATURE ANO THERMOMETRY,
temperature limit is in agreement with the THERMOOYNAMICS.
HEA T TRANSFER 556

HEAT TRANSFER Predictions for the value of the thermal conduc-


tivity k can be made from considerations of the
Establishment of thermodynamic equilibrium atomic structure. Accurate values, however, re-
for a system consisting of a number of media quire experimentation in which the heat flux q
requires that the temperature be locally unifor!? and the temperature gradient, grad T, are mea-
and time-wise constant. A departure from this sured and these values are inserted into Fourier's
condition causes a transfer of energy in the form equation. Figure I presents thermal conductiv-
of heat from locations with high temperature to ity values for a number of media in a large tem-
locations with low temperature. Such an energy perature range. It can be recognized that metals
transfer occurs very frequently and is encoun- have the largest conductivities and, among
tered in our everyday life as well as in many en- those, pure metals have larger values than alloys.
gineering applications or in scientific experi- Gases on the other hand, have very low heat
ments. It has, therefore, been known for a long cond~ctivity values. Electrically nonconducting
time. The fact that quantitative predictions have solids and liquids are arranged in between. The
become possible in the recent past only is due low thermal conductivity of air is utilized in the
to the situation that several mechanisms are development of thermally insulating materials.
usually involved and interrelated in such an en- Such materials, like cork or glass fiber, consist
ergy transfer process. They are generally classi- of a solid substance with a very large number of
fied as conduction, convection and radiation. small spaces filled by air. The thermal transport
Heat transfer by conduction is that process occurs then essentially through the air spaces,
which transports heat in a medium from one and the solid structure only supplies the frame-
location to another without involvement of any work which prevents convective currents. The
visible movement. It is generally the only or the range of thermal conductivities in Fig. I at am-
dominating mode of heat transfer in a solid bient temperature extends through S powers of
medium· however it occurs also in liquids and 10. This range is still small compared with the
gases. I~ such fl~ids, this energy transport is range for the electric conductivity of various
often augmented when parts of the fluid are in substances where electric con'ductors have values
movement and carry energy along. This mecha- which are larger by 2S powers of 10 than elec-
nism of heat transfer is classified as transfer by
convection. All media can also release energy in
the form of photons (electromagnetic waves).
This energy travels in space essentially with the o4...----,,t!::*---.,------r-----,
velocity of light until the photons are recap-
tured by some other atoms, causing in this way
heat transfer by radiation. An example of this
energy transport is the transfer of heat from the
sun to the earth. The three modes of energy 10
transfer mentioned above will be discussed con-
secutively in the following sections. Howe,:er, it ;.:
must be realized that they often occur Simul-
taneously, so that in some cases the tota~ en~rgy 1:
I
transport will be the sum of the contnbutlOns E 10
of the individual mechanisms. In other cases, ...,
.....
~
such a summation will not lead to the correct
result when the individual transport mecha- >-
nisms mutually interfere. t-
;; 10
Conduction From a microscopic standpoint, t=
thermal conduction refers to energy being u
::>
handed down from one atom or molecule to the a
z
0
next one. In a liquid or gas, these particles U
change their position continuously even with- .J
out visible movement and they transport energy <t
::<
also in this way. From a macroscopic or con- a::
w
tinuum viewpoint, thermal conduction is quan- I
t-
titatively described by Fourier's equation which
states that the heat flux q per unit time and 10-1
unit area through an area element arbitrarily
located in the medium is proportional to the
drop in temperature, - grad T, per unit length in
the direction normal to the area and to a
transport property k characteristic of the me- 1~·~1----L----~·---~·~---J104
10 10 10
dium and called thermal conductivity.
TEMPERATURE, K

q = -k grad T (1) FIG. 1.


557 HEAT TRANSFER

tric insulators. As a consequence of this fact, it stream tube to another and is the dominating
is much easier to channel electricity along a de- mode of transfer near solid walls. Convection
sired path than to do so with heat, a fact which transports heat along the stream lines and is
accounts for the difficulty in accurate experi- dominating in the main body of the fluid where
mentation in the field of heat transfer. the velocities are large. In many situations, the
Fourier's equation can be used together with a flow is turbulent; this means that unsteady mix-
statement on energy conservation to derive a ing motions are superimposed on the mean flow.
differential equation describing the temperature These mixing motions contribute also to a trans-
field in a medium. Fourier was the first one to port of heat between stream tubes, a process
develop this equation and to devise means for which can be thought of as being described by
its solution. In vector notation, this equation is an "effective" conductivity which often has
values by several powers of ten larger than the
aT
pc - = I/(kl/T) (2)
actual conductivity of the fluid.
at The movement of the fluid may be generated
by means external to the heat transfer process,
where p is the density, c is the specific heat, t is as by fans, blowers, or pumps. It may also be
time, 1/ is the Nabla operator. The temperature created by density differences connected with
field in a substance can either change in time the heat transfer process itself. The first mode
(unsteady state) or it can be independent of is called forced convection, the second one nat-
time (steady state aT/at = 0). For a steady-state ural or free convection. Convective heat transfer
situation, the temperature field depends pri- may also be classified as heat transfer in duct
marily on the geometry of the body involved flow or in external flow (over cylinders, spheres,
and on the boundary conditions. The simplest air foils, or similar objects). In the second case,
case of a steady state temperature field is the the heat transfer process is essentially concen-
one in a plane wall with temperatures which are trated in a thin fluid layer surrounding the ob-
uniform on each surface, however different at ject (boundary layer).
the two surfaces. The temperature in the wall Of special interest in such heat transfer proc-
then changes linearly in the direction of the sur- esses is the knowledge of the heat flux from
face normal as long as the variation of the ther- the surface of a solid object exposed to the
mal conductivity in the temperature range in- flow. This heat flux qw per unit area and time is
volved can be neglected. For an unsteady conventionally described by Newton's equation
process, the capacity of the medium to store
energy enters the energy conservation equation; (3)
correspondingly, the specific heat of the mate-
rial and its density become factors for the con- where Tw is the surface temperature and Tf is a
duction process, as well as the thermal conduc- characteristic temperature in the fluid. This
tivity. A combination of these properties, equation defining the heat transfer coefficient
defined as the ratio of the thermal conductivity h is convenient because in many situations the
to the product of specific heat and density, heat flux is at least approximately proportional
called thermal diffusivity (k/pc), then deter- to the temperature difference Tw - Tf. Informa-
mines how fast existing temperature differences tion on the heat transfer coefficients can be ob-
in a medium equalize in time. It is found that tained by a solution of the Navier-Stokes equa-
metals and gases have thermal diffusivity values tions describing the flow of a viscous fluid and
which are approximately equal in magnitude the related energy equation, or they are found
and considerably higher than thermal diffusivi- by experimentation. The availability of elec-
ties of liquid and solid nonconductors. This tronic computers has tremendously increased
means that temperature differences equalize our ability to study heat transfer analytically
much faster in metals and gases than in other at least for laminar flow, whereas in turbulent
substances. flow the bulk of our information is based on ex-
Various other physical processes lead in their periments. Such experimentation is made dif-
mathematical description to equations of the ficult by the large number of parameters in-
same form as Eq . (2), especially in its steady- volved. Dimensional analysis has therefore been
state form. Such processes are, for instance, the applied to reduce the number of influencing pa-
conduction of electricity in a conductor or the rameters, and relations for convective heat
shape of a thin membrane stretched over a transfer are correspondingly presented in mod-
curved boundary. This situation has led to the ern handbooks as relations between dimension-
development of analogies (electric analogy, soap less parameters. Such an analysis demonstrates,
film analogy) to heat conduction processes for instance, that heat transfer in forced flow
which are useful because they often offer the can be described by a relation of the form
advantage of simpler experimentation.
Convection It has been mentioned before Nu = f(Re, Pr) (4)
that in fluids, energy is often transported by
convection. In such a situation, conduction in which the Nusselt number Nu is a dimension-
takes care of the transport of heat from one less parameter hL/k, containing the heat trans-
HEA T TRANSFER 558

fer coefficient h; the Reynolds number Re = TABLE I RELATIONS FOR CONVECTIVE


p( VL/I1) describes essentially the nature of the HEAT TRANSFER
flow; and the Prandtl number Pr = cpl1/k can be
considered as a dimensionless transport property Forced Convection
characterizing the fluid involved. L and V are an
Channel Flow
arbitrarily selected characteristic length and ve- Flow through a tube:
locity, respectively, p denotes the density, 11 the laminar (Re < 3000)
viscosity, and cp the specific heat of the fluid at
constant pressure. Occasionally the Stanton Nu = 3.65
number, St = Nu/Re Pr, is used instead of the
Nusselt number as a dimensionless expression of turbulent (Re > 3000, Pr > 0.6)
the heat transfer coefficient. Equation (4) is Nu =0.116 (Re 2/3 -125)PrI/3
based on the assumption that the thermody-
namic and transport properties involved in the =hD = VD
heat transfer process can be considered as con- ( NU Re
k' v '
stant. Larger variations of such properties are
usually accounted for by additional terms in D = diameter, V = mean velocity)
Eq. (4) expressing the ratio of the varying trans-
port properties or of parameters (temperature, External Flow
pressure) on which they depend. Fluids occur- Flat plate parallel to flow:
ring in nature cover a very large range of Prandtl laminar (1000 < Re < 500000, Pr > 0.6)
numbers. Liquid metals, for instance, have values Nu =0.332.Ji[e ~
of order 0.001 to 0.01. Gases have values be-
tween 0.6 and I, and oils have values up to turbulent (Re > 500000, Pr > 0.6)
10 000 and more. Some heat transfer relations 0.0297 Re4 15 Pr
for forced convection are presented in Table I. Nu=--------------------
Relations for other situations can be found in 1 + 1.3Re- IIlO Pr-1/6 (Pr - 1)
the various texts mentioned at the end of this
section or in corresponding handbooks. With ( hx pVsx
regard to the relations in Table I and other sim-
ilar equations, it has to be kept in mind that
they are valid over a restricted range of the in-
Nu • "R,' :. .' :~:~:'::~:::::: : :)
dependent parameters only. Cylinder normal to flow : 1< Re < 4000
Heat transfer by free convection is described
by relations of the form Nu = 0.43 + 0.48.Ji[e
hD pVoD
Nu = f(Gr, Pr) (5) ( Nu =--
k '
Re =---- D =diameter
JJ' ,
In this equation, the Grashof number
Vo = upstream velOcity)
p2g(j(Tw - Tr)L3
Gr=~~------~-- Natural Convection
112 Vertical flat plate:
laminar (10 4 < Gr < 10 8 )
(where g is the gravitational constant, (3 is the
thermal expansion coefficient, Tr is the fluid Nu =0.508Gr 1/4Pr 112 (0.952 + Pr)-1/4
temperature at a distance where it is not influ- turbulent (Gr > 10 8 )
enced by the heated or cooled object with sur-
face temperature Tw) replaces the Reynolds Nu = 0.0295 Gr 2l5 Pr7/5 (1 + 0.494Pr2/3)-2I5
number. Sometimes a dimensionless parameter
called Rayleigh number (Ra = Gr Pr) is used in- hx p2g(3(Tw - Tr)x 3
(Nu = -- Gr = x = distance
stead of the Grashof number. Equation (5) as- k ' JJ2'
sumes again that the fluid properties involved
are nearly constant. Examples for such relations from leading edge, Tw = wall temperature, Tr =
are also contained in Table l. Examples for free fluid temperature at some distance from plate)
convection situations are the heat transfer from (All surfaces are assumed to have uniform temper-
a heating register in the room. Free convection is ature)
also an important faCtor in the establishment of
the temperature in the atmosphere. In the free
convection relations of Table I and in Eq. (5), it Space does not permit the discussion of other
has been assumed that the convection flows are heat transfer processes, although such processes
generated by the gravitational field. Natural con- have found increasing attention in recent years.
vection can also be created by other body Especially large heat transfer coefficients are
forces, like centrifugal and Coriolis forces or created by a boiling or condensation process.
electromagnetic forces. Boiling heat transfer is therefore used in appli-
559 HEA T TRANSFER

cations which have to deal with very large heat iv


c 2 -=f(T, v) (7)
fluxes like chambers and nozzles of rockets or Qv
the anodes in electric arc devices. Heat transfer
is also often combined with mass transfer pro- in which iv is the monochromatic intensity
cesses. This is, for instance, the case in evapora- (flux per unit area and unit solid angle) of the
tion devices. Heat transfer may also occur com- radiative flux at frequency v originating in me-
bined with chemical reactions as in processes dium 2 and traveling through the interface into
involving gases at very high temperature where medium 1 per unit solid angle and area nor-
combustion, dissociation, or ionization occur. mal to the direction of the radiant beam. Qv is
Radiation Energy can also be transferred the monochromatic absorptance or absorptiv-
from one location to another within a medium ity, i.e., that fraction of a radiant beam ap-
or from one medium to another in the form of proaching the interface in the medium 1 in the
photons (electromagnetic waves). Usually a opposite direction that is absorbed in medium 2.
multiplicity of wavelengths A is involved in such c is the wave velocity in medium 1. Kirchoff's
energy transfer. * In vacuum, all waves regardless law states that the combination of the param-
of their wavelength move with the same speed eters of the left-hand side of Eq. (7) is again a
2.9977 X 10 8 m/sec. In various substances, the function of temperature and frequency only,
wave velocity c changes somewhat with wave- but does not depend on the nature of the me-
length, and the ratio of the wave velocity in dium. A medium which absorbs all the radiation
vacuum to the velocity in a substance IS equal to traveling into it through an interface (Qv = 1) is
the optical refraction index. Air and generally called a blackbody. The intensity of radiation
all gases have refraction indices which differ emitted by an arbitrary medium is, according to
from one only in the fourth decimal. Their Eq. (7), in the following way related to the in-
wave velocity is, therefore, practically equal to tensity of radiation ibv emitted by a blackbody
that in vacuum. at the same temperature and frequency
Prevost's principle states that the amount of
energy emitted by a volume element within a
radiating substance is completely independent (8)
of its surroundings. Whether the volume element
increases or decreases its temperature by the From the consideration of a system in thermo-
process of radiation depends on whether it ab- dynamic equilibrium, it is also easily shown that
sorbs more foreign radiation than it emits or ibv is independent of direction and that the total
vice versa. One talks about thermal radiation monochromatic radiant flux emitted by a black-
when the emission of photons is thermally ex- body per unit interface area and unit time into
cited, i.e., when the substance within the vol- all directions in space is equal to rribv.
ume element is nearly in thermodynamic equi- The law describing the monochromatic inten-
librium. The discussion in this section will be sity of radiation of a blackbody is given by
restricted to thermal radiation. For such radia- Planck's equation
tion Kirchhoff was able to derive a number of
relations by consideration of a system of media 2hv 3
in thermodynamic equilibrium. If jv indicates
ibv = - - - - - - (9)
c2(ehv/kT - I)
the co-efficient of emission, i.e., the radiative
flux at the frequency lJ* emitted per unit vol- (where h is Planck's quantum constant and k is
ume into a unit solid angle, and K is the co- Boltzmann's constant). Experimentalists prefer
efficient of absorption at the same frequency, to use the intensity ibi\ per unit wavelength
that is the fraction of the intensity of a radiant (ibi\dA = ibvdv). Figure 2 presents the wave-
beam Zvhich is absorbed per unit path length, length dependence of the intensity of black-
then one of these relations states body monochromatic radiation for a number of
temperatures. It may be observed that for each
jv temperature, the intensity has a maximum at a
c2 - =f(T, v) (6)
Kv certain wavelength and that this maximum shifts
toward short wavelengths with increasing tem-
with c denoting the wave velocity. According perature (Wien's law). The wavelength A is
to this relation, the combination of parameters plotted on the abscissa in micrometers, J.1m,
on the left-hand side of the equation is a func- (1000 J.1m = I mm). Our eye is sensitive to radia-
tion of temperature T and frequency v of the tion in the range 0.4 to 0.7 J.1m (the dashed
radiation only, but does not depend on the range). It may be observed that for tempera-
substance under consideration. Kirchhoff's law tures with which we have largely to deal, the
can also be expressed in parameters which refer bulk of blackbody radiation is contained in the
to the interface of two media 1 and 2. It then range of wavelengths larger than the visible ones
takes the form (infrared range). This statement also holds for
other media because Eq. (8) shows that no me-
*Frequency and wavelength are used interchange- dium can have a monochromatic intensity
ably. They are connected by the relation I\V = c. which is higher than that of a blackbody. Only
HEA T TRANSFER 560

about radiative interchange between surfaces.


BOOOn7.r-,,---r.---rr---.----, Kirchhoff's law, Eq. (8), additionally connects
the intensity of a beam emitted through the in-
terface with the absorptivity and the intensity
of a beam leaving a blackbody at the same tem-
perature. Electromagnetic theory shows that
electric nonconducting materials have in the
infrared range generally high values of the ab-
sorptivity and correspondingly low values of
the reflectivity. Metals (electric conductors), on
the other hand, behave in the opposite way,
having low absorptivity and' high reflectivity
values. This fact is utilized in aluminum-insula-
tions and in vacuum thermos bottles. In the
visible range, the appearance of surfaces on vari-
ous materials to the eye already supplies infor-
mation on approximate values of the reflectivity
and absorptivity. The values of these parame-
ters largely determine the reflective and ab-
4 6 10 sorption characteristics of surfaces for solar
irradiation. A white surface, for instance, has a
FIG. 2. very low absorptivity and reflects the major
part of solar irradiation. Gases behave differ-
ently with respect to radiation. They need fairly
radiation coming from the sun has a major large layers in order to absorb the major part of
portion of the energy in the visible wave- incident radiation and radiate only in restricted
length range (corresponding to a temperature wavelength ranges whereas solids and liquids
of 6500 K). have a more continuous spectrum. Diatomic
The total energy flux q emitted per unit area gases (0 2 , N2 ) do not absorb energy with good
and time from a blackbody into all directions in approximation. Solar irradiation is absorbed in
space can be obtained by integration of Eq. (9) the atmosphere by carbon dioxide and water
over all frequencies and by multiplication of vapor. Values for reflectivities, absorptivities,
the result by 'fr. For blackbody radiation into a and transmissivities of various substances are
vacuum (or with good approximation into a contained in the Handbook reference listed at
gas), the result is the end of the ariticle.
Values for the radiation properties (Pv, O:v, Tv)
q = aT4 (10) together with the relations for blackbody radia-
tion are the basis of calculations to determine
The Stefan-Boltzmann constant a has the value heat exchange by radiation in a system with lo-
5.67 X 1O-8W/m3 K4 cally varying temperature. Calculations of such
The following additional relation exists at an interchange are in general very involved, espe-
interface between two substllnces. cially when substances with small absorption
coefficients are involved. The formulation of
Pv + O:v + Tv = I (11) such interchange leads to jntegral differential
equations. The reader has in this connection to
This equation describes that monochromatic be referred to the books listed at the end of this
radiant energy in a beam approaching in a me- section, and only a few relations for simple
dium I the interface with a medium 2 is found geometries will be presented here. Consider two
again either as radiation reflected back into the area elements, dA 1 and dA 2, belonging to two
medium 1 or absorbed in the medium 2 or blackbodies with the temperatures T 1 and T 2.
transmitted through the medium 2 into other The distance between the two area elements is
media or back into medium 1. The monochro- s, and (31 and (32 are the angles between the two
matic reflectance or reflectivity Pv is the ratio surface normals and the interconnecting line.
of reflected to incident radiant energy, O:v is the The following equation then describes the net
corresponding ratio for the absorbed, and Tv for heat transfer dQ from area element dA 1 to area
the transmitted energy. The vast majority of element dA 2 per unit time assuming that no ra-
solids and liquids absorb radiant energy over diation is absorbed or emitted in the space be-
most wavelengths in the infrared range within a tween the two surfaces.
very thin layer adjacent to the interface (of
order IJ.l to I mm). In heat transfer calculations, cos (31 cos (32
it can therefore usually be assumed that the dQ= dA 1 dA 2 a(T I 4-T2 4)
transmissivity of such substances is equal to 'frS2 (12)
zero and that reflectivity and absorptivity are
connected to the temperature of the interface. If non-black surfaces are involved, then the
One talks then often in a simplified manner process of radiant interchange is much more
561 HEISENBERG UNCERTAINTY PRINCIPLE

involved, since part of the incident radiation is Cross-references: HEAT, HEAT CAPACITY, IN·
now reflected from the surfaces and travels in FRARED RADIATION, PHOTON, REFLECTION,
this way back and forth until it is finally ab- THERMAL RADIATION, THERMODYNAMICS.
sorbed. Simple relations exist in this case for the
radiative interchange between the surfaces of
two concentric spheres or cylinders with areas
Al and A2 and temperatures Tl and T 2 . It is HEISENBERG UNCERTAINTY PRINCIPLE
further assumed that both surfaces are emitting Classical physics is based on two assumptions
and reflecting in a perfectly diffuse way and that that have been found experimentally to be un-
they are separated by a medium which does tenable. The first of these is the existence of
neither emit nor absorb radiation. The net signals that can travel with infinite speed; the
monochromatic interchange d<l>v between the second is that the magnitude of the interaction
two surfaces is then described by the equation between two systems can be reduced to arbi-
rr trarily small values. The realization that the
1) dA, (ib"> - ;b,,) speed of propagation of signals has a finite upper
limit led to the development of relativity theory.
The recognition of the existence of a finite
quantum of action has been incorporated in
(13) quantum (wave) mechanics.
Quantum mechanics assigns a physical reality
The monochromatic intensities ibvl and ibV2 are only to those variables whose value can, in prin-
calculated with Eq. (9). The relation changes ciple, be experimentally determined. About the
when the outer cylinder reflects radiation mir- existence of phenomena or systems that cannot
ror-like (specularly) to be experimentally observed, quantum mechanics
is noncommittal. Questions regarding an isolated
system are meaningless in quantum mechanics,
for any observation made on such a system nec-
essarily disturbs its isolation by at least one
quantum of action. Heisenberg S observed that
(14) any measurement made on a system destroys
some of the knowledge gained about that system
Both equations merge asymptotically into the through previous measurements. Any prediction
same relation when the differences between the about the future course of a system must be
two radii become small. The corresponding rela- contingent on a knowledge about the measure-
tion then also holds for two parallel infinite ments that will be made on that system, and is
planes. Equations (13) and (14) have to be inte- subject to uncertainties introduced by the mea-
grated over all frequencies to obtain the net surements. Whereas one might speculate with
heat transfer between the two surfaces. The re- some reliability about the future course of a
sult is simple when the absorptllnces (XVI and system under the restriction that no more mea-
(XV2 are independent of frequency (gray sur- surements will be performed on it, such speCUla-
faces). Equations (13) and (14) describe then tions would be physically meaningless, as they
the net heat transfer, when ibvl and ibV2 are could not be experimentally confirmed or
replaced by (a/rr)T I 4 and (a/rr)T 2 4 , respec- denied.
tively. In the broadest sense, then, the Heisenberg
Uncertainty Principle states that the partitioning
E. R. G. ECKERT of the universe into observer (either a human
observer, or a recording device such as a photo-
References graphic plate) and observed is subject to a finite
inaccuracy; one might say that the "knife" or
Eckert, E. R. G., and Drake, R. M., Jr., "Analysis of "pencil" that makes the partition has a finite
Heat and Mass Transfer," New York, McGraw-Hill, "thickness," h.
Inc., 1972. The concept of a monochromatic beam of
Arpaci, V., "Conduction Heat Transfer," Reading, radiation is not a difficult one to accept. Yet the
Mass., Addison-Wesley Publishing Co., 1966. experimental determination of the frequency of
Kays, W. M., "Convective Heat and Mass Transfer," such a beam requires an infinite time interval;
New York, McGraw-Hill Book Co., 1966. any finite portion chopped from the beam is
Hottel, H. C., and Sarofim, A. F., "Radiative Trans- shown by Fourier analysis to have a spectrum of
fer," New York, McGraw-Hill Book Co., 1967. finite width, hence not a single frequency at all.
Patankar, S. V., "Numerical Heat Transfer and Fluid Quantum mechanics does not deny the exis-
Flow," Washington, D.C., Hemisphere Publishing tence of a monochromatic beam, but it does
Corp., 1980. render the assignment of a definite frequency in
"Handbook of Heat Transfer," Rohsenow, W. H., and a given time interval meaningless.
Hartnett, J. P. (Eds.), New York, McGraw-Hill Book Fourier analysis shows that the specifications
Co., 1973. of the time interval, /).t, and of the spectral
HEISENBERG UNCERTAINTY PRINCIPLE 562

width, .::lv, are reciprocally related: two variables, a and b, of a system, whose opera-
tors are (i and ~. The expectation values of a
(1) and b are called 7l and Ti respectively. The uncer-
tainties in a and b can then be defined quanti-
This equation represents the uncertainty rela- tatively as their respective rms deviations from
tion for classical waves: any attempt to specify their expectation values; it follows from the
the frequency at an instant of time results in a postulates that
broadening of the frequency spectrum. This un-
certainty relation applies to any wave, whether
electromagnetic, acoustic, or otherwise. (.::la)2 ==f\fl*«(i - ii)2 \fI dq
Interference patterns observed when electron
beams are reflected from crystalline surfaces 3 or

f ...
where \fI is the normalized wave function of the
transmitted through thin metallic films 7 indicate
that these beams possess some wave character- system under observation, and dq indi-
istics. DeBroglie 2 , by independent theoretical
considerations (see WAVE MECHANICS), postu- cates integration over all values of all coordi-
lated the following relations between the dy- nates. Similarly,
namic variables, energy (E) and linear momen-
tum (p) of the beam, on the one hand, and the
wave variables, frequency (v) and wavelength
(A), on the other:
E =hv (2) If (i and ~ are Hermitian, * «(i - a)2 and (~- b)2
are also Hermitian. Therefore:
p = h/A (3)

Substitution of Eq. (2) in Eq. (1) gives the (.::la)2 . (.::lb)2 =


uncertainty relation between energy and time:
.::lE·.::lt~ h (4) ~
=f\fl*( (i - a)2 \fI dq ·f\fl*( - b)2 \fI dq

For a beam of free electrons of mass m, travel-


ing in the x direction,.::lt = m .::lx/Px, where Px =f( (i - a) \fI( (1- a)* \fI* dq
is the linear momentum and .::lx is the distance
traveled in the time interval .::l t. Since E =
p~/2m,.::lE ~ px .::lpx/m, so that.::lE· .::It=
.::lPx . .::lx, and ·f( ~- b) \fI (~ - b)* \fI* dq

'f'(~ -
(5)
=f/«(j - a)\fI/ 2 dq b)\fI/2dq.
The pairs of variables (E, t) and (Px,x) are
called canonically conjugate pairs of variables.
In quantum mechanics, the operators corre- To put this product of two definite integrals in a
sponding to canonically conjugate variables do more useful form, consider the function
not commute (see QUANTUM THEORY). Heisen-
berg originally stated his uncertainty principle f(q)=A«(i - a)\fI+i('i3 - b)\fI
in the following form: the values of canonically
conjugate variables of a given system can only where A is real, and independent of coordinates,
be determined with a finite lower limit of and i == v'-1. The function / f(q )1 2 must be non-
accuracy. negative:
Among the many important experimental
phenomena illustrating the uncertainty principle A2 1«(i- a)\fI12+iA[«(i*- a)\fI*(~- o)\fI+
in the COMPTON EFFECT. Here, a photon is scat-
tered by an electron; the momentum of the - «(i - a) \fI (~* - D) \fI * 1 + I(~- b) \fII2 ~ 0
photon is rendered uncertain as a result of its
scattering by the electron and the electron is
moved from its original position by the impact When the left-hand side of this inequality is
received from the photon. If we consider the integrated over all values of all coordinates:
photon and the electron as separate systems,
then their interaction (the collision) introduces *An operator :J is Hermitian, if for any properly
uncertainties for each system, given by Equa- behaved functions u and u: pu:Ju dq = g;uCf*u dq,
tion (5). where * indicates complex conjugation. Quantum
A general uncertainty relation follows from mechanical operators usually are, or can be made to
the postulates of WAVE MECHANICS. Consider be, Hermitian.
563 HIGH-VOLTAGE RESEARCH

are given respectively by: P = - if! "V, where


;\2(Aa)2 +(Ab)2 + iA~«(i *- a)W*(~ - O)W dq + f! = h/2rr, and !l = q (multiplication by q).
For linear motion in the x direction,
Px =-ifla/ax, !l=x. Hence (P!l- H)\}!=
-feCi - a)W(~ * - Ii)w* dq] ~ 0 = -ill(a/ax) (x\}!) + i1ix(a\}!/ax) =-ifl\}!. When
this expression is substituted into Eq. (6), it fol-
lows that:
Since Ais real, the left-hand side of this inequal-
ity becomes negative unless the discriminant IAPxllAxl ~ h/4rr
becomes zero or negative:
which is in agreement withEq. (5). Equation (4)
4(Aa)2(Ab)2~_[f(i*- a)W*(~- b)Wdq+ can be similarly derived from the postulates of
wave mechanics by setting 8. = (ih/2rr) (a/at),
3'= t.
-f«(i- a)W(~* - b)w*dqf A. L. LOEB
References
The right-hand side of this inequality is reduced
1. Bohm, David, "Quantum Theory," Englewood
as follows (remember that W is normalized): Cliffs, N.J., Prentice-Hall, 1951.
2. De Broglie, 1., J. Phys. Ser. 6, 7, 1 "Introduction
~
cf(i * - a)W*( - Ii)W dq =f(l W)*~ W dq + to Wave Mechanics," London, Methuen, (1926).
3. Davisson, C. J., and Germer, 1. H., Phys. Rev., 30,
705 (1927); Proc. Natl. A cad. Sci. U.S., 14, 317
~
- afw* W dq - bfW( (l W)* dq + ab (1928).

f.
4. Harris, 1., and Loeb, A. 1., "Introduction to Wave
Mechanics," New York, McGraw-Hill Book Co.,
(l - a)W(~ * - b)W* dq =f«(i W)(~ W)*dq + 5. 1963.
Heisenberg, W., Z. Physik, 43, 172; "The Physical
Principles of the Quantum Theory," Chicago, Uni-
- afW(~W)*dq - brLW*«(l\}!)dq + ab versity of Chicago Press, (1927).
J 6. Margenau, H., and Murphy, G. M., "The Math-
ematics of Physics and Chemistry," New York,
Since b =f\}!*~\}! dq =f\}!(~\}!)* dq, and Van Nostrand Reinhold, 1943.
7. Thomson, G. P., Proc. Roy. Soc. London Ser. A,
117,600 (1928); 119,651 (1928).
a=t\}!*(i\}!dq =t\}!«(l\}!)*dq ,
Cross references: COMPTON EFFECT, QUANTUM
THEORY, WAVE MECHANICS.
4(Aa)2 (Ab)2 ~ - [t«(i\}!)* (~\}!)dq +
-f(el \}!) (~\}!)*dqr HIGH-VOLTAGE RESEARCH
High-voltage research deals with phenomena
evoked by high voltages and intense electric
Since (j and ~ are Hermitian fields, with the behavior of dielectrics and elec-
trical components under such electrical stress,
f«(j \}!)* (~\}!) dq == f(~ \}!)«(j \}!)* dq = and with the utilization of electrostatic fields
and forces for various purposes of science and
industry. In the laboratory, this electrical stress
=t\}!* (j ~\}! dq, is produced by the presence of electric charge
on the opposing surfaces of two electrodes be-
tween which a voltage is applied . Pulsed, alter-
f.(j\}!)(~\}!)*dq =f\}!*M\}!dq (6) nating and constant voltages ranging from a few
kilovolts to 20 MY' have been used in such
studies. In nature, air currents and water pre-
:. 4(Aa)2 (Ab)2 ~ - [ f\}!*( (i 'B - 'B (j ) \}! dqf cipitation cause electric charge to become
separated between cloud and earth or between
clouds; the stressed region may reach electrical
and IAal'IAbl~!if\}!*«(l~ - ~(j)\}!dq pressure differences in excess of 1000 MV.
Lightning, a rapid high-current discharge, com-
pletes the breakdown of the over-stressed air
According to the postulates of wave mechan- and dissipates the accumulated electrical energy.
ics, the operators for momentum and position Many aspects of this natural high-voltage phe-
HIGH-VOLTAGE RESEARCH 564

nomena have been the subject of investigation the 60 years centered on the turn of the twen-
because of their scientific interest and the danger tieth century, physicists studied the passage of
of lightning to life and to susceptible structures electricity through gases at nonnal and reduced
such as electric power systems. pressure, sought an understanding of long sparks
In the industrial high-voltage laboratory, the and corona in atmospheric air, measured the
direct and induced effects of lightning discharges conductivity and breakdown strength of liquid
on electrical apparatus are often simulated by dielectrics, and examined the flashover of solid
high-voltage impulse generators. These use the insulators in these media.
Marx method of first slowly charging a number More recent research has been directed at the
of condensers in parallel and then suddenly con- perfonnance under high electrical stress of a
necting them in series by spark-gap switches wide range of solid, liquid, and gaseous dielec-
which at the same instant impress the multiplied trics as well as vacuum-insulated systems. The
voltage upon the test circuit. A typical voltage solid materials vary from porcelains and glasses
wave produced by such impulse generators rises to hydrocarbon polymers and loaded epoxies.
to its peak value of several million volts in 1 Ilsec Superior gaseous insulation is now obtained by
and then diminishes exponentially reaching half- the combined use of such electronegative gas
voltage in 10 Ilsec. moleculesas sulfur hexafluoride (SF 6) and Freon
Electric Field between High-voltage Electrodes (CCI 2 F 2 ) with elevated pressures sometimes ex-
The region between and around electrodes which ceeding 5 or 10 atmospheres. Solid insulator
have been charged by the application of a voltage supports are indispensable in gas, liquid, and
V between them is occupied by the electric field vacuum insulated systems and are studied in
of that charge. In an isolated system, the positive combination with these media.
electrode has a deficiency of electrons exactly Research in the high-voltage field may also be
equal to the excess electrons on the negative directed at testing and increasing the insulation
electrode. The amount of electric charge Q on strength of power equipment subject to lightning
either electrode surface at any instant is given or switching surges. The increasing trend toward
by Q = CV where Q is in coulombs and C is the higher voltages for the transmission of electric
capacitance of the electrode system in farads . power over long distances has directed research
Energy is stored in the electrically stressed space toward reduction of radio interference and
between the electrodes; the amount of this elec- power loss by corona and surface leakage from
trical energy W can be expressed in terms of high-voltage transmission line conductors and
applied voltage or separated charge by W = their suspension insulators. The need to bring
cVl /2 = Q2 /2C joules. such power into urban areas has produced the
The electric field intensity at any point is requirement for ac power cables in which the
defined by the magnitude and direction of the center conductor is reliably insulated for hun-
force which would be experienced by a unit dreds of kilovolts above earth. The inherent
positive charge placed in the field at that point. efficiency and stability of dc power transmission
Following Faraday, if we define a line of electric have led to the development of more adequate
flux as a line drawn so that its direction is every- high-voltage rectification and conversion ap-
where the direction of the force on a positive paratus. For these purposes, the low-pressure
particle, and require that one line must originate metallic vapor tube has reached the highest
on each unit positive charge and terminate power levels though solid state devices offer
on each unit negative charge, then the lines of much promise for high-voltage, high-power
flux will map out the electric field and the lines switching.
per unit area will be directly related to the elec- In the field of science and medicine, high-
trical field intensity E. The electrostatic force voltage research seeks improved methods of pro-
acting on a particle with charge q placed in an ducing high constant voltages, of measuring and
electric field of intensity E volts per meter is stabilizing such voltages, and of applying them
given by f = qE newtons. The electric field dis- to the acceleration of atomic ions and electrons
tribution depends upon the geometry of the to high energies. Such particle accelerators are
electrodes but is affected by the presence of needed for nuclear structure research; for the
dielectric materials or of charged particles; a study of the properties of energetic atoms,
quantitative picture of the static or changing electrons, x-rays, and neutrons; for the treatment
field picture is usually essential to high-voltage of deep-seated tumors with energetic particles
research. and radiation, and for the radiation processing
Objectives of High-Voltage Research From of materials.
antiquity to the present, the history of high- Industrial objectives of high-voltage research
voltage research sparkles with many names well- include the development of high-power, high-
known in electrical science-Thales of Miletus, frequency tubes and their power supplies for
von Guericke, Newton, Franklin whose kite ex- radar and long-range radio communication, the
periment established the identity of natural ionization of particulate matter by corona and
lightning and electricity, Cavendish, Faraday, its electrostatic collection as in smoke and chem-
and Roentgen whose discovery of x-rays in 1895 ical precipitators, and the elimination of electro-
marked the beginning of the atomic age. During static hazards which arise in processes such as
565 HISTORY OF PHYSICS

the transfer or mixing of volatile hydrocarbons, maintains that most of the major achievements
dusts, and explosive gases. of physical science of our age can be traced back
to the stream of knowledge started by this
JOHN G. TRUMP Ionian intellectual giant in Miletus. Pythagoras,
Democritus, Anaxagoras, Aristarchos of Sam os,
References to mention a few, represent as one historian
exclaims, that "miracle of ancient Greece" that
Graggs, J. D., and Meek, J. M., "High Voltage Labo- prepared and shaped the climax of Plato and
ratory Technique," Butterworths Scientific Publi- Aristotle, who for two millenia were to inspire
cations, London, 1954. and guide, for better or worse, the evolution of
Trump, J. G., "Electrostatic Sources of Electric Power," physics.
Elec. Eng., 66, No.6, 525-534 (June 1947). Liberated from the mythology of pre-Socratic
Trump, J. G., "New Developments in High Voltage time, these ancient astounding thinkers repre-
Technology," IEEE Trans. on Nuclear Science NS-14 sent every school of philosophy from the ex-
(3),113-119 (June 1967). treme idealism of illusion and nonexistence of
the world of sense perception, in the Pythago-
Cross-references: ACCELERATOR, V AN DE GRAAFF; rean-Platonic sense, to the atomism and materi-
ELECTRICITY; POTENTIAL; STATIC ELECTRIC- alism of Democritus and Anaxagoras with their
ITY. doctrine of the primacy of matter in a universe
manipulated by accidental mechanism. It has
been said that everyone by nature is a disciple
HISTORY OF PHYSICS of either Plato or Aristotle, and Raphael in his
Any attempt to trace the beginning of man's famous painting of the School of Athens on the
ideas on the nature of the physical world brings wall of the Vatican Palace appropriately illus-
us to anthropomorphic, mythological cosmol- trates Plato pointing upward and Aristotle
ogy. It is impossible to trace any absolute adum- downward to the ground. Thus, the Alexandrian
bration of some scientific doctrine. If a hint School in Hellenistic Egypt with its lighthouse,
could be suggested, then the Western world ap- Pharos, as a symbol irradiated the glory of
pears to have drawn more from Mesopotamian Hellenic science for centuries, nourishing West-
sources and traditions than from the Nile civili- ern civilization with Euclides' Stoicheia and
zation. No archeological dust will reveal the in- Ptolemy's Almagest which, outside the Bible,
scrutable past. As for physics, in the limited were the most widespread literary sources in
space available we can start only with the era physical science relegated to antiquity.
when specific names emerged that are associ- With the rise of Christianity, followed by Islam
ated with the earliest formation of scientific half a millenium later, interest in the studies of
ideas, described at that time as philosophy. The natural science was temporarily paralyzed, to be
Ionian period, that amazing flow of intellectual later zealously renewed under the aegis of the
energy, paved the way for the rise of the fathers theology of the new religion. A convert to
of Hellenic science. Physics, then closely linked Christianity, St. Augustine (A.D. 354-430),
with philosophy and mythological astronomy, later Bishop of Hippo, was a pioneer in the
played an auxiliary role in man's overwhelming realization that Plato's ideas on sense delusion
concern in cosmography and cosmogony. conveniently responded to the devout Christian's
Although physics as an independent field search for the salvation of the soul. Like Socrates
originated in the age of Renaissance and was and Sophists about a millenium previously, so
associated with such giants as Galileo, Kepler, St. Augustine also turned his back on nature
and Newton, even in this brief sketch it is and advised, "Return to thyself. In· the inner
indispensable to trace the fountainhead of this man dwells truth." Thus, for a thousand years
science, which is in the cornerstone of Western the men who guided the thought of the Western
civilization. The port of Miletus in Ionia, on the world did not observe out-of-doors and learn
eastern coast of the Aegean sea, can be singled from natural phenomena. Monasteries became
out as among the most significant birthplaces of leading establishments where sedentary monks
physical science. Located in a favorable geo- pored over volumes of Plato and Aristotle. More
graphic position, its flourishing commerce since than half a millenium after St. Augustine, St.
the second millenium before our era provided Thomas Aquinas (A.D. 1227-1274) petrified
an excellent clearing house between two river Aristotelian peripatetic scholasticism into an
civilizations, Egypt and Mesopotamia. Exposing authoritarian Summa Theologica that included
people to divergent ideas and traditions, it all the answers man should know on the nature
created an atmosphere of open-mindedness of the physical world. It must be mentioned
where new, unrestrained ideas could flourish. that the science of Western Islam helped to
Consequently, several most unique rationalizing shape the cosmological ideas of St. Thomas
minds had their roots in this town alone. Aquinas and, through him, the whole thought
Of the galaxy of great pioneers of Ionian of Catholic Europe. We should stress that
science, Thales of Miletus (624-565 B.C.) stands Ptolemy's masterpiece, Almagest, first reached
out as a symbol of the era. Sir James Jeans Europe in the Arabic language.
HISTORY OF PHYSICS 566

This marked the climax of the Middle Ages, mathematical discovery of the planet Neptune
described unjustly as "dark ages." Although by J. C. Adams and Leverrier in 1845. It was
theology was the queen of sciences, this period then considered the final shape of knowledge
was not devoid of scientific activity because man was in position to realize. Previously,
speculations in physics were constantly being Laplace produced an overwhelming impression
nourished by the mystery of the Primum on the entire century when, in 1798, he used
Mobile, a sphere beyond the fixed stars. It was in his Origin of the World System his determin-
this Prime Mover to which Aristotelians ascribed istic equations in the formulation of his hypoth-
the first supernatural impulses or "impetus." esis on the origin of the solar system .
How else could motion first have started? With With advancing crystallization, Newtonian
the rise of Humanism, the rediscovery of Greek physics radiated with a galaxy of great names,
literature of antiquity revealed vast subjects inspired builders of the classical view of the
dealing with fields other than theology. Invig- physical world. Only a few principal milestones
orating new studies spread through Western can be indicated, each a giant of his own in a
Europe invading universities, not by-passing panoramic view of the glorious century of
some monasteries. Bold, unusual views on the promise of a scientific paradise. From Laplace
nature of the physical world and methods of at the beginning of the nineteenth century, the
investigation aroused the suspicion of watchful epic unfolds from Avogadro to Faraday, from
scholasticians but the age of Renaissance could Carnot and Joule to Kelvin and Helmholtz,
not be diverted. Even the prominent ecclesiastic attaining its pre-Einsteinian peak in Maxwellian
Oresme (1332-1382), Bishop of Lisieux, suS- equations formulating the electromagnetic the-
tained the main interest by challenging the ory of light. These equations were impressively
Aristotelian doctrine of the fixity of the earth. described by Boltzmann, himself at the cradle
Nearly a century later this was continued by of thermodynamics, when he quoted from
Cardinal Nicolas of Cusa who, like the Fran- Goethe's Faust: "Who was the god who wrote
ciscan monk, Roger Bacon, in the thirteenth these lines?" Yet, these equations so brilliantly
century also advocated experimentation in order describing natural processes, pointed inevitably
to learn how the laws of nature operate. Very to a deterministic and mechanistic universe.
penetrating studies on the mystery of "violent" Although rigorous mechanicism had flourished
motion and inertia were accomplished long in the past in varying degrees, Newton must be
before their actual fruition with the appearance regarded as the founder of the mechanistic
of Galileo and Newton. world view even though he had difficulty in
Signs of a new era in physics were imminent harmonizing mechanistic natural philosophy
with ideas of the universal genius, Leonardo with his belief in a God. who not only created
da Vinci, maintaining like. Democritus and the world but also constantly preserves it. This
Anaximander that the whole universe conforms mechanization of the world picture systemat-
to unalterable mechanical laws. The coming ically led to the conception of God as a retired
dawn was evident when Nicolaus Copernicus engineer, and it was only another step to His
(1473-1543) came to study in Bologna and complete exclusion. Therefore, the universe was
Padua. Being more an ancient Greek philosopher ultimately knowable and predictable. This Welt-
in his use of geometry in support of the helio- anschauung of triumphant physics encouraged
centric system, Copernicus at least prepared the the rise of materialistic philosophy, that actually
way of Galileo (1564-1642), who finally mobi- shaped the dialectic materialism of Marx and
lized all known physics with his inventive exper- Engels, which became the official doctrine of
imentalism that adumbrated the first full stream the ruling communist state in the twentieth
of scientific revolution. Symptomatically, Gal- century.
ileo's first work was on motion, De motu, a Newtonian physics was not destined to remain
subject of great concern through the Middle the last form in the evolution of physics. When
Ages, and his was the final challenging blow to it appeared to reach its perfection, as some
the Aristotelian doctrine, when he verified that leading physicists advocated, the turn of the
force primarily produces acceleration instead of twentieth century again witnessed another tidal
mere movement. wave that changed the course of physics. By
As a sign of continuity, Newton was born 1895, this second scientific revolution started
(1642-1727) the year Galileo passed away. The with discoveries of the first magnitude, con-
trend of mechanism of physical phenomena as taining unfathomable consequences for the
a consequence of mathematical determinism future. Becquerel's radioactivity, Roentgen's
reached its portentous finalization with the x-rays, J. J. Thomson's electron, Planck's quan-
Galilean-Newtonian revolution. Newtonian clas- tum, Bohr's atom, Rutherford's nucleus, Ein-
sical physics, associated with the world view of stein's relativity and equality of mass and energy,
the majestic Newtonian universe, eternal and represent a revolution that will carry the
infinite, was formulated in three laws of motion, exploring mind incomparably farther into the
climaxed by the universal law of gravitation. mysteries of the universe than Copernicus,
This physics continued in its progressive refine- Galileo, Kepler, or even Newton ever dreamed
ment until it was confirmed by the triumphant of. Heisenberg's principle of indeterminacy in
567 HISTORY OF PHYSICS

the realm of microphysics not only shatters the when he suggested our rooting in spatiotem-
once cherished corpuscular-kinetic determinism poral provincialism. The Victorian physicist op-
but points to a microcosmos much more com- erated in what Reichenbach called the "world
plex than what Whitehead called "provincialism of middle dimensions," his Newtonian-Euclidian
in time and space," and as valid in Newtonian space between microcosmos and macrocosm os.
mechanism. Combining this with the staggering In relativistic physics the rigid homogeneous
discoveries in astrophysical macrocosmos, the Newtonian-Euclidian infinite space melts away
Dopplerian red shift of external galaxies, quasars, in a constant spatiotemporal dislocation that
neutron stars, and black holes, we confront a depends on the presence of matter and a finite
truly unprecedented era of future centuries that four-dimensional universe. Matter produces
will bring about unimaginable amendments in local distortions of space-time metric with its
the physics we know today and its subsequent non-Euclidean geometry, which manifest them-
world view. selves through force fields, inertia, and the
As the end of the twentieth century looms "constant" of universal gravitation (G) that is
ahead with all the problems of turbulent hu- subject to change with the progressive expan-
manity combining utility with the catastrophic sion of the universe.
abuse of applied physical science, it is impor- In the microworld, Heisenberg's principle of
tant to note in retrospect that the unparalleled indeterminacy denies the objective reality of
advance of man's understanding of the physical sharply localized particles, so that the so-called
world in the present century was not appropri- particle blends into the background from which
ately balanced by any corresponding increase in it should be distinct to deserve to be identified
man's ethical enlightenment. Einstein's centen- as a particle. The shocking problem of the in-
ary, celebrated throughout the entire cultural constancy of the reality of such particles as
world in 1979, more than justifies the inevitabil- the electron is well described by Oppenheimer:
ity of this comment in this encyclopedia. The
genial author of the new, post-Newtonian con- If we ask whether the position of the electron
cept of our universe never failed to express his remains the same, we must say "No;" if we ask
concern in the fate of mankind alongside his whether the electron's position changes with time,
world view of New Physics. we must say "No;" if we ask whether the electron
The Second Scientific Revolution, as his- is at rest, we must say "No;" if we ask whether it
torians like to call it, starting at the turn of this is in motion, we must say "No."
century, is not only intellectually longer in its
three quarters of a century than were the three The Second Scientific Revolution has hardly
centuries since Newton, but far surpasses the begun to build its history with Einstein's un-
length separating the world view of Newton finished dream of unified field theory and un-
from that of Aristotle, two thousand years ago. fold a new universe from the immensity of
Even Laplace with his classical, mathematical extraordinary facts, whether observable or un-
physics belongs in a category that separates him observable, pointing to an unprecedented mag-
from such wizards of the New Physics of our nitude of the changing new world of physics
contemporary age as Einstein, Bohr, Planck, and its cosmology in the forthcoming centuries.
de Broglie, and Heisenberg. These conclusions It dwarfs all preceding world concepts of ac-
are drawn because both Newton and Aristotle commodating, enduring anthropomorphic views.
built world views from facts acquired by our
KAREL HUJER
sensory perceptions, as demanded by Victorian
physics. Yet we can never perceive nor directly
observe an electron, a nucleus, a quantum, or References
any ultimate particle of microphysics. We build
their representation from a highly complex lab- 1. Sarton, George, "A History of Science," 2 Vols.,
oratory procedure which does not guarantee Cambridge, Mass., Harvard Univ. Press, 1959.
their actual reality. The great pioneer of New 2. Singer, Charles, "A Short History of Scientific
Physics, Ernst Mach, once stated that "Senses Ideas to 1900," London, Oxford Univ. Press,
do not lie, but they do not tell us truth." This 1959.
was supplemented a century later by J. M. 3. Dijksterhuis, E. J., "The Mechanization of the
Jauch, the late Director of the Institute of World Picture," London, Oxford Univ. Press,
Theoretical Physics at the University of Geneva: 1969.
"Bohr, by denying objective reality to prop- 4. Capek, Milic, "Philosophical Impact of Contem-
erties whose simultaneous presence would re- porary Physics," D. Van Nostrand Co., 1961.
quire for their verification mutually exclusive 5. Frank, Philip, "Einstein, His Life and Time," New
physical situations, puts in question that very York, Alfred Knopf, 1947.
concept of reality which has been the corner- 6. Dingle, Herbert, "The Sources of Eddington's
stone of all of physics as it has developed from Philosophy," Cambridge, U.K., Cambridge Univ.
the time of Galileo and Newton." Press, 1954.
The dilemma arising from the Einsteinian 7. Koyre, Alexandre, "Des Revolutions des orbes
world view was aptly described by Whitehead celestes," Paris, Librairies Felix A1can, 1934.
HISTORY OF PHYSICS 568

8. Heisenberg, Werner, "Physics and Philosophy," incident on the object is scattered by it, and
New York, Harper Torch Books, 1958. this scattered radiation, now called the object
9. de Broglie, Louis, "Physics and Microphysics," wave, impinges on the recording medium. The
New York, Harper Torch Books, 1960. wave that proceeds directly to the recording
10. D'Abro, A., "The Rise of New Physics: Decline medium is called the reference wave. Since the
of Mechanism," New York, Dover Publications, object and reference waves originate from the
1951. same source, they are mutually coherent and
11. Jauch, J. M., "Are Quanta Real;" Bloomington, form a stable interference pattern when they
IN, Indiana Univ. Press, 1971. meet at the recording medium. The detailed
12. Oppenheimer, J. Robert, "Science and the Com- record of this interference pattern constitutes
mon Understanding," New York, Simon and the hologram.
Schuster, 1954. When the hologram is illuminated with a
13. Taylor, A.M., "Imagination and the Growth of beam similar to the original reference wave, it
Science," The Tallman Lectures, 1964-65, John modulates the phase and/or amplitude of the
Murrey Publications, 1966. illuminating wave in such a way that the trans-
14. Manuel, F. E., "A Portrait of Isaac Newton," mitted wave divides into three separate compo-
Cambridge, Mass., Harvard Univ. Press, 1969. nents, one of which exactly duplicates the
15. Calder, Nigel, "Violent Universe: An Eyewitness original object wave.
Account of the New Astronomy," New York, The If the two interfering beams are traveling in
Viking Press, 1969. substantially the same direction, the recording
of the interference pattern is said to be a Gabor
hologram or an in-line hologram. If the two in-
terfering beams arrive at the recording medium
HOLOGRAPHY from substantially different directions, the re-
In 1948, British scientist Dennis Gabor (1900- cording is a Leith-Upatnieks or off-axis holo-
1979) proposed and demonstrated a new, two- gram. If the two interfering beams are traveling
step method of optical imagery. However, be- in essentially opposite directions, the recorded
cause of the extreme difficulty of implementing hologram is said to be a Lippmann or reflection
the method in those early days, it was not until hologram, first invented by Y. N. Denisyuk.
the decade of the 1960s that the method be- Electromagnetic radiation is most commonly
came widely known and used. This renaissance used, although acoustic radiation can be used.
of interest in the method was largely due to the The most common electromagnetic radiation
general availability of the laser with the great employed is light, but holograms have also been
temporal and spatial coherence of its light and recorded successfully with electron beams,
to the efforts in the early 1960s of J. Upatnieks x-radiation, and microwaves.
and E. N. Leith of the University of Michigan. Holograms can be classified by the way they
As Gabor's method grew in significance, the diffract light. In an amplitude hologram the
genius of the idea was recognized and Gabor varying irradiance distribution of the interfer-
was awarded the Nobel Prize in physics in 1971. ence pattern is recorded as a density variation
The method proposed by Gabor is now of the recording medium. In this type of holo-
known as holography. It is similar to photogra- gram the illuminating wave is always partially
phy in many respects, yet is fundamentally absorbed, i.e., the illuminating wave is ampli-
different. With photography, one generally re- tude-modulated. In a phase hologram, a phase
cords, by means of lens and film, the two- modulation is imposed on the illuminating
dimensional irradiance distribution in the image beam which in turn results in diffraction of the
of an object. With holography, however, one light. Phase modulation occurs when the optical
records not the optically formed image of an path (thickness X index) varies with position. A
object but the object wave itself. This wave is phase hologram results from either relief-image
recorded (usually on photographic film) in such or index variation, or both.
a way that a subsequent illumination of this Either phase or amplitude holograms can be
record, called a hologram, reconstructs the classified further as Fresnel holograms or as
original object wave. A visual observation of Fraunhofer holograms. Generally speaking, if
this reconstructed wavefront then yields a view the object is reasonably close to the recording
of the object which is practically indiscernible medium, say just a few hologram or object di-
from the original, including three-dimensional ameters distant, the field at the hologram plane
parallax effects. is the Fresnel diffraction pattern of the object.
Figure l(a) shows schematically how a holo- A hologram recorded in this manner is termed a
gram is recorded. One starts with a single, mo- Fresnel hologram.
nochromatic beam of light that has originated If the object and hologram are separated by
from a very small source. This single beam is many object or hologram diameters, the field at
split into two components, one of which is di- the hologram due to the object alone is the
rected toward the object and the other to a Fraunhofer diffraction pattern of the object. A
suitable recording medium, most commonly a hologram recorded in this manner is termed a
photographic emulsion. The component that is Fraunhofer hologram.
569 HOLOGRAPHY

(bl

Illuminating Hologram
Beam

..... ...... "-


Primo ry Imag e ...... "-
(Virtuol l
"-
"-
"-
"-
'0
O'il
(el Hologram Con j ugate Image
(Reali

--------~~------------~------- g6
Conjugote tmoge
Illuminot i ng Beam (Reali

ol:. Primary Imoge


(Virtuoll

FIG. 1. A typical holographic arrangement. (a) Recording the hologram. (b) Reconstructing the primary
object wave. (c) Reconstructing an undistorted conjugate wave.

Any of these hologram types may be recorded ence pattern is sinusoidal. As long as the record-
as either a thick or a thin hologram. A thin ing is essentially linear (irradiance proportional
hologram is one for which the thickness of the to final amplitude transmittance), the hologram
recording medium is thin compared to the spac- will be a diffraction grating varying sinusoidally
ing between the recorded interference fringes. in amplitude transmittance, and 0nly the first
A thick or volume hologram is one in which the diffracted orders will be observe' '1!le of these
thickness of the recording medium is of the or- first-order waves will be travelinl n the same
der of or greater than the spacing of the re- direction as the object wave; this is the recon-
corded fringes. structed wave.
Conceptually, the simplest form of an off-axis To describe the recording of a hologram of a
hologram is one for which the object is just a more complicated object, let 0 be a mono-
single, infinitely distant point so that the object chromatic wave from the object incident on the
wave at the recording medium is a plane wave. recording medium H, and let R be a wave co-
If the reference wave is also plane, and incident herent with O. The wave 0 contains informa-
on the recording medium at an angle to the ob- tion about the object, since the object has
ject wave, the hologram will consist of a series uniquely determined the amplitude and phase
of Young's interference fringes. These recorded of O. The object can be thin and transmitting,
fringes are equally spaced straight lines running such as a transparency, or it can be opaque and
perpendicular to the plane of incidence. Since diffusely reflecting. Both the amplitude and
the hologram consists of a series of alternating phase of the wave 0 can be recorded with the
clear and opaque strips, it is in the form of a aid of the reference beam R. The total field on
diffraction grating. When the hologram is illu- H is 0 + R. A square-law recording medium,
minated with a plane wave, the transmitted such as a photographic emulsion, will respond
light consists of a zero-order wave traveling in to the irradiance of the light 10 + R 12. Assume
the direction of the illuminating wave, plus two that after processing, the hologram possesses a
first-order waves. The higher diffracted orders certain complex amplitude transmittance t(x)
are generally missing or very weak since the ir- that can be expressed as a function of the ex-
radiance distribution of a two-beam interfer- posure E(x),
HOLOGRAPHY 570

t(x) = f[E(x)]. (1) Holograms may be recorded with diverging,


parallel, or converging reference beams. If care
The expansion of this function will yield a term is taken to maintain the recording geometry
linear in exposure, and by ignoring all terms ex- during reconstruction, it is even possible to
cept this one, one can write form holograms with an arbitrary reference
t(x) = ~E(x) = ~ 10 + R 12 . T beam, the only requirement being that it be
coherent with the object beam.
=~T(1012 + IRI2 +OR*+O*R), (2) It is possible to produce color holograms by
recording three separate holograms on a single
where T is the exposure time, * indicates com- photographic plate, each in a different color.
plex conjugate, and ~ is the constant coefficient Subsequent illumination with a three-color
of the linear term. When the hologram with this beam yields three separate wave fronts, one in
amplitude transmittance is illuminated with a each of the three colors representing the por-
wave C, the transmitted field at the hologram is tion of the object corresponding to that color.
C . t (x) = ~ T[ C 10 12 + C 1R 12 It is possible to make holograms that can be
viewed in reflection. This is done by allowing
+ CR*O + CRO*]. (3) the reference and object beams to enter the re-
cording medium from opposite sides [Fig.
If the illuminating wave C is sufficiently uni- 3(a)]. The fringes formed are planes lying ap-
form so that CR * is approximately constant proximately parallel to the plane of the holo-
across the hologram, the third term of Eq. (3) gram. When such a hologram is illuminated by a
is ~TCR*O = const. X O. This term represents a beam similar to the reference wave, a reflected
wave identical with the object wave O. This wave is formed which exactly duplicates the
wave has all of the properties of the original object wave. The image is viewed in reflected
wave and can form an image of the object. The light [Fig. 3(b)]. It is possible to illuminate
fact that this wave is separated from the rest this type of hologram with white light. The in-
can be seen most clearly by analogy with the terference planes filter the light by acting as a
diffraction-grating hologram described above. It A/2 multilayer interference filter, in the same
can be shown that the other first-order dif- way as in Lippmann color photography.
fracted wave corresponds to the conjugate im- Many applications have been proposed for
age term ~TCRO* of Eq. (3) and that the zero- holography. The most important of these are
order wave corresfonds to the first two terms the ones that exploit the unique features of
~ TC( 101 2 + 1R 1 ). The object wave 0 is sepa- holography, rather than those that just do old
rated from the others and may be viewed tasks in a new way. The best example of this
independently. is holographic interferometry, whereby arbi-
Figure 1 illustrates the recording of a holo- trary wavefronts interfere with each other in
gram and the subsequent reconstruction. In such a way that interference bands are pro-
Fig. l(a), the laser beam is first expanded and duced that depict only the differences between
then divided by a mirror, which directs part of them. In practice the two wavefronts are nearly
the beam directly onto the photographic plate; identical. There are several types of holographic
the rest of the light is reflected from the object. interferometry .
After processing, the hologram plate may be re- For single-exposure holographic interferom-
placed in its original position [Fig. l(b)] and etry, a hologram of the object wave 0 is re-
the object removed. The light diffracted by the corded in the usual way. The hologram is then
hologram forms, in part, the same wavefront placed in exactly the same position it occupied
that was originally scattered by the object. A during exposure and it is illuminated with the
viewer looking through the hologram will see an reference wave and a slightly distorted object
undistorted view of the object, just as if it were wave 0'. The reference wave reconstructs the
still present. Figure 2(a) shows a photomicro- original object wave 0, which interferes with
graph of an actual hologram, and 2(b), (c), and the directly transmitted wave 0'. This is a real-
(d) photographs of three perspectives of the time technique, because when the wave 0'
resulting image. changes, so do the interference bands. This
In addition to this virtual image, or primary technique can be used to measure surface defor-
image, a real, or conjugate image will be formed mations of all kinds, including thermal expan-
on the observer's side of the hologram. This im- sion and contraction, swelling caused by ab-
age will appear unsharp and highly distorted, sorption, and any minute changes that might
and it will also be inverted in depth, i.e., re- occur in an object.
versed front to back, as shown in Fig. 1(b). Double-exposure holographic interferometry
However, a distortion-free real image can be requires making two holograms on a single re-
formed by changing the position of the illumi- cording medium. One of the two holograms
nating beam so that all of the rays of the refer- yields a primary image which constitutes the
ence beam are reversed in direction. In this way, comparison wave, just as in the single-exposure
an undistorted, real, three-dimensional image of case. The test wave is not the object itself, how-
the object scene appears in front of the holo- ever, but a reconstructed wave from the
gram as shown in Fig. l(c). changed object. Interference phenomena,
571 HOLOGRAPHY

FIG. 2. A hologram of a diffusely illuminated object. (a) A highly magnified image of the hologram. (b), (c),
(d) Three perspectives of the resulting image.

(a) (b)

Fringes

Reference . ~ Illuminating "-


,,- '-
Beam ---<.~ ~ Obiect Beam -
" '- "Primary
~ Be'am " Image
(Virtual)
Recording ~ Hologram
Medium

FIG. 3. Volume holograms that can be viewed in reflection. (a) Recording the hologram with object and
reference waves incident in nearly opposite directions. (b) Reconstructing the primary wave in reflection.
HOLOGRAPHY 572

caused by changes in optical path through the object clearly and concisely. Complicated mo-
object between exposures, are produced when lecular or anatomical structure can be simply
the doubly exposed hologram is illuminated. presented with a single holographic image, with
This technique is well suited to interferometric little chance of error or misinterpretation on
recording of transient phenomena, such as the part of the viewer. Such a hologram could
shock waves and fluid flow, when a pulsed laser take the place of several conventional drawings
is used as the source. All of the principles dis- or photographs. The use of holograms in text-
cussed thus far apply equally well to time-de- books would be a great aid to the student in
pendent events, and the very short pulse of light many fields. Holograms made to be viewed with
from a ruby laser can record the interference a small penlight and a colored filter have al-
phenomena at a single instant of time. The very ready been produced in large quantities and
wide range of applicability of the method has distributed in magazines and books.
been well demonstrated. This technique can Holographic microscopy is another important
also be extended to multiple exposures. application. There are basically two distinct
The idea of multiple-exposure interferometry methods of holographic microscopy: (l) con-
can be extended to the limiting case of a ventional holography with magnification
continuum of exposures, resulting in what is achieved by changing the scale of the hologram,
called time-average holographic interferometry. the illuminating wavelength, or the radius of
This technique lends itself nicely to the prob- curvature of the illuminating wavefront; by op-
lem of vibration analysis and may well be the tically magnifying the holographic image; or by
best method yet devised for such analysis. The using any combination of these; or (2) holo-
basis of the method is that since holography graphically recording the optically magnified
itself is an interferometric process, any instabili- wavefront. For the first method, the lateral
ties of the interferometer cause fringe motion. magnification is given by the formula
Thus the hologram of a vibrating object is a
record of the time-average irradiance distribu- prnZcZR
tion at the hologram plane. Since the amount M= 2'
J,lZc(ZR - Zo) + m ZoZR
of light flux diffracted from any region of the
hologram depends on the fringe contrast, any where p. is the ratio of the illuminating and re-
object motion that causes the fringes to move cording wavelengths, m is the factor by which
during the exposure, causing a loss of contrast, the hologram has been scaled, and Zc, Zo, and
will result in less diffracted flux from that re- Z R are the radii of curvature of the illuminating,
gion of the hologram. The strength of the re- object, and reference wavefronts, respectively.
constructed wave is therefore a function of the Because aberrations are introduced when the
fringe motion during the exposure. If the object wavelength is changed or the wavefront radii
is vibrating in a normal mode, there will be are changed, most of the current work in holo-
standing waves of vibration on the surface, so graphic microscopy involves optically magnify-
that at the nodes the object motion will be very ing the holographic image or using the second
small or nonexistent. At the antinodes the vi- method, in which an optically magnified wave-
bration amplitude will be large. A hologram of front is recorded holographically. Holographic
such an object will then produce a bright image microscopy offers the following advantages:
of the regions of the object for which little or One can avoid the problems of limited depth of
no motion occurred during the exposure, focus, off-axis observation, and the short work-
whereas it will not produce images of antinodal ing distance of the classical microscope. The
points at all. This holographic method of vibra- hologram records a large volume of object space
tion analysis has all of the advantages of all instantaneously. All of the usual image-process-
holographic interferometry. The method can be ing techniques can be applied to the recon-
used regardless of the shape or complexity of structed wave, which may represent a large vol-
the object; the vibration nodes cim be examined ume of space at an instant of time. No other
in three dimensions, or at least from a variety method for doing this exists.
of perspectives; and the method works regard- Recently, holograms used as optical elements
less of whether the surface is optically smooth (holographic optical elements, or HOEs) have
or diffusely reflecting. become an important aspect of holography. An
One of the most striking aspects of the mod- HOE is a generalized diffraction grating having
ern hologram is the three-dimensional image many advantages over conventional optics (such
that it is capable of producing. This three- as their light weight, thinness, and low cost),
dimensional image indicates that there is a large along with some disadvantages (such as requir-
amount of information contained in a single ing narrow bandwidth light). In principle, HOEs
hologram-much more than is contained in a are able to perform the functions of conven-
conventional photograph of the same size. Be- tional optical elements, such as beamsplitters,
cause of the many perspectives available, the lenses, scanners, mirrors, and even spectral fil-
hologram is well suited to display purposes. ters. HOEs can not only be designed with a
With a hologram, one can present all of the ob- computer, as can conventional optical elements,
servable characteristics of a three-dimensional but they can also be generated with a com-
573 HOLOGRAPHY

puter, allowing inexpensive fabrication of such "Optical Holography," New York, Academic Press,
things as aspherics and off-axis elements. 1971.
4. Vest, C. M., "Holographic Interferometry," New
HOWARD M. SMITH York, John Wiley & Sons, 1979.
5. Lee, T-C., and Tamura, P. N. (Eds.), Proc. SPIE
References 215, "Recent Advances in Holography," 1980.

1. Smith, H. M., "Principles of Holograrhy," New


York, Wiley-Interscience, 1969. Cross-references: COHERENCE; DIFFRACTION BY
2. DeVelis, J. B., and Reynolds, G. 0., "Theory and MATTER AND DIFFRACTION GRATINGS; INTER-
Applications of Holography," Reading, Mass., Ad- FERENCE AND INTERFEROMETRY; LASER;
dison-Wesley, 1967. OPTICS, GEOMETRICAL; OPTICS, PHYSICAL;
3. Collier, R. J., Burckhardt, C. B., and Lin, L. H., WAVE MOTION.
I
IMPULSE AND MOMENTUM two billiard balls, A and B. Taking each ball
separately, the impulse on A by B is equal in
The concept of impulse and momentum derives magnitude but opposite in direction to the
directly from Newton's law of motion. impulse on B by A. Thus, upon collision, the
Consider first the case of linear impulse and change of momentum of A is also equal and
linear momentum. Newton's law states that in opposite to that of B. On the other hand, if we
the proper frame of reference, force is equal to take both A and B as our system, then the two
the (time) rate of change of momentum, where impulses are acting on the same system. Since
momentum is defined as the product of mass and the two are equal and opposite, they cancel out
velocity. Consider a force F acting for a time and the total impulse on this system due to col-
D. t on a particle of mass m, thereby changing its lision is zero. The Conservation Law of Momen-
velocity from VI to V2' The rate of change of tum then predicts that the total momentum of
momentum in this case is the change of momen- the system (which is the sum of momenta of all
tum divided by the time interval during which particles in the system) remains a constant no
the change occurs. Thus, Newton's law states matter what goes on in the system as long as
there are no external forces acting on the sytem.
(mvh-(mV)1 The Conservation Law of Momentum forms
F = -'-----'-..::---'--:...:. (1)
D.t one of the basic cornerstones in physics and
engineering. Its application is all-pervading, from
Now if we multiply each side of this equation by the motion of stars to the encounter and scat-
D.t, we obtain tering of molecules, atoms and electrons.
F D.t = (mvh-(mv)1 (2)
We will now continue our discussion at a more
precise level. First of all, in calculating the
The left-hand side of this equation, F II t, repre- momentum mv, the mass m should be relativis-
senting the product of a force and the time tic mass defined as
interval through which the force acts, is called mo
"impulse." Thus, this equation states what is m =y
-"F-}=_=:=(v====jC=:)=:='2
often known as the Law of Impulse and
Momentum: Impulse is equal to the change of where mo is the rest mass, i.e., mass at zero velo-
momentum. city, and c is the velocity of light. It !S seen .that
It becomes apparent that a body will experi- at velocities much less than the velOCIty of hght,
ence the same change of momentum irrespective the relativistic mass and the rest mass are indis-
of the separate values of F and II t as long as their tinguishable, and m may be considered constant.
product F llt is the same. Thus, a large force Secondly, both force and velocity. are vector
acting briefly may have the same net effect as a quantities, i.e., they have a magmtude and a
smaller force acting longer, if the two impulses direction and obey the parallelogram law of
are the same. Going to the limit, we may con- addition, and we shall use letters F and v to
sider an infinite force acting for an infinitesimal represent them. Finally, as the time intervalll t
time such that their product remains a finite approaches zero as a limit, the rate of change of
quantity. Under the action of such an impulse, momentum during llt becomes a derivative and
a body will experience an instantaneous change Newton's law becomes:
of velocity. A common example is the change
of velocity of a baseball as it is hit by a bat. For d
all practical purposes the change occurs instan- F =-(mv) (3)
taneously. dt
As a consequence of the Law of Impulse and After integrating each side of this equation with
Momentum described above, we find that if the respect to time t and taking the integration
total force is zero, so is the total impulse, and limits from t = tl to t = t2, the result is:
the momentum will remain unchanged. This is
known as the Conservation Law of Momentum. t2
]
It applies either to one particle or to a system of F dt = (mvh - (mv)1 (4)
particles. Consider, for instance, the collision of tl

574
575 IMPULSE AND MOMENTUM

The integral on the left-hand side, analogous to We now integrate this equation with respect to t
the left-hand side of Eq. (2), is the "impulse," and take the integration limits from t = t, to
and we again reach the Law of Impulse and Mo- t = t2. The result is
mentum given previously. In evaluating this in-
tegral, we must know the variation of F as. a
function of time, i.e., F = F(t). Furthermore, 1ll (6)
applying this law to a system of particles, we
need only to count those impulses that are
caused by external forces acting on the particles Analogous to the linear impulse, we may call
due to sources outside the system. As has been the integral on the left-hand side of the above
illustrated by the previous example on two bil- equation the angular impulse. Thus this equation
liard balls, the internal forces that any two par- states: Angular impulse is equal to the change of
ticles exert on each other will generally cancel angular momentum. In particular, if the total
out if both particles are included in the system. torque is zero, so is the total angular impulse,
A dramatic illustration of the above principle and the angular momentum will remain un-
is the case of an exploding shell. Before the ex- changed. This is known as the Conservation
plosion, the shell will travel a parabolic path Law of Angular Momentum. •
under the influence of the only external force,
When applied to common problems, ~e take
the gravitational force, if the forces due to air the axial component of the vector equatIOns (5)
resistance are neglected. During explosion, all
the forces acting on the exploding pieces due or (6), i.e., along the direction perpend~cula! to
to explosion are internal forces and they cancel the plane of motion. Along such a duectIOn,
out if we include all pieces of the shell as our the magnitudes of T and L assume the follow-
system. The conservation law of momentum ing values:
then predicts that the center of gravity of the
system will continue to travel along the same T=rFt
parabolic path even after explosion, until some
new external force acts on the system (as, for L = mrVt
example, when a piece of the shell hits an
object). where r again is the distance from the axis to
Up to this point we have discussed the la~s of the particle, Ft the tangential component of the
linear impulse and linear momentum. Entnely force acting on the particle, and Vt the tangen-
similar laws hold for angular impulse and angular tial velocity of the particle. Equation (5) then
momentum. The angular momentum L of a becomes a scalar equation.
particle about a point 0 is defined as the vector
product of r, the radius vector from 0 to the d
particle, and mv, the momentum vector of the T = rFt = - (mrvr) (7)
dt
particle. Th.us,
L = r X mv If T = 0, then the angular momentum mrVt re-
mains constant.
The moment of force or torque about 0 is de- Equation (7) is the basis of the operation of
fined as pumps and turbines. In the case of a turbine,
the angular momentum of the working fluid is
T = rXF large at entrance and small or zero at exist.
Hence the fluid imparts a torque to the shaft.
If we take the cross product of r with each side In a pump or compressor the reverse is the case,
of the expression for Newton's law, Eq. (3), we i.e., the angular momentum of the working
obtain fluid is small at entrance but large at exit, hence
d the shaft supplies torque as well as energy to
r X F = T = r X - (mv) the fluid.
dt A remarkable example of the conservation
law of angular momentum is the motion of
The right-hand side can be identified to be just planets around the sun. The attractive force be-
dLldt on account of the vector identity: tween a planet and the sun is in the direction
of the line connecting the two, hence it has
dL d d no tangential component and contributes no
- = -(r X mv) = v X mv + r X -(mv) torque. Equation (7) then concludes that rVt
dt dt dt must be a constant. This means that when the
where the first term on the right-hand side planet is at a larger distance from the sun, its
vanishes. Thus, tangential velocity must be smaller, and vice
versa. This led Newton to explain the experi-
mental finding of Kepler, namely, a planet
dL sweeps equal sectorial areas in equal times. The
T=- (5)
dt same explanation applies to figure skaters and
IMPULSE AND MOMENTUM 576

ballet dancers when they increase their spin It might appear from Eq. (2) that the emf
speed by pulling in their arms. depends on the forward velocity with which the
test charge is moved along the path C. However,
HSUAN YEH this is not the case. If v and dl in Eq. (2) have
the same direction, then the vectors (v X B) and
References dl are normal, and their scalar product is zero.
Consequently, only the component of v normal
Yeh, Hsuan, and Abrams, Joel I., "Principles of Me- to dl can contribute to the emf. This component
chanics of Solids and Fluids," Vol. I, New York, has value only if the differential path length dl
McGraw-Hill Book Co., 1960. has sideways motion. Thus v in Eq. (2) repre-
Synge, John L., and Griffith, Byron A., "Principles of sents the sideways motion, if any, of dl. The
Mechanics," New York, McGraw-Hill Book Co., fields E and B of Eq. (2) may be functions of
Third Edition, 1959. time as well as functions of the space coordi-
Goldstein, Herbert, "Classical Mechanics," Reading, nates. In addition, the velocity v of each dif-
Mass., Addison-Wesley, 1950. ferential path length dl may vary with time.
Feynman, Richard T., "Lectures on Physics," New However, Eq. (2) correctly expresses the emf,
York, Addison-Wesley Pub. Co., 1963. or voltage drop, along the path C as a func-
Simon, Keith R., "Mechanics," 3rd. Ed., New York, tion of time. That component of the emf
Addison-Wesley Pub. Co., 1971. consisting of the line integral of v X B is known
Cross-reference: CONSERVATION LAWS AND SYM· as the motional emf, because it has value on~y
METRY, DYNAMICS, MECHANICS, ROTATION-
when the path C is moving through a magnehc
CIRCULAR MOTION, STATICS, VECTOR PHYSICS.
field, cutting lines of magnetic flux. For station-
ary paths there is no motional emf, and the volt-
age drop is simply the line integral of the elec-
tric field E.
INDUCED ELECTROMOTIVE FORCE For an emf to exist along a stationary path, it
is necessary to have an electric field present. As
Electromotive force and voltage drop are usu- electric charges are surrounded by electric fields,
ally regarded as synonymous. When an electro- emfs are generated by devices that separ~te
motive force, or simply emf, is impressed on a charge. A familiar example is the battery, WhICh
closed metallic circuit, current results. The emf utilizes chemical forces to separate charge. Some
along a specified path C in space is defined as other methods of separating charge are the heat-
the work per unit charge W/q done by the elec- ing of a thermocouple, the exposure of a photo-
tromagnetic fields on a small test charge moved cell to incident light, and the rubbing together
along C. Since work is the li~e integ!al C?f force of different materials. Electric fields are also
F, the work per unit charge IS t~e line mtegral produced by tirne-changi~g magnetic. fields, an.d
of the force per unit charge. Lettmg F /q denote this principle is extensIvely exploIted, as IS
the vector electromagnetic force per unit charge motional emf, to generate electric power. The
in newtons per coulomb, we have

=1 ~.
remainder of this article is devoted to electro-
motive force induced by magnetic means.
emf dl volts (1) A fundamen tal law of electromagnetism, often
cq called the Maxwell-Faraday law, or the first law
The scalar product (F /q) • dl is the product of electromagnetic induction, states that the line
(F/q) cos (J dl, with (J denoting the angle be- integral of the electric field intensity E around
tween the vectors F /q and dl. any closed path C equals -acp/at, with cp repre-
The electric force per unit charge is the elec- senting the magnetic flux over any surface S
tric field intensity E (volts per meter) an~ the having the closed path C as its contour. The
magnetic force per unit charge is v X B, wIth. v positive side of the surface S and the direction
denoting the velocity of the test charge ~n of the line integral around the contour C are re-
meters per second and B denoting the magnehc lated by the right-hand rule; by this rule, the
flux density in webers per square meter. In curled fingers are oriented so as to point around
terms of the smaller angle (J between v and B, the loop in the direction of the integration.a!1d
the cross product v X B is a vector having mag- the extended thumb points out of the poslhve
nitude vB sin 0; the direction of the vector side of the surface S. The magnetic flux cp is the
v X B is normal to the plane of the vectors v and surface integral of the magnetic flux density B;
B with the sense of that of the extended thumb that is,
of the right hand oriented so that its fingers curl
through the angle (J from v toward B. As the
total force per unit charge is E + v X B, the emf
in terms of the fields is
cp = IIs B • dS webers (3)

""1f =Ic (E + v X B) • dl (2)


In Eq. (3) the vector differential surface dS has
area dS and is directed normal to the plane of
dS out of the positive side. The partial time de-
577 INDUCED ELECTROMOTIVE FORCE

rivative of I/> is defined as Clearly the integral of Bn v around the closed


contour C, with v denoting the magnitude of

al/> =
at
Jfs aB. dS volts
at
(4)
the sideways velocity of each dl, is simply the
time rate of increase of the magnetic flux over
the surface bounded by C due to the path C
cutting lines of magnetic flux. Hence, the com-
and this is often referred to as the magnetic cur- plete expression in brackets is the time rate of
rent through the surface S. For a moving surface increase of the magnetic flux 1/>, over any surface
S the limits of the surface integral of Eq . (4) are S bounded by the closed path C, due to the
functions of time, but Eq. (4) still applies. It is changing magnetic field and also due to the
important to understand that in evaluating moving path cutting through the magnetic field .
al/>/at over a surface that is moving in a region Equation (8) is often written
containing a magnetic field, we treat the surface
at the instant under consideration as though it dl/>
were stationary. The partial time derivative of
emf=- - (9)
dt
I/> is the time rate of increase of the flux over the
surface S due only to the changing magnetic It is important to note carefully the distinc-
field B; any increase in I/> due to the motion of tion between Eqs. (5) and (9). Equation (5) is
the surface in the B-field is not included. The only the variational emf, and Eq. (9) is the sum
Maxwell-Faraday law is of the variational and motional emfs. In Eq. (5),
the partial time derivative of the magnetic flux
I/> is the rate of change of the flux due only to
£ E' dl = _ al/> (5) the time-changing magnetic field; in Eq. (9),
Ie at the total time derivative is the rate of change of
the flux due to the time-changing B-field and
with I/> being the magnetic flux in webers out of also to the path cutting through the magnetic
the positive side of any surface having the path field . Of course, if the closed path is not cutting
C as its contour. The small circle on the integral lines of magnetic flux, then Eqs. (5) and (9) are
sign indicates a closed path. We note from Eq. equivalent. It is also important to note that
(5) that an electric field must be present in any dl/>/dt in Eq. (9) does not necessarily mean the
region containing a time-changing magnetic total time rate of change of the flux I/> over the
field. surface S. For example, the flux over a surface
The application of Eq. (2) to a closed path C S bounded by the closed contour C of the left-
gives hand electric circuit of Fig. I is changing when
the coil is being unwound by the rotation of the
emf= fe E' dl+ fe (vX B)' dl (6)
cylinder. However, as B is static there is no vari-
ational emf, and since the conductors are not
cutting flux lines, there is no motional emf.
Consequently, dl/>/dt as used in Eq. (9) is zero
Utilizing Eq. (5) enables us to write Eq. (6) in even though the flux is changing with time. Note
the form that df/>/dt in Eq. (9) was defined as representing

emf = - a
- I/> +
at
fe
(v X B) • dl (7)
the bracketed expression of Eq. (8), and df/>/dt
must not be more broadly interpreted.
In the applications of the equations which
have been presented, we must refer all flux den-
sities and movements to a single specified coor-
Thus the emf around a closed path consists, in dinate system. In particular, the velocities are
general, of two components. The component with respect to this system and are not relative
- al/>/at is often referred to as the variational velocities between conductors and moving lines
emf (or transformer emf), and the second com- of flux. Of course the coordinate system is arbi-
ponent is, of course, the motional emf. trarily selected, and the relative magnitudes of
In Eq. (7) the relation (v X B) • dl can, by the variational and motional emfs depend upon
means of a common vector identity, be replaced the selection. Let us consider two examples.
with - B • (v X dl has magnitude v dl and Example 1 An electric generator is shown in
of dl, the vector v X dl has magnitude v dl and Fig. 2. The parallel stationary conductors sep-
direction normal to the differential surface dS arated a distance I have a stationary voltmeter
swept out by the moving length dl in the time connected between them. The electric circuit is
dt. Letting Bn denote the component of B nor- completed through a moving conductor that is
mal to this area, we note that - B • (v X dl) be- connected electrically by means of sliding taps.
comes - Bn v dl, and Eq. (7) can be written This conductor is at y = 0 at time t = 0 and
moves to the right with constant velocity v =
vay. The applied flux density B, represented in
Fig. 2 by dots, is Bo cos ~y cos wt ax. Unit vec-
tors in the directions of the respective coordi-
INDUCED ELECTROMOTIVE FORCE 578

A
FIG. 1. The current generator produces a steady magnetic flux in the
iron cylinder, which rotates as the wire is pulled at A .

x
FIG. 2. Elementary electric generator.

nate axes are ax, ay, and az. Find the instanta- The motional emf is the line integral of v X B
neous voltage across the voltmeter. along the path of the moving conductor. As
Solution. Let S denote the plane rectangular v X B is - Bov cos {3y, cos wt az and as dl is
surface bounded by the closed electric circuit, dz az, we evaluate the integral oC-Bov cos {3y,
with the positive side selected as the side facing cos wt dz from z = 0 to z = I, obtaining a mo-
the reader. The counterclockwise emf around tional emf of - Bolv cos (3y, cos wt. This com-
the electric circuit is -dC/J/dt, with C/J signifying ponent results from the cutting of lines of mag-
the magnetic flux out of the positive side of S. netic flux by the moving conductor.
As dS = 1dy ax, the scalar product B • dS is If the voltmeter draws no current, there can
Bol cos ~y cos wt dy; integrating from y = 0 to be no electromagnetic force on the free elec-
y = y, gives trons of the wires. Therefore, the emf along the
path of the metal conductors, including the
C/J = (Bol/~) sin ~y, cos wt (10) moving conductor, is zero. The total voltage of
Eq. (11) appears across the voltmeter.
with y, denoting the instantaneous y-position Example 2 Suppose the conductor with slid-
of the moving wire. The counterclockwise emf ing taps in Fig. 2 is stationary (v = 0) and lo-
is found by replacing y, with vt and evaluating cated at y = y, . Also suppose that the magnetic
-dC/J/dt. The result is field B is produced by a system of steady cur-
emf = wBol/{3 sin {3vt sin wt rents in conductors (not shown in Fig. 2) that
are moving with constant velocity v = val. At
- B olv cos (3vt cos wt (11) time t = 0 the magnetic field B is Bo sin py ax.
Determine the voltage across the voltmeter.
The variational (transformer) component is Solution. There is no motional emf because
-aC/J/at, which is determined with the aid of Eq. the conductors of Fig. 2 are stationary with re-
(10) to be wBol/{3 sin (3y, sin wt, with y, = vt. spect to our selected coordinate system. How-
This is the first component on the right side of ever, the magnetic field at points fixed with
Eq. (11). Note that y, was treated as constant respect to the coordinate system is changing
when evaluating the partial time derivative of C/J. with time, and hence there is a variational emf.
579 INDUCTANCE

As the B-field at t = 0 is Bo sin ~y ax and Neumann equation*


moving with velocity v ay, the B-field as a func-
tion of time is Bo sin W(Y - vOl ax. This is
verified by noting that an observer at Yo at t = 0
moving in the y-direction with the velocity v of
MI2 =M21 =~;f 12 dl l ~ dl 2
the moving current-carrying conductors, would
have a y-coordinate of Yo + vt; hence according where dl l and dl 2 are vector elements of length
to the expression for B, he would observe a con- in circuits 1 and 2, respectively, and r is the
stant field. The magnetic current density is distance between these two elements. Note that
this expression is completely symmetrical with
aB/at = - ~vBo cos ~(y - vt) ax respect to the two circuits. The self-inductance
The negative of the integral of this over the rec- is calculated from the same equation, where the
tangular surface bounded by the electric circuit, elements dl l and dl 2 are now situated on the
with the positive side selected as the side facing same circuit. The double integral is evaluated by
the reader and with y limits of zero and y I, first keeping the position of dl 2 fixed and inte-
gives the counter-clockwise emf. The result is grating dll/r around the circuit. The process is
then repeated for all other elements such as d1 2 ,
emf=Bolv[sin~(YI - vt)+sin~vt] and the results are summed.
The ex ternal inductance of a circuit is the part
This is the voltage across the meter. of its self-inductance which is due to flux lying
outside the surface of the conductor while the
CHARLES A. HOLT internal inductance is the contribution from the
magnetic flux within the conductor itself.
References If the current, and therefore the magnetic flux
Bewley, L. V., "Flux Linkages and Electromagnetic associated with it, varies with time, an electro-
motive force (emf) will be induced in any circuit
Induction," New York, Dover Publications, 1964. linked by the flux. This provides an alternative
Fano, R. M., Chu, L. J., and Adler, R. B. "Electro- method of defining inductance in terms of the
magnetic Fields, Energy, and Forces," New York, emf induced by a given rate of change of current.
John Wiley and Sons, 1960.
Holt, C. A., "Introduction to Electromagnetic Fields di l
and Waves, New York, John Wiley & Sons, 1963. &2 =-M-
Moon, P., and Spencer, D. E., "Foundations of Elec- dt
trodynamics," New York, Van Nostrand Reinhold,
1960.

Cross-references: AL TERNA TING CURRENTS, CIR·


CUITRY, INDUCTANCE, INDUCTION HEATING, di
& =-L-
POTENTIAL. dt
The negative signs are used to imply that the
direction of the emf is always such as to tend to
INDUCTANCE oppose the change of current (Lenz's law). Note,
however, that although the self-inductance must
Inductance is a ratio of a magnetic flux <I> to an always be positive, the mutual inductance of
electric current i. The unit is the henry; I henry. two circuits may be either positive or negative.
== 1 weber/ampere. The mutua! inductance M of An inductance of I henry corresponds to an
two circuits is defined as the ratio of the mag- induced emf of 1 volt for a rate of change of
netic flux, linking with one circuit, to the cur- current of I ampere/sec.
rent in the other In order to maintain a current i, the induced
emf - Ldi/dt must be balanced by an equal and
<1>2 <1>1
M 12 = -.-, M2I = -.- opposite applied voltage so that the total applied
II 12 voltage is
The self-inductance L of a single circuit is defined di
as the ratio of the flux linking the circuit to the v=Ri+L -
current flowing in the circuit. dt

<I> and the power supplied to the circuit is


L=-
i
vi = Ri 2 + ~(~ Li 2\
In the absence of any magnetic material, M dt 2 J
and L depend only on the geometry of the
circuits concerned. The mutual inductance of *Ito is the permeability of free space; Ito = 41T X
two circuits 1 and 2 can be calculated from the 10- 1 henrys/m.
INDUCTANCE 580

where the resistanceR is a measure of the power wL


dissipated . This equation shows another way of Q = - = tan I/>
R
interpreting inductance, namely as a measure of
the amount of energy stored in the magnetic where I/> is the phase angle of the complex im-
field when a given current flows. The stored pedance z.. An alternative term is the power-
energy is !Li2 joules when L is measured in factor cos 1/>, particularly when the emphasis is
henrys and i in amperes. on power dissipated rather than on the damping
Electric circuit theory is based on the use of of tuned circuits.
sinusoidally varying currents and voltages. When An impedance Z can be represented by an
the current varies sinusoidally with time, the equivalent circuit consisting of a resistance R in
rate of change of current has the same time series with an ideal series inductance L. An alter-
waveform except for a phase shift of Tr/2 radians. native is to start with the admittance Y = lIZ
This leads to the idea of a complex impedance, and to represent this by the parallel combination
of which the real part is associated with the of a resistance R' and a parallel inductance L'.
power lost from the circuit and the imaginary
part is a reactance given by the ratio of the R'
R=--
magnitude of the induced emf to that of the 1+ Q2
current. Since the phase of the applied voltage
leads that of the current, the magnetically
induced reactance is taken as being positive. We L=L(~)
1+ Q2
thus have
Note that, when Q is sufficiently large,
i = I sin wt
L'~L
di
v=Ri+L - R'~Q2R
dt
Inductance Coil (Inductor). This is a device
The circuit impedance is Z where which is specially designed to possess inductance.
v The winding may have a ferromagnetic core
Z = -:- = R + jwL composed of a dust-core material, metallic lam-
I inations or ferrite. An air-cored coil is one with
no magnetic core.
The self-inductance of a circuit can be regarded Although it is the inductance which is of
as a parameter which determines the inductive interest, the influence of the electric field often
reactance presented to a sinusoidally varying cannot be neglected and causes the effect known
current of given amplitude and frequency . In the as self-capacitance. At relatively low frequencies,
same way, the mutual inductance determines the inductor will behave as if an equivalent
the mutual reactance between two circuits. lumped capacitance were shunted across its
Owing to magnetic hysteresis, the voltage ana" terminals. This simple equivalent circuit fails
current time waveforms cannot both be sinusoi- as the frequency is approached at which the
dal if any magnetic material is present. Under apparent lumped capacitance would resonate
these conditions, the reactance is defined as the
ratio of the fundamental-frequency components with the inductance of the coil. The problem
of the voltage and current waveforms. then becomes one of electromagnetic wave
The sign of the mutual inductance can be propagation, and the concept of inductance is
specified in the following way. Suppose that the no longer relevant. The effect of a fixed parallel
two circuits are connected in series. Then the capacitance C is to reduce the Q-factor to Qc,
total induced emf will be where

di
- (L 1 + L2 + 2M)-
dt
When Q is large, this becomes
and the total reactance will therefore be
Qc ~ Q(I-w 2 LC)
jw(L 1 + L2 + 2M)
The self-resonance effect is sometimes an
and will be greater than the sum of the reac- advantage, e.g., in choke coils, where the object
tances of the separate circuits when M is positive. is to obtain a high impedance, irrespective of its
A combination of two coils is said to be series phase angle. Generally however self-capacitance
aiding or series opposing, depending on whether is an undesirable property, particularly in elec-
they are connected so that M is positive or nega- trical networks in which the inductor forms part
tive, respectively. of a series resonant circuit.
The Q-factor is used in describing the proper- The self-capacitance of a coil can be kept to a
ties of an inductor. It is defined as minimum by spacing the turns of the winding
581 INDUCTANCE

well apart and also, in the case of a multi-layer tion of the electromagnetic field becomes com-
winding, by ensuring that wires which lie physi- parable with the thickness of the laminations.
cally close together always belong to adjacent The depth of penetration d is given by
parts of the winding so that the potential dif-
ference between them is relatively small. 2
The inductance of some types of coil can be d 2 = w/lo/lO
calculated directly, without recourse to the
Neumann equation . An example is the toroidal where 0 is the conductivity of the core material.
coil, i.e., one consisting of a uniform winding The inductance of a coil with a magnetic core
around a ring-shaped former. In this case the is reduced if there is a superimposed unidirec-
magnetic flux exists in a well-defined magnetic tional polarizing flux, caused for example by a
circuit and the inductance is given, very nearly, dc current in the winding. This may be an un-
by wanted effect or it may be used as a means of
L = /loN2A controlling the inductance.
Transverse air gaps are often introduced into
I the magnetic circuit. There are several reasons
where N is the number of turns in the winding for this. An air gap reduces the flux density for
and A and I are, respectively , the cross-sectional a given magnetizing force, thus reducing the
area and the length of the magnetic circuit. The inductance of a given coil. The power losses are,
same formula also holds for a long, thin sole- however, reduced at a greater rate than the
noidal coil. Corrections must be applied unless inductance so that the power factor is improved.
both the diameter of each tum and the radial The performance of the inductor is made less
depth of the winding are small compared with dependent on the magnetic parameters of the
the length of the magnetic circuit for the toroid core material. The waveform distortion due to
or the length of the coil itself for the solenoid. hysteresis is also reduced . An air gap may be
If the toroidal former is replaced by one made
used to prevent saturation of the core by the
of a magnetic material having a relative per- polarizing flux when the coil is required to carry
meability /l, the inductance will be increased by a dc current.
a factor /l. For coils of other geometrical shapes, If the inductance of a coil, with a closed mag-
where some of the flux linking the winding may netic circuit of length t, is L, an air gap of length
g will reduce the inductance to Lg where
lie partly or wholly outside the magnetic core,
the presence of the latter will increase the induc-
tance by a factor called the effective permeabil-
ity /le. This can never exceed /l and may be con- Lg=L· [ /l ]
siderably smaller than /l . I + /l(glt)
The relative permeability of a ferromagnetic
material is not a constant but is a function of As the gap ratio gil is increased, the inductance
the instantaneous flux density . This increases tends to become independent of /l, particularly
power losses and causes distortion of the cur- when /l is large. Note however that the formula
rent and voltage time waveforms. assumes that the presence of the air gap does not
Power dissipation in a metallic core can be change the geometry of the flux distribution; an
represented approximately by an equivalent assumption which is generally only justified for
series resistance R, where relatively small gap ratios.
Dust-cored or ferrite-cored coils, for which the
R core eddy-current losses are small, are often
- - = c + hB max + ef
/lefL wound with stranded wire. This is done to keep
the eddy-current losses in the winding low by
and Bmax is the peak flux density. The param- ensuring that the individual conductor strands
eters c and h depend on the hysteresis properties have diameters which are small compared with
of the core material. The parameter e is a mea- the depth of penetration of the electromagnetic
sure of eddy-current losses in the core. field into the material of which they are made.
Ferrite materials are practically insulators; At very high frequencies, where this condition
therefore eddy-current losses are negligible. For cannot be met, solid wire is often used and the
a ferrite, however, the relative permeability must diameter of the wire is increased to compensate
be regarded as a complex quantity for the fact that only part of its cross-sectional
area is effective as a conductor.
/l=/l'(I-jtano)
v. G. WELSBY
Both /l' and the dissipation coefficients tan 0 are
functions of frequency. References
The useful frequency range of coils with lami-
nated cores is often limited by magnetic skin Scott, W. T., "The Physics of Electricity and Mag-
effect. This causes both the inductance and the netism," New York, John Wiley & Sons, 1959.
Q-factor to start to fall as the depth of penetra- Plonsey, R., and Collins, R. E., "Principles and Appli-
INDUCTANCE 582

cations of Electromagnetic Fields," New York, Mc- TABLE 1. EFFECTIVE DEPTH OF CURRENT
Graw-Hill Book Co., 1961. PENETRATION (in inches)
Welsby, V. G., "The Theory and Design of Inductance
Coils," London, MacDonald & Co., 2nd Edition, Frequency 50Hz 1000 Hz 100 kHz
1960.
Copper 0.300 0.080 0.008
Cross-references: INDUCED ELECTROMOTIVE Aluminium 0.450 0.120 0.012
FORCE, INDUCTION HEATING, TRANSFORMER. Brass 0.700 0.180 0.018
Steel 0.065 0.015 0.0015
Steel (above Curie) 3.200 0.750 0.075

INDUCTION HEATING
Induction heating is a technique for generating of the workpiece. This effect increases with in-
heat in electrically conductive articles by causing crease in frequency, and as the current at any
current to circulate in them by induction from depth from the surface determines the rate of
an adjacent coil carrying alternating current. heat generation at that depth, the frequency
The frequency of the current exciting the coil can be chosen to control the heat pattern.
ranges from supply freq1,lency, 60 Hz, up. to The effective depth of current penetration is
typically 450 kHz. Exc()'pt for 60-Hz apphca- a function not only of frequency, but also mag-
tions current may be obtained from rotary netic permeability and specific resistivity. It can
mot~r alternators, up to 10kHz; magnetic be expressed by the relation:

1~
multipliers using saturating cores, up to ~40 Hz;
valve oscillators to over 1 MHz; and solid-state
frequency changers up to 10kHz. Coil-exciting
voltages range from 400, 800, and 1 200 V in D = 1. 98 p f
low-frequency systems (up to 10kHz), to as
much as 5 000 V for high-frequency valve where D = depth in inches, p = resistivity in
oscillators. prl p = permeability, and f = frequency in Hz.
Typical applications include general heating The very marked change in current depth in
of metal parts, annealing, case and through ferromagnetic material (steel) as it passes from
hardening, melting in crucible-t~pe furnaces, its high permeability state through Curie tem-
and heating of billets prior to forgmg. perature to its low permeability, higher re-
In all cases of induction heating the funda- sistivity state should be noted.
mental principle is transformer action between The exciting coil together with the workpiece
the exciting coil and the workpiece which forn~s can be represented by an inductor in parallel
the secondary circuit. A typical arrangement IS with a resistor. To achieve efficient operation
a solenoidal coil with a solid cylindrical work- when used with the power sources described it
piece. The currents induced in the char~e f!ow is necessary to compensate for the reactive
in cylindrical paths centered on the longltudmal KV A taken by the inductance by adding ca-
axis of the workpiece. Each of these paths has pacitance in parallel to bring the power fac~or
inductance and there is a consequent progressive close to unity. Once this is achieved the effective
decrease in current strength toward the center load resistance must then be matched to the
source resistance to achieve maximum power
transfer to the workpiece.
Induction heating makes possible power densi-
ties (KW /in.2 of workpiece area) up to 100 KW /
in. 2 compared with the maximum equivalent
power density from an oxyacetylene flame of
about 10 KW/in.2.
P. 1. BILLING
References
Simpson, P. G., "Induction Heating, New York,
McGraw-Hill Book Co., 1960.
Tudbury, C. A. "Basics ofInduction Heating," Vol. 1,
132 pp. and Vol. 2, 133 pp., New York, John Rider,
1960.
Vaughan, J. T., and Williamson, J. W., "Design of In-
duction Heating Coils for Cylindrical Nonmagnetic
t===~ '!==:= - leoll Load," AlEE Transactions 64, 587-592 (1945).
FIG. 1. Heating coil./w = current induced in ma- Vaughan, J. T., and Williamson, 1. W., "Design of
terial to be heated. I coil = current through the water Induction Heating Coils for Cylindrical Magnetic
cooled coil. Loads," AlEE Transactions 66, 887-892 (1947).
583 INERTIAL GUIDANCE

Baker, R. M., "Design and Calculation of Induction tion of constraint (; (Coriolis, centripital accel-
Heating Coils," AlEE Transactions 76, Part II, ApI. erations, etc.). This must be computed and then
and Industry. 31-40 (1957). it must be subtracted from the accelerometer
"Heat Treating, Cleaning, and Finishing," in "Metals signal. (Return to Fig. 3.) Following this with
Handbook," 8th Ed., Vol. 2, p. 173, Metals Park, al1 integration determines the vehicle velocity
Ohio, American Society of Metals, 1962. X. Diyision by earth's radius R yields the angular
rate X (latitude rate about the earth), which
Cross-references: ALTERN A TIN G CU RRENTS ; CON- upon integration provides the instantaneous
DucTIvITy' ELECTRICAL; INDUCED ELECTRO- latutude A. (See Fig. 5.)
MOTIVE FORCE; INDUCTANCE. Schuler Tuned INS The simple pendulum
used in Fig. 3 is unsatisfactory, since vehicle
acceleration would cause the pendulum to seek
a false vertical. Einstein's special theory of
INERTIAL GUIDANCE* relativity states that it is impossible to distin-
Inertial guidance is an on-board means to pro- guish between inertial reaction (acceleration)
vide a steering control function that will hold a forces and mass attraction (gravitational) forces .
moving vehicle on a prescribed course, relying Consequently, if a real pendulum were em-
solely upon inertial measurements. Inertial ployed, the accelerometer would erroneously
navigation is an on-board means to determine include a component of gravity. With an in-
the location of a vehicle relying solely upon correct vertical, and an incorrect measure of
inertial measurements. Hence, an inertial acceleration, there would be no way to correct
guidance system (IGS) employs an inertial either.
navigation system (INS). Instead of a simple pendulum, a more sophis-
The steering function is formulated by com- ticated scheme is required. The pendulous ef-
paring the location vector Qins (as determined fect is achieved by means of the accelerometer
by the INS) to the steering command vector and a gyroscope. The same accelerometer will
Qsc . (See Fig. I.) Note that solid lines rep- serve in two capacities (since they cannot be
resent signals while dotted lines represent separated anyway). It will read the vehicle ac-
physical or actual dynamics. In performing its celeration and act as part of the vertical seeking
task, th~ INS measures the vehicl<; acceleration device. (See Fig. 6.) The vehicle rate A is ap-
vector Q and vehicle rate vector n. Employing plied to the gyro , which precesses at that rate.
a gyro-stabilized coordinate reference system, After a timet, the platform turns through an
the INS computes the position or location vec- angle A which is the integral of A. Thus, when
tor Qins. In actual practice, the vectors are the system is operating properly, the platform
handled as three scalar components, utilizing is physically turned through the same angle that
three channels each like that in Fig. 1. the vehicle has traveled around the earth. Con-
Basic INS Essentially, the INS measures sequently, the platform is still level. The INS
vehicle acceleration and integrates twice. See is an analog of the actual system, maintaining
Fig. 2. Consiq.er the X component of vehicle a true vertical and measuring true acceleration.
acceleration, X. The X accelerometer ax mea- As such, the INS acts like a pendulum whose
sures this, producing an electrical signal. The period T is given by
signal is integrated twice, thus computing the
X component of vehicle position. For a three T= _1 • /R; (1)
dimensional system, there would be a similar 2rr V-;
channel each for the Y and Z components.
Stable Platform While Fig. 2 represents the whereupon, applying numerical values, the pe-
basic INS, it is not practicable. It has several riod becomes
shortcomings-the lack of a coordinate system T = 84 minutes. (2)
(implied in Fig. I) being the primary one. In
order to provide a coordinate system, it is neces- This is referred to as a Schuler pendulum or a
sary to physically orient the accelerometers Schuler tuned system.
appropriately. This can be accomplished by Complete INS The preceding section indi-
mounting the accelerometer on a stabilized cates one means by which the INS may be

-S_3__ -
platform (a surface whose orientation in space instrumented. Other schemes involve considera-
is maintained by physical means) . For discus-
sion purposes , let us employ a pendulum and a
compass. (See Fig. 3.) For this hypothetical
case, the X accelerometer would be level and
pointed north.
The price paid for this particular form of
.. I
QI
I'
,n.
stabilization is that the accelerometer rotates.
(See Fig. 4). As a result , it senses the accelera-

*Illustrations and text excerpted from Refs. 1 and 2. FIG. 1. Inertial guidance system.
INERTIAL GUIDANCE 584

Computer
_ _ _ __ _ _ _ _ __ _ ____________
X
Accelerometer Integrator Integrator
X m!Ch x

FIG. 2. Basic INS eX channel).

I co~pass I
Computer
I
I

I
I
@pendulum

FIG. 3. INS stabilized with pendulum and compass.

tions such as: the shape of the earth, free gyros, involving gravity and terrestrial coordinates,
platform fixed in space, platform affixed to the respectively.
vehicle frame (strap down), free azimuth, slaved A specific INS, one that is level and north-
to arbitrary heading, etc. A universal model is slaved, is shown in Fig. 8. Note that there are
shown in vector form in Fig. 7. The boxes two Schuler loops, one each for the X and Y
marked G and T are universal transformations axes. The third axis, Z, is aligned along the

North Pole
11..'..:"

FIG.4 . Rotating spherical coordinates.


585 INERTIAL GUIDANCE

w. Earth 's ra te

l1p - w, + C Polar ra te

East

v, Up

vehIcle

FIG, 5. Spherical terrestrial coordinates.

Computer

c.

Ix ~~
L---L S- - -
L-----------------~vr-------------------
Gyrostabilized Platform

FIG. 6. X axis stabilized by gyroscope.

G ravity Computer

r----------------------
I Acceleration of Constraint
1
Ig t
Terrestrial
I
P

L___ @ __ _
I 1 Schu ler Loops
I
I
I
: Heading Loop

L---- G -----
FIG. 7. Universal INS (in vector form).
INERTIAL GUIDANCE 586

i G;vityC~m-;;-u;t;-;;n-l
_g(2
I
I
II
t-~I--Gz

l/Iv ~ I
r-YleV;--D-- I
I
I Y Schuler Loop I I
~~~S~O~~~~ ~Gj
A_-I.!7l
Latitude

r - - - - -_-_-_-_I....:....__ A _

we
I Terrestrial Computation I
I
LJ
Longitude
I
I

I X Schuler Loop

~-.!~--J7l--
X Level ~
I
Gz
LTRANSFORMATION
________ [TJ11_
Altitude

FIG. 8. A level north-slaved inertial navigation system.

FIG. 9. Errors in level north slaved INS.


587 INERTIAL GUIDANCE

Qw wsinwt-PsinPt
O:n W2 - p2

O~-------1--~-==r------------------~

\ w P sin wt - w sin Pt
'fw P w2 _p2

~ cos Pt - cos wt
Qe w2 - p2

FIG. 10. Typical errors in level north-slaved INS.

earth's radius vector (up). The accelerometer In order to quantitatively accomplish a com-
for this channel senses the full magnitude of plete error analysis, first perturb the system, as
gravity, requiring that this be computed ac- shown in Fig. 9. Approximate the error sources
curately. The latitude channel A. has already in the system as phantom inputs (step func-
been described. The longitude channel rj) dif- tions) by the following six quantities: accel-
fers somewhat, since it requires some trigo- erometer errors ax and ay ; computer error Ih;
nometric manipulations. Refer to Fig. 5. A and gyro drift rates 'Yx, 'Yy, and 'Yz. Assum~ that
n
rate x , about a vector that is tangent to the
earth and pointing north, is multiplied by sec
the vehicle travels at constant altitude (R = 0)
and at constant latitude Oi. = 0). Also assume
A. in order to obtain the polar rate Q p , where, that the Schuler loops limit the tilt errors to
small values. The closed loop cross coupled
(3) analysis is treated with matrices, * and the
resulting errors are shown in Fig. 10. Note the
The polar rate includes the earth's rate of rota- following:
tion We, which must be suj:>tracted in order to
derive the longitude rate rj). Upon subsequent W = Schuler frequency whose period is given
integration, this reveals the longitude rj). The by Eq. (1), which is equal to 84 minutes
azimuth Z gyro. torquing rate Qz is generated a t sea level. .
by multiplying Q x by tan A.. P = Polar frequency Q p given by Eq. (3),
System Errors Upon examination of the INS which is about 2 rev/day for a vehicle
in Fig. 8, it should be pointed out that many traveling east at the equator at about
quantities are fed back, making the INS a multi- 1600 km/hr.
coupled system. Thus, error sources in various
parts of the system will effect many other It should be noted that a gyro drift rate equal
parts. Also, since these are fed back and cross to 0.0 I 0 fhr results in maximum velocity
coupled, the errors may either grow unbounded,
or they may be self-limiting. *See chapters 8 and 9 of Ref. 2.
INERTIAL GUIDANCE 588

errors of 2 km/hr; maximum latitude error of result of thermal excitation and by the emission
2 km; maximum longitude error of 6 km; of molecules of gases. Thermal emission from
maximum platform tilt off the vertical of 2 solids is contained in a continuous spectrum,
arc-min; and maximum heading error of 6 arc- whose wavelength distribution is described by
min. The most significant conclusion is that the relation
no errors are unbounded -a bonus provided by
21T c2 hfX
the multicoupling in the INS. Mx dA = AS -ec-=-h/=X-=-k=T-_-I dA (2)
IRA COCHIN
where
References Mx = spectral radiant exitance of the solid
1. Cochin, Ira, "Analysis and Design of Dynamic into a hemisphere in the wavelength
Systems," New York, Harper and Row, 1980. range from Ato (A + dA)
2. Cochin, Ira, "Analysis and Synthesis of INS in Uni- h = Planck's constant = 6.62 X 10- 27 erg sec
versal Terms," Ph.D. thesis Cooper Union, New fX = spectral emittance
York,1969. k = Boltzmann's constant = 1.38 X 10- 16
3. Bachman, K. 1., "Ring Laser Gyro Navigator," erg/K
Navigation 25(2), 142-152 (Summer 1977). T = absolute temperature of the solid emit-
4. Daniel, H. 1., and Hulslander, D. B., "Standard ter, K.
INS Program Status," Navigation, 27(1), 65-71, The spectral emittance, fA, is defined as the
(Spring 1980). ratio of the emission at wavelength A of the ob-
5. Harrison, J. V., "Reliability and Accuracy Predic- ject to that of an ideal blackbody at the same
tion for a Redundant Strapdown Navigator," Guid- temperature and wavelength. When fX is unity,
ance and Control, 4(5), 523-529 (Sept. 1981). Eq. (2) becomes the Planck radiation equation
for a blackbody. The distribution of radiant
Cross-references: ASTRONAUTICS, GYROSCOPE, exitance with wavelength for blackbody radi-
MECHANICS, ROTATION-CURVILINEAR MO- ators at different temperatures is shown in
TION. Fig. 1. It is apparent from the figure that black-
body radiation from emitters at temperatures
below about 2000 K falls predominantly in the
infrared region. An emitter which exhibits a
INFRARED RADIATION constant value less than unity of spectral emit-
The region of the electromagnetic spectrum be-
tween the wavelength limits 0.7 and 1000llm
(7 X 10- 5 and I X 10- 1 cm) has become known
as infrared radiation. The lower wavelength
limit is set to coincide with the upper limit of
the visible radiation region. Radiation of wave- -0:
length greater than 1000llm is generally thought
of as the microwave spectrum. Both limits are ,.~
:?u
arbitrary, and represent no change in character-
i 2~--~~~~~r-~------~------~
istics as they are passed. Conventionally, the
region between 0.7 and I.Sllm is called the near
infrared region; that between I.S and 201lm, ~ 1 ~--~~1-~--~~------~------~
the intermediate infrared region; and that be- S
~ o~~~~~~~~~~----~------~
tween 20 and 10001lm, the far infrared region. ...u
For many applications, the location of infrared ~ -'~-+--,,~----,-~~~--~------~
radiation in the spectrum is described by :s
wavelength in micrometers, 11m, (111m = 10-4 8
cm). In applications where the relative energy .
~ -2~~--~~------~~~~~------~

of the radiation is of interest, the wave number, ~ -3~+----Y1-----"--f~~~~------~


a, is used . The wave number is defined as the
reciprocal of the wavelength, A, in centimeters,
.
..J
a:
~ -4
and is expressed in units of cm- I (called the
kayser). This quantity is used more commonly Q-5~----+'-+---+---~~~~~~Ar--~
than the frequency v of the radiation, which is "9
related to a as follows:
a = vic (I)
10 100 1000

where c is the velocity of light. WAVELENGTH - MICROMETERS

Infrared radiation is produced principally by FIG. 1. Spectral radiant exitance of a blackbody at


the emission of solid and liquid materials as a various temperatures.
589 INFRARED RADIATION

tance at all wavelengths is called a gray-body Detection of the presence, distribution and/or
radiator. Most solid radiators show a general quantity of infrared radiation requires tech-
decrease in spectral emittance with increasing niques which are, in part, unique to this spectral
wavelength in the infrared; however, over limited region. The frequency of the radiation is such
spectral ranges, many materials are approxi- that essentially optical methods may be used to
mately gray-body radiators. Radiators which collect, direct, and filter the radiation. Trans-
approach the characteristics of ideal black- mitting optical elements, including lenses and
bodies can be made in the form of uniformly windows, must be made of suitable materials,
heated cavities. A relatively small aperture, which mayor may not be /transparent in the
through which the cavity can be observed, visible spectrum. Table I gives characteristics
serves as the source of blackbody radiation. of several transmitting materials suitable fm use
Infrared radiation is also observed as emitted in infrared optical systems. To avoid chromatic
from excited molecules of gases. Many of the aberration, reflecting mirrors are commonly
energy transitions which take place in gases ex- used in infrared optical systems to focus and
cited thermally or electrically result in radiation deviate the radiation when broad spectral bands
emission in the infrared region. Gaseous emission are observed. Filters for the infrared are de-
differs in character from solid emission in that signed and constructed like those for the visi-
the former consists of discrete spectrum lines ble, except for the choice of materials and, in
or bands, with significant discontinuities, while the case of interference filters, the thickness of
the latter shows a continuous distribution of the layers.
energy throughout the spectrum. The pre- The detector element for infrared represents
dominant source of molecular radiation in the the most unique component of the detection
infrared is the result of vibration of the mole- system. Photographic techniques can be used
cules in characteristic modes. Energy transitions for the near infrared out to about 1.3J.Lm.
between various states of molecular rotation Photoemissive devices, comparable to the visible-
also produce infrared radiation. Complex mo- and ultraviolet-sensitive photocells, are available
lecular gases radiate intricate spectra, which with sensitivity also extending to about 1.3J.Lm.
may be analyzed to give information of the na- The intermediate infrared region is most effec-
ture of the molecules or of the composition of tively detected by photoconductors. These ele-
the gas. ments, photosensitive semiconductors, are es-
The propagation of infrared radiation through sentially photon detectors, which respond in
various media is, in general, subject to absorp- proportion to the number of infrared photons in
tion which varies with the wavelength of the the spectral region of wavelength shorter than
radiation. Molecular vibration and rotation in the cut-off wavelength. This cut-off wave-
gases, which are related to the emission of length corresponds to the minimum photon
radiation, are also responsible for resonance ab- energy necessary to overcome the forbidden
sorption of energy. The gases in the atmo- gap of the semiconductor. A number of sensi-
sphere, for example, exhibit pronounced ab- tive photoconductors are available with spectral
sorption throughout the infrared spectrum. cutoff at various wavelengths in the infrared.
The principal gases of the atmosphere, nitrogen Photoconductors are employed as resistive ele-
and oxygen, do not absorb significantly in the ments, as photovoltaic p-n diodes, or as photo-
infrared region. However, the lesser constituents, electromagnetic elements, according to the par-
water vapor (H20), carbon dioxide (C02), and ticular electrical advantage to be gained. All
ozone (03), are responsible for strong absorp- spectral regions from ultraviolet through visible,
tion in the infrared. The absorption of radiation infrared, and microwaves, can be detected by
is so prevalent that those spectral bands in which an appropriately designed thermal element,
relatively little absorption occurs are identified which responds by being heated by the absorp-
as atmospheric windows. tion of the incident radiation. In the infrared,
Solid and liquid materials show, as a rule, thermal detectors take the forms of thermo-
strong absorption in the infrared. There are, couples, bolometers, and pneumatic devices.
however, many solids which transmit well in The thermal elements, in general, are not as
broad regions of the infrared spectrum. Many sensitive or as rapidly responding as photo-
materials, such as water and silica glasses, which conductors in spectral regions where they both
show little absorption in the visible, are opaque respond. However, the broad spectral response
to infrared radiation at wavelengths greater and uniform energy sensitivity characteristics
than a few microns. Many of the electrically in- make them highly useful. Two-dimensional
sulating crystals, such as the alkali halides and image information is obtained in the infrared
the alkaline-earth halides, which transmit well by one-dimensional scanning of a linear array
in the visible, also are transparent to much of of detectors, or by a two-dimensional array of
the near and intermediate infrared spectrum. detector elements which are scanned electron-
Several of the semiconductor materials absorb ically or by electron beam. A "push-broom"
strongly in the visible, but become transparent scan arrangement of a row of detectors is
in the infrared beyond certain wavelengths commonly used for the preparation of infrared
characteristic of the semiconductor. maps of the terrain. "Staring" sensors, with a
INFRARED RADIATION 590

TABLE l. INFRARED TRANSMITTING MATERIALS

Useful Refractive
Transmission Index Near
Material Region (Jlm) Transmission Peak Special Characteristics

Optical glasses 0.3-2.7 1.48-1.70 Best for near infrared


Fused silica 0.2-3.5-4.5 1.43 Some types show absorption near
2.7101m
Arsenic trisulfide 0.6-12.0 2.4 A glass; subject to striations
Calcium aluminate 0.3-5.5 1.8 A glass; subject to attack by water
Sapphire 0.17-6.0 1.7 Single crystal, hard, refractory
Silicon 1.1->20 3.4 Low density; opaque to visible
Germanium 1.8->20 4.0 Opaque to visible
NaCI 0.2-15 1.52 Water soluble
KBr 0.21-27 1.54 Water soluble
LiF 0.11-6 1.35 Low solubility in water
CaF2 0.13-9 1.41 Insoluble
Thallium bromide-iodide (KRS-5) 0.5-40 2.38 Fairly soft; cold flows
AgCI 0.4-25 2.0 Soft; cold flows
Irtran 1 0.5-9 1.3 Polycrystalline MgF2
Irtran 2 0.4-14.5 2.2 Polycrysta11ine ZnS
Irtran 3 <0.4-11.5 1.3 Polycrysta11ine CaF 2
Irtran 4 0.5-22 2.4 Polycrysta11ine ZnSe
Irtran 5 <0.4-9.5 1.7 Polycrysta11ine MgO
Irtran 6 0.9-31 2.7 Polycrysta11ine CdTe

two-dimensional array of detectors, are used for 2 gives representative characteristics of several
imaging fields-of-view in which the object of commonly used infrared detectors.
interest may appear unpredictably. Imaging The most common application of infrared
sensors may contain built-in amplification in radiation is, of course, radiant heating. Solid
the form of charge-coupled devices or electron radiators, such as hot tungsten filaments, alloy
beam scan with electron multiplication. Ther- wires, and silicon carbide rods are employed ex-
mal-type response for infrared detection is also tensively as sources of infrared to provide
obtained by means of pyroelectric materials, surface heating by radiation.
such as triglycine sulfate, which change their Infrared spectroscopy has become a powerful
surface charge with temperature. This type of analytical tool in the chemistry laboratory.
detector shows wide spectral band response with Organic molecules, in general, contain inter-
short time constant at room temperature. Table atomic valence bonds which exhibit character-

TABLE 2. INFRARED DETECTORS

Specific
Detectivity, D* Time
Detector at Peak Constant
(operating temperature) Region (Jlm) (cmHz l/2w- l ) (sec) Special Features

Sj(295K) Visible-1.0 2 X 10 12 5 X 10-6 Photovoltaic crystal


PbS (295 K) Visible-2.8 8 X 10 10 2 X 10-4 Thin-mm photoconductor
PbSe (195 K) Visible-5.6 2 X 10 10 2 X 10-3 Thin-film photoconductor
InSb (77 K) 1-5.6 1011 <2 X 10-7 Photovoltaic crystal
(Hg . Cd)Te (77 K) 2-14 5 X 109 5 X 10-7 Spectral cut-off varies with
alloy composition
Ge (Hg doped) (25 K) 1-16 2X 10 10 <10-6 Photoconductor crystal
Ge (Cu doped (5 K) 1-29 3X 10 10 <10-6 Photoconductor crystal
Ge (Zn doped) (5 K) 1-40 3X 10 10 10-8 Photoconductor crystal
Thermistor bolometer All 2X 108 10-3 -10-2 Flake of mixed oxides
(295 K)
Golay cell (255 K) All 2 X 109 1.5 X 10-2 Pneumatic
Thermocouple (295 K) All 2 X 10 8 1.5 X 10-2 Used in spectrometers
Pyroelectric (295 K) All 3 X 10 8 2 X 10 9 High impedance
591 INTERFERENCE AND INTERFEROMETRY

istic resonance frequencies which can be identi- provided to increase the weak signal from the
fied in the absorption spectrum of the material detector to a level which can be used to operate
in gaseous form. Such information can be used controls or displays, as demanded by the appli-
to study the structure of complex molecules. It cation. Optical filtering is applied in order to
also serves in aiding the identification of the restrict the observed spectral region to one in
presence of known valence bonds in chemical which the target is effectively detected, with a
analysis. Most absorption lines and bonds due minimum of interference from radiation from
to molecular vibrations fall in the frequency its background. The wavelength of detection is
range 500 to 5000 cm- I (wavelength range 2 such that angular resolution capability, as set
to 20llm). A large quantity of data has been by diffraction, is much greater with infrared
gathered on the detailed absorption spectra of devices than that of radar devices. Detection of
many gaseous materials. The characteristic spec- targets at great distances through intervening
tra of many organic molecules are such that atmosphere is more effective in the infrared
identification of the presence of the molecules, than in the visible because of the much lower
as well as the presence of particular radicals atmospheric scattering in the infrared.
within the molecules, can be readily observed. Detailed discussions of the characteristics, de-
Petroleum chemistry, for example, has been tection and applications of infrared radiation
greatly aided by the application of infrared may be found in the references.
spectroscopy to the identification of many of
the complex constituents in petroleum products. R. H. McFEE
Observation of infrared absorption spectra is
carried out by means of an infrared spectro- References
photometer, in which the transmission of mono-
chromatic radiation by a gaseous sample in 'a Jamieson, J. A., McFee, R. H., Plass, G. N., Grube,
cell is compared with that of a blank cell, while R. H., and Richards, R. G., "Infrared Physics and
the wavelength of the radiation is scanned Engineering," New York, McGraw-Hill Book Co.,
through the spectral range of interest. Prism 1963.
dispersing elements are usually used in the in- Smith, R. A., Jones, T. E., and Chasmar, R. P., "The
frared, rather than gratings, because of the diffi- Detection and Measurement of Infrared Radiation,"
culty with the latter of separating the several Fair Lawn, N.J., Oxford University Press, 1957.
orders in the wide spectral range covered. Far Herzberg, G., "Infrared and Raman Spectra of Poly-
infrared spectroscopy is complicated by the atomic Molecules," New York, Van Nostrand Rein-
omnipresence of background and scattered hold, 1945.
radiation of shorter wavelength emitted inside Szymanski, H. A., and Alperts, N. A., "IR: Theory
the instrument at room temperature. Special and Practice of Infrared Spectroscopy," Plenum
techniques of filtering must be employed to Press, 1964.
eliminate the effects of the short-wavelength Kruse, P. W., McGlauchlin, L. D., and McQuistan,
radiation. R. B., "Elements of Infrared Technology," New
Optical-electronic devices of many varieties York, John Wiley & Sons, 1962.
have been designed to determine the direction Hudson, R. D., Jr., "Infrared System Engineering,"
of weakly radiating remote objects by means of New York, John Wiley & Sons, 1969.
detection of their infrared emission. Military Wolfe, W. L., and Zissis, G. 1. (Eds.), "The Infrared
applications have been found which have been Handbook," Ann Arbor, MI, Environmental Re-
made possible uniquely by this technique. search Institute of Michigan, 1978.
Missiles can be guided to their target by infra- Keyes, R. J. (Ed.), "Optical and Infrared Detectors,"
red detection of the self-emission of heated seg- Second Edition, Heidelberg, Springer-Verlag, 1980.
ments of the target. Detailed maps of the earth's Martin, A. E., "Infrared Interferometric Spectrom-
surface can be made from aircraft at night by eters," Amsterdam, Elsevier Scientific Publishing
observing the varying infrared emission of the Co., 1980.
ground. Personnel can be detected in total dark-
ness by the infrared radiation they emit as Cross-references: ABSORPTION SPECTRA; LIGHT;
warm objects. Space applications have included RADIATION, THERMAL; SPECTROSCOPY.
remote sensing of terrestrial surface character-
istics for weather mapping, pollution detection
and agricultural conditions detection from INTERFERENCE AND INTERFEROMETRY
satellites. Nondestructive testing of electronic
components, heat-processing plants, and thermal Interference is the term used to denote the
insulation effectiveness are now commonplace physical effects of superimposing two or more
applications of infrared radiation detection. waves of the same wavelength. Interferometry
Such devices require the detection of low- is the technique of measurement using these
level radiation in the intermediate infrared re- effects. Interference occurs whenever waves
gion. Optical lenses or mirrors are used to emanating from sources which have a constant
collect the observed radiation and concentrate phase relationship (i.e., which are coherent) are
it onto the sensitive infrared detector. High- present at the same place. Depending on the
gain, low-noise electronic amplifiers must be phase difference between the waves, the inter-
INTERFERENCE AND INTERFEROMETRY 592

ference may be constructive, so that the waves bands of cancellation. If the sources were light
reinforce, or it may be destructive, so that the sources, a screen placed along the right edge of
waves cancel each other. It may seem para- the figure would display alternating light (L)
doxical that two waves-e.g., sound waves-can and dark (D) bands. Note that there is a light
combine at one place to produce no effect- band at the center, equidistant from the two
e.g., silence. However, the effects are easily ob- sources. Because the path of each wave from its
servable under the proper conditions and are source to that point is the same, there is no
explicable by elementary theories. phase difference between them and construc-
Interference can occur for all wave phenom- tive interference results. Away from the center,
ena. Vibrational waves such as sound or water there is a path difference and hence a phase dif-
waves and electromagnetic waves such as radio ference. When that path difference is an integral
and light waves all behave similarly. Interfer- num ber of wavelengths, constructive interfer-
ence is a consequence of the principle of super- ence results; when the path difference is an odd
position. That principle states that the resultant number of half-wavelengths (e.g., X/2, 3X/2,
effect of two or more waves-that is, the atomic 5X/2, ... ), there is destructive interference.
displacements in the case of vibrational waves Figure 2 is a schematic representation of
or the electric and magnetic fields in the case Thomas Young's classic experiment performed
of electromagnetic waves-is simply the alge- in 1802. His two sources were closely spaced
braic sum of the effects that the individual pinholes which were both illuminated by light
waves would produce alone. Superposition from a single pinhole (SL). Diffraction at this
holds as long as the wave amplitudes are within pinhole ensured that the two sources were co-
the range of the normal, linear response of the herent. Young placed a screen some distance
propagating medium. It always holds when elec- away and observed a series of alternating bright
tromagnetic waves traverse a vacuum. Coher- and dark fringes. The fringes would vanish if
ence of the sources is essential to the observa- either of the sources was covered; they would
tion of interference (see COHERENCE). If the be replaced by a nearly uniform illumination
sources of the waves do not have a constant of the screen. By this experiment, Young pro-
pl,1ase relationship, the superposition of the vided the first firm experimental evidence for
waves, while instantaneously producing a the wave nature of light. The competing cor-
rapidly varying pattern of reinforcement and puscular theory of light, advocated by Isaac
cancellation, will, on the average, produce no Newton a century earlier, still had many ad-
observable interference effects. herents. Young was also able to calculate the
A simple illustration of interference is shown wavelengths of various colors of light; these
in Fig. 1. Concentric circles represent wave- were the first such measurements.
fronts moving out, at regular intervals (with Theory A traveling wave can be represented
wavelength X), from the two sources Sl and S2' mathematically by
The dark areas represent the minima of the
wave patterns and the light areas between them
represent the maxima. The overlapping patterns M(x, t) =a cos (27Tvt - 27T ~ + ex) (1)
show bands of reinforcement alternating with
where M(x, t) is the magnitude of the wave at
distance x from the source and at time t, a is
the amplitude, which is the maximum magni-
D tude, v is the frequency, X is the wavelength,
and ex is the phase. The wave velocity v = Xv.
The intensity of the wave, which is the observ-
l able quantity, is proportional to the square of
the amplitude.
To calculate the effects of interference, one
considers the sum of two waves of the same
D frequency and wavelength. Denoting the two

D .....
1--':":::--,------- -------- R

0 -- --::_-: .... -
l
1 -- - - .:::.
5

FIG. 1. FIG. 2.
593 INTERFERENCE AND INTERFEROMETRY

waves by subscripts I and 2, one obtains a re- separations of less than 1 mm were difficult to
sultan t magnitude: attain. This resulted in fringes that were weak
and indistinct. The experimental results, while
M=MI +M2 historically decisive, were not immediately con-
vincing. However, over the following few years,
= al cos (21rvt - 21rXI /A + al)
refinements were made by Fresnel and others.
+ az cos (21rvt - 21rX2 /A + (2)' (2) These permitted greatly reduced separation of
the sources which, in turn, increased the spac-
For simplicity, one considers the case of two ing and observability of the fringes. In addition,
sources of equal amplitude (al =a2 =a), which Fresnel produced a more complete theoretical
are in phase (al = (2)' This situation is exem- formulation of interference.
plified by Young's experiment. The logical extension of Young's experiment
Using the trigonometric relationship for the to more than two slits yields a tremendously
sum of the cosines of two angles, it is found useful device, the diffraction grating. A grating
that the amplitude A of the resultant wave is: consists of a large number of equally spaced
rulings. (See DIFFRACTION BY MATTER AND
A = 2a cos (8/2) (3) DIFFRACTION GRATINGS.) Analysis of its
properties, along the same lines which led to
where 8 = 21r(XI - Xl )/A is the total phase dif- Eq. (4) above, predicts very narrow fringes.
ference between the waves at the point of inter- These occur in the same places as those pro-
ference. The quantity Xl - X2 is the difference duced by two slits whose separation is the same
in the paths from the sources to that point. The as that of the rulings of the grating. Narrow
amplitude A = ±2a when 8 = 2m1r (m = 0, I, 2, fringes permit the resolution of closely spaced
3, ... ), so that Xl - X2 = mA; A = 0 when 8 = spectral features. Many spectrometers and spec-
(2m + I) 1r, so that Xl - X2 = (m + t) A. trophotometers used in the infrared, visible,
Because the intensity is proportional to A 2, and ultraviolet spectral ranges incorporate dif-
the peak intensity is 4a -four times that due fraction gratings as their central elements for
2
to either wave acting alone and twice that due spectral analysis.
two incoherent sources (which would not in- Thin Film Interference An important appli-
terfere). Peaks in intensity occur whenever A = cation of the theory is in the analysis of inter-
±2a; i.e., whenever Xl - X2 = rnA, as stated ference which occurs in thin films of transpar-
above. ent media. The most common example of this is
In Young's experiment (Fig. 2), the path dif- the interference of visible light in soap bubbles
ference X I - X2 = PS - QS may be easily cal- and in oil slicks, which produces a range of
culated. Point S denotes the location of the colors in reflection. These may be explained
first (rn = 1) peak in intensity. Let PQ =sand easily by the use of Eq. (3) and reference to
RS =y; let D be the distance of the screen from Fig. 3. Here, light from the source reflects
the plane of the slits. Then, using the Pythago- from the upper surface of the film at A and
rean theorem: from the lower surface at B. The two coherent
sources in this case are the two reflected images
of the source. For light that is incident nearly
perpendicular to the film, the geometrical dif-
ference between lengths of the two paths
Xl - X2 = 2d. The optical path difference is
2dn, where n is the index of refraction of the
film (see REFRACTION). One additional com-
PS - QS = 2ys/(PS + QS). plication is the phase reversal (corresponding
to a phase shift of 1r) that occurs whenever light
For all cases of practical interest, y «D, so that is reflected from a medium whose index of re-
(PS + QS) ~ 2D. At S, PS - QS = Xl - X2 = A, fraction is greater than the index of the me-
so that dium in which the light is propagating. This
is exactly the case at point A in Fig. 3; no such
(4)
Equation (4) provided Young with the basis for
the first experimental determination of the
wavelength of light. His results were remarkably
accurate, yielding 420-700 nm as the range of
visible wavelengths.
Practical problems made Young's experi- Air
ment difficult to perform. For visible light,
A~ 500 nm. If the separation of the sources i Film d i
s = I mm and the distance to the screen D = 2 m, Air
the separation of the fringes y ~ I mm. Light
sources available to Young were weak and slit FIG. 3.
INTERFERENCE AND INTERFEROMETRY 594

reversal occurs at point B since the index of The Fabry-Perot interferometer is another ap-
refraction of air (n = 1) is less than that of the plication of thin film interference, one which
film. The condition for constructive interfer- leads directly to the topic of interferometry. It
ence remains a phase difference (j = 2m1T [in consists of two parallel plates of glass or quartz
Eq. (3»); however, (j is now the sum of two (see Fig, 4). Their inner surfaces are optically
terms: flat and partially silvered to enhance their re-
flectivity. Light from the source (S) is colli-
(j = 1T + 21T(2nd/A.). (6) mated by lens Ll and is focused by lens L2 . The
high reflectivity results in a large number of re-
The first term is due to the phase reversal at A, flected images of the source combining to in-
the second to the optical path difference 2nd. terfere in the focal plane (F) of lens L2 . As
Equation (6) has one surprising prediction: in the case of the diffraction grating, this pro-
for very thin films, there is no light reflected. If duces very sharp interference fringes, permit-
d/A.« 1, (j ~ 1T so that A ~ 0 in Eq. (3). This ting high resolution measurements. Optical
effect is easily demonstrated by viewing in re- filters based on the Fabry-Perot interferometer
flected light a film of soapy water held vertical are used to produce nearly monochromatic light
in a loop of wire. As the fluid drains downward, from polychromatic sources.
and the thickness of the film decreases, the film Interferometry Interferometers are instru-
appears to change color. The portion of the ments which utilize the principle of interference
film at the top of the loop seems to disappear to make measurements. As in the case of the
(no light is reflected from it) once its thickness Fabry-Perot interferometer, a single wave is split
is reduced to much less than the wavelength of into two or more waves which are recombined
light. after they have traveled paths of different op-
Equations (3) and (6) can be used to predict ticai lengths. This introduces a phase difference,
the apparent color of a thin film. For example, so that interference effects will generally be ob-
a film of soapy water whose thickness is 300 nm, served. The discussion here will focus mainly
and whose index of refraction n = lAO pro- on several applications of the most widely used
duces a phase difference in reflection of (j = 1T + one, the Michelson interferometer.
21T (840 nm/A.). Setting this equal to 2m1T (for The Michelson interferometer is shown sche-
integer m) to obtain the conditions for con- matically in Fig. 5. The discussion which fol-
structive interference and solving for A., it is lows will be in terms of visible light, but any
found that only m = 2 gives a solution lying other wave phenomenon will produce the same
within the visible spectrum: A. = 560 nm. This effects. Collimated light from source S is inci-
corresponds to a yellow-green shade; if the film dent on the beam splitter B which reflects half
is viewed in reflection, it will appear that color. of the intensity to mirror M1 and transmits half
Soap bubbles and oil slicks vary in thickness to mirror M2, After reflection from the two
over their surfaces. Depending on the thickness, mirrors, the two beams again encounter the
different spectral regions are enhanced by in- beam splitter. Half of each beam is now directed
terference and brilliant bands of various colors toward the observer or detector of O. Because
are o'bserved. Each continuous band corre- the two beams at 0 originate at the same source,
sponds to a region of constant film thickness. they will be coherent (as long as the difference
A very common practical application of thin in their optical paths does not exceed the co-
film interference is the "anti-reflection" coating herence length of the light-typically ~ 10 cm
used on lenses in optical instruments and cam- for conventional light sources). Thus they will
eras. The coating is a thin layer of transparent interfere. Neglecting phase changes that may
material whose thickness is carefully selected to occur at the beamsplitter, one expects con-
minimize reflection (and thus maximize trans- structive interference if the optical path differ-
mission) in the visible. The analysis is similar to ence (Xl - X2) equals mA. and destructive inter-
that which led to Equation (6). t)
ference if it equals (m + A.. If the mirrors M1

Plates

L1
Silvered F

FIG. 4.
595 INTERFERENCE AND INTERFEROMETRY

r:::::::~:=:::J1 M1 ented along the direction of the earth's motion


I relative to the (fixed) ether, the other must be
I perpendicular to it. A calculation of the effec-
tive optical paths and transit times for light
t along the two paths through the ether is straight-
I forward. It predicts a phase difference between
I the two beams, even if the geometrical path
lengths are identically equal. It also predicts
I that when the interferometer is rotated 90° so
I M2 that the roles of the two arms are interchanged,
there should be a significant shift in the ob-

---~7r~'~------~
served fringe pattern.
For the interferometer used in the Michelson-
Morley experiment, this shift was predicted to
s be 0.4 of a fringe; the instrument could resolve
B I a shift of about 0.01 fringe. Repeated measure-
ments at differing times and in differing seasons
t never showed any shift upon rotation through
I 90°. This null result pointed to a fatal flaw in
I the classical theory of relativity. Only Einstein's
o theory of special relativity (1903), in which
there is no ether and in which the speed of light
FIG. 5. is independent of the relative motion of source
and observer, can provide a consistent explana-
tion of the Michelson-Morley null result. (See
and M2 are aligned precisely perpendicular, RELATIVITY.)
analysis predicts the observation of a series of Another application of the Michelson inter-
circular interference fringes at O. As the path ferometer is in the measurement of the indices
difference is changed, say by moving mirror M1 of refraction of gases. An evacuated cell with
toward the beam splitter , the fringe pattern windows at each end is placed in one arm of
shifts: each time M1 is moved a distance '11./2, the interferometer so that light passes through
the optical path difference changes by A, and the windows. As gas is admitted slowly into the
the fringe pattern shifts by one fringe. cell, the optical path in that arm is increased
Among the applications of the Michelson in- according to the laws of refraction. Thus the
terferometer that will be discussed here, there fringe pattern will shift due to the change in
are two of historical significance. In 1892, optical path. From the known length of the cell
Michelson and Benoit measured the primary and the number of fringes that are observed to
standard of length, the International Prototype shift, the index of refraction of the gas may
Meter which is kept in Paris, in units of the easily be found.
wavelength of the red line emitted by cadmium. In the last 20 years the interferometer, in con-
By carefully counting fringes as one mirror was cert with digital computers, has revolutionized
moved slowly, they determined that the stan- infrared spectroscopy. The light intensity de-
dard meter is equivalent to 1,553,163.5 wave- tected at 0 in Fig. 5, as one mirror is moved
lengths of this line. Ultimately, it was decided steadily, is a sinusoidal function of the mirror
by the International Commission of Weights displacement. The period of that function is
and Measures (1960) to define the meter as ex- proportional to the wavelength of the light. If
actly 1,650,763.74 wavelengths of the orange- the source is not monochromatic, but instead
red line of krypton-86. This made the primary emits a range of wavelengths, the output signal
standard of length easily accessible to all scien- at 0 is more complicated but is easily shown to
tists around the world, obviating the need to be the Fourier transform of the spectral dis-
travel to Paris for a precise comparison of mea- tribution of the source.
sured lengths with the standard. This line of High-speed digital computers are capable of
krypton was chosen because it is extremely nar- performing the transform in real time as the
row and therefore has a very long coherence mirror is moved. Compared with conventional
length. This permits the use of long optical path grating spectroscopy, the need for a computer
differences needed for measurements of objects is a disadvantage. However, this is heavily out-
of appreciable length. weighed by the number of highly significant
The Michelson-Morley Experiment of 1887 advantages in the areas of signal-to-noise ratio
was an ingenious attempt to measure the veloc- and spectral resolution. Such FTIR (Fourier
ity of the earth in its orbital motion through transform infrared) spectrometers have gained
the "luminiferous ether." Scientists up to that rapid acceptance and application in several
time had postulated the existence of the ether branches of physics and chemistry, including
as the medium of propagation for light through plasma diagnostics, astronomy, quantitative
the otherwise empty reaches of outer space. If analysis, and solid state studies.
one arm of a Michelson interferometer is ori- Two other applications of interference effects
INTERFERENCE AND INTERFEROMETRY 596

t
I t
I I
I I
I I

"\--~9' - -y
I I

M2
Ml , '
I :
l: \
>L I
\ /
, f
\ /
V F
FIG. 6.

will complete this discussion of interferometry. tremendous difference in wavelength: resolu-


The stellar interferometer constructed by Mi- tion is proportional to A, and radio wavelengths
chelson (1920) can be understood as an ex- are typically a factor of 10 4 greater than those
ample of the principle of Young's experiment. of light. The actual resolution of the VLBI sys-
Here the distant star is the single source illumi- tem is close to that attainable theoretically
nating the device (see Fig. 6). However, the because methods of eliminating the image-
slits are replaced by two mirrors (Ml and M2), degrading effects of atmospheric turbulence
separated by several meters, which reflect the have also been developed. This is not the case in
starlight into the objective lens (L) of an astro- optical astronomy, where such turbulence de-
nomical telescope. This produces a pattern of grades the actual resolution of the Palomar in-
interference fringes in its focal plane (F). By strument to about 1 arc-second. Thus, distant
varying the separation of the mirrors, the fringe galaxies and quasars are being observed in far
pattern can be shifted. Analysis of the shifts greater detail by very long baseline radio wave
permits the resolution of closely spaced binary interferometry than by optical means.
stars or the measurement of the diameters of
nearby red giant stars such as Betelgeuse and BRUCE P . CLAYMAN
Antares. By this method, higher resolution than
otherwise attainable with the same telescope
can be realized. References
Recent technological developments have per-
mitted the extension of such stellar interferom- Born, M., and Wolf, E., "Principles of Optics," 5th
etry to the radio portion of the electromagnetic Ed., Chs. 7 and 10, New York, Pergamon Press,
spectrum. In an exact analog of Michelson's 1975.
stellar interferometer, signals from radiotele- Cook, A. H., "Interference of Electromagnetic Waves,"
scopes separated by long distances can be Oxford, Clarendon Press, 1971.
com bined to produce interference effects. This Jenkins, F. A., and White, H. A., "Fundamentals of
is termed very long baseline interferometry Optics," Chs. 12, 13, 14, and 16, New York,
(VLBI). A Fourier transform converts the McGraw-Hill, 1976.
signals, which correspond to the fringes in Klein, M. V., "Optics," Chs. 5 and 6, New York, John
Michelson's device, into an image of the astro- Wiley & Sons, Inc., 1970.
nomical object. Steel, W. H., "Interferometry," London, Cambridge
Originally, cables carried the signals to a cen- Univ. Press, 1967.
tral location for comparison. Subsequent devel- Strong, J., "Concepts of Classical Optics," Chs. 8, 11,
opment of highly stable and accurate atomic 12, and App. A, B, and F, San Francisco, W. H.
clocks permitted separations of the individual Freeman and Co., 1958.
telescopes by thousands of kilometers. This
has resulted in greatly improved performance: Cross-references: COHERENCE, DIFFRACTION BY
VLBI now yields angular resolution of 0.0001 MATTER AND DIFFRACTION GRATINGS, ELEC-
arc-seconds, compared with 0.023 arc-seconds TROMAGNETIC THEORY, LIGHT, MICHELSON-
theoretically attainable with the 5 m optical MORLEY EXPERIMENT, REFRACTION, RELA-
telescope on Mt. Palomar. This is despite the TIVITY, WAVE MOTION.
597 INTERMOLECULAR FORCES

INTERMOLECULAR FORCES explanation for the intermolecular attractive


forces that are evidenced by imperfect-gas be-
The terms "intermolecular forces" and "van havior the Joule-Thomson effect, and the very
der Waals forces" refer to the weak forces be- existe~ce of the liquid state. When the dipole-
tween molecules and these forces are to be dipole term is calculated quantum-mechanical.Iy,
distinguished fr~m the much stron~er in.ter- the resulting attractive potential energy, varymg
atomic intramolecular forces of chemical bmd- as r- 6 , is called the "dispersion potential." It
ing. The simplest example <;>f an in~ermolecular was first treated by Wang (1927) and then by
force is provided by the Interaction between London (1930).
two noble-gas atoms: The atoms are spherical A Priori Calculations It has long been cus-
and the force is central in this case. tomary to assume that the intermolecular
Short-range Repulsion The volume of mat- potential between two neutral, no~pola~ mole-
ter in the solid and liquid states is an extensive cules consists of a long-range dlsperslOn at-
property, i.e., is proportional to the number ~f traction combined with a short-range overlap
moles or molecules in the specimen, and thiS repulsion . The combined potential-energy curve
implies a molecular "size" or the existence of for two noble-gas atoms then has a minimum at
repulsive forces that prevent two molecules a distance rm, and the potential vanishes at a
from occupying the same space at the same distance a which is referred to as the slow-
time. The very low compressibilities of solids collision diameter. However, it is questionable
and liquids indicate a very strong repulsion of to decompose the interaction potential into
two molecules when they begin to overlap each parts, and in particular, to use t.he disper~ion
other. These self-evident conceptions were em- potential, which is an asymptotIc ex~r~sslOn,
bodied in the nineteenth-century representation at distances as small as rm. The coeffICient of
of molecules as little billiard balls. The origin the dispersion potential for interactions betw~en
of the "overlap forces" of repulsion was not pairs of rare-gas atoms has been calculated With
explained until the advent .of qu.ant.um me- great accuracy in recent years, starting from the
chanics and the Pauli exclUSIOn prInCiple. Ac- Schrodinger equation and using a small number
cording to the latter, when two noble-gas ~toms of optical data. The a priori calculati~n of the
begin to overlap, the electrons tend to migrate complete potential energy curve, startmg from
from the crowded region in the middle to the Schrodinger's equation, is an extraordinarily
far ends outside the nuclei, where they exert difficult and laborious procedure even for the
electrostatic forces that tend to pull the nuclei simplest atoms and even with the aid of modern
apart. This effect, which greatly exceeds. the
direct electrostatic repulsion of the partially electronic computers. Only since (1972) has
shielded nuclei, cannot be represented. by any it been possible to achieve sufficient accuracy
simple analytic potential energy function. For to obtain the He-He potential in the neighbor-
hood of its minimum.
reasons of mathematical convenience, however, Nonadditivity Effects When three or more
an inverse-power law, especially r- 12 , ~r a simple rare-gas atoms are in proximity, the mutual
exponential function e-ar , where r IS the diS- energy of interaction cannot be expressed as
tance between the nuclei of the atoms, is often the sum of potential energies for all the inter-
used to represent the potential energy of the acting pairs, except to a first approxima.tion.
overlap repulsion. The first calculation of an The deviations from additivity are assOCiated
overlap repulsion (between H atoms) was made with many-body forces, and in recent years the
by Heitler and London in 1927. three-body interaction has received particular
Long-range Attraction When two noble-gas attention. The asymptotic form of the nonad-
atoms are so far apart that there is negligible ditive three-body interaction is called the triple-
overlap of their charge clouds or wave packets, dipole potential; it arises from an instantaneous
the atoms although neutral and nonpolar, moment in one atom which induces moments
interact th;ough a set of electrostatic multipole in two other atoms, the induced moments
terms of which the most important is the then interacting to produce a three-body force.
lowest-order dipole-dipole term. An instan- The quantum-mechanical calculation gives the
taneous electric dipole moment in one atom, result
which averages to zero, induces a dipole mo-
ment in the other atom which is proportional
to the inducing moment and which interacts U123 = v (rI2'23 '13)-3
with it. It is readily seen that the interaction . (1 + 3 cos (J I cos (J 2 cos (J 3)
energy varies as r- 6 , does not average to zero,
and corresponds to an attraction between the
atoms. Similarly an instantaneous electric qua- for the triple-dipole potential, where 'ij are the
drupole moment in one atoI!1 induces ~ dipole sides and (J i the interior angles of the triangular
moment in the other which Interacts With It to array and v is a constant which can be calcu-
.
give an interaction energy, varyIng as r-8 , lated for rare-gas combinations. Little is known
which is called the dipole-quadrupole term. about three-body interactions in the region of
These long-range multipole interactions, es- overlap, and higher-order many-body forces have
pecially the dipole-dipole term, provide the usually been neglected. Three-body forces ap-
INTERMOLECULAR FORCES 598

pear to make a large contribution to the values r are both greatly in error) which preclude its
of the third vi rial coefficients of the noble use in accurate, realistic investigations. Other
gases at temperatures close to the critical point. simple two-parameter and three-parameter po-
Polar Molecules When the molecules possess tentials have been proposed but they are in-
permanent electric moments, additional elec- sufficiently flexible to be able to account for
trostatic terms contribute to the intermolecular the results of increasingly accurate measure-
force. The dipole moments of simple polar mol- ments of numerous physical properties extended
ecules, such as LiH, can be calculated with over wide ranges of the experimental variables.
considerable accuracy from the Schrodinger As an example of the multiparameter po-
equation and compared with experimental val- tential functions which have been employed,
ues deduced from the temperature dependence we mention the so-called Morse-spline-van der
of the dielectric constant. The potential energy Waals function which uses a Morse potential
of alignment of two polar molecules varies as for the short-range interaction, a combination
r- 6 and is usually smaller than the dispersion of r- 6 and r- 8 for the long-range interaction,
energy, but in the important case of H2 0 at and a spline interpolation formula to join the
20 0 e the alignment energy is four times as two segments. Even more elaborate functions
large as the dispersion energy. The nonspherical are in current use, some containing as many as
shape of polar molecules and the noncentral 12 adjustable parameters.
character of the electric force greatly compli- Experimental Methods The types of ex-
cate the mathematical analysis for most physical perimental measurements that can be used to
properties. Permanent electric quadrupole mo- derive information about intermolecular forces
ments Q have also been calculated and mea- include the following: (1) thermodynamic prop-
sured, and their contributions to intermolecular erties of gases, (2) transport properties of
forces through dipole-quadrupole and quad- gases, (3) equilibrium properties of solids and
rupole-quadrupole interactions are important in liquids, (4) molecular beam scattering cross
special cases, e.g., for H2 0. Methods of mea- sections, and (5) spectroscopic and x-ray dif-
suring Q include molecular beam radiofre- fraction data. Although much used in the past,
quency resonance, microwave pressure broaden- the calculation of lattice energies, lattice dis-
ing, collision-induced absorption, nuclear tances, compressibilities, and elastic constants
magnetic resonance, and second dielectric virial of solids from assumed pair potentials cannot
coefficients. be regarded as an accurate method of de-
Empirical Potential Functions Enough has termining intermolecular pair potentials be-
been said to indicate that the nature of the cause of uncertainty with regard to the contri-
intermolecular forces is qualitatively well under- butions from many-body interactions, i.e., from
stood but that accurate quantitative results nonadditivity effects. The use of the radial
cannot be obtained from wave mechanics except distribution function g(r) of a liquid, deter-
in a few special cases. The most detailed in- mined from x-ray and neutron diffraction ex-
formation about the actual magnitudes of inter- periments, in conjunction with the vapor pres-
molecular forces is obtained by the empirical sure near the triple point, provides information
procedure of (I) choosing an intermolecular about the pair potential, and one may also use
force law, with two or more adjustable con- g(r) in combination with the isotopic separa-
stants, on the grounds of physical plausibility tion factor in liquid-vapor equilibrium to test
and mathematical convenience, (2) selecting a the repulsive region of the potential. The x-ray
macroscopic property that can be accurately diffraction results for dense gases and liquids
measured and accurately calculated from the can be analyzed in terms of theories of dense
force law assumed, and (3) using the results of fluids (e.g., Percus-Yevick) to deduce a numer-
measurements over a wide range of experimental ical intermolecular potential. All of these liquid-
conditions to fix the values of the adjustable state methods are subject to uncertainties with
parameters in the force law. The most popular respect to nonadditivity effects.
of the empirical potential functions has been Although the latter may be small, it is safer to
the Lennard-Jones (12,6) potential deduce the pair potentials from phenomena that
involve only binary interactions. In this category
u(r) = 4€ [(a/r)12 - (a/r) 6 ), € == -u(rrn). are the second virial coefficients B(T), the
viscosity coefficients fI(T), and other gas
Here a and € are the adjustable parameters. transport properties. The values of B(T) ex-
The 6th-power term represents the dispersion tending from high to low temperatures do not
forces and the 12th-power term, the repulsive provide a sensitive test for assumed forms of the
forces. The parameter € is the maximum energy pair potential; even the square-well and (12, 6)
of attraction, and a is the value of r for which potentials give a good representation of B(T)
u (r), the potential function, is zero. Although data. It is also possible to fit viscosity data over
mathematically convenient and still widely used, a wide temperature range with simple potential
this pair potential is now known to have serious functions; former disagreements at high tem-
defects (e.g., its behavior at small r and at large peratures have recently been shown to have re-
599 INTERNATIONAL SOLAR-TERRESTRIAL PHYSICS PROGRAMS

suited from experimental errors. However, the References


parameters of the simple potentials that give Hirschfelder, 1. 0., Curtiss, C. F., and Bird, R. B.,
the best fit to B(T) measurements differ from "Molecular Theory of Gases and Liquids," New
those needed to fit the viscosity data. Other York, John Wiley & Sons, 1954.
transport properties which provide useful in- Hirschfelder, J. 0., Ed., "Intermolecular Forces,"
formation about the pair potential are the Adv. Chern. Phys., 12 (1967).
isotopic thermal-diffusion coefficient, the iso-
"Intermolecular Forces," Disc. Faraday Soc., 40
topic mutual-diffusion coefficient, and the ther- (1965).
mal conductivity (of monatomic gases) for
like-molecule interactions and the mutual-dif- Margenau, H., and Kestner, N. R., "Theory of Inter-
fusion coefficient for unlike-molecule interac- molecular Forces," Oxford, Pergamon Press, 1971.
tions. Schlier, C., in Eyring, H., Ed., Ann. Rev. Phys.
The most direct method of investigating the Chern., 20,191-218 (1969).
force between two molecules is to measure the
elastic-scattering differential and total cross Cross-references: CRITICAL
PHENOMENA, GAS
LAWS, LIQUID STATE, MOLECULAR SPEC-
sections at different energies. Molecular beams
formed from neutralized accelerated ions may TROSCOPY, SOLID-STATE PHYSICS.
be used with stationary gas targets to determine
the repulsive wall of the potential at high
energies. In order to investigate the region of INTERNA TIONAL SOLAR-TERRESTRIAL
the potential near its minimum, it is necessary PHYSICS PROGRAMS
to employ energies close to thermal, and the It has been recognized for some time that the
motion of the target molecules then precludes sun ultimate source of practically all the energy
the use of a single beam. Two molecular beams utilized on earth (the only current exception
passing through velocity selectors and inter- being the relatively small amount of power pro-
secting at right angles have been used but much duced by nuclear fuels), influences many earthly
greater intensity is available from crossing two phenomena. Aside from the obvious solar con-
supersonic nozzle beams. Differential elastic- trol of the weather through visible and invisible
scattering cross sections measured by the latter light -electromagnetic radiation -continuously
method, showing well-resolved rainbow struc- emitted by the sun, other less well-understood
ture and symmetry oscillations, have recently effects occur. For example, the so-called earth
provided accurate potential functions for pairs storm (not to be confused with a weather
of rare-gas atoms (Lee et aI., 1972). disturbance in the troposphere near the ground)
Molecular spectroscopy furnishes a precision results from the arrival not only of radiation
tool for investigating the potential energy curves but also of matter-streams of electrically
for different electronic states of diatomic mol- charged particles-spewed out by the sun follow-
ecules, but these interatomic potentials refer to ing violent eruptions or flares in the chromo-
intramolecular rather than intermolecular inter- sphere, just above the sun's visible surface
actions and will not be further discussed. The (photosphere ).
pressure broadening of spectral lines and the A multitude of geophysical effects arise from
phenomenon of collision-induced absorption, the interactions of these radiation and particle
measured in the microwave and infrared regions fluxes with the upper reaches of the earth's
of the spectrum, provide additional methods for atmosphere, manifesting themselves as phen-
studying intermolecular forces. These methods nomena such as magnetic storms, radio dis-
are not accurate but they have yielded useful burbances and auroral displays. The frequency
information about molecular quadrupole mo- of occurr~nce of these transitory happenings
ments. A recent and important spectroscopic in the upper atmosphere waxes and wanes as
method of determining the intermolecular po- the level of solar activity changes during the
tential near its minimum, which has been well-known sunspot cycle (approximately 11
applied to Ar and Ne, is to measure the ultra- years).
violet absorption bands that correspond to The phenomena that occur in the earth's
electronic transitions out of the ground state of upper atmosphere (i.e., the entire region above
the dimer (Ar2 and Ne2). The spectroscopic the troposphere) obviously recognize no
constants derived from the vibrational levels of international boundaries. Thus, it was natural
the ground state provide accurate information that the earliest international scientific programs
about the bowl of the potential curve including were organized by geophysicists. In 1625,
the well depth (10), the curvature at the mini- Francis Bacon emphasized the need for "experi-
mum and the anharmonicity (Tanaka and ments in concert." One early effort, organized
Yoshino, 1970, 1972). The best-known inter- by Friedrich von Humboldt in early 19th
molecular potential, for the interaction between Century, investigat~d the earth's magnet.ism by
two Ar atoms, currently has an accuracy of means of many SImultaneous observatlOns at
about ±4 per cent in the well depth. widely different locations. Karl Friedrich Gau~s
R. D. PRESENT subsequently founded the G6ttingen MagnetIc
INTERNATIONAL SOLAR-TERRESTRIAL PHYSICS PROGRAMS 600

Union in 1834 for studies of geomagnetic to what we now know as an exceedingly com-
variations by a network of 44 magnetic obser- plex region surrounding it, its program em-
vatories dispersed around the world. braced many, sometimes overlapping, scientific
IPY 1 The First International Polar Year disciplines. Thus, in addition to investigating
(IPY) in 1882-83 was the original direct the properties of the earth's interior (seismol-
progenitor of the current mode of conducting ogy, latitude and longitude, and gravimetry)
international cooperative scientific enterprises and studying its surface characteristics (ocean-
on a global scale. Its program, confined to the ography and glaciology), the program embraced
Arctic and the approaches to the Antarctic, was observations in and beyond the earth's atmo-
limited to meteorological, geomagnetic, and sphere, including investigations of sun-earth
auroral observations at 14 stations established relationships (meterology, nuclear radiations,
by 11 participating countries. The most com- geomagnetism, ionosphere, aurora and airglow,
monly cited result was the discovery of the oval- cosmic rays, and solar activity). An additional
shaped auroral zone-a band with geomagnetic discipline, rockets and satellites, was defined in
latitude roughly 65-70° where this phenomenon terms of technique rather than scientific sub-
is most prevalent. ject matter. As a consequence, the Space Age
IPY 2 The first IPY was followed 50 years was spawned. And, to increase basic knowledge
later by the Second International Polar Year in about the solar influences acting upon the earth,
1932-33, with 44 participating nations. Planned IGY was planned to cover a period at or near
and coordinated by the International Meteoro- solar maximum. (See also GEOPHYSICS and
logical Organization, the second IPY program SOLAR PHYSICS.)
included stations not only in the polar regions, As hoped, IGY did take place at a peak of
but at lower latitudes as well, and investigated sunspot activity. Propitiously, it was not only
for the first time the role played by polar the maximum for that particular eleven-year
phenomena in the concert of worldwide effects. cycle-but the level of solar activity in 1957-58
To the three types of observation conducted has probably not been matched at least since
during the First International Polar Year was Galileo first observed sunspots in the early
added a fourth one, the new field of ionospheric 17th Century.
research which had been opened up by the de- IQSY The International Years of the Quiet
velopment of radio during the intervening Sun (IQSY) in 1965-66 was to be the sequel
period. of IGY aimed at catching the myriad inter-
IGY The International Geophysical Year related changing phenomena of sun, space, and
(IGY) in 1957-58 was originally conceived as earth at the sun's nadir of activity. But the
a third Polar Year, but it was generally agreed later enterprise was not a small-scale repetition
during the early planning phases that the terms if its illustrious predecessor. On the contrary,
of reference should be extended beyond a study in the fields that it embraced, the level of effort
of polar influence to include the whole earth far exceeded that which it had been possible to
and its far-reaching atmosphere. IGY was a attain only seven years earlier.
huge success, and its impact upon the develop- However, in a real sense, IGY had set the
ment of many fields of science was tremen- stage for IQSY. A new scientific discipline,
dous and permanent. It was truly distinctive solar-terrestrial physics (STP)-alternatively
in that it initiated the first extensive program called solar-terrestrial relationships or solar-
of in situ investigations of the earth's tenuous terrestrial research (STR)-was emerging to em-
envelope utilizing the rapidly developing new brace all processes whereby diverse forms of
technology for transporting scientific instru- energy generated by the sun influence the
ments into space. Furthermore, it opened up terrestrial environment, with the resulting com-
Antarctica as a scientific treasure trove. plex interplay of physical-chemical processes in
Responsibility for some part of the IGY every element of the sun-earth system.
program was shared among 67 nations operating The purpose of IQSY was threefold. Some of
4,000 primary stations and an equal number of the studies tnat were conducted were feasible,
secondary posts. It is estimated that approxi- or are best undertaken, ony at a time of solar
mately 30,000 scientists, plus another 30,000 minimum. Others were concerned with observ-
volunteer observers, participated. Although ing in detail isolated solar events, uncompli-
there is, of course, no satisfactory index of the cated by the superposition in time of a number
productivity of a scientific enterprise, it is of concurrent outbursts. Finally, some investi-
noteworthy that more than 3,000 articles on gations provided data characteristic of solar
IGY results were published in scientific journals minimum conditions as benchmarks for com-
by United States workers alone. The vast wealth parison with data observed previously during a
of data was deposited at the several World Data most remarkable solar maximum.
Centers which were established during IGY and Actually, at sunspot minimum solar out-
which still continue to be utilized extensively. bursts do not cease completely, but when they
Since the broad objective of IGY was to do occur their effects can be observed under
study every aspect of the earth as a planet, relatively "clean" conditions, free from con-
from the properties of the terrestrial interior fusing interferences. Hence, the entire sequence
601 INTERNATIONAL SOLAR-TERRESTRIAL PHYSICS PROGRAMS

of events associated with a single outburst can to plan and organize the international coordina-
be followed through all of its aspects. Further- tion of international projects of those research
more, smaller effects, previously lost in the high- workers throughout the world who are interested
level background of activity, become discern- in solar-terrestrial physics, and who wish to
ible. Thus, many IQSY projects took advantage concentrate attention on some of the more
of this period to make observations with greatly important problems that still awaited solution.
increased resolution or "amplification." Several representative recent programs will be
The IQSY program was organized into eight cited here.
disciplines, although in most cases there were MIDDLE ATMOSPHERE PROGRAM (MAP)
overlapping interests, and no sharp boundary The Middle Atmosphere Program (MAP) is
lines were drawn. These were: meteorology (see an international cooperative enterprise de-
METEOROLGY), geomagnetism (see GEOPHYS- veloped under the auspices of the Scientific
ICS), aurora (see AURORA AND AIRGLOW), Committee on Solar-Terrestrial Physics of the
airglow (see AURORA AND AIRGLOW), iono- International Council of Scientific Unions and
spheric physics and radio astronomy (see ION- the World Meteorological Organization. Sched-
OSPHERE AND RADIO ASTRONOMY), the sun uled to run from January 1982 through Decem-
and the interplanetary medium (see SOLAR ber 1985, its chief objective is a comprehensive
PHYSICS AND SPACE PHYSICS), cosmic rays and understanding of the structure, chemistry,
geomagnetically trapped radiation (see COS- energetics, and dynamics of the middle atmo-
MIC RAYS AND MAGNETOSPHERIC RADIA- sphere (defined for MAP purposes as the region
TION BELTS), and aeronomy (see PLANETARY extending upward from the tropopause, includ-
ATMOSPHERES). ing the stratosphere and mesosphere, to the
The sponsorship and supervision of the lower thermosphere, i.e., from about 10 to 100
65 worldwide networks of more than 2,000 km). Contemplated studies include ozone
IQSY stations were the responsibility of 71 climatology; stratospheric composition; meso-
participating committees, as they were called, spheric composition; basic climatology of the
established by the Academy of Science or an middle atmosphere; planetary waves in the
equivalent body in each country. Innumerable middle atmosphere; equatorial waves; tides,
balloons, some 500 rockets, and about 140 gravity waves, and turbulence; troposphere-
satellites and space probes were launched during stratosphere coupling; the influence of middle-
the two-year campaign. The IQSY bibliography atmospheric conditions on lower-level climate;
lists 5,359 publications, which themselves aerosol formation and properties; solar radia-
formed the basis for an untold number of subse- tion, especially in the ultraviolet; the effects of
quent papers in the scientific literature. energetic particles and x-rays on the middle
IQSY was the final broadly based interna- atmosphere; ion composition; and the electro-
tional scientific undertaking. It became evident dynamics of the middle atmosphere. These
that future cooperative ventures should be fourteen areas have been tagged observational
somewhat more sharply focused, and the or experimantal MAP initiatives (MIs). The
mechanism for accomplishing this was the es- operational basis for MAP is an international
tablishment of several permanent international plan for coordinated observations from space-
committees arising from the temporary single craft, rockets, balloons, aircraft, and ground-
purpose, national and international groups based facilities. This plan encourages cooperative
that had been formed specifically to organize data management, information exchanges on all
IGY and IQSY. Each of the latter were dis- appropriate time scales, and interaction between
solved after their missions, including the publi- observers, modelers, and theoreticians.
cation of their respective annals, had been THE INTERNATIONAL MAGNETO-
completed. SPHERIC STUDY (IMS) The International
IUCSTP The modus operandi of IQSY has Magnetospheric Study (IMS) is a highly co-
continued to be employed for mounting ordinated, cooperative program designed to pro-
imaginative interdisciplinary and international vide quantitative new information about cer-
research projects. Thus, when in 1966 the tain key problems involving plasmas in the
IQSY program was drawing to an orderly con- geomagnetic field. The IMS formally began in
clusion and its international organization was January 1976, and the primary data-collection
closing down, a new and continuing body, the period ran through 1979. A five-year follow-up
Inter-Union Commission on Solar-Terrestrial program for data reduction and analysis was
Physics (IUCSTP) was established by the subsequently recommended by the IMS Steering
International Council of Scientific Unions Committee, and is in progress.
(ICSU) to plan and coordinate future programs About 50 countries are participating in the
in solar-terrestrial physics. IMS, and the four primary IMS spacecraft
ISCU, from among the several interested (ISEE-l, -2, -3 and GEOS-2) were successfully
unions among its membership, had established launched by the United States (NASA) and the
the bodies that were responsible for the inter- EUiopean Space Agency (ESA). Japan and the
national coordination of IGY and IQSY. Soviet Union also provided new spacecraft for
IUCSTP was established on a permanent basis the program. The key ingredients of the IMS
INTERNATIONAL SOLAR-TERRESTRIAL PHYSICS PROGRAMS 602

were these dedicated new spacecraft and several 2. "Annals of the IQSY," Cambridge, Massachusetts,
sophisticated new ground-based instrument M.LT. Press, 1969.
arrays, which greatly advanced international 3. Pomerantz, Martin A., "The IQSY and Solar-
data-exchange standards by using simultaneous Terrestrial Research," Proc. Nat. A cad. Sci. 58,
data collection, processing, and dissemination 2136 (1967).
in near real time from central data facilities. 4. "Solar-Terrestrial Research for the 1980's," Wash-
Incoherent-scatter radars were operated at high ington, D.C., National Academy Press, 1981.
latitudes, and a variety of ground-based, bal- 5. "The International Magnetospheric Study: Report
loon, rocket, and aircraft experiments were of a Working Conference on Magnetospheric
conducted. While the IMS required a sizable Theory," Washington, D.C., National Academy of
number of specific new facilities, it also bene- Sciences, 1979.
fited greatly from the effective use of previously 6. Handbook for Middle Atmosphere Program, Scien-
planned programs and from continued opera- tific Committee on Solar-Terrestrial Physics (SCOS-
tion of certain older satellites and ground net- TEP), C. F. Sechrist, Jr., Ed., Urbana, IL 1981.
works. The IMS Central Information Exchange
(IMS CIE) Office was established in the World Cross-references: AIR GLOW, AURORA, COSMIC
Data Center-A in Boulder, Colorado, and re- RAYS, GEOPHYSICS, IONIZATION, IONOSPHERE,
gional offices were set up in Paris, Moscow, and PLANETARY ATMOSPHERES, MAGNETOSPHERIC
Tokyo; a monthly Newsletter transmitted RADIATION BELTS, RADIO ASTRONOMY, SOLAR
schedules, plans for spacecraft and ground- PHYSICS.
based measurements, and general IMS news to
several thousand scientists. NASA also estab-
lished the Satellite Situation Center, which dis-
seminated detailed trajectory data for all IMS
spacecraft. IONIZATION
SOLAR MAXIMUM YEAR (SMY) During
the next period of maximum solar activity Ionization is the name given to any process by
following IQSY, programs were conducted in which a net electrical charge may be imparted to
which several score observatories coordinated an atom or group of atoms. In the case of liquid
their observations with each other, and with solvents, molecules or ionic salts become disso-
spacecraft in particular. The NASA Solar Maxi- ciated to form positive and negative ions. This
mum Mission (SMM), carrying a new state-of- ionization process is known as electrolysis, and
the-art set of instruments, ran from August the name electrolyte is given to the solute or to
1979 through February 1981. Observing the conducting solution. The study of electroly-
schedules were arranged by the SMY Steering sis is embodied in the subject of electro-
Committee with special observational intervals chemistry. Of great interest in recent years has
being scheduled for the three activities sub- been the study of ionized gases. Rockets, hyper-
sumed under the SMY, namely the Flare Build- sonic flight, and space physics have spurred
up Study (FBS), with emphasis on distinguish- investigations of plasmas, shock waves and high-
ing the characteristics of localized solar active temperature chemical processes arising in a
regions that subsequently produce flares variety of terrestrial and celestial phenomena.
from those that do not; Study of Energy Indeed, ionized gases make up a major portion
Release in Flares (SERF), with emphasis on the of all matter in the universe. Our chronic need
flare process itself; and Study of Traveling for new energy sources has transformed the
Interplanetary Phenomena (STIP), with empha- speculation of a controlled thermonuclear fusion
sis on the aftereffects of flares and other activereaction into one of the greatest research efforts
phenomena. Coordination was effected by telex in history. These and other considerations have
from several SMY control centers. Following induced a vigorous growth in the study of ioniza-
the pattern of previous programs, the partici- tion phenomena.
pants in the SMY engaged in a number of Electrolytes The degree of ionization found
workshops to coordinate both their detailed in electrolytes is highly variable and depends
planning and the analysis of their data sets. upon the solute, the solvent, and the interaction
CSTR In the United States, the Committee between them. Weak electrolytes, such as many
on Solar-Terrestrial Research (CSTR), formed organic compounds, are solutes which are barely
in 1965 by the National Research Council's dissociated into ions except in the limit of in-
Geophysics Research Board, "looks after the finite dilution. Strong electrolytes are highly
health of this field" in our country, and repre- dissociated at any concentration. Ions formed in
sents the nation in International STR programs. solution may bear one or several electronic
charges. The electrochemical equivalent weight
MARTIN A. POMERANTZ is the atomic weight divided by the number of
charges carried by the ion. If electrodes are
References placed in an electrolyte and a current flows in
the external circuit, the ions with positive
1. "Annals of the IGY," New York, Pergamon Press, charges, called cations (cathode + ions), will mi-
1959. grate toward the negative electrode (cathode).
603 IONIZATION

Those ions possessing a negative charge (anions) hv ~ V, where h is Planck's constant and V is
will migrate to the positive electrode (anode). the ionization potential. The latter is defined as
The ions arriving at the cathode are neutralized the energy required to remove completely an
by the acquisition of electrons; the atoms or electron from an atom or molecule in the ground
molecules thus formed may then be evolved as state, leaving the resulting ion in its lowest state.
a gas or retained as a deposit on the electrode. Photons having energy less than V may be ab-
The cations are said to undergo reduction. Like- sorbed by atoms or molecules, giving rise to
wise, the anions experience a loss of electrons at excitation of internal states or perhaps molecu-
the anode; this process is called oxidation. The lar dissociation, or both.
quantity of electricity required to deposit one Many laboratory investigations of photoab-
gram equivalent is called the faraday, in honor sorption have been performed. One type of
of Michael Faraday (1791-1867). Based on the experiment requires the measurement of the ab-
physical scale of atomic weights, the faraday is sorption coefficient, a, of a photon beam:
numerically equal to 96 520 coulombs per
equivalent. 1=lo e- ax
As for any solid conductor of uniform cross
sectional area A and length I, the electrical re- in which 10 is the initial intensity of the beam
sistance R of an electrolyte is given by IjKA. The and I is the intensity after the beam has tra-
conductivity K is independent of geometrical versed a distance x in the absorbing gas. The
shape and size, and bears the units (ohm cm)-I . absorption coefficient may then be studied as a
Of greater importance in the study of electro- function of photon energy (i.e., wavelength).
lytes is the equivalent conductivity: It can be expressed in terms of a microscopic
cross section for absorption or for ionization as
A= "jC a function of incident wavelength. This cross
section curve for photoionization usually exhi-
where C is the concentration of the solute in bits a sharp peak at the ionization threshold.
equivalents per cubic centimeter. Plots of A Only a single, outermost electron is ejected from
as a function of concentration show very dif- an atom which absorbs an ultraviolet photon.
ferent behavior for weak and strong electrolytes. An x-ray photon generally will eject a more
The latter exhibit a limiting value of A as C tightly bound electron from one of the atom's
diminishes to zero, while the weak electrolytes inner shells.
do not. Such behavior provides insight into the Other types of experiments utilize photoioni-
nature of the ions, their mobilities and their zation to study the deionization process for the
interactions with the solvent material. ions thus formed. Of major importance are such
Formation of Gaseous Ions 'Studies in 1895 processes as electron-ion recombination:
by J. J. Thomson of the effects of newly dis-
covered x-rays on gases marked the beginning of A + + e- -+ A
a series of experiments which established the
existence of the electron and clarified many and ion-ion recombination if both charged
questions on the nature of atomic structure. species are ions.
Just as in electrolytes, both positive and nega- Ionization by Heavy-particle Collisions By
tive ions may exist in an ionized gas. In addition heavy particles is meant both atoms and mole-
to the ions, the presence of free electrons may cules and their ions, ranging in mass from the
profoundly influence the character of the gas. hydrogen atomic ion (proton) to very heavy
Negative ions may be formed by the attachment molecular systems of large atomic number.
of free electrons to a neutral atom or molecule, When two heavy particles collide with sufficient
by the dissociation of a neutral molecule into energy, one or more electrons may be ejected
positive and negative fragments, or by electron from either or both particles. In experimental
transfer upon collision of two neutral atoms or work, the target molecules or atoms are in the
molecules. Positive ions may be formed by form of a low-density gas having an energy cor-
dissociation, charge transfer, neutral-particle responding to room temperature and usually
or electron collisions, or by photoabsorption negligible compared to the energy of the pro-
(the absorption of electromagnetic radiation). jectile particles. The latter are usually obtained
Still another mechanism for the formation of through ionization of a selected gas in an ion
ions is the emission of a nuclear particle, such source, and acceleration through a large electric
as beta decay. Several of these processes are dis- potential difference E. Regardless of the mass of
cussed below. projectile particles, their kinetic energy will be
Photoionization Photoabsorption leading to equal numerically to E electron volts, if the
excitation and ionization is of interest because potential difference E is in volts and if the
of its significance in astrophysics and geophysics. particles carry but one elementary charge. One
The ionosphere is constituted of molecular and electron volt (e V) is equal to 1.6 X 10- 12 erg.
atomic ions which result from the absorption If a beam thus formed with an intensity of B
of solar ultraviolet and x-radiation. The fre- particles per second is incident on a target
quency v of the electromagnetic radiation giving chamber of area A containing N target particles
rise to ionization must satisfy the relation as a low-density gas, the electron ionization
IONIZATION 604

current i which is released is given by produced by collimated beams of electrons in-


cident on heavy particles. Whether the target
i = BNa-/A electrons/ sec. particles are atoms or molecules, the ionization
is usually by the removal of single electrons
This equation defines the effective ionization from the outer most shell, as in photoionization.
cross section a-. For energies at which multiple As a function of the incident electron energy,
ionization is improbable, a_ approaches the cross the ionization cross section rises rapidly from
section for singly charged ions. Often the dis- zero for energies just below the ionization po-
tinction is made between ionization of the tar- tential and increases to a maximum value in the
get particle and the beam particle. Ionization of neighborhood of 50 to 100 e V; thereafter it
the latter is referred to as stripping. decreases slowly and monotonically with in-
In the quantitative description of heavy- creasing electron energy. Since an electron with
particle collisions, one usually introduces the a given energy travels at much higher speed than
concepts of the (a) laboratory and (b) center-of- does a heavy projectile of the same energy, the
mass (CM) coordinate systems. The laboratory electron collision induces a much more rapid
system is used to describe the motion of the perturbation on the target's orbital electrons.
particles as would be viewed by an observer Thus, a larger ionization cross section at low
standing at rest in the laboratory. The origin energies is to be anticipated.
of the CM system moves with the center-of-mass Much of the definitive work on electron im-
of the two-particle system. If m and M are the pact ionization was performed in the 1930's by
masses of the projectile and target particles, Tate, Smith, and Bleakney (cf. reference 4).
the former moving with a velocity v much Collective Processes If, as in a glow discharge,
greater than the target velocity, the center-of- a large number of charged particles are created,
mass velocity Vc is the collective interactions of these particles with
each other and with external fields may permit
Vc = mv/(m + M) = I1V/M the charged fluid to exhibit very unusual and
distinct properties. An ionized gas possessing
The latter relation defines 11, the reduced mass. both positive and negative charges is called a
The kinetic energy in the CM system is !I1V 2 , plasma if the distance over which the gas can
which is the projectile energy multiplied by have an appreciable departure from charge
M/(M + m). For the case in which target and neutrality is small compared to the dimensions
projectile are identical, the kinetic energy in the of the gas. This distance is described by the
CM system is half that of the projectile. Using Debye-Huckel radiUS, a quantity borrowed from
the law of conservation of linear momentum, the theory of strong electrolytes, which charac-
one may show that the kinetic energy in the terizes the decay of the shielded Coulomb
CM system is the maximum energy available for potential surrounding the ionized particles of the
excitation and/or ionization. fluid. If the charge neutrality in a plasma is
Collisions in general are classified as (a) elastic disturbed in some manner, the electrons will be
in which no changes in internal states occur, and forced to oscillate about their equilibrium
for which kinetic energy is conserved, and (b) positions in simple harmonic motion with a
inelastic in which a part of the kinetic energy frequency characterized by the electron density.
is conve~ted to internal energy. A superelastic Longitudinal oscillation of the ions and electrons
collision is one in which internal energy is as a whole constitutes another type of motion
transformed into kinetic energy. Because of the called ion-acoustical waves. Hydro-magnetic
extreme complexity encountered in quantum- waves, which appear in the presence of a mag-
theoretical calculations of ionization cross netic field are still another form of motion not
sections for heavy-particle collisions, very little observed in a nonionized medium. The descrip-
progress has been made in this important area of tion of such phenomena goes beyond the scope
collision theory. of the ionization process.
Ion-neutral collisions may give rise to free
electrons, or simply charge exchange which., in ROBERT C. AMME
its simplest form, is expressed by the equatIOn.
A+ + B -+ A + B+ + !::.E References
The neutral particle B has been ionized, but the 1. Condon, E. U., and Odishaw, H., "Handbook of
electron has transferred to the incident ion, Physics," second edition, New York, McGraw-Hill
neutralizing it. The energy !::.E released in this Book Co., 1967.
process is the difference between the ionization 2. Spitzer, L., "Physics of Fully Ionized Gases,"
potentials of the neutral particles A and B. For Interscience Tracts on Physics and Astronomy,
the case in which A and B are identical, !::.E = 0 New York, Interscience Publishers, 1956.
and the process is called symmetric resonant 3. Loeb, L. B., "Basic Processes of Gaseous Elec-
charge transfer. A t low ion beam energies, this tronics," Berkeley, University of California Press,
transfer proceeds with a large cross section. 1955 .
Ionization by Electron Impact Of great im- 4. McDaniel, E. W., "Collision Phenomena in Ionized
portance in atomic physics is the ionization Gases," New York, John Wiley & Sons, 1964.
605 IONOSPHERE

5. Hasted, J. B., "Physics of Atomic Collisions," The boundary between the E region and the
Washington D.C., Butterworth, 1964. D region is normally taken to be at around
90 km, or the lower boundary of penetration
Cross-references:
COLLISIONS OF PARTICLES, of ultraviolet radiation sufficiently energetic
CROSS SECTIONS AND STOPPING POWER, ELEC- to ionize molecular oxygen. The production
TRICAL DISCHARGES IN GASES, ELECTRO- and loss processes in the E and F 1 regions are
CHEMISTRY, IONOSPHERE, MAGNETO-FLUID- essentially identical in the region up to 120-
MECHANICS, PLASMAS. 140 km, though there is a continuous change
in the portions of the extreme-ultraviolet
(EUV) spectrum producing the ionization.
IONOSPHERE X-rays between 30 and 100 A, in particular,
ionize N2 and O2 in the E region.
The ionosphere is the region of the upper at- An important feature of the ionosphere in
mosphere in which the density of thermal the region of 100 km is the formation, on
electrons and ions is large enough to affect the occasion, of extremely thin layers of ioniza-
propagation of radio waves. The ionosphere is tion known as "sporadic E." Direct sampling
produced by the effects of ionizing radiation has shown the ions in these regions to be
from the sun, and high energy particles imping- similar to the composition of the chondritic
ing on the neutral upper atmosphere. This pro- meteors. It is believed that these meteoritic
duces photoelectrons and ions. These then ions are produced by charge exchange with
interact with the neutral atmosphere and the ambient ions and are compressed into thin
resultant densities are controlled by the pro- layers by the interaction of wind shears with
duction and loss processes as well as horizontal the Earth's magnetic field. It is this type of
and vertical transport. Ionospheres are thus a sporadic E that is occasionaly responsible for
general feature of planetary atmospheres. long-distance television reception.
The ionosphere of the earth was divided up The F2 layer, which is generally regarded as
into vertical regions based on radio soundings beginning at about 140-150 km, is responsible
which are sensitive to the gradient in the elec- for by far the largest portion of the total
tron density; these regions also correspond to ionization. In the lower regions the molecular
regions of different ion chemistry. Typical ions NO+ and O2+ predominate, but in the
values of the densities in the earth's ionosphere region of the peak and above 0+ becomes more
are shown in Fig. 1. important. In this region of the neutral at-
The lowest region of the ionosphere was called mosphere atomic oxygen is the major con-
the D layer to allow for up to three lower layers stituent and 0+ is the major ion produced.
had they been discovered later. Ionization due The most important loss process for this ion
to galactic cosmic rays does penetrate to the is charge exchange with O2 and N2 , and this
ground, and below theD region the conductivity proceeds much faster at lower altitudes where
decreases steadily. This ionization in the region the molecular densities are larger. Vertical
of 30-50 km is detectable in in-situ observa- transport is extremely important, with most
tions and is often referred to as the C layer. In ions being produced above the peak and re-
the lower D region between 50 and 80 km, al- combining below the peak. The effect of
though N2 + and O2+ are the major ions pro- neutral winds and electric fields on this trans-
duced, the predominant positive ions are those port is large and consequently the layer displays
of water clusters, H30+' [H 20] n' During the a very complex morphology with large varia-
day the negative charge is mainly carried by tions in geographic and magnetic coordinates
electrons, but at night negative ions become as well as altitude, time, and season. Other large
important. This is the only region of the fluctuations occur whose cause is not at present
ionosphere in which negative ions are numeri- well understood. As shown in Fig. 1 there are
cally important. large systematic changes in the F region between
In the upper D region the positive ions are sunspot maximum and minimum following the
mainly NO+ and O2+, as they are in the E and solar cycle. These are due to the large variations
Fl regions above. The D region has very dif- in the neutral thermospheric densities as well as
ferent behaviors during times when the sun is to the changes in the EUV ionizing radiation.
very active and when it is quiet. The reason for Above the region where 0+ predominates is
this was discovered by Nicolet and Aikin (1960) the protonosphere. At the lower boundary the
to be that when the sun is active the X-ray protons undergo charge exchange reactions
emission is enhanced by orders of magnitude with neutral atomic oxygen and neutral hydro-
and is the dominant production source. Under gen charge exchanges with atomic oxygen ions.
quiet conditions the Lymann alpha line of the This results in a generally downward flux of
sun's radiation, which varies little with solar protons when the F layer is decreasing and an
activity, dominates. This line, which is of too upward flux when it is increasing. Because the
Iowan energy to ionize any of the major con- atomic oxygen ions are much heavier than the
stituents above these altitudes, penetrates to electrons the equilibrium profile of the densi-
the D region and ionizes a very minor neutral ties in which the fluxes produced by diffusion
constituent, nitric oxide. would be balanced by gravity are very different.
IONOSPHERE 606

IOOO~'-~Tn~--~~~~~-rrn~~~~Tn~--r.,,~

800
600

400
::i<
:.:: 300
w
o
~ 200
5
«
100
- - - - . . . - - E REGION

o REGION
0~9--'--'-.L..L.J..J..I.I10
590...,7;--L......L'-l...Ju..u.LJ10-;:8--'--'-....L...L.L.I.L1.... ....I""O-'--'-J...,L.JCLI..I.10
..1I;---'L......L........u..u.u1012

ELECTRON DENSITY PER CUBIC METER


FIG. 1. Electron density at mid-latitudes at noon for high and low solar activity.

A vertical electric field is set up which keeps sions with higher energy particles. Some of this
the two distributions the same. This electric excess energy is radiated and results in aurora
field has a large effect on the protons in the and the airglow radiations in the infrared, the
region where 0+ predominates. In particular visible, and the ultraviolet (see AURORA and
at high latitudes where the magnetic field lines AIRGLOW). The major source of visible illumi-
are open to the solar wind, the protons are nation, for example, on a moonless night is
accelerated outward, leading to the escape of from the 630 nm radiation from atomic oxygen
hydrogen. This is balanced by a poleward wind in the F region of the ionosphere.
in the protonosphere. Electrical currents flowing in the ionosphere
The ionosphere of Mars and Venus, despite cause perturbations in the earth's magnetic field
their lower atmospheres which are quite dif- that can be measured at the ground. There are
ferent from that of the earth, also have an 0+ two main current systems. The "dynamo sys-
"F2 region" and an O 2+ "Fl region." The tem" is produced by E region winds blowing
major difference is that, unlike the earth, the the ions and electrons across the earth's mag-
absence of a strong planetary magnetic field netic field. The "solar wind" of plasma flowing
allows the solar wind to interact directly with outward from the sun produces a voltage across
the ionosphere, producing an extremely turbu- the polar caps resulting in field-aligned Birke-
lent and variable upper boundary. land currents entering the ionosphere at high
The thermal state of the ionosphere is quite latitudes. The resulting ionospheric currents
interesting because the electrons, ions, and are enhanced and can penetrate to low latitudes
neutral particles all have different temperatures during disturbed times.
above about 200 km in the earth's ionosphere The ionosphere is important for ground-to-
due to small collision frequencies between ground propagation because it can refract or
species, the very different energy inputs, and reflect radio waves and thus makes communi-
the importance of vertical transport. That part cation beyond the horizon possible. For satellite-
of the excess energy given to the photoelec- to-ground communication the combined effects
trons produced in the ionization process which of refraction and electron density irregularities
is not lost in inelastic collisions with the neutral produce scintillations. These can be a problem
gas is transmitted preferentially to the ambient especially if the ray path crosses the equatorial
electrons, which in turn heat the ambient ions. region. For satellite navigation systems refrac-
As a result, above about 200 km, in the day- tive errors have to be compensated for. Radio
time or when photoelectrons are entering the astronomy from the ground is limited to fre-
upper ionosphere from the conjugate hemi- quencies below the critical frequency of the F
sphere, the electrons are hotter than the ions, region.
and the ions are hotter than the neutrals. Be-
cause of the comparative ease with which the JOHN S. NISBET
electron and ion temperatures and densities can
be measured remotely by radio waves these References
inputs provide a very valuable tool in studying
the energy balance of both the ionosphere and Al'pert, Ya. L., "Radio Propagation and the Iono-
the neutral atmosphere. sphere," 2nd Ed., Plenum, New York, 1973.
Many of the ions in the ionosphere are pro- Banks, P. M., and Kockarts, G., "Aeronomy," Aca-
duced in excited states, or are excited by colli- demic Press, New York, 1973.
607 IRRADIANCE (ILLUMINANCE) CALCULATION

Bauer, S. J., "Physics of Planetary Ionospheres," curvatures called the parallel normal curvature
Springer-Verlag, Berlin and New York, 1973. and the perpendicular normal curvature of the
Nicolet, M., and Aikin, A. C., "The Formation of the wave front. The parallel normal curvature is the
D Region of the Ionosphere," J. Geophys. Res. 65, curvature of the curve in the wave front whose
1469 (1960). tangent is a tangent of the surface and lies in
Nisbet, J., "On the Construction and Use of a Simple the plane of incidence at the point of refrac-
Ionospheric Model," Radio Science 6, 437 (1971). tion. The perpendicular normal curvature of the
wave front is the curvature of the curve in
Cross-references: AIR GLOW, AURORA, INTERNA- the wave front whose tangent is a tangent in the
TIONAL SOLAR-TERRESTRIAL PHYSICS PRO- surface and is perpendicular to the plane of
GRAM, IONIZATION, PLANETARY ATMO- incidence at the point of refraction of a ray.
SPHERES, MAGNETOSPHERIC RADIA TION There is an equation that relates the parallel
BEL TS, PLASMAS, SPACE PHYSICS. normal curvature of the incident wave front to
the parallel normal curvature of the refracted
wave front. A second equation relates the cor-
responding perpendicular curvatures of the in-
cident and refracted wave fronts. Two other
IRRADIANCE (ILLUMINANCE) equations relate the torsions of these respective
CALCULA TION curves. We call these the generalized lens equa-
Introduction In geometrical optics the paraxial tions. The two torsion equations can be shown
lens equation is applied at each surface to rough to reduce to one, leaving three equations. From
out a design. To determine the final image qual- the generalized lens equations one can relate
ity a designer may trace hundreds of nonpar- the principal curvatures of the wave front after
axial rays to obtain a spot diagram in the image refraction to the principal curvatures of the
plane. Each spot is the point of intersection of wave front before refraction, since the principal
a ray from a point on the object with the image curvatures of a surface are related to the nor-
plane, and the spot density is a measure of image mal curvatures of the surface through an equa-
spread and therefore of image quality. tion called Euler's equation. When the principal
A formula is available, however, for specify- curvatures are thereby calculated one has the
ing the flux density, that is, the irradiance along necessary data to evaluate the flux equation
each traced ray. 1 It is also applicable to ray and also to locate points on the caustic surfaces
acoustics and radiant heat transfer. For example, associated with any ray, skew, or meriodional,
the formula enables one to vary a parameter, in an optical system.
trace a new ray and immediately calculate the For the special case of meridional rays, the
irradiance along that ray. To illustrate the ideas, torsions of the normal curves are zero so the
we derive the formula here, referred to as the torsion equations do not appear. Also the nor-
/lux equation, for the special case of a two- mal curves and their associated curvatures be-
dimensional geometry. This result is useful by come identical with the principal curves and
itself since it gives some of the characteristics their curvatures. Therefore, the two generalized
of the image formed by rays in the meridional lens equations become equations that directly
plane of a three-dimensional geometry. The relate the principal curvatures of the wave front
meridional plane is the plane formed by the before and after refraction. They also relate
object point (and image point) and the optical points on the caustic surface before refraction
axis. to the caustic surfaces after refraction; the
First, omitting mathematical details, we out- caustic surfaces are therefore immediately
line the procedure for calculating illuminance determined.
in three dimensions and then proceed in detail The rather complicated three-dimensional
to the simpler two-dimensional geometry. As problem described above is greatly simplified
input to the flux equation in three dimensions in two dimensions, as we shall see. Many of the
one needs to know the principal radii of curva- key ideas are still involved, however, but can be
ture of the wave front both before and after made much clearer in two dimensions before
refraction. In any surface two curves may be proceeding to a full three-dimensional geometry.
formed at right angles to each other, one having Figure 1 shows the three-dimensional caustic
a maximum curvature and the other a minimum surfaces formed by collimated rays after refrac-
curvature; they are called the principal curva- tion by a spherical surface. A hornlike surface
tures of the surface. The radii of curvature are is formed (tangential caustic) and a caustic
the reciprocals of the principal curvatures. The spike (saggital caustic). The latter is associated
centers of the radii of curvature for a wave with the radial convergence of rays through the
front emerging from a refracting surface define optical axis because of the axial symmetry. If
converging points for the wave front. They lie the source lies off the axis, the caustic spike
along the refracted ray vector and their loci, also spreads out into a hornlike surface. In a
for a series of ray vectors, trace out two surfaces strict two-dimensional geometry only the
called caustic surfaces . In three dimensions, in tangential caustic appears.
optics, the principal curvatures after refraction Two-Dimensional Geometry We now derive
may be obtained from the values of a pair of an equation that relates the curvature of a two-
IRRADIANCE (ILLUMINANCE) CALCULATION 608

FIG. 1. Caustic cone and caustic spike along optical axis fonned by
incident collimated rays after refraction. In a 2-dimensional geometry only
a slice of the cone in the plane of the figure is present.

dimensional wave front after refraction to the The arc rate of change of the normal vector is
curvature of the wave front before refraction.
This single equation is the two-dimensional dN(s)
analogue of the generalized lens equation for - - = I<t(s). (Ie)
meridional rays in a three-dimensional geom- ds
etry. Since the center of curvature of the wave
front after refraction defines a point on the Snell's Law in Vector Form Let a ray be in-
caustic, only one caustic is formed: a caustic cident upon the surface specified by xes). A
curve, the tangential caustic. For small angles, position vector on the ray after refraction is
with the optical axis, the two three-dimensional given by
exact lens equations for meridional rays become x = xes) + rA(r)
the single paraxial lens equation. Likewise, the
single two-dimensional generalized lens equa- where A(r) is a unit vector along the refracted
tion, the exact lens equation, becomes the par ray and r is a distance along the ray. A(r) is
axial lens equation. related to A(i), the unit vector along the inci-
Definition of Curvature Let the position dent ray, and N, the unit normal to the refract-
vector xes) of arbitrary origin define the equa- ing surface, by Snell's law in vector form; that
tion of a refracting surface in parametric form. is
The parameter s may be arbitrary, but in most
derivations is the arc length along the curve. A(r) = 'YA(i) + UN. (2)
I and J are cartesian unit vectors. Then
xes) = Ix(s) + Jy(s), (1) This equation merely expresses that fact that
A(r), A(i), and N are coplanar, part of Snell's
where x (s) and y (s) are arbitrary functions of s. law, and that a vector A(r) may be expressed
The unit tangent vector to the curve is given by in terms of two independent vectors, A(i) and
N. To determine 'Y, form the cross product of
dx(s) N with the above. Thus
t(s) = - - . (la)
ds A(r) X N = 'YA(i) X N
In turn the arc rate of change of the unit tangent
vector is so that J sin cp(r) = J'Y sin cp(i), where J is a unit
vector perpendicular to the plane of the figure.
Therefore, sin cp(r) = 'Y sin Cp(i). Snell's law as
dt(s) = I<N(s) (l b) usually stated is
ds
where N(s) is the unit normal to the curve, I< is . ,f.()
SIn,/,r n(i). ,f.(')
=--SIn,/, I ,
the curvature, and 1II< is the radius of curvature. n (r)
609 IRRADIANCE (ILLUMINANCE) CALCULATION

where if>U) is the angle of incidence and if>(r) is where J is a unit vector normal to the figure,
the angle of refraction. Therefore r = n (i )/n (r), we have, after putting the coefficient of J in the
the ratio of the index of refraction on the inci- resulting equation equal to zero,
dent side to the refracted side.
To determine n, form the scalar product of K (r) cos if>(r) ds(r) - IK (i)
N with Eq. (2). Then A(r) . N = IA(i) . N + n. (3c)
Since A(r) . N = cos if>(r) and A(i) . N = cos if>(i) . cos if>(i) ds(i) - nK ds = O.
we obtain
In order that flux be conserved (except for
n = -I cos if>(i) + cos if> (r). absorption) or that the wave front map con-
tinuously across the refracting surface, the
For reflection if>(r) = 7r - if>(i) since N is taken in following constraint must hold among ds (i),
the same directional sense as A (i), that is, it is ds, and ds(r):
positive on the emerging side of the refracting
surface. Thus for reflection, dsCi) = ds cos if> (i) and ds(r) = ds cos if>(r).

I = I and n = -2 cos if>(i). (4)


With I and n now known we form the com-
Putting Eq. (4) into Eq. (3b) yields
plete differential of Eq. (2) and make use of cos 2 if>(i) nK
the fact that the ray vector is perpendicular to K (r) =IK (i) + . (4a)
the wave front. This is so because the wave front cos 2 if>(r) cos 2 if>(r)
from the point source is spherical and is perpen-
dicular to the ray vector which lies along the In terms of wave front radii of curvature, rei)
radius of the sphere. It is a general theorem in for incident, r(r) after refraction,
optics, known as the Malus-Dupin theorem,2
that if the ray vector starts out perpendicular I I I
K(r)=-, K(i)=- K =-
to the wave front it remains so after any number r(r) rei) , R
of refractions. Thus A (i), the unit ray vector
before refraction, is also the unit normal to the we have
wave front before refraction; likewise A (r), the
unit ray vector after refraction is normal to the I I cos 2 if>(i) n
wave front after refraction. The complete dif- r(r) = rei) cos 2 I/>(r) + R cos 2 I/>(r) (4b)
ferential of Eq. (2) is
where I=n(i)/n(r) and n=-/cosl/>(i)+cos
dA (r) dA(i) I/> (r). The result applies to any surface. The cor-
- - ds(r) - I--ds(i) responding mirror equation is
ds(r) ds(i)
(3)
dN I I 2
- dnN - n- ds = 0 -=-+
r(r) rei) R cos I/>{i)
(4c)
ds '

where ds (i), ds, ds (r) are elements of length Equation (4b) is the generalized lens equation,
along the incident wave front, the refracting in that it relates the radius of curvature of the
surface and the wave front after refraction. refracted wave front at the point of refraction
By Eq. (lc) we have to the radius of curvature of the incident wave
front at the point of refraction and the radius
dA(r) dN of curvature R of the surface at the point of
--=K(r)t(r), -=Kt refraction. R may vary over the surface. The
ds(r) ds ' usual paraxial optics sign conventions apply if
(3a)
we begin by placing a minus sign in front of
dA(i) r{i) in Eqs. (4b,c). When I/>(r) =::: I/>{i) =::: 0, Eq.
- - = K (i) t(i),
ds(i) (4b) becomes the paraxial equation
where K (r), K (i), and K are the curvatures of the
-
I
=-
I
+-
n n =-I + 1. (Sa)
refracted wave front, the refracting surface, and
the incident wave front, respectively, and t(r), r(r) rei) R'
t(i) and t are unit tangent vectors to the respec- For reflection, Eq. (4c) becomes
tive wave fronts.
If we form the vector cross product of N with
Eq. (3) after inserting Eq. (3a) and use the fact
that
t(r) X N = J cos I/>(r), t X N = J, the mirror equation; r{i) is negative in both
(3b) cases, when the incident wave front is diverging
t(i) X N = J cos I/>(i) from the left.
IRRADIANCE (ILLUMINANCE) CALCULATION 610

FIG. 2. Converging element of wave front of radius of curvature r2C


after refraction at surface 2. '2C is also the distance along the refracted
ray to a point on the caustic 'zp is the radius of curvature of the wave
front incident upon surface 3.

Derivation of Flux Equation If the light which expresses continuity of flux across the
originates from a point source of strength 10 , interface, into Eq. (8c), one obtains
the element of flux d-F emitted in angle dB is
Eds(l) - dS(3) = a( l)p I PZ cos cf>(l) cos 1/>(2)
dw(l)
dF=IodB =Io--=a(l)dw(l) (6) ds(l) ds(2)
r(1 )
. cos 1/>(3) dw(2) dw(3)'
where a(1) = Iolr(l) is the flux density at posi-
tion r(l) and dw(l) the element of wave front (10)
area. When the light originates from an element
of length ds(O) radiating in accordance with 1/>(1), 1/>(2), and 1/>(3), without subscripts, are
Lambert's law, then angles of incidence at each surface; cf>r 0) and
I/>r(2) are angles of refraction; dw(1), dw(2),
a(l) = B cos 1/>(0) ds(O)lr(I) (7) and dw(3), without subscripts, denote the ele-
ments of wave front on the incident side of the
where B is the brightness and cos cf>(0) = A(O) . surfaces 1, 2 and 3; dwr (1) and dWr(2) denote
N (0). Thus the flux incident upon ds (1) is the wave front as it leaves surfaces 1 and 2.
Likewise, using
Eds(o) -ds(l) =a(1) cos (p(I) ds(l). (8a)
cos I/>r(l)ds(1) =dw r (1),
If the flux is refracted to ds(2) then the flux (11)
density over ds (2) is cos 1/>,(2) ds(2) =dw r (2),
ds(l) in Eq. (10), we have
E ds (0)-ds(2) =a(l)pI cos 1/>(1)--,
ds(2) EdS(1)-dS(3) = a(l)PIPZ
(8b) cos 1/>(1) cos cf>(2) cos cf>(3)
where PI is the coefficient of absorption or re- cos I/>r(1) cos I/>r(2)
flection at ds (I). Similarly if the flux
. dwr(l) dw r (2)
(12)
Eds(I)-dS(Z) X ds(2) dw(2) dw(3)'
incident upon ds(2) is refracted to ds(3) then The elements of wave front are related to the
the flux density over ds(3) is radii of curvature as follows:
ds(1) ds(2) dwr(l) =rlC d8 1 , dw(2)=rl p dB 1 .
Eds(o)-ds(3) = a(l)PIPZ cos 1/>0) ds(2) ds(3)
(13a)
(8c) dwr(2)=r2CdB2' dw(3)=r zp dB 2 ·
and so on for any number of surfaces. Putting (13b)
ds(2) cos 1/>(2) =dw(2), See Fig, 3 for Eq. (13b), A similar figure applies
(9) to Eq. (13a) between surfaces 1 and 2. The
ds(3) cos 1/>(3) =dw(3), radius of curvature of the wave front as it leaves
611 IRRADIA nON, DISPLACED ATOMS

I
:......1 - - - - - - r2 c - - - - - 1•.-41---- r 2 p ----.:
I
I
II I

I I
I I
I I
I I
dS(2)i :
I

FIG. 3. Ray trace showing various quantities employed in


the text.

surface I is denoted by rle. Its radius of curva- If rip ~ 0, then


ture as it is incident upon surface 2 is denoted
by rip. Likewise r2C is the radius of curvature !.J.s:.. = cos 2 ¢r(2)
of the wave front as it leaves surface 2 at ds(2) rip 'Y(2) cos 2 ¢(2) ,
and r2p is the radius of curvature of the wave
front as it is incident at ds (3). The orthogonality which is a finite result. If Eq. (4b) is always
of the ray vector and the wave front is used in written as Eq. (15), this problem is automati-
writing Eqs. (13,a,b). Putting Eqs. (13a b) into cally avoided.
Eq. (12) one obtains '
DONALD G. BURKHARD
Eds(O) -ds(3) = a(0)PIP2

cos ¢(1) cos ¢(2) cos ¢(3) References

cos ¢r (1) cos ¢r (2) 1. Burkhard, D. G., and Shealy, D. L.,Appl. Opt. 20,
897 (1981).
. !.!£ r2C 2. Born, M., and Wolf, E., "Principles of Optics,"
(14 ) p. 131, Elmsford, New York, Pergamon, 1975.
rip r 2p
Cross-references: LENS; OPTICS, GEOMETRICAL;
where rip = rl2 - rle, r2p = r23 - r2C, with an
obvious generalization to n surfaces. Equation OPTICS, GEOMETRICAL, ADVANCED; OPTICS,
(14) is the flux equation for a two-dimensional PHYSICAL.
geometry, and ric, r2C are the values of r(r) at
surfaces I and 2, respectively, 0 btained from
Eq. (4b) for refraction and Eq. (4c) for reflec- IRRADIATION, DISPLACED ATOMS
tion. This completes the derivation of the
generalized lens equation and also the flux The word radiation originally implied emana-
equation. tion of electromagnetic waves (photons, gamma
A word of caution. If the caustic curve from rays), or alpha or beta rays (helium ions or elec-
surface I intersects surface 2, then for the trons), and is used in that sense in the entries
particular ray to that point on the caustic we RADIATION, THERMAL and NUCLEAR RADIA-
TION. It is now commonly accepted in the
have rip = rl2 - riC = 0, since rip = r12. Equa-
tion (14) then predicts a singular value for the broader sense of emanation or bombardment
flux density at ds (3), for this ray, and very by any elementary particle or ion. Indeed, the
large values for the small pencil close to the ray. general subject area of the irradiation of solids
i~ called radiation effects. The primary empha-
The singularity is not real, however, and is
SIS of the present entry is on irradiation which
eliminated by writing
significantly alters the relative positions of the
atoms in a solid, in contrast to cases in which
ric r 2C (r2C) I the primary effect is a change in the state of
rip r2p = ric rip r2p ionization of the atoms (see RADIATION,
IONIZING, BASIC INTERACTIONS; RADIATION
and solving for r2C/rlp from Eq. (4b). In Eq. CHEMISTRY; and COLOR CENTERS): Electron
(4b) put r(r) = r2C and r(i) = rip, ¢(i) = ¢(2) redistribution is sufficiently rapid in metals that
and ¢(r) = ¢r(2). Then ionization effects are not observable and hence
the study of ionizing radiation normally is re-
(15) stricted to insulators and semiconductors.
'Y(2)R 2 cos 2 ¢(2) + D(2)rI P • The atoms in the majority of solids are ar-
IRRADIA TION, DISPLACED ATOMS 612

ranged in very specific packing patterns which POSITRON). These charged particles then travel
repeat three-dimensionally for distances large through the lattice like incident electrons to
compared to atomic dimensions (see SOLID create primary knock-ons. To reach threshold,
STATE PHYSICS and CRYSTALLOGRAPHY). This photon energies of the order of 1 Me V are re-
crystalline order is not usually evident: a piece quired for the two electron processes and of 5
of copper wire is quite flexible and does not MeV for pair production.
look like a "crystal." However, truly amorphous Electrons are so light compared to nuclei that
solids (supercooled liquids and glasses) are un- they can only transfer a small fraction of their
common, and the copper wire is in fact com- energy to a lattice atom in an elastic collision.
posed of many crystallites joined by grain For example, energies of the order of I MeV
boundaries, each "small" by human scale stan- (and relativistic corrections in the theory)
dards, but large enough so that the average cop- are required to reach threshold recoil energies
per atom is in a crystalline environment. Thus, in the 3d transition metals (e.g., Fe and Ni).
the title of this entry refers to the decrease in High-voltage electron microscopes operate with
this atomistic order caused by irradiation. electrons in this energy range and can be used
The simplest defects in the crystal lattice both to create and observe the effects of radia-
order of solids are point defects such as va- tion damage. With electron bombardment, the
cant atomic sites (vacancies) and extra atoms primary knock-on atoms normally do not have
crammed into locations where they should not sufficient energy to create a significant number
be (interstitials), and line defects, effectively of additional knock-ons.
wrinkles in the crystal structure (dislocations). With ion bombardment, much of the incident
Although there is still some controversy, it is particle energy can be transferred to a single
generally acknowledged that self interstitials lattice atom. At high energies, often in the tens
(i.e., not impurity interstitials) migrate free~ in of MeV range, there is a coulombic interaction
the lattice at very low temperatures « 100 K), between the incident ion and the lattice atom
while vacancies typically might become ther- cores, and the scattering is approximated by
mally mobile at 100° C. Self interstitials and Rutherford theory. Here the primary knock-on
vacancies are defined as the disturbance in the can have sufficient energy to act like an inci-
lattice order. Thus, interstitial migration does dent particle, which can lead to a cascade effect.
not involve a specific atom moving through However, most of the energy loss in this range
the lattice, but the interstitial configuration, is due to electronic excitation which, because
similar to a traveling wave, and the motion of a it is less than I MeV, does not yield displaced
vacancy occurs by a neighboring atom hopping atoms. As energies decrease, the interactions
into the vacancy, leaving its site vacant. become more like hard-sphere collisions be-
The practical significance of the stUdy of tween atoms. The changeover from Rutherford
irradiation processes in solids is related, for to hard-sphere scattering occurs in the A-keV
example, to the degradation of mechanical range, with A the atomic weight of the moving
properties of materials in nuclear reactors or ion. Although most of the energy loss is in the
electronic properties of semiconductors. To ex- Rutherford range, the majority of the displaced
plain these gross effects, details of the structure atoms are produced by hard-sphere scattering.
and properties of solids must be understood, Neutrons can produce lattice atom displace-
and so irradiation can be used as a probe to ments either by scattering processes which leave
study many fundamentals of solid state physics the atom unchanged or by nuclear reactions.
and of defects in the crystalline order of solids. For nuclear reactions, a lattice atom can absorb
As an energetic particle traverses a solid, it a thermal neutron and then undergo a radioac-
interacts with and transfers its energy to the tive transformation, such as Fe 51f(n, 'Y) Fe 59
crystal in a variety of ways. Much of the em- The iron atom recoils to conserve momentum
phasis here will be on elastic collisions which as the 'Y is emitted, with energy, for example,
impart kinetic energy to the lattice atoms. The somewhat less than a ke V, but well above thresh-
probability of significant elastic interaction, old to become a primary knock-on and act to
called the scattering cross section, is sufficiently produce a small cascade. Other reactions with
small under most circumstances that elastic col- thermal neutrons, such as (n, a), (n, ~), and
lisions are spatially well separated, and can be (n, f) (fission) also produce primary knock-ons.
treated theoretically as isolated events using Neutron scattering generiilly Involves "fast"
nuclear collison theory. If, during a collision, a neutrons, i.e., with energies in the keV and
lattice atom recoils with kinetic energy above a MeV range. As in the thermal neutron case,
threshold value, it is called a primary knock-on. lattice atoms generally receive kinetic energies
The theory for the creation of a primary well above the primary knock-on threshold. At
knock-on is dependent on the irradiating parti- the low energy end of neutron scattering, the
cle. Photons, or gamma rays, first impart energy interaction is primarily elastic, but as energies
to electrons via the photoelectric effect or by increase, lattice atom nuclei can become ex-
Comyton scattering, or transform to an elec- cited. leading to inelastic scattering.
tron and a positron by pair production (see The discussion so far has been concerned with
PHOTOELECTRICITY, COMPTON EFFECT, and the process by which a primary knock-on is
613 IRRADIATION, DISPLACED ATOMS

created, and the concept of a threshold energy fects of the lattice structure and scattering
was presented as intuitively reasonable. It is theory more realistic than hard spheres indi-
similarly clear that, given the highly ordered cates that conditions for focusing should be
crystal structure, the threshold energy must quite common.
vary with direction, i.e., the threshold energy is Another focusing type of phenomenon is
a function of orientation. A minimum value of called channeling, whereby a particle (incident
25 eV is commonly accepted as typical, with or knock-on) travels with little energy loss per
4.5 times the sublimation energy yielding a unit length down an open channel in the crystal
rough correlation with the experimental data. structure. Such channels are easily seen in ball-
Minima tend to occur near high symmetry di- and-stick lattice models. Under careful labora-
rections, with the sequence from. most to least tory conditions with properly oriented single
favorable being, for example, (110), (loa), crystals, incident particles which normally
(I I 1) in fcc copper and (IOO), (110), (II 1) in would be stopped can readily pass through a
bcc iron. foil by such channeling.
If a knock-on atom only travels a short dis- Much of the interest in radiation damage
tance in the lattice, say several interatomic spac- theory is concerned with the estimation of the
ings, it momentarily becomes an interstitial, amount and spatial distribution of the disorder
leaving a vacancy at its initial lattice site, but caused by a given irradiation. The ideas dis-
then this interstitial may spontaneously recom- cussed above have been incorporated into com-
bine with its vacancy to yield no net damage. plex formulations which give good estimates of
As noted above, interstitials of this type move the formation of damage but normally require
through the lattice as a disturbance in the order, numerical methods to solve. However, even if
not as a specific atom. The atom which returns the disorder created was accurately known, di-
to the knock-on site is generally not the atom rect comparison with experiment is extremely
initially there, so that this subthreshold event difficult because the defects are so small. Also,
can, for instance, yield mixing of ordered alloys. at all but very low temperatures, some if not all
If the interstitial is somewhat further removed of the defects are mobile so that annealing takes
from the knock-on site, a stable interstitial- place continuously. The irradiation drives the
vacancy complex, called a close Frankel pair, solid away from thermal equilibrium, but, if
is created. There is a binding energy between any kinetic pathways are available, the solid
the two defects so that they preferentially re- will adjust to thermodynamically lower its free
combine when one becomes mobile. At greater energy.
separations yet, the interstitial and vacancy do The simplest annealing process is interstitial-
not interact, are called a separated Frankel pair, vacancy recom bina tion, called annihilation. The
and migrate in a random path upon becoming defects can also disappear at sinks such as sur-
mobile. faces, grain boundaries and dislocations, or
For primary knock-on energies above but not clusters of a given type may form. These clus-
too much greater than threshold, the interstitial ters can transform into dislocation loops, chang-
is formed at a distance from the knock-on site ing the effective sink concentration by orders
by a replacement chain process, which occurs of magnitude. The migrating defects can inter-
most readily near high symmetry directions. act with alloying elements leading to solute
The primary knock-on hits a neighbor almost segregation. The net effect of these, and many
head on, and, because they have the same mass, additional effects, is a continuous evolution of
transfers most of its energy to that atom. The the microstructure of a solid under irradiation.
primary knock-on atom then remains in that The initial defects created in a solid are too
neighboring position while the atom which has small to be observed directly, although strides
been there hits one of its neighbors, etc. While have been made with field ion microscopy (see
a highly energetic particle may have a long path FIELD EMISSION). Electrical resistivity is prob-
in a solid, the majority of defects created in a ably the most standard experimental technique
cascade occur through this mechanism, often used to study the amount of disorder in a solid.
yielding a depleted zone surrounded by a shell A problem is that resistivity, and most other
rich in interstitials. This configuration, called a procedures, are not specific: vacancies and in-
displacement spike, is raised to a very high tem- terstitials have about the same electron scatter-
perature for a short duration of time because so ing strength, and, for example, a divacancy has
much energy is deposited in a small region. The about twice the scattering strength of a single
hot spot only lasts for perhaps 100 lattice vibra- vacancy. Sophisticated experimental methods
tions before cooling below the melting temper- such as positron annihilation spectroscopy in
ature. Subthreshold collision chains similar to which different defects give different signals,
replacement chains are considered to be the pri- are not without their own ambiguities.
mary means by which energy is rapidly removed The most significant practical effects in
from a displacement spike event. Experience metals are related to mechanical properties. For
with billiard balls would indicate that replace- example, an increase in the dislocation content
ment chains or subthreshold collision chains produces a loss of ductility, dislocation climb
would rapidly defocus, but the combined ef- leads to high-temperature creep, clustering of
IRRADIATION, DISPLACED ATOMS 614

vacancies into voids yields bulk swelling, and cussions is necessary in order to illustrate how
segregation of impurities at grain boundaries the problem of irreversibility is related to our
can decrease fracture strength. The creation of general view of the world. We shall be con-
damage in semiconductors (see SEMICONDUC- cerned here mainly with the qualitative aspects
TORS) is similar to that in metals, but here the of irreversibility as distinct from quantitative
electronic effects are most significant. The dis- theories of particular processes (see, e.g., the
order gives rise to additional electronic states articles on DIFFUSION, HEAT TRANSFER,
in the band gap which can act as donors, ac- THERMOELECTRICITY, and VISCOSITY).
ceptors, or trapping sites, which in turn can In the seventeenth century, the "clockwork
completely alter the sensitive electronic prop- universe" theory was popularized by the French
erties built into a particular device. philosopher Rene Descartes and the British sci-
entist Robert Boyle. According to Descartes
R. A. JOHNSON and Boyle, the physical world is like a perfect
machine which, once created and set in motion
References by God , can run forever without any further
need for divine intervention. This mechanistic
1. Dienes, G. 1., and Vineyard, G. H., "Radiation Ef- view of nature dominated much scientific work
fects in Solids," New York, Interscience Publishers (and conditioned many of the influences of
1957. physics on biology, psychology, philosophy,
2. Billington, D. S. (Ed.), "Radiation Damage in and political thought) up to the nineteenth cen-
Solids," New York, Academic Press, 1962. tury . Since its most striking successes were at-
3. Peterson, N. 1., and Harkness, S. D. (Eds.), "Radia- tained in the Newtonian theory of the solar sys-
tion Damage in Metals," Metals Park, OH, Amer- tem, it has often been attributed to Newton in
ican Society for Metals, 1976. such phrases as "the Newtonian world-ma-
4. J. Gittus, "Irradiation Effects in Crystalline Solids," chine." But Newton himself rejected it, point-
London, Applied Science Publishers, 1978. ing to the importance of irreversibility in phe-
5. Peterson, N. 1., and Siegel, R. W. (Eds.), "Prop- nomena such as the viscosity of fluids and the
erties of Atomic Defects in Metals," New York, imperfect elasticity of solids; in his Opticks he
North Holland Publishing, 1978. stated that "motion is much more apt to be
lost than got, and is always upon the decay."
Cross-references: COLOR CENTERS; COMPTON EF- He also thought that mutual gravitational per-
FECT; CRYSTALLOGRAPHY; FIELD EMISSION; turbations of planets in the solar system would
LATTICE DEFECTS; NUCLEAR RADIATION; accumulate over long periods of time, produc-
PHOTOELECTRICITY; POSITRON ; RADIATION ing instabilities that could be corrected only by
CHEMISTRY; RADIATION, IONIZING, BASIC IN- divine intervention.
TERACTIONS; RADIATION, THERMAL; SEMI- In 1715 the German philosopher G. W. Leib-
CONDUCTORS. niz attacked Newton's opinion, arguing that the
suggestion that "God almighty needs to wind
IRREVERSIBILITY up his watch from time to time; otherwise it
would cease to move" (as Leibniz put it) was a
Physical systems commonly display a tendency slur on God's ability to make a perfect world-
to change spontaneously from one state to an- machine. Newton retorted that the clockwork
other, but not to change in the opposite direc- metaphor was too materialistic, and was likely
tion. Examples are the tendency of heat to pass to encourage the notion that God has no active
from regions of high temperature to regions of role in the world at all. Newton's suspicions
low temperature, the tendency of mechanical were well-founded; later improvements on his
or electrical energy to be transformed into heat calculations of gravitational perturbations indi-
by friction or resistance, and the mixing or dif- cated that a solar system governed only by
fusion of different substances. While irreversi- Newton's laws is stable , so that all changes in
bility appears to be an obvious feature of planetary orbits merely oscillate between fixed
macroscopic natural phenomena, so much so limits. Thus when Napoleon asked the French
that violations of this tendency are scarcely mathematician P. S. de Laplace why he had not
conceivable, it is not yet established whether it mentioned God in his treatise on celestial me-
should be considered a general law applicable chanics, Laplace could reply: "Sir, I have no
on both the atomic and the cosmological scales. need of that hypothesis."
Most physicists accept the "statistical" explana- When Newton asserted that motion tends to
tion of irreversibility, according to which com- be lost in processes such as friction or inelastic
plex systems with many degrees of freedom collisions, he was refusing to accept the sugges-
tend to spread out among more and more di- tion (already well-known at that time) that it is
verse states in the "phase space" of possible simply transformed into invisible molecular mo-
configurations (see STATISTICAL MECHANICS). tion, perceptible only as heat. The modern con-
Scientific discussions of this question go back cept of irreversibility is different from Newton's
to the time of Newton and continue up to the because it involves a dissipation but not a
present day. A brief review of these earlier dis- destruction of energy. One might therefore ex-
615 IRREVERSIBILITY

pect that the modern theory of irreversibility heat conduction (whose development was moti-
could be established only after the law of con- vated in part by the problem of terrestrial tem-
servation of energy had been accepted (around peratures) was the first major physical theory in
1850); in a sense this is true, but it is somewhat modern times to incorporate irreversibility in
misleading historically, as we shall see. its basic postulates.
In 1852 the British physicist William Thom- With the advent of thermodynamics in the
son (later known as Lord Kelvin) published a 185 Os, the concept of irreversibility could be
short paper, "On a Universal Tendency in Na- extended to processes involving transformations
ture to the Dissipation of Mechanical Energy." among different forms of energy . Thomson's
The immediate stimulus for this pronounce- use of the word "dissipation" introduced a
ment was Thomson's reflections on the conse- moral connotation based on the observation
quences of Carnot's theory of steam engines, that whenever heat flows through a finite tem-
which he and the German physicist Rudolf perature difference a certain amount of me-
Clausius had recently used as a basis for the chanical work could be obtained by appropriate
second law of thermodynamics. In formulating use of a heat engine; the maximum work is ob-
the second law, Clausius and Thomson argued tained when each temperature difference in the
that it must be impossible to transfer heat from cycle of operations is made infinitesimally small
a cold body to a warm body without some kind so that the direction of heat flow could be re-
of compensating process equivalent to the versed by a slight alteration of the temperatures
transfer of at least the same amount of heat in (see CARNOT CYCLES AND CARNOT ENGINES).
the opposite direction; or, as Thomson phrased Whenever that is not done, heat flows irrevers-
it, it is impossible to obtain mechanical work ibly and the chance of doing mechanical work
by cooling a body below the temperature of its has been missed, hence the energy involved has
surroundings. But these "impossibility" or "im- been "dissipated."
potency" statements of the second law were In 1854, Clausius formulated his thermody-
closely associated with the conviction that the namic theory in terms of the "equivalence-value
forbidden processes were forbidden because of a transformation," defined as the amount of
they contravened a natural tendency for heat to heat transferred at a certain temperature di-
flow from high temperatures to low. Thus, in vided by that temperature. For a cyclic process
the development of theories of energy dissipa- the total equivalence-value would be deter-
tion, the simple flow of heat from hot to cold mined by an integral over the path of this ratio,
was the first irreversible process to be recog-
nized; other processes were subsequently said
to be irreversible, in part because they were
equivalent or analogous to the equalization of
temperature differences.
Although Thomson's explicit statement of For a reversible process the value of this integral
the principle of dissipation of energy occurred would be zero, according to Clausius's state-
in a thermodynamic context involving transfor- ment of the second law; whereas for an irrevers-
mations of heat and mechanical work, it is evi- ible process it would be positive. Since this par-
dent from his own writings that he was equally ticular mathematical expression proved to be
concerned with an irreversible process that in- useful in developing his theory, Clausius finally
volved no change in the total quantity of heat: gave it the name entropy in 1865; more pre-
namely, the cooling of the earth from a hypo- cisely, the change in entropy in a process is
thetical initial molten state to its present state defined as
(moderate surface temperature with residual in-
ternal heat), ultimately reaching the desolate dS = dQ/T.
cold of interplanetary space. This was the pre-
dicted "heat death" of the earth (and perhaps Thus entropy provided an indicator of irrevers-
of the entire universe) that attracted the atten- ibility, and Clausius could state the second law
tion of popular science writers in the late nine- in the generalized form, "the entropy of the
teenth century, and was publicized further by world tends toward a maximum." It is in this
the British astrophysicists J. H. Jeans and A. S. form, or simply the phrase "entropy tends to
Eddington in the 1920s. But Thomson was not increase," that the principle of irreversibility or
the first to discuss the cooling of the earth; in- principle of dissipation of energy is now ordi-
deed, this had been a favorite topic of debate narily stated. Eddington asserted that this prin-
among scientists in the eighteenth and early ciple "holds the supreme position among the
nineteenth centuries, and provided the occasion laws of Nature" because it determines the direc-
for several assertions that there is a natural tion of time itself-"time's arrow."
tendency for heat to flow from hot to cold. Although one could simply accept the prin-
Even the threat of a heat death as a fate com- ciple of irreversible entropy increase as a funda-
mon to all bodies in the universe had been mental law of physics without further explana-
hinted as early as 1777 by the French astrono- tion (as is sometimes done in texts on
mer Jean-Sylvain Bailly. Fourier's theory of thermodynamics), there is widespread senti-
IRREVERSIBILITY 616

ment among scientists that it must be possible tion could be used to derive irreversibility. Yet
to reduce the law to a more basic postulate it was not until the advent of quantum me-
about atomic behavior (or about the universe as chanics in the 1920s that most scientists were
a whole). This reduction seemed an urgent ready to accept the idea that atomic behavior
necessity for those physicists in the nineteenth may involve an inherent element of randomness
century who believed that all properties of mat- or indeterminacy. While the late-nineteenth-
ter and energy were ultimately explicable by century debates on irreversibility thus fore-
mechanical models, using Newton's laws of mo- shadowed the breakdown of determinism in
tion. Since the kinetic theory of gases was being physics, it has not yet been established that
developed during the same period when en- irreversibility should be attributed to atomic
tropy was being introduced into thermodynam- randomness.
ics, that theory offered a natural starting point. The statistical theory developed by Maxwell,
The following conclusions were reached as a Thomson, and Boltzmann achieved a substantial
result of analyses by Thomson, James Clerk clarification by describing irreversible processes
Maxwell, and Ludwig Boltzmann (see KINETIC in terms of the mixing of atoms with different
THEORY): properties; the flow of heat, or the transforma-
(l) It is reasonable to assume that the prin- tion of mechanical energy into heat, can be
ciple of irreversibility is not an absolute law of seen as special cases in which the relevant
nature and that all processes involving individual atomic property is velocity. Thus entropy in-
atoms are perfectly reversible, since Newton's creases whenever one goes from an ordered
second law of motion is unchanged when - t is state to a disordered state, and entropy itself
substituted for t, and there is no direct evidenc.! can be regarded as a measure of disorder. That
that interatomic forces are dissipative. In that terminology also suggests that entropy is a mea-
case irreversibility can appear only when large sure of the amount of information (or rather
numbers of atoms are involved, and is connected lack of information) which we have about the
with the fact that observable macroscopic states system (see ENTROPY). An extreme interpreta-
(specified by parameters such as temperature and tion based on this idea is that irreversibility is
pressure) of a system correspond in general to not an inherent property of physical systems
enormously large collections of microscopic but is merely a necessary feature of any human
states, the latter being specified by values of the description of them.
positions and velocities of all the atoms. In a There have been numerous attempts to con-
typical irreversible process, a system passes struct or imagine devices that would permit a
from macroscopic state A corresponding to a violation of the second law of thermodynamics,
small number of microscopic states, to another i.e., to accomplish "perpetual motion of the
macroscopic state B corresponding to a much second kind" by arranging for heat to flow
larger number of microscopic states. The pro- from cold to hot regions, or by extracting me-
cess could be reversed only if there were some chanical work from an isothermal heat reser-
way to pick out the special microscopic states voir. Brownian movement has sometimes been
in B that had evolved from microscopic states considered an example of the latter kind. Never-
in A, and then to reverse the directions of all theless there are no generally accepted instances
atomic velocities. of violations of irreversibility. On the other
(2) In this view irreversibility is a statistical hand, scientists hesitate to accept the view of
property, which appears to be a general law of Clausius that irreversible entropy increase leads
nature only because the probability of reversal to a final state in which "no further change
(e.g., the probability that entropy will spon- could evermore take place, and the universe
taneously decrease) is extremely small. There would be in a state of unchanging death."
is some room for philosophical or methodologi- In addition to the statistical explanation of
cal debate as to whether a statement that is al- irreversibility, there have been attempts to dis-
ways true in practice but may be wrong once in cover violations of "time-reversal invariance" in
a billion years "in principle" (according to a the interactions of elementary particles (see
theory of atomic behavior) should be consid- PARITY) and to associate the "direction of
ered a law of nature. time" with the expansion of the universe (see
(3) The use of statistical terminology in the article by Gal-Or cited below). The persis-
explaining the principle of irreversibility never- tent philosophical interest in the nature of time
theless suggested that this principle may depend ensures that debates about irreversibility will
on randomness at the atomic level. In 1894, continue to enliven discussions of the founda-
S. H. Burbury pointed out that the proof of tions of physics for many years to come.
Boltzmann's "H theorem" involved the assump- It is sometimes suggested that there is a con-
tion that two colliding molecules are uncor- flict between biological evolution and the prin-
related after as well as before they collide, and ciple of irreversibility, since the former involves
argued that this assumption could be justified the transformation of simple forms of life into
if the system were subjected to continuous more complex ones while the latter involves the
random external disturbance. Boltzmann, and transformation from order to disorder. How-
later Max Planck, agreed that such an assump- ever, there is no conflict if it is recognized that
617 ISOSPIN

the principle predicts an increase of entropy Fraser, J. T., Haber, F. C. and Milller, G. H. (Eds.),
only in a closed system taken as a whole, and "The Study of Time," New York, Springer-Verlag,
that the statistical nature of irreversibility leads 1972.
one to expect local decreases in entropy which Gal-Or, B., "The Crisis about the Origin of Irrevers-
are compensated by larger increases elsewhere. ibility and Time Anisotropy," Science 176, 11
In the case of biological evolution the system (1972).
must include the sun; the entropy increase as- Hinds, E. A., "Parity and Time-Reversal in Atoms,"
sociated with the transfer of energy from the American Scientist 69, 430 (1981).
sun to the earth (dS = dQ/Tsun - dQ/Tearth) is Kubrin, D., "Newton and the Cyclical Cosmos: Prov-
enormous compared to the decrease in biolog- idence and the Mechanical Philosophy," Journal of
ical processes that utilize a small part of this the History of Ideas 28,325 (1967).
energy. Layzer, D., "The Arrow of Time," Scientific American
As Boltzmann himself pointed out in a lec- 233(6),56 (December 1975).
ture in 1904, Darwin's theory of natural selec- Misra, B., and Prigogine, I., "Time, Probability, and
tion can be derived qualitatively from statistical Dynamics," in "Long-Time Prediction in Dynamics"
thermodynamics. The equilibrium state of a (C. W. Horton et al., Eds.), New York, Wiley, 1982.
molecular system is not simply the one of great- Park, David, "The Image of Eternity: Roots of Time
est entropy or disorder, but involves a balance in the Physical World," Amherst, Mass., Univ. Massa-
between disorder and the forces that favor an chusetts Press, 1980.
ordered state; thus at sufficiently low tempera- Prigogine, I., Nicolis, G., and Babloyantz, A., "Ther-
tures water molecules will spontaneously form modynamics of Evolution," Physics Today 25 (11),
complex crystals. Such an ordered state, ini- 23 (Nov. 1972),25 (12),38 (Dec. 1972).
tially formed by chance, will be "selected" to Prigogine, I., "Time, Structure, and Fluctuations,"
survive under certain conditions. The chemical Science 201, 777 (1978).
reactions that occur in living systems are quite Reichenbach, H., "The Direction of Time," Berkeley,
consistent with thermodynamics, as Harold Univ. California Press, 1956.
Blum showed in his classic monograph "Time's
Arrow and Evolution (Princeton University Cross-references: CARNOT CYCLES AND CARNOT
Press, 1951). ENGINES, DIFFUSION, ENTROPY. HEAT TRANS-
During the past two decades several scientists FER. KINETIC THEORY. PARITY. STATISTICAL
have developed more detailed theories of the ir- MECHANICS, THERMODYNAMICS, THERMO-
reversible processes in biological systems. One ELECTRICITY. VISCOSITY.
of the best known is that of Ilya Prigogine and
his collaborators. Their theory shows how a sys-
tem driven far away from equilibrium may ac-
quire a new kind of order, just as a fluid heated ISOSPIN
from below will develop a regular pattern of Isospin (also called isobaric spin and isotopic
convection cells. Prigogine received the 1977 spin) is an attribute ascribed to particles in a
Nobel Prize in Chemistry for his theory of "dis- mathematical formalism introduced originally
sipative structures," including its application to to simplify calculations involving the interac-
biology. tions among protons and neutrons through
STEPHEN G. BRUSH nuclear forces. In the isospin formalism, the
neutron and proton are treated as though they
References
were two different quantum-mechanical states
of the same entity, the nucleon. A neutron is
Brush, S. G., "Kinetic Theory," Vol. 2, "Irreversible distinguished from a proton by the value of its
processes," New York, Pergamon Press, 1966. (In- isospin proiection quantum number, tz: the
cludes reprints and translations of papers by Max- value tz = l is arbitrarily assigned to the neu-
well, Boltzmann, Thomson, Poincare, and Zermelo.) tron and tz = - ! to the proton. (This conven-
Brush, S. G., "The Development of the Kinetic Theory tion is common in nuclear structure literature,
of Gases. VIII. Randomness and Irreversibility," but the opposite assignments for tz are found in
Archive for History of Exact Sciences 12,1 (1974), particle physics literature.)
reprinted in "The Kind of Motion We Call Heat: A The idea of using such a formalism for treat-
History of the Kinetic Theory of Gases in the 19th ing neutrons and protons was first suggested by
Century," Amsterdam, North-Holland, 1976, Chap- Heisenberg, l who used the Pauli spin matrices
ter 14. to represent the attribute that we call isospin.
Brush, S. G., '-'Irreversibility and Indeterminism: Wigner 2 later named the concept isotopic
Fourier to Heisenberg," Journal of the History of spin. Subsequent authors pointed out that the
Ideas 37, 603 (1976), reprinted in "Statistical Phys- attribute, when extended to nuclei, is relevant
ics and the Atomic Theory of Matter from Boyle not to isotopes but to nuclei with the same
and Newton to Landau and Onsager," Princeton, mass, hence the name isobaric spin replaced
N.J., Princeton Univ. Press, 1983, Chapter 2. isotopic spin. In recent literature the name has
Eddington, A. S., "The Nature of the Physical World," been shortened to isospin. In spite of the name,
London, Cambridge Univ. Press, 1928, Chapter IV. the concept has nothing to do with mechanical
ISOSPIN 618

angular momentum. (See MATHEMATICAL <Rt2 R >. If R is normalized so that <R IR >= I
PRINCIPLES OF QUANTUM MECHANICS for a and is an eigenfunction of t 2 , we find that
discussion of quantum-mechanical angular
momentum.)
The mathematical apparatus of the isospin
formalism belongs to the general subject of where t may be thought of as the length of the
group theory and is identical to that used for vector. The values that t may have are integers
treating electron spin in atomic physics (see for bosons and half-integers for fermions.
ELECTRON SPIN). Four quantum-mechanical The isospin representations are members of
operators, t z , t+ . t-, and t 2 , constitute the basic a rotational group in isospace. Starting with a
elements of the formalism. The number tz that vector IR > one can generate new orthogonal
distinguishes neutrons from protons is the eigen- vectors in isospace through the application of
value of the operator t z , analogous to the the raising and lowering operators t+ and Co
operator Sz that gives the projection of the The newly created vectors are characterized
electron spin on the quantum axis. The opera- by the same eigenvalue of t 2 as the original vec-
tor t+ operating on the representation of a pro- tor but different eigenvalues of t z . The opera-
ton changes it into the representation of a tions may be pictured as rotations of the vec-
neutron and C changes the representation of a tor in isospace, and the set of vectors created
neutron into that of a proton. The effects in this way are the possible orientations of the
of these operators can be written symbolically vector IR > in isospace that can be generated
as through rotation alone. There are 2t + 1 pos-
sible orientations. We say that IR)is a member
tzR(p) =! R(p); tzR(n) =! R(n) of an isospin multiplet of multiplicity 2 t + 1.
t+R(p)=R(n); t-R(n)=R(p) The concept of isospin is especially useful
whenever one wants to describe a set of particles
t+R(n)=O; CR(p)=O. differing in electrical charge but similar in other
attributes. For example, positive, neutral, and
The letter R is used to indicate a representation negative pi mesons (or pions) may be described
of the entity specified in the parentheses. The as an isospin triplet. To achieve multiplicity
representation can be a matrix, a wave function, three, t must be 1 and the particle must be a
a diagram, or any other representation ap- boson.
propriate to the method being employed in the At the time Heisenberg introduced this method
calculation. The operator t 2 is related to the of manipulating neutron and proton representa-
other operators by the equation t 2 = !(t+ c + tions, it might well have looked like a contrived
C t+) + t z 2 • (Note that t+ and t- do not com- device of questionable utility. However, later
mute. Therefore t+ C does not equal C t+.) in the 1930s, as evidence began to mount that
For the case of the nucleon, where we have set the forces between nucleons were the same
up the defining equations to include only two whether they were protons or neutrons, except
states, t2 has the eigenvalue i. The formalism for the additional electrical repulsion between
may be applied also to other kinds of particles, protons, the concept of isospin gained in ap-
and the generalized operator t 2 has eigenvalues parent utility. Eventually, the formalism evolved
of the form t(t + I), where t is either a half- into something that could be applied to aggre-
integral or an integral number, and 2 t + 1 is the gate systems in which the constituents possessed
number of possible states. The value of t is the attribute of isospin, for example, to nuclei.
called the isospin of the particle. Thus, it may To gain insight into the physics underlying
be said that the nucleon is a particle of isospin the concept of isospin one might note that part
!. From a general point of view, the relation- of the evidence for the charge independence of
ships stated above may be viewed in terms of a the nuclear force came from a study of the
vector operator t in a fictitious three-dimen- energy-level structure for mirror nuclei. Mirror
sional space where nuclei are related to each other by an inter-
change of their neutron and proton numbers.
t =tx + ty + tz For example, the nuclei 7 Li (3 protons and 4
t+ = tx + ity neutrons) and 7 Be (4 protons and 3 neutrons)
are a mirror pair and their energy levels ac-
C = tx - it y . curately match each other except for an overall
displacement. This correspondence suggests that
This space, which we call isospace, is mathe- it might be useful to view 7 Li and 7 Be as
matically analogous to ordinary coordinate though one were related to the other as the
space but represents attributes of electrical neutron is related to the proton. The concept
charge rather than ordinary positions and of total isospin in a many-nucleon system is
lengths. If we represent an object by a vector defined in a way that is consistent with the
IR> in isospace, we can determine the length of single-nucleon formalism and gives meaning to
the vector by forming the scalar product statements of the type T+RC' Be) =RC' Li). The
<RtltR>, which is equivalent in meaning to concept of total isospin together with a struc-
619 ISOSPIN

ture model can provide the relationship between would be strong enough to destroy the appro xi-
the isospin of the nucleus and the arrangement mate symmetry between the proton and neu-
of the constituent nucleons in the nucleus. tron states and isospin would not be a useful
In the nuclear shell model, one pictures the' concept.
nucleus as though it contains a set of single- The discovery of isobaric analog states3 gave
particle quantum states that result from the isospin a new importance in nuclear structure
sum of the interactions among the nucleons. physics. Analog states are energy levels of one
The different energy levels of the nucleus are nucleus that have a special relationship to
then pictured as different combinations of energy levels in the neighboring nucleus with
occupancy by the nucleons of the available the same number of nucleons. If we compare
single-particle states. The similarity of level the energy levels of a nucleus A(N, Z) with N
structure for mirror nuclei suggests that similar neutrons and Z protons, to the energy levels of
sets of single-particle states exist for protons a nucleus B(N - 1, Z + 1), we find in B a set of
and neutrons. levels at high excitation energies that match
In an isospin formalism each sin,gle-particle one-for-one the lowest few levels of A with
state occurs twice, once with tz = ~ and once respect to energy spacing and other properties.
with tz = - !, and the states are occupied by In terms of isospin, analog states can be under-
nucleons. We can define a set of total isospin stood as states that differ only in their isospin
operators for the aggregate system as follows: projection T z . That is, all quantum numbers
The total isospin projection operator Tz is the except Tz of the state in nucleus A are the same
sum of the individual nucleon isospin projec- as those of its analog in B. The ground state of
tion operators, T z = kjt z . To find its eigen- a stable nucleus with more neutrons than pro-
value we need only to sum the isospin projec- tons is always characterized by T = Tz = (N -
tion quantum numbers for all the occupied Z)/2. Its analog is characterized by the same
states. We find that the eigenvalue of Tz is Tz = value of T, but Tz is one unit lower, i.e., Tz =
!(N - Z), where N is the number of neutrons T - 1. That state may in turn have an analog in
and Z the number of protons in the nucleus. its neighboring isobar with Tz = T - 2, and so on
We can also define total operators correspond- until Tz = - T. States in mirror nuclei are simply
ing to the other single-nucleon operators that analog states with Tz (for nucleus A) = - Tz (for
we discussed previously: nucleus B).
T+ ="t+; T- ="c; The binding energy of an analog state differs
L.. L.. from that of its parent state, and the energy
T2 = t(T+T- + T-T+) + Tz2. difference is called the Coulomb displacement
energy. The approximate value of this energy
The operator T+ when applied to a represent a- can be calculated in a rather simple way, if
tion of a many-nucleon system changes it to a one assumes that the nuclear volume is filled
representation with Tz + 1, if Tz is the original with a uniform charge density. Since both
value. Similarly T- changes the representation parent and analog nuclei have the same number
to a new one with Tz - 1. (Certain numerical of nucleons, the volumes are the same. For the
factors have been omitted for simplicity. Refer parent nucleus, the fraction Z/(N + Z) of the
to a textbook on quantum mechanics for a nucleons are charged, while for the analog,
more complete treatment of raising and lower- (Z + 1 )/(N + Z) of the nucleons are charged.
ing operators.) The operator T2 has eigenvalues The value of the Coulomb displacement energy
of the form T(T + 1), where T is a number that is approximately the difference in the electro-
is referred to as the total isospin of the system. static energy for spherical volumes of charge
(For a definition cif eigenvalue, see MATHE· densities implied by those two charge fractions.
MATICAL PRINCIPLES OF QUANTUM ME- This has the value of about 12 MeV for nuclei
CHANICS). The value of T is always greater than of about 100 atomic mass units, for example.
or equal to the absolute value of Tz and is an In spite of large differences in absolute binding
integer for nuclei with an even number of energies, the analog level spacings match those
nucleons and a half-integer for nuclei with an of the parent accurately for nuclei of all masses
odd number of nucleons. An arbitrary represen- that have been studied. It thus appears that the
tation of a many-nucleon system is not neces- similarity between neutron and proton single-
sarily an eigenfunction of T2, but it can be particle states persists even when the Coulomb
reexpressed as a linear combination of repre- force produces a large change in the absolute
sentations that are eigenfunctions of T2 . binding energies of the states. The effect of
Before the 1960s the concept of total isospin the coulomb force is to introduce an appro xi-
was used in formal classifications of shell model mat ely uniform shift in the energy levels of
states, but it was not generally believed that the nuclei differing only in Tz .
simple eigenfunctions of T2 would provide Isospin considerations can be a guide to cer-
useful descriptions of real nuclear states except tain features of nuclear reactions. For example,
with respect to mirror nuclei and very light a rule. of conservation of isospin may be formu-
nuclei. It was thought that in a nucleus with a lated by analogy with the principle of conserva-
large number of protons the Coulomb force tion of angular momentum. The rule states that
ISOSPIN 620

if the initial system-the projectile plus the of a substance by observing the type of radia-
target-can be characterized by total isospin tion and the half-life. Indeed, additional evi-
quantum numbers T and T z , then the final dence came from the observation of the visible
system will also be characterized by the same spectrum of a mixture of two of these sub-
values of T and T z . The conservation of Tz is stances and finding no new lines. Further,
equivalent to the statement that the total num- Boltwood, in his studies, noted the presence of
ber of protons and the total number of neutrons lead in uranium minerals and suggested that
does not change in a nuclear reaction. The con- lead might be the end product of the uranium
servation of T implies restrictions on the rear- series. As a result of the study of the relation
rangements of the nucleons within the nuclei of lead to uranium in a large number of miner-
but is not a very strict rule, since it is based on als, this view was generally adopted. Soddy con-
neglect of the Coulomb force between protons. cluded that the end products of the uranium
It has been shown, however, that processes that and thorium series should be lead with isotopic
violate the rule are severely inhibited compared weights 206 and 208, respectively, whereas
to processes that follow the rule. Some broader ordinary lead as found in nature has an atomic
implications of isospin conservation have been weight of 207.2. Soddy and Hyman reported
pointed out by Adair.4 (1914) that the atomic weight of lead as found
in thorite, which consists mainly of thorium
C.D.GOODMAN with 1-2% uranium and 0.4% lead, indeed had
a slightly greater atomic weight than that of
References ordinary lead. It was not, however, until the
1. Heisenberg, W., Z. Physik 77 ,1-11 (1932). development of the mass spectrograph (1919)
2. Wigner, E.,Phys. Rev. 51,106-119 (1937). that a detailed study of the occurrence and rela-
3. Anderson, J. D., Wong, C., and McQure, J. W., tive abundance of such species could be accu-
Phys. Rev. 126,2170-2173 (1962). ately made and it was not until the discovery
4. Adair, R. K.,Phys. Rev. 87,1041-1043 (1952). of the neutron (1932) that a satisfactory ex-
planation could be given.
For additional reading see: "Isospin in Nuclear Physics" Of the 81 naturally occurring stable elements,
(Wilkinson, D. H., Ed.), Amsterdam, North-Holland 22 elements are anisotopic, i.e., possess one
Publishing Co., 1969. stable isotope. The largest number of stable
isotopes occurs in tin, which has ten. Since the
Cross-references: ELECTRON SPIN, ELEMENTARY atoms in multinuclidic elements are chemically
PARTICLES, MANY-BODY PROBLEM, MATHE- identical and have the same number of protons
MATICAL PRINCIPLES OF QUANTUM MECHAN- in the nucleus, the varying masses are accounted
ICS, NEUTRON, NUCLEAR STRUCTURE, PRO- for by the variable number of neutrons in the
TON, QUANTUM THEORY, STRONG INTER- nucleus (Z = 50,A = 112, 114, 115,116,117,
ACTIONS. 118, 119, 120, 122, 124). Two of the elements
below bismuth (Z = 83), technetium (Z = 43)
and promethium (Z = 61), do not have any
stable nuclides. Table 1 lists, in order of increas-
ISOTOPES ing atomic number Z (the charge or number
The word isotopes, stemming from the Greek of protons associated with the nucleus) the 106
word isos (same) and topos (place), refers to known elements, and where available, the rela-
atoms of the same element which have differ- tive atomic weights. It should be noted that the
ing masses. The term isotope was first proposed conflict regarding the Soviet and United States
by Soddy (1913) to designate substances having claims to a discovery of elements 104 and 105
different atomic weights and, at the same time, continues. The Berkeley workers have 'proposed
having chemical properties which were so closely the names of Rutherfordium for Z = 104 and
allied as to make chemical methods of pro- Hahnium for Z = 105, while the Soviet workers
ducing a separation ineffective. Hence, Soddy have proposed Kurchatovium and Nielsbohrium.
suggested that they were chemically identical, The International Union of Pure and Applied
i.e., they occupied the same place in the periodic Chemistry has proposed Unnilquadium (Unq,
table. The term nuclide is, perhaps, a more Z = 104), Unnilpentium (Unp, Z = 105), and
accurate representation, since it is defined as Unnilhexium (Unh, Z = 106). Popular usage,
a particular atomic species with an atomic however, involves the direct use of Z = 104,
number Z and a mass number A. lOS, and 106.
In 1905, Boltwood, studying the decay chains Isotopes are divided into two groups: stable
of uranium, identified a new element which he and radioactive (unstable). The total known
called ionium and which, when admixed with isotopes number about 1950, of which 280 are
thorium, could not be separated by any of the stable and the balance are radioactive, having
known chemical processes. Similar behavior a transient existence ranging from millionths of
was observed by Soddy and others in other a second to millions of years. Radioisotopes
such mixtures. These studies depended on the undergo transformation, or decay, emitting
detection and identification of minute amounts alpha, beta, gamma, or x-radiations during their
621 ISOTOPES

TABLE 1. TABLE OF MEAN RELATIVE ATOMIC WEIGHTS (1979)


BASED ON THE ATOMIC MASS OF 12C = 12.
(In Order of Atomic Number.)

Atomic Atomic Atomic Atomic


Number Name Symbol Weight Number Name Symbol Weight

1 Hydrogen H 1.0079 55 Cesium Cs 132.9054


2 Helium He 4.00260 56 Barium Ba 137.33
3 Lithium Li 6.941 * 57 Lanthanum La 138.9055*
4 Beryllium Be 9.01218 58 Cerium Ce 140.12
5 Boron B 10.81 59 Praseodymium Pr 140.9077
6 Carbon C 12.011 60 Neodymium Nd 144.24*
7 Nitrogen N 14.0067 61 Promethium Pm (145)
8 Oxygen 0 15.9994* 62 Samarium Sm 150.36*
9 Fluorine F 18.998403 63 Europium Eu 151.96
10 Neon Ne 20.179 64 Gadolinium Gd 157.25*
11 Sodium Na 22.98977 65 Terbium Tb 158.9254
12 Magnesium Mg 24.305 66 Dysprosium Dy 162.50*
13 Aluminum Al 26.98154 67 Holmium Ho 164.9304
14 Silicon Si 28.0855* 68 Erbium Er 167.26*
15 Phosphorus P 30.97376 69 Thulium Tm 168.9342
16 Sulfur S 32.06 70 Ytterbium Yb 173.04*
17 Chlorine CI 35 .453 71 Lutetium Lu 174.967*
18 Argon Ar 39.948 72 Hafnium Hf 178.49*
19 Potassium K 39.0983 73 Tantalum Ta 180.9479
20 Calcium Ca 40.08 74 Tungsten W 183.85*
21 Scandium Sc 44.9559 (Wolfram)
22 Titanium Ti 47.88* 75 Rhenium Re 186.207
23 Vanadium V 50.9415 76 Osmium Os 190.2
24 Chromium Cr 51.996 77 Iridium Ir 192.22*
25 Manganese Mn 54.9380 78 Platinum Pt 195.08*
26 Iron Fe 55 .847* 79 Gold Au 196.9665
27 Cobalt Co 58.9332 80 Mercury Hg 200.59*
28 Nickel Ni 58.69 81 Thallium 11 204.383
29 Copper Cu 63.546* 82 Lead Pb 207.2
30 Zinc Zn 65.38 83 Bismuth Bi 208.9804
31 Gallium Ga 69.72 84 Polonium Po (209)
32 Germanium Ge 72.59* 85 Astatine At (210)
33 Arsenic As 74.9216 86 Radon Rn (222)
34 Selenium Se 78.96* 87 Francium Fr (223)
35 Bromine Br 79 .904 88 Radium Ra 226.0254
36 Krypton Kr 83 .80 89 Actinium Ac 227.0278
37 Rubidium Rb 85.4678* 90 Thorium Th 232.0381
38 Strontium Sr 87.62 91 Protactinium Pa 231.0359
39 Yttrium Y 88.9059 92 Uranium U 238.0289
40 Zirconium Zr 91.22 93 Neptunium Np 237.0482
41 Niobium Nb 92.9064 94 Plutonium Pu (244)
42 Molybdenum Mo 95 .94 95 Americium Am (243)
43 Technetium Tc (98) 96 Curium Cm (247)
44 Ruthenium Ru 101.07* 97 Berkellium Bk (247)
45 Rhodium Rh 102.9055 98 Californium Cf (251)
46 Palladium Pd 106.42 99 Einsteinium Es (252)
47 Silver Ag 107.868 100 Fermium Fm (257)
48 Cadmium Cd 112.41 101 Mendelevium Md (258)
49 Indium In 114.82 102 Nobelium No (259)
50 Tin Sn 118.69* 103 Lawrencium Lr (260)
51 Antimony Sb 121.75* 104 (Unnilquadium) (Unq) (261)
52 Tellurium Te 127.60* 105 (Unnilpentium) (Unp) (262)
53 Iodine I 126.9045 106 (Unnilhexium) (Unh) (263)
54 Xenon Xe 131.29*

Values are those recommended by the Commission on Atomic Weights and Isotopic Abundances, Inorganic
Chemistry Division, International Union of Pure and Applied Chemistry, in Pure and Applied Chemistry, 52, 2349-
2384 (1980).
Values are considered to be ± 1 in the last digit or ± 3 in the last digit when followed by an asterisk (*).
ISOTOPES 622

return to a stable condition. Elements beyond of magnesium, lithium, potassium, calcium, and
bismuth (Z = 83) are radioactive. Aclear graphic zinc.
presentation of isotopes, both stable and radio- Thus the pioneering work of Aston and
active, can be seen in the General Electric Knolls Dempster in 1918-19 with the electromagnetic
Atomic Power Laboratory "Chart of the mass spectrometer unequivocally demonstrated
Nuclides," now in its twelfth edition (revised the isotopic nature of the stable elements and
to April 1977), where the nuclides are displayed is the historical starting point for separation
by plotting nuclides of elements in increasing and study of the isotopes of the elements.
atomic number (Z) against the neutron number The electromagnetic separation of isotopes
(N). (See RADIOACTIVITY) . is relatively simple in principle (see MASS
Isotopes, both stable and radioactive, have SPECTROMETRY).
continued to grow in importance to science and Production of Stable Isotopes. There are a
technology during the past 50 years. Since number of possible ways of separating isotopes
atoms can be characterized or marked by their using electromagnetic principles. However, the
radioactivity, or in some cases by a change in large-scale electromagnetic mass separator,
isotopic composition, the elements can be known as a calutron , is the device now used
traced, a procedure of great value in physical predominantly. Within a tank maintained at
and biological science, technology, and medical high vacuum, ions of an element are produced
diagnosis . Further, radioisotopes emit particu- by vaporization at high temperature, some-
late and/or electromagnetic radiations that can times assisted by a chemical agent such as car-
be used to probe into and through matter, af- bon tetrachloride (chlorination). The ions are
fect it chemically and physically, produce heat accelerated by an electric potential and pro-
and light, kill or alter microorganisms , and per- jected as a beam across a magnetic field . The
form many other functions useful in today's trajectories of these ions are dependent on their
complex industrial/technological society. masses ; hence the path of the lighter ion has
Stable Isotopes By 1900, physicists had a greater curvature than that of the heavier
found that positively charged particles formed ion. After traversing a circular path of 180-
by the passage of an electric discharge through 300 0 , the divergent particle paths are inter-
an evacuated tube consisted of molecular ions rupted by catcher pockets, usually made of
of the gas present in the tube. Deflection of slots in graphite or copper water-cooleq "re-
these positive rays by electric and magnetic ceivers." The isotopes are then chemically re-
fields offered a sensitive tool to study gaseous covered from the receiver pockets.
elements . By allowing the rays from a given Only relatively small amounts of material
element to fall on a photographic plate, a series can be separated in the calutron, since it sepa-
of parabolic streaks were observed, each corre- rates the isotopes literally atom-by-atom. Never-
sponding to a definite value of mass·to-charge theless , ion currents up to one ampere can be
ratio (m/e). Positive ray photographs of neon maintained, allowing kilograms of material to
(atomic weight 20 .2) obtained by Thomson be separated in a machine operating over a
exhibited a heavy neon line at mass 20 and a year's time. Virtually all of the isotopes of the
faint line at 22. In an effort to elucidate the elements have been separated in relatively
situation, Thomson's assistant, Aston, passed high purity at Oak Ridge National Laboratory.
neon gas through a porous pipe-clay tube re- The details on separated isotopes available and
peatedly and was able to show a significant the procedures to be used in obtaining them are
alteration in the atomic weight of the two ex- given in the ORNL Research Materials Catalogs.
treme fractions. This alteration was reflected Large-scale gaseous diffusion separation of
in changes in the relative brightness of the two 235 U is accomplished by diffusing uranium hexa-
lines in subsequent positive ray analyses. fluoride (UF 6) gas through a series of several
Aston proceeded to redesign the positive ray thousand barriers. Because of the repetitive
apparatus so that the particles having the same nature of the process, continuous diffusion
mass were brought to a focus to produce a through thousands of stages, or cascades,
sharp line rather than a parabola; the resulting gaseous diffusion plants are extremely large
instrument was called a mass spectrograph. industrial facilities.
With this instrument, Aston was able to con- More recently, large high-speed centrifuges
firm the finding that neon exists in at least have been developed for the separation of 235 U.
two forms (atomic weights of 20 and 22) and Because the desired enrichment is not obtained
that the proportions appeared to be 10-: 1 , giv- in a single centrifuge, several machines are con-
ing an average atomic weight of 20.2 to neon. nected in a series or cascade. The gas centri-
Aston next analyzed chlorine and also found fuge enrichment technology is economically
that this gave two lines, corresponding to 35 superior to the gaseous diffusion enrichment
and 37, and in time examined mercury, nitro- process, requiring only 5% as much electrical
gen, and the noble gases. energy.
Independently Dempster, working at the Gaseous thermal diffusion is used to separate
University of Chicago, developed a mass spec- the noble gas isotopes (krypton, neon, and
trometer which he used to examine the isotopes argon). Liquid thermal diffusion is currently
623 ISOTOPES

used to separate the isotopes of chlorine, artificially radioactive isotopes were identified
bromine, and sulfur. by 1944, and over 1650 by 1977. Each element
Chemical exchange is used with such isotopes has at least one radioactive isotope, and some
as hydrogen and nitrogen. have as many as 30.
Cryogenic distillation is used to separate The discovery of the first transuranic element,
isotopes of carbon, oxygen, and nitrogen. neptunium (Z = 93) in 1940 was followed by
Electrolysis is used for deuterium separation the identification of the other members of the
and distillation and has been used for the en- actinide series (Z = 89-103), which are anal-
richment of mercury isotopes. ogous to the lanthanide series or rare earths
Laser isotope separation has been used for (Z = 57-71), thus completing the series. It is
235 U separation and offers potential for the expected that element 104 and element 105
separation of other ions. should demonstrate periodic characteristics of
Other than the well-known uses of 235 U and hafnium and tantalum, respectively. Indeed,
3H in large-scale nuclear work, separated iso- predictions on both the chemical and nuclear
topes have been in the past primarily for funda- basis have been made on the properties of
mental scientific work, such as the measurement other "superheavy" elements. For example,
of nuclear reaction cross sections. There is now, islands of stability of the elements are pre-
however, a growing utilization of isotopic ma- dicted around Z = 114, N = 184 and elements
terials in all fields of fundamental research and 117-120 lend themselves to reasonably detailed
as target materials for radioisotope production. predictions of their macroscopic properties.
Radioisotopes Some radioisotopes occur in Radioisotopes are produced by disturbing a
nature, e.g., uranium, radium, and thorium- preferred neutron-proton ratio in the nuclei
ordinarily accompanied by their radioactive of elements. This is done by adding or removing
daughters (decay products). Radioisotopes that neutrons, by adding or removing charged par-
occur in nature have half-lives* greater than ticles such as protons, or by a combination of
about 10 8 years or, are the decay products of both. Usually, a nuclear reactor is used as the
parent radioisotopes of such long-lived radio- source of neutrons; a cyclotron or other particle
isotopes. These primordial radioisotopes were accelerator is used as the source of charged par-
produced when the earth was formed and have ticles. The radionuclides formed by increasing
not yet decayed away in the ensuing several the neutron-proton ratio generally decay (or
billion years. Of the naturally occurring iso- transform) back to a stable configuration by
topes of the elements, roughly 280 are stable, having a neutron transform to a proton, with
and about 25 may be considered naturally the emission of a negative electron (beta
radioactive. Some shorter-lived radioisotopes, particles, ~-) and a neutrino (v)-an almost
such as 5530-year 14C and 12.33-year 3H are undetectable uncharged particle of negligible
normally formed by cosmic ray interactions mass (see NUCLEAR REACTIONS):
with atmospheric carbon and hydrogen. Irene
Curie observed and identified the first arti- n ~ p + ~- + v.
fically induced radioactivity in 1934 by irra-
diating targets of aluminum, magnesium, and For those radionuclides resulting from a de-
boron with alpha particles from a 1OO-mCi t crease in the neutron-proton ratio (i.e., neutron-
polonium source and noting that the targets deficient nuclei), the transformation again
continued to emit radiation after the alpha tends to reverse the cause of instability, and,
source was removed, This discovery offered the where energetically possible, a proton in the
first chemical proof of artifical transmutation. nucleus is transformed into a neutron, with
After the introduction of the cyclotron and the emission of a positive electron (positron,
~+):
other particle accelerators, many elements
were bombarded with deuterons and protons
to produce hundreds of new radioisotopes,
including the well-known 131 I, 32p, and 14C. In a competing process, neutron-deficient
Large-scale production of radioisotopes, how- nuclei will regain stability by the capture of an
ever, did not come about until nuclear reactors orbital electron (EC). Indeed, it is the only
were available after World War II to supply ~-decay mode possible for such nuclei when the
enormous amounts of neutrons. The number of decay energy (the mass difference between
artificially produced isotopes had reached the decaying and product atom) is less than 2
200 in 1937, and with the nuclear reactor as mc 2 :
a source of neutrons in World War II, about 450
p + e- ~ n + v.
*A half-life (T 112) is defined as the time required The electronic vacancy produced in the K (or
for one half of an initially large number of radio- L, etc.) shell is filled by an electron from a
active atoms of a given species to decay, less tightly bound state with the simultaneous
tA curie (Ci) is defined as 3,70 X 10 10 disintegra- emission of an x-ray (characteristic of the prod-
tions/second. uct element) or an electron produced by an
ISOTOPES 624

internal photoelectric process (Auger elec- law, have used the coincidence between the
trons). These atomic rearrangements following alpha particles of the parent and the K x-rays
electron capture, particularly in a heavy atom, of the daughter to simultaneously establish
may involve many x-ray emissions and Auger the parent-daughter genetic relationship and the
processes in successively higher shells. The ad- atomic number independently of other nuclear
justment can be quite extensive and includes or chemical information.
such effects as Coster-Kronig transitions (Auger Production of Radioisotopes. The bulk of the
effect in the subshells). The fluorescence yield artificially produced radioisotopes are made
is defined as the fraction of vacancies filled by neutron reactions in the high-volume neu-
by the emission of x-rays. The Auger yield tron fluxes available in NUCLEAR REACTORS.
is, in a similar fashion, defined as the fraction Neutrons, having no charge, can easily penetrate
of vacancies filled by the emission of Auger the coulombic barriers of the nucleus. The
electrons. atomic nuclei of the elements vary in their
Many radioactive nuclei decay by two or ability to capture thermal neutrons (Le., neu-
more modes so that {3-, {3+, and EC decay with trons slowed down to 2200 meters/second or
associated emissions and x-radiation are not ~0.025 eV) according to their cross sections,
uncommon. The branching ratio defines the a term which expresses the probability of inter-
relative amount of each mode of decay. action of a neutron of a certain energy or
For heavy nuclei (Z > 82), the transforma- velocity (see CROSS SECTIONS AND STOPPING
tion to a more stable configuration usually POWER). When target materials are placed in
takes flace by the emission of an alpha particle the reactor and subjected to a flux of neutrons
(ex or He). As the nuclei become progressively (CP, number of neutrons traversing a unit area
heavier, the half-lives become shorter and the per unit time), neutrons are captured in propor-
fission process becomes more dominant. tion to cross sections of the target element
In alpha or beta decay processes the product atoms present. Cross sections are expressed in
nucleus may be left in either the ground state barns (1 barn = 10- 24 cm 2 , which is approxi-
or, more frequently, in an excited state. A mately equal to the actual cross sectional area
nucleus in an excited state may de-excite by the of a medium-weight nucleus). Certain materials,
emission of electromagnetic radiation or pho- such as aluminum and graphite, have such small
tons ('Y-radiation). Frequently the gamma neutron capture cross sections that few neu-
transition does not proceed directly to the trons are captured; others, such as cadmium,
ground state, but rather may go in several steps have such large cross sections that a thin foil
involving intermediate excited states. The will absorb almost all the thermal neutrons
angular correlation between successive gamma impinging upon it.
rays depends on the multipole character of the Radioisotopes are produced in a nuclear
radiations and on the spins of the intermediate reactor by several different processes. Those
states. An alternative to gamma-ray emission processes that produce appreciable quantities
is the internal conversion process, an electro- of radioisotopes are described below.
magnetic interaction between the nucleus (1) (n, 'Y) Process. In the (n, 'Y) process,
and the orbital electrons. Thus, the transition which is most common, a neutron is captured
between the two energy states of the nucleus by a target and simultaneously a photon is
is not evidenced by the emission of a photon. emitted. Since no change of atomic number Z
Instead the energy is imparted to an orbital occurs, the element remains the same as the tar-
electron which is ejected from the atom. The get material. The (n, 'Y) reaction is primarily
ratio of the internal conversion process to the a thermal neutron reaction; cross sections for
rate of gamma emission is known as the internal (n, 'Y) reactions vary from a few millibarns to
conversion coefficient, ex. The internal conver- many thousands of barns. For example:
sion process leaves the atom with a vacancy in
one of the shells. The subsequent atomic re- (T 1/2 = 15.02 h).
arrangement process is essentially identical to
that following electron capture. The radioelement cannot be separated chem-
For some nuclei, only gamma radiation is ically unless a recoil collection is used. In the
emitted for the deexcitation from a metastable Szilard-Chalmers process, the recoil energy of
or isomeric state. Such decay is termed iso- the residual nucleus, resulting from the emis-
meric transition (IT) and is characterized by no sion of the photon, is greater than the chemical
change in mass number or atomic number. binding energy of the nucleus in a compound.
Here, too, the internal conversion process is a Radioisotopes produced sometimes decay by
competing process. beta emission ({3- or (3+) or electron capture to
Knowledge of the energies and intensities of a radioactive daughter with a higher or lower
the particulate radiation (ex, {3-, (3+, and e-) and atomic number. For example:
the electromagnetic radiation ('Y-rays and x-
rays) serve to characterize a particular nucleus 144S m(n, 'Y) 14SS m
and as such is the principal means, along with
EC
the half-life, of identifying the radioisotope. 14SS m - - + 14SPm (T 1I2 = 17.7 y).
Indeed, transuranic workers, using Moseley's (340 d)
625 ISOTOPES

The daughter can be chemically separated to streams of DOE production facilities and
obtain high-specific-activity* material. represent an important source of such radio-
With the availability of thermal neutron isotopes as 90Sr (thermoelectric generator
fluxes well in excess of 2 X 1015 n/ cm 2 . sec, the systems for terrestrial and underwater applica-
preparation of millicurie amounts of radio- tions), 137Cs (radiography, teletherapy, and
isotopes by successive (n, ')') reactions has be- large irradiation units), and 147Pm (thermo-
come feasible. For example: electric power generators).
The basic equation for radioisotope produc-
64Ni(n, ')') 65 Ni (T1I2 = 2.520 h) tion is
65Ni(n, ')') 66Ni (T1I2 = 54.8 h).
A---+B--+~
(2) (n, p) Process. In the (n, p) process, UA UB
which requires neutrons of higher-than-thermal
energies,t a neutron enters a target nucleus The target atom A captures neutrons to pro-
with sufficient energy to cause a proton to be duce the product nuclide B, which in turn is
released. The atomic number is reduced by I, transformed by decay or further neutron cap-
and the affected atom is transmuted into a ture. The effective cross sections UA, UB and the
different element, which can be separated decay rate constant AB enable the rate and
chemically from the target material. Through equilibrium values for the transformation to be
chemical separation, high-specific-activity ma- calculated for any particular irradiation condi-
terial can be obtained. For example: tions. The exact solution for the differential
equation describing these rate processes for the
(T1I2 = 14.28 d). number of atoms N of the product formed at
time t in neutron flux </> is,
Cross sections for such reactions, typically a
few millibarns, are orders of magnitude less AB </>uANA
than those for (n, ')') reactions. AB NB = ------
(3) (n, a) Process. The (n, a) process, like the
AB + </>[UB - UA]
(n, p) process, requires high-energy neutrons . [e-</>aAt - e-(</>aB+AB)t] .
and typically has a cross section of a few milli-
barns. In the (n, a) process, a neutron of high In most cases, one can neglect the burnup of
energy enters a target atom and causes an alpha the target atoms and the product radioisotope.
particle to be emitted. The atomic number of In such cases, the above equation then reduces
the target atom is reduced by 2, and a chemical to:
separation yielding high-specific-activity ma-
terial is possible. For example: ABNB =NAo</>UA(l- e-A.Bt).

36Cl(n, a) 33p (T 1I2 = 25.3 d). Here N Ao refers to the number of original
target atoms at time zero.
(4) Fission. Under normal operating condi- For irradiations ' of sufficient length (t »
tions, research reactors have 20-50% burnup of TI/2), and again neglecting burnup, the satura-
the fissile material. The asymmetric fission tion factor (I - e-ABt) approaches 1 and the
process yields fission products of mass ranging equation further reduces to
between A = 72 and A = 162. The maximum
fission yields of 6.5% occur at A = 95 and A = ABNB = NAO </>UA.
138. Since several isotopes of anyone element With the introduction of the cyclotron in the
are often produced, the isotopic purity will early 1930s, charged particle reactions started
not be as high as that of radioisotopes produced to play a significant role in the prepara-
by (n, p) and (n, a) reactions. This isotopic tion of a large variety of neutron-deficient
purity will depend somewhat upon the length radioisotopes. Indeed, until the advent of
of time that the uranium was exposed to neu- nuclear reactors in the mid-l 940s, proton-,
trons and upon the elapsed time between re- deuteron-, and alpha-particle accelerators played
moval from the reactor and the chemical sepa- the major role in supplying radioisotopes for
ration. Fission products are routinely chemically medical, biological, and scientific research.
separated and purified from high-level waste During the period 1950-1970, the ORNL 86-
Inch Cyclotron exploiting a 2.6-mA (2 X 10 16
*Specific activity is the amount of radioisotope per particles/sec) beam of 23-MeV protons, supplied
unit weight of the total element and is usually ex- the major portion of neutron-deficient radio-
pressed as curies or millicuries/gram. isotopes. With the development of compact
t A few exceptions are found among reactions with cyclotrons, the production and use of short-
the light nuclei in cases where binding energy of a lived radioisotopes for medical purposes has
proton or particle is appreciably lower than that of gained impetus. Both hospitals and radio-
a neutron: the reactions loB(n, p) lo Be, 14N(n, p) 14C, pharmaceutical and radiochemical organizations
35Cl(n, p) 35S, loB(n, a) 7Li, and 6Li(n, a) 3H occur have found these cyclotrons extremely useful.
with thermal neutrons. Indeed, one organization is utilizing five such
ISOTOPES 626

accelerators. Hospitals have found these acceler- pared to 105 n/cm 2 • sec for a I-g radium source
ators particularly useful in producing short- mixed with beryllium). (See TRANSURANIUM
lived radionuclides such as 20.38-m HC, 9.96-m ELEMENTS.)
13N, 122 s 15 0, and 109.8 m 18F for on-site Isotope Processing The techniques used for
diagnostic procedures. Heavy-ion accelerators, the processing of ultrahigh-purity chemicals
developing beams of carbon, nitrogen, oxygen, are required for isotope work in recovering
neon, and argon, have opened additional areas stable isotopes, preparing target materials, and
for research. These devices, in conjunction with separating and purifying radioisotopes. Prac-
on-line mass separators, should make possible tically every technique from traditional wet
the production and identification of more of chemistry to ion exchange and chromatography
the 5000 or so theoretically possible nuclei. is utilized, often with high-purity radioisotopes,
The Brookhaven Linac Isotope Producer (BLIP) at very low concentration levels (e.g., micro-
and the Los Alamos Scientific Laboratory High grams per liter). Sophisticated analytical meth-
Flux Meson Facility (LAMPF) offer the pos- ods (e.g., mass and radiation spectral analysis)
sibility of producing large amounts of neutron- are also required and make up a significant
deficient radioisotopes by high-energy proton- portion of the cost of isotope preparations.
induced spallation reactions using 200-MeV The size of the radioisotope industry has not
and 750-MeV protons, respectively. Most re- changed much during the past decade; about
cently, the completion of the Oak Ridge Na- 100 private firms produce radioisotopes, radio-
tional Laboratory Holifield Heavy Ion Research pharmaceuticals, sealed sources, and equip-
Facility (HHIRF), a facility utilizing a 25- ment for medical, industrial, and scientific uses
million-volt tandem electrostatic accelerator of radioisotopes. The magnitude of sales, how-
and the Oak Ridge Isochronous Cyclotron, ever, has increased dramatically. Whereas in
makes it possible to accelerate ions up to the 1970 the U.S. estimated sales were more than
mass region of 160, i.e., rare earth elements, to $50 million, now, in 1982, a single firm has
the energies required to produce nuclear reac- gross sales of substantially greater magnitude:
tions, and extends the capability for creating New England Nuclear, a major supplier, reports
nuclear species far from stability. net sales of $66 million for 1979 and $82
A method for preparing short-lived radio- million for 1980. It is estimated that the annual
isotopes off-site has found widespread use in sales of 99Mo and associated generators well
nuclear medicine. The radioisotope generator exceeds $ 50 million (perhaps as high as $100
employs a relatively long-lived parent which is million), 201Ti exceeding $30 million, and 67Ga
sorbed onto an ion exchanger; the short- exceeding $10 million. To place it in another
lived daughter may be eluted, as required, with a perspective, approximately 225,000 thallium
suitable agent. The most common such system, doses were administered to patients in the
the 99Mo_99mTc generator, yielding 6.02-h 99Tc United States in 1980. Sales of stable isotopes
pertechnetate, has been established as a power- at Oak Ridge National Laboratory exceed $4
ful tool for diagnostic scanning and has largely million annually, that for radioisotopes is $6
supplanted the use of 13OI. Other generators million, and that for heavy elements (241 Am,
include the 87y _87m Sr (2.83 h) and the 113 Sn- 234U, 229 Th, and others) exceeds $4 million.
113m In (99.5 m) systems. The overall market today is in the tens of bil-
Recent advances in transuranic isotope tech- lions of dollars.
nology have made possible the production of It is estimated that 17 million in vivo and 100
large quantities of these radioisotopes. The thousand in vitro nuclear studies were made
original neutron irradiations to produce such during 1980, a figure which corresponds to one
nuclei were performed in the Oak Ridge and out of every two or three hospitalized patients
Hanford reactors in the 1940s, in the NRX plus a large number of outpatients (275 thou-
reactor at Chalk River Laboratory in Canada in sand/day). There are about 22,000 individuals
the late 1940s, and in the Materials Testing in the United States practicing nuclear medicine.
Reactor (MTR) in Idaho, in the 1950s. Both
neutron irradiation and charge particle bom- J. J. PINAJIAN
bardment were used to produce and identify
about 50 transplutonium nuclides covering the
atomic number range up through Z = 103 and References
mass range up through 257. During the 1960- Nuclear and Radiochemistry
1970 period, the Savannah River reactors and
the High Flux Isotopes Reactor (HFIR) at the Barbier, Marcel, "Induced Radioactivity," Amsterdam,
Oak Ridge National Laboratory were used for North-Holland Publishing Company, 1969.
large-scale production of an additional 50 Friedlander, G., Kennedy, J. W., Macias, E. S., and
transplutonium isotopes in milligram and gram Miller, J. M., "Nuclear and Radiochemistry," Third
quantities. Indeed, several grams of 252Cf have Edition, New York, John Wiley and Sons, Inc., 1981.
been produced since 1966. Milligrams of ma- Kocher, D. C., "Radioactive Decay Data Tables,"
terials such as 252Cf and 241 Am represent excel- Technical Information Center, Department of
lent fission sources of neutrons presenting peak Energy Report, DOE/TIC-l1026 (1981).
thermal neutron fluxes of 108 n/cm 2 • sec (com- Lederer, C. M., and Shirley, V. S., "Table of Isotopes,"
627 ISOTOPES

Seventh Edition, New York, John Wiley and Sons, Part I., "Irradiation Technology," (Zentralinstitut
Inc., 1978. fUr Kernphysik Dresden) ZfK-RCH-l, December
Oak Ridge National Laboratory Research Materials 1961 (in German); for English translation see
Catalog, Oak Ridge, Tennessee, in press. Pinajian, J. J., Oak Ridge National Laboratory Re-
Seaborg, G. T., "Transuranium Elements: Products of port ORNL-tr-2400, November 1970.
Modern Alchemy," Stroudsburg, Dowden, Hutchin- Oak Ridge National Laboratory, "ORNL Radio-
son, and Ross, 1978. isotope Procedures Manual," ORNL-3633, June
Subcommittee on Nuclear and Radiochemistry, Com- 1964.
mittees on Chemical Sciences, Assembly of Mathe- Pinajian, 1. J., "Oak Ridge Research Reactor for Iso-
matical and Physical Sciences, National Research tope Production," Isotop. Radiat. Technol. 1,130-
Council, "A Review of the Accomplishments and 36 (Winter 1963-64).
Promise of U.S. Transplutonium Research 1940- Rupp, A. F., and Binford, F. T., "Production of
1980," Washington, D.C., National Academy of Radioisotopes," in "Nuclear Engineering Hand-
Sciences, 1982. book" (H. Etherington, Ed.), Section 14, pp. 26-37,
New York, McGraw-Hill Book Co., 1958.
Mass Spectrometry
Roboz, J., "Introduction to Mass Spectrometry: Accelerator Production of Radioisotopes
. Instrumentation and Techniques," New York, Lange, J., and Miinzel, H., "Estimation of Unknown
Interscience Publishers, 1968. Excitation Functions for (c;, xn), (c;, pxn), (d, xn),
(d, pxn), and (p, xn) Reactions," Karlsruhe Nuclear
Atomic Weights Research Center Report, KFK-767, May 1968 (in
International Union of Pure and Applied Chemistry, German); for English translation see Pinajian, J. J.,
Inorganic Division, Commission on Atomic Weights and Kern, L. H., Oak Ridge National Laboratory
and Abundances, Pure and Applied Chemistry 52, Report ORNL-tr-3020, October 1970.
2349-2384 (1980). Laughlin, J. S., Tilbury, R. S., and Dahl, J. R., "The
Cyclotron: Source of Short-lived Radionuclides and
Stable Isotopes Positron Emitters for Medicine" in "Progress in
Subcommittee on Nuclear and Radiochemistry, Com- Atomic Medicine, Volume 3: Recent Advances in
mittee on Chemical Sciences, Assembly of Mathe- Nuclear Medicine" (1. H. Lawrence, Ed.), New York,
matical and Physical Sciences, National Research Grune & Stratton, 1971.
Council, "Separated Isotopes: Vital Tools for Pinajian, J. J., "ORNL 86-Inch Cyclotron," in "Radio-
Science and Medicine," Washington, D.C., National active Pharmaceuticals," (G. A. Andrews, R. M.
Academy Press, 1982. Kniseley, and H. N. Wagner, Eds.), Chapter 9,
Davis, W. C. et aI., "Chemical Recovery and Refine- pp.143-54, U. S. Atomic Energy Commission, 1966.
ment Procedures in the Electromagnetic Separation Rosen, L., Schillaci, M. E., Dropesky, B. J., and
of Isotopes," Oak Ridge National Laboratory Re- O'Brien, H. A., "Use of LAMPF for Isotope Pro-
port, ORNL-4583, August 1970. duction: Briefing to the AEC Division of Iso-
Underwood, J. N., Love, L. 0., Prater, W. K., and topes Development, December 15, 1970," Los
Scheitlin, F. M., "Calutron Experiments with Milli- Alamos Scientific Laboratory Report, LA-4587-MS,
gram Quantities of Charge Material," Nucl. Instrum. February 1972.
Methods 57,17-21 (1967). Stang, L. G., Jr., Hillman, M., and Lebowitz, E., "The
Villani, S. , "Isotope Separation," New York, Ameri- Production of Radioisotopes by Spallation," Brook-
can Nuclear Society, 1976 . haven National Laboratory Report, BNL-50195,
August 1969.
Reactor Production o[ Radioisotopes
Radiopharmaceuticals and Nuclear Medicine
Aebersold, P. C., and Rupp, A. F., "Production of
Short-lived Radioisotopes," in "Production and Use Schneider, P. B., and Treves, S., "Nuclear Medicine in
of Short-lived Radioisotopes from Reactors," Vol. 1, Clinical Practice," Amsterdam, Elsevier, North
pp. 31-47, Vienna, International Atomic Energy Holland Biomedical Press, 1978.
Agency, 1962. Spencer, R. P., "Radiopharmaceuticals: Structure,
Binford, F. T., Cole, T. E., and Cramer, E. N., "The Activity, and Relationships," New York, Grune &
High Flux Isotope Reactor," Oak Ridge National Stratton, 1981.
Laboratory Report, ORNL-3572 (Rev. 2), May Wagner, H. N., "Principles of Nuclear Medicine,"
1968. Philadelphia, W. B. Saunders Co., 1969.
Brookhaven National Laboratory, "Manual of Isotope
Production Processes in Use at Brookhaven National Cross-references: ATOMIC PHYSICS, CROSS SEC-
Laboratory," BNL-864, August 1964. TIONS AND STOPPING POWER, ELECTRON, NU-
Crandall, J. L., "The Savannah River High Flux CLEAR REACTIONS, NUCLEAR REACTORS, NU-
Demonstration," Savannah River Laboratory Report, CLEAR STRUCTURE, PERIODIC LAW AND
DP-999, June 1965. PERIODIC TABLE, PROTON, RADIOACTIVITY,
Knoll, Peter, "The Technology oflsotope Production," TRANSURANIUM ELEMENTS.
K
KEPLER'S LAWS OF PLANETARY MOTION accounts for the manner in which the distance
from the sun to the planet changes as the planet
The German astronomer and mathematician travels along its orbit. As shown in Fig. 1, the
Johannes Kepler (1571-1630) worked briefly perihelion is the point in the orbit at ~hi~h the
with the Danish astronomer Tycho Brahe, who planet is closest to the sun; the aphelion IS the
gathered some of the most accurate observa- point farthest from the sun.
tional data on planetary motion in the pre- Prior to Kepler's time, it was assumed that
telescope era. When Brahe ·died in 1601, Kepler the planetary orbits were concentric circles
inherited his data books and devoted many with the sun at the center, because it was
years of intensive effort to finding a mat.hemat- thought that only the perfect geometrical curve
ical description for the planetary motIon de- of the circle could describe the motion through
scribed by the data. Kepler was successful in the heavens of perfect celestial bodies. How-
deriving three laws of planetary motion which ever as the accuracy of the observational data
led ultimately to our current understanding of incr~ased it became clear that the distance from
the orbital motion of planets, moons, and com- the sun to each planet was not constant, as it
ets as well as man-made satellites and space- would be for a circular orbit. Increasingly com-
craft. The first two laws were published in 1609, plicated geometrical constructions using several
about the time Galileo was first making astro- circles were proposed in an attempt to fit the
nomical observations with his telescope; the data for an individual planet orbit. Kepler was
third law did not appear until a decade later, in the first person to recognize that an ellipse,
1619. rather than a circle, is the geometrical curve
Briefly stated, these laws are (see Fig. 1): which describes the shape of a planet orbit in a
simple and elegant manner.
1. The orbit of each planet is an ellipse with 2. The second law is a concise mathematical
the sun at one focus. description of the observed fact that the rate at
2. The line from the sun to a planet sweeps which the sun-planet line rotates through space
out equal areas inside the ellipse in equal (the angular velocity of the motion) increases as
lengths of time. the planet moves closer to the sun and decreas~s
3. The squares of the orbital periods of the as it moves farther from the sun. As shown m
planets are proportional to the cubes of Fig. I, the distance along the orbit traveled by
their mean distances from the sun. the planet near perihelion in a given length of
time (for example, one month) is greater than
1. The first law describes the geometrical the distance travelled near aphelion during the
shape of the orbit as an ellipse, which correctly same length of time. This phenomenon is de-
scribed quantitatively by the statement that
equal areas are swept out in equal lengths of
time. The time required to sweep out the total
area inside the ellipse is the orbit period of the
planet.
3. The first two laws describe the motion of
an individual planet. By contrast, the third law
Perihelion states the manner in which the motions of the
various planets are related to each other. It
states that the ratio formed by dividing the
square of the orbit period of any planet by the
cube of its mean distance from the sun is the
same value for all planets in the solar system.
The term mean distance, as used in the third
FIG. 1. Elliptical orbit of a planet with the sun at law is simply the average of the perihelion and
one focus. Equal areas P and A are swept out in aph'elion distances. This mean distance is then
equal lengths of time. (Taken from Second Edition half the distance between perihelion and aph-
article by Richard M. Sutton.) elion, and is called the semimajor axis of the

628
629 KEPLER'S LAWS OF PLANETARY MOTION

ellipse. Another way of stating the third law is note that the motion of any body orbiting the
that the period of a planet is proportional to sun in a closed, periodic orbit is governed by
the 3/2 power of its mean distance from the this same force law and must satisfy Kepler's
sun. laws. Thus the motion of comets, asteroids, and
Kepler, to his credit, formulated these three interplanetary spacecraft, such as the Mariner
laws based entirely on empirical data, without spacecraft launched to Mars in 1969 and the
the benefit of a fundamental theory which ex- Voyager spacecraft launched to Jupiter in 1977,
plained why planetary motion satisfied these are governed by these same laws.
laws. The missing ingredient was the concept of One can easily verify Kepler's third law using
gravitational force, which was developed several modern astronomical data for the planetary
decades later by Sir Isaac Newton. Newton, orbits. If one plots the orbit periods of the
starting with Kepler's first law, deduced that a planets against their mean distances from the
planet whould move in an elliptical orbit with sun on a log-log graph as shown in Fig. 2, a
the sun at one focus only if the force exerted straight line of slope 3/2 connects the points,
on the planet by the sun was proportional to indicating that the orbit period T is propor-
the inverse square of the distance between tional to the 3/2 power of the mean distance
them. This is the so-called inverse-square law of R. The unit of distance used in the figure is the
gravitational force. Newton also showed that astronomical unit (A. U.), which is approxi-
Kepler's second law was a consequence of the mately 150 million kilometers or 93 million
principle of conservation of angular momen- miles.
tum. The angular momentum is conserved be- In the few minutes it takes to plot the graph,
cause the gravitational force is a central force, one can discover what it took Kepler ten years
that is, it acts along the line from the planet to to find! The data for the planets beyond Sat-
the central body, the sun. The third law is a u{n (unknown to Kepler) also lie on this same
natural consequence of the inverse-square gravi- straight line. It is interesting to also note that
tational force field of the sun, which exists Halley's comet, which orbits the sun with a
throughout the solar system. It is interesting to period of 76 years at a mean distance from the

I
I
/
/
/
I
/
40 I

30 Saturn

20

Jupiter
10
~ 8
...>-
0
6
c
4
Qj
c
.2
a. 2
'0
"0 I Year
., 1.0
0
.;:
Q.
,
0.6 V~nus
,
,
04 ,,,
I

Mercury! :::j
0.2 ,«
,
,!
1
0.1
0.1 0.20.3 0.6 1.0 2 34 5 7 10 20 3040

Mean distance from sun in astronomical units


FIG. 2. Graph on log-log paper of periods of planets in years vs mean distance from sun in astronomical units
(sun~arth distance is 1 A.V.). (Taken from second edition article by Richard M. Sutton.)
KEPLER'S LAWS OF PLANETARY MOTION 630

sun of 18 A. U. also falls on the same straight ings, is actually traveling at this enormous speed
line. around the sun. The EMOS is a convenient unit
The motion of Halley's comet, which will of speed used in mission planning for interplan-
cross the earth's orbit in 1986, also provides etary spacecraft.
a dramatic example of one consequence of Kepler's laws also apply in the case of man-
Kepler's second law (the conservation of angu- made or natural satellite orbital motion about a
lar momentum). In order to satisfy this law, the planet, such as the earth. In this application,
ratio of-the speed of the comet at perihelion the central body is the planet rather than the
to its speed at aphelion must equal the ratio of sun. If the satellite is close to the planet in rela-
the aphelion to perihelion distances from the tion to the distance from the sun to the satellite,
sun. Since the aphelion distance is 35.3 A.U. the planet's gravitational force on the satellite is
(outside of Neptune's orbit) and its perihelion much greater than the gravitational force due to
distance is 0.6 A.U. (inside the earth's orbit at the sun. The satellite is then said to be inside
1 A.U.), the speed of the comet at perihelion is the sphere of influence of the planet. The
nearly 60 times its speed at aphelion. earth's moon and all artificial satellites closer to
The magnitudes of the speeds of the planets the earth are well within the earth's sphere of
are enormous by earth-based standards. Because the influence because the sun is 400 times far-
the orbit of the earth is very nearly a circular ther from the moon than is the earth. (Recall
orbit of radius I A.U., the distance travelled by that the gravitational force is inversely propor-
the earth in one year is approximately 27T A. U. tional to the square of the distance).
(the circumference of the circular orbit). Divid- Figure 3 illustrates on a log-log graph the pe-
ing this distance by the elapsed time of one riods in hours of satellites orbiting the earth
year gives the value of the unit of speed called plotted against their mean distances from the
the EMOS, for earth-mean-orbital-speed. One center of the earth, measured in units of earth
EMOS is equal to about 30 kilometers per sec- radii. As in the case of the planetary orbits
ond or 67,000 miles per hour. Thus the reader, about the sun shown in Fig. I, the graph is a
sitting motionless relative to his or her surround- straight line of slope 3/2, illustrating Kepler's

1000
800
600

400

200

~ 100
::l
0 80
.s::;
c 60

~ 40
2l
~
'0
20
"0
"'"
Q)
Q..

10
8
6

Excluded
2 4 6 810 20 40 6080100
Mean distance from center of earth in earth radii
(1 earth radius =6378 kilometers)

FIG. 3. Log-log graph of satellites of earth.


631 KEPLER'S LAWS OF PLANETARY MOTION

third law. The sphere of influence of the earth is usually given in terms of perigee and apogee
is located at approximately 150 earth radii altitudes. To calculate the mean distance from
from the center of the earth. the center of the earth for use in Fig. 3 to de-
The lower left-hand region of the graph is termine the orbit period, one simply adds the
excluded because no satellite can continually radius of the earth (6378 km) to the mean alti-
orbit the earth if its mean distance from the tude (the average of the perigee and apogee alti-
center of the earth is less than one earth radius, tudes). For example, a satellite having a perigee
equal to 6378 kilometers, or approximately altitude of 1548 kilometers and an apogee alti-
4000 miles. In fact, the minimum mean dis- tude of 1800 kilometers has a mean altitude of
tance is slightly greater than this value by a 1674 km and a mean distance from the center
hundred kilometers or so because of the layer of the earth of 8052 km, or about 1.3 earth
of atmosphere surrounding the earth. Satellites radii. Referring to the graph of Fig. 3, this cor-
entering the atmosphere burn up due to the responds to an orbit period of 2 hours.
heat caused by atmospheric drag. However, if Other types of orbits besides ellipses are pos-
one for a moment imagines a fictitious satellite sible in an inverse-square gravitational force
orbiting the earth at tree-top level and ignore5 field, namely parabolic and hyperbolic orbits.
the effect of atmospheric drag, the orbit period These were entirely unknown to Kepler, who
would be approximately 84 minutes or 1.4 was only aware of closed, periodic orbits. Para-
hours, at an orbit speed of approximately 8 ki- bolic and hyperbolic orbits are open, in the
lometers per second (about 18,000 miles per sense that they do not close on themselves, but
hour). This enormous speed is necessary in instead extend to infinity on either side of the
order to overcome the gravitational force at- central body. The motion along these orbits
tempting to pull the satellite down to the sur- does not periodically repeat and Kepler's first
face of the earth. and third laws do not apply in this case. However,
The satellites shown in Fig. 3 include the Kepler's second law is still valid because angular
STS-l, the first orbital flight of the Space Trans- momentum is conserved. Thus, Kepler derived
portation System, (the Space Shuttle), which a law which applies to orbits which he did not
was in orbit April 12-14, 1981. The Space even know existed! Parabolic and hyperbolic
Shuttle orbited the earth at an altitude of orbits occur when the speed of the orbiting
approximately 278 kilometers with an orbit body is too great for the body to be captured
period of 90 minutes. The altitude of a satellite by the central body. For this reason these types
is defined as the distance above the surface of of orbits are also termed escape orbits.
the earth. The distance from the center of the Figure 4 shows an example of a hyperbolic
earth is calculated by simply adding the altitude orbit, the flyby of the planet Jupiter by the
of the satellite to the radius of the earth. The
Landsat satellites, which photograph the earth's
surface, have an altitude of 920 km and a pe-
riod of 1.7 hours.
Another satellite shown is a NA VST AR GPS
(Global Positioning System) satellite. A collec-
tion of these satellites are in orbits having 12-
hour periods, and provide precision navigation
information for ships and aircraft on or near
the surface of the earth. Also shown is a geo-
synchronous satellite, which has an orbit period
of one sidereal day, equal to 23 hours, 56 min-
utes, and 4 seconds (the period of the daily ro-
tation of the earth relative to the distant stars).
The orbital motion of geosynchronous satellites
is synchronized with the daily rotation of the
earth, causing these satellites to remain directly
over fixed points on the surface of the earth at
the equator. Geosynchronous satellites are used
for radio, television, and other forms of com-
munication over large distances on the earth.
An example is the satellite WEST AR 1, which is
located above the equator at 99° west longi-
tude. The earth's moon is also shown on the
same straight line on the graph with its sidereal
period of 27.3 days.
F or a satellite in an elliptical orbit about the
earth, the points in orbit nearest and farthest FIG. 4. Hyperbolic flyby of Jupiter by Voyager
from the center of the earth are termed perigee spacecraft. Numbered areas are equal and are swept
and apogee, respectively. Data for satellite orbits out in equal lengths of time.
KEPLER'S LAWS OF PLANETARY MOTION 632

Voyager I spacecraft in March 1979. The space- tion can be readily measured; the phase retar-
craft made a single high-speed passage by the dation due to the Kerr effect is proportional to
planet and proceeded on to the next destina- a constant characteristic of the material and the
tion, the planet Saturn. The orbit is curved to- length of the sample and inversely proportional
ward the planet due to the gravitational force to the wavelength.
of the planet pulling on the spacecraft. In liquids and other optically isotropic media
Other examples of hyperbolic orbits are the a dc Kerr constant is usually defined as:
orbits of the two Voyager spacecraft about the
sun after their encounters with Jupiter in March onll - onl
and July of 1979. During their Jupiter flybys, B= AIEI2
the spacecraft were accelerated by the gravita-
tional force of the massive planet to speeds where A is the wavelength in vacuum. The in-
which placed them on escape orbits from the dividual molecules of such media are often
solar system. After their subsequent encounters highly anisotropic with dipole moments p. or-
with the other planets, they will proceed out iented along a definite molecular axis as well as
of the solar system into our galaxy never to (frequency-dependent) dielectric polarizability
return. As they escape the solar system on tensors (Xij(W) with different values along each
hyperbolic orbits, which were unknown to of three orthogonal directions. The overall isot-
Kepler, their motion satisfies Kepler's second ropy of the medium results from the random
law, derived almost 400 years ago. orientations of the individual molecules. If the
molecules are free to rotate, an applied electric
JOHN E. PRUSSING field can perturb the initial random distribution
of molecular orientations, causing more mol-
References ecules to align parallel to the field than perpen-
dicular to it. This realignment causes the ma-
1. Lodge, Sir Oliver, "Johann Kepler," in "The World jority of the birefringence detected in the Kerr
of Mathematics," Vol. 1, (1. R. Newman, Ed.), effect. 1 Even when the molecular orientations
Simon and Schuster, New York, 1956. are fixed, the applied field can perturb elec-
2. Sagan, Carl, "The Harmony of Worlds," in "Cos- tronic energy levels. The change in energy of
mos," Random House, Inc., New York, 1980. electronic transitions is called the Stark effect,
3. Bate, R. R., Mueller, D. D., and White, J. E., "Fun- and can often be detected as a shift or splitting
damentals of Astrodynamics," Dover Publishing of an absorption band. Since every absorption
Co., New York, 1971. band causes dispersion in the index of refrac-
4. Prussing, J. E., "The Mean Radius in Kepler's Third tion that extends to wavelengths beyond the
Law," American Journal of Physics 45(12) (De- absorption itself, the shift in energy levels im-
cember 1977). plies a change in the index of refraction that
contributes to the Kerr effect and is character-
Cross-references: ASTRONAUTICS, DYNAMICS, ized by an electronic hyperpolarizability tensor
GRAVITATION, ROTATION-CURVILINEAR MO· 'Yijkl' The dc Kerr constant for a liquid com-
TION, MECHANICS. posed of uniaxial molecules where the three
components of the molecular polarizability
tensor at zero frequency are all (0) and a22 (0) =
KERR EFFECTS a33 (0) = all (0) + Aa(O) and the corresponding
In electrooptics and nonlinear optics, a change polarizabilities at optical frequencies are all (w)
in the index of refraction of a medium that is and all (w) + ~(X(w) and where the dipole mo-
proportional to the square of an electric field or ment is along all can be expressed as:

:~ (n2 3+ 2Y(e;2J
to the product of two electric fields is termed
a Kerr effect. 1 In magnetooptics, various field- B=
dependent changes in the polarization of a light
beam reflected from the surface of a mag-
netized material are also called Kerr effects. 2
The details of the physics underlying each of
.[2 ( )+ ~a(O) ~a(w) + p.215k~a(w)lJ
'YxYYX 15kT 2 T2
.

these classes of phenomena depend upon the


experimental conditions and the medium. In this equation N is the number density per
The electrooptic Kerr effect is intrinsically unit volume, e is the dc dielectric constant,
anisotropic; the index of refraction for light ('Yxyyx) is the orientationally averaged molec-
polarized along the direction of the field is al- ular hyperpolarizability, k is Boltzmann's con-
tered by an amount different from that for stant, and T is the absolute temperature. When
light polarized perpendicular to the applied p. =1= 0 the term proportional to p.2 usually dom-
field. In a typical experiment a dc field is inates, while ('Yxyyx) is significant only for
applied at 45° to the polarization of the in- highly symmetric molecules such as CCI4 • The
cident light and the field-induced birefringence dc Kerr constant of nitrobenzene is 3.26 X 10-5
onll - onl causes the output beam to become cm/statvolt 2 at A = 589 nm, which is an order
elliptically polarized. 3 This change in polariza- of magnitude larger than the constants for
633 KERR EFFECTS

other simple molecules. 4 Nematic liquid crys- fringes different from that in the bright fringes,
tals can show even larger effect, however. creating a grating which can Bragg scatter other
In crystals, the polarization subscripts of the beams. The scattered beam contains informa-
nonlinear susceptibility tensor must be referred tion carried by the initial beams; the overall
to crystallographic axes. A quadratic field de- process has been widely studied and is generally
pendence of the index of refraction ellipsoid termed real-time holography.
results from the Kerr effect, but is often called The tensor nature of the nonlinear suscep-
the quadratic electrooptic effect. It is smaller tibility complicates the description of many
and less important technologically than the technologically important applications of the
linear electrooptic or Pockels effect, which only optical Kerr effect. If the applied field at fre-
occurs in noncentrosymmetric crystals. 5 quency WI has projections Ex(wI) and Ey(wI)
The optical Kerr effect occurs when the ap- along the x andy axes, the field alters the po-
plied or orienting field is at optical frequency larization of a probe beam at frequency W2 in
WI. The definition of the dc Kerr constant an otherwise optically isotropic medium . If the
above can be modified to describe some effects probing field is initially polarized along x, the
expected in the optical case. The direction of a birefringence of a sample of length I creates a
static dipole moment cannot reverse fast component polarized along y with amplitude
enough to follow a rapidly oscillating optical
field so the third term in the equation for B -12il (3)
vanishes. Also the dielectric constant € must be E y (W2)= ( )X {xyxyx(W2,-WI,
replaced with the square of the index of refrac- n W2 2
tion at the frequency of the applied field and WI, -W2 )E:(wl )Ey(WI)
the dc molecular anisotropy ~a:(O) must be re-
placed by the optical frequency value ~a:(WI ).6 + X~Vxx (W2, -WI, WI ,-W2)
Additional complexities, however, result from
the presence of resonances in the optical Kerr . E x (wd E ;(WI)}E:(W2)
constant at a variety of optical frequencies.
The optical Kerr effect and its many varia- where i =A and an asterisk denotes the com-·
plex conjugate.
tions are best discussed in terms of a third order
nonlinear susceptibility tensor X}!Jl(W2, -WI, Because the molecular reorientations can oc-
WI, -W2) where the components of the di- cur on a picosecond time scale, the optical Kerr
electric polarization density is written as Pi = effect can be used as a high-speed shutter. A
cell filled with a Kerr liquid such as CS 2 is
Lj XijEj + Lkj XiWEjEk + Ljkl XiW1Ej E k E l. 7 placed between crossed polarizers. A powerful
In the Maker-Terhune convention, the fre- IO-picosecond-long laser pulse, linearly polar-
quency W2 is that of the optical probe, while ized at 45° to the axes of the polarizers and
the frequency WI (which may be zero) de- incident from the side, can induce sufficient
scribes the applied field. The subscripts relate birefringence to permit rapidly occurring pro-
to the polarization directions of the field, and cesses to be photographed. 8
each of the four subscripts is paired with the The nonlinear susceptibilities describing the
frequency at the corresponding position in the optical Kerr effect show resonances and dis-
argument. Thus an applied field at frequency persion similar to the index of refraction, but
WI in the x direction causes a change in the often depending on more than one frequency.
index of refraction in the direction parallel to These effects have been exploited in "polariza-
the field: tion spectroscopy," Raman induced Kerr effect
spectroscopy, and other laser spectroscopy tech-
onx =onll niques. 9 These techniques benefit from the sen-
sitivity with which polarization changes can be
_ 241T (3) _ _ 2
detected. In the polarization spectroscopy ver-
- ~xx(W2' WI,WI, W2) IExl ,
n sion of saturation spectroscopy, all the fields
have the same frequency, but propagate in op-
while, perpendicular to the field, posite directions through an absorbing vapor.
When the laser frequency approaches that of a
ony =onl transition in the vapor, a field-dependent bire-
fringence (and dichroism) can often be induced.
_ 241T (3) _ _ 2 In vapors at low density, the widths of the non-
- Xyxxy(W2, WI,WI, w2)IEx l
n linear resonances can be much less than that of
the absorption itself.
in cgs units. These index of refraction changes In Raman induced Kerr effect spectroscopy,
cause self-focusing of high-power laser beams in the difference of the input frequencies WI - W2
most media. must approach that of a Raman active vibration
Coherent beams crossing in a Kerr medium of the medium. Again a resonant Kerr effect
create an interference pattern of bright and occurs-accompanied by dichroism at exact
dark fringes. The optical Kerr effect makes the resonance. The nonresonant part of the Kerr
index of refraction in the region of the dark effect can be suppressed by using circularly po-
KERR EFFECTS 634

larized light for the applied field (i.e., Ex (WI) = The polar Kerr effect has been exploited in
±iEr(WI ». Two-photon-absorption-induced bi- magneto-optical memories for computer and
refnngence occurs when the sum of the input video applications. A film of magnetic material
frequencies approaches a two-photon transition. can be magnetized with small domains having
Dramatic polarization rotations occur when the magnetic fields either up or down. An incident
applied field is circularly polarized. plane wave is reflected with polarization detect-
The magnetooptic Kerr effects result from ably rotated one way or the other by different
the different phase velocities of left and right domains. Up magnetization can be interpreted
circularly polarized light propagating along a as a binary I while down magnetization would
magnetic field (see FARADAY EFFECT). This be binary O. Such domains can be produced
difference in phase velocity can be interpreted with dimensions as small as a wavelength of
as a difference in the indices of refraction for light by heating the film with a focused laser
the two polarization states. Fresnel's laws of beam in a magnetizing field and allowing it to
reflection then imply that right and left cir- cool through a curie or compensation tempera-
cularly (or elliptically) polarized light will be ture. The calculated information density would
reflected with different amplitudes and phases be greater than 10 8 bits/cm 2 .
at the surface of a magnetized medium. Since
linearly polarized light can be decomposed into MARC D. LEVENSON
circular (or elliptical) components, the differ-
ence in Fresnel coefficients implies that the References
reflected light will be elliptically polarized with 1. Kerr, J., Phil. Mag. 50,337,446 (1875); 8,85,
major axis rotated from the initial polarization 229 (1879); Beams, J. W., Revs. Mod. Phys. 4,
plane. 2 133 (1932).
If linearly polarized light is normally incident 2. Kerr, J., Rept. Brit. Assoc. Adv. Sci. P40 (1876);
upon the surface, and if the magnetization is Phil. Mag. 3, 321 (1877); 5,161 (1878).
perpendicular to the surface,' the rotation angle 3. McClung, F. J., and Hellwarth, R. W., Appl. Op-
of the maj.or axis of the ellipse is tics Suppl. 1,103 (1962).
4. Gray, D. E. (Ed.), "American Institute of Physics
n+ - n_ } Handbook," McGraw-Hill, New York, 1972, p.
(lk =-Im {
n+n_ - I 6-232.
5. Kaminov, I. P., and Turner, E. H., Appl. Optics 5,
and the ellipticity of the reflected light is 1612 (1966).
6. Mayer, G., and Gires, F., C. R. Acad. Sci. (Paris)
n+ - n_ } 258,2039 (1964).
€k =-Re {
n+n_ - I 7. Maker, P. D., and Terhune, R. W.,Phys. Rev. 137,
A801 (1965).
where n± = N± - iK± is the complex index of 8. Dugay, M. A., and Mattick, A. T., Appl. Opt. 10,
refraction for right (+) or left (-) circularly 2162 (1971).
polarized light and Re and 1m denote the real 9. Levenson, M. D., "Introduction to Nonlinear Laser
and imaginary parts, respectively. Both (h and Spectroscopy," New York, Academic Press, 1982,
€k are typically very small with () k less than 10 pp. 73ff, 139ff.
minutes of arc for the ferromagnetic metals. 10. Freiser, M. J., IEEE Trans. on Magnetics MAG-4,
If the light is not normally incident, there are 152 (1968).
three cases. (1) In the polar Kerr effect, the
magnetization is normal to the surface, and the Cross-references: FARADAY EFFECT, LASER, PO-
complex Fresnel reflection coefficients for s LARIZED LIGHT, RAMAN EFFECT AND RAMAN
and p polarized incident light result in ellipti- SPECTRA, REFLECTION, REFRACTION.
cally polarized reflected light. (2) In the longi-
tudinal Kerr effect, the magnetization is parallel
to the surface and lies within the plane of inci-
dence. Again, linearly polarized incident light KINETIC THEORY
becomes elliptically polarized upon reflection.
(3) In the equatorial Kerr effect, the magnetiza- The kinetic theory is a branch of THEORETICAL
tion is parallel to the surface and perpendicular PHYSICS developed in the nineteenth century to
to the plane of incidence. Light polarized along explain and calculate the properties of fluids. It
the magnetization (s-polarization) is unaffected is most useful for studying the physical prop-
by the magnetic field, while p polarized light erties of gases, but it can also be applied to
has a reflection coefficient that contains a term liquids, electrons in metals, and neutrons passing
linear in the magnetization. In absorbing ma- through solids. The word "kinetic" means "per-
terials the intensity of the reflected light will taining to motion," in this case the motion of
depend upon the sign of the magnetization. The molecules or subatomic particles.
equatorial Kerr effect has been an important Historical Development The first attempt to
tool for studying absorbing magnetic materials develop a kinetic theory of gases was made in
such as ferromagnetic metals. I 0 1738 by the Swiss mathematician Daniel Ber-
635 KINETIC THEORY

noulli. Bernoulli began with the idea that mat- pressures, and should increase with temperature.
ter consists of tiny atoms moving about rapidly This seemed to go against common sense, but
in all directions, which the Greek philosopher later experiments by Maxwell himself and other
Democritus had presented, but was unable to physicists showed that the theory is correct.
prove. Bernoulli showed that the collisions of Soon afterward several scientists, starting with
atoms against the walls of a container would Josef Loschmidt (Austrian) in 1865, used the
produce a pressure which would be inversely kinetic theory to calculate the diameter of an
proportional to the total volume of the con- atom . At this time it began to appear that the
tainer; he assumed that the space occupied by atom is something that really exists in nature,
the atoms themselves is negligible compared to since it can be measured, weighed, and counted,
the total volume of the container and that the and is not merely a philosophical speculation.
rest of the space is empty. He also found that By then the atomic theory had already been
the pressure would be directly proportional to accepted in chemistry as a basis for explaining
the kinetic energy of motion of the atoms if the chemical reactions, but it was the kinetic theory
velocities of the atoms are changed while the of gases that established the place of atoms in
volume is kept fixed . (The kinetic energy of an physics.
atom is half its mass multiplied by the square of Starting from the foundations laid by Clausius
its velocity.) and Maxwell, Ludwig Boltzmann (Austrian) and
The British scientist Robert Boyle had al- J. Willard Gibbs (American) worked out system-
ready shown in 1662 that the pressure of air atic methods for calculating all the properties
varies inversely as its volume if the temperature of gases from kinetic theory (see STATISTICAL
is held constant (see GAS LAWS). Thus Ber- MECHANICS). Sydney Chapman (British) and
noulli's theory was able to explain a well-known David Enskog (Swedish) completed the theory,
fundamental _property of air and other _gases, insofar as it pertains to the transport properties
It was also known that the pressure of a gas (diffusion, viscosity, and heat conduction) of
confined in a fixed volume increases with gases at ordinary densities, although there are
temperature. However, it was not until about still some unsolved problems in the area of high-
1800 that there was enough experimental density gases and liquids, on the one hand, and
evidence, and an accurate enough temperature rarefied (very low density) gases on the other.
scale, for Gay-Lussac (French) and others to In the course of working out this theory, Chap-
establish a quantitative relation between pres- man and Enskog discovered that it should be
sure and temperature. This relation can be possible to separate the components of a mix-
expressed by saying that pressure is proportional ture of a gas by making one side of the container
to the temperature measured from "absolute hotter than the other. This effect-known as
zero" (though it was not until later in the nine- "thermal diffusion" -was soon afterwards estab-
teenth century that the idea of absolute zero lished experimentally by Chapman and Dootson
temperature was generally accepted). According (British), thus confirming the prediction based
to the kinetic theory, the absolute temperature on kinetic theory. (Thermal diffusion was used
is proportional to the kinetic energy of motion as one of the methods of separating isotopes of
of molecules in a gas. uranium during the development of the atomic
The kinetic theory was proposed again in the bomb in World War 11).
first half of the nineteenth century by two Although the kinetic theory was founded on
British scientists, John Herapath and J . J .Water- the principles of classical Newtonian mechanics
ston. Neither of them was familiar with Ber- and led to some incorrect results because those
noulli's theory, which had not made much im- principles are not valid on the molecular level,
pression on the world of science. Waterston ob- it is now generally agreed that the kinetic theory
tained one important new result, which is now is valid for calculating the statistical properties
known as the "equipartition theorem" : in a of large numbers of molecules, provided that
mixture of two or more different gases at the the properties of the individual molecules them-
same temperature, the average kinetic energy of selves are determined experimentally, or from
each kind of molecule will be the same. This the quantum theory. I t is only when one tries to
means that heavy molecules will tend to move apply the kinetic theory to matter in extreme
more slowly than light molecules, since when conditions (very low temperatures or very high
the mass of a molecule is greater, its velocity densities) that he must take account of quantum-
must be less in order to keep the kinetic mechanical modifications of the statistical
energy the same. method itself (see QUANTUM THEORY and
In 1858, the German physicist Rudolf Clausius STATISTICAL MECHANICS).
showed how the kinetic theory could be used Main Features of the Theory By assuming
to explain the rate of mixing of two gases and that the major part of the heat energy of a gas
the rate of heat conduction. His work was consists of kinetic energy of motion of the mole-
extended by James Clerk Maxwell (British), who cules, one finds that the average velocity of a
calculated the viscosity coefficient by the kinetic molecule is several hundred meters per second
theory . He found that theoretically the VISCOS- under ordinary conditions. However, it is a fact
ITY of a gas should be the same at differen t of common observation that gases do not actu-
KINETIC THEORY 636

ally move as a whole at such speeds; a gas will (3) In treating the collision of two molecules,
eventually spread throughout any container in one supposes that there is no correlation between
which it is placed, but it may be several seconds their velocities before the collision. In the ele-
or minutes, for example, before chlorine gas mentary kinetic theory, one assumes that each
generated at one end of a large laboratory is molecule has the average velocity characteristic
noticed at the other end. According to the of the region of the gas in which it has most
kinetic theory, the reason for the relative slow- recently undergone a collision. It thus "forgets"
ness of gaseous diffusion, in contrast to the high its past history every time it collides with another
average velocities of individual molecules, is molecule. This is obviously not strictly true for
that a molecule can travel on the average only each individual molecule, but it is a useful ap-
a very short distance (its "mean free path") proximation for dealing with average properties
before it collides with another molecule and of large number of molecules.
changes its direction of motion. In particular, In modern physics research, it is usual to dis-
at atmospheric pressure, if the molecular diam- tinguish between "equilibrium theory" and
eter is assumed to be about 0.00000001 cm "transport theory ," both of which grew out of
(which is approximately true for most mole- the elementary kinetic theory . Equilibrium
cules), the mean free path would be approxi- theory is described in the article on STATISTICAL
mately 0.00001 cm. At the same time, the MECHANICS (see also EQUILIBRIUM); it is used
average distance between neighboring molecules to study such properties as the heat capacity
would be somewhat more than 0.0000001 cm, (amount of heat needed to raise the temperature
so that the fraction of the total volume occupied by a certain amount) and the compressibility
by the molecules themselves is less than I part in (change in volume produced by a small change
1000. The average molecular velocity in air at in pressure). Equilibrium theory also tries to
15°C is about 460 m/sec, so that a molecule will explain the existence of phase transitions, such
have about 4 600 000 000 collisions per second. as the condensation of gases to liquids, or the
To simplify their calculations, Maxwell and appearance of magnetic ordering in solids. From
Boltzmann made the following assumptions: the theoretical viewpoint, the calculation of
(1) Instead of trying to compute the exact equilibrium properties is simpler than that of
path followed by every molecule, they assumed transport properties such as viscosity, because
that, because of the enormous frequency of one merely averages over all the possible states
collisions, the velocities and postions of mole- of the system (i.e., over all possible combinations
cules in a gas are distributed at random over all of velocities and positions of the molecules)
possible values consistent with the known phys- without having to worry about how one state
ical state of the gas. For example, it is assumed follows another in time. TRANSPORT THEORY
that the average total velocity is known, as is involves a detailed analysis of molecular col-
the temperature (which fixes the mean square lisions in order to determine how changes in the
velocity). If variations of temperature and den- state of the system are related to external forces
sity from one place to another can be ignored, or nonuniform conditions imposed on it.
then the molecular velocities can be described One of the most fruitful techniques in trans-
by a statistical distribution-the "Maxwell dis- port theory is the use of "Boltzmann's equa-
tribution" -which is similar to the normal "bell- tion ." This equation describes how the velocity
shaped curve" or law of errors in statistics. distribution changes as a result of external
It should be noted that the effect of a large forces and collisions between molecules. Un-
number of collisions is not to make all the veloc- fortunately the equation is rather difficult to
ities equal, but rather to produce a wide range solve, because the term that expresses the effect
of velocities from zero up to very large values- of collisions on the velocity distribution is an
though the probability of large deviations from integral over the values of the (unknown)
the average is quite small. The existence of this velocity distribution itself for two colliding
"spread" of molecular velocities has been veri- molecules. In order to calculate the transport
fied directly by various experiments. properties it is necessary to resort to tedious
(2) The diameter of a molecule is so small, computations with infinite series, except for
compared to the average distance between mole- certain artificial force laws (such as repulsive
cules, that simultaneous collisions of three or forces inversely proportional to the fifth power
more molecules may be ignored. The validity of of the distance between two molecules) for
this assumption for low-density gases makes it which the integral can be simplified. In most
possible to develop a very accuratr. theory of cases, the results (as worked out by Chapman
gas properties, since these properties can be and Enskog) do not differ greatly from the ones
related to the interactions of molecules taken obtained from the approximate elementary
two at a time, and the mathematical description theory. However, the important phenomenon
of such two-particle interactions is relatively of thermal diffusion was discovered only be-
simple. The corresponding kinetic theory of cause of a theoretical prediction by Enskog
dense gases and liquids, on the other hand, and Chapman; since the existence of this effect
involves the solution of difficult many-particle had not even been suggested by the earlier
problems, and reliable results have been obtained theories, this discovery must be regarded as one
only within the last few years. of the triumphs of mathematical analysis.
637 KINETIC THEORY

Irreversibility As indicated above, the kinetic and the laws of classical mechanics is shown by
theory assumes that the velocity of a molecule two famous criticisms of the kinetic theory, the
may depend on the conditions in the region "reversibility paradox" and the "recurrence
where it has just suffered a collision, but is paradox." The first paradox is based on the fact
otherwise random-in other words, independent that Newton's laws of motion are unchanged if
of its previous history. This assumption permits one reverses the time direction, so that it would
one to use the methods of probability theory seem to be impossible to deduce from these
even though, in classical mechanics, the actual equations a theorem that predicts irreversible
motions of the molecules are regarded as com- behavior. Kelvin discussed this paradox in 1874,
pletely determined by their initial configura- and concluded that while any single sequence
tions. As long as one uses the theory only to of molecular motions could be reversed, leading
calculate properties of a gas that can actually be to an ordered state, the number of disordered
measured during a relatively short time, the states is so much greater than the number of
assumption of randomness leads to no serious ordered states that it is virtually impossible to
errors. However, it introduces an element of stay in an ordered state for any period of time.
irreversibility which is inconsistent with the Thus irreversibility is a statistical but not an
reversibility of the laws of classical mechanics. absolute consequence of kinetic theory. Boltz-
(A reversible process is one that can go equally mann gave a similar answer when the problem
well forwards or backwards, in contrast to an was pointed out to him by Loschmidt a few
irreversible process, like scrambling an egg, years later. The second paradox is based on a
which cannot be undone without a great ex- theorem of Henri Poincare (French): if a mechan-
penditure of energy. The British physicist Wil- ical system is enclosed in a finite volume, then
liam Thomson, later Lord Kelvin, had pointed after a sufficiently long time it will return as
out the importance of irreversible processes in closely as one likes to its initial state. Hence H
his "Principle of the Dissipation of Energy," in must return to its original value; if it has de-
1852.) The irreversible aspect of the kinetic creased during some period of time, it must
theory is shown most clearly by Boltzmann's increase during some other period. The time
"H-theorem," which has led to a considerable between successive recurrences of the same state
amount of controversy about the foundations for the molecules in 1 cc of air is much longer
of kinetic theory. Boltzmann showed in 1872 than the present age of the universe, so one does
that a certain quantity, later called H, which not have to worry about recurrences in any
depends on the velocity distribution, must actual experiment. In his attempt to resolve the
always decrease with time, unless the velocity recurrence paradox, Boltzmann was finally led
distribution is Maxwell's distribution, in which to a remarkable psycho-cosmological specula-
case H remains constant. In the latter case, tion: he suggested that the "direction of time"
which corresponds to the equilibrium state, H as perceived by an animate being is determined
is proportional to the negative of the entropy by the direction of irreversible processes in his
(see article on THERMODYNAMICS). Thus the environment and in his body. Thus when the
H-theorem provides a molecular interpretation time comes for a recurrence, entropy will de-
of the second law of thermodynamics or, in crease but subjective time will flow in the
particular, the principle that the entropy of an opposite direction; thus the law "entropy in-
isolated system must always increase or remain creases with time" is a tautology! This idea of
constant. Irreversible processes are those in alternating time-directions in cosmic history was
which entropy increases. The entropy itself can further explored by H. Reichenbach (The
be regarded as a measure of the degree of ran- Direction of Time, Berkeley, 1956) and has
domness or disorder of the gas, although it must been proposed again in recent theories of the
be recognized that disorder really means just expanding (and contracting) universe.
our own lack of knowledge about the details of Some other aspects of this problem and its
molecular configurations. The equilibrium state connection with atomic randomness are dis-
represents the maximum possible disorder; the cussed in the article on IRREVERSIBILITY.
H-theorem implies that a gas which is initially in Recent Developments Since World War II
a nonequilibrium (partly ordered) state will there has been a revival of interest in the "clas-
eventually reach equilibrium and then stay there sical" kinetic theory of gases, based on the
forever if it is not disturbed. assumptions of Clausius, Maxwell and Boltz-
If the long-term consequences of the H- mann and ignoring quantum effects except
theorem were applicable to all matter in the insofar as these may determine the intermole-
universe, one might expect that the universe cular force law. In part this interest is due to
would even tually "run down"; although the applications involving high-speed aerodynamics
total energy might always remain the same, no and plasma physics, in part to renewed attempts
useful work could be done with this energy to construct reliable theories of liquids as well
because all matter would be at the same tem- as dense gases. New methods for obtaining
perature (see THERMODYNAMICS). This final accurate solutions of the Boltzmann equation
state has been called the "heat death" of the have been developed by H. Grad, C. L. Pekeris,
universe. E. Ikenberry and C. Truesdell, and many others.
The contradiction between the H-theorem These solutions have been used to describe the
KINETIC THEORY 638

behavior of gases in many circumstances more had been based) was established almost simul-
complicated than those treated in the nineteenth taneously by J. R. Dorfman and E. G. D. Cohen,
century _(including the interactions of charged J. Weinstock, and R. Goldman and E. Frieman.
particles and magnetic fields). Problems such as The result has been a flurry of activity in kinetic
the propagation and dispersion of sound waves theory, in which many of the intuitively plau-
have also been treated by G. E. Uhlenbeck and sible ideas about "relaxation" of initial states
his collaborators. to steady nonequilibrium states, and destruction
In 1946, three general formulations of kinetic of correlations by intermolecular collisions,
theory were published, by M. Born and H. S. have been revised.
Green, by J. G. Kirkwood, and by N. N. Bogoli- In spite of the impressive success of kinetic
ubov. In each case the goal was to derive a theory in solving numerous problems of both
generalized Boltzmann equation in a form that practical and theoretical interest, many scien-
would be valid when simultaneous interactions tists are not satisfied that the theory of non-
among more than two molecules have to be equilibrium processes rests on a solid founda-
taken into account, and thence to obtain solu- tion comparable to that of equilibrium statistical
tions of the equation from which transport mechanics. There is no single fundamental
properties of dense gases and liquids could be postulate analogous to the Gibbs canonical
calculated. In each formulation certain approxi- distribution from which everything can be
mations had to be made in order to obtain deduced. Perhaps as a result of this gap in
practical results; because of the difficulty in fundamental principles, there remains con-
estimating the error involved in these approxi- siderable disagreement about how to formulate
mations, and the great compleXity of the equa- some of the major outstanding problems which
tions involved, there was no clear evidence that involve the connection between the atomic and
the results for properties such as the viscosity the macroscopic levels, such as the nature of
coefficient would be significantly more accurate turbulence.
than those obtained by Enskog from his modi- Other applications of the kinetic theory are
fied kinetic theory for dense gases published in discussed in the articles on AERODYNAMICS,
1922. Eventually, in the early 1960s, attention BOL TZMANN'S DISTRIBUTION LAW, ELECTRI-
was centered on the systematic derivation of CAL CONDUCTIVITY, LIQUID STATE, NUCLEAR
series expansions for the transport coefficients REACTORS and PLASMA. See especially
in ascending powers of the density, together IRREVERSIBILITY.
with attempts to calculate the first few terms
in such series for special molecular models such STEPHEN G. BRUSH
as elastic spheres. In the meantime, an alter-
native and apparently more rigorous method for
deriving theoretical expressions for transport
coefficients, based on the "fluctuation-dissipa- References
tion theorem" introduced in 1928 by H. Ny- For an elementary introduction, see Cowling, T. G.,
quist in electrical engineering problems, was "Molecules in Motion," London, Hutchinson, 1950;
developed by M. S. Green, H. Mori, and R. reprinted by Harper Torchbooks, New York, 1960.
Kubo. This method had the heuristic advantage Diffusion and thermal diffusion: Furry, W. H., "On the
of bringing out clearly the connection between Elementary Explanation of Diffusion Phenomena in
transport theory and the description of fluctu- Gases," Am. J. Physics, 16,63 (1948).
ations in equilibrium statistical mechanics. Later Comprehensive treatment of modern kinetic theory
it was proved that the Green-Mori-Kubo method and its applications: Lifshitz, E. M., and Pitaevskii,
gives results precisely equivalent to those that 1. P., "Physical Kinetics," New York, Pergamon
would be obtained from the Born-Green, Kirk- Press, 1981; Berne, Bruce 1. (Ed.), "Statistical
wood, and Bogoliubov methods, and also those Mechanics, Part B: Time-Dependent Processes,"
of yet another method developed by I. Prigogine, New York, Plenum Press, 1977, especially the
if in each case the calculations are done without chapter by 1. R. Dorfman and H. Van Beijeren.
approximation. Thus,just as in the case of quan- Recent developments and current research problems:
tum mechanics, several alternative approaches Raveche, H. 1. (Ed.), "Perspectives in Statistical
are equally valid in modern kinetic theory. Physics," Amsterdam, North-Holland, 1981, es-
After intensive efforts to calculate terms in pecially the chapters by 1. R. Dorfman and R.
the density expansion of transport coefficients, Zwanzig.
it was finally discovered in 1965 that such a History of kinetic theory: Brush, S. G. (Ed.), "Kinetic
density expansion does not actually exist, for Theory," 3 Vols., New York, Pergamon Press,
mathematical reasons associated with the per- 1965-72; "The Kind of Motion We Call Heat: A
sistence of weak correlations between colliding History of the Kinetic Theory of Gases in the 19th
particles over very long times. The divergence of Century," Amsterdam, North-Holland, 1976; "Sta-
the expansion (and thus the inadequacy of the tistical Physics and the Atomic Theory of Matter,"
approximations on which most earlier theories Princeton, N.1., Princeton Univ. Press, 1983.
L
LAPLACE TRANSFORM and one is left to interpret this result. The key
to this interpretation lies in the three operational
Introduction The Laplace transform /(s) of a relations

f
function f(t) is defined to be the integral

/(s) = 1
00

o
e-stf(t)dt
and
D =-!!...
dt'
D-I = t dt,

if the integral exists. It will certainly exist if


=L =L
00 00
f(t) is itself integrable between zero and an (1- D)-l Dk D-k-l.
arbitrary upper limit, and if, for a large eno~g~
real value of k e- kt f(t) ~ 0 as t ~ 00. Then If It k=o k=O
exists for one 'complex value of s it exists also In these, D-I is an indefinite integral and
for all complex s of greater real part, and so the hence one of the major faults was that D and
integral "transforms" the function f(t) into a D-l were not commutative, as D-ID - DD-I
function 1(s) defined on a half-plane of the would in general be a constant. Heaviside does
complex Argand plane. lo ,6 not seem to have been clear which interpreta-
Operational Methods The Laplace transform tion of (I - D)-l to take, preferring the first
is perhaps the best known and most useful ,?f a but recognizing the difficulties of convergence
number of integral transforms, whose apphca- implied. The method also fails to take account
tion is that province of applied mathematics of the initial values under which the equation
usually called operational methods. is to be solved.
The essential idea is to exploit the analogy Now it follows by successive integrations by
between certain differential and algebraic oper- parts that if dky/dt k has a Laplace transform
ators. Historically, the idea goes back to Leib- it is
nitz, and has been employed by a number of
mathematicians including Laplace, Lagrange k-I
and Riemann. 2 It was treated extensively by sky(s) - L Yk-j-lS j
Boole and Oliver Heaviside developed it to the j=o
point' where he could apply it, particularly to
circuit theory in electricity. In his D-notation, where ji is the Laplace transform of y, and
given the ordinary differential equation with Yr = limt--+o dry/dt r . Thus the Laplace transform
constant coefficients has the algebraic property of the D-operator,
but the initial values are now included in the
dny dn-ly dy operation. If this is then applied to. the differ-
ao - + a l - - + . "+an-l- ential equation above, the term to be mterpreted
dtn dtn -l dt becomes
+ anY =F(t), .y(s) = {F(s) + Q(s)}/P(s),
where F(s) is the Laplace transform of F(t),
n and Q(t) is a polynomial of degree n - I with
we introduce the operator P(D) =L akDn-k coefficients depending on the constants Yr.
k=O It can easily be verified that the Laplace trans-
and so write the equation as forms of tne at , eat cos bt and eat sin bt are re-
spectively n!(s - a)-I-n, (s - a)/{(s - a)2 +b 2}
P(D)y =F(t). and b/{(s - a)2 + b 2 }. If, then, the ~orcing
function F(t) is a finite sum of terms whlch are
Then it is formally solved as products of powers of t, expone.ntials, and
I trigonometric functi<?,ns, ~he equa~lOn can. be
Y - -F(t) solved by resolving y (s) mto 'pa~tl~1 fractlOns
- P(D) , and interpreting separately the mdlVldual terms.

639
LAPLACE TRANSFORM 640

The Inversion Integral The justification for (21T)1/2 P(p)G (p) is the Fourier transform of
the procedure above involves an assumption
that the relation between a function and its
Laplace transform is unique. This would follow [ : F(u)G(t - u)du.
from a corresponding result for the Fourier
transform, as we will show that the relation-
ship for the Laplace transform can be deduced If we interpret this as a theorem for the Laplace
from the Fourier Integral Theorem. In fact, transform we obtain the result that if /(s) and
functions which differ on a set of points of zero g(s) are respectively the Laplace transforms of
measure will have the same Fourier or Laplace f(t) and g(t), then /(s)g(s) is the Laplace trans-
transform; but in most practical problems such form of the function
an equivalence class of functions would not be
distinguished in any case, so that effectively
there is a uniqueness theorem. The Fourier inte- i t f(u)g(t - u)du.
gral theorem establishes, for a restricted class of
functions, which vanish sufficiently strongly for
large values of t, a unique reciprocal relation This theorem has an immediate application to
betwexn a function F(t) and its Fourier trans- the solution of the differential equation we
form F(p), have introduced above, for we can assert that if
Y(t) is the function whose Laplace transform is
l/P(s), thenF(s)/P(s) is the Laplace transform of
(21T)1/2 P(p) = [ : eip t F(t)dt,

J
i t F(u)Y(t - u)du.
X+ic
(21T)1/2F(t) = lim e-iPtP(p)dp,
X-HO -X+ic Now pes) is a polynomial in s, and so we can
write
where c is chosen so that the integral is
meaningful. P(s)=ao n
m

k=!
(s- (Xkf"R
If we introduce the unit function H(t), which
is one for positive t, and zero for negative t, a
function often associated with the name of where the (Xk are in general complex, and
m
Heaviside, and we also write p = is, F(t) =
(21T)1/2 f(t)H(t), then P (is) is the Laplace trans-
L: Nk = n. We can then resolve l/P(s) into
k=l
form /(s). We therefore obtain the uniqueness partial fractions

E
theorem we desire, and in addition obtain an
inversion formula for the Laplace transform,
often known as the Bromwich integral t Akj .

21Tif(t)H(t) = lim
jC+iX

X-HO c-iX
e stl(s)ds,
k= 1 j= 1 (s - (Xk)l

(see Refs. 5 and 6) and so


m Nk Ak'
where c must be greater than the real part of Y(t)= L L ~tj-1eCJ.kt.
any singUlarity of /(s). k =1 j= 1 (J 1) .
We may use the method of residues to evaluate
the inversion integral for a wide class of trans- This is equivalent to solving the differential
forms,6 and so with its use we can relax the re- eqnation by the method of variation of
striction we had to place on the forcing function parameters.
above. The convolution theorem is also of great
An alternative form of the inversion formula assistance in the solution of integral equations
can be given byll with difference kernels, i.e., with kernels of the
form K(t, u) = K(t - u). In particular we see
that the Volterra equation
f(t) = lim -(-1)n
- (n)n+l
- _f(n)-
(n)
n->oo n! t t
f(t) = get) + 'X i t K(t - u)f(u)du
where 1(n)(s) is the nth derivative of /(s). o
The Convolution Theorem We may also show leads directly to
that, for a suitably restricted class of functions,
the Fourier transform has a convolution or /(s) = g(s)/ {I - 'XK(s)}.
Faltung theoremAassociate~ with it. This takes
the form that if F(p) and G(p) are respectively The corresponding Fredholm integral equa-
the Fourier transforms of F(t) and G (t), then tion with a different kernel may also be solved
641 LAPLACE TRANSFORM

using a transform. In this case, however, another useful relation, which is that if a is the singular-
integral equation is obtained which may be ity of !(s) of greatest real part, and near it !(s)
solved by the Wiener-Hopf technique. 6 has an expansion
Other Properties of the Laplace Transform
It can be proved quite easily 6 that \Yithin the !(s) = (s - a)-V-l L An (s - a)n,
half-plane in which it converges, f(s) is an n=o
analytic function of s, and so has derivatives of then for large positive values of t

1
all orders with respect to s. It then follows that
k
d f(s)
00
f(t) ~ tVeat L An .
- - = (-l)k
ds k 0
e-sttkf(t)dt.
n=o rev + 1 - n)t n

Partial Differential Equations and Other


We may apply this relation to ordinary differ- Transforms Possibly the most useful applica-
ential equations whose coefficients are poly- tion is to the solution of initial value problems
nomials in t. If we combine this formula with for some of the linear partial differential equa-
the relation for the transform of a derivative tions of mathematical physics. This is exploited
we have used before, the equation to the full in Ref. I and to a lesser extent in
n mk .dn-ky Refs. 5 and 6. In a number of these equations
"~ L ak,·tJ--=F(t)
dt n - k
time derivatives appear with constant coeffi-
cients, so the use of the transform effectively
k=O j=o
reduces by one the number of independent
is transformed into variables. In particular, if there is only one other
independent variable than the time, it produces
Ln L
mk {d
(-l)jakj J
j
(sn-ky)
an ordinary differential equation. The method
has great flexibility since the asymptotic
k=o j=o ds properties can often be used to extract in-
formation which conventional solutions in eigen-
L ily. n-k-i-l si-i } =F(s).
- n-k-l value expansions conceal. It can be shown that
if the Laplace transform of f(t)H(t) is /(s),
i=j (i - j)!
then the Laplace transform of f(t - a)H(t - a)
Hence an ordinary linear differential equation is e-saf(s). In problems involving wave propaga-
of order n whose coefficients are polynomials tion this can be exploited to analyze the solu-
in t of maximum degree m, becomes an tion into the components arising from multiple
ordinary linear differential equation for the reflections.
transform which is of order m and whose co- It is clear from the way that the initial condi-
efficients are polynomials of degree n. This is tions appear in the Laplace transform of a
exactly the same situation as arises if we attempt derivative, that it can be used primarily for the
to solve the original equation by a contour solution of initial value problems. and hence
integral of Laplace type.? for partial differential equations of hyperbolic
In fact the Laplace transform is only a special or parabolic type only. In treating the solution
case of this more general technique, which is in of equations of elliptic type on infinite domains,
consequence rather more useful. in which the boundary conditions are given at
One advantage does, however, remain for the both ends of the range, we require other integral
use of the Laplace transform in this context. transforms, the conditions at infinity being
It has asymptotic properties which can allow usually of a form which will ensure the con-
the deduction of certain properties of a func- vergence of the defining integral. The question of
tion from its transform without explicitly in- which transform to use is usually decided by the
verting it. It is an immediate deduction from equation, the coordinate system, and the form in
Watson's lemma? that if, for small t, which the boundary conditions are given. It is
convenient to illustrate some of the most com-
f(t) = tV L an tn , v> -1, monly used transforms by reference to the
Laplace equation 1j2¢ = O.
n=O In Cartesian coordinates this has the form
then for Is I large a 2¢ a 2¢ a 2¢
-+-+-=0.
ax 2 ay2 az 2
!(s) ~ s-l-V L r(n + V + l)ans- n .
n=O If the region in which a solution is sought ex-
tends over (-00, 00) in one coordinate, say x,
To deduce an expression for f(t) for small t and the boundary conditions are that ¢ ~ 0
from the transform, a converse theorem is re- as I x I ~ 00, we would use a Fourier transform,
quired and the conditions under which this is whose inversion formula and convolution inte-
valid may be found in Ref. 4. The same text gral have already been given. Like the Laplace
establishes conditions for the validity of another transform, an asymptotic expansion of the
LAPLACE TRANSFORM 642

transform is equivalent to the expansion of the


function near the origin. The expansion of the [(s) = ] 00 Rs-! [(R)dR;
transform near the singularity of least negative o
imaginary part gives the asymptotic expan~i?n
of the function near x =00, and of least posltlve I jC+iX _
imaginary part near x = -00. [(R) = lim -. R-S[(s)ds.
X-+oo 2m c-iX
If the range in x is only semi-infinite, we may
use either a Fourier cosine transform, with the This can be derived from the Fourier trans-
reciprocal relations form by substituting R = eX, w!lic1!. also. gives
the convolution theorem that [(s)g(s) IS the
2)1/2 [00
i(s) = (-; Jo [(x) cos sx dx;
transform of

2)1/2]00 {OO [(p)g (%) ~.


[(x) = (-; 0 [(s) cos sx dx
To apply this transform to the equation it. is
necessary, but not sufficient, that R.s~ vam~h
or the Fourier sine transform with the reciprocal at infinity and at R = o. Thes~ condlt.lOns ~lll
relations also dictate what choice of c m the mverslOn
integral will b~ needed for conv~rgence. The
j(s)= (-;2)1/2]0 00 [(x)sinsxdx; singularity of [(s) whose real part IS less t.han c
but nearest to it will dominate the behaVIOr of
[(R) for small R, and the singularity with real
part greater than c but closest to it will domi-
2)1/2 (00_
[(x) = (-: Jo [(s) sin sx dx.
nate the behavior of [(R) for large R.
If the Laplace equation is expressed in terms
of cylindrical polar coordinates (r, z) it has e,
Both transforms require the boundary condi- solutions of the form <l>v(r, z)e ivO , and to de-
tion </>(x) ~ 0 as x ~ 00, but the choice b.et~een termine the form of <l>v we may use a Hankel
transform of order v, defined by

L
them is decided by whether </> or a</>/ax IS g.lven
on x = 0, for if </>(x) ~ 0 as x ~ 00, and the mte- oo
grals exist, [v(s) = r[(r)Jv(sr)dr,

i
o
OO a2</>
{
o
-
ax2
cossx dx
[(r) = 00 siv (s)Jv(sr)dr
=_(a¢) x=o -S2 rOO </>(x) cossx dx,
Jo where Jv(sr) is the Bessel function of first kind
ax and of order v ~ o.
For integer order this can be derived from
while applying simultaneous Fourier transforms to

L
the x and y variables and then converting to
polar coordinates by x = r cos e, y = r sin e. For
OO a2</>
-sinsx dx noninteger order it is better to derive the result

i
o ax2
directly using a modified version of the me~h.od
oo needed for the Fourier theorem and explOltmg
=S</>X=0-S2 </>(x)sinsxdx. the analogy between the Bessel and circular
functions. There is no convenient convolution
theorem, though convolution theorems can be
Convolution theorems and the asymptotic deduced. Once again the behavior of [(r) for
properties can be deduced by noting th~t if </> is
continued for x < 0 as an even functIOn, the small r is deducible from the values of iv(s) for
cosine transform is the Fourier transform of large s, and the properties of [(r) for large
the continued function , and if </> is continued as values of r follow essentially from the singular-
an odd function we relate the sine transform to ity of iv(s) nearest the real axis. Any of these
the Fourier. transforms can be applied simultaneously with
In spherical polar coordinates the Lapl.ace other ones.
All these transforms and also the Hilbert
equation has solutions like RVS v wh~re SV IS a

f_0000 f_0000
transform
spherical harmonic of degree v. This suggests
that to solve problems in conical regions of in- - P [(t)dt I [(s)ds
finite extent we use the Mellin transform and [(s)=- --, [(t)=-P --
its inverse 1r t- s 1r S - t
643 LASER

where P f is a Cauchy principal value, are


8. Titchmarsh, E. C., "Introduction to the Theory
of Fourier Integrals," London, Clarendon Press,
1948.
treated rigorously in Ref. 8. 9. Van der Pol, B., and Bremmer, H., "Operational
An alternative to the Fourier transform, pro- Calculus Based on the Two-sided Laplace Trans-
vided f(t) vanishes strongly enough at t = too, form," London, Cambridge University Press, 1955.

f:
is the so-called two-sided Laplace transform 10. Widder, D. V., "The Laplace Transform," Prince-
ton, Princeton University Press, 1941.
11. Widder, D. V., and Hirschman, I. I., "The Con-
e-stf(t)dt, volution Transform," Princeton, Princeton Uni-
versity Press, 1955.
12. Zemanian, A. H., "Generalized In tegral Trans-
which is really a complex Fourier transform. formations," New York, Interscience, 1968.
An extended account of this and its application 13. Treves, F., "Basic Linear Partial Differential Equa-
can be found in Ref. 9. Very many other inte- tions," New York, Academic Press, 1975.
gral transforms have appeared in the literature, 14. Schechter, M., "Modern Methods in Partial Dif-
and tables of them with some theory can be ferential Equations," New York, McGraw-Hill,
found in Ref. 3. A more general transform 1977.
which embraces a number of those above is 15. Davies, B., "Integral Transforms and Their Appli-
described in Ref. II. cations," Berlin and New York, Springer-Verlag,
More Recent Work The introduction of the 1977.
concept of generalized functions has considera-
bly increased the class of functions whose trans- Cross-references: CALCULUS OF PHYSICS, FOU-
forms can be utilized. Thus, for example, the RIER ANALYSIS, MATHEMATICAL PRINCIPLES
Fourier transform of t k exists as a generalized OF QUANTUM MECHANICS.
o
function and is (-i)k(21T)1/20(k)(s), where (k)(s)
is the kth distribution derivative of the Dirac
delta function. In particular the growth condi-
tion for a generalized Laplace transform is re- LASER
laxed to become one that is bounded above by
a polynomial in 1 s I. "Laser" is an acronym for l(ight) a(mplification
A treatment of a number of these generalized by) s(timulated) e(mission of) r(adiation). This
integral transforms can be found in Ref. 12. device is identical in theory of operation to the
A number of authors have made use of the MASER except that it operates at frequencies in
transforms of generalized functions, and most the optical region of the electromagnetic spec-
notably the Fourier transform, in the rapid trum, rather than in the microwave. Laser oper-
development in the theory of Linear Partial ation has been demonstrated at wavelengths
Differential Equations which has taken place from 1500 to over 1 000000 A or from 0.15 to
in the last twenty years. 13 , 14 100J,Lm. By common usage, these devices are all
There is also quite a lot of interest in devel- called lasers, although more descriptive termi-
oping both new integral transforms, such as are nology utilizes ultraviolet maser, optical maser,
published periodically in mathematical journals, infrared maser, etc. Although the original micro-
and a general operational calculus not limited wave maser offers an extremely stable frequency
specifically to any particular transform. IS source, its main use is as an amplifier with ex-
tremely low noise output. In contrast, the
M. G. SMITH main significance of the laser is its ability
to produce a single frequency at high in-
References tensity in the optical region, a feat heretofore
1. Carslaw, H. S., and Jaeger, J. C., "Conduction of impossible at these frequencies. Not only may
Heat in Solids," Oxford, Clarendon Press, 1959. the output be a single monochromatic wave,
2. Davis, H. T., "The Theory of Linear Operators," but the wave may be coherent, or in phase,
Bloomington, The Principia Press, 1936. over the whole surface of the radiator. In
3. Ditkin, V. A., and Prudnikov, A. P., "Integral this mode of operation, the laser is actually an
Transforms and Operational Calculus," Oxford, oscillator whose output depends upon the selec-
Pergamon Press, 1965. tive amplification of one of the single frequency
4. Doetsch, G., "Theorie und Anwendung der modes of the resonant cavity containing the
Laplace-transformation," Berlin, Springer, 1937. active laser medium.
5. Jeffreys, H., and Jeffreys, B. S., "Methods of Following the development of the microwave
Mathematical Physics," London, Cambridge Uni- maser, Schawlow and Townes in 1958 proposed
versity Press, 1956. that optical maser action could be obtained by
6. Smith, M. G., "Laplace Transform Theory," placing an active medium in an optical cavity.
London, Van Nostrand Reinhold, 1966. The medium would be a gas or solid which was
7. Spain, B., and Smith, M. G., "Functions of Mathe- excited electrically or by light in such a manner
matical Physics," London, Van Nostrand Rein- that any optical wave present would be ampli-
hold, 1970. fied as it moved through the material. The cavity
LASER 644

was proposed to be a Fabry-Perot resonator-two cause more upward or absorbing transitions and
plane, parallel reflecting plates with a small trans- the net effect would be absorption.
mission through which the radiation might es- Solid-State Lasers The first optical maser
cape. Upon excitation of the material, light will was demonstrated by Maiman of Hughes Re-
be emitted with a band of frequencies deter- search Laboratories in 1960 using ruby, which
mined by the particular material. In addition, is single-crystal aluminum oxide "doped" with
the direction of emission will be nominally ran- chromium impurities. By applying semitrans-
dom. In the presence of the cavity, some of the parent reflective coatings on the ends of a rod
waves will escape after several back and forth about 2 inches long, he made the cavity and the
reflections from the parallel plates, "walking crystal an integral unit. Then, exposure to an
off" the edge of the reflectors. Those waves intense exciting light from a xenon flashtube
which travel normal to the walls will remain in was found to invert the population between the
the cavity and be amplified provided they rein- red-emitting level and the ground or lowest-
force each other after each round-trip reflection energy state of the electrons. The result was a
at the two surfaces. This reinforcement or reso- burst of intense red light emanating in a beam
nance is only satisfied if the spacing of the plates through the end reflectors. This was the first
is an integral multiple of one-half the wave- and is still one of the most powerful lasers.
length in the medium. Thu~, after a short time, Advances in the art since that time have re-
only that frequency which satisfies the resonant sulted in energies per pulse of the order of
condition and those waves traveling normal to 1000 joules or watt-seconds. Peak powers
the reflector will build up to an appreciable in- are as high as 500 000 kW in short pulses of the
tensity. The resultant light which is partially order of 10-8 sec. Because of "off-axis" modes
transmitted through one of the reflectors will and multiple resonances, the output is not a
thus be a single frequency or several discrete fre- single-frequency, single plane-wave mode, but
quencies if there is more than one cavity reso- generally consists of the order of 100 separate
nance within the band of frequencies emitted by modes. The beam is still quite narrow, being
the laser material. In addition, the wave front the order of I milliradian or 0.05 degrees. As a
will be in phase across the surface of the reflec- comparison with conventional light sources, the
tor since waves striking the surface at normal in- energy radiated from 1 cm 2 of the brightest
cidence are amplified most strongly. The resul- flash lamp is less than 10 kW and is distributed
tant beam will then be diffraction limited, i.e., over the entire visible spectrum. In addition, the
the beam will spread by an angle in radians radiation is incoherent and is spread out uni-
given approximately by the ratio of the wave- formly in all angles from the source. Thus, the
length to the diameter of the beam. In actual directivity and spectral purity of the laser source
practice, single-mode operation is obtained only are many orders of magnitude superior to that
under special conditions. Generally, several fre- of an incandescent source. The ruby laser suf-
quency modes are present due to the multiple fers from a low efficiency, about 1 per cent, and
resonances of the cavity and numerous "off- except with elaborate cooling systems, only op-
axis" modes are found which correspond to erates on a pulsed basis. Other crystalline or
resonant waves which travel at small angles from glass systems with impurity ions have been de-
the normal to reflectors. These waves "walk off' veloped, which yield wavelengths from the ul-
so slowly that they still are amplified apprecia- traviolet to approximately 3J..Lm wavelength in
bly. Refinements of the simple cavity proposed the infrared. Some, such as neodymium-doped
by Schawlow and Townes consist of concave yttrium aluminum garnet (Y AG), operate in a
reflectors which decrease the diffraction losses, continuous mode at the one-watt level while
or several parallel reflectors which limit the os- peak powers have reached values as high as 10 14
cillation to a frequency common to each pair in W in pulses of the order of 10- 12 sec. These
the set. ultrahigh powers are obtained in neodymium-
The key to successful laser operation is of doped glass systems using several stages of am-
course the active medium which amplifies the plification and novel pulse-forming techniques.
wave. Qualitatively, a material which fluoresces Gas Lasers Historically, the next develop-
or exhibits luminescence is an obvious candidate. ment came in 1961 when J avan, Bennett and
In fluorescence, electrons are excited to an up- Herriott demonstrated laser action in a gaseous
per-energy state by short-wavelength light such discharge of helium and neon. Again, the
as ultraviolet, while luminescence is produced parallel-plate reflector cavity was used but this
by passing an electron current through the me- time with a spacing of several feet. Later, con-
dium, such as in a gaseous discharge. In either cave mirrors were used to decrease the loss of
process, stimulated emission can occur only if energy out the sides of the cavity. This device
more electrons are produced in the upper-energy operates continuously and delivers power at
state than in the lower or terminal state for the levels up to one watt. Pulsing the gas discharge
radiating transition. In this case, an incident yields peak powers as high as 100 W. The first
photon will stimulate further transitions and laser radiated at 1.15 J..Lm in the infrared, while
amplification will result. If the final state were further development with different gases has
more heavily populated, then the photon would yielded outputs from the ultraviolet to 1000 J..Lm
645 LASER

or I mm in the far infrared. In contrast to the coatings. The first such device used gallium
ruby laser, the gaseous laser beam may be dif- arsenide and radiated at 8400)\ or just beyond
fraction limited and the frequency is pure, i.e., the visible region in the infrared. This laser was
oscillation may be limited to one mode. By developed by groups at General Electric, Inter-
careful design, the frequency may be stabilized national Business Machines, and Lincoln Labo-
to a few thousand cycles per second or approxi- ratory in 1962. The efficiency is high, about
mately one part in 10 13 • Although the original 40 per cent, and the power source is low-voltage
gas laser utilized electrical excitation of elec- direct current. One shortcoming is the require-
tronic transitions, later versions use vibrational ment of liquid nitrogen cooling (77 K) to main-
transitions in molecules such as carbon dioxide, tain power output and efficiency. Powers as high
and the excitation mechanism may be electrical, as 3 W continuous have been produced. The cav-
chemical, or thermal. In the chemical laser, ity in this case is extremely small, the reflector
atomic species such as hydrogen and fluorine spacing being less than a millimeter. As a result,
can be reacted to produce molecules in an it is fairly easy to limit the oscillation to one
excited vibrational state which in turn yields frequency mode although small irregularities in
amplification or oscillation. Recent electrically the junction prevent coherence over the full
excited lasers, particularly those using carbon width of the narrow radiating junction strip.
dioxide at 10 /lm, have been operated at at- The compactness and efficiency of the semi-
mospheric pressure using spark discharges or conductor laser make it particularly attractive
pre-ionization by voltages in the I OO-k V range. for systems use. Wavelengths as long as 30/lm
The high pressure and the powerful electrical and as short as 6300)\ have been generated us-
excitation result in peak powers in the 10 to ing different semiconductors such as indium
100-MW region. For continuous laser operation, arsenide, indium phosphide, indium antimonide,
the gas may be circulated rapidly to avoid ex- or alloys such as gallium arsenide-phosphide. In
cessive heating, and using an electrical discharge, addition, these lasers may be tuned over several
powers from I to 10 kW have been obtained. percent of their nominal frequency of operation
An entirely new excitation process, essentially by varying the current flow through the device.
thermal, was announced by Gerry in 1970. In The tuning results from the variation in tem-
this, the gas dynamic laser, an appropriate fuel perature with current which in turn changes the
is burned to produce carbon dioxide and nitro- index of refraction and the resultant resonant
gen at high temperature and pressure. When frequency of the cavity. Since the line width of
released through a nozzle into the optical the radiation is only about 1 MHz the tunable
resonator region, the gas cools rapidly in terms semiconductor laser is an excellent tool for high
of its kinetic or translational energy, but the resolution spectroscopy.
population of the vibrational energy levels of Liquid Lasers Laser action may also be ob-
the carbon dioxide molecules becomes inverted tained in liquids using either a flash tube or
since the lower level of the laser transition another laser as the pump. Early versions used
relaxes much more rapidly. In addition, the rare earths in an organic liquid, while more
vibrationally excited nitrogen molecules are in recently organic dyes have been found to be
near resonance with the upper laser state of the more efficient but require a separate laser for
carbon dioxide and transfer energy with high the exciting radiation. The dye laser has the
efficiency to maintain the inversion. This type special attraction that one laser may be tuned
of laser has produced continuous powers as over a significant fraction of the visible spectrum
high as 60 kW. by using a reflection grating as one of the cavity
Semiconductor Lasers The third main type mirrors. Another type of liquid laser utilizes a
of laser utilizes a solid material, in this case a different principle than those above, depending
semiconductor. Here the electron current upon stimulated Raman scattering. Raman laser
flowing across a junction between p- and n-type action was discovered by Woodbury in 1962
material produces extra electrons in the con- using a ruby laser and nitrobenzene. Here the
duction band. These radiate upon making a laser excites the nitrobenzene, which in turn
transition back to the valence band or lower- shows amplification at a frequency displaced
energy states. If the junction current is large from the ruby line by the vibrational frequency
enough, there will be more electrons near the of the molecule. There is no true inverted popu-
edge of the conduction band than there are lation in this case. The incident photon is scat-
at the edge of the valence band and a popula- tered by the molecule which absorbs an amount
tion inversion may occur. To utilize this effect, of energy determined by its vibrational energy.
the semiconductor crystal is polished with two The molecule is left in an excited state and the
parallel faces perpendicular to the junction scattered photon is frequency shifted by the
plane. The amplified waves may then propagate energy loss. This process may be stimulated,
along the plane of the junction and are reflected since the rate at which the scattered photons
back and forth at the surfaces. The gain in the are produced is proportional to the number of
material is high enough so that the reflection at photons already present in the cavity at the
the semiconductor-air interface is sufficient to scattering wavelength. As in the normal stimu-
produce oscillation without special reflective lated emission case, the frequency and phase of
LASER 646

the output wave are identical with the wave length, oscillation can occur in any mode as
which stimulates the scattering. The Raman long as it is within the natural emission line-
laser normally operates using the Stokes line, width of the laser transition. Many lasers oscil-
or the wavelength corresponding to the loss late in only one mode, since that with the
of one vibrational quantum. Other modes of highest gain takes over from any modes away
operation utilize the second or third Stokes from the line center. Now, if the cavity is
lines corresponding to double or triple vibra- modulated internally at a frequency equal
tional absorptions. Similarly, higher-order ef- to the mode spacing, all modes within the
fects in the medium may produce a series of natural linewidth become coherently coupled
anti-Stokes lines which correspond to vibra- and the result is a train of pulses at the modula-
tional energy being added to the initial energy tion frequency with a pulse width roughly the
of the photons from the driving laser. The inverse of the natural linewidth. An alternative
wavelength range of Raman lasers using differ- way of looking at the process is to assume ran-
ent liquids is from the visible to the near dom noise pulses propagating back and forth
infrared. between the cavity mirrors. Since the round-
Q-Switching The high instantaneous powers trip time is the same as the modulation period,
quoted for ruby are obtained by using the a noise pulse which passes through the modula-
"Q-switched" mode of laser operation. This tor at its maximum transmission will receive the
technique, due to Hellwarth and McClung, most net gain in a round trip. Although the dif-
uses a cavity resonator whose reflectivity or ferential gain among differentially phased pulses
"Q" may be controlled externally. The laser, is small, the cumulative effect after many round
usually ruby, is first excited by the flash lamp trips singles out the in-phase pulse train. The
while the cavity is in a state of low reflectivity width of the pulse is determined by the ampli-
and thus low feedback. As a result, the inverted fier bandwidth, which is the naturallinewidth as
population reaches an extreme value before mentioned above. After demonstration of this
oscillation occurs. At the peak of inversion, technique of pulse generation, it was discovered
the reflectivity is "switched on," and the that a laser could be mode-locked by a saturable
resultant high reflectivity produces an intense filter in the cavity, i.e., a material whose trans-
burst of energy which almost completely de- mission loss decreased with increasing light in-
populates the high-energy states in a time tensity. In this case, a random pulse increases
of the order of 10-8 sec. The switching is the transmission in the filter on each passage and
accomplished either by a Kerr electro-optic produces its own transmission modulation. This
shutter in the cavity or by rotating one of the so-called self-mode locking was soon discovered
mirrors so that it is lined up parallel with the to exist in some lasers without the addition of a
opposite reflector at the optimum time during special saturable filter material. The measure-
the flash lamp pulse. ment and discovery of these effects depends
Mode-Locked Pulses An alternative method upon newly developed techniques of measuring
for generating extremely short pulses utilizes short pulse lengths. In particular, the pulse train
the technique of mode-locking. Since a laser may be passed into a reflecting cell containing a
resonator has frequency modes equally spaced material whose fluorescence is proportional to
at a separation of c12L, where L is the cavity the square of the light intensity. Upon reflection

TABLE 1. LASERS CLASSIFIED BY TYPE, WAVELENGTH, AND PUMPING SOURCE.

Wavelength
Type Range Example Pump Comments
Solid (Insulating) 0.17-3.9 Iill1 Alz03:Cr optical highest pulse powers
Glass:Nd
Gas 0.15 /lm-l mm He-Ne electrical; highest continuous
Argon gas dynamic; powers
HF laser*
COz
Semiconductor 0.6-30/lm GaAs electrical electrically tunable over
PbSnTe narrow bandwidth
Liquid 0.2-1.3 /lm organic dye optical; wide tuning range;
laser picosecond pulses
Free electron 0.6/lm-3 mm e high-energy requires relativistic
accelerator electron beam

'Wavelengths from 50 to 500 /lm usually require CO 2 laser pumping of an organic molecular gas.
647 LASER FUSION

back through the fluorescent material, the light energy for economic electricity generation. The
output increases where one returning pulse idea was first reported in 1972.
passes an incoming pulse. In this manner, the To produce a significant reaction rate, a tem-
physical width of the light pattern measures the perature of -10 8 K is required and the cor-
pulse length. Using a laser material with a broad responding pressure causes a rapid expansion of
emission band has yielded pulse trains with the deuterium-tritium fuel. The aim is to com-
widths of the order of 10- 12 sec or a picosec- plete the nuclear fusion burn before the expan-
ond. This corresponds to physical lengths of the sion can cool and extinguish the reaction. Since
order of one millimeter. inertia limits the expansion velocity the ap-
Other Sources of Coherent Light In addition proach is sometimes called inertial confinement,
to the lasers described above and summarized in contrast to magnetic confinement (see the
in Table 1, there are several other techniques article on FUSION POWER). Although other
for generating coherent infrared and optical heating methods for inertial confinement (e.g.,
radiation. Stimulated Raman scattering, the electron, light-ion, or heavy-ion beam bom-
basis of the original liquid Raman laser, is now bardment) are being considered, laser heating
used to generate frequencies throughout the is attracting most attention because focusing
visible and infrared region. Harmonic genera- is easier, there is probably less undesirable
tion, utilizing the nonlinear index of refraction core preheat, and it is possibly more economic.
of solids, liquids, and gases at high intensities, The burn condition for a sphere of fuel of
has produced radiation out to the extreme radius r, is that the number of particles burnt
ultraviolet at wavelengths below 0.1 11m. Simi- before significant expansion must be approxi-
larly, these same nonlinear materials may be mately equal to the total number of particles,
used for parametric oscillators, which are i.e.,
tunable devices producing two output frequen-
cies whose sum is equal to the driving laser n2<av>t~n.
frequency. An excellent review of laser prin-
ciples and a description of these latter systems where t is the burn time (~expansion timescale
may be found in Ref. 1. The last entry in Table r/v), v is the ion thermal velocity, and a is the
1, the free electron laser, is included for com- fusion cross section. It happens that <av>/v is
pleteness, although the radiation generation approximately independent of v in the tem-
process is describable by relativistic electron perature range important for fusion, so the
dynamics and does not necessarily invoke the condition is equivalent to nr ~ constant or,
stimulated emission mechanism. The radiation since n is proportional to the final density
in this case arises from the interaction of an p, pr ~ constant. Numerically pr> 30 kg/m 2
electron beam near the velocity of light with a is required, determined by the need to keep the
spatially periodic magnetic field. A description dimensions greater than the 4He particle range
of this device and a detailed review of the com- so that the 4He energy can make up for heat
plete laser field may be found in Ref. 2. radiated and thereby maintain the burn. The
fusion energy is produced as a miniature ther-
R. H. KINGSTON monuclear explosion. It is estimated that the
maximum energy of each explosion that can
References be handled is -IS kg TNT, equivalent to
-10 8 J, and about 30 explosions per second
1. Yariv, A., "Quantum Electronics," 2nd Ed., New would be required for a I-GW(e) power station.
York, John Wiley & Sons, Inc., 1975. The laser energy required to heat the fuel would
2. Weber, M. 1., (Ed.), "Handbook of Laser Science be -10 6 J. Each explosion would burn about
and Technology," Vols. I and II, Boca Raton, CRC 3 X 10- 7 Kg of D/T mixture, corresponding to
Press, Inc., 1982. a small pellet of only -10-3 m radius. The burn
time is -5 X 10- 10 s, and the pressure in the
Cross-references: COHERENCE, LIGHT, MASER, OP- burning fuel would be _10 12 atmospheres. The
TICAL PUMPING, RAMAN EFFECT AND RAMAN burn condition and the fuel radius correspond to
SPECTROSCOPY. p ~ 3 X lOs kg/m 3 andacompressionof-1500
times the density of liquid D/T mixture. The
feasibility of laser fusion depends on developing
LASER FUSION a way of getting this high compression.
The object of laser fusion research is to use laser The compression is produced by laser radia-
radiation to heat small pellets of deuterium/ tion, which exerts pressure on the pellet by
tritium mixture to produce the nuclear fusion heating and evaporating the surface material.
reaction In moving away, the ablated material carries
momentum and, by Newton's Third Law, a
D+T=4He+n+3X 10 14 J/Kg. corresponding pressure is exerted on the re-
maining material (the ablation pressure is, under
The reaction is exothermic with about one-fifth the required conditions, much greater than the
of the energy in the 4He particles. Eventually ponderomotive pressure of laser photons).
it is hoped to find ways of using the excess However the simple application of an arbitrary
LASER FUSION 648

pressure pulse of laser radiation to a sphere of pressure reduces the final compression. The
fuel will not produce the required temperature preheating is wavelength dependent and is a
and pressure. In plane geometry, conventional severe limit to target design.
strong shock theory gives a compression of Another limit to the compression may be
4 because of preheating by the shock, and Rayleigh-Taylor instabilities which occur when
spherical convergence only increases the com- the low density ablated material exerts accelera-
pression to ~30 because of the outward-going tion pressure on the heavy shell. These become
shock reflected at the fuel center. To obtain more restrictive with increasing ratio of insta-
high compression, shocks must be avoided so bility wavelength (A.R) to shell thickness: when
that the compression is effectively isentropic A.R < 5 the ripples cause some turbulent heat-
and the temperature kept low until after the ing of the shell; when A.R ~ 5 the shell is
compression has been achieved. One method strongly turbulently mixed and preheated; and
that has been proposed is continuously to when A.R > 5 the instability disturbs the implo-
increase the driving pressure while the implosion sion symmetry. Although Rayleigh-Taylor
proceeds to that the shock and piston do not instabilities cause heating and are a fairly severe
part, thereby avoiding shock preheating and the limit on the symmetry of the implosion, it is
final compression limit due to the reflected possible that the problem may be reduced by
shock from the center. This is not easy to damping due to viscosity and thermal conduc-
achieve because of two factors: the high power tion around the shell. Such damping occurs
required at the end of the implosion and the when the distance between the absorption and
difficulty in getting the correct pressure varia- ablation fronts exceed about one-half the
tion in the short time available. These problems initial fuel radius.
may be reduced by using a hollow shell target. Present experiments test laser fusion prin-
A pressure p will accelerate a fuel shell of thick- ciples using glass microballoons of ~1O-4 m in
ness 5 to a final velocity (2rp/p s 5)1/2, where diameter and ~1O-6 m wall thickness, contain-
Ps is the initial density, so the final pressure ing D/T gas at a pressure of ~l 00 atmospheres.
produced at the centre is a function of r/5. The laser pulse energy is a few kJ and power
This is further increased by convergence. For ~1013 W, produced in a numberofbeams(~IO)
example, calculations show that with r/5 = 10 arranged to exert pressure from all directions.
the ratio of the final pressure to the driving In contrast to the ideal laser compression de-
pressure is ~600. The sensitivity to the pressure scribed above, the energy is delivered in a time
variation can be reduced by using a heavy short compared to the collapse time: so the
"pusher" shell of another material which may thin, dense glass wall suddenly vaporizes and
be accelerated by a pressure more uniform in exerts pressure on the D/T gas; this is called
time. A further increase in energy may be ob- an exploding pusher. A disadvantage of this
tained by surrounding the pusher with another method of operation is that considerable elec-
massive shell with an intervening vacuum space. tron preheat can develop, but an advantage is
The massive shell is accelerated by the driving that the preheat tends to inhibit Rayleigh-Taylor
pressure and collides with the pusher/fuel shells instabilities. The absorption of the laser light
which, by conservation of momentum (allowing has been found to be complicated, involving
for shocks in the shell thickness) move away such processes as inverse bremsstrahlung, stimu-
with velocity y2 times the massive shell lated Brillouin scattering, parametric instabili-
velocity. The use of massive shells to tailor the ties, stimulated Raman scattering, and nonlinear
implosion velocity would reduce the overall density profile steepening. Compressions of a
efficiency because of the energy required to few times 100 are obtained and temperatures
move the pusher/massive shells. Another im- up to ~107 K. A fraction of up to 10-8 of the
portant factor is the cost of complicated shell fuel is burnt, giving a neutron yield of ~101l
pellets. per pellet.
The absorption of the laser radiation occurs In conceptual laser fusion reactors, a reaction
at a critical radius where the radiation fre- chamber of ~10 m diameter is envisaged. A
quency equals the natural frequency of the heat exc.hanger/tritium breeder blanket sur-
electrons, called the electron plasma frequency, rounds the chamber in a way similar to magnetic
at a density 10 15 /A. 2 m- 3 , where A. is the radia- confinement systems. The heat from the blanket
tion wavelength in meters. About 20% of the would be used to generate electricity, some of
incident radiation is absorbed. Heat is trans- which is used to drive the laser. The overall
ported from the absorption to the ablation efficiency (the laser efficiency times the elec-
radius by electron thermal conduction; at high trical generation efficiency) must exceed the
power the heat transport is restricted by the inverse of the energy gain in the fusion reaction
finite flow velocity of free streaming electrons, (~103) for net energy production. Since the
reducing the pressure available as a function of generating efficiency is ~20%, the laser effi-
power. This limits the overall efficiency. Also, ciency must >~0.5%, including coupling to the
the faster electrons may have sufficient mean plasma. Short laser wavelengths favor good
free path to penetrate the ablation radius to plasma coupling and reduce preheat, but the
preheat fuel at smaller radii, so the resultant lasers are inefficient. A A. = I J.1m Nd glass laser
649 LA TTICE DEFECTS

has an efficiency of only 0.1 %, whereas a A = remarkably, so that the understanding of the
10 !lm CO 2 laser has an efficiency of 1%. behavior of lattice defects is very important in
Attempts are being made to increase the cou- modern solid-state physics.
pling efficiency of Nd glass lasers by using short Lattice defects are classified in to three classes
wavelength harmonics. However, glass lasers from the point of view of the dimensional ex-
are probably unsuitable for the continuous high tension. These are the point defect, the line
repetition rates required in a reactor, and so defect, and the plane defect. Vacancies (atomic
short wavelength gas lasers (e.g., using KrF) or ionic) and interstitials (atoms or ions) are the
are being developed. most elementary point defects. A dislocation is
A proposal to increase the overall reactor a line defect. Twin boundaries and extended
efficiency is to use a liquid lithium wall in the dislocations are some examples of plane defects.
reacting chamber, which will absorb the fusion Dislocations Dislocations are lattice defects
energy pulse by evaporation and will be con- quite unique in that their movement produces
tinuously replaced: this may give an order of the plastic deformation of crystals, and that
magnitude increase in the power that can be mechanical properties of crystals are largely
handled. Among technical problems to be over- determined by the behavior of dislocations in
come in obtaining power from laser fusion are crystals. The presence of dislocations, however,
launching the pellets, laser beam or pellet steer- affects also various physical properties of crystals
ing, pumping gaseous debris so that it does not other than plastic and mechanical properties.
scatter the laser beam, production of cheap Figure 1 shows an edge dislocation in a simple
multilayer pellets «~7¢/pellet, 1982 value) cubic crystal, the dislocation line being perpen-
and disposal of ~ 100 metric tons/year of dicular to the plane of the paper. The same
radioactive pellet debris. atomic arrangement is repeated in the direction
It is hoped to demonstrate scientific break- of the dislocation line. The edge dislocation is
even in the mid-1980s by using multibeam formed at the edge A of the inserted extra atomic
lasers of energy ~ lOs J. half-plane AB. The edge dislocation A can also
Support for inertial confinement research be formed by a partial slip of the crystal along
comes from its application in the areas of fusion a slip plane CAD by a shear stress shown by
power generation, the basic physics of matter arrows in Fig. 1. The part CA of the slip plane
at extreme densities and pressures. Indeed, con- has already slipped, but the part AD has not
fidence in the application to produce power yet slipped. Therefore, the dislocation line A,
derives from the knowledge that ignition may which penetrates the plane of the paper perpen-
be achieved by compressional heating in ther- dicularly at A, is defined as the boundary be-
monuclear weapons. The main countries in- tween the slipped and unslipped parts of the
terested in inertial confinement research are slip plane. When a dislocation sweeps the whole
USA, USSR , Japan, UK, and France. In the last slip plane from C to D, the upper and lower
10 years the US has spent ~ $1 0 billion on laser half-crystals are sheared along the slip plane by
fusion research; in 1982 the spend was ~$200 an amount and in a direction shown in Fig. I
million, which was about 50% of the spend on by a vector b, which is called the Burgers vector.
the magnetic confinement approach to fusion The slip produced by the movement of a dis-
power. However the US laser fusion spend is location is the elementary process of plastic
expected to decrease to ~ $1 00 million in 1983, deformation of crystals.
corresponding to about 25% of the spend on The Burgers vector of an edge dislocation is
magnetic confinement. perpendicular to the line of dislocation as can
be seen from Fig. I. This is not the case in dis-
J. A. REYNOLDS locations other than the edge dislocation. In the
case of screw dislocation the Burgers vector is
References parallel to the line of dislocation. Atomic planes
Dolan, T. J., "Fusion Research," New York, Pergamon perpendicular to a screw dislocation form a con-
Press, 1982. tinuous helicoid around the screw dislocation
(AD) as shown in Fig. 2. In the case of a mixed
Cross-references: FUSION POWER, LASERS, NU- dislocation, which consists of the edge com-
CLEAR REACTIONS, PLASMAS. ponent and the screw component, the Burgers
vector and the dislocation line make an angle
between 0 0 and 90 0 •
LATTICE DEFECTS The stress and the strain around a dislocation
can be accurately calculated by the theory of
The crystal consists of a regular array of atoms elasticity except in the region called the dis-
(or molecules) (see CRYSTALLOGRAPHY). But location core which is within a few atomic dis-
the perfect crystal, in which all atoms are on tances of the center of a dislocation (broken
precisely defined lattice points, is nonexistent. circle in Fig. I), a~ the crystal regions outside
Real crystals contain more or less lattice defects the core can be treated as an elastic continuum,
or lattice imperfections. Even a small amount of because the strain there is sufficiently small.
lattice defects can affect properties of crystals The following are some of the important results
LATTICE DEFECTS 650

) The stress, the strain, and the energy in a dis-


location core can be computed if the discrete
B structure of the crystal is considered. The core

,,0
0000000 ~nergy of a dislocation in copper, for example,
is of the order of 1 eV per dislocation length of
0 0, ...9_ . .0 0 0 an atomic spacing. This is considerably smaller
than the elastic strain energy outside the core.
Another important property of dislocations
~0,60QOO which can be understood by considering the
C -------T"--A--~-----D
o 0 0 \0 "
0/0 .-
0 discrete and periodic structure of the crystal is
the so-called Peierls potential. When an edge
o 0 0 6--6 0 0 dislocation is located, e.g., at position A in
Fig. 1, the atomic arrangement around the dis-
location has a mirror symmetry so that the
0000000 energy of the dislocation is at a minimum. When
the dislocation has moved a distance b the
( atomic arrangement around the disloc~tion
resumes the same configuration as before and
FIG. 1. An edge dislocation in a simple cubic crystal. the energy is again at a minimum. Between
A: center of a dislocation; AB: extra half-plane; these two positions with minimum energy the
CAD: slip plane; b: Burgers vector. atomic configuration of the dislocation is not
necessarily mirror symmetrical and the disloca-
tion energy increases. Thus the potential energy
of the elasticity theory of dislocations. The of the dislocation changes periodically with the
p~riod b, when the dislocation moves along the
stress around a dislocation decreases inversely
proportionally to the distance from the dis- slip plane. This potential barrier is called the
location axis. The total strain energy integrated Peierls potential. The Peierls potential is t}",e
over the entire crystal outside the core is pro- intrinsic resistance to the movement of dis-
portional to the square of the magnitude of the locations in crystals. I t is relatively small in most
Burgers vector, and is of the order of magnitude metals, and is relatively large in such crystals as
of Jlb 2 for a unit length of a dislocation where diamond, refractory oxides, and so on.
Jl is the shear modulus and b is the magnitude of Incidentally, there are many extrinsic origins
the Burgers vector. Two or more dislocations of resistance to the movement of dislocations
interact with one another through the stress i.e., impurity atoms, point defects, plane defects'
field. Two parallel screw dislocations attract other dislocations, etc. These impurities and
each other if their helicoidal windings are in defects interact with the moving dislocations
opposite directions (i.e., a left-handed screw concerned, and act as obstacles to the movement
and a right-handed screw), and repel each other of these dislocations. A larger resistance to the
if their windings are in the same direction. Two movement of dislocations means a larger resis-
parallel edge dislocations with parallel Burgers tance to the plastic deformation and accordingly
vectors attract or repel in a somewhat com- a higher mechanical strength of crystals.
plicated way (for details, see any textbook of Properties of dislocations discussed so far are
gene!~ ones and are not restricted to any
dislocation theory, e.g., Refs. 1,2, and 3). speCific crystal. But there are some properties
which directly reflect specific crystal structures
(e.g., extended dislocations) or speoific physical
characteristics of crystals (e.g., a space charge
around a dislocation in semiconductors).
Now in discussing an extended dislocation, it
would be more convenient to refer to a specific
crystal structure, in which a dislocation is known
to extend. The face-centered cubic (fcc) lattice
gives a good example. In fcc metals a total
dislocation, whose Burgers vector is a lattice
vector (a/2){110}, dissociates into two partial
dislocations,.. whose Burgers vectors are of the
type (a/6hI12} and are smaller in magnitude
than a lattice vector. An example of dissociative
reaction is:
(a/2)[ 1011 -+- (a/6)[ 1121 + (a/ 6)[ 211 J,

FIG. 2. A screw dislocation AD in a simple cubic where a is the lattice constant. The passage of
crystal. the first partial dislocation (a/6)[ 1121 along a
651 LA TrICE DEFECTS

(111) slip plane in a perfect fcc lattice leaves The relaxation of atoms around a single
behind it a fault of stacking of (111) atomic vacancy in most metals has a simple symmetry
layers, which is a plane defect called a stacking which is related to the symmetry of the crystal
fault. By the passage of the second partial dis- structure. But the atomic configuration around
location (aj6)[2II] along the same slip plane, a single vacancy in covalent semiconductors is
the faulted plane resumes its original perfect somewhat distorted because of the quantum-
lattice. Thus between the two partial disloca- mechanical Jahn-Teller distortion. The atomic
tion lines there is a strip of stacking fault. This configuration of a single interstitial is rather
dislocation configuration as a whole is called complicated, because different configurations
an extended dislocation. The width of the ex- for an interstitial are possible in some cases, and
tended dislocation is determined by the balance because the configuration often shows a lower
of two forces, namely the repulsive force be- symmetry than the host crystal. In cubic metals,
tween two partial dislocations and the con- for example, the so-called split interstitial or
tractive force of the stacking fault. In copper dumbbell-form interstitial is the most stable
and copper alloys, for example, the width is one because of the lowest formation energy
fairly large and is from 10 to 100 ft., while in (for details, see any textbook of point defects,
aluminum, which has a large stacking-fault e.g., Refs. 1 and 4).
energy, the total dislocation practically does In ionic crystals one or more of the following
not extend. The extension of dislocations point defects are considered according to re-
greatly affects the plastic and mechanical be- spective circumstances, i.e., vacancies of positive
haviors of crystals. ions and of negative ions, and interstitials of pos-
In elemental semiconductors with the diamond itive ions and of negative ions. Some of them
lattice, an edge dislocation has unpaired valence form complexes with impurity ions. Point de-
bonds or dangling bonds along the edge of the fects in alkali halides and similar ionic crystals
extra half-plane. In n-type germanium, for ex- have been extensively investigated in connection
ample, a dangling bond, which acts as an ac- with their optical properties, and many color
ceptor, traps an electron, so that an edge dis- centers are well analyzed in terms of point de-
location line is negatively charged. This negative fects (see COLOR CENTER; for details, see, e.g.,
line charge is shielded by a surrounding cylin- Ref. 5). In alkali haiides, for example, the F cen-
drical positive space charge. This electrical ter is a negative ion vacancy with a trapped elec-
structure of dislocations affects the electrical tron, the M center is a pair of negative ion vacan-
and other properties of semiconductors. cies with two trapped electrons, and the H center
Dislocations are introduced into crystals by is an interstitial negative ion with a considerably
plastic deformation, and by precipitation of distorted configuration, having captured a posi-
point defects such as vacancies or interstitials. tive hole.
They are also introduced during crystal growth The migration of point defects results in the
to relax various internal stresses. transport of materials in crystals, i.e., self-
Point Defects Atomic or ionic vacancies and diffusion, mutual diffusion, ionic conduction,
interstitial atoms or ions are the most elementary etc. In most metals and covalent semiconduc-
point defects. Di-vacancies, tri-vacancies, and tors the self-diffusion and the substitutional
higher clusters of vacancies, and di-interstitials, impurity diffusion occur by means of the migra-
tri-interstitials, and higher clusters of interstitials tion of vacancies. The diffusion and the ionic
can exist. Complexes of these point defects conduction in ionic crystals take place by means
with impurity atoms are formed according to of the migration of ionic vacancies and/or ionic
circumstances. in terstitials.
The most fundamental physical properties of Point defects are produced in crystals in
point defects are the formation energy, the thermal equilibrium, their concentration being
migration energy, the binding energy, and the approximately equal to exp(-EtlkT) at the ab-
atomic configuration around the point defect. solute temperature T, if the formation energy
These properties are widely different in differ- is Ef. Here k is Boltzmann's constant. Point
ent crystal structures and in different types of defects are also introduced into crystals in
crystalline solids. In most metals and elemental excess of thermal equilibrium by irradiation of
semiconductors the formation energy of a single energetic radiation or by plastic deformation.
vacancy E fV is smaller than the formation Excess point defects are annihilated by anneal-
energy of a single interstitial E fI, while the ing, the kinetics of which is well investigated
migration energy of a single vacancy Em V is under various conditions (Ref. 4).
larger than the migration energy of a single
interstitial EmI. Typical examples of energy RYUKITI R. HASIGUTI
values are: EfV ~ I eV and EmV ~ 1 eV for References
copper; EmV ~ 0.2 eV for p-type silicon and
EmV ~ 0.3 eV for n-type silicon; the binding 1. Hasiguti, R. R., "Crystal Lattice Defects," in "Solid
energies between two single vacancies to form State Physics," Ed. Kubo, R., and Nagamiya, T.,
a di-vacancy are a few tenths of an eV in noble New York, McGraw-Hill Book Co., Inc., 1969 (p.
metals. 719).
LA TTICE DEFECTS 652

2. Friedel, J., "Dislocations," Oxford, Pergamon Press, Some lenses are made with cylindrical and
1964. toric surfaces. Spectacle lenses often have sur-
3. Nabarro, F. R. N. (Ed.), "Dislocations in Solids," faces of this type. It is practical to use aspherics
in five volumes, Amsterdam, North-Holland Pub- in spectacle lenses because the beam of light
lishing Company, 1979 and 1980. entering a person's eye is small in diameter. The
4. Leibfried, G., and Breuer, N., "Point Defects in performance requirements are therefore not
Metals I," Berlin, Springer-Verlag, 1978; Dede- great. Cylindrical or toric surfaces are seldom
riohs, P.H., Schroeder, K., and Zeller, R., "Point used in telescopes, or microscopes of high
Defects in Metals, II," Berlin, Springer-Verlag, performance.
1980. There are many types of glass used in optical
5. Schulman, J. H., and Compton, W. D., "Color lenses. Some glasses are more dispersive (see
Centers in Solids," Oxford, Pergamon Press, 1962. REFRACTION) than others. By combining posi-
tive and negative lenses of different glass it is
Cross-references: COLOR CENTERS, CRYSTALLI- possible to correct for chromatic aberrations.
ZATION, CRYSTALLOGRAPHY, METALLURGY, Theory of the Lens Most of the performance
SEMICONDUCTOR, SOLID·STATE PHYSICS, of a lens or lens system may be understood by
SOLID-STATE THEORY. considering that light travels as rays in straight
lines until it encounters a change of index of
refraction. The light is then refracted according
LENS to Snell's law (see REFRACTION). Light is
emitted from a point source of light in the
A lens is any element that focuses light to form object as a diverging beam of rays. A lens is
images. Many lenses are found in nature. Ice able to collect these rays and refocus them to
crystals, waves on the surface of water, and all an image point (see Fig. 1).
the eyes of humans and animals are examples With analytical geometry, one may derive
of lenses. These lenses have one or more curved equations for calculating the path of any ray as
surfaces and are made of a transparent material. it passes through the optical system. The pro-
Manufactured lenses are usually made out of cedure is called ray tracing. The mathematical
glass, plastic or crystal material. The simplest equations used to trace rays are long and com-
lens consists of two ground and polished spheri- plicated, and have to be computed with many
cal surfaces. A line connecting the centers of the significant figures. Prior to. the use of m~dern
two spheres is called the optical axis of the lens. digital computers, the deSIgn and analYSIS of
The lens is edged to form a cylindrical surface lens systems was a long tedious job. An average
centered on the optical axis. The spherical sur- lens design required many months of calcu-
faces may be convex or concave, resulting in lation. Today most of these calculations are
lenses which are positive refracting or negative done on large computers, and few people need
refracting. A positive lens collects the light from to be concerned about being able to ray trace.
a distant object and focuses it to a real image. Paraxial Rays Paraxial rays pass through the
The negative lens disperses the light and causes center portion of the lens and the assumption
it to diverge from a virtual image. Positive and is made that the object points are close to the
negative elements are used in combinations to optical axis. The ray-tracing equations for
form optical lens systems. The optical axes of paraxial rays are simple, and by using the
each of the lens elements usually coincide to paraxial approximation, many useful theorems
form centered optical systems. Most optical for lens performance may be worked out.
systems are designed to be centered optical In the paraxial region, any optical system
systems, but in manufacture the centering is may be described by locating six cardinal points
seldom perfect, so the system will have various along the optical axis. Once these cardinal
degrees of defective performance. points are known, the position and size of the
Spherical surfaces are usually used in optical image of any object may bel computed from
systems because of stringent requirements on the following formulas (see Fig. 2).
the manufacture of optical elements. In order
to perform properly, a given surface in an ['
optical system often has to coincide with the Yk Z
m =-=--=-- (1)
prescribed surface to within a few millionths of Yo f z
an inch. Such extreme tolerances can be
achieved on spherical surfaces because spheres , , f [' 1
may be ground and polished with self-correcting zz = ff and - + -; = (2)
techniques. s s
A few lenses have been made using non-
spherical surfaces, but they usually have rota- fIno = ['Ink (3)
tional symmetry around the optical axis. These
surfaces are called rotationally symmetric P1N 1 =P2 N 2 (4)
aspheric surfaces. Aspheric surfaces of this type
are difficult to generate so they are used
infrequently. P 1P2 =N1N 2 (5)
653 LENS

Image Point

- - - - - Optical Axis

Object Point
FIG. 1. A diagram showing how a lens collects diverging rays and focuses them at an image point.

(6) Aperture Stop. The aperture stop in a lens


system is a diaphragm which determines the
(7) NA of the lens.
Lens Aberrations Lens designers attempt to
PI and P 2 are called the first and second prin- combine elements and glass types to reduce the
cipal points. NI and N2 are called the first and lens aberrations. All points in the object should
second nodal points. F 1 and F 2 are the first be imaged as points in the image and should be
and second focal points. f and f' are the front located at or very near the position predicted
and back focal lengths. no and nk are the by the paraxial rays. In lenses of large relative
indices of refraction in the object and image aperture and field of view, there are usually
space. several residual aberrations that designers are
The following terms are commonly used in unable to eliminate. There are the following
connection with lenses: types of aberrations:
Field of View. The field of view usually refers (l) Spherical
to the half angle subtended by the object as (2) Coma
seen from the first principal point PI. For (3) Astigmatic
example, it would be tan-I(yo/S) in Fig. 2. (4) Field curvature
When specified for a lens, it usually refers to (5) Distortion
the maximum size of object which may be (6) Axial chromatic aberrations
imaged by the lens. Optical designers tend to (7) Lateral chromatic aberrations.
describe the field of view by its half angle, as These aberrations are corrected by using com-
shown in Fig. 2. Marketing firms often refer to binations of positive and negative elements.
the full field which is twice the half angle. If There are two general principles one may use
not clearly stated, confusion over this term may as guide lines in correcting optical systems. (I)
result. A closely spaced positive and negative lens with
Relative Aperture. This refers to the half the aperture stop in contact may be corrected
angle of the cone of rays converging to the axial for spherical aberration, coma, axial and lateral
image point. The sine of this angle is often chromatic aberration and distortion. (2) It is
called the numerical aperture and is written NA. necessary to use positive and negative lenses
If NA is large, the lens collects a large cone of with appreciable air space between them to
light and focuses it on the image. Another way correct field curvature and astigmatism.
to describe this NA is to use the term f-number. There are many conflicting requirements in
The f-number of a lens and the NA of a lens are lens systems. Lenses of large relative aperture
related by the following equation usually are designed to cover small fields of
view. A large field of view normally dictates a
0.5 small relative aperture. Wide fields and large
f-number=- aperture may be realized by using many ele-
NA ments or by compromising some of the image
quality. For example, lenses of 140 0 total field
working at f/2 are available, but they are com-
plex and have large amounts of distortion.
Periscopes allow one to look through a long
pipe and see a wide field at high magnification,
but there is always some residual chromatic
aberration and field curvature left in the design.
Lenses are usually designed for specific ap-
plications, and the designer has made a careful
f - - - - - S'------I
balance between conflicting requirements. It is
seldom that a lens designed for one application
FIG. 2. Diagram showing the location of the six will perform optically in another. In the past
cardinal points in a lens system. there was a tendency to misuse -lenses, because
LENS 654

it was difficult to design a new system. Today tives have a wide variety of focal lengths and
it is easy to obtain a new design. It is, however, diameters. They are usually corrected precisely
expensive to build a prototype of a new lens for spherical aberration, coma and axial chro-
design. This is because the optical shop has to matic aberration. Since telescope objectives
make the lenses without adequate tooling. It cover small fields of view, astigmatism and field
takes a skilled optician to make a prototYPI! curvature usually are not corrected. Telescope
lens accurately enough to perform as designed. lenses of large diameter (20 inches or more)
When production runs are large, modern tool- become afflicted with chromatic aberration
ing can ease the burden on the skilled optician which cannot be corrected, so many of the
and the cost per lens can be drastically reduced. large telescopes used by astronomers are mir-
Lens Testing A designer specifies a set of rors instead of lenses.
curvatures, thicknesses and optical glasses for a Telescope objectives are used in binoculars,
nominal design. The work shop makes the lens opera glasses, surveying instruments, gunsights,
to these specifications within certain tolerances. and many laboratory instruments.
The final lens must then be tested to make sure Periscopes. Periscopes are used to enable one
that the over-all performance is as expected. It to look through a long tube. The submarine
is rare indeed that a lens performs exactly as periscope is one well-known example, but there
computed. It is then necessary to determine if are many other types used in industrial and
the difference is negligible and, if not, what to medical instruments. Gastroscopes and cysto-
do about it. First, one tests the complete as- scopes are examples of periscopes used in med-
sembled lens and attempts to predict the per- ical instruments. Periscopes are made up of a
formance. By studying the defective image, it train of telescope objectives and eyepieces.
may be possible to locate the sources of error. Camera Objectives. By far the largest class of
The tests consist of studying the light distribu- optical lens systems would be classified as
tion in the image of a point source. This is done camera lenses. They are used to record images
on a lens bench or testing interferometer. Some- on films as in common landscape cameras, but
times it is possible to locate the source of error today they are also used with many other types
by testing the over-all system, but usually the of image recording systems such as television
lens system is disassembled and all the lens image tubes, electrostatic plates, etc. The dis-
surfaces, spaces and centering are checked tinguishing features of camera lenses are wide
separately. field and large aperture. Usually the image is
The invention of the laser (1960), the aston- located on a flat image plane. Camera lenses
ishing progress in computing equipment (1950- range in complexity from a single meniscus lens
1980), the development of small detectors, to systems with more than ten elements. Camera
and the associated transistor technology have lenses cover such a wide range of uses that one
brought about major changes in lens testing. could claim all lenses to be a form of camera
Interferometers are used more extensively, lens. For example, a long focal length lens used
tolerances are held better in the optical shop on a 35-mm camera may actually be very
and mounting techniques have improved. The similar to a telescope objective.
trend now is to use better quality control at Zoom Lenses. Zoom lenses are in common
every step of the manufacturing process. The use on cameras and a wide variety of instru-
objective is to manufacture and mount the ments. A zoom lens has a variable focal length,
lenses to achieve a high yield of acceptable which is achieved by changing the spaces be-
lenses. The disassembly of lenses as described tween elements. The lenses are positioned ac-
above is seldom done beyond the prototype curately with precision cams, which have
stage. In large production runs the rejected become practical to manufacture since the
lenses are few and seldom are they repaired. introduction of numerically controlled ma-
Lens Types There are several optical systems chines. Zoom lenses first appeared on 8 mm
which may be classified as types. There is con- and 16 mm movie and TV cameras. They were
siderable overlap between the types, but there originally too large for the popular 35 mm still
is some value in the following classification. cameras. The latest designs and manufacturing
Microscope Objectives. Microscope objectives methods have improved so much that the sizes
are used to magnify small objects. They are are much reduced and 35 mm zoom lenses are
usually used with a microscope eyepiece. Micro- now popular.
scope objectives range in focal lengths from 2 to Photolithographic Lenses. Some of the most
48 mm and are used at magnifications ranging precise lenses made are used in the making of
from 1000 to 5 X. The high-power objectives modern integrated circuits. These lenses are
are made up of many small elements. Some of used to project images of precision masks onto
the lenses are only a few millimeters in diam- silicon wafers. The devices on the silicon wafers
eter. The lens making and mounting procedures have dimensions as small as one micrometer.
for such small lenses are quite different from The lenses have to form these minute images
larger elements. over large areas. Some of these lenses are capable
Telescope Objectives. Telescope objectives are of imaging several million independent '.,nage
used to view distant objects. Telescope objec- spots in a single exposure.
655 LIGHT

Satellite Lenses. Since the development of condenser lenses are common in condensers.
earth satellites, camera lenses have been made With some of the modern high-intensity light
to photograph the cloud patterns covering the sources, it is necessary to correct for the image
earth. These lenses are short focal length and errors in order to obtain uniform illumination.
wide field. Lenses of long focal length have
been able to photograph lines less than 6 inches ROBERT E. HOPKINS
wide on the ground from a satellite in orbit
200 miles from the earth. Many spectacular References
pictures have been taken of planets and stars
from satellite cameras. Hardy, A. C., and Perrin, F. H., "Principles of Optics,"
Laser Scanning Lenses. The wide variety of New York, McGraw-Hili Book Co ., 1932.
gas and solid state diode lasers along with Greenleaf, Allen, "Photographic Optics," New York,
modern modulating systems has lead to many The Macmillan Co., 1950.
applications using laser scanners. The extremely Conrady, A. E., "Applied Optics," London, Oxford
intense small diameter beams from lasers may University Press, 1929.
be reflected from oscillating mirrors to provide
Cross-references: ABERRATIONS; MICROSCOPE; OP-
a rapidly scanning line of light similar to that
TICAL INSTRUMENTS; OPTICS, GEOMETRICAL;
generated in a cathode ray oscilloscope. The
REFRACTION.
advantage is that the material to be scanned
does not have to be in a vacuum. Nearly any
material may be scanned with a laser beam.
Laser scanners can be used to scan printed LIGHT
material and convert the light reflected into
an electrical signal. The electrical signal may Light is a form of electromagnetic energy. It
then be stored in a computer memory, and has a physical character similar to that of radio-
image processed. Finally, it can be read back waves. In order that the human eye may get the
through a similar laser scanner and be printed sensory perception of light, the electromagnetic
out on a light sensitive material of which there waves entering the pupil should have a wave-
are now many kinds. Since these scanning lenses length A between 4000 and 7000A (IA =
work with lasers the light is monochromatic, 10-8 cm). The wavelength of a wave is inversely
making it easier to design high quality lenses. proportional to its frequency v. The product of
Some of the scanning lenses can surpass the the two quantities equals the velocity of prop-
image quality of lithographic lenses because agation. For light in vacuum one has c = VA =
the light used is monochromatic. 3 X 10IOcm/sec. The frequency of light waves
Eyepieces. Eyepieces are quite clearly a dis- is therefore almost a billion times higher, their
tinctive class of optical lens system. Eyepieces wavelength a billion times shorter, than the
are designed to match the sensitivity and phys- waves of standard radio broadcast bands. The
ical requirements of the human eye. For ex· perception of color depends on the distribution
ample, the eye is sensitive in the visual part of of the electromagnetic energy over the visible
the spectrum so eyepieces are designed for this wavelengths. White light is a superposition of
range of wavelengths. An eyepiece must also be waves at many frequencies. It can be decom-
located with its aperture stop in an external posed into its monochromatic spectral com-
position so that the observer's eye may be ponents by a prism or other spectral apparatus.
located within it. This requirement imposes The violet end of the spectrum is near 4000A,
serious limitations on eyepieces, and they are the red end near 7000A. Whereas light in its
seldom useful in any other applications. narrow definition should be confined to this
Magnifiers are essentially eyepieces except relatively narrow portion of the electromagnetic
they are designed to view opaque material while spectrum, it is customary to extend the defini-
an eyepiece is designed to view an aerial image tion to the ultraviolet and infrared portion of
formed by an objective . the spectrum. One sometimes speaks loosely of
Condensers. Condensers are used to collect ultraviolet and infrared "light," although electro-
and focus large amounts of light. They are magnetic waves at these frequencies are not
found in projectors and substages of micro· detected by the eye. The human mind and
scopes: A searchlight mirror is a form of a hands have, however, devised a large variety of
condenser. The numerical aperture of a con· instruments by which such invisible radiation
denser is usually very large, and for many ap- can be detected and measured. Photographic
plications, the image-forming properties are not plates can be made sensitive to x-rays, with a
important. Condensers are often made, there- wavelength shorter than the ultraviolet, or to
fore, with low-quality surfaces. Some condenser the much longer wavelengths of the infrared.
lenses are molded. Condensers are usually Geiger counters can detect electromagnetic
placed close to an intense light source which radiation of very short wavelength (A-rays and
heats and cracks the lenses if made of glass. x-rays). Photoelectric cells are sensitive in the
Condensers are often made out of quartz be- ultra-violet and visible portion of the spectrum .
cause of its ability to withstand heat. Aspheric Photoconductivity can be used to detect infrared
LIGHT 656

radiation. At still longer wavelengths, the micro- These are called the two transverse modes of
waves and radiowaves are detected by diode polarization.
detectors in appropriately arranged microwave All phenomena of geometrical and physical
and radioreceivers. All these types of radiation optics are described consistently by Maxwell's
can also be converted into heat by absorption equations of electromagnetic theory. Optical
in a blackbody, i.e., a material that can absorb phenomena are, therefore, closely related to
radiation at all wavelengths. The radiation can other electric and magnetic phenomena. Around
be felt as heat, if it is absorbed by the human 1900 the prevailing opinion was that the wave
skin. character of light was unambigously established
The study of the human eye as a detector of and the nature of light well understood.
light is the task of physiological optics. The im- There was, however, a mathematical difficulty
pression of light is not necessarily always con- with the intensity of radiation of ultraviolet and
nected with the simultaneous presence of elec- higher frequencies. The photoelectric effect
tromagnetic energy at the retina. We see "stars" could also be interpreted only by considering
from a heavy mechanical blow in the dark. The light to have a quality of particles. The number
impression of light is retained for about 0.1 of electrons emitted from a photosensitive
second after the light source is shut off. This surface is proportional to the intensity of the
fact is made use of in the movies to create the light. The energy of the individual electrons is,
impression of motion by a series of still images. however, determined by the light frequency.
The eye is a detector with a relatively long This led to the postulate of light quanta with
response time. Photoelectric cells can react more energy hv, where h is Planck's constant. This
than a million times faster. Color vision is also duality in nature, in which "wave-like" and
subject to physiological peculiarities which are "particle-like" properties are combined, is de-
quite complex (see COLOR and VISION AND scribed without internal contradiction by quan-
THE EYE). tum mechanics. The combined "particle and
The property of light which is most immedi- wave" character of light is revealed by the com-
ately accessible to observation is its propagation bination of properties of the light sources, the
along straight lines (shadows). If light rays electromagnetic field describing the light waves,
pass from one medium to another, their and the detectors.
direction is changed according to the law of The study of the interaction of light waves
REFRACTION. If the light in medium 1 prop- with matter in the sources and in the detectors
agates with a velocity VI and makes an angle !?l is the subject of SPECTROSCOPY. This is a wide
with the normal to the boundary between field which encompases atomic and molecular
media 1 and 2, the direction !?2 in medium 2, spectroscopy, parts of solid-state physics and
with a velocity of propagation V2 is given by photochemistry. The quantum theory was
Snell's law, sin !?dsin !?2 = VdV2 = n. The con- largely developed on the basis of spectroscopic
stant n is called the relative index of refraction data. A light quantum is emitted by an excited
of medium 2 with respect to medium 1. These atom, molecule or other material system, when
three laws are the basis of geometrical optics. an electron in such a particle makes a transition
This branch of the science of light describes the or "quantum jump" from a state with higher
paths of light rays, the formation of images energy to a state with lower energy. The energy
by mirrors and lenses, the action of tele- difference between these states is equal to the
scopes, microscopes, prisms and other optical quantum energy hv. Similarly, the absorption of
instruments. light quanta is accompanied by an electronic
The wave character of light becomes apparent transition from a state with a lower energy to a
by more refined observations. The phenomena state with an energy higher by an amount hv. In
of diffraction, interference and polarization are this manner, the frequencies of spectral lines
the subjects of physical optics. Diffraction are characteristic for the electronic energy levels
describes how waves are bent around obstacles. in each material. The frequency of the light may
They represent corrections to and deviations be said to correspond to the frequencies of the
from the laws of geometrical optics. These vibrating charges or oscillators, which are repre-
effects become pronounced only when the sented by the electrons.
material has a characteristic dimension com- Light sources are thus bodies with a sizeable
parable to the wavelength of the wave. When population of electrons in excited states. This
light waves reach the same point along different may be accomplished by raising the temperature
paths, the resulting intensity may be smaller of the material. The most important source of
than that produced by each individual wave light is the sun. The moon and other planets are
separately. The relative phases of the waves may visible only because they reflect sunlight, just as
be such that they interfere destructively, when all other objects on the earth which we can see
the arrival of one wave with maximum positive by daylight, but not at night. The sun is a star.
deflection coincides with that of another wave In stars, the temperature is maintained at a very
with maximum negative deflection. Observations high temperature by nuclear reactions.
of light in crystals of calcite (iceland spar) first Man-made light sources range from the primi-
showed that there are two different modes of tive fire, candles, and oil and kerosene lamps to
vibrations for each direction of propagation. the electric light bulb, fluorescent gas-discharge
657 LIGHT

tubes, arcs, etc. In early sources, the material dominates the spontaneous emISSIOn, a laser
particles of smoke or wick were heated by the results. This requires a high concentration of
chemical reaction of oxidation or burning; in the excited atoms and a sufficient feedback
incandescent electric lamps, a wire is heated to mechanism of light by mirrors. In its simplest
a very high temperature by an electric current. form a laser consists of a gas discharge in a
There are so many energy levels in these lumi- tube of suitably chosen dimensions and gas
nous solid materials or gases at high pressures pressure between a set of parallel mirrors. Be-
that the emitted light is white and contains cause the atoms in the laser source all act con-
essentially all frequencies. The higher the tem- structively in step, these sources provide a more
perature, the more radiation is emitted and the efficient means to transmit light energy.
higher the average frequency of radiation. It The high light intensities available in focused
should be realized that most of the energy is laser beams have led to the development of the
emitted as invisible (infrared) radiation, even in branch of nonlinear optics. The optical proper-
the best incandescent lamps. Hot gases in flames ties of materials are different at high intensities,
may also emit sharp spectral lines characteristic because the electronic oscillators are driven so
of the atoms occurring in the flame. The yellow hard that anharmonic properties become evi-
color which arises when kitchen salt is sprinkled dent. A typical effect is the harmonic generation
in a flame is due to the characteristic yellow of light in which red laser light is converted into
spectral line of sodium atoms. ultraviolet light at exactly twice the frequency,
In gas discharge tubes, atoms or molecules are when the high-intensity beam traverses a suit-
excited by collisions with electrons in the ionized able crystal such as quartz. It should be possible
gas. The energy is provided by the generator to duplicate at light frequencies all nonlinear
which provides the voltage necessary to main- effects known from the field of radio com-
tain the discharge current. An arc is a discharge munications, such as modulation, demodulation,
in air or in a high-pressure vapor. Mercury and frequency mixing, etc. It is no longer correct to
sodium dischlrges are used for street lighting. say that the propagation of a light wave is in-
Fluorescent tubes for office and home lighting dependent of the presence of other light waves.
use a gas discharge with a substantial amount of At high intensities, there is a noticeable inter-
ultraviolet components. This ultraviolet light action between light waves of different fre-
excites electrons in fluorescent centers on the quencies.
walls of the tube . The electrons drop immedi- The combination of the laws of quantum
ately from the highly excited state to an inter- mechanics and electromagnetic theory gives a
mediate state with a lower energy. From this consistent description of the generation, propa-
state they finally drop down to the original gation and detection of light. Since these same
ground energy level with the emission of visible laws also describe many other properties of
light. Gas discharges at relatively low pressure matter such as electronic structure, chemical
may serve as spectroscopic sources to study the binding, electricity and magnetism, etc. , it may
emission spectra of atoms, ions and molecules. be said that the nature of light is well under-
From the relationship between the energy levels stood. In this context, it is not necessary and
and the frequency of radiation, it follows that a not even desirable to pose the question, "What
material, when heated, can emit precisely those is it, precisely, that vibrates in a light wave in
frequencies which it absorbs when it is in the vacuum?" The electromagnetic fields acquire
lower energy level at low temperature . meaning only through their relationships with
All these light sources are incoherent in the detectors and sources. Human knowledge or
sense that there is no phase relationship between understanding is here used in the operational
the light waves emitted by the different atoms sense that a relatively simple framework of
in the source. This is quite different from the physical concepts and mathematical relation-
property of the usual sources of electromagnetic ships exists, which gives an accurate description
radiation at lower frequencies. In the oscillator of the wide variety of optical phenomena at
tubes of radio- or microwave transmitters, all present accessible to observation or verification
electrons move and vibrate in step with each in experimental situations. The following refer-
other. The analogy between light and low-fre- ences will introduce the reader to the vast liter-
quency electromagnetic radiation raises the ature of optics and spectroscopy.
question, "Can coherent light sources be con-
structed?" Recently such coherent light sources N.BLOEMBERGEN
have been developed. They are characterized by
the emission of a highly directional, highly References
monochromatic light beam of high intensity.
They are called LASERS because they are based Whittaker, E. T., "A History of the Theories of Aether
on light amplification by stimulated emission of and Electricity," Vols. I and II, London, Nelson &
radiation. In the conventional sources , all light Sons, 1952.
is emitted spon taneously. In lasers, the original Born, M., and Wolf, E., "Principles of Optics," London
spontaneously emitted light forces the other and New York, Pergamon Press, 1959.
excited atoms to emit their radiation in step, "Lasers and Light," in "Readings from Scientific
or coherently. If stimulated emission thus American," San Francisco, Freeman, 1969.
LIGHT 658

Ditchburn, M., "Light," Third Edition, New York, for each scatterer or scattering element. Many
Interscience Publishers, 1976. important cases have been solved, however,
Minnaert, M. G. J., "Light and Color in the Open Air," su bject to certain approximations. I ,2,3 Widest
Ann Arbor, Mich., Dover Publications, 1953. success has been obtained for single scattering,
i.e., for particles sufficiently dispersed that
Cross-references: ELECTROMAGNETIC THEORY; radiation scattered by anyone particle can be
INFRARED RADIATION; INTERFERENCE AND considered to escape from the medium without
INTERFEROMETRY; LASER; OPTICS, GEOMETRI- being further scattered by other particles. Even
CAL; OPTICS, PHYSICAL; PHOTOCONDUCTIVITY; for single scattering, however, exact solutions
PHOTOELECTRICITY; QUANTUM THEORY; RE- of Maxwell's equations have been obtained only
FLECTION; REFRACTION; SPECTROSCOPY; UL- for simple shapes such as spheres (Mie, 1908),
TRAVIOLET RADIATION; VISION AND THE EYE. spheroids, and cylinders. Thus, even for single
scattering, it is often necessary to resort to
approximation methods.
LIGHT SCATTERING If the particles are less than about one-tenth
of the light wavelength and if their index of
When a beam of light falls on a particle, part of refraction is near to that of their surroundings,
this incident beam is diverted from its original only the induced electric dipole radiation is
path; that part which is diverted and not ab- important. (The amplitude of the induced dipole
sorbed is scattered. moment is given by the polarizability of the
Light scattering is a familiar phenomenon. scatterer times the amplitude of the electric
The colors of visible objects (other than light vector of the light wave.) Lord Rayleigh (1871)
sources) are determined by the wavelengths explained Tyndall's principal results in terms
which they scatter most effectively. Scattering of the intensity and polarization of the induced
by small particles was first studied experimen- electric dipole radiation. This type of scattering
tally by Tyndall (1869) in connection with the has since become known as Rayleigh scatter-
blue of the sky. ing. I ,2
Classical physics is appropriate for the de- Rayleigh scattering is of particular impor-
scription of most light scattering phenomena. tance. If the particles are dispersed at random
Thus, light scattering is explained in terms of (molecules of an ideal gas or widely dispersed
the forces exerted by the electromagnetic field macromolecules in an optically homogeneous
on the electronic charges which all matter con- solution), the individual particles may be re-
tains. The oscillating electromagnetic field of garded as independent sources. In this event,
the incident light exerts a periodic force on the total scattered intensity is merely the sum
each electronic charge, causing it to execute of the intensities scattered by the individual
harmonic motion at the frequency of the light particles. The special case of isotropic particles
wave. It is the fact that an oscillating charge and unpolarized light is both simple and illumi-
radiates in all directions (except along the line nating. The Rayleigh formula is
of its motion) which accounts for the scatter- Intensity of scattered light
ing. The intensity of the radiation scattered
from a particle will be large in directions for Intensity of incident light
which the radiation from the individual ele-
ments of the particle interferes constructively, 81T4 Na 2 (l + cos 2 8)
and small in directions in which it interferes A4 r 2
destructively.
For particles comparable in size to the light where N is the number of particles, a is their
wavelength, the amount of energy scattered as polarizability, 8 is the angle of scattering, A is
well as the angular distribution of the intensity the wavelength (in the surrounding medium),
and polarization of the scattered light are and r is the distance from the scattering system
influenced by the distribution of induced oscil- to the point of observation (with r much greater
lating charge within each scatterer. Any correla- than any relevant dimension of the scattering
tion which may exist between the positions of system). Thus, Rayleigh scattering from inde-
the scatterers also affects the extent to which pendent particles is proportional to the number
the radiation interferes constructively or de- of particles. Moreover, if the dielectric constant
structively to make up the resultant scattered of the surroundings is close to that of the scat-
field. Thus, in principle, light scattering pro- terer, the polarizability is insensitive to the
vides a tool for the investigation of surfaces shape of the scatterer. Accordingly, the Rayleigh
as well as of the number, size, structure, and formula has been used to determine Avogadro's
orientation of particles and their interactions. number and, when the total mass of scatterers
The problem of relating the light scattering is known, to determine molecular weights.
to these properties and vice versa has proved For electrically conducting scatterers (e.g.,
too difficult for exact solution in general, metals), the conduction electrons are relatively
because it would be necessary to solve Maxwell's free so the incident electromagnetic wave in-
equations with the proper boundary conditions duces relatively large amplitude oscillations of
659 LIGHT seA TTERING

these electrons. For such particles, magnetic of each other. If the index of refraction of
(as well as electric dipole) radiation may be every element were identical, the solution would
important even for scatterers that are quite be homogeneous and no scattering would result.
small relative to the light wavelength. More- But the index of refraction of an element will
over, if the frequency of the incident light is fluctuate according to the number of molecules
close to the natural frequency of charge oscilla- it contains. The total scattering is found to be
tion (surface plasmon oscillation), particularly proportional to the mean square fluctuation
large amplitude oscillations may be induced. in index of refraction, which is related to the
At such frequencies, the polarizability may be thermodynamic properties of the solution
enhanced by several orders of magnitude, through free energy. For s:rystalline solids and
making the particle an extraordinarily efficient other media in which correlation extends over
and color-selective scatterer. The resonant fre- distances comparable to the wavelength, it is
quency can be controlled, since it depends on necessary to use more specialized techniques.
the shape, the index of refraction, and the In many cases, light scattering is related to
electrical conductivity of the scatterer as well the composition and structure of the medium
as on the index of refraction of the surrounding in a way similar to x-ray scattering. The criterion
medium. For example, gold and silver yield a which must be satisfied is that the electromag-
wide range of striking colors, and their suspen- netic field within the scatterers should be closely
sions were used in the Middle Ages in stained approximated by the unperturbed incident
and other colored glasses. field, just as in the x-ray case. Light scattering
As we consider larger particles which begin under this approximation is widely known as
to violate the criterion that their dimensions Rayleigh-Cans, or Rayleigh-Debye, scattering.
be very small compared with the wavelength, It is applicable if the phase shift for radiation
or the criterion that they are sufficiently dis- passing through a particle is not too different
persed that their positions are not correlated, from the phase shift which would occur for
the Rayleigh formula breaks down. This break- radiation passing through the same distance
down first appears at large scattering angles, in the surrounding medium. When this approxi-
(corresponding to large optical path differences) mation is valid, the angular distribution of the
where the destructive interference is first sig- scattered light is related, as in x-ray scattering,
nificant, and quickly spreads to moderate and to a form factor which describes the structure
small angles. Nevertheless, for scattering angles of the individual scatterers and to the previously
near zero, the Rayleigh formula retains validity, mentioned radial density function which de-
since for zero scattering angle the radiation scribes their spatial correlation.
from all volume elements is essentially in phase In general, each of the above approximate
regardless of particle positon or size (provided methods rests on approximating the dipole
that the index of refraction of the scatter is moment that is induced in the scatterers.
close to that of its surroundings so that all When inadequate, these approximations can
relevant optical path differences are small). be very significantly improved by variational
Thus, the Rayleigh formula, when properly techniques. Variational principles for identi-
applied, is useful over an extremely wide range cal scatterers were described by Levine and
of molecular weights (I 0 2 -1 0 7 ). Schwinger (1948), and have been developed
For scattering from surfaces and dense media and applied also to statistical distributions of
such as liquids in which there is local order over nonidentical scatters by many investigators. 4
dimensions comparable to the light wavelength, To this point, the frequency of the scattered
individual scatterers cannot be treated as inde- light has been regarded as identical to that of
pendent scatterers. Perhaps the most general the incident light. Actually, as predicted by
formulation of the problem is in terms of the Brillouin (1914), frequency displacement and
radial density function, which describes the line broadening occur due to thermal and
correlation between the positions of scatterers, other motion of the scatters (via the DOPPLER
and thus determines the extent to which their EFFECT), and also due to variations in the
radiation interferes constructively and destruc- directions or magnitudes of their polarizability
tively. For scattering by molecules of a solution, tensors (e.g., due to chemical reaction). Highly
however, it is often preferable to relate the light monochromatic laser light and techniques of
scattering directly to thermodynamic properties light-beating (correlation) spectroscopy extend
of the medium. This may be accomplished the use of light scattering to the study of these
through an ingenious approach due to Smolu- kinetic phenomena, e.g., to measurement of
chowski (1908) and Einstein (1910). It takes velocities and properties of particles that influ-
advantage of the fact that for molecules small ence their motion, such as diffusion constants. 3
compared with the wavelength, the scattered The scattered light also has a RAMAN SPEC-
field may be regarded as made up of radiation TRUM of relatively weak lines (or bands) first
from elements of volume small enough that discovered by Raman (1928), originating from
each element may be considered an electric the light analog of the Compton effect, and
dipole source and yet large enough that the explained by quantum theory.
elements can be considered to be independent R.W.HART
LIGHT SCATTERING 660

References for a gas such as methane to -15 times the


1. van deHulst, H. C., "Multiple Light Scattering: minimum for helium.
Tables, Formulas and Applications," Volumes 1 To achieve the minimum thermodynamic work
and 2, New York, Academic Press, 1980. requirement for cooling and liquefying a stream
2. Young, A. T., "Rayleigh Scattering," Physics of gas, an infinite sequence of perfectly efficient
Today 35,42-48 (1982). refrigerators operating at successively lower
3. Degiorgio, V. (Ed.), "Light Scattering in Liquids temperatures ranging from ambient to the boil-
and Macromolecular Solutions," New York, Plenum ing point of the material would be required.
Publishing Co., 1980. Various approximations to this theoretical ideal
4. Feinstein, M. R., and Farrell, R. A., "Trial Func- have been developed .
tions in Variational Approximations to Long- Cascade Process If the critical temperature
Wavelength Scattering," (and references therein) of the gas which is to be liquefied lies well above
J. Opt. Soc. Am. 72,223-231 (1982). the boiling point of some second fluid, whose
critical temperature in turn lies above the boil-
Cross-references: DOPPLER EFFECT; LIGHT; OP- ing point of yet another fluid, and so on to
TICS, PHYSICAL; RAMAN EFFECT AND RAMAN some fluid that is condensable at ambient tem-
SPECTROSCOPY. peratures, then one can replace the infinite
sequence of refrigerators of the thermodynamic
ideal with this discrete series, or cascade of
LIQUEF ACTION OF GASES liquid cooling baths. '
Such an arrangement is shown schematically
The liquefaction of all readily available gases in Fig. 1. The raw material is compressed to the
has become a routine operation in industrial pressure necessary to condense it at the tem-
technology. Prominent among the reasons for perature of the final refrigerant bath. The re-
converting a gas to a liquid are the net saving sulting liquid is expanded through a throttle
in the cost of storing or of transporting a nor- valve, and the vapor that boils off in the throt-
mally gaseous material in liquid form, the tling process is recycled to conserve the refriger-
convenience and flexibility of providing very ation it represents. The penalty in increased
low temperature refrigeration, in the form of work over the thermodynamic minimum arises
a low-boiling liquid, to mulitple sites of modest from the small number of steps in the sequence
or intermittent requirements, and the efficiency with the attendant irreversible exchange of heat
attainable in the separation of the components between the process gas and the much colder
of a gaseous mixture by the partial liquefaction baths; from the throttling losses for the product
of the mixture, or its total liquefaction followed liquid and the various cooling fluids; and from
by rectification. the imperfect efficiency of any real compressor.
The transoceanic shipment of liquefied natural For small-scale systems, the operational com-
gas, the commercial distribution ofliquid helium plexity and the equipment cost of a cascade are
to scientific laboratories, and the production of prohibitive. However, for large-scale operations,
pure oxygen and pure nitrogen from air are the economic performance of a cascade com-
representative examples of the first, second pares very favorably with any other process for
and third reasons, respectively. The first and the simple liquefaction of any gas for which a
last examples currently operate on scales such cascade of refrigerants can be found.
that thousands of tons of liquid are produced For large-scale operations such as the lique-
daily. faction of natural gas, one can realize signifi-
To produce a cold liquid product from gaseous cant savings in equipment costs if, rather than
raw material at ambient temperature requires a buying a smaller, separate compressor for each
heat-pumping operation. Thermodynamic anal- individual fluid in the cascade, one mixes the
ysis gives the (unattainable) irreducible mini- fluids together and buys a single, large com-
mum work Wmin, which must be expended in pressor for the mixed refrigerants. This saving
the heat pump operating in an environment at in equipment cost can more than compensate
temperature To, to convert a unit mass of warm for the slightly lower thermodynamic efficiency
gas to liquid to be
Wmin ='(Hliquid - Hgas) - TO(Sliquid - Sgas) TABLE 1. MINIMUM WORK OF
LIQUEFACTION OF VARIOUS GASES.
where Hliquid and H gas are the enthalpies and
Sliquid and Sgas are the entropies per unit mass Boiling Point Work Required
of liquid product and gaseous raw materii'll Substance (K) (kW-hr/kg)
!espectively. These thermodynamically revers:
Ible works of liquefaction are listed for various Methane 111.7 0.320
of the "permanent" gases in Table I which Oxygen 90.2 0.176
assumes that the starting material is ga~ at one Nitrogen 77.3 0.212
atmosphere pressure and 300 K. In large-scale Hydrogen 20.4 3.26
practica~ operations, the actual work require- Helium 4.22 1.90
ment will range from -3 times the minimum
661 LIQUEFACTION OF GASES

Water Cooling

Froduct GaS
Compressor '--_---.--'

efrlgerant Bath To
Precool Product And To
Condense Low Boilin~
Refrigerant

LOvV BOiling Refrigerant Bath


To Condense Product

rIG. I. Schematic arrangement for two-stage cascade.

(and higher power costs) that characterize such temperature results in the liquefaction of a
a system compared to a discrete cascade. liquid rich in the highest boiling component of
Such systems are represented schematically the refrigerant mixture. That liquid is separated
in Fig. 2. Commercially, they are frequently from the residual gaseous phase and injected
used for the production of liquefied natural gas into the first exchanger to cool the compressed
(LNG) and are referred to variously as MCR product stream and the residual gaseous refrig-
(muiticomponent refrigerant) or ARC (auto- erant stream. That cooling condenses yet more
refrigerated cascade) cycles. Cooling of the of the refrigerant stream ; this liquid is in turn
compressed mixture of refrigerants to ambient separated and injected into the main exchanger,
etc., until the final fraction of the refrigerant
stream is throttled directly and returned to the
Liquefied exchanger as coolant.
Linde and Claude Processes A stream of cold
NaTural Gas
gas, flowing countercurrent to the process stream
(L N G) - - '"""'iIii:-Qllr-~~~ in a heat exchanger that establishes perfect
thermal equilibrium between the two streams
at every point along their paths could substitute
for the infinite sequence of refrigerators in the
theoretical ideal system. The problem is just to
produce the stream of cold gas (let alone to
produce it with perfect efficiency) and to
produce a refrigerator to extract the heat of
vaporization from the product material at its
boiling point.
Application of the first law of thermodynam-
ics to the system shown in Fig. 3, consisting of
a constant high-pressure source of fluid at PI
that flows at constant rate through an insulated
Compressed heat exchanger B, then through a throttling
Natural Gas device C, and back through exchanger B, leav-

"
ing the exchanger against some constant low
pressure P2 at the same temperature at which
high-pressure fluid enters , gives
Refrigerant
Campresso-r- '-_ _ _--'

rIG. 2. Schematic arrangement for multicomponent HI and H2 are the enthalpies per unit mass of
refrigerant cascade. fluid entering and leaving the system, respec-
LIQUEFACTION OF GASES 662

Expanded Fluid Sink -14 K (where H2 > HI for helium). The pre-
cooled helium was then fed into a simple Linde
liquefier, and Onnes first liquefied helium in
1908.
For any real system, whose thermal insulation
leaks q units of heat per unit mass of entering
gas and whose exchanger permits gas to leave
the system at T 2 ' < TI, the energy balance
becomes
HI + q = fHJjquid + (1 - f)
. [H 2 +Cp (T 2 ' - Tdl.

Poor insulation (large q) and an inefficient ex-


changer (large IT 2 ' - TI I) can easily reduce f
to zero.
FIG. 3. Adiabatic throttling flow system. As the temperature of the gas entering the
exchanger of a Linde liquefier approaches its
critical temperature, the thermodynamic effi-
tively, and Q is the amount of heat absorbed ciency of the system as a refrigerator rises
by a unit mass of fluid in passing through the sharply. A simple Linde liquefier, coupled to any
system. The initial and final kinetic and gravi- of several types of efficient auxiliary precooling
tational potential energies are assumed equal. refrigerators, forms the final stage of almost
H I will be smaller than H 2 for most gases at every large-scale liquefier in common use. Linde,
absolute temperatures less than 8-10 times the himself, quickly modified his simple system to
normal boiling point and for initial pressures what, in essence, is a pair of simple liquefiers
of several hundred to a few thousand pounds operating in cascade. The first (precooling) unit
per square inch. At higher temperatures, HI operates between the common initial high
will be larger than H2 so that Q is negative- pressure and some intermediate pressure (chosen
heat is liberated within the exchanger-throttle for optimum overall efficiency) rather than
valve system. If the thermal insulation of the between the high pressure and one atmosphere.
exchanger-throttle valve system is perfect, then He also added a conventional ammonia refriger-
for HI < H 2 , the exchanger-throttle valve sys- ator for precooling to further enhance efficiency.
tem and the circulating gas itself will be con- It is possible to produce a stream of cold gas
tinually cooled until some of the circulating gas with relatively good efficiency by allowing
accumulates within the system as liquid, and compressed gas to expand in a reciprocating
a new energy balance expansion engine, or in an expansion turbine.
If the expansion engine is preceded by an effi-
HI = fHliquid + (1 - f)H 2 cient regenerative heat exchanger, relatively
modest ratios of inlet pressure to exhaust pres-
is attained. The fraction f of the entering gas is sure at the expansion engine can produce gas
withdrawn from the system as liquid product near its dew point. The thermodynamic effi-
with enthalpy Hliquid, and the fraction (1 - f) ciency of such expanders commonly approaches
returns through the exchanger as its refrigerant. or exceeds 80%. A fraction of this cold exhaust
Carl von Linde, in 1885, was the first to couple gas can be used to refrigerate the feed to a
the relatively feeble (40°C maximum for air simple Linde liquefier. Georges Claude in 1905
expanded from 4000 psi to one atmosphere at combined a reciprocating expansion engine
room temperature) Joule-Thomson cooling, with a simple Linde liquefier, as shown sche-
produced by throttling a high-pressure gas, and matically in Fig. 4, to produce an air liquefier
a regenerative heat exchanger to give the very of improved efficiency which became known
simple system that, starting from room tem- as the Claude cycle.
perature, is capable of liquefying any gas except Peter Kepitza, in 1934, combined a precool-
helium, hydrogen, or neon-albeit with poor ing bath of liquid nitrogen in sequence with a
efficiency. For helium, hydrogen, and neon, reciprocating expansion engine to precool com-
throttling at room temperature produces heat- pressed helium feed for a final Linde stage and
ing (H 2 < HI). Sir James Dewar used a bath of produced a Claude cycle which liquefied helium
liquid oxygen to precool compressed hydrogen without the use of any auxiliary liquid hydro-
to -90 K where H2 > HI. The precooled hy- gen. Collins developed a similar machine which
drogen was then fed into a simple Linde lique- was produced commercially in relatively large
fier, and Dewar first liquefied hydrogen in this numbers. Modifications in various sizes were
way in 1898. developed and produced commercially. The
Kamerlingh Onnes used a bath of liquid machines typically use a cascade of two expan-
hydrogen boiling under vacuum near its freez- sion engines in a Claude-cycle liquefier capable,
ing point to precool compressed helium to when combined with liquid nitrogen precool-
663 LIQUID STATE

Vance, R. W., "Cryogenic Technology," Ch. 2, New


Water Cooling York, John Wiley & Sons, 1964.
Product Gas
Collins, S. C., Science 116,289 (1952).
Compressor Wenzel, L. A., Advances in Cryogenic Engineering 20,
90 (1975).
Gibson, B. M., Advances in Cryogenic Engineering, 25,
715 (1979).

Cross-references: CRYOGENICS; ENTROPY; GAS


LAWS; GASES, THERMODYNAMIC PROPERTIES;
LIQUID STATE; REFRIGERATION; STATES OF
MATTER; THERMODYNAMICS.

LIQUID STATE
Liquid is the term used for a state of matter
FIG. 4. Schematic arrangement for Claude cycle. characterized by that of a pure substance above
the temperature of melting and below the
vaporization temperature, at any pressure be-
tween the triple point pressure and the critical
pressure (see Fig. I). The liquid state resembles
the crystalline in the relatively low dependence
Compressor L-_,--' of density on P and T, and resembles the gas
Raw Materia
state in the inability to support shear stresses
(see reference to glasses below). Structurally
Auxiliary Refrigerant
For Precoolmg the molecules are relatively close together but
they lack long-range crystalline order. The mu-
tual solubility of different liquids is also inter-
mediate between the complete mutual solubility
of most gases, and the relatively rare appreciable
mutual solubility of pure crystalline compounds.
Two liquids of similar molecules are usually
soluble in all proportions, but very low solu-
bility is sufficiently common to permit the
demonstration of as many as seven separate
liquid phases in equilibrium at one temperature
and pressure (mercury, gallium, phosphorus,
perfluoro-kerosene, water, anilinc, and heptane
at sooe, I atm.
Stability Limits With the exception of he-
lium and certain apparent exceptions discussed
below, Fig. I gives a universal phase diagram for
all pure compounds. The triple point of one P
FIG. 5. Composite system for helium liquefaction. and one T is the single point at which all three
phases, crystal, liquid, and gas, are in equilib-

ing, of producing up to 40 liters of liquid helium


per hour. The schematic arrangement is shown
in Fig. S. The same schematic arrangement of a
liquid precooling bath followed by expansion
engines, all precooling the feed to a final Linde
liquefier, describe in essence the large plants CRYSTAL
currently used for the large-scale liquefaction
of hydrogen or helium.
DAVID N. LYON

References
Haselden, G. G., "Cryogenic Fundamentals," Ch. 2,
New York, Academic Press, 1971. T-
Barron, Randall, "Cryogenic Systems," Ch. 3, New
York, McGraw-Hill Book Co., 1966. FIG. 1
LIQUID STATE 664

rium. The triple point pressure is normally be- to observe. In these cases, there is a continuous
low atmospheric. Those substances, i.e., C02, transition on cooling to a glass, which has the
Pt =3885 mm, Tt =- 56.6°C, for which it lies elastic properties of an isotropic solid. The
above, sublime without melting at atmospheric structure of the glass is qualitatively that of the
pressure. high-temperature liquid, lacking long-range or-
From the triple point, the melting curve der. Sinctf glass and liquid are not sharply dif-
defines the equilibrium between crystal and ferentiated, the term liquid is sometimes used
liquid, usually rising with small but positive to include glasses, although common parlance
dT/dP, and presumably always with positive reserves liquid for the state in which flow is
dT/dP at sufficiently high P values. The line is relatively rapid.
believed to extend infinitely without a critical Quantum Liquids The one real exception to
point (it has been followed to T ~ 16Tc for the phase diagram of Fig. I is that of helium,
He, and calculations indicate that hard spheres Fig. 2. Both isotopes, 4He and 3He, have no
would show a gas-crystal phase change). The .triple point, the liquid is stable to 0 K below
gas-liquid equilibrium line, the vapor pressure about 20 atm for 4He and below about 30 atm
curve, has dT/dP always positive and greater for 3He. The liquids have zero entropy at 0 K in
than the melting curve. The vapor pressure curve both cases. This is also the only case in which
always ends at a critical point, P = Pc, T = Tc, isotopic mixtures form two liquid phases at
above which the liquid and gas phase are no equilibrium, the isotopic solution separating
longer distinguishable. Since the liquid can be below I K. The isotope 4 He has itself two
continuously converted into the gas phase with- phases, He I above the dotted X-line of the
out discontinuous change of properties by any diagram, and He II with remarkable properties
path in the P-T diagram passing above the of superfluidity, second sound, etc., below the
critical point, there is no definite boundary be- X-line. The phase transition along the X-line is
tween liquid and gas. second order; that is, whereas S and V are
The term liquid is commonly reserved for continuous, heat capacity and compressibility
T < Tc, and "dense gas" is used for T> Tc. change discontinuously across the X-line.
However, certain properties, such as the ability Although no completely satisfactory single
to dissolve solids, change rather abruptly at the theory of liquid helium has yet been formu-
critical density . In many respects, the dense gas lated, one can say that most of the remarkable
resembles the low-temperature liquid of the properties are qualitatively understood and are
same density more closely than it does the due to the predominance of quantum effects,
dilute gas. including the difference in the statistics of the
The slope, dT/dP, of all phase equilibrium even and odd isotopes. Thus helium is the one
lines obeys the thermodynamic Clapeyron equa- example in nature of a quantum liquid, all other
tion: liquids showing only minor deviations from
classical behavior.
dT/dP = D.V/D.S = TD.V/D.H (1) Structure Considerable confusion in the de-
scription of liquid structure exists, due pri-
with D. V, D.S, and D.H the differences, for the marily to difficulties of precise formulation of
two phases, of volume, entropy, and heat con- verbal concepts. The geometric arrangement of
tent or enthalpy, respectively. The quantity any small number (say 10 to 12) of close lying
D.H is the heat absorbed in the phase change at molecules resembles the arrangement in the
constant P. Since always Scr. < Sliq. < Sgas and crystal, but the order rapidly disappears as
usually Vcr . < Vliq. < Vgas , one usually has larger groups are considered. Long-range order
dT/dP> 0; the relatively rare cases, including is lacking. The fact that numerical theories
water, for which Vliq. < Vcr. at low pressures
leads to dT/dP < 0 for the melting curve near
the triple point.
Figure I gives the P- T boundaries of the stable
liquid phase. Clean liquids can readily be super-
heated or supercooled, and in vessels having
walls to which the liquid adheres, they can be
made to support negative pressures of several \
tens of atmospheres. Thus the properties of the
metastable liquid can be investigated outside
the limits shown in the diagram.
t \~-lin. Critical
Point
p LIQUID JI LIQUID I
Two apparent exceptions to the universality
of the phase diagram of Fig. I deserve mention.
First, many of the more complicated molecules
decompose at temperatures below melting or GAS
boiling, and the diagram is unobservable. Sec- p·o
ondly, some liquids, notably glycerine and Si02
and many multicomponent solutions, super- oK T-
cool so readily that crystallization is difficult FIG. 2
665 LIQUID STATE

based on a lattice or cell structure have some


success is evidence only that most properties
depend on the configuration of near neighbors
alone. Insofar as the arrangement of nearest
neighbors is describable in terms of that of the
crystal, the structure of the normal liquid is
probably characterized best by a somewhat
closer spacing than the crystal of the same
molecules, the reduced density arising from a
considerable number of vacancies in the lattice;
the coordination number or number of nearest
neighbors is lower than in the crystal. The ex-
ception is water, in which the low coordination r--
number, 4, of the crystal, is increased by inter-
stitial molecules in the liquid, leading to a FIG. 3
higher density of the liquid.
Structural descriptions of this nature usually
lack the possibility of precise formulation. It can be shown to be the potential of average
is, however, possible to define for any disor- force of n molecules located at the positions
dered array of molecules in three-dimensional rl, ... , rn. That is, if there are n molecules at
space an arrangement of contiguous cells, each these positions there will be some average force,
containing one and only one molecule, the faces lxi, along the x-coordinate of molecule i. This
of the cells being the loci of the midpoints of average is the sum of the direct force due to
neighboring molecules. The statistics of the the other n - 1 plus the average of a fluctuating
fraction of cells with n faces and of the dis- force due to the others, whose average position
tances of the faces from the molecules would is affected by that of the n specified ones. This
give the fraction of molecules having a given average force is
number of nearest neighbors and the distance Ixi = - (ojoXi) Wn(rl, ••• ,rn)
distribution of these in a precisely defined
manner. Neither present experimental infor- One frequently assumes that Wn is a sum of
mation nor present theories lend themselves to pair forces only,
analysis in such terms.
The only clearly defined manner of describing Wn(rl,· .• ,rn) = L L' W2 (rij)
liquid structure in use at present involves the n~i>j;;'l
concept of a set of probability density func-
tions, pn, for ascending numbers, n, of mole- although this assumption is known to be only
cules. The function Pn depends on the vector approximate. With this assumption, the pair
coordinates rl , r2, ••. ,rn of n molecules, and average force potential, W 2 (rij), can be com-
puted as the solution of an integral equation,
Pn(rl, r2, ... ,rn)drl, ..• ,drn
and the solutions agree quite well with the
experimental curves.
The knowledge of the complete set of func-
is defined as being the probability that in the tions Pn plus that of the intermolecular forces
liquid of definite P and T, there will be at any would permit the computation of all equilib-
instant of time, one molecule at each position, rium properties of the liquid, and indeed if the
ri, within the volume element, dri. For a fluid, intermolecular forces are the sum of pair forces,
unlike a perfect single crystal, Pi(r) is a constant only a knowledge of P2 at all P, T values is
independent of r and equal to the number den- necessary. An adequate, although numerically
sity: the number, p, of molecules per unit difficult, theory of the transport properties also
volume. The first significant member of the set exists, using the equilibrium functions, Pn. At
is then the pair density function, P2 (rl , r2), present, only qualitative success is obtained in
which depends only on the distance, r =Irl - r21, the completely a priori use of the equations.
between the two molecules. At large distances Associated Liquids The description given
P2 (r ~ 00) = p2. This function can be found above is adequate only for liquids composed of
experimentally from the x-ray scattering inten- spherically symmetric molecules or molecules
sities of the liquid (it is the three-dimensional that are nearly so. These constitute the so-
Fourier transform of the scattering intensity at called normal liquids, which obey reasonably
angle 6 vs (41r(A)jsin (6j2». A typical plot is well the law of corresponding states, for which
shown in Fig. 3. The area under the ill-defined the entropy of vaporization at the boiling point
first peak integrated over 4m 2 dr is the average has the Trouton's rule value of approximately
number of nearest neighbors, and is of order 10 21 calj deg. F or molecules containing large
to 11 for normal liquids. dipole moments, or those forming mutual hy-
The quantity of dimensions of energy, drogen bonds, the concept of the probability
density functions must be extended to include
angles or other internal degrees of freedom in
LIQUID STATE 666

the coordinates. Such inclusion is conceptually erence frames moving with a relative velocity v
easy, but incredibly complicates the already along the z axis (for simplicity) would be re-
difficult numerical evaluation of any equations. lated by the Lorentz transformation:
However, certain qualitative statements may be
made. X'=X
Liquids composed of molecules with large
dipole moments are frequently referred to as y'=y
associated. Although in some instances relatively (2)
z' = (1 - v2 le 2 )-1/2 (z - vt)
stable dimer or definite polymer units of rela-
tively fixed orientation may exist, in many t' = (1 - v2 Ic 2 )-1/2 (t - vzlc 2 ).
cases, notably water, it is extremely doubtful if
an exact knowledge of the structure would re- One readily observes that these relations satisfy
veal any distinguishable entities of associated Eq. (1).
molecules other than that of the whole liquid. Every observer, related by a Lorentz transfor-
In such cases, one would, however, expect that mation, observes the same physical process in
certain mutual angular orientations between all of space-time and conversely, any physical
neighboring molecules will be highly preferred, process must be Lorentz invariant. Such pro-
whereas in the dilute gas this will not be the cesses described by Newtonian mechanics, for
case. The effect of this restriction on the inter- example, are valid approximations when v «c
nal coordinates will be to decrease the entropy of a more general relativistic mechanics. Classi-
of the liquid markedly compared to the gas. cal theories such as electromagnetism involving
This effect is qualitatively the same as in as- the velocity c are Lorentz invariant.
sociation, and the properties of these liquids, The Lorentz transformation of Eq. (2) is a
particularly the high entropy of vaporization, subgroup of a general class of rotations in
will simulate those of a liquid, composed of Minkowski space. We first consider ordinary
definite associated complexes. rotations of the spatial coordinates leaving the
X4 coordinate unchanged. If we perform a
JOSEPH E. MAYER rotation in the Xl -x2 plane about the X3 axis
through an angle W in a clockwise direction, a
Cross-references: CRYSTALLOGRAPHY, DIPOLE MO- point described by (XI, X2, X3, X4) in the origi-
MENTS, ENTROPY, SUPERFLUIDITY, SURFACE nal coordinate system is described in the new
TENSION, THERMODYNAMICS, TRANSPORT THE- (primed) coordinate system by
ORY, VAPOR PRESSURE AND EVAPORATION.
Xl'=Xl COSW+X2 sinw

LORENTZ TRANSFORMATIONS
X2' =X2 cos W - XI sin W
(3)
X3' =X3
An event specified at a point r == (x, y, z) in
ordinary space at an instant of time t defines X4' =X4·
an event described by the space-time coordi-
nates (x, y, z, t). Such an event observed in a These relations also satisfy Eq. (1). Such a
particular reference frame S may also be de- transformation may be obtained by a unitary
scribed in another reference frame S', in motion operator

a!2 - a!JJ.
with a velocity v with respect to S. A Lorentz
transformation relates the space-time coordi-
nates in frame S' describing the event in frame exp [-W(XI X2 (4)
S to the space-time coordinates in S. This
Lorentz transformation forms the basic mathe- Noting that exp M = I + (Mil!) + (M2 12!) +
matical foundation for Einstein's special theory (M3 13!) + ... , and that
o/relativity (see RELATIVITY).
An interval between two events may be de- M2n XI =(-I)n W 2n XI ;
scribed by an invariant measure of length, a
metric, that is the same in all reference frames: M2n X2 =(-I)nw2nX2;

M2n+l xl =(_I)n w 2n+l x2 ;


(~S)2 =(~Xd2 +(~X2)2 +(~X3)2 +(~X4)2
= (~XI')2 + (~X2')2 + (~X3')2 + (~X4')2 M2n+l X2 = (_1)n+1 w 2n + 1 Xl
one easily obtains the relations (2) that give the
(1)
primed coordinates in frame S' in terms of the
where we have set (Xl, X2, X3, X4) == (X, y, z, unprimed coordinates in S.
ict) and where c is the velocity of light. A Similarly if we perform a "rotation" in the
space-time with the above metric is called a X3-X4 plane in Minkowski space through an
Minkowski space. In the Einstein picture there imaginary angle iX where tanh X = vic such that
is no invariant separation of space and time. the primed coordinate system moves along the
Observation of an event from two different ref- X3 axis with a velocity v, the primed coordi-
667 LORENTZ TRANSFORMATIONS

nates may be obtained by use of the operator consequently

exp [-iX(X3 _3 - X4
3X4
~)J.
3X3
The transformation obtained by this Lorentz The part within the parentheses is related to the
transformation often called a Lorentz boost to orbital angular momentum operator L 12 such
distinguish it from the ordinary rotations yields that
XI' =XI

X2' = X2
(7)
X3' =X3 cosh X + iX4 sinh X We may generally express a finite Lorentz rota-
X4' = X4 cosh X - iX3 sinh X. tion or boost in a particular xJJ."xv plane through
an angle W by the operator
Noting that cosh X =(1 - v2 /c 2 )-1/2 and sinh X =
(v/c) (! - v 2 /c 2 )-1/2 one sees that expressions
(7) are identical to those of (1) when expressed cf>(X) = exp (1 WLlJ.v) cf>(x') (11)
in the corresponding parameters.
One can express generally the Lorentz rota- which is obtained by exponentiation of (10)
tions and boosts of the coordinate frame in The requirements of special relativity, that
matrix form: physical processes be independent of the refer-
ence coordinate frame, leads to rather profound
(8)
conservation laws that may be obtained from
with the orthogonality condition on the ma- variational principles involving infinitesimal
trices variation of the Lagrangian function describing
the system. The conservation of orbital angular
(9) momentum is a direct consequence of invari-
ance under Lorentz transformations involving
A summation is to be made over repeated in- scalar particles.
dices. The Lorentz transformations (8) corre- A more complicated example is that of the
spond to a six-parameter homogeneous group. four-vector field such as the electromagnetic
However, events occurring in a coordinate vector potential AIJ. (x) consisting of three
system involve a physical process associated spacelike components AI, A 2, A 3 and a time-
with a physical object in that coordinate system. like component A4 = icV(x). The components
An equivalent description may be obtained by may be represented by a column matrix:
"rotating" the physical system rather than the
coordinate system, but in the inverse direction.
A physical quantity may be expressed by a AI (X)]
function involving the coordinates and mayor A 2 (x)
may not have other intrinsic degrees of freedom A (x) =[ . (12)
not directly involving the space-time coordi- A 3 (x)
nates. Examples are scalars, spinors, polar four-
vectors, antisymmetric tensors, etc. These other A4(X)
intrinsic degrees of freedom are designated as
spin and they occur in the quantum mechanical Omitting details too lengthy to treat here, we
description of elementary particles. The Lorentz must introduce an operator analogous to LIJ.V
transformation properties of these wave func- that rearranges the components of A IJ. (x) into a
tions must be examined. We consider two ex- new local arrangement AJ (x') referred to the
amples, the transformation properties of a transformed coordinate system. It should be
scalar and a polar four-vector wave function. emphasized that LlJ.v involves the coordinates
Consider a wave function in frame 8' de- while the spin operator introduced here in-
scribed by (1) that is considered to differ infin- volves the components of A (x).
itesimally from 8 by an angle OW in the x I -X2 These spin operators are defined in this Car-
plane. We make a Taylor expansion about the tesian representation as
unprimed coordinate system where x I' = Xl +
OWX2 and X2' = X2 - OWX1' The coordinates
are expressed in a shorthand notation, i.e.,
cf>(x) == cf>(X1, X2, X3, X4).
8 23 =fI[~o ~ -~ ~
i 0 0
o 0 0 0
LORENTZ TRANSFORMATIONS 668

After rearranging in trigonometric series and

~{~ ~}
0
carrying out the matrix multiplication, one ob-
0 0 tains expressions for the components of the
S" four-vector field A(x):
-I 0 0 , , ,
A 1 (x ) = A 1 (x ) cos w + A 2 (x') sin w
0 0 0
A 2'(x') = A2 (x') cos w- A 1 (x') sin w
0 -i 0 0 (14)
A 3'(x') =A 3 (x')
0 0 0
S12 =fz
0 0 0 0 It is not obvious from the operation per-
0 0 0 0 formed here that, not only are the components
of the polar vector field rearranged by this op-

]
erator, but also the dependence on the coordi-

~{~
0 0 nates that are in the primed system. This feature
0 0 would be made obvious in the transformation
of the Lagrangian function involving these
S" 0 0 fields, which is beyond the scope of this article.
This local dependence may be made global by
0 0 inclusion of the operator L 12 occurring in (11)
that transforms the coordinates of the primed
0 0 0 0 system to those of the unprimed system:

S24 =fz
0 0 0 -i
0 0 0 0
A'(x) = exp [1 W(L 12 +S12)]A(X) (15)

so that we have
0 0 0
A 1'(x) =Al (x) cos W + A 2 (x) sin w

s,,~{
n
0 0 A 2'(x) =A2 (x) cos W - A 1 (x) sin w
0 0 (16)
A 3'(x) =A 3 (x)
0 0
0
The sum of L/lu and S/lu representing the or-
The action of an operator similar to that of (11) bital and spin angular momentum operators
involving these spin operators S/lu may be ob- yield that for the total angular momentum J /lU.
tained by expansion of the exponential The Lorentz rotation of a particle or field
through an angle w in the X1-X2 system is thus

"
A (x) = exp
C
i ) wS/lU A (x).
given by the operator

exp (i WJ12) . (17)


Noting that, for example,

[' f]
The corresponding Lorentz boost of a particle
0 0 to velocity v along the X 3 direction is given by

G-
. 2n 0 0 the operator.
WSu ) ~ (- I)" w'" ~ 0 0
exp (~XJ34) (18)
0 0
where tanh X = vIc. In this latter case the com-
and ponents of a four-vector field become

r ~}
0 0 AI' (x) = A 1 (x)

C
i WS12
n 1
+ = (_l)n W 2n +1
-1
0 0 0
0 0 Az'(x) =Az(x)
A 3'(x) =A 3 (x) cosh X + iA 4 (x) sinh X
(19)

0 0 0 A 4'(x) = A4(X) cosh X - iA 3 (x) sinh X


669 LUMINESCENCE

where as before Rindler, W., "Essential Relativity," Second Edition,


New York, Springer-Verlag, 1977.
cosh X = (I - V2 le 2 )-1/2 =Elme 2 Sakurai, J. J., "Invariance Principles and Elementary
Particles," Princeton, Princeton Univ. Press, 1964.
sinh X = (vic) (1 - v2 le 2 )-1/2 = plme.
In these expressions, m is the rest mass of the Cross-references: CONSERVATION LAWS AND
particle, E is the total energy, and P is the mo- SYMMETRY; ELECTROMAGNETIC THEORY; EL-
mentum. EMENTARY PARTICLES; MATRICES; MECHAN-
In any physical process involving particles or ICS; PARITY; TIME; VELOCITY OF LIGHT; WEAK
fields the invariance of the Lagrangian under INTERACTIONS.
infinitesimal homogeneous Lorentz transfor-
mations leads directly to the law of conserva-
tion of total angular momentum whenever the
particle has an intrinsic spin. The orbital and
spin angular momentum are not independently LUMINESCENCE
invariants in a dynamical process.
There are other Lorentz transformations allow- Introduction Luminescence is the phenom-
able. The six-parameter homogeneous group (8) enon of light emission in excess of thermal
may be extended to the ten-parameter inhomo- radiation. Excitation of the luminescent sub-
geneous group that includes the translations stance is prerequisite to the luminescent emis-
sion. Photoluminescence depends upon excita-
(20) tion by photons; cathodoluminescence, by
cathode rays; electroluminescence, by an applied
The constant €j.l represent a translation or shift voltage; chemiluminescence, by utilization of the
of the coordinate Xj.l. Such translations obey energy of a chemical reaction. Luminescent
the invariant metric condition (I). emission involves optical transitions between
The corresponding Lorentz translation of a electronic states characteristic of the radiating
wave function is given by substance. The phenomenon is essentially the
emission spectroscopy of gases, liquids, and
solids. The same basic processes may yield
infrared or ultraviolet radiation in substances
with suitable electronic energy states; there-
where P/l are the operators corresponding to fore, such emission in excess of thermal radia-
the components of the four momenta (PI, P2, tion is also described as luminescence.
P3, (i/e) E). Luminescence can be distinguished from the
-Ii a RamaJl effect, Compton and Raleigh scattering
Pj.l=---. and Cerenkov emission on the basis of the
i aXj.l time delay between excitation and luminescent
emission being long compared to the period of
Invariance of the Lagrangian function under in- the radiation, A/e, where A is the wavelength
finitesimal Lorentz translations leads directly to and e is the velocity of light. The radiative
the conservation of momentum and energy. lifetimes of the excited states vary from 10- 10
There are also the improper Lorentz trans- to 10- 1 sec depending on the identity of the
formations in which the space and/or the time luminescent substances whereas A/e is approxi-
coordinates are inverted. Space inversion corre- mately 10- 14 sec for visible radiation. At ordi-
spondsto nary densities of excitation, the spontaneous
xi' = -Xi, i = I, 2, 3 (22) transition probability predominates so that the
luminescent radiation is incoherent; under con-
while time inversion corresponds to ditions of high densities of excitation in suit-
able luminescent substances, the induced transi-
X4' = -X4· (23) tion probability may predominate, the emitted
radiation is coherent, and laser action is at-
Both of these improper Lorentz transforma- tained.
tions obey the invariance condition (I). They The initial persistence of luminescent emis-
are discussed in detail in the article PARITY. sion following the removal of excitation de-
RAYMOND W. HAYWARD pends on the lifetime of the excited state. This
emission decays exponentially and is often
called fluorescence. In many substances, there
References is an additional component to the afterglow
Jackson J. D., "Classical Electrodynamics," Second which decays more slowly and with more com-
Edition, New York, John Wiley & Sons, 1975. plex kinetics. This is called phosphorescence.
Pauli, W., "Theory of Relativity," London, Pergamon For many inorganic crystals, the emission
Press, 1958. spectra for fluorescence and phosphorescence
LUMINESCENCE 670

are the same; the difference in afterglow arises


from electron traps from which thermal acti-
vation is prerequisite to emission. For organic
molecules, the emission spectra for fluores-
cence and phosphorescence are often different:
the former occurs from an excited singlet· the
latter, from a triplet state. '
Luminescence of Gases The simplest lumi-
nescent substances are monatomic gases. The 2
electronic states are characteristic of the iso- 1
lated atoms; therefore, the excitation and emis-
sion spectra depend only on the differences in
energy of the stationary electronic states of the
many-electron atom, and the spectral lines are
broadened only by the lifetimes of the excited
states or at higher pressures, by collisions. The
transitions are to a good approximation one-
-
electron transitions. Resonance fluorescence is
photoluminescence in which the exciting radia-
tion is the exact frequency or wavelength for R
the transition from the ground to the excited FIG. 2. Configuration coordinate model.
state and emission occurs with the same fre-
quency. Resonance fluorescence is shown dia-
grammatically in Fig. I for low pressure alkali as shown in transitions I and 2 of Fig. 2. In
metal vapor. The well-known 2537A line of most cases the emission will involve a smaller
tran~ition energy and occur at longer wave-
mercury vapor is another example of reso-
nance fluorescence. This emission can also be length than the photo-excitation. This is re-
excited by electrons accelerated by 5 or more ferred to as Stokes' emission. In some cases, for
volts. The simplest case of sensitized fluores- example at high temperatures when the higher
cence (photoluminescence in which absorption vibrational levels of the ground electronic state
of the exciting radiation is by one substance are populated thermally, anti-Stokes' emission
and the excitation is transferred to another is also observed. Additional structure in the
which emits radiation) occurs with mixtures of photo excitation and emission spectra arises
m~n~tomic gases. For example, the charac-
from rotational states of the molecule. Iodine
tensilc fluorescence of thallium is observed is a typical diatomic luminescent molecule
when mixtures of Tl and Hg vapors are illu- excited by green light with visible emission at
minated. with.the 2537A radiation of Hg. slightly longer wavelengths. Benzene and aniline
~or dIatomIc and polyatomic gases, the en-
are typical polyatomic molecules which are
ergies of the electronic states are dependent on luminescent as vapors.
the interatomic distances of the molecule. This Luminescence of Organic Materials The elec-
dependence is shown in Fig. 2 for the Iround tronic states of most organic luminescent mate-
and excited states. For a diatomic molecule the rials in the liquid or solid phase, either as pure
coordinate R is the distance between the two materials or as solutes in dilute concentration
atoms. For each electronic state there is a in inert solvents, are to a good approximation
series of vibrational levels which ar~ also shown describable in terms of the electronic states of
in Fig. 2. Optical transitions occur between the free molecule in the gaseous phase. In other
individual vibrational levels of one electronic words, the intermolecular forces are weak com-
state and individual vibrational levels of another pared to the intramolecular forces. The photo-
electronic state. These transitions occur in ac- excitation and luminescent emission spectra of
~ordan.ce ~ith the Franck-Condon principle, these substances in condensed phases are similar
I.e., wIth fIxed nuclear coordinates, vertically to the spectra of the vapors. The intermolecular
forces are, h~wever, great enough to bring
about broademng of absorption and emission
lines and in some cases to bring about elec-
tronic energy transfer between molecules be-
fore intramolecular, vibrational relaxation with
the accompanying Stokes' shift can occur. On
the other hand, in a viscous or rigid medium
collisional, nonradiative de-excitation is reduced.
Many of the organic luminescent materials are
aromatic molecules related to dyes. The sodium
salt of fluorescein in dilute aqueous solution is
well known as an efficient fluorescent material.
Other organic substances luminesce efficiently
FIG. 1. Resonance fluorescence of atoms, e.g., Na. when dissolved in organic solvents. Terphenyl
671 LUMINESCENCE

- -absorption of the spin selection rule governing radiative


-fluorescence transitions, the triplet has a long lifetime and
the oscillator strength for direct excitation to
the triplet is negligible. In suitable systems,
I' polarized excitation and emission arising from
I ' the anisotropy of the organic molecules can be
1 \ observed.
I \ Chelates involving organic molecules as li-
I \ gands bound to a metal atom or ion are a class
I ", of substances with members which luminesce.
/
I " ' ..... ..... The fluorescence and phosphorescence of chlo-
rophyll both in vivo and in vitro have been in-
vestigated for many years. On the other hand,
16000 18000 20000 rare earth chelates with organic ligands have
.v). ) c m~1 been intensively investigated quite recently as
lasers. The photoexcitation occurs in the broad
FIG. 3. Mirror symmetry of absorption and emission. absorption bands of the ligand; the energy is
transferred to the localized 4f shell of the rare
earth by a mechanism related to the transfer
in xylene is a liquid {3- and "y-ray scintillator process occurring in the scintilla tors; lumines-
with emission in the near ultraviolet. Some or- cent line emission characteristic of the rare
ganic molecules luminesce most efficiently in a earth occurs as coherent radiation with high
rigid medium. A solid solution of I per cent excitation intensities. A fluorinated Eu-ace-
anthracene in napthalene is a scintillator with tonate dissolved in acetonitrile has been an-
blue emission. For these solutions, energy is nounced as a liquid laser operating near room
absorbed by the solvent molecules and trans- temperature.
ferred to the solute where the luminescent Luminescence of Inorganic Crystals Inor-
emission occurs. Crystals of some pure organic ganic crystals which luminesce are often called
substances luminesce, particularly at low tem- phosphors. Their luminescence in most cases
peratures. originates from impurities or imperfections.
The fluorescent emission and the long-wave- Exceptions include the luminescence of alkaline
length absorption of organic materials are often earth tungstates which is characteristic of the
simply related as mirror images of each other. W0 4 2- group perturbed by the crystal field,
This is shown in Fig. 3 for rhodamine in etha- the luminescence of some rare earth salts, and
nol and can be explained on the basis of the radiative recombination of conduction elec-
configuration coordinate model in Fig. 2. with trons with valence band holes in semicon-
the force constants for the two electronic states ductors. The impurities and imperfections re-
approximately equal. For organic molecules, sponsible for luminescence in inorganic crystals
the configuration coordinate R is interpreted are of diverse atomic and molecular types
as representing, schematically, all the intra- whose characteristics depend on the structure
molecular nuclear coordinates. In addition to of the defect and on the electronic structure
the fluorescent emission, many organic sub- of the pure crystal. In some cases, the elec-
stances exhibit phosphorescent emission. This tronic states involved in the luminescence can
arises from nonradiative relaxation from the be described in terms of energy levels of the
excited singlet to the triplet state followed by impurity ion perturbed by the crystal field;
radiative decay from the triplet to the ground in other cases, in terms of the crystal band
singlet state, as illustrated in Fig. 4. Because structure perturbed by the impurity. The exis-
tence of conduction bands in inorganic crystals,
particularly in semiconducting crystals, intro-
duces additional mechanisms for the excitation
of luminescence and for phosphorescence. For
example, suitable impurities can be excited by
~"r---T alternate capture of injected conduction elec-
trons and valence band holes, thus providing
one mechanism for electroluminescence; on the
fl. other hand, an excited luminescent impurity
ph. may lose an electron to another defect via the
conduction band, and the thermal activation
necessary for return to the luminescent im-
purity is responsible for phosphorescence.
S --l.---'-_-.&-_
The alkali halides are simple ionic crystals
which become luminescent when doped with
suitable impurities. Thallium substituted in di-
FIG. 4. Fluorescence and phosphorescence of or- lute concentration at cation sites in potassium
ganic molecules. chloride has the absorption and emission shown
LUMINESCENCE 672

- -absorption of interest as lasers and as phosphors for fluo-


rescent lamps and for color television. Crystal
--emission field theory can be used to explain the optical
(\ absorption and luminescent emission of the 4f
I \ transitions of rare earth ions in crystals and
I \ glasses. Phosphors doubly activated with rare
I \ earths, e.g., YF 3 : Yb, Tm, have been found to
I \ be capable of large anti-Stokes' emission. This
I \ occurs by multiphoton infrared excitation and
with visible emission.
v \ '"'
, ,
The zinc sulfide phosphors, which are widely
\ / \ used as cathodoluminescent phosphors and
/
'- / \...
well-known for their electroluminescence, are
2000
now recognized as large band gap, compound
semiconductors. Two impurities or imperfec-
tions are essential to the luminescence of many
FIG. 5. Spectra of KCL:ll. of these phosphors: an activator which ~eter­
mines the emission spectrum and a coactlvator
which is essential for the emission but in most
in Fig. 5. The absorption bands involve the cases has no effect on the spectrum. Activator
IS -+3 pO, 1 pO transitions of the free ion per- atoms such as Cu, Ag and Au substitute at Zn
turbed by crystal interactions; the principal sites and perturb a series of electronic states
emission band, 3pO -+ 1 S, is similarly perturbed. upward from the valence band edge. In a neu-
The spectra can be understood qualitatively tral crystal containing only these activator impu-
with the aid of Fig. 2, modified with a second rities, the highest state is empty, i.e., it contains
excited state and with the configuration co- a positive hole and can accept an electron from
ordinate interpreted as symmetric displacement the valence band; therefore, in semiconductor
of the six nearest-neighbour Cl- from the Tl+. notation, the activator is an acceptor. In a simi-
It is this interaction which is most dependent on lar way, coactivators such as Ga or In at Zn
the electronic state of the Tl+ and is, therefore, sites or CI at S sites are donors in ZnS. The
largely responsible for the band widths and simultaneous introduction of both types of
Stokes' shift. impurities results in electron transfer from
Many inorganic crystals become luminescent donor to acceptor lowering the energy of the
when certain transition metal ions are dissolved crystal and leaving both impurities charged. The
in them. The luminescence involves intercom- coulomb attraction of the donor and acceptor
bination transitions within the 3d shell; there- leads to a departure from a random distribu-
fore, crystal field theory can be used to inter- over lattice sites and to pairing. The electronic
pret the absorption and emission spectra. states and some of the transitions of acceptors,
Divalent manganese is a common activator i?n. donors and donor-acceptor pairs are shown in
Zn z Si0 4 , ZnS and 3Ca3(P04h . CaF z are Im- Fig. 6. The spectrum of ZnS:Cu, Ga is shown
portant phosphors a~tivated w.ith Mnz~. !he in Fig. 7. The longer-wavelength emission band
last activated also WIth Sb 3+, IS the pnncipal involves the transition from the lowest donor
flu~rescent lamp phosphor. The excitation at state to highest acceptor state (transition 3) in
2537A from the Hg discharge occurs at the approximately fifth nearest-neighbor pairs; the
Sb 3+ whose energy level structure is similar shorter-wavelength emission corresponds more
to that of Tl+; part of the energy is radiated in nearly to transition 1 of Fig. 6. Luminescent
a blue band due to Sb3+ and part is trans- emission from donor-acceptor pairs has been
ferred to the Mnz+ which is responsible for an more clearly seen with gallium phosphide crys-
orange emission band. It has been shown that tals. In addition to luminescence due to donors,
the Sb 3+ at a Ca z + site is locally charge-com-
pensated by an Oz- at a halide site and that
the blue emission is in part an eJectron transfer
transition. The ruby laser involves the lumines-
cence of Cr 3+ in Al z 03. Excitation occurs in a condv.chon band
broad absorption band, and the system relaxes
to another excited state from which emission
occurs in a narrow band.
Rare earth ions, particularly trivalent, in solid
solution in inorganie crystals and in glasses
exhibit the emission characteristic of transitions
in the 4f shell. For examples, samarium, euro-
pium, and terbium give visible emission; neo- valence band
dymium, infrared; and gadolinium, ultraviolet.
Because of their narrow emission bands and FIG. 6. Band model for acceptor, donor, and pair
stability, the rare earth activated phosphors are transitions.
673 LUMINESCENCE

-300 k systems involves to some extent all the particles


and thus the exact description of the excitation
- - 80 K is a collective excitation.
Under some conditions of luminescent excita-
.... -, tion (for example, very high excitation inten-

/
,,
"" "' sity), a more complete formulation is necessary,
including the radiation field of the excitation,
I
I the detector of emission, and the source of any
I applied stresses to the luminescent body. Even
I at low intensities of excitation the coupling of
I
I
ths: electromagnetic probe to transverse excitons
I must be included to describe excitonic lumi-
I nescent transitions, that is, excitonic polaritons.
/
Another example is the recent analyses of the
" effects of hydrostatic pressure on luminescent
4000 $000 6000 7000 spectra in which a type of double adiabatic
AJl approximation is used, with the electrons,
nuclei and pressure apparatus approximated
FIG. 7. Spectra of ZnS: Cu, Ga. respectively as fast, intermediate, and slow
components of the total system.
The usual luminescent transitions depend on
acceptors and their pairs, emission bands due to thermalization of the excited molecule or center
transition metals are well known for zinc sul- among the vibrational levels illustrated in Fig. 2.
fide as noted earlier. In zinc sulfide crystals, the Emission which occurs before thermalization
donors which are unassociated with acceptors is termed "hot" luminescence. It is well-known
serve as electron traps and are responsible for for molecular dopants such as O2- in alkali
long-persistent, temperature-dependent phos- halides. This phenomenon is investigated more
phorescence. Uncharged or isoelectronic dop- widely by nanosecond and picosecond time-
ants, such as N at phosphorous sites in GaP, resolved spectroscopy.
have been shown to be efficient radiative re- Hot electrons are responsible for the excitation
combination centers, particularly in indirect of luminescence with high electrical fields, that
band gap semiconductors. The III-IV semicon- is, high field electroluminescence. This phenom-
ductors with isoelectronic dopants or with enon is investigated in thin film devices consist-
donor-acceptor pairs are currently the princi- ing of a phosphor sandwiched between insula-
pal electroluminescent light-emitting diodes. tors, all films prepared by evaporation or
Trends in Luminescence Research Both mo- sputtering, and subjected to a.c. voltages. The
lecular and solid-state luminescence research mechanism is well-established for ZnS: Mn and
have become increasingly sophisticated in ex- ZnF2: Mn as collision excitation of Mn 2 + by
perimental techniques and theoretical concepts; hot electrons, and cross sections have been
the materials investigated, more diverse and measured and calculated theoretically. These
structured. Low temperatures, high electrical and rare earth doped components are used in
and magnetic fields, intense excitation and thin film displays.
short times are among the conditions used to For systems exhibiting inhomogeneous spec-
investigate luminescent phenomena. Lumines- tral broadening, site selection spectroscopy is
cent excitation is most generally formulated used to resolve components of the spectra. Ions
theoretically as a collective excitation of a or molecules at specific sites are selectively ex-
many-body problem encompassing the source cited to fluorescence or phosphorescence with
of excitation and its probe, as well as the lumi- laser sources at low temperatures. Organic com-
nescent body itself. Inhomogeneously doped pounds in solutions, crystals, and amorphous
crystals, amorphous materials and man-made materials are investigated by site selection
superlattices are now investigated as luminescent spectroscopy. "Hole burning," that is, optical
materials, as well as are homogeneous solutions, depletion of selective ground state molecules
crystals and powders. . followed by optical probing of these molecules
The configuration coordinate model shown m is another valuable tool for studying the origin
Fig. 2 for diatomic and polyatomic mo~ecules of inhomogeneous broadening and also to
is also applied to luminescent centers m the investigate energy transfer.
condensed phase. The configuration coordinate
represents the local and/or normal atomic
coordinates of the luminescent system. The FERD WILLIAMS
model is based on the adiabatic approximation
to - the Born-Oppenheimer expansion for a Reference
system of many electrons and many nuclei,
for which the electrons are assumed to move Pringsheim, P., "Fluorescence and Phosphorescence,"
rapidly compared to the nuclei. Howe~er, New York, Interscience, 1949.
luminescent excitation of these many-partlcle Curie, D., "Luminescence in Crystals" New York,
LUMINESCENCE 674

J. Wiley & Sons, 1963 (translated by G. F. J. Gar- DiBartolo, B. (Ed.), "Luminescence of Inorganic Sol-
lick). ids," New York, Plenum Press, 1978.
Goldberg, P., Ed., "Luminescence of Inorganic Sol- Curie, D., Mattler, 1., and Parrot, R. (Eds.), "Proceed-
ids," New York, Academic Press, 1966. ings of the 1978 International Conference on Lumi-
Crosswhite, H. M., and Moos, H. J., Eds., "Optical nescence," Amsterdam, North Holland, 1979.
Properties in Crystals," New York, Wiley-Intersci- Williams, F. (Ed.), "Workshop on the Physics of Elec-
ence,1967. troluminescence," Journal of Luminescence 23 (1,2)
Lim, E. C., Ed., "Molecular Luminescence," New (1981).
York, Benjamin, 1969. Broser, I., Gumlich, H.-E., and Broser, R. (Eds.), "Pro-
Williams, F., Ed., "Proceedings of International Con- ceedings of the 1981 International Conference on
ference on Luminescence," Amsterdam, North Hol- Luminescence," Amsterdam, North Holland, 1981.
land, 1970.
Birks, J. B., "Photophysics of Aromatic Molecules,"
New York, Wiley-Interscience, 1970.
Williams, F., Ed., "Luminescence ofCrystaIs, Molecules,
and Solutions," New York, Plenum, 1973. Cross-references: COLOR CENTERS; CRYSTAL-
Shionoya, S., Nagakura, S., and Sugano, S. (Eds.), LOGRAPHY; ENERGY LEVELS; LASERS; PHOTO-
"Proceedings of the 1975 International Conference CONDUCTIVITY; RADIATION, THERMAL; RESO-
on Luminescence," Amsterdam, North Holland, NANCE; SEMICONDUCTORS; SOLID-STATE PHYSICS;
1976. SOLID-STATE THEORY; SPECTROSCOPY.
M
MAGNETIC FIELD the conductor axis. The same is true for an axial
current in any axisymmetric conductor, e.g.,
Magnetic field is generated by the passage of the toroidal conductor of Fig. I (b). An in-
electrical currents through conductors or by the finitely long circular cylindrical conductor. [Fig.
circulation of microscopic charges within mag- 2(a)] carrying an azimuthal current denslty It:
netic materials. The magnetic field exerts a per unit length contains an axially directed
mechanical stress on its sources. magnetic induction of strength. B = 41T~~. ~he
Basic Equations To separate electromagnetic same expression holds for a stralght cylmdncal
theory from the theory of the solid state, Max- conductor of arbitrary cross section [Fig. 2(b)],
well's equations can be written in terms of the or for an infinite plane current sheet. A cirCUlar
magnetic induction or flux-density B and the cylinder like that of Fig. 2(a), but of finite
total current density J: length L and radius R, has a central mafnetic
V"B=O (1) induction of strength B = 41TIbL(L2 + 4R rl/2.
In terms of the vector potential A,
V' X B = 41TJ (2)
B=V'XA (4)
where, for purposes of the present article, the
displacement current (l/c 2 )aE/at is neglected and the gauge V" A = 0, one ha~
relative to 41TJ (see ELECTROMAGNETIC THEORY V'2 A = -41TJ (5)
and MAGNETISM).

_f
The solution of Eqs. (1) and (2) is and
3 J(rd X (r - rd
B(r) - d r1 1r - r1 13 (3) (6)

where the integral includes all current-carriers, for A vanishing at infinity.


and B vanishes at infinity. Magnetic lines of force are defined by dr ex B
From Eq. (3) it follows, for example, that an
infinite straight conductor [Fig. l(a)] carrying
a total axial current Ie gives rise to an azimuth- B B
ally directed external magnetic induction of
strength B = 2Ie/r, where r is the distance from

-
I I

+- I

-
I
I
I,c lc
I I
I
-+-1
1 I

,...
I
I
I
1
"7
1
I I
(0 ) (b) (b )
(0 )
FIG. 1. Axisymmetric configurations with axially FIG. 2. Infinitely long cylindrical configurations
directed current Ie. with axially directed magnetic induction B.

675
MAGNETIC FIELD 676

variable of the magnetic material (Le., it is con-


stant in uniform samples and constant fields).
This condition follows automatically from the
interpretation of M as a magnetic-moment den-
sity per unit volume:
M=Nmo (12)
The theoretical molecular magnetic moment
mo (with number density N) is derived from J m
by evaluation of Eq. (9) over the molecular
volume.
For macroscopic purposes, the total current
A density J of the preceding section is now speci-
fied by
J = J e + JM (13)
The component J e flows in conductors of resis-
tivity 1/ in accordance with Ohm's law:
1/Je = E (14)
The component JM is derived from M.
In the analysis of configurations involving
magnetic materials, the magnetic field H is a
convenient vector
FIG. 3. Magnetic flux tube.
H = B - 41TM (15)
and are endless, by virtue of Eq. (1). The mag- Then Eqs. (1) and (2) take the form
netic flux <P through a surface S, bounded by a
closed curve Q(Fig. 3), is given by 'V . H = - 41T'V . M (16)

<P = f dSB . n = f dQ· A (7)


'V X H =47Tle
At the interface between two magnetic ma-
(17)

terials, Eqs. (1) and (17) imply continuity of


where n is the normal to S. The flux tube de- the normal component of B and of the tangen-
fined by the field lines passing through Q con- tial component of H.
tains constant flux, independent of S. Across a sheet-current of density N per unit
At a large distance from a localized current length, the tangential component transverse to
distribution at r = 0, Eq. (6) gives the dipole J e of both Hand B undergoes an increment
potential 41TI~. The other components of Hand Bare
unaffected.
A= mX r (8) The field patterns set up by a magnetized
,3 sphere are illustrated in Fig. 4. The magnetic
induction (a) and magnetic field (b) are identi-
where cal outside the sphere, but differ inside it, be-

m=t I
cause of the magnetization (c). The same pat-
tern of magnetic induction could be generated
d 3 'lrl X J(rl) (9) in the absence of magnetization by a surface
current (d). The source of the magnetic field in
Equations in the Presence of Magnetic Ma- Eq. (16), that is to say the quantity 'V . M, is
terials A macroscopic current density JM can also referred to as the magnetic pole density.
be defined by local averaging of the microscopic The north and south polar regions are indicated
current density J m within magnetic materials in (c).
For weakly magnetic materials, Eq. (15) can
JM = (Jm)av (10) generally be written in terms of a scalar mag-
netic permeability 11
More conveniently, a magnetization vector M
can be introduced, where IlH =B (18)
JM = 'V X M (11) For ferromagnetic materials, one can still write
In experiments, only JM can be measured di- IlH =B - 41TMo (19)
rectly (via measurements on B), and M is then
uniquely derivable from Eq. (11) only with the where Mo is a permanent magnetization, but 11
added condition that it is to be a local state now depends on the time history as well as the
677 MAGNETIC FIELD

[2(r) = - f d3rl
iji' M(rl)
Ir - rl I
(22)

In the presence of current-carrying conductors


Eqs. (20) and (21) still hold in the region where
J c = 0, but Eq. (17) now implies a multivalued
potential

(23)
lo) ( b)

where the integral is taken around a loop en-


closing the total conductor current Ie. To keep
the potential single-valued, so that the solution
47TM
of Eq. (22) remains valid, one may adopt the
"magnetic-shell" approach: J e is replaced with
North ~ South an equivalent M, in analogy with Eq. (11).
p a l e . Pole Magnetic Force and Energy From Maxwell's
stress tensor, we find the volume force
(c) (d)

FIG. 4. Magnetic induction (a) and magnetic field f = - Ij ( B2)


- + -I (B . Ij) B (24)
8n 4n
(b) arising from sphere with magnetization pattern
shown in (c) or with surface current pattern shown = J XB
in (d).
which agrees with the summation of the Lorentz
forces on the moving charges composing J. The
magnitude of H. The typical relation between B "magnetic pressure" against a current sheet
and H for ferromagnetic materials is illustrated bounding a region of finite B (as in the Meissner
inFig.S. effect or ordinary skin effect) is thus B2 18n,
When J c is zero everywhere, one can define a evaluated at the surface. The force and torque
scalar potential [2, such that on a body localized in a nearly uniform field are
H = - 1j[2 (20) F=(m'Ij)B (26)
1j2[2 = 4nlj . M (21 ) N=mXB (27)
If the boundary condition on [2 is simply that From the microscopic point of view underly-
it vanish at infinity, the solution is ing Eq. (2), the magnetic energy density is
B2
w=- (28)
8n
B
In the presence of magnetic materials, one is
more interested in the electrical input energy
required to go from Bo to B, and this is given by

.::lw = ~ rB H' dB
4n J Bo
(29)

H For H =pB, with constant p, this becomes


1
.::lw = - - (B2 - B 2) (30)
8np 0

The derivation of Eqs. (28) and (29) depends


on the complete set of Maxwell's equations.
Units and Magnitudes The equations used
FIG. 5. As magnetic field H is initially raised to HI, here are based on the emu system. If the cur-
magnetic induction 8 rises to 8 1 , Cyclical pattern rents are expressed in amperes, they must be
shown is typical of ferromagnetic materials: 8 saturates divided by 10 to give magnetic inductions in
as H becomes large; 8 remains finite as H returns to gauss or fields in oersteds. A magnetic induc-
null; 8 goes to -8 1 as H goes to -HI' Double- tion of 10 kilogauss (one weber per square
valued ness of 8(H) is known as hysteresis. meter) can exert a maximum stress B2/8n of
MAGNETIC FIELD 678

about 4 atm, and contains an energy density of an elementary particle or a nucleus may take
0.4 joules/cm- 3 • on values which are integral multiples of fi/2.
The strength of the earth's magnetic induc- The nuclear equivalent of the Bohr magneton,
tion is about 0.2 gauss. For typical ferromag- based on a classical model of the proton as
netic materials, the maximum value of 47TM is a rotating sphere of charge is the nuclear
20 kilogauss; at this point of saturation, the magneton
permeability /1 approaches unity. At magnetic
fields below one gauss, permeabilities of 1000 /1N = 5.050824 X 10-27 J rl.
or more can be reached. The strength of ma-
terials limits the magnetic induction obtainable The actual intrinsic magnetic moments for
nondestructively with laboratory electromag- the free electron and proton are respectively
nets to peak values well below a million gauss. -1.001160 /1B and +2.792846 J1.N. The g
factor is a dimensionless measure of magnetic
HAROLD P. FURTH moment (in units of - /1B/2) which is tradi-
tionally used for electron-associated moments.
References A particle possessing both angular momentum
J and a proportional magnetic moment p.. = r1 =
Stratton, J. A., "Electromagnetic Theory," New York, rl1f will precess gyroscopically when placed in
McGraw-Hill Book Co., 1941. a magnetic field Bo in a manner similar to that
Jackson, J. D., "Classical Electrodynamics," New York, of a spinning top precessing about the Earth's
John Wiley & Sons, 1962. gravitational field. The proportionality constant
r is known as the magnetogyric ratio. The
Cross-references: ELECTRICITY, ELECTROMAG· precession rate in radians sec-I, the Larmor
NETIC THEORY, FERROMAGNETISM, FIELD frequency, may be derived by a classical calcu-
THEORY, MAGNETISM. lation to be
Wo =rBo. (1)
MAGNETIC RESONANCE
Coincidentally, an exact quantum calculation,
The magnetic resonance phenomenon is the using a Hamiltonian H =- p..' Bo , yields an
resonant interaction between an oscillating energy separation (Zeeman interaction) be-
magnetic field and an orthogonal static mag- tween adjacent energy levels of a particle with
netic field mediated by the presence of objects spin angular momentum quantum number I
possessing both angular momentum and a of Eo =fzwo. A total of 2I + I states with
magnetic moment. The objects in question are energies -fzwolz occur, I z being the z compo-
normally microscopic in character (molecules, nent (magnetic) angular momentum quantum
atoms, atomic nuclei or subatomic particles: number and taking on the values -I, -1+ I,
protons, neutrons, electrons, muons, etc.), al- ... , +1.
though historically the effect was first demon- This beautiful correspondence between the
strated with magnetized iron rods. quantum and classical descriptions is made even
In atoms and molecules, a magnetic moment stronger for isolated spins by the fact that a
may arise from the orbital motions of electrons, quantum dynamic calculation shows that the
in the same manner as an electric current flow- expectation values (Ix>, (Iy>, and (Iz> behave
ing in a loop of wire generates a dipole-like with time in precisely the same manner as the
magnetic field. According to quantum theory, components of angular momentum of a wholly
angular momentum for such orbital or rota- classical particle. Thus, we are entitled to for-
tional motions is restricted to values which are mulate and visualize descriptions of magnetic
integral multiples of -fi (-Ii = h/27T, where h = resonance phenomena in terms of classically
6.626176 X 10-34 1 s is the Planck constant). precessing vectors.
Thus, we may calculate the magnitude of the The vector description of magnetic resonance
magnetic moment /1 due to the orbital motion is due to Rabi. If a spin experiences simul-
of an electron in the smallest (n = I) orbit of taneously a static magnetic field and a field
the classical Bohr atom using Ampere's law rotating in a plane perpendicular to the static
(/1 = current X area) and the fact that the angu- field, resonance becomes possible. (In practice,
lar momentum L =mvr =nh. The result, linearly polarized transverse oscillatory fields
are used; these may be decomposed into two
J1.B = .!!!... =9.274078 X 10-24 1 T- 1 , counterrotating fields, one of which moves
along with the precessing spin, the effect of the
2m
other being unimportant.) The detailed analysis
where elm is the magnitude of the electron of this situation is conveniently accomplished
charge to mass ratio, is known as the Bohr by a coordinate transformation to a frame of
magneton, and is a convenient unit in which reference rotating synchronously with the
to denote atomic and molecular magnetic rotating field (Fig. 1). By convention, the
moments. z-axis of the laboratory and rotating reference
The spin, or intrinsic angular momentum, of frames are collinear with the applied static
679 MAGNETIC RESONANCE

Rotallng Frame
Laboratory Frame

.-.I'----'7--- - - - yLAB ~-------YROT

FIG. 1. Rotating coordinate transformation. The rotating reference frame rotates in


synchronism with the applied 81 RF magnetic field. Within the rotating frame, the 81
field is static, the original 80 field is replaced by the resonance offset field .:lB, and the
magnetization M precesses about the net field Be.

field Bo =BokLAB' The rotating field ampli- ing an effective field given by LlB = (wo - w)/'y,
tude is B I , and the laboratory and rotating which acts along kROT.
frame vectors are defined as Now the resonance effect becomes clear and
simple to analyze. Upon application of aBI field,
a net effective field Be =B 1 iROT + LlBkROT
results which is no longer parallel to the z-axis.
and The spin now precesses at rate rBe about the
BlkROT effective field direction (often denoted by an
effective field angle 0 = tan -I [B 1/ LlB]) and
respectively, w being the angular rotation may suffer large excursions in direction. When
frequency. the frequency of the applied BI field is far
Thus, in the absence of a BI field a spin with above or below Wo [Fig. 2(b) I the effective
true Larmor frequency Wo appears in the rotat- field direction is essentially the same as that of
ing frame to precess at Wo - w == Llw. In order the laboratory field, and no significant effects on
to preserve the form of the dynamical equa- the spin are observed. The maximum effect
tions upon transformation into the rotating occurs when the frequency of the applied BI
frame, we adopt the idea of the spin experienc- field exactly matches Wo [LlB -+ 0, the reso-

Fer Off R •• onenee

B,

(a) (b)

FIG. 2. Magnetic resonance-development of transverse magnetization. (a) On or near


:s
resonance (I.:lB I B I) the magnetization M may be nu tated from its thermal equilibrium
position along z into the xy plane (90 0 pulse), inverted (180 0 pulse), and so forth. (b)
When the applied RF is far from resonance (I.:lB I » B I) the magnetization is not signifi-
cantly perturbed.
MAGNETIC RESONANCE 680

nance condition, Fig. 2(a)]. In this case it is component for sufficiently large values of B I
possible to nutate the spin into the transverse near resonance).
direction and leave it there (a "900 pulse") or The Bloch equations also suggest ways of
invert its orientation entirely (a "1800 pulse," experimentally observing magnetic resonance
etc.). phenomena. The CW (continuous wave) tech-
The phenomenological description of the col- nique is based on the steady-state solution. The
lective behavior of a system of weakly interact- static transverse component of M in the rotat-
ing spins was provided by a set of equations ing frame corresponds in the laboratory frame
given by Bloch. Although they fail quantitatively to an oscillating transverse magnetization at the
in a number of specific instances (most notably frequency of the applied field. It may be ob-
for spin systems in solids or other strong-inter- served as a net power absorption by the sample
action situations), they offer an enormously or the appearance of a coupling between two
useful and generally applicable conceptual orthogonal transverse radiofrequency (RF) coils
framework: containing the sample as field and applied RF
are slowly brought into resonance according
dM x Mx to Eq. (l) (either field or frequency may be
-=r(MX B) - - swept). The transient response may be observed
dt x Tz
directly after a suitable (e.g., 90 0 ) pulse has
dMy My been applied. The resulting oscillatory decaying
- - = 'Y(M X B) - - (2) magnetization induces a voltage in a coil sur-
dt Y T2 rounding the sample (which may be the same
coil used to apply the pulse). This signal is
dM z = reM X B)z _ (Mz - Mo) . known as the free induction decay (FlO), and
dt TI contains components from all spins which were
significantly affected by the pulse. The CW
Basically, the Bloch equations describe the spectrum may be recovered by Fourier trans-
time behavior of the components of the magne- formation of the FlO (see FOURIER ANALYSIS).
tization M (summation over the individual The CW method is traditional for observation
microscopic magnetic moments) in the rotating of nuclear magnetic resonance (NMR) and is
frame. M 0 is the magnitude of the magnetiza- still almost exclusively employed for electron
tion in thermal equilibrium with the "lattice" spin resonance (ESR, or electron paramagnetic
(i.e., the remaining motational degrees of free- resonance, EPR). Over the past decade (1970s),
dom of the material in which the spin system pulsed, or Fourier transform, techniques have
resides) given by the Curie law gradually supplanted CW in chemical NMR
applications because of the higher potential
CBo sensitivity for a given measurement time (all
M 0- -
T ' (3)
- of the spectral information is "captured" in
one short time interval) and the convenience
C being the Curie constant and T the absolute with which computer-based signal averaging
temperature. In thermal equilibrium, the mag- may be adapted. (See Figs. 3-5.)
netization must be parallel to the static field The great preponderance of magnetic reso-
(transverse components zero). nance measurements are in NMR and ESR.
The cross-product terms in Eq. (2) represent Every chemical element has at least one observ-
the precessional or nutational behavior discussed able isotope. Common examples are listed
above. The remaining terms are damping terms in Table 1. ESR measurements may be made on
which describe the tendency of the system to any substance with net unpaired electron spin
return to thermal equilibrium. The time-depen- density. Examples are organic free radicals,
dent solution of the equations starting with an high-spin metal ions, conduction electrons in
arbitrary M yields a z-component Mz which metals, some doped semiconductors, as well as
decays exponentially toward Mo with a time systems with long-range collective interactions:
constant TI (the longitudinal or "spin-lattice" ferromagnetic, antiferromagnetic, and ferrimag-
relaxation time). The transverse components netic materials. The spontaneous polarization
Mx and My decay exponentially to zero with of ferromagnetic systems allows the occurrence
time constant T z (transverse or "spin-spin" of NMR and ESR without an external Bo field .
relaxation time). Longitudinal relaxation in- Several types of interactions affect NMR and
volves . exchange of energy with the lattice, ESR spectra. The presence of the Bo field in-
while transverse relaxation involves energy duces a circulation of electrons in materials
exchange between spins or dephasing of pre- which in turn gives rise to a small opposing field.
cessing components; hence the differentiation This field-proportional shielding effect depends
in relaxation times. It must always be true that sensitively upon the chemical environment and
TI #T2 • A low power (B I « ('Y 2 T I T2 )-1/2 orientation at the spin under observation. In
steady-state solution gives a nonzero transverse NMR the effect is called the chemical shift,
magnetization parallel to Be which may exhibit and is always reported with respect to a refer-
saturation (disappearance of the transverse ence compound of the isotope in question;
681 MAGNETIC RESONANCE

a
a
' H caD MHz eWI I:iC 12 2 . a;) MHI. c ro •• ~ol.rh.tlon) ~
l-Butllnol at , . , I(

( CH J,';» COH
-eH ,

O':-J ~
~
_ - -- - _ 200 ppm. 14!!i00 Hz) _ _ _ _ _~

- 1 - - -- -&ppm (3eo HzJ - - - - -_

b
1 3 C 17 5 . 43 MHz FT, protgn deCQupl ecU

~2aIl;Hz--_

FIG. 4. Ty~ical solid state Fourier transform NMR


spectra. (a) 1 C spectrum of tertiary butanol, employ-
ing proton decoupling [see Fig. 3(b)]. The widths and
_ _ __ _ _ ,eo ppm {12 . 1 IlriHd - _ _ _ _ _ _. . shapes of the lines are due to the anisotropy of the
chemical shift. (b) 2H spectrum of deuterated hexa-
FIG. 3. Typical solution NMR spectra. (a) Proton methylbenzene. The width and shape of this spectrum
60 MHz (Bo = 1.409 T) continuous wave spectrum of are due to the anisotropy of the quadrupole splitting
diethyl ether. Positions of spectral patterns are due to in teraction.
the chemical shift, line splittings within patterns are
due to J couplings. Integral of each spectral pattern is
proportional to the number of protons giving rise to
that resonance. (b) 13C 75.43 MHz (Bo = 7.046 T) of the same isotope (homonuclear dipole-
Fourier transform spectrum of ethylbenzene. Couplings dipole), between different isotoptes (hetero-
to protons are removed by continuous irradiation of nuclear dipole- dipole), between nuclei and
protons simultaneous with acqusition of carbon free unpaired electrons, or between electrons.
induction decay. Carbon-carbon couplings do not These interactions may occur directly through
appear because the low concentration of the isotope space, or may be "conducted" through chemi-
about 1% of 12C) makes the occurrence of 13C pairs cal bonds as a slight bias in spin polarization
rare. Solvent is deuterochloroform, CDCI3. Chemical (indirect or J coupling when speaking of NMR,
shift reference, tetramethylsilane (TMS), (CH3 )4Si, hyperfine coupling when speaking of couplings
has been added. to nuclei in ESR spectra).
Couplings between molecular rotation-induced
moments and nuclei or electrons can also occur
(spin-rotation coupling). These are usually most
typical ranges of chemical shifts vary from apparent in their effect on relaxation (see next
10 ppm (parts per million of field or fre- paragraph). Nuclei with I >! have nonspherical
quency) for the three hydrogen isotopes to charge distributions, and therefore interact with
250 ppm for carbon to several thousand ppm electric field gradients in materials. Quadrupole
for the heavier elements or for shifts arising couplings can range from zero to small pertur-
from conduction electrons in metals (these are bations on NMR spectra to values which com-
called Knight shifts). This exquisite sensitivity pletely dominate the nuclear Zeeman energy.
to chemical effects has made NMR one of the In the latter case, the quadrupole interaction
most powerful and ubiquitous techniques of can serve as the source of nuclear polarization
chemical analysis and research. Shiel dings in rather than the Bo field, making possible "zero-
ESR are reported as g values, which range from field" NMR (normally called nuclear quadrupole
roughly 1.9 to 2.2. Shifts arising from bulk resonance, NQR).
magnetic susceptibility are also observed. All of these interactions may be time depen-
Interactions also occur between all the mag- dent due to atomic or molecular motions, or
netic dipoles in systems, such as between nuclei due to natural or experimentally induced mo-
MAGNETIC RESONANCE 682

. - (9.535 GHz, X-bend)

" ~'.- I
H H

--~

__- - -- -- - - -- -- -- 20 Gau •• - - - - - - - - - - -- -_ _

FIG. 5. Typical solution ESR spectrum. Most ESR spectra are obtained by a con-
tinuous wave method in the derivative mode. Spectrum of parabenzosemiquinone radical
anion in alkaline ethanol. Line splitting (hyperfine coupling to the four protons) is 2.368
gauss, g-value is 2.005.

TABLE 1. MAGNETIC RESONANCE DATA FOR SELECTED PARTICLES.

Larmor Electric Relative


Magnetic Frequency Quadrupole Natural Relative Receptivity
Moment in 2.3487-T Moment,b Isotopic Receptivity at Natural
Particle Spin in Units of .uNa Field, MHz X10-28 m 2 Abundance per Partic1ec Abundance d

e 1
;: 1.7340593.uB 65,821.07 0 1.000 2.08 X 10 8 2.08 X 108
.u 1
;: 1.7340706.u.u 318.33 0 0 32.3 0
n 1
"2 -3.313670 68.51 0 0 0.322 0
1H .!
2 4.873505 100.00 0 1.000 1.000 1.000
2H 1.2125 15.35 2.73 X 10-3 1.5 X 10- 4 9.65 X 10-3 1.45 X 10-6
3H 1
;: 5.1595 106.66 0 0 1.21 0
13c 1
;: 1.2166 25.15 0 1.11 X 10-2 1.59 X 10-2 1.76 X 10- 4
14N 1 0.5706 7.22 0.016 0.996 1.01 X 10-3 1.01 X 10-3
1SN 1
;: -0.4900 10.13 0 3.7 X 10-3 1.04 X 10-3 3.85 X 10-6
19F 1
;: 4.5509 94.08 0 1.000 0.833 0.833
23Na 3
;: 2.8610 26.45 0.12-0.15 1.000 9.25 X 10-2 9.25 X 10-2
29Si 1
"2 -0.9612 19.87 0 4.70 X 10-2 7.85 X 10-3 3.69 X 10- 4
31p 1
;: 1.9581 40.48 0 1.000 6.63 X 10-2 6.63 X 10-2
43Ca 7
"2 -1.4914 6.73 0.2 ± 0.1 1.45 X 10-3 6.40 X 10-3 9.28 X 10-6
121Sb 5
;: 3.9537 23.93 -0.5 to -1.2 0.573 0.160 9.17 X 10-2

aMagnitude of magnetic moment vector .u = 'Yff'[J(J + 1»)112. Positron and muon moments in units of their
respective magnetons. Values for all nuclei except 1 H are those observed in specific chemical compounds, uncor-
re 6ted for shielding.
Electric quadrupole moments are often known with only poor accuracy, due to certain assumptions which
must be made in their experimental determination, and due to conflicting results from different experimental
techniques. The table values represent a range from several sources.
cApproximate detection voltage signal to noise ratio, assuming equal Bo fields, line shapes, relaxation times,
measurement bandwidths, and total measurement noise, relative to protons; proportional to w03J(J + 1).
dproduct of natural isotopic abundance and relative receptivity per particle.
683 MAGNETISM

tions of some of the magnetic dipoles in the MAGNETISM


system. Depending on the type of interactions,
such motions can be the source of relaxation Magnetization Magnetic fields are produced
(e.g., Tl and Tz in the Bloch equations) between both by macroscopic electric currents and by
specific types of spins or between spins and the magnetized bodies. The first observed mani-
lattice. festations of magnetism were the forces be-
Aside from the chemical and physical research tween naturally occurring permanent magnets,
and measurement, other applications of mag- and between these and the earth's field. North-
netic resonance include measurement of mag- and south-seeking poles could be identified.
netic fields (proton magnetometers) and the Poles were observed to be localized near the ends
use of ferrites (high RF resistivity ferrimagnetic of long rods magnetized by contact with natural
ceramics of general formula MOFez03, M a magnets or by a current-carrying coil. From the
divalent cation) as magnetic field-controllable observed attraction and repulsion of unlike and
microwave switches, phase shifters, attenuators, like poles with an inverse square law came the
filters, circulators and isolators. concept of pole strength and the definition of
Most recently, magnetic resonance has been the unit pole, that which acts on another in
employed for noninvasive clinical imaging of vacuum with a force of one dyne at a distance
the human body (zeugmatography). Lauterbur of one centimeter. The unit magnetic field, the
first demonstrated that the NMR spectrum of oersted, could then be defined as that in which
a compound with a single spectral line (e.g., a unit pole experiences a force of one dyne.
protons in intracellular water) is the projection The magnetic moment of a long, uniformly
of the distribution of the compound in the magnetized rod of length I with a pole strength
object if a linear Bo field gradient, rather than of m unit poles at each end is defined as ml, the
a uniform Bo field, is applied across the object. largest couple that the sample can experience
Multiple projections in several directions may in a field of one oersted. The magnetization, M,
be used to reconstruct an image using algorithms is defined as the magnetic moment per unit
similar to those employed in x-ray or emission volume, mllal, where a is the cross-sectional
computed tomography. area, and is thus also equal to the pole strength
The formal description of magnetic resonance per unit area, mla.
given above has been applied (by Feynman, Magnetic Induction The induction, or flux
Vernon, and Hellwarth) to the generalized two- density, B, is numerically equal to the field H in
level quantum system, and has proved extremely free space and is described as one line of flux
useful in understanding coherence phenomena per square centimeter for a field of one oersted.
in microwave, infrared and optical spectros- Its direction is that of the force on a unit north
copies. pole. If magnetic material is present, the flux
density is equal to H + 47TM, since 47T lines of
JEROME L. ACKERMAN force emanate from the unit pole at each end of
a dipole equivalent to a specimen of unit mag-
References netization. Magnetic poles are observed to occur
in pairs. Lines of B are continuous, i.e., div B =O.
Carrington, A., and McLachlan, A. D., "Introduction If a material becomes strongly magnetized in a
to Magnetic Resonance with Applications to Chem- small field, the lines of flux can be considered
istry and Chemical Physics," New York, Harper and to crowd into the material, leaving their original
Row, 1967. locations and reducing the field there. This is
Slichter, C. P., "Principles of Magnetic Resonance," how magnetic shielding is accomplished. Changes
Berlin, Springer-Verlag, 1978. of B within a coil induce voltages which can be
Abragam, A" "The Principles of Nuclear Magnetism," measured and form the basis of galvanometer
Oxford, Oxford Univ. Press, 1961. and fluxmeter measurement methods. For a
Becker, E. D., "High Resolution NMR," New York, coil of N turns of cross-sectional area a, in
Academic Press, 1980. which the flux is changing at dBldt gauss/sec, E
Helszain, J., "Principles of Microwave Ferrite Engi- in volts is given by
neering," London, Wiley-Interscience, 1969.
Kaufman, L., Crooks, L. E., and Margulis, A. R. (Eds.), dB
E=-IO-8Na-
"Nuclear Magnetic Resonance Imaging in Medicine," dt
New York, Igaku-Shoin, 1981.
Lee, K., and Anderson, W. A., "Nuclear Spins, Mo- Forces on magnetic bodies in field gradients are
ments and Magnetic Resonance Frequencies," in proportional to M.
"The Handbook of Chemistry and Physics," (R. C. Types of Magnetic Behavior In general a field
Weast, Ed.), Cleveland, The Chemical Rubber Com- H will produce a magnetization M in any ma-
pany. terial. If M is in the same direction as H, a
Steinfeld, J. I., "Molecules and Radiation," New York, sample will be attracted to regions of stronger
Harper & Row, 1974. field in a field gradient. It will be repelled if M
is in the opposite sense. This experiment, as
Cross-references: COHERENCE, ELECTRON SPIN, first performed by Faraday, is the basis for the
FERRIMAGNETISM, FOURIER ANALYSIS, LA- broad classification of materials into paramag-
SERS, MAGNETISM, MASERS, RESONANCE. - netic, diamagnetic, and ferromagnetic. The sus-
MAGNETISM 684

ceptibility K is defined asM/H. The force Fx on Atomic Magnetic Moments There are two
a small specimen of volume v in a field Hy and a possible sources for the moments of individual
field gradient dH y/dx is atoms. They are electron orbital motion and
electron spin (see ELECTRON SPIN). In most
dH y ferromagnetic materials, most of the moment
Fx = (K2 - KdvHy-- comes from spin rather than orbital motion, a
dx
fact that is revealed experimentally by gyromag-
where K2 and Klare the volume susceptibilities netic measurements (see FERROMAGNETISM)
of the specimen and the surrounding medium, and by magnetic resonance experiments (see
usually air. MAGNETIC RESONANCE). Orbital motions are
For paramagnetic materials, K is small and quenched by the electric fields of the neighbor-
positive, usually between 1 and 1.001 at ordinary ing atoms in the crystal lattice. In the rare earth
temperatures. These substances contain atoms metals the unfilled shell is deep within the
or ions with at least one incomplete electron atom, orbital motion is not quenched, and the
shell, giving them a non-zero atomic or ionic orbital contribution to the magnetic moment is
magnetic moment Ila. Many salts of the iron- observed. The unit of atomic moment is the
group and rare-earth metals are paramagnetic, Bohr magneton, IlB' which is the moment asso-
as are the alkali metals, the platinum and ciated with one electron spin, numerically equal
palladium metals, carbon, oxygen, and various to 0.9274 X 10-20 erg/oersted. The spin
other elements. Antiferromagnetic substances quantum number, S, is one-half the number of
also have small positive K, as do ferromagnetics unpaired electrons. The moment per atom is
above their Curie temperatures. In the classical SgllJ3 where g is the gyromagnetic ratio, close
theory of paramagnetism, the orientations of to 2 for most materials. The moment in Bohr
the moments are considered to be initially magnetons of an isolated atom or ion of the
thermally randomized in space. An applied first transition series is equal to the number of
field produces a net magnetic moment in its unpaired d electrons, considering the first five
direction, as described by the classical Langevin electron spins to have one orientation and the
function next five the opposite (Hund's rule). The Ni++
ion, with eight d electrons, has the expected
M = coth (lla H)_ kT moment of 21lB in ferrites, in which the ionic
Ms kT llaH spacing is great enough so that the d levels are
not disturbed (see FERRITES). In metallic nickel,
where k is the Boltzmann constant. Ms is the however, the d levels overlap considerably, and
value of M attained for very large H/T. Under the moment corresponds to only 0.6 IlB per
most conditions, only the initial portion of this atom . Similarly the Bohr magneton numbers
curve is observed, with the corresponding con- for metallic iron and cobalt are 2.2 and 1.7
stant K. The conduction electrons at the top of respectively.
the Fermi distribution in a metal can also give Ferromagnetism Ferromagnetism can only
rise to a temperature-independent Pauli para- occur in a material containing atoms with net
magnetism. The quantum-mechanical analogue moments. Also, quantum-mechanical electro-
of the Langevin function is called the Brillouin static "exchange" forces must be present, hold-
function (see PARAMAGNETISM). . ing neighboring atomic moments parallel below
For diamagnetic materials, K is small and the Curie temperature. These are much greater
negative. Diamagnetism is a universal phe- than the Lorentz force due to the average mag-
nomenon but is often masked by paramagnetic netization and are, in fact, equivalent to an
or ferromagnetic effects. Net diamagnetic be- effective field on the order of 106 oersteds.
havior is observed in a number of salts and Such an effective "molecular field" was postu-
metals, and in the rare gases, in which there is lated in 1907 by Weiss in extending the Langevin
no net moment. The effect can be regarded as theory of paramagnetism to include ferromag-
the operation of Lenz's law on an atomic scale netic behavior. The Langevin function predicts
(see DIAMAGNETISM). a temperature dependence of magnetization,
Ferromagnetic materials show a value of M for small M, of
which may be of the order of 10 3 in small fields.
Thus Y. can be very large. It is common to CH
describe their properties in terms of the perme- M=-
T
ability Il = B/H. Since M saturates in ordinary
fields, K and Il are not constant. M is not neces- where C is a constant. The susceptibility is then
sarily in the same direction as H, so K and Il are, CIT, which is Curie's law. Weiss pointed out
in general, tensors. Furthermore, ferromagnetics that if the field H were augmented by an addi-
generally exhibit hysteresis in the dependence tional field NM proportional to the magnetiza-
of M on H, and the details are very structure- tion, the temperature dependence became
sensitive. Still another distinction is the rather
abrupt disappearance of ferromagnetism at a CH
characteristic temperature, the Curie tempera- M=--
ture, Te. T- Te
685 MAGNETISM

where Te = NC. This is the Curie-Weiss law, aries move considerable distances, they experi-
approximately obeyed by ferromagnetic sub- ence a net drag from impurities and irregularities
stances above their Curie points. Below T e , the in the material, causing hysteresis in the de-
presence of the molecular field produces an pendence of B on H. There will in general be a
alignment of the atomic moments correspond- remanence B r , the flux density remaining after
ing to the spontaneous magnetization Ms even saturation when the field is reduced to zero,
when no external field is present. However, and a coercive force He, the reverse field re-
ferromagnetic samples can have any net ex- quired to reduce the flux density to zero. A
ternally measured value of magnetization, in- loss associated with the irreversibility of mag-
cluding zero, which seems to contradict this netization changes also occurs in rotating fields.
result. Weiss therefore postulated the existence This loss becomes zero in very large fields, ex-
of domains separated by boundaries. In each cept in a few special cases.
domain the atomic moments are parallel, the Even in very slowly changing fields, a wall
domain magnetizations having different orienta- characteristically moves in jumps, each giving a
tions. The net external magnetization is then sudden change in B. This irregularity has been
the vector sum of the domain magnetizations known for a long time as the Barkhausen effect,
and can be varied by a rearrangement of the and its physical origin is the irregularity of wall
domain structure, which may happen in very motion through various inhomogeneities in the
small applied fields. This prediction has been material. Usually a very large number of these
completely verified by experiment. The motion small jumps takes place. In special circumstances,
of domain boundaries as observed under the however, the material may remain at B r , until,
microscope has been directly correlated with in a sufficiently large field, a single wall will be
external changes in magnetization. Domain nucleated and sweep all the way across the
boundaries in iron are on the order of 1000 A specimen, leaving it at Br in the other direction.
thick. Within a boundary, neighboring mag- Such a materIal has only two stable states, +Br
netic moments are not quite parallel. The and - Br , a useful behavior in some applications.
change in orientation of the magnetization Direct microscopic observation of domains,
from one domain to another is distributed e.g., by the Faraday effect or the magnetic Kerr
through the thickness of the boundary. effect, is an important research tool. Under-
Within a domain, the magnetization will in standing and control of domain structures has
general preferentially lie along some particular progressed to the point that under appropriate
crystallographic direction. The energy difference conditions large numbers of tiny cylindrical
between magnetization in the easiest and hard- domains can be deliberately produced and con-
est direction may exceed 108 erg/ cm 3 . This trollably moved in certain single-crystal ma-
anisotropy is described in an appropriate trigo- terials, enabling the development of memory
nometric series with coefficients K j • Usually devices utilizing this ability.
only a few terms are necessary. Often a material It is also necessary to consider the behavior
is described by a single K; this implies a uni- in rapidly varying fields, discussed below.
axial anisotropy energy of the form K sin 2 (J. Antiferromagnetism Exchange forces can
The Ki may pass through zero and change sign operate to hold neighboring moments anti-
with changing composition or temperature. Al- parallel, rather than parallel. Materials whose
though such details cannot in general be pre- magnetic moments are arranged in this way
dicted, the magnetocrystalline anisotropy will show no external permanent moment and are
have the same over-all symmetry as the crystal called antiferromagnetic. The sign of the ex-
structure. Anisotropy is best investigated in change force may depend, among other things,
single crystals, by analysis of magnetization on the atomic spacing. Metallic manganese, for
curves in various directions or from the rela- example, is antiferromagnetic, while many alloys
tionship between the measured torque and the of manganese, in which the average Mn-Mn dis-
direction of the applied field (see FERROMAG- tance is greater, are ferromagnetic. In some anti-
NETISM). Dimensional changes are also asso- ferromagnetic compounds, the exchange inter-
ciated with the position of the magnetization action appears to be of a next-.nearest-neighbor
vector relative to the lattice (see MAGNETO- type, taking place through an intervening atom
STRICTION). such as oxygen. This type of interaction is
There are two mechanisms available for chang- termed superexchange. Antiferromagnetic ma-
ing the externally measured magnetization of a terials, having no net external moment, show
ferromagnetic material: domain boundary mo- small positive susceptibilities that reach a maxi-
tion, and domain magnetization rotation. mum at the temperature above which the ex-
Broadly speaking, in magnetically soft materials, change forces can no longer hold the moments
boundary motion accounts for most of the aligned against thermal agitation. This tempera-
changes in low applied fields, leaving the mag- ture, TN, the Neel temperature, corresponds to
netization in each domain in the easy direction the Curie temperature of a ferromagnet. Mag-
nearest the applied field. Then rotation against netocrystalline anisotropy exists for anti-
anisotropy produces the remaining change in ferromagnets just as for ferromagnets (see
higher fields. In very low fields, boundary mo- ANTIFERROMAGNETISM ).
tion is practically reversible, but when bound- Ferrimagnetism With more than one type of
MAGNETISM 686

magnetic ion present, in certain compounds, crystal lattice but are randomly packed as in
antiferromagnetic coupling may lead to a net a glass. The saturation magnetizations and Curie
external moment corresponding to a Bohr mag- temperatures of these materials, although
neton number equal to the difference in ionic generally somewhat less than those of their
moments. Other more complicated cases occur. crystalline counterparts, are substantial. The
Ferrites, insulating oxides with the spinel struc- demonstration of ferromagnetism in a glassy
ture, are important examples of this class of metallic structure is of great fundamental
material, called ferrimagnetics (see FERRIMAG- interest. Furthermore, in such a structure there
NETISM). is no macroscopic magnetic anisotropy. As a
Exchange Anisotropy A ferromagnetic phase result, magnetization changes can take place
may be in exchange coupling with an antiferro- readily in small fields. Thus these materials
magnetic phase, as in a cobalt particle covered can show high permeabilities and narrow hys-
with CoO. This leads to new phenomena, in- teresis loops, giving them potential usefulness in
cluding non-vanishing high-field rotational hys- various devices.
teresis. Such a material cooled in a field through Permanent Magnets A useful permanent
the Neel temperature, if Tc > TN, may exhibit magnet material should have as large a hysteresis
a hysteresis loop that is permanently displaced loop as possible. In the early magnet steels, wall
from the origin. This is equivalent to a unidirec- motion was made difficult by a heterogeneous
tional (not uniaxial) anisotropy and will appear alloy structure. A different approach is based
in a torque curve as a sin () term. Ferromagnetic on the theory that sufficiently small particles
and antiferromagnetic regions in a single-phase should find it energetically unfavorable to con-
alloy may also lead to these effects. tain domain boundaries. The critical size is
Other Configurations Atomic moments need proportional to KI/2/Ms . Reversal must then
not necessarily be either parallel or antiparallel. proceed by the difficult process of rotation
In a few materials they may be arranged in against shape, strain-magnetostriction, or crystal
a triangular or spiral configuration. In some anisotropy. Fine-particle ("'IOOOA) iron and
circumstances, an antiferromagnetic material iron-cobalt materials utilizing shape anisotropy
may shift to a configuration having a large have been developed. The Alnico permanent
ferromagnetic moment in the appropriate com- magnet alloys have very fine precipitate struc-
bination of fields and temperatures (metamag- tures and are probably also best regarded as
netism). fine-particle materials. A magnetic oxide, BaO .
Rare Earths The rare earth elements have 6Fe2 03, utilizes magnetocrystalline anisotropy
magnetic moments originating from unpaired in fine-particle ("'1 f.lm) form. A new class of
electrons in the 4f shell. These electrons are permanent magnet materials based on Cos -(rare
close to the nucleus and are shielded by the 5 s earthh intermetallic compounds shows by far
and 5p electrons. Thus direct exchange does the highest permanent magnet properties of any
not occur in the rare earths. However, several material. These originate in the extremely high
of them exhibit ferromagnetism at low temper- magnetocrystalline anisotropy of these ma-
atures, originating in indirect exchange via the terials.
three 5 d-6s conduction electrons. The atomic Thin Films Since a surface atom's surround-
moments can be large in the heavy rare earths, ings are different from those in the interior, the
in which the spin and orbital moments add. magnetization and Curie temperature of thin
In fact, Dy and Ho have a moment per unit films should yield important information about
volume, at low temperatures, half again as large the range of ferromagnetic interactions. Ex-
as that of iron. The rare earths often exhibit perimental difficulties, primarily with purity,
complex magnetic ordering structures. have beclouded the subject to some extent, but
The rare earths form many solid solutions it now appears that any surface layer on nickel
among themselves, in which the variation of having substantially different magnetic proper-
moment and Curie temperature have been in- ties from the bulk cannot be more than a few
vestigated. They also form many intermetallic Angstroms thick.
compounds with other elements. Often a rare There have been many investigations of flux
earth and another element will form several reversal in films, usually vapor-deposited on
discrete binary compounds, sometimes showing glass, which have been motivated by computer
extraordinary magnetic properties. A number technology needs. Such films show a uniaxial
of compounds, including RCo s , have extremely anisotropy associated with fields present during
high magnetocrystalline anisotropy. TbFe2 and deposition or sometimes with geometric effects
DyFe2 exhibit the highest magnetostrictive such as the angle of incidence of the vapor
strains known, on the order of 1%, thousands beam.
of times higher than values typical of other Dynamic Behavior of Ferromagnetic Materials
materials. Changes in flux in a conductor induce emf's re-
Amorphous Materials It is possible by rapid sulting in current flows whose fields tend to
quenching from the molten state to prepare oppose the change in flux. For various time
metallic samples whose atomic structure is rates and geometries these can be calculated,
amorphous. The atoms are not arranged in a leading to expressions for phase relationships
687 MAGNETO·FLUID-MECHANICS

and skin depth in conductors (see ELECTRO- Morrish, A. H., "The Physical Principles of Magne-
MAGNETIC THEORY). These expressions have tism," New York, John Wiley & Sons, Inc., 1965.
often been applied to magnetic materials at Berkowitz, A. E., and Kneller, E. (Eds.), "Magnetism
power frequencies by simply replacing H by B. and Metallurgy," New York, Academic Press, 1969.
This is in general not a good approximation and Nesbitt, E. A., and Wernick, J. H., "Rare Earth Per-
leads to erroneous results. It is more nearly manent Magnets," New York, Academic Press,
correct to recognize that highly localized eddy 1973.
currents around moving domain boundaries are Bobeck, A. H., Bonyhard, P. I., and Geusic, J. E.,
the entire source of loss under these condi- "Magnetic Bubbles-An Emerging New Memory
tions. For a given dB/dt, the loss calculated in Technology," Proc. IEEE 63,1176-1195 (1975).
this way is much greater, decreasing to the Wohlfarth, E. P. (Ed.), "Ferromagnetic Materials,"
classical value as the density of domain bound- Vols. 1 and 2, New York, North-Holland Publishing
aries increases. Co., 1980.
In bulk metals, domain wall velocities are
usually determined by the damping associated Cross-references: AMORPHOUS METALS, ANTI-
with local eddy currents. In ferrites and thin FERROMAGNETISM, DIAMAGNETISM, FERRI-
films, other types of damping may predominate. MAGNETISM, FERROMAGNETISM, MAGNETIC
These and many other aspects of the dynamic FIELDS, MAGNETIC RESONANCE, MAGNETOM·
behavior of magnetic materials of all types have ETRY, RARE EARTHS, THIN FILMS.
been investigated through resonance methods
(see MAGNETIC RESONANCE).
Superparamagnetism For particles whose vol-
ume v is on the order of 10- 18 cm 3 or less, the MAGNETO-FLUID-MECHANICS
direction of the entire particle moment Msv
may fluctuate thermally. An assembly of such Magneto-fluid-mechanics is the subject that
particles will exhibit the Langevin function deals with the mechanics of electrically con-
magnetization curve of a paramagnetic with the ducting fluids (such as ionized gases and liquid
extremely large moment Msv; thus it may be metals) in the presence of electric and magnetic
easily saturated with ordinary fields and tem- fields. Magneto-hydrodynamics is another name
peratures. Such magnetization curves can be used extensively, but it suffers from the less
used to study particle sizes and size distributions.
general meaning of the words "hydro" and
Magnetic Bubbles A remarkable application "dynamics." Other names used are: magneto-
of the principles of domain structure has been
hydro-mechanics, magneto-gas-dynamics, mag-
realized in the fabrication of computer memory neto-plasma-dynamics, hydromagnetics, etc.
components utilizing tiny cylindrical magnetic
The fundamental assumptions underlying
domains. In thin monocrystalline plates of a
magneto-fluid-mechanics are those of contin-
material such as a garnet or in thin amorphous
uous media. In this respect, magneto-fluid-
films of certain alloys, having an appropriate mechanics is related to plasma physics (see
com bination of magnetization and anisotropy, PLASMAS) in the same way that ordinary fluid
it is possible to establish stable cylindrical mechanics is related to the kinetic theory of
magnetic domains passing completely through
gases. More specifically, such phenomenological
the material. These domains, universally referred coefficients as viscosity, thermal and electrical
to as bubbles, can be generated, erased, moved, conductivities, mass diffusivities, dielectric con-
and sensed by overlays of conducting strips and stant, etc., are assumed to be known functions
Permalloy guide patterns. Each bubble is a "bit" of the thermodynamic state, as derived from
of information. The dimensions of these bubbles microscopic considerations or experiments.
and the associated patterns are on the order of From electromagnetic theory, we know that
a few microns. The storage density and access the "Maxwell stresses" give rise to a body force
times of these memories appear to fit them to made. up of the following components: elec-
an important range of applications. (See FER- trostatic (applied on a free electric space
RIMAGNETISM. ) charge); ponderomotive (the macroscopic sum-
JOSEPH J. BECKER mation of the elementary Lorentz forces ap-
plied on charged particles); electrostrictive
References (present when the dielectric constant is a func-
tion of mass density); a force due to an
Bozorth, R. M., "Ferromagnetism," New York, Van inhomogeneous electric field and its magnetic
Nostrand Reinhold, 1951. counterpart; and the magnetostrictive force. For
Kneller, E., "Ferromagnetismus," Berlin, Springer- any fluid the last two forces are negligibly small
Verlag, 1962. at normal temperatures, whereas the ones as-
Rado, G. T., and Suhl, H. (Eds.), "Magnetism," New sociated with the behavior of the dielectric con-
York, Academic Press. Vol. 1,1963; Vol. I1A, 1965; stant, although normally small, are of the same
Vol. lIB, 1966; Vol. III, 1963; Vol. IV, 1966. order of magnitude as the buoyant forces under
Chikazumi, S., "Physics of Magnetism," New York, certain conditions. On the assumption that we
John Wiley & Sons, Inc., 1964. deal with electrically neutral but ionized fluids,
MAGNETO-FLUID-MECHANICS 688

the only substantial force that remains is the Nondimensional Parameters and Some Im-
ponderomotive force. Indeed, what is today portant Theorems In order to study the nature
called magneto-fluid-mechanics deals almost ex- of the solutions as they emerge from different
clusively with this force. problems, we shall form a number of non-
Fundamental Equations The equations that dimensional parameters that can be extracted
govern magneto-fluid-mechanics are the follow- from the different equations. The order of
ing: magnitude of the inertia force per unit volume
(1) Equation of conservation of mass, which is given by p V 2 IL where p is the mass density,
is the same as in ordinary fluid mechanics. V the velocity, and L a characteristic length;
(2) Equation of conservation of momentum, the order of magnitude of the ponderomotive
which is altered by the forces enumerated force J X B, after using Ohm's law, is equal to
above. In particular, the ponderomotive force OB2 V. Also, the order of magnitude of the
per unit volume is given by J X B where J is the viscous force is: f..l VIL 2 • The ratio of the typical
vector current density and B the magnetic inertia force over the viscous force is called the
induction, both measured in the laboratory. Reynolds number (Re) and, from the above, is
(3) Equation of energy conservation; the found to be: Re = p VLlf..l. The ratio of the
same as in ordinary fluid mechanics with the ponderomotive force over the inertia force is
addition of the Joulean dissipation E'· J'. The given by ~ = oB 2 LlpV. The ratio of the pon-
primes indicate that the electric field and cur- deromotive force over the viscous force is equal
rent density are measured in a frame of refer- to (Re) . ~ and is defined in the literature as the
ence moving with the fluid. In the nonrela- square of the "Hartmann number.," denoted by
tivistic case and for zero space charge, we have M. We have M = BL Val";;;.
E' = E + q X Band J' = J . The barycentric The distortion of the magnetic field due to
stream velocity is indicated by q. the hydrodynamic field can be studied best
(4) Equation describing the thermodynamic with the help of the following two equations:
state.
(5) Conservation of electric charge.
(6) Ampere's law. -dQ =(Q • V)q + -f..l V2 Q
(7) Faraday's law. dt P
(8) Statement that the magnetic poles exist
in pairs only. dH I
(9) Ohm's phenomenological law . - =(H' V)q+ - V2 H
(10) Constitutive equations linking the elec- dt Of..le
tric field with the displacement vector and the In the above f..le is the magnetic permeability
magnetic intensity field with the magnetic and H the magnetic field intensity. The first
induction. Equations (1) to (3) are the conser- equation describes the diffusion of vorticity
vation equations. Equations (5) to (8) are Q = V X q, whereas the second can be obtained
Maxwell's equations. For a large number of by a combination of Ampere's and Ohm's laws
problems, the phenomenological coefficients of after elimination of the electric field by using
electrical and thermal conductivity, viscosity and Faraday's law. In the ordinary fluid mechanic
the like are assumed to remain unaffected by the case, the streamlines obtained after solving the
magnetic field. This implies that the collision fre- inviscid problem are distorted in regions of high
quencies among the particles are much higher vorticity through the mechanism of viscosity
than the cyclotron frequency associated with the (last term in first equation). Similarly the mag-
property a charged particle has to rotate around netic field calculated in the case of ideal, non-
a magnetic line under the influence of the dissipative flow with 0 =00 is distorted by the
Lorentz force. This means that the transfer of finite electrical conductivity (last term in sec-
electric charge, mass, momentum, and energy is ond equation). The nondimensional number
not realized in a preferential direction. describing the influence of viscosity is the
Physically, the magneto-fluid-mechanic system Reynolds number, and in perfect analogy as
of the above equations is coupled in the follow- indicated by the above two equations, the
ing sense: A velocity field q cutting magnetic magnetic field distortion is described by the
lines of flux B gives rise to an induced current number, (Re)m = f..le0VL and is called the mag-
whose magnitude is given by J = a(q X B). At netic Reynolds number. When (Re)m is zero,
the same time the fluid feels an induced body the magnetic lines remain undisturbed, whereas
force equal to J X B. On the other hand, in the limit (Re)m -+ 00 , the magnetic lines are
the electric currents induced by the motion frozen into the fluid in exactly the same way
create, according to Ampere's law, a magnetic that vorticity is frozen according to Helmholtz's
field which distorts the original applied mag- theorem. Mathematical similarities apart, the
netic field. The basic mechanism of this distor- freezing of the magnetic lines with the motion
tion is the one created by the irreversibility in- is evident in the case of 0 -+ 00 from the follow-
troduced by the finite electrical conductivity, ing physical considerations: An observer moving
the same way that the distortion of the inviscid with the barycentric (stream) velocity in a me-
streamlines in ordinary fluid mechanics takes dium of infinite electrical conductivity can mea-
place by the action of viscosity. sure only a zero electric field and hence he does
689 MAGNETO-FLUID-MECHANICS

not cut magnetic lines, which means that the small velocities, the interaction is too weak to
magnetic lines must move along with his speed. be useful. However, in the laboratory, with a
From this argument it also follows that the total mutually perpendicular magnetic field, flow,
change in the magnetic flux through a given and induced current density fields, a large inter-
surface moving with the stream must be zero action is possible when hot ionized gases are
for an infinitely conducting medium. Finally, used in conjunction with strong magnetic fields.
the remark should be made that except for This area of research is called magneto-hydro-
stellar and interspace applications where the dynamic power generation, and its popularity
velocities and (especially) characteristic lengths emerges from the fact that mechanical energy
are high, (Re)m is a small number. On the other can be converted to electrical without ther-
hand, the assumption (Re)m = 0 is a rather dras- mally stressed rotating parts. As a consequence,
tic one since it permits the uncoupling of higher temperatures can be imparted to the
Maxwell's equations from the conservation working medium with better thermal effi-
equations. In thermonuclear plasma physics, ciencies. This scheme, under development now,
much of which is analyzed by MHD models, seems to be limited by losses due to heat
the approximation of frozen field lines is very transferred from the hot gas to the out-
useful , especially in dealing with phenomena side, corrosion of the electrodes, and Hall
which occur over brief time intervals, such as current losses. (When the gyrofrequency of the
in MHD instabilities. ionized particles is high compared to their col-
For the calculation of the ponderomotive lisional frequency, the particles drift in a direc-
x
force we can use Ampere's law ell H =J) to tion parallel to the flow, and as a result, the
find that J X B =(IJ X H) X B. Through regular current to be collected by the electrodes in
vector operations, we can show that (IJ X H) X the direction perpendicular to the flow, di-
B = - grad (B 2 /2J.1.e) + div (BB/J.l.e). One can minishes. The Hall effect can be turned to some
identify the last term as representing a ten- advantage if it is designed to be substantial and
sion equal to B2/2J.1.e acting along the lines of if the current in the direction of flow is the one
force, whereas the first one corresponds to an to be collected.)
equivalent hydrostatic pressure equal to B2 /2J.1.e. One of the earliest astrophysical applications
This term is frequently called "magnetic pres- of magneto-fluid-mechanics lies in the area of
sure" and in different problems is found to solar physics and in particular the sunspots.
behave precisely as the static pressure does. Fur- Sunspots were seen and studied for the first
thermore, one can show that if the magnetic time with the help of a telescope by Galileo
lines are lengthened, the magnetic field intensity about 1610. Three hundred years later, Hale
is increased . discovered, through the Zeeman effect, that
Consider now the propagation of small distur- the magnetic field in the sunspots is very
bances in the form of acoustic waves for which high (of the order of several thousand gauss).
the speed of sound for an ideal gas is propor- It was, however, only in the middle 1930s
and in particular after the last world war
tional to -../PiP. Now one can show through a that an explanation was sought in which the
linearization of the equations of conservation, magnetic field was involved. At the writing of
assuming the presence of the magnetic pressure this article, there is no complete sunspot theory.
alone, that a small disturbance (for a gas of However, the majority of workers in this area
infinite electrical conductivity) will be prop- agree on the following rough picture. Because
agated, in perfect analogy, with a speed equal of mechanical equilibrium considerations, the
to v'B2 /PJ.l.e. This is the so-called Alven speed, pressure is the same at a given distance from the
and these waves are called magneto-fluid me- center of the sun in the sunspot proper or in
chanic waves. Of interest also are combinations the photosphere which is free of a magnetic
of several mechanisms of propagation which field. This means that the magnetic pressure
might include sound and gravitational waves. plus the static pressure in the sunspot region
Because of the property of the magnetic lines must balance the static pressure in the photo-
to increase their tension when lengthened, along sphere, a fact that implies that the static
with the additional ones of distortion and prop- pressure in the sunspot region is smaller. If we
agation of disturbances, their properties are pre- picture the sunspot magnetic lines to be radial,
sented in loose terms as resembling very much the pressure gradient in this direction is indepen-
those of rubber bands. dent of the magnetic field and balances exactly
Applications There are both astrophysical the gravitational force per unit volume pg. Hence
and terrestrial applications of magneto-fluid- P is constant inside and outside the sunspot.
mechanics. One of the earliest ones was perhaps Since the static pressure is proportional to den-
suggested by Faraday, who thought to harness sity and temperature, the above arguments force
the river Thames with electrodes on its banks us to accept a lower temperature inside the spot
that would collect the induced electric current with a resulting darkening. The only question
resulting from the flow of the river as it cuts the that rises is whether the order of magnitude of
earth's magnetic field perpendicularly. Because the magnetic pressure is enough for the effect to
of the small electrical conductivity of water, be significant. This seems to be so. If we assume a
the small magnetic field of the earth and the magnetic field of 1500 gauss (typical in a sun-
MAGNETO-FLUID-MECHANICS 690

spot), the magnetic pressure is about 0.1 of an column, such that the rings in the concave
atmosphere which is the typical pressure in the side are pressed together, whereas the rings
photosphere. in the convex side are separated. As a result,
An explanation for the bipolar nature of sun- the magnetic flux density (and hence the
spots and the difference in the sign of their magnetic pressure) will be higher on the concave
polarity has been offered. The differential rota- side resulting in a force tending to increase
tion of the sun is invoked. The torroidal mag- the concavity. We say that this configuration is
netic lines of the sun's field lying on its surface unstable, since the force induced by the imposed
are twisted, since for very high electrical con- disturbance acts in a destabilizing direction.
ductivity, they are frozen with the motion. As Note that in this configuration, the center of
a result, the magnetic intensity is amplified and curvature of the undistorted plasma boundary
so is the magnetic pressure. Simple considera- cross section falls inside the plasma. One can now
tions based on the observed kinematics of the create another example in which the curvature of
differential rotation establish the location in the confining undistorted boundary of the
latitude with time where the intensities will be plasma is opposite (the center of curvature falls
high enough to give rise to sunspot activity. in. the vacuum) and show that the configuration
The result compares favorably with observa- wIll be stable. We can then state that a sufficient
tions. In fact, it can be shown that the sunspot condition for stability is met when the magnetic
activity migrates, time-wise, from the higher lines are everywhere convex towards the plasma.
latitudes towards the equator as observations If the magnetic lines induced by the currents
show. Because the twisted field is symmetric going through the plasma are in an unstable
with respect to the equatorial plane, this model configuration, externally imposed magnetic
describes correctly the symmetry of the activity fields can be used in order to "stiffen" the
in the north and south hemispheres along with configuration.
the fact that the polarity between two sym- The "aurora borealis" can be explained in
metric sunspot pairs is opposite in sign. terms of the interaction of the solar wind (due
to the continuous expansion of the solar corona
Efforts have been made to discover the mecha- with a velocity of about 500 km/sec) with the
nism for the generation and maintenance of geomagnetic field. The inertia associated with
cosmic magnetic fields, such as fields in stars, this "wind" will penetrate the magnetic lines of
the earth, and galaxies. The most promising the earth, only up to the point where the
direction seems to lie in the so called "dynamo induced magnetic pressure is smaller than these
theories." Here, some general magnetic field is inertial forces. The earth's magnetic field falls
assumed (not necessarily strong), which upon off with the inverse third power from the center
interaction with the motion of a conducting of the earth. Knowing the mass density and the
medium (convective, or motion due to Coriolis velocity of the solar wind, we can locate the
forces), induces currents which reinforce the remotest magnetic line from the earth that is
original magnetic field. As the magnetic field is
reinforced, the ponderomotive force suppresses strong enough to stop the penetration of the
the motion until some kind of a steady state for s?lar co~pusc~es. When this happens, these par-
both the motion and the magnetic field is ticles will glide along this magnetic line and
reached. eventually will come to the foot of this line at
th~ surface of the earth. An elementary compu-
Magneto-fluid-mechanics also studies problems tation shows that the latitude of this line is the
related to magnetic confinement of plasmas and one where the "aurora borealis" is observed.
their stability to small disturbances. Consider (see AURORA AND AIRGLOW).
for instance the so-called "pinch effect." Here, Convective motions can be effectively subdued
a strong current is passed through a cylindrical by ~he presence of a magnetic field. Consider,
column made up of a plasma. The axial current for mstance, the convection in a thin horizontal
filaments create an azimuthal magnetic field layer due to heating from below. Convective
(the magnetic lines are then rings with the cells will be formed when the buoyant force is
cylinder axis as the locus of their centers,) and enough to counterbalance the viscous force of
as a result, a ponderomotive force is induced the motion. (These were formerly called Benard
which compresses the plasma radially *. Through cells because it was believed that Benard had
this confinement, it is hoped that temperatures observed them. However, the name convective
of the order of 10 6 to 10 7 K will be created so cells is preferred.) At the same time balance
that thermonuclear FUSION can take place. of energy dictates that the heat conv~cted up-
Such configurations are normally subject to wards be equal to the heat conducted from the
instabilities. Consider, for instance the case in hot source at the bottom. The ratio of these
which a small distortion in the' form of a two energies is called the "Rayleigh number"
"kink" is formed in a cylindrical plasma and ~~r a given geometry, it must be higher th~n
a cntIcal value for the convective cells to ap-
pear. However, when a magnetic field is present
*Pinch-effect devices are also useful in metallurgy the ponderomotive force in general inhibits th~
where molten metals can be confined away from solid motion and at the same time changes the geom-
boundaries in order to remain pure. etry of the cell. The extent of this inhibition is
691 MAGNETOMETRY

given by the Hartmann number (defined earlier) Cross-references: ASTROPHYSICS, AURORA AND
so that the critical Rayleigh number is higher AIRGLOW, CONSERVATION LAWS AND SYM·
for higher Hartmann numbers. Available labo- METRY, FLUID DYNAMICS, FLUID STATICS,
ratory experimental results reconfirm the find- HALL EFFECT AND RELATED PHENOMENA,
ings of this theory. On a cosmic scale, it has IONIZATION, PLASMAS, SOLAR PHYSICS.
been hypothesized that the roll-like granulation
in the sunspot penumbra is the result of the
magneto-fluid-mechanic inhibition of the mo- MAGNETOMETRY
tion inside regular photospheric convective
cells. Magnetometry is the art of determining ac-
Magnetic fields are also known to inhibit · the curately magnetic fields and the magnetic
onset of turbulence. For instance, consider the properties of matter. Both applications are of
flow of mercury in a channel. Experiments have interest to pure science as well as to technology.
shown that the flow can be laminar well above The principles employed for measurements are
the critical Reynolds number of 2000 or so, if based on the magnetostatic interaction between
a coil is wrapped around the pipe thus creating fields and moments, on voltages induced by
an axial magnetic field. The small disturbances flux changes, on the deflection of charge car-
perpendicular to the direction of the main riers in fields and on the precession of nuclear
stream will be damped out through the action and electronic spins in a field.
of the induced retarding ponderomotive force. Matter exposed to a magnetic field of strength
Many other cosmic scale phenomena seem to H (as produced by a current-carrying solenoid)
be explainable through magneto-fluid-mechan- is magnetized. This phenomenon is described
ics. To list but a few, there are the solar flares by the vector of magnetization M =N( m), where
and filaments, the spiral structure of some N is the number of atoms per unit volume
galaxies, the heating of the solar corona, ex- having a mean dipole moment (m). The mag-
plosion of magnetic stars and many others. netic induction (flux density) is then given by
Although order-of-magnitude analyses have been B = Mo(H + M). One describes the magnetic
suggested to explain some of these phenomena, response of a material by its susceptibility X
there are no complete self-consistent theories. or its permeability Mr using M = XH and B =
Such theories seem to demand a simultaneous MOMrH. It follows that Mr = I + X. In magnetic
satisfaction of all the conservation and electro- materials the susceptibility X is a function of
magnetic equations-a formidable, if ever pos- temperature. Most often it will depend also on
sible, task. On the terrestial scale, many ap- the magnitude and direction (relative to crystal-
plications have been undertaken, and some of line axes) of H. Equations are given in SI units,
them are dependent upon technological develop- which form a rational system. In all systems of
ment rather than fundamental physical under- units Mr and X are dimensionless numbers. Mr is
standing. To give a few more examples, in the same in SI and emu, but Xis smaller by I I( 41T)
addition to those already mentioned, we list in emu. Note also that X is related to the num-
magneto-fluid-mechanic liquid metal pumps ber of atomic dipoles per m 3 in SI and per cm 3
and flow meters, propUlsion devices based on in emu.
the acceleration of a neutral plasma through Magnetometry is a still expanding field,
which a current and a normal magnetic field mainly because of the many practical aspects
from the outside are supplied (an area called of magnetism. New magnetometer designs ap-
"plasma propulsion"), or a device in which pear constantly, but especially for absolute
positive ions (such as the ones easily produced measurements the classical systems are still in
by alkali metals) are accelerated with an electric use with only minor modifications.
field (ionic propulsion). Other examples are Originally magnetometry developed from the
devices to reduce the heat transfer in reentry interest in geomagnetism, with its importance
objects by using the decelerating action of a for navigation . Nowadays exact measurements
magnetic field carried by the vehicle or to use of variations in the earth's field are important
the ponderomotive force as a control force for questions like the dynamics of the inner
when needed for the navigation of space crafts. core of our planet and of the surface of the sun
(via changes in the magnetosphere by sun spot
PAUL S. LYKOUDIS activities).
The classical method, devised by· Gauss, de-
termines the horizontal intensity of the earth
References field H - absolutely, with a relative accuracy
better than 10- 5 . Two measurements must be
Ferraro, V. C. A., and Plumpton, C., "An Introduction performed. First the torsional frequency of a
to MagnetO-Fluid-Dynamics," London, Oxford Uni- small standard magnet having the magnetiza-
versity Press, Second Edition, 1966. tion M and being suspended by a torsion fiber
Alfven, H., and Falthammar, C. G., "Cosmical Electro- is recorded. It is proportional to MH -. Then a
Dynamics," Oxford, The Clarendon Press, 1963. small magnetic needle is suspended by the same
Bateman, G., "MHD Instabilities," Cambridge, Mass., fiber and its deflection under the action of
MIT Press, 1978. the stam:'lrd magnet placed at a well defined
MAGNETOMETRY 692

position is measured. From this M/H- can near the probe. The system may thus also be
be determined. In modern systems the stan- used for measurements of susceptibility. The
dard magnets are replaced by precision coils sensor probe can be as small as 10-3 m!. NMR
activated by exactly measured currents (sine magnetometers lend themselves for telemetric
galvanometer ). readout and for incorporation into a field
With the earth inductor, devised by Weber, stabilizing circuit.
one obtains the inclination of the earth field. In so-called volume averaging NMR mag-
The induced voltage in a rotating coil is sensi- netometers the protons (e.g., of water or alco-
tively measured. If the rotation axis is parallel hol) are flowing through a tube in a strong B-
to the field direction the induced voltage field where their spins are longitudinally polar-
vanishes. Angles of a tenth of a minute of arc ized. The fluid then flows through a region with
can be resolved. Portable systems are available unknown B', where the spins are turned. The
as survey instruments, particularly for prospect- remaining longitudinal spin polarization is
ing of iron ores and related uses. monitored by standard NMR. The method can
Highly accurate field measurement (to 10-8 ) be used between 0.1 IlT and 1 T with a typical
are needed for the design of particle beam op- relative accuracy of 10- 7 around 1 mT.
tical systems such as high energy accelerators A very sensitive device for measuring ex-
or particle spectrometers. Similarly, such de- tremely weak fields is the Zeeman magnetom-
vices require a high temporal constancy of the eter. One of its applications is probing inter-
field which can only be achieved by high gain planetary fields in space. In the presence of a
feedback loops. field the 2S 1/2 ground state of an alkali atom
Rotating coil gaussmeters are modern descen- (e.g., Cs vapor) splits into a doublet. The popu-
dants of the earth inductor. A small coil is lation of the upper state is increased over the
wound on a nonmagnetic core and driven by a thermal equilibrium by optical pumping with
high speed electromotor. Such systems allow circularly polarized light. The transition to the
absolute measurements, but the probe averages Zeeman ground state is forbidden. It can be
over the volume covered by the coil. The me- stimulated by application of an rf field. This
chanics of the electrical contacts can also give will bring the population ratio back to thermal
rise to problems. equilibrium. The absorption of the pumping
A convenient, simple, rather pointlike field light in the vapor is dependent on the popula-
probe (which has to be calibrated) is the Hall- tion ratio of the two Zeeman levels. By tracing
effect gaussmeter. A constant current is sent the absorption maximum as a function of rf
through a small conductor or semiconductor frequency one obtains the energy separation
(2 X 2 X 0.1 mm 3 ). The field component per- of the Zeeman levels and hence the acting field
pendicular to the current flow will deflect the since the atomic moment of the alkali is pre-
charge carriers via the Lorentz force and a volt- cisely known. Sometimes a feedback circuit is
age perpendicular to field and current is gen- used to keep the system at exact resonance.
erated. It is proportional to the induction of the Its sharpness allows measurements down to
field. Most sensitive are semiconductor probes 0.1 nT.
like InSb. Typically a Hall voltage of 5 mV It should be mentioned that most informa-
is generated per T at lOrnA current. Such de- tion on stellar or interstellar fields comes from
vices are now common laboratory equipment. the observation of the Zeeman splitting of spec-
They can be used over a fairly wide tempera- tral lines of certain elements present in stars or
ture range. interstellar matter. However, the width of
An even simpler and cheaper method for spontaneously emitted optical lines is orders
coarsely measuring or controlling fields in the of magnitudes wider and thus the sensitivity is
laboratory is the measurement of the magneto- down.
resistance of a semimetal wire or film or of the Next we mention the magnetometer most
forward diode bias of commercial semicon- widely employed in technical application, the
ductor diodes at liquid helium temperatures. flux gate (or Forstersonde). It consists of two
They have moderate sensitivity and an overall soft magnetic cores in parallel orientation.
nonlinear response. They are driven to saturation by ac currents of
A highly precise field measuring device is the fixed frequency applied to a primary coil
NMR magnetometer. One determines the pre- wound around each core; the two windings are
cession frequency of the nuclear magnetic of opposite direction. Thus in a secondary
moment of protons (or 7Li) with a standard sensing coil wound around both cores nominally
nuclear magnetic resonance circuit. The extreme no signal will be present. A superimposed, am-
sharpness of the resonance line (e.g., of pro- bient dc field will produce a signal with twice
tons in water) allows detection of variations in the ac frequency in the sensing coil. The ampli-
induction of 10-8 in 1 T. Measurements of tude of this second harmonic is proportional to
broadening of the resonance line give informa- the dc field strength, its phase is related to the
tion on the presence of small field gradients. field direction. Modern systems use an addi-
An example is the field distortion caused by the tional field coil excited by the sensing coil
presence of weak para- or diamagnetic impurities signal in a feedback circuit. It compensates the
693 MAGNETOMETRY

ambient field and brings the sensing circuit netic material experience a force parallel to the
back to zero. direction of the gradient of the external field.
Uses range from airborne survey of mineral The sample is usually suspended on a thin fiber
deposits to minesweeping, submarine detection, from one side of the arm of a microbalance
treasure hunting (in archeology), and security mounted some distance above the field-produc-
checks. In these applications often the difference ing magnets. In modern designs a feedback sys-
signal between two flux gates separated by a tem is used which keeps the equilibrium by
certain distance is monitored. Three devices electromagnetic or electrostatic forces acting
mounted mutually perpendicular to each other on the other side of the balance arm. The mea-
are used to determine the vector components surement is absolute. Sensitivity is about
of a field. Magnetometers of this type were 10- 6 -1 0- 7 A/m in good systems. Fields are
flown to the moon during the Apollo mission. produced either by an electromagnet with in-
The sensitivity of flux gates can be better than clined pole caps or by a system of supercon-
10- 5 A/m. ducting coils. .
The most widespread use of magnetometry is Both the vibrating sample magnetometer and
the study of magnetic properties of matter. the magnetic balance allow the variation of
In pure science the quest is for the basic prin- sample temperature rather straightforwardly.
ciples of magnetism and thus the electronic Sample and vibrating rod (or suspending fiber)
structure of matter. In the foreground of tech- are mounted inside a small dewar system or
nological applications stands the design of new oven. Complete systems are available com-
permanent magnets, of magnetic cores of trans- mercially in highly advanced designs.
formers and inductances and recently mainly Another system which makes use of the force
for magnetic storage and recording devices. exerted on a magnetic material in a nonuniform
These technological applications favor mag- field is the pendulum magnetometer. The sample
netically ordered materials. For example, new is fixed at the upper end of a pendulum rod
technologies appear on the horizon through the which is suspended in the middle. A counter-
use of amorphous magnets. These types of weight is mounted on its lower end. For small
measurements are less concerned with the mag- amplitudes the force on the magnetic dipole
nitude and spatial distribution of fields but acts as a restoring force and will thus change
rather with the magnetic parameters of ma- the frequency of oscillation. Measurements of
terials such as the susceptibility or the mag- pendulum frequency are made with and without
netization. A widely used instrument for such field. The same basic principle is used in the
applications is the vibrating sample or Foner vibrating reed magnetometer. The sample is
magnetometer. The specimen is vibrated per- attached to one end of the nonmagnetic, me-
pendicular to a uniform magnetic field. Two tallic reed (e.g., Au), while the other end is
signal coils wound in opposite direction and rigidly fixed. The reed is forced into oscilla-
connected in series are placed around the tions by an inhomogeneous ac field super-
sample with their axes parallel to the direction imposed on the uniform magnetizing dc field.
of motion. The dipole field of the specimen The mechanical vibrations are converted by a
induces an ac signal in the coils. It is compared piezoelectric transducer to an ac voltage which
after amplification with the signal excited in is proportional to the magnetization of the
a second pair of coils by a ferromagnetic sample. Using look-in techniques one can re-
reference sample (located usually near the vi- solve moments down to 10- 13 A m 2 .
brator) which is moved together with the un- The classical astatic magnetometer is still
known specimen. Both samples are mounted used to determine the magnetization of rod-
on a nonmagnetic shaft set into oscillations of shaped samples. Two small, identical magnets
~80 Hz by a loudspeaker system. The mag- are mounted horizontally and antiparallel to
netic field may either be generated by an elec- each other at the ends of a long, nonmagnetic
tromagnet or superconducting coils. Careful rod suspended vertically by a weak torsion fiber.
design of pickup coils and phase-lock noise This is the measuring system which is unaf-
reduction allow the detection of changes in fected by the earth field. Two opposing coils
magnetization down to 10- 6 A/m at fields of are placed in the plane of rotation of one of
10 6 A/m. The principle of the vibrating coil the magnets. The aligned axes of the coils stand
magnetometer is very similar. The sample stays perpendicular to the axis of the magnet. Their
fixed in a homogeneous magnetic field and the fields cancel exactly at the position of the
signal coils oscillate perpendicular to the field magnet. The rod shaped sample is inserted into
along the axis of the dipole moments of the one of the coils. The balance of fields is dis-
sample. Demands on the homogeneity of the turbed by its magnetization and the resulting
field are extremely high. torsional deflection can be read out optically.
The other workhorse for studies of mag- Moments down to 10- 9 A m 2 can be determined
netic properties of matter is the magnetic (or a bsolu tely.
Faraday) balance. Superimposed on the mag- Critical parameters for commercial mag-
netizing uniform field is an inhomogeneous nets are the coercive force and the remanence.
field. The dipole moments induced in a mag- They are determined by recording the hys-
MAGNETOMETRY 694

teresis loop (i.e., the B vs H curve for rising and ropy energy density with respect to a reference
decreasing field intensity) in a simple inductance crystalline axis.
magnetometer. The sample is inserted into a On a similar principle operate rotating sample
gap of the core of a toroidal solenoid. The cur- magnetometers. The specimen (often a sphere)
rent through the solenoid gives H. One obtains rotates slowly (0.02-0.1 Hz) within a dc field
B by electronically integrating the voltage applied perpendicular to its axis of rotation.
induced in a concentric pickup coil. Auto- The variation in flux with rotation is sensed
matic systems of this type are common in ma- by search coils oriented parallel and perpen-
terials testing laboratories. The sensitivity can dicular to the dc field. The anisotropy con-
be improved by first balancing the signals stants are derived from the two measured flux
from two toroids with empty gaps. After in- components.
serting the sample into one gap the difference Information similar to the magnetic anisotropy
in outputs is measured. can be obtained by measuring the magneto-
For comparisons of the high field suscepti- strictive changes in dimensions of single-crystal
bilities of ferromagnets the orbiting sample samples as a function of magnitude and direc-
magnetometer is an advanced modern system. tion of the applied field. X-ray techniques are
Several specimens are mounted on a disk which usually too insensitive for this purpose, and
rotates with constant angular frequency of strain gauges have found the widest use. Capaci-
some 10 Hz. The uniform magnetizing field is tive read out has also been reported.
directed along the axis of rotation. The samples A predecessor of this system is the spinner
pass successively by a sensing coil. Its output is magnetometer. It rapidly rotates a ferromagnetic
stored in phase with the rotation. Digital sig- sample without external field. It is available as
nal averagers can be used to reduce noise. a survey instrument for anisotropy studies of
In single-crystal samples there are directions rock samples.
along which the material is more easily mag- Finally we discuss briefly a very modern sys-
netized. In general they coincide with the tem used for special magnetic measurements,
principal crystal symmetry directions. The free the superconducting quantum interference de-
energy of a crystal thus contains a term de- vice (SQUID) which is rapidly gaining im-
pendent on the direction of magnetization rela- portance. Basically a fluxmeter, it does not,
tive to the crystallographic axes. It is called however, measure the flux itself but rather,
the magneto crystalline anisotropy (energy) and with extreme accuracy, minute variations in
is usually expressed in a set of parameters flux. In fact, sensitivity has reached about 10-5
referred to as anisotropy constants. They of a single flux quantum (CPo = hj2e = 2.07 X
yield basic information on the anisotropy of 10- 15 Wb).
magnetic interaction of the atomic magnetic Applications of the SQUID in physics range
moments which in turn is often caused by the from ac measurements of very small magnetic
anisotropic electron distribution of the orbital moments (e.g., magnetization in extremely
ground state. weak fields) to a search for the elusive mag-
Magnetic anisotropy can be detected by all netic monopole. Measurements of volume sus-
static methods for the measurement of magneti- ceptibilities down to 10- 10 for 1 fJ.g samples
zation if provision is made that the axis of the have been reported. The SQUID has also be-
magnetizing field can be turned with respect come the central tool for magnetomedical
to crystal orientation. Also used are ac bridges and -biological research. Examples are magneto-
and ferro-, ferri-, or antiferromagnetic reso- cardiography and the study of fields generated
nance. The most commonly employed system by the action of the human brain.
is, however, the torque magnetometer. The The SQUID is based on two macroscopic
sample is suspended by a torsion wire between quantum effects in superconductors. The first
the poles of an electromagnet which can be is flux quantization: The flux trapped inside a
moved around the sample. The magnetizing superconducting loop must be an integral
field tends to align the sample magnetization multiple of CPo. The second is the dc Josephson
along an easy axis, and a torque is thus exerted effect: A superconducting ring is interrupted
on the sample, which is read out by optical or at one point by a weak link. Examples are a
capacitive methods or by the variation in re- thin (:::; I mm) oxide layer or a point contact.
sistance of a set of strain gauge wires. A set The thickness of the insulating layer is such
of data is taken by varying the original orien- that electrons can tunnel through it. In a super-
tation of the specimen relative to the external conductor a current of Cooper pairs is flowing
field. which requires no voltage across the barrier.
In the ripple field magnetometer the magneti- Only when a critical current (which can be kept
zation of the sample is modulated by an ac as low as some 10-5 A) is exceeded, a voltage
(~I 00 Hz) "ripple" field superimposed parallel proportional to the tunnel current appears at
to the dc magnetizing field. A sensing coil this so-called Josephson junction. The critical
oriented at 90 degrees to the field measures current drops rapidly when a field is applied
the perpendicular magnetization, which is pro- perpendicular to the flow of Cooper pairs.
portional to the angular derivative of the anisot- After this field has increased so much that an
695 MAGNETOSPHERIC RADIATION BELTS

additional flux quantum can be brought into NMR Magnetometer ... ," Rev. Sci. [nstr. 50,535
the ring, the critical current jumps back to a (1979).
higher value. These periodic discontinuities 8. Parsons, 1. W., and Wiatr, Z. M., "Rubidium Vapour
occur because the ring momentarily ceases to Magnetometer," Scient. Instr. 39,292 (1962).
be superconducting, so that one quantum of 9. Romani, G. 1., Williamson, S. J., and Kaufman, 1.,
flux can enter. In a wire loop, placed inside "Biomagnetic Instrumentation," Rev. Sci. Instr.
the ring, a voltage pulse is induced each time a 53,1815 (1975).
flux jump occurs. By counting these pulses the
total change of field through the ring can be Cross-references: GEOPHYSICS; HALL EFFECT
calculated. For ~1 mm diameter a change in AND RELATED PHENOMENA; MAGNETISM; MEA-
flux density of ~ I nT causes a jump. SUREMENTS, PRINCIPLES OF; SUPERCONDUC-
A practical SQUID arrangement is the two- TIVITY; ZEEMAN AND STARK EFFECTS.
hole system: Inside an Nb cylinder a bore shaped
like a dumbbell is drilled. The weak link is an
Nb screw placed across the bar of the dumb-
bell. If the flux in one hole increases, an equal MAGNETOSPHERIC RADIATION BELTS
decrease of flux in the other hole must follow.
The current through the Josephson junction de- The sun emits continuously a fully ionized gas,
pends on the flux difference between the holes. the solar wind, which flows radially outward
One uses an external search coil placed in the throughout the solar system. The solar wind
field the variations of which are to be measured. plasma is primarily made up of protons and elec-
It is connected in series to a wire loop inside trons, and its properties although variable, have
the first hole. This whole circuit works as a average values at earth orbit of bulk velocity
flux transformer. The second hole is inductively V ~ 350 km/sec, number density N ~ 5 cm- 3 ,
coupled to an LC circuit tuned at some 10 and temperature T ~ 15 eV. Because of its high
MHz. The appearance of a voltage drop across conductivity, the solar wind carries with it an
the junction due to flux brought into the first embedded magnetic field, which on average is
hole can be regarded as a change in inductance parallel to the ecliptic plane and traces an
of the superconducting ring which mistunes Archimedean spiral back to the sun in this
the oscillator. The voltage across the circuit plane due to solar rotation. At the orbit of earth
as a function of field acting on the search coil the interplanetary magnetic field is highly
is saw tooth like with a period corresponding variable in direction and magnitude, having an
to trapping one flux quantum inside the two- average value of ~ 5 nT (10- 5 gauss).
hole ring. The sensing coil can be placed at a The interaction of the supersonic solar wind
convenient distance from the SQUID. By giving with the intrinsic dipole magnetic field of the
it the appropriate shape one may also detect earth forms a region, the magnetosphere (Fig.
variations in the spatial derivatives of fields 1), whose boundary, the magnetopause, sepa-
with extreme sensitivity. Furthermore, the rates interplanetary and geophysical plasma
SQUID has a fast response. Rates of 10 7 <Po /s and magnetic field environments.! Upstream
have been recorded. of the magneto pause is a bow shock formed in
the solar wind-magnetosphere interaction pro-
F. JOCHEN LITTERST cess. At the bow shock the solar wind becomes
G. MICHAEL KALVIUS thermalized and subsonic and continues its
flow around the magnetosphere as magneto-
sheath plasma, ultimately rejoining the un-
References disturbed solar wind. The bow shock is of
interest because of its collisionless character,
1. Kohlrausch, F., "Praktische Physik" 22nd Ed., and much work is presently being done to un-
Vol. 2, p. 341, Stuttgart, B. G. Teubner, 1968. derstand the nature and the development of the
2. Lark-Horovitz, K., and Johnson, V. A., "Methods electric and magnetic field configurations
of Experimental Physics," Vol. 6, "Solid State required to establish a shock front in a collision-
Physics," Part B, p. 171, New York, Academic less medium. 2
Press, 1959. A rough estimate of the position of the day-
3. Kalvius, G. M., and Tebble, R. S., "Experimental side magnetopause is obtained by balancing
Magnetism," Vol. 1, John Wiley & Sons, Chichester, the solar wind pressure against the geomagnetic
1979. field with the resistive pressure of the geo-
4. Foner, S., "Vibrating Sample Magnetometer," magnetic field itself:
Rev. Sci. Instr. 27548 (1956).
5. Gallop, J. C., and Petley, B. W., "SQUIDs and
Their Applications," 1. Phys. E: Scient. Instr. 9,
t p V2 =B2 /8n,
417 (1976). where p = solar wind mass density, V = solar
6. Prim dahl, F., "The Fluxgate Magnetometer," wind velocity, and B = 0.34/R3 gauss is the
1. Phys. E: Scient. Instr. 12,241 (1979); 15,221 earth's field at the magnetic equator, with R =
(1982). geocentric distance in units of earth radii. Use
7. Pendlebury, J. M., et aI., "Precision Field Averaging of the average solar wind values given above
MAGNETOSPHERIC RADIA nON 'BEL TS 696

to the magnetopause), causing a two-way trans-


fer of particles and energy between the inter-
planetary medium and the solar wind. Neutral
points (lines) are formed on the magneto-
pause and, by magnetic flux conservation, in
the geomagnetic tail in the neutral sheet region.
It is not known if sites of field line intercon-
nection are responsible for large scale par-
ticle acceleration, but they are probably ef-
fective in altering the shape and size of the
magnetosphere.
Electrostatic, induced, and polarization elec-
tric fields playa fundamental role in determining
charged particle entry to and subsequent
motion and acceleration within the magneto-
sphere. Much present work in magnetospheric
physics is aimed at identifying these fields and
deconvolving the subsequent currents respon-
sible for sustaining the magnetospheric con-
figuration and causing its variations.
FIG. 1. An outline of the earth's magnetosphere. The in situ phase of magnetospheric research
The lines represent magnetic field lines. began dramatically with the discovery by Van
Allen and his colleagues3 of a permanent,
intense, trapped energetic particle population
gives a dayside magnetopause distance of -10.2 (the Van Allen radiation belts) residing in the
earth radii, as compared to an average observed geomagnetic field-a discovery made with data
distance of -10.8 earth radii. More exact fluid from the first United States satellite, Explorer
and kinetic theory models give good agreement I. Following this initial discovery, data obtained
with the observed average latitudinal and longi- from the trapping regions showed the geomag-
tudinal shape of the dayside magnetosphere. netic field to be at least at altitudes ::s several
In the antisolar direction, observations show earth radii, a very efficient and vast magnetic
that the earth's magnetic field is stretched out mirror machine. 4
in an elongated geomagnetic tail (analogous to The most useful approach for describing the
a cometary tail) to distances of several hundred motion of charged particles in the earth's
earth radii. The geomagnetic tail field lines magnetic field has been the guiding center ap-
emanate from high geomagnetic latitudes from proximation and subsequent development of
the vicinity of the auroral ovals to the geo- adiabatic invariant concepts. 5 ,6 In the guiding
magnetic pole. Topologically the geomagnetic center approximation, the instantaneous posi-
tail consists of roughly oppositely directed tion r of a particle moving in a magnetic field is
field lines separated by a "neutral" sheet of broken down into its circular motion of radius
nearly zero magnetic field. Surrounding the p, and the motion of the guiding center R:
neutral sheet is a plasma of "hot" particles, the
plasma sheet, having a temperature of 1-10 r = R + p.
keV, a density of 0.01-1 particle/cm 3 and bulk A general expression for the motion of the guid-
flow velocity of a few tens to a few hundreds of ing center can be obtained by substituting the
km/sec. A definitive physical explanation of above into the equation of motion
the extended geomagnetic tail has yet to be
obtained. d2 r e dr
Figure I is a schematic of the overall mag- m - - = mg +- - X B + Ee
netospheric configuration. There are seasonal dt 2 c dt
variations due to the _12 0 tilt of the earth's
magnetic dipole axis relative to its spin axis. where m = particle mass, e = electronic charge,
More important, variations in solar wind pa- c = velocity of light, g = acceleration of gravity,
rameters cause large perturbations to the pic- B = magnetic field, and E = electric field. This
ture shown in Fig. 1. These perturbations are yields the nonrelativistic guiding center equation
observed to have scale variations much larger
than the V- 1/3 and p-1/6 dependences pre- - R = g + - e { E + -I -dR
d 2- X B} - - fJ. VB
dicted by the simple pressure balance discussed dt 2 m c dt m
earlier. Therefore, physical mechanisms other
than simple pressure balance are required to
explain observed magnetospheric variations.
For example, geomagnetic field lines are known
to interconnect with interplanetary magnetic where fJ. = particle's magnetic moment due to
field lines (magnetic field component normal gyration, p = cyclotron radius, Q = scale length
697 MAGNETOSPHERIC RADIATION BELTS

over which the magnetic field changes appreci- in the northern and southern hemispheres
ably, and O(p/a) = terms of order pIa. In this (cL, Fig. 2).
approximation the particle's motion in the The second adiabatic invariant, obtained from
earth's field is broken down into three com- the action integral and associated with the par-
ponents: gyration about a field line, bounce ticle's bounce back and forth along a field
back and forth along the field line between line, is given by

L
mirror points, and a slow longitudinal drift
around the earth. While these motions are not M*
strictly separate from one another, the vast J= 2 Pil ds
difference in the time scales associated with
them makes such a separation possible and
leads directly to the consideration of adiabatic where Pil = mUll is the particle momentum along
invariants. These motions are illustrated in the field line and the integral is taken along the
Fig. 2. field line between the two conjugate mirror
The adiabatic invariants may be considered points M and M *.
constants of the particle's motion provided that Forces due to the gradient of the earth's
magnetic field variations are small compared magnetic field and field line curvature cause
with the time and spatial scales associated with a longitudinal drift across field lines with
the particle's motion. The first of the adiabatic electrons drifting eastward and protons drift-
invariants is the magnetic moment generated ing westward. In an ideal dipole field this ef-
by the particle as it gyrates around the field line, fect produces a drift surface which is simply
/1 = mUl 2 /2B = mu 2 sin 2 al2B = WliB the figure of azimuthal revolution of a line of
force. Associated with this drift motion is the
where a = angle between the field line and the third adiabatic invariant, the flux invariant
velocity vector.
This leads directly to the mirror equation and
definition of the particle's mirror point: <PM = B dS.

W sin 2 a <PM, the magnetic flux linked in the drift orbit


/1=--- = constant. of the particle, is the weakest of the three in-
B variants (/1, J, <PM) since it has associated with
In a static field, W (the particle energy) is con- it the largest spatial and temporal scales. There-
stant and fore the conditions for adiabatic invariance are
most easily violated for <PM.
sin 2 a1 IB 1 = sin 2 a21B 2 = constant. In Table I we show for reference character-
istic times associated with charged particle mo-
Using the earth's equatorial magnetic field as tion in the magnetosphere.
a reference, the mirror point B value on a given Radiation belt particles represent significant
line of force is determined by the particle's energy storage in the magnetosphere (2 X 10 22 _
pitch angle at the equator: 2 X 10 23 ergs). Their gradient and curvature
drifts establish a current encircling the earth,
BM =Beq/sin 2 O!eq. the ring current, which is responsible for world-
In a dipolelike field configuration, such as the wide depressions of the earth's surface mag-
earth's, having a minimum B value at the netic field. During times of enhanced radiation
equator, particles will simply bounce back and belt intensities, particle energy densities sig-
forth between conjugate mirror points located nificantly greater than the ambient magnetic
field energy density are observed and can cause
surface field variations up to several hundred
nT. The bulk energy density of the ring current
Mirror point particles is contained within the energy range
(Pitch angle of helical trajectory = 90· ~1-200 keV with a mean energy of ~85 keV.
This high {3 plasma (! pu 2 > B2 I 8rr) decays
primarily through charge exchange and ion-
cyclotron wave generation.
Protons, helium, and oxygen together form
the ring current but their relative contributions
are unknown. Thus the ultimate source of ra-
Magnet ic field line
diation belt particles, the solar wind or the
ionosphere, is still uncertain. It is expected
that strong energy and spatial dependencies
will be evident in the source mixture for the
bulk of the radiation belt particles. The very
FIG. 2. Illustration of the motion of a charged high energy (2: several tens of MeV) protons
particle trapped in the earth's magnetic field. observed at low altitudes CS 1.8 earth radii)
MAGNETOSPHERIC RADIATION BELTS 698

TABLE 1. CHARACTERISTIC TIMES ASSOCIATED


WITH PARTICLES TRAPPED IN THE EARTH'S
MAGNETIC FIELD.

Energy Gyr~eriod Bounce Period Drift Period


Particle (keV) (rxR )(sec) (rx R)(sec) (rx l/R) (hr)

Electron 10 9.4 X 10-6 0.64 36.7


100 13 X 10-6 0.23 4.1
1000 80 X 10-6 0.13 0.54
Proton 10 17 X 10-3 27 36.7
100 17X10-3 8.6 3.4
1000 17 X 10-3 2.7 0.35

Values correspond to "'e = 1T/2 and R = 2Re.

are supplied by neutrons, generated in the at- lished. Radiation belt particles having a solar
mosphere by cosmic rays, which leave the wind source obtain an initial heating in the
earth's atmosphere and decay in the geomag- geomagnetic tail where gradient and curvature
netic field. drive across electric field equipotentials can
Radiation belt particles are accelerated to increase particle energies by amounts up to
their final energies via E X B convection across several tens of ke V. If the particles are from
field lines and betatron and Fermi acceleration the ionosphere they are accelerated to energies
processes due to slow diffusion across mag- of -1-10 ke V by electric fields parallel to mag-
netic field lines under conservation of the first netic field lines emanating from the auroral
two adiabatic invariants. The relative importance zones. In either case a radiation belt source,
of these mechanisms has not yet been estab- most likely the plasma sheet, can be formed at

Mercury

..... I- --I I-
3.5 x 103 km -5 x 104 km 4.3 x 106 km

FIG. 3. MagnetosphereJike systems are probably common


throughout the universe, with a large range in scale sizes. For
example, the subsolar magnetopause distance for Mercury is
3.5 X 10 3 km; for the radio galaxy NGC 1265, the analogous
distance is roughly 10 18 km.
699 MAGNETOSPHERIC RADIA TION BELTS

altitudes ~ 6.5 earth radii which then can be tion, Fermi acceleration, and acceleration by
accelerated as discussed above to form the electric fields parallel to magnetic field lines.
trapped particle population. It remains to be shown that field line inter-
The second large energy storage region in connection can directly transfer energy from
the magnetosphere is the extended geomagnetic the magnetic field to charged particles (field
tail (3 X 10 22 -3 X 10 23 ergs). The plasma sheet line merging, reconnection) or if plasma turbu-
particles and the stretched geomagnetic field lence effects are inportant as acceleration
lines contribute roughly equal parts to this processes.
energy storage. The relationship between the Magnetospheric systems, while similar, often
plasma sheet, the aurora, and the radiation belts have their own unique characteristics. For ex-
is an intimate one but not yet fully understood. ample, Jupiter and Saturn have moons in the
As discussed above, the earthward portion of heart of the charged particle populations which
the plasma sheet is a likely source for the radia- are effective absorbers creating distinctive
tion belts, but the relative contributions of the features in their radiation belts. At Jupiter, the
solar wind and ionosphere are unknown. It volcanic moon 10 is a copius source of sulfur
is also likely that diffuse auroral forms are due and oxygen, both of which have been detected
to plasma sheet electrons scattered into the loss at all energies throughout the Jovian magneto-
cone by electrostatic waves. On the other hand, sphere. Jupiter's high spin rate (period = 9 hours
discrete auroral forms are most probably caused 55 minutes 29.7 seconds) can produce effects
by electric fields parallel to auroral magnetic to accelerate particles in addition to those
field lines. It is not known whether these fields found in the earth's magnetosphere. For ex-
are due to very narrow electrostatic field geom- ample, low energy plasma corotating with
etries called double layers or the observed shear Jupiter's magnetic field will exceed the Alfven
flows in the high altitude plasma (V . E < 0) speed and become supersonic well within the
coupled with ionospheric current continuity Jovian magnetosphere (30-40 Jovian radii).
restrictions. There appears to be no require- Even tiny Mercury, with neither atmosphere
ment for anomolous resistivity mechanisms in nor ionosphere, possesses a magnetosphere
auroral processes. capable of accelerating large numbers of par-
Associated with the parallel electric fields ticles to high energies.
responsible for discrete auroral forms is an in- Work remains to be done to understand how
tense electromagnetic radiation in the 100- the laws of physics operate in interacting mag-
1000 kilohertz band, auroral kilometric radia- netized plasma systems which display the range
tion. The intensity of this radiation,. normalized of boundary conditions seen throughout the
to a planetary radius distance scale, is compa- solar system and in the universe.
rable to Jupiter's emissions. In fact, integration
over respective radiating solid angles may make DONALD J. WILLIAMS
the earth a radio source of the same order as
Jupiter in total power output.
If we consider the earth's magnetosphere in References
a general sense as a rotating magnetized plasma, 1. For a detailed discussion of the magnetosphere and
we find that such objects are plentiful through- its interaction with the solar wind see, for example,
out our solar system and perhaps the universe. Akasofu, S. I., and Chapman, S., "Solar Terrestrial
This is not surprising, since most of the universe Physics" 901 pp., London, Oxford Univ. Press,
is filled with plasma and the basic interactions 1972, and Williams, D. J. (Ed.), "Physics of Solar
between plasmas, electric fields, and magnetic Planetary Environments," 1038 pp., Washington,
fields being uncovered in the earth's magneto- D.C., American Geophysical Union, 1976. For a
sphere are present in the development of cosmic brief summary of other magnetosphere systems see
regions from small interstellar clouds to entire Stern, D. P., and Ness, N. F., "Planetary Magneto-
galaxies. spheres," Annual Review of Astronomy and
Interplanetary spacecraft have identified mag- Astrophysics 20 (1982).
netospheres around Mercury, Saturn, and 2. See collection of papers on recent bow shock re-
Jupiter. 7 Astronomers have detected similar sults, Journal of Geophysical Research 86, 4319
structures around rotating neutron stars (pul- (1981).
sars) and radio wave-emitting galaxies. Figure 3. Van Allen, J. A., Ludwig, G. H., Ray, E. C., and
3 illustrates the scale sizes observed for these McIlwain, C. E., "Observation of High Intensity
various magnetospherelike systems. Radiation by Satellites 1958 Q and 'Y," Jet Pro-
In all these cases it is evident that nature has pulsion 28,588 (1958).
been able to accelerate charged particles to very 4. See for example, Roederer, G., "Dynamics of
high energies. In the earth's magnetosphere Geomagnetically Trapped Radiation," Heidelberg,
there are at least four established methods of Springer-Verlag, 1970, and Williams, D. J., "Charged
accelerating particles. The most general of these particles Trapped in the Earth's Magne+ic Field,"
is magnetic field gradient and curvature drift Advances in Geophysics 15,137 (19 7 1).
across electric field equipotential surfaces. 5. Alfven, H., "Cosmical Electrodynamics," 1st Ed.,
More specific mechanisms are betatron accelera- London and New York, Oxford Univ. Press, 1950.
MAGNETOSPHERIC RADIATION BELTS 700

6. Spitzer, L., "Physics of Fully Ionized Gases," directions. This gives the saturation magneto-
New York, Wiley-Interscience, 1956. striction
7. For Mariner-l0 Mercury results see Science 185,
141 (1974). A 2AlOO + 3 A111
For Voyager-l Jupiter results see Science 204,
=---=~-~ (2)
s 5
945 (1979).
For Voyager-2 Jupiter results see Science 206, The assumption undt)rlying the averaging that
925 (1979). yields Eq. (2) is that before the field is applied
For Voyager-l Saturn results see Science 212, the material is unmagnetized and all polycrystal
159 (1981). orientations are equally likely. This is in fact
For Voyager-2 Saturn results see Science 215, rarely the case; there is often both some rema-
499 (1982). nent magnetization and some preferential
orientation of crystallites. Measurements of
Cross-references: ELECTRON, GEOPHYSICS, IONO- All - Al, the difference in distortions parallel
SPHERE, PLANETARY ATMOSPHERES, PROTON, and perpendicular to the field direction, are
SPACE PHYSICS. more reproducible and significant, in that they
are independent of the distortion in a fiducial
"unmagnetized" state.
Magnetostriction coefficients vary greatly, de-
MAGNETOSTRICTION pending upon the material, temperature, and
magnetization state. For pure iron at room
When a polycrystalline nickel sample is placed temperature, the saturation magnetostriction
in a magnetic field, it contracts along the field constants are AIOO ""' 20 X 10-6 ; Al11 ""' -20 X
direction by about 30 parts per million and 10- 6 , while for alloys near 80Ni-20Fe (weight
elongates in the transverse direction by about per cent) these constants are almost zero. The
half that amount. There is also a small volume cobalt ion causes a large magnetostriction; for
change. Such changes in dimension of magnetic cobalt ferrite AIOO ""' - 500 X I 0- 6 while for
materi/lls with variation of magnetic field nickel ferrite AIOO ""' - 30 X 10- 6 • The largest
strength or direction, are termed magnetostric- known magnetostriction is that of dysprosium
tion. They are measured by strain gages, optical metal. 2 As a magnetic field is rotated in the
dilatometers, capacitance variation, and x-ray basal plane of this hexagonal crystal, there is a
analysis. basal plane distortion of almost one per cent, at
Below the Curie temperature, magnetostric- liquid nitrogen temperatures and below. At
tion in weak fields is caused by domain rotation, room temperature, TbFe2 shows a magneto-
becoming appreciable at fields near the knee of striction 5 times larger than does any other
the B-H curve. materia1 3 ; (A ""' 2 X 10- 3 ).
In saturating fields there is still a small linear The source of magnetostriction is the depen-
dependence of magnetostriction on magnetic dence of magnetic energy on strain. Because the
field strength, and above the magnetic ordering elastic energy is quadratic in strain while the
temperature magnetostriction is, except in rare magnetoelastic energy is linear in strain, the
instances, quadratic in magnetic field strength. minimum free energy occurs at nonzero strain.
Field strength dependent distortions in the satu- For example, in a cubic crystal the equilibrium
rated and paramagnetic regions, designated shear strain €xy is given by
forced magnetostriction, are due to the para-
process, the induction of a moment by the field. B 2 (T,H)
The saturation magnetostriction of single crys- €xy = O:xO:y (3)
C44
tals depends upon the direction of the (sublat-
tice) magnetization, a, and the direction of Here C44 is the elastic constant, the a's are mag-
measurement, {3, with respect to the crystal netization direction cosines, and B2 (T, H) is a
axes. In a cubic crystal (with collinear sublat- magnetoelastic coefficient representing the vari-
tices), to lowest order, ation of magnetic energy (magnetic anisotropy,
dipolar, anisotropic exchange) with strain.
-011 =Ao + ~3 AIOO [alI-'I
2 {./ 2
+ a2 2 1-'2
{./ 2 Quantum mechanical calculations of the mag-
netoelastic coefficients are in a somewhat more
satisfactory state in the case of nonconductors
+a32~32 - 11 + 3 A111 [aIO:2~1~2 than for metals. Extensive calculations by Tsuya
+ O:2O:3~2~3 + O:3O:I~3~11 (1) of the B coefficients of the spinels are reviewed
by Kanamori. 4
ClarkI gives higher order expressions for cubic The temperature dependence (and "forced"
and hexagonal symmetry. field dependence) of the magnetostriction coef-
For cubic crystals the fractional change in ficients is due to statistical averaging as the in-
length along the field (and the magnetization) dividual spins fluctuate around the average mag-
direction induced by a saturating magnetic field netization direction a. For some materials this
is, in principle, found by averaging Eq. (1) over temperature dependence can be expressed en-
701 MANY·BODY PROBLEM

tirely in terms of a known function of the system can be explained on the basis of the law
(sublattice) magnetization. For ferrimagnets S of interaction between the particles and the laws
of dynamics. The latter may be classical or
MT, H) ='L;Ajn(O)fj[mn(T, H) 1 (4) quantum mechanical according to the demands
n of the situation. At each level of refinement we
refrain from asking about the internal structure
That is, the magnetostriction coefficient Ai(T, H) of the particles. This is to achieve a natural
is the sum over sublattices of temperature inde- simplicity of description; but still, at each such
pendent sublattice magnetostriction coefficients level, we have a rich variety of natural phe-
[Al11 (0) = B 2n (0)/C44 1 times a functionfi of nomena to explain.
the sublattice magnetization Mn(T, H)/Mn(O). The many-body theory is not concerned with
At sufficiently low temperatures this function any fundamental or complete explanation of
reduces to nature. Its chief aim is to formulate schemes
according to which calculations of certain
Mn(T, H)] 3 physical quantities can be performed theoreti-
f[mn(T, H)] = [ Mn(O) ; T<f. Tc (5)
cally and the results can be compared with ex-
perimental measurements. It is inherent in its
for both A1IlO and Am. methods that the number of particles is con-
Clarki, Bozorth6 and Callen 7 give references. sidered as being large, and no attempt is made
to find all the details of motions of the particles
EARL CALLEN -a characteristic which distinguishes it from
the so-called one-, two- or three-body problems.
References The main approaches to the theory of quan-
tum-mechanical many-body systems, such as
1. Clark, A. E., "Magnetostrictive Rare Earth-Fe2
nuclei, solids, and fluids, were worked out in a
Compounds," in "Ferromagnetic Materials" (E. P. period of approximately ten years starting in
Wohlfarth, Ed.}, New York, North Holland Publish- the early 1950s. This led to a great deal of
ing Co., 1980, Chapter 7, pp. 531-589. activity and attracted much attention. As a
2. Legvold, S., Alstad, J., Rhyne, J., Phys. Rev. Let-
ters 10,509 (1963); Clark, A. E., Bozorth, R. M.,
result, the term many-body problem has come
and DeSavage, B., Physics Letters,S, 100 (1963). to mean, almost exclusively, the theory of such
3. Clark, A. E., Belson, H. S., AlP Conf. Proc. 5, 1498 systems at or near the absolute zero of temper-
(1972); Clark, A. E., Belson, H. S., Tomagawa, N.,
ature. The latter qualification serves to distin-
and Callen, E., Proc. Internat. Conf. on Magnetism, quish the many-body problem as such from the
Moscow, August 1973. closely related field of STATISTICAL MECHAN-
4. Kanamori, J., "Magnetism" (Rado, G. T., and ICS. (By convention many-body scattering
Suhl, H., Eds.), Vol. I, p. 127, New York, Academic theory is a separate subject). The new develop-
Press, 1963. ments were based on the observation that when
5. Callen, E., Clark, A. E., DeSavage, B., Coleman, W., the number of particles is so large that it may
and Callen, H. B., Phys. Rev., 130, 1735 (1963). be considered effectively infinite, then the
6. Bozorth, R. M., Ferromagnetism, New York, Van system becomes very similar to that of inter-
Nostrand Reinhold, 1951. acting fields-except for the nature of inter-
actions considered-and the general formal
7. Callen, E., J. Appl. Phys. 39,519 (1968). methods of quantum field theory and quantum
electrodynamics may be used with advantage.
Cross-references: FERRIMAGNETISM, FERROMAG· There is only one general theorem in many-
NETISM, MAGNETISM. body theory; it is known as Poincare's theorem.
Roughly speaking, it states that any given initial
state of a finite many-body system will be re-
MANY-BODY PROBLEM peated provided one waits long enough. The
quantum mechanical form of this theorem states
Scope and Definition A large part of the ex- that all observables in a finite system are almost
perimental data of physics is concerned with periodic functions of time. This theorem has not
natural objects which may be looked upon as had much practical use but has played an impor-
being made up from smaller bodies. For example, tant role in discussions concerning the founda-
we may think of the solar system as an object tions of statistical mechanics.
composed of the planets and the sun; ordinary The so-called many-body theory is mainly a
matter, in solid, liquid or gaseous form, as com- collection of special approximate methods devel-
posed of molecules and atoms; atoms and mole- oped for particular problems. The chief com-
cules themselves as made up from nuclei and mon features of some of the methods, especially
electrons, the nuclei as composed from neutrons the ones connected with modern developments,
and protons, and so on. We shall call the com- will now be described.
posite object the system, and its constituents, Reduction to an Equivalent System of Non-
the particles; and note that it seems most interacting Particles The very fact that we can
reasonable to suppose that the properties of the recognize some constituent particles leads us to
MANY-BODY PROBLEM 702

believe that in the lowest approximation we may nuclei, Wigner-Seitz cell model in solid state
neglect their interactions. This approximation theory, and several others.
is already quite successful in derivation of per- Density Functional Method This -method is
fect gas laws and electron theory of metals. A based upon a theorem of Hohenberg and Kohn,
slightly different form of this assumption occurs which states that if the ground state of a many-
in the case of atoms, which are treated as sys- particle system is degenerate then the corre-
tems of non-interacting electrons moving in the sponding wave-function is a unique functional
field of force of the nucleus. For planetary of the particle density. The theorem implies the
systems a similar approximation is used. existence of a universal functional of the exter-
The normal mode analysis of a lattice provides nal potential and particle density which is a
an example where a transformation of coordi- minimum for the true particle density. Others
nates is used to achieve such a reduction. In- have generalized the theorem to nonzero tem-
stead of considering the coordinates of individual peratures, nonlocal external potentials, rela-
particles which interact with each other through tivistic systems, and spin-dependent systems. In
harmonic forces, one considers certain linear the latter case a space and spin-dependent den-
combinations of displacements, the modes. In sity is used. The universal functional of the
terms of the new variables there are no inter- theory is inferred from the basic many-body
actions and the solution is immediately obtained. quantum mechanical description of the system.
This is an example of a transformation which Given the functional, the ground-state energy
introduces a collective description of the system. and particle density are obtained from a varia-
Another type of situation occurs in nuclear tional principle involving the particle density
theory, where it is found that a shell-model of alone. Appropriate approximations reproduce
the nucleus, built in analogy with the atomic Thomas-Fermi theory and its known generaliza-
shell-model, is very successful. The non-inter- tions, but the density functional approach can
acting particles of this model are called neutrons go much further. Extensive applications to
and protons, but interaction between them atomic, molecular and solid state problems have
which must be used in this model is vastly dif- been made. (Unfortunately no comprehensive
ferent from that observed in two-body scattering review of these works exists; we give references
experiments. As a first approximation one can to the first basic papers.)
completely ignore the mutual interaction and Collective Motion Theory In some phenom-
assume that the particles move in a common ena, such as propagation of sound and plasma
one-body potential. This circumstance suggests oscillations, it is clear that many particles are
that what are called neutrons and protons in this performing coordinated movements. To study
model are not the same as the free ones but are such cases, one introduces some collective
only some quasi-particles which are appropriate variables in addition to the usual ones, and the
to the model and happen to have many prop- Hamiltonian is re-expressed in terms of these
erties in common with actual particles. An mixed variables. Subsidiary conditions have to
analogous situation occurs in some solids where be imposed upon this extended system of
electrons as observed by means of cyclotron variables to preserve the original number of
resonance experiments possess an effective mass degrees of freedom. The collective variables
different from the mass of free electrons. In should be such that there is no appreciable
fact, one may even say that many-body theories interaction between these and other degrees of
always deal with quasi-particles. That is true of freedom. When quantum mechanics is applica-
most existing theories, but from such a point ble, the collective motions are also excited in
of view one loses sight of one of the basic quanta which for all practical purposes may be
motivations of many-body theory. treated as new (quasi) particles. The stability of
Effective Field Method This is one of the collective motions is then expressed in terms of
methods of taking into account the mutual inter- the lifetime of quasiparticles. Solid-state physics
actions of the particles. One starts with a given is particularly rich in exhibiting collective
motion of particles, e.g., from an approximation motions. Quasiparticles associated with some of
of the type described in the last paragraph, and them are: the phonons (sound, lattice vibration);
calculates the field of force experienced by one the polarons (electron and its polarization field
of the particles under the influence of all the in dielectric); and the excitons (electron-hole ex-
others. As a further refinement the field may be citations in insulators). Collective motions in
made self-consistent, that being the situation nuclei can also be interpreted in a similar man-
when the motion produced under the influence ner. Superconductivity and superfluidity are also
of the field is the same as that which generated examples of collective motion. The quasiparti-
it. But for the approximations made in the cles responsible for superconductivity are elec-
course of calculation, such as omission of the tron pairs with equal and opposite momenta
effects of correlations among the particles, a and spin.
fully self-consistent theory would be a complete Use of Techniques of Field Theory With these
theory. techniques it is possible to obtain formal expres-
Examples are : Hartree-Fock theory, Fermi- sions which represent the effect of interparticle
Thomas approximations, Brueckner theory of interactions to any order in perturbation theory.
703 MASER

By carrying out rearrangements and partial Many-Body Problem," Amsterdam, North-Holland


summations of terms in perturbation series it is Publishing Co., 1962.
possible to see that, as far as the motion inside Khilmi, G. F., "Qualitative Methods in Many-Body
the system is concerned, the relationship be- Problem," New York, Gordon and Breach, 1961
tween the coordinates and momenta and the (for classical mechanics).
potential and kinetic energies is changed in such March, N. H., Young, W. H., Sampanthar, S., "The
a way that it has to be described in terms of an Many Body Problem in Quantum Mechanics," Cam-
effective mass and an effective interaction, bridge, Cambridge University Press, 1967.
which differ from the original quantities in a Ziman, 1. M., "Elements of Advanced Quantum
known way. In certain cases these effects can Theory ," Cambridge, Cambridge University Press,
be calculated and are finite. 1969.
Brueckner's theory of nuclear matter is an
example of this type. The effective mass is found Cross-references: EXCITON, FEYNMAN DIA-
to depend on the momentum of the particle GRAMS, FIELD THEORY, KINETIC THEORY,
inside the system, and the effective interaction, NUCLEAR STRUCTURE, PHONON, PLASMAS,
the so-called t- or K-matrix, is given by an QUANTUM ELECTRODYNAMICS, QUANTUM
integral equation involving the original inter- THEORY, SOLID-STATE THEORY, STATISTICAL
action. A self-consistent calculation of the MECHANICS, SUPERCONDUCTIVITY, SUPER-
properties of the system (nuclei or atoms) can FLUIDITY.
be based on this understanding.
Similar techniques can be used for studying
collective >motions. An example is the treatment MASER
of electron gas by Gell-Mann and Brueckner. The term "maser," coined by Townes and co-
Perhaps the greatest advance has been made in workers who pioneered this field, stands for
the theory of superconductivity, where varia- m(icrowave) a(mplification by) s(timulated)
tional and canonical transformation methods e(mission of) r(adiation). "Microwave" has
have been used. proved restrictive; stimulated emission ampli-
A combination of all these methods is needed fiers have operated in the UHF (-300 MHz),
to study the difficult problem of relationship and at infrared, visible, and ultraviolet frequen-
between various excitations, i.e., the interaction cies (see LASER). The principal advantage of the
between various quasiparticles of a many-body maser amplifier is its small intrinsic internal
system. One of the most useful tools in these noise: the equivalent noise input temperature is
calculations is the representation of matrix but a few degrees Kelvin. The theoretical mini-
elements by means of diagrams, first introduced mum noise input temperature is hfs/k, where h
by Feynman. Many of these methods were first is Planck's constant, k is Boltzmann's constant,
developed in connection with the theory of and fs is the signal frequency. This is 0.48 K at
interacting fields and they are usually employed fs = 10 Ghz (Giga Hertz) or 10 X 10 9 Hz. Maser
in many-body problems for the limiting case of oscillators can generate exceedingly monochro-
an infinite number of particles, but these matic radiation, e.g., the ammonia maser has a
restrictions are not essential; in fact, they are short-term frequency stability of -5 parts in
quite general methods for treating arbitrary 10 12 , and the atomic hydrogen maser has a
quantum mechanical systems. short-term stability of better than 1 part in 10 13 •
KAILASH KUMAR Because "quasi-optical" techniques are being
employed increasingly in the millimeter and
submillimeter regions, the distinction between
References LASER and maser in these regions is becoming
eroded. Historically, maser oscillators used reso-
ter Haar, D., "Introduction to the Physics of Many- nant systems of dimensions comparable to a
Body Systems," New York, Interscience Publishers, cubic wavelength, (A 3 ); laser oscillators used
1958. resonators with dimensions exceeding A3 by
De Witt, B., "The Many-Body Problem," London, many orders of magnitude.
Methuen, 1959. Stimulated Emission of Radiation Because
Thouless, D. J., "The Quantum Mechanics of Many- its energy is quantized, a molecule (here a ge-
Body Systems," New York, Academic Press, 1961. neric term) can exchange energy with the elec-
Fetters, A. L., and Walecka, J. D., "Quantum Theory tromagnetic radiation field only in discrete
of Many Particle Systems," New York, McGraw- amounts (quanta). The emission or absorption
Hill, 1971. of a quantum (photon) is associated with a
Hohenberg, P. and Kohn, W., "Inhomogeneous Elec- transition between molecular energy states. For
tron Gas," Phys. Rev. 38, 864-871 (1964); Kohn, two states, 1m>, In> of energies Wm , Wn ,
W., and Sham, L. J., "Self-Consistent Equations (W m > Wn ), the frequency fmn of the radiation
InclUding Exchange and Correlation Effects," Phys. accompanying the (permitted) transition be-
Rev. 4A, 1133-1138 (1965) (for density functional tween them satisfies the Bohr condition
method).
Kumar, K., "Perturbation Theory and the Nuclear hfmn = Wm - Wn . (1)
MASER 704

A molecule in state In>, exposed to radia- Conditions for Amplification Suppose radia-
tion of frequency fmn and energy density u, has tion of frequency fmn is incident on an assembly
a probability per unit time u X Bnm (Bnm is a of molecules with an allowed transition at this
constant) of absorbing a photon hfmn and frequency [Eq. (1)]. Let the number of mole-
reaching state 1m>. There is also a probability cules in the upper state 1m> be N m , and in the
u X Bmn that a molecule in the upper state lower state In> beNn. If the incident radiation
1m> will emit a photon hfmn and return to the energy density is u, the power absorbed by the
lower state In>. The upper state molecule is molecules will be
stimulated to emit radiation of frequency fmn
by the radiation field at this frequency. Stimu- PA =NnuBmnhfmn (4)
lated emission, like absorption, is a process and the power emitted will be (see equation 2)
which is phase coherent with the incident radia-
tion. Thermodynamical arguments by Einstein (5)
(1917) showed that
Since at microwave frequencies spontaneous
Bnm = Bmn. (2) emission is negligible, the condition for amplifi-
cation is
A molecule in the upper energy state 1m> PE >PA;i.e.,Nm >Nn . (6)
may also revert to the lower state In> by
spontaneously emitting radiation of frequency There must be an excess of molecules in the
fmn. This spontaneous emission is a random upper energy state of the transition associated
process, which is phase incoherent with any with the signal frequency.
incident radiation, and is therefore a source of For thermal equilibrium at temperature T,
noise in a maser. Boltzmann statistics give
The spontaneous emission probability Amn is
given by (Nm/Nn)=exp(-(W m - Wn)/kTl
=exp (-hfmn/kT)'.:::.I- (hfmn/kT) (7)
Amn = Bmn X hfmn X PI (3)
at microwave frequencies, where hf ~ kT. Clearly
where PI is the number of wave modes per unit a molecular system in thermal equilibrium is
volume per unit frequency range open to radia- thus always absorptive. Equation (7) allows the
tion of frequency fmn. Table I shows values of definition of an "effective temperature" T m for
PI under various conditions; c is the velocity of an emissive system; Eq. (6) and (7) show that
light, Ug is the group velocity of radiation. T m will be a "negative" temperature, and that
ITm I-? 0 for (Nm/Nn ) -?OO. Obtaining an emis-
In the microwave region (say, I to 100 GHz), sive condition, obtaining a "negative tempera-
Amn ~ Bmn; spontaneous emission is therefore ture," and obtaining "population inversion" are
negligible except as a source of noise. However, thus synonymous. The excitation of a molecular
maser spontaneous emission noise is usually assembly to an emissive condition is perhaps
exceeded by noise arising from losses in ancil- the crux of the maser problem. The schemes
lary microwave circuit elements. used depend on the conditions and on the
Molecular transitions are excited by either the molecular system. Discontinuous methods (pulse
electric or magnetic component of the radiation inversion, adiabatic fast passage) can be used,
field, depending upon whether the change in but the account here is confined to the princi-
molecular energy is primarily electric or mag- ples of continuous methods. In a gas, actual
netic in character. Each radiative transition has separation of the upper-state molecules may be
associated with it an effective oscillating electric possible. For example, the upper-state mole-
or magnetic moment, usually dipplar. The prob- cules for the 23.87 GHz ammonia maser transi-
ability Bmn given above depends directly on tion tend to increase their energy in a static
this dipole moment and inversely on the fre- electric field, while the lower-state molecules
quency spread (line width) [) of the transition. tend to decrease their energy (Quadratic Stark
effect). In an inhomogeneous electric field, the
wanted upper-state molecules will therefore
drift to the low-field regions. An electrode sys-
TABLE I tem (with geometrical axial symmetry) which
gives a low-field region along the symmetry axis
Environment will therefore confine the upper-state molecules
in a beam along this axis while rejecting the
Enclosure large compared with the lower-state ones.
wavelength clfmn In the atomic hydrogen maser, a state-selector
Single mode resonant cavity, magnet is used, with alternating north and south
volume V, width of half-power poles, again in an axial arrangement. The elec-
response t;.f tron and nuclear spins can either be "parallel"
Waveguide, cross section A (F = 1) or "antiparallel" (F = 0). The energy
of the atoms with F = 0 decreases with the mag-
705 MASER

netic field as they go off axis, while that of the magnetic field (Zeeman effect). When the steady
atoms with F = 1 increases. Hence the atoms magnetic field is applied at an angle to the
with F = 1, those in the higher energy state, are major symmetry axis of the crystal field, and
focused by the magnet system along its axis, the resultant Zeeman splitting is comparable
while the lower-state atoms are lost to the beam. with the initial Stark splitting of the levels, the
The upper-state atoms enter a teflon-coated usually forbidden "leap-frog" transitions neces-
quartz bulb inside a microwave cavity resonant sary for 3- or multiple-level excitation become
at the transition frequency. The teflon coating allowed. In crystal fields of low symmetry,
minimizes the chance of an atom emitting its "leap-frog" transitions may be allowed at very
energy because of a collision with the wall of low or even zero magnetic fields. Clearly, ions
the bulb. having three or more energy levels are wanted,
Most masers operate on the multilevel excita- and any processes competing with radiative pro-
tion scheme, requiring an input of energy cesses-e.g., the interaction of the "spins" with
("pumping") at some frequency other than the the lattice-are usually required to be small.
transition frequency; forms of energy other Spin-lattice interaction can usually be reduced
than electromagnetic may also be used. The by cooling the lattice to a low absolute tem-
principles of the scheme will be illustrated by perature; and indeed most solid-state paramag-
reference to a molecule having 3 levels with netic masers operate at liquid nitrogen (77 K) or
energies WI < W2 < W3, such that all transi- liquid helium (4.2 K) temperatures. Some ions
tions between levels are allowed. (The transi- and host lattices with which maser action has
tions other than the signal transition need not been achieved are listed in Table 2.
radiate electromagnetically). In thermal equi- A "spin-spin" interaction process, known as
librium the number densities (ni)e of the parti- cross-relaxation must also be taken into account,
cles in the different states (i) will satisfy since it may either aid or inhibit maser action.
Cross relaxation is dependent on spin concen-
(nl)e > (n2)e > (n3)e. tration, but not on temperature. Consequently,
The frequencies 132, hi ,hi are defined from maser action may be achieved at comparatively
high temperature (77 K) but not at low tem-
imn = (Wm - Wn)/h. perature (4.2 K) where the considerably longer
spin-lattice relaxation time might be expected
Suppose now by some means, that the transi- to give better maser action. Rearrangement of
tion 1-+ 3 is saturated, i.e., nl ~ n3. (This the level populations occurs because of single or
might be achieved by a sufficiently strong elec- multiple quantum transitions between the levels,
tromagnetic field at frequency hi -known as in which energy is "almost" conserved on the
the "pump" frequency). Under these conditions, microscopic scale, any differential being ex-
it may happen either that n2 > n I, or that changed with the energy of the macroscopic
n3 > n2. In the first case, amplification will be spin system (total magnetic moment).
possible at f21; in the second case, at f 32, pro- Amplifier Systems Maser amplifiers may be
vided that the appropriate transition is electro- of either traveling-wave or resonant circuit
magnetically radiative. (cavity) form. Their performances are ex-
There are many variants of the simple scheme pressed in terms of a molecular Q-factor, Qm,
just described. The frequency 131 may lie in the defined over unit length for the traveling-wave
optical region (OPTICAL PUMPING); the excita- maser and over the resonator volume for a
tion may be by collision processes in a gas cavity maser. At the signal frequency is,
discharge; or more than three levels may be in-
volved, and pump frequencies lower than the Energy stored in the structure
signal frequency can sometimes be used. Qm = -2rris X - - = . . : . . . . - - - - - - - -
Maser Materials Maser action has been Power emitted by the molecules
achieved in gases (e.g., ammonia, formaldehyde, (8)
hydrogen, rubidium vapor) and liquids (e.g.,
protons in water) but the most important maser since the Q's similarly defined for losses are
materials are the solid-state ones, since these positive.
have a high concentration of active centers in a
small space. Present emphasis is on the use of TABLE 2
certain paramagnetic ions diluted in a host crys-
tal lattice. Three-level excitation, or some vari- Ion Effective Spin Host Lattice
ant, is usually employed.
PARAMAGNETISM is associated with ELEC- Cr 3 + A1 2 03, alumina (ruby)
TRON SPIN. The directional quantization of Cr 3 + Ti0 2 rutile
angular momentum leads to the quantization of Cr 3 +
the energy of the ionic magnetic moments in a Be3Al2(Si60IS) emerald
steady magnetic field. In general, the ground- Fe 3 + Al203, alumina
state multiplet of these ions is split by the crys- Fe 3 + Ti0 2 rutile
tal field of the host lattice (Stark effect), and Fe 3 + Al2 SiO s andalusite
the levels are completely separated by steady
MASER 706

For a magnetic dipole transition, The noise input temperature Tin of a traveling
wave maser is given approximately by
IQm I ex: S(N*prfi 11)-1 (9)
Tin~ITml+TI(IQmIIQe) (12)
where S is the frequency width of the transi-
tion at half-intensity, N* is the excess upper where T m is the effective negative temperature
level population, Pm is the effective dipole mo- of the maser material, Qm is the molecular Q
ment for the transition, and 11 is the ratio of the (negative), Qe is the similarly defined ohmic loss
magnetic el1ergy coupled to the molecules to factor, and TI is the actual temperature of the
that stored in the microwave circuit. waveguide (and contents). In this approxima-
Traveling-wave Maser The active maser ma- tion, I Qm 1< Qe. The bandwidth bm of the
terial is placed in a waveguide carrying a pure amplifier is approximately equal to, but less
traveling wave. The gain coefficient am is de- than,S.
fined such that the power gain G for a length I The Resonant circuit Maser may be of either
of amplifier is given by transmission (two-port) or reflection (one-port)
type: only the reflection type is considered
G = exp (2a m l). (10) here, since it is superior in performance to the
It can be shown that transmission type. A reflection cavity maser and
necessary ancillary equipment are illustrated
am = (21Tfs)/(IQm IVgI) (11) schematically in Fig. 2. Assuming that the un-
loaded resonant circuit (cavity) losses are negli-
where Vg is the group velocity of radiation in gible, the coupling to the external circuits will
the guide. Because Pm is typically of the order give rise to a Q-factor Qe, say. The power gain
of a Bohr magneton, and the active centers are G of the reflection cavity maser is then given by
diluted, it is necessary to use slow-wave struc-
tures (Vg ~ cl 100) in order to keep I to a G=(Qe+IQmD 2 (Qe-IQmD- 2 (13)
reasonable value (a few centimeters). Suitable
values of Vg are readily achieved by the resonant The bandwidth bc depends on the gain in such
slowing obtained in periodic structures. Systems a way that
such as the Karp structure, comb structure, and G1/2 bc -:::=' 2lQml/Js(for G > 10, say)
meander line are favored, since these support
waves with the magnetic field circularly polar- The noise input temperature is given by Eq.
ized in a plane containing the direction of prop- (12) above, where now
agation and perpendicular to the plane of the
periodic elements. A comb-structure traveling- Qil = Qil - 10m I-I
wave maser is illustrated schematically in Fig. 1.
The sense of circular polarization is reversed on
crossing this plane and is opposite in any re-
flected wave to that in the forward wave. The Matched Termination
nonreciprocal gyromagnetic properties of para-
and ferrimagnetic materials may then be em-
ployed to obtain forward gain and reverse at-
tenuation with these slow waveguides.

Output

~ Input

Circulator

Resonant
Cavity

I-+---+-Active Medium
FIG. 1. A magnetic field is applied parallel to the
"teeth" of the comb. FIG. 2.
707 MASER

It is necessary to have some nonreciprocal de-


vice to separate the reflected amplified output
Wn = mc 2 [I +2(n +! )hno/21TmC2] 1/2

from the input signal; the ferrite circulator is


- mc 2 +p2/2m
most commonly used. The bandwidth and gain
stability of the cavity maser are inferior to that
of the traveling-wave maser, but the cavity where no is the rest electron gyrofrequency
maser is more easily constructed. If three-level eB/m, and p is the unquantized momentum
excitation is used, it is clear that any maser sys- along the direction of B. Transitions between
tem must support both "pump" and signal states In + 1) and In) occur at angular frequency
frequencies. wn =(1- nhno/21TmC2)no for nhno/21T«
Maser Oscillators Equation (13) indicates mc 2 ; note that Wn decreases as n increases. If
that if IQrn I is small enough, G becomes infi- a system has a greater population in the state
nite; i.e., oscillation occurs when the stimulated In + I) than in In), photons of angular frequency
emission is large enough to overcome all losses. Wn will induce more (downward) transitions
The width of the signal emitted by a maser os- In+I)--*ln) than (upward) transitions In)--*
cillator is very much less than 0, so that for In + 0, because of the unequal level spacing;
narrow 0 an extremely pure oscillation signal hence stimulated emission exceeds absorption.
results, and a molecular transition which is rela- If the width of the level In + I) exceeds that
tively insensitive to external influences will thus of In>, a similar effect can occur. This situa-
give oscillations of high stability in frequency. tion can be obtained by causing the radiating
The ammonia maser and the atomic hydrogen atoms to suffer elastic phase-interrupting colli-
masers are two examples. sions (e.g., with neutral atoms) if the energy-
Two hydrogen maser oscillators have been dependent collision cross section is sufficiently
compared against each other, and the relative strong.
frequency over several hours was stable to In a typical device, the gyrotron, a solenoid
1 part in 10 14 • This stability has allowed the creates an axially symmetric magnetic field
determination of the level separation of atomic about a gently tapering waveguide system,
hydrogen and its isotopes with greatly increased whose different, sections act as interaction
precision (e.g., 0.1 Hzin 1,420,405,751.768 Hz), space (open or "quasi-optical" cavity) and
and has thus allowed a further test of theoreti- output (and if necessary, input) apertures.
cal quantum electrodynamics, and given a mote The electrons are produced at a cathode with
precise value for the fine structure constant. a large emitting surface, and accelerated to-
Hydrogen masers have also been flown in mis- wards a collector. The magnetic field increases
siles and aircraft for the detection of small in intensity from the cathode to the interaction
relativistic effects of the motion. space, which has an almost uniform magnetic
The Electron-Cyclotron Maser Relativistic field. In the nonuniform field region, the elec-
free electrons gyrating in a static magnetic field tron orbital velocity v1 grows from the initial
undergo free-free transitions. If there is an cathode orbital velocity according to V1 2 /B =
energy-dependent level width, or an energy- constant: the orbital energy is drawn from that
dependent level spacing for the quantized free of the longitudinal motion, and from the ac-
electron states, and a population inversion is celerating electrostatic field. The electrons
also present, stimulated emission and hence deliver up the RF energy in the interaction
amplification can occur. space, then pass through a section of decreasing
The classical picture of cyclotron maser action magnetic field to an extended surface collector.
can be obtained by considering the phases of Such devices are capable of powers of many
electrons gyrating about the magnetic field. kilowatts in the millimeter and sub millimeter
Each charge will radiate as an electric dipole; wavelength regions.
a multipolar contribution also occurs when Applications Maser amplifiers are now in use
relativistic effects are properly taken into ac- wherever the requirement for a very low noise
count. Consider a system of monoenergetic amplifier outweighs the technological problems
electrons, initially distributed randomly in of cooling to low temperatures. They have been
phase. If a phase bunching mechanism exists, used in passive and active radioastronomical
coherent emission will take place. Because of work, in satellite communications ("Project
the relativistic mass change, this bunching does Echo") and as preamplifiers for microwave
in fact occur. Electrons absorbing radiation spectrometry. The "deep-space tracking" sta-
become more massive, and go back in phase; tions around the world use ruby maser pre-
electrons emitting radiation become less "mas- amplifiers for the reception of signals from
sive" and advance in phase. The ultimate phase planetary probes. The ammonia and the atomic
distribution favors emission over absorption, hydrogen masers are being studied as frequency
thus increasing the intensity of the incoming standards and have been used in a new accurate
electromagnetic wave. test of special relativity. Sources and amplifiers
From the quantum-mechanical viewpoint, a in the submillimeter, micron, and optical wave-
free electron in a uniform static magnetic field length regions are being studied and developed
B is an anharmonic oscillator with quantizerl (see LASER). Maser theory has been used to
energy levels (neglecting spin) explain numerous atomic and molecular emis-
MASER 708

sion lines observed in radio astronomy ('celestial gravitional mass). Secondly, all matter resists
masers'). any change in its motion (inertial mass).
The gravitational role of mass is quantified by
G. J. TROUP Newton's universal law of gravitation,

References Gmlm2
=-
A
F 2 Uy (1)
Andronov, A. A., Flyagin, V. A., Gaponov, A. V., r
Gol'denberg, A. L., Petelin, M. I., Usov; V. G., and
Yulpatov, V. K., "The Gyrotron: High-Power Source
wherein the force F of attraction (designated
of Millimetre and Submillimetre Waves," Infrared
by the negative sign) between masses m 1 and
Physics 18,385 (1978). m 2 is inversely proportional to the square of
Hershfield, J. L., and Granatstein, V. L., "The Elec- the distance r between their centers of mass,
tron Cyclotron Maser-An Historical Survey," IEEE and directly proportional to the magnitudes of
Transactions on Microwave Theory and Techniques m 1 and m2. F is called a central force because
it acts in a straight line (the direction of which
is given by a unit vector uy ) which originates at
MTT-25(6), 522 (1977).
Microwave Journal Staff, "Low Noise Maser for Radio
Astronomy," MicrowaveJ. 21(3),52 (1978).
the center of mass m 1 and terminates at the
Ramsey, N. F., "Hydrogen Maser Research," in "Fun-
center of mass m2.
damental and Applied Laser Physics" Proceedings of
This force is independent of the physical di-
Esfahan Symposium (Feld, M. S., Javan, A., and
mensions of ml and m2. Its strength, compared
Kurnit, N. A., Eds.), New York, John Wiley & Sons,
with forces of other kinds such as that of elec-
tricity, is determined by the size of the gravi-
1973.
tational constant G. In view of the high degree
Weber, J., Rev. Mod. Phys. 31,681 (1959).
of accuracy over centuries of application inher-
ing in Eq. (1), which does not require any
Books specification as to which of the two masses
Cook, A. H., "Celestial Masers," Cambridge Mono- is the active and which is the passive gravita-
graphs on Physics, Cambridge, U.K., Cambridge tional mass, no quantitative difference is
Univ. Press, 1977. expected between these two states of gravita-
Ishii, T. Koryu, "Maser and Laser Engineering," Hun- tional mass. Further, Newton's third law of
tington, N.Y., Krieger, 1980. action and reaction, to be considered below,
Orton, J. W., Paxman, D. H., and Walling, J. C., "The implies their equality.
Solid State Maser," London, Pergamon, 1970. The inertial property of mass is described by
Siegman, A., "Microwave Solid State Masers," New Newton's three laws of motion. According to
York, McGraw-Hill, 1964. the first law, a body will continue in its state
Siegman, A., "An Introduction to Lasers and Masers," of rest or in a state of uniform motion (un-
New York, McGraw-Hill, 1971. changing speed along a straight line) unless
Troup, G., "Masers and Lasers," 2nd Edition, London, acted upon by a net external force. This means,
Methuen and Co., 1963. for example, that the earth, instead of orbiting
Weber, 1. (Ed.), "Masers," International Science Re- about. the sun, would move in a straight line
view Series, New York, Gordon and Breach, 1967. and thus escape from the solar system. The
"escape" is prevented by the force of gravita-
Cross-references: COHERENCE, ELECTRON SPIN, tional attraction between the sun and the earth.
FERRIMAGNETISM, LASER, LIGHT, MICROWAVE It also means that motion of mass can exist
SPECTROSCOPY, MICROWAVE TRANSMISSION, without force. It is through Newton's second
OPTICAL PUMPING, PARAMAGNETISM, QUAN- and third laws however, that an operational
TUM THEORY, ZEEMAN AND STARK EFFECTS. definition of inertial mass is achieved, one
based on the concept of force which can be
a push or a pull (such as that inherent in the
MASS AND INERTIA force called weight). The second law states that
the net force F is equal to the change (or deriv-
Mass is, along with length and time, one of the ative) of momentum with time, momentum
three fundamental undefinables of Newtonian being the product of mass m and its velocity v:
dynamics (the explanation of matter in motion).
These three undefinables combine to form the d(mv)
important operational quantity called force. F=--. (2a)
One in turn may consider force as a measure of dt
the interaction of mass with its environment.
Force is defined in such a way that it is de- Taking the derivative of the momentum with
pendent on eitlier one of two intrinsic proper- respect to time leads to a two-term expression
ties of matter, called gravitation and inertia. for the force F:
First, all matter exerts an attractive force on all dm
other matter (active gravitational mass) and is F =m·a+--v. (2b)
in turn attracted by all other matter (passive I dt
709 MASS AND INERTIA

For most conventional situations the inertial concept of electromagnetic fields as well as
mass is constant, so that the second term with the quantum of action. This success has
(dm/dt)v is equal to zero, thereby leaving culminated in the extraordinarily precise disci-
F =mia. The acceleration a, which is the deriv- pline of quantum electrodynamics.
ative of velocity v with respect to time (dv/dt), It should not be concluded, however, that the
represents a time-dependent increase or decrease concept of force is a unique prerequisite in
in velocity of the mass or a change in its direc- Newtonian mechanics for the definition of
tion of motion such as that encountered in inertial mass. Mach proposed an experiment
rotation. Velocity is speed in a specific direc- for determining the relative inertial mass free
tion, and is an example of a vector, namely it of any consideration of force including that of
has magnitude (i.e., speed) and a specific direc- weight. Two masses M and m on a frictionless
u
tion. In the case of the unit vector y , the mag- surface are held together against a compressed
nitude is one. Operationally, Eq. (2b) states that spring between them. When released, they fly
when the force F is fixed, mass and acceleration apart under the influence of the equal and
are inversely proportional to each other. The opposite forces exerted by the spring. The
interdependence of the concepts of force and second law states that the product of the mass
mass reflected by F = mja emphasizes the lack and the magnitude of the acceleration for each
of a unique definition of what inertial mass is. object should be identical, or
The latter, however, may not prove ultimately
to be a drawback, for the following reason. At mA =Ma. (4a)
least three of the hierarchy of four forces, i.e.,
gravitation, weak, electromagnetism, and strong, If we replace M by one standard unit of mass,
the kilogram, i.e., M = 1, then
contain empirical constants, such as the per-
mittivity of free space £0, the gravitational a
constant G, or the Fermi coupling constant GF. m=-. (4b)
A delineation of the origin of these constants A
would provide a deeper insight into the nature
of the interaction of mass with its environment, Mass m is therefore operationally defined as a
as well as a potentially richer understanding of ratio of two accelerations, i.e., that of the
the nature of mass itself. standard a and that of the object A ; the ratio
For example, Einstein in banishing the con- of the two accelerations is constant for a par-
cept of absolute velocity by focusing attention ticular body and thereby quantifies an intrinsic
on the role of the relativity of the velocity term property of matter. Its validity requires that
in Eq. (2b), enlarged the Newtonian operational Newton's third law hold, namely that for any
view of the concept of inertial mass. In his force of action there must exist an equal and
theory of special relativity, the amount of mass opposite force of reaction.
associated with an object was shown to be a From the time of Galileo, there has existed
function of its relative velocity, reason to believe that the two intrinsic proper-
ties of mass might have a common origin. Since
(3a) the mass of a falling object on earth is simul-
taneously inerting m j and gravitating m G, one
a dependence quantified by the Lorentz trans- can equate Eqs. (l) and (2b), where, on earth,
formation 1, Me is the mass of the earth and r is the distance
from the center of the earth to the falling
(3b) object:
GMemG A

Here e, the speed of light in a vacuum, is con- mja = - r2 Ur · (5)


sidered independent of any motion of its source.
The total mass of an object m was now con- To a surprising degree of accuracy (about 1 part
sidered to be the sum of a constant, intrinsic in 10 12 ) the acceleration for all falling bodies in
rest mass mo and an increment of variable mass the vicinity of the earth is equal to -GMe/r 2 ,
depending upon its velocity relative to some more commonly designated asg ~ 9.8 Newtons/
reference frame. This latter effect negates kilogram. It is the variation in r in the factor g
defining mass as a measure of the quantity of and not a change in mass that causes the weight
matter in a body. Significantly, mass could now of a given object to be location dependent. This
be shown to be equivalent to energy via the re- means that to the extent that all objects fall to
lationship E = me 2 • It is worth noting that this the ground with the same acceleration, the ratio
energy expression for mass does not reflect the mG/mi, at fixed r, in Eq. (5) is constant so that
fact that it is considerably easier for stable mass mj can be set equal to mG by redefining the
to be converted into radiant energy than the constant G. Einstein seized upon this apparent
reverse. Significantly, the special relativistic experimental equality of inertial and gravita-
Newtonian operational definition of mass has tional mass and made it absolute, thereby
been found to be highly compatible with the adopting the suggestion of Mach that inertial
MASS AND INERTIA 710

mass is simply the gravitational attraction of Eisenberg, R. M., and Lerner, L. S., "Physics: Founda-
all the mass in the universe for the matter under- tions and Applications," Vol. I, New York, McGraw-
going a change in its motion. This enabled him Hill, 1981.
to reduce the number of fundamental undefin- French, A. P., "Special Relativity," M.I.T. Introduc-
abIes of dynamics to two, namely length (three tory Physics Series, New York, W. W. Norton and
dimensions) and time. If we multiply the latter Company, Inc., 1966.
by the postulated invariant speed of light e, Dicke, R. H., "The Solar Oblateness and the Gravita-
we obtain a fourth length or the fourth dimen- tional Quadrupole Moment," Astrophysical Journal
sion. Thus, dynamics is reduced to four-dimen- 591-24 (January 1970).
sional geometry and the existence of mass is
indirectly reflected by the warpage of this Cross-references: DYNAMICS, FRICTION, IMPULSE
four-dimensional geometric structure. (The AND MOMENTUM, MECHANICS, RELATIVITY,
unusual behavior of black holes can be at- STATICS.
tributed to optical effects encountered under
conditions of extreme warpage.) From a
Newtonian standpoint, general relativity ban- MASS SPECTROMETRY
ishes absolute acceleration and quantifies the
relativity of acceleration. Mass spectrometry is based on observations of
General relativity, in eliminating mass as a the behavior of positive rays by Thomson and
fundamental undefinable through the removal Wien. In 1919, Aston demonstrated the exis-
of any distinction between inertial and gravita- tence of isotopes by introducing neon gas into
tional mass, redefines mass as a measure of the a mass spectrograph. Prior to 1940, mass spec-
curvature of space-time. Despite many experi- trographs and spectrometers were used primarily
mental confirmations involving matter and light; for isotopic studies in university laboratories.
general relativity has resisted accommodation Analytical spectrometers became commercially
with electric charge, as well as with the related available during the early years of World War II
quantum of action h. Further, recent measure- when their use for the rapid analysis of hydro-
ments of solar oblateness by Hill, if confirmed, carbon mixtures was recognized.
would suggest that a small portion (~1.4%) of Mass spectrometry provides information con-
the advance of the perihelion of Mercury (43" / cerning the mass-to-charge ratio and the abun-
century), until now a stunning, unique and dance of positive ions produced from gaseous
exact prediction of general relativity, may have species. There are several techniques for the
to be attributed to some other mass effect. It production and measurement of the ions, and
therefore remains an open question whether a the design of an instrument is determined by
description of mass based upon the elimination its proposed application. The mass spectrograph,
of all distinction between inertia and gravita- using a photographic plate for ion detection,
tion can be maintained. had been used primarily for isotopic studies but
later was used for the analysis of trace constitu-
B.A. SOLDANO ents in solids. The mass spectrometer uses an
electrical detection and recording system giving
References a metered output that provides a more accurate
General Relativity measure of the abundance of the ions than the
photographic plate provides. The mass spec-
Klein, H. Arthur, "The New Gravitation: Key to In- trometer is used primarily for the quantitative
credible Energies," Philadelphia and New York, analysis of gases, liquids, and a limited number
J. B. Lippincott Company, 1971. of solids.
Sciama, P. W., "The Unity of the Universe," Garden The five basic components of the instrument
City, N.Y., Anchor Books (Doubleday and Co.), are the sample introduction system, the ion
1959. source, the mass analyzer, the ion detector, and
Lieber, L. R., and Lieber, H. G., "The Einstein Theory the recorder. A sample pressure of approxi-
of Relativity," New York and Toronto, Rinehart mately 5 X 10-5 torr is generally required for
and Company, Inc., 1945. a satisfactory analysis. An elevated temperature
Black Holes inlet system or other means of converting the
Taylor, J. G., "Black Holes," New York, Avon Books, sample into a gaseous state is required for less
1973. volatile species. Direct insertion probes for in-
Penrose, Roger, "Black Holes," Scientific American, troducing samples directly into the region of
PPM 38-46, May 1972. ionization are available on many instruments.
Ruffini, Remo, and Wheeler, John A., "Introduction The most common methods of producing
to Black Holes," Physics Today 24(1), 30-41 positive ions are electron impact, thermal ion-
(January 1971). ization, spark, field emission desorption, and
chemical ionization. The electron impact source
Newtonian Physics and Special Relativity is the most widely used. Positive ions are pro-
Hecht, E., "Physics in Perspective," Reading, Mass., duced by removing one or more electrons from
Addison Wesley Publishing Company, 1980. the molecules. Thermal ionization produces
711 MASS SPECTROMETRY

positive ions by vaporizing a material directly A major advantage is the increased capability
into the ion source from a filament coated with to detect individual organic compounds in
the sample. With a spark source, the material complex mixtures.
under investigation must be a conductor, or The two types of ion detection and recording
else suitable means must be provided for initiat- systems are the photographic plate and the elec-
ing and maintaining a spark. Ions produced in trical detector. The photographic method is
the spark are taken directly into the mass commonly used with double-focusing instru-
analyzer. In the field emission source, a high ments such as the Mattauch-Herzog design that
potential is applied between the sample- focuses all ions simultaneously in one plane.
generally deposited on the tip of a tungsten The photographic plate records a complete
wire-and another electrode. Ionic species spectrum (mass range - 36: 1) in a time interval
representative of the sample are removed by a of a few seconds to 10 minutes. However, the
high-intensity electric field. Chemical ionization response of the plate to the ion intensity is
involves the reaction, directly in the ion source, nonlinear and quantitative results are more
of ions from a reactant gas such as methane difficult to obtain than by electrical detection.
with molecules of the sample. Electrical detection systems use an ion collector,
The three most widely used types of mass amplifier (commonly an electron multiplier),
spectrometers are (1) the single-focusing, and recorder.
magnetic deflection, (2) the double-focusing, Positive and negative ions and neutral species
and (3) the quadrupole. These three types of are produced by the electron bombardment of
instruments differ primarily in the method used molecules. The mass spectrum of a compound
for mass separation. The single-focusing analyzer is a record of the positive ions collected. Posi-
achieves direction but not velocity focusing of tive ions are produced by the removal of one
the ions. Ions of the same mass-to-charge ratio, or more electrons from the molecule and by the
having slightly different velocities resulting from rupture of one or more bonds, fragmenting the
different kinetic energies imparted in the ion- molecule. While the majority of the positive ions
ization process, will not be focused simulta- are singly charged, doubly and triply charged
neously, thus producing a broadening of the ions are 0 bserved in many instances. Certain
peak. The resolution of commercially available mass ions produced from organic molecules
instruments of this type is generally limited to must be attributed to the rearrangement of
about one part in 500. That, is, mass 499 can hydrogen atoms during the ionization and frag-
be separated from mass 500 with about a 10% mentation processes. Metastable ions, formed
valley. With double-focusing instruments, an when ions decompose while traversing the
electric sector and a magnetic analyzer are path to the collector, are also frequently ob-
placed in tandem to produce both velocity and served. Metastable ions generally appear at
direction focusing of the ions. Several commer- nonintegral mass units and produce broad,
cial models of the double-focusing design are low-intensity peaks.
available having resolutions in excess of one .• In the electron impact source, the electron
part in 50,000 with an electron impact source, energy is usually adjusted to 50-70 eV, which
and greater than one part in 3000 with a spark is considerably above the appearance potential
source. The double-focusing geometry is neces- for molecular and fragment ions. For simplifica-
sary with spark source operation because of the tion of a complex spectrum, the bombarding
wide energy spread of ions produced in the energy can be reduced to provide sufficient
spark. With both single and double focusing, energy to ionize the molecule but not enough
the resolution varies directly with the radius of to rupture bonds, thus achieving a spectrum
the analyzer tube and inversely with the width consisting .primarily of molecular ions. Mass
of the slits located in the ion source and ion ions appearing in the normal mass spectrum
collector regions. Sensitivity, the abundance of correspond to the various atoms and combina-
the ions collected per unit sample charge, varies tions of atoms in the original molecule. The
inversely with slit width, and a compromise pattern of mass-ion intensities observed is
must be made between resolution and sensitiv- independent of pressure. Differences in the
ity. Combined gas chromatography-mass spec- patterns obtained for various compounds. can
trometry with appropriate data processing units be used as the basis for the analysis of complex
is a popular type of instrumentation. Separation mixtures. Quantitative analysis is based on the
of complex mixtures into individual components ion current varying linearly with the partial
as well as analysis is accomplished by mating pressure of the gas.
these two types of instrumentation. Recently, Some of the common uses for mass spectrom-
two mass analyzers have been used in tandem etry include analysis of petroleum products,
for the technique termed MS/MS. The first identification of drugs, determination of the
analyzer is followed by a reaction region and structure of organic molecules, determination
then by a second analyzer. Chemical ionization of trace impurities in gases, residual vacuum
is commonly used for the source of positive studies and leak detection in high-vacuum sys-
ions. In this technique, a spectrum is obtained tems, geological age determinations, tracer
of ions at a single m/e in the initial spectrum. techniques with stable isotopes, determination
MASS SPECTROMETRY 712

of unstable ionic species in flames, identifica- cent historical example of mathematical physics
tion of compounds separated by gas chroma- we may cite to contrast with the classical atomic
tography (combined gas chromatography-mass theory. Here mathematical abstractions, Min-
spectrometry), trace element analysis in metals kowski space 4-vectors, Riemannian tensors,
and other solids, and microprobe studies of etc. were invented or adopted from the stock-
surfaces and of surface composition of various in-trade of pure mathematicians, with analytical
materials. properties that were seen to match those of the
A. G. SHARKEY, JR. data of experimental physics-velocities, forces,
field variables, etc. Then the logical (i.e., mathe-
matical) consequences of relations among these
References abstractions predicted new and unexpected
1. Duckworth, Henry E., "Mass Spectroscopy," Cam- relations among either already known or as yet
bridge, U.K., Cambridge Univ. Press, 1958. undiscovered data of experimental physics. The
2. Beynon, J. H., "Mass Spectrometry and is Appli- construction of a self-consistent mathematical
cations to Organic Chemistry," Amsterdam, Elsevier description of all physical phenomena, without
Publishing Co., 1960. the use of hypothetical building blocks of any
3. Biemann, K., "Mass Spectrometry: Organic Chemi- kind, is the aim of mathematical physics as
cal Applications," New York, McGraw-Hill Book distinct from theoretical physics.
Co., 1962. As a more recent example of this same con-
4. McLafferty, F. W. (Ed.), "Mass Spectrometry of cept, one may cite the deduction of conserva-
Organic Ions," New York, Academic Press, 1963. tion laws from generalized symmetry principles.
5. Hill, H. C., "Introduction to Mass Spectrometry," The classically familiar conservation laws: ener-
2nd Ed., London, Heyden and Son, Ltd., 1973. gy, momentum, angular momentum, have for
6. Hamming, Mynard, C., and Foster, Norman G., some time now been recognized as logical con-
"Interpretation of Mass Spectra," New York, Aca- sequences of the homogeneity and isotropy of
demic Press, 1972. time and space. Interpreting these homogene-
7. McLafferty, Fred W., "Tandem Mass Spectrom- ities and isotropy as meaning the invariance of
etry," Science 214 (1981). physical laws under translation and rotation,
8. Cooks, Graham R., and Glish, Gary L., "Mass Spec- one is led naturally to the theorem that each
trometry/Mass Spectrometry," C & EN, November invariance principle corresponds to an appro-
30, 1981. priate conservation law. This theorem has yield-
ed very significant discoveries in the study of
Cross-references: FIELD EMISSION, IONIZATION, elementary particles and nuclear structure where
ISOTOPES, SPECTROSCOPY. the most cunningly devised classical models
have been not only fruitless but actually mis-
leading. A non-technical account of this subject
appears in Chapter 27 of R. K. Adair's text
MATHEMATICAL PHYSICS "Concepts in Physics," New York, Academic
The term "mathematical physics" is almost Press, 1969.
synonymous with "THEORETICAL PHYSICS," The activities of mathematical physicists have
but their difference is significant. It is like the resulted in the invention of new mathematical
difference between the descriptions of the abstractions some of which were at first rejected
electromagnetic field by Maxwell and by Fara- by pure mathematicians as illogical, only later
day respectively. The theoretical (nonmathe- to be granted a respectable status in the vocabu-
matical) description draws on analogies between lary of pure mathematics. Examples include
elements of the field and familiar mechanical Oliver Heaviside's operational calculus, J. Willard
models-stretched strings, compressed fluids, Gibbs' vector analysis, and P. A. M. Dirac's
vortex motion, etc.; the mathematical descrip- delta-function techniques. On the other hand,
tion made use of the abstract analytical proper- many branches of pure mathematics which
ties of the elements of the field to set up a initially had been regarded as so abstract as to
purely symbolic description without mechanical be entirely "useless," have been found by
models. Classical theoretical physics was largely mathematical physicists to serve as remarkably
mathematical in content, but was nevertheless useful tools in describing physical phenomena.
based on mechanical models in the spirit of Examples include non-Euclidean geometry in
Faraday's theory of the electromagnetic field. the problems of COSMOLOGY; function space
The atom and interactions between atoms were in modern QUANTUM MECHANICS; spin or analy-
regarded as the "real," "external" objects in sis, or the theory of binary forms, in quantum
terms of which all physical phenomena could FIELD THEORY. Again collaboration between
be explained. The mathematical formalism was mathematical physicists and mathematicians has
merely a handy tool or language in terms of in recent years resulted in the construction of
which to set up the explanation. The atoms new disciplines of great value, examples being
themselves were not explained, but regarded as group theory, operations analysis, the theory of
the fundamental "building blocks" of the phy- random functions, information theory and CY-
sical world. BERNETIcs. The names of many contemporary
Einstein's theory of RELATIVITY is a magnifi- scientists are involved here, including Eugene P.
713 MA THEMA TICAL PHYSICS

Wigner, John von Neumann, C. E. Shannon, shock formation in high-speed phenomena. For
Norbert Wiener and many others. these purposes there are now several libraries of
On closer examination it becomes difficult to program subroutines useful in the study of
distinguish clearly between mathematical phy- literally thousands of unexplored problems.
sics and applied mathematics; very frequently Computational projects that were never even
the same individual may be responsible for dis- contemplated because they would have taken
coveries in both areas. Classical examples of this decades to complete can now be undertaken
may be cited: Isaac Newton, Laplace, Carl and completed in hours or even minutes. The
Friedrich Gauss, Henri Poincare, David Hilbert, other side of the coin is: such masses of numeri-
Ernst Mach, A. N. Whitehead. Evidently our cal data are generated that results can take years
attempt to define mathematical physics is de- to digest and interpret. This difficulty is amelio-
generating into a simple catalog of items with rated in the simpler cases by the recently devel-
only a vague hint of general characteristics oped graphics display capability.
common to all particulars. Physics has some- The most significant advance to date would
times been defined as what physicists do, and seem to be the possibility of programming com-
one is expected to recognize the physicist with- puters to perform abstract algebraic analysis
out need for further definition than his own without the use of purely numerical input. This
affirmation. Mathematical physics may then be mayor may not help eliminate the one most
defined as what physicists do with mathematics, serious weakness of computer science: computer
or what mathematicians do with physics, or programming is so absorbing an occupation that
some superposition of the two. As the history an expert can hardly be expected also to under-
of mathematical physics unfolds it becomes stand the historic significance, or even relevance,
apparent that activity tends to cluster in a few of the major problems at the frontiers of knowl-
fruitful directions at anyone time. Current edge in mathematical physics. Perhaps the most
interests can be judged from the contents of the serious error committed by schools of computer
leading journals devoted to the subject; among science has been to exempt their students from
these the reader should consult the Journal of a solid course on mathematical physics. Physi-
Mathematical Physics, and the Physical Review, cists who wish to keep in touch with advances
published by the American Institute of Physics; in computer science should watch the continu-
The Proceedings of the Cambridge Philosophi- ing series of annual volumes "Methods in Com-
cal Society; Comptes Rendus (French Academy putational Physics" (Alder, Fernbach, Roten-
of Sciences); Progress of Theoretical Physics berg, Eds.; Academic Press).
(Japan); Nuovo Cimento (Italy);Jndian Journal A philosophy of mathematical physics has
of Theoretical Physics; Zhurnal Eksperimental' gradually evolved with all this creative activity.
noy i Teoreticheskoy Fiziki (USSR) (in English For current thinking in this area, the reader is
Translation "JETP"); and other translations referred to two pertinent journals that have
published by the American Institute of Physics. appeared in the past few years: "Foundations
Probably the most popular fields in recent years of Physics, and International Journal of Theo-
have been in the wide application to solid-state retical Physics, both from Plenum Press. Semi-
physics and statistical mechanics of quantum popular expositions are available in a number of
field theoretical techniques introduced initially recent texts in addition to the one by Adair,
to deal with the phenomena of high energy cited above: Kenneth W. Ford, "The World of
physics-nuclear interactions, creation and des- Elementary Particles, 1967; F. A. Kaempffer,
truction of particles. etc. "The Elements of Physics," 1967; and Kenneth
During the last decade the explosive growth W. Ford, "Basic Physics," 1968; all from Blais-
in the availability of miniature computers and dell Publishing Co.
the wide accessibility of large computer facili- As for college level texts on mathematical
ties have had a revolutionary impact on the physics, there are very few that are organized
thinking of mathematical physicists, if not on around physical concepts. We may refer to the
mathematical physics itself. Exceptionally com- classic series of volumes by Arnold Sommerfeld,
plicated problems that were easily prov~n and those by Slater and Frank, as prototypes.
"soluble in principle" have become soluble m The standard modern work is the two-volume
practice, and this fact alone has freed creative set by Morse and Feshbach, "Methods of Theo-
thinkers to ask ever more elaborate and useful retical Physics," New York, McGraw-Hill, 1953.
questions- e.g., finding the eigenvalues of very A less ambitious volume, William Band, "An
large matrices, solving nonlinear differential Introduction to Mathematical Physics," New
equations and adjusting parameters to make the York, Van Nostrand Reinhold, 1959, served as
solutions match experimental data, simulating an overall survey for advanced undergraduates.
the behavior of systems with many degrees of There is now a multitude of excellent texts
freedom * and the resulting understanding of whose major emphasis is the various mathemati-
*A good example: Abraham, F. F., "Computer Sim- cal techniques employed in theoretical physics,
ulation of Diffusion Problems ... ," IBM Data Pro- and we cite only three: Margenau and Murphy,
cessing Division, and The Materials Science Depart- "The Mathe~tics of Physics and Chemistry,"
ment, Stanford University, Palo Alto, California, New York, Van Nostrand Reinhold, 2 volumes!
1971. 1956 and 1964; George Arfken, "Mathematica
MATHEMATICAL PHYSICS 714

Methods for Physicists," New York, Academic form a maximal set, and if the position of the
Press, 1966; and James T. Cushing, "Applied particle is known, it is in a pure state; similarly,
Analytical Mathematics for Physical Scientists," the momentum components (Px, Py, pz) form
New York, John Wiley & Sons, 1975. a maximal set and a particle whose momentum
is known is also in a (different) pure state.
WILLIAM BAND A pure state in which the observables of a
maximal set S have known values is termed an
Cross-references: COSMOLOGY,
CYBERNETICS, eigenstate of S and the observable values are
FIELD THEORY, MATHEMATICAL PRINCIPLES called their eigenvalues in the eigenstate. If
OF QUANTUM MECHANICS, QUANTUM THEORY, (Po, qo, ... , wo) are the eigenvalues, the eigen-
RELATIVITY, THEORETICAL PHYSICS. state is sometimes denoted by the symbol
Ipo, qo, ... , Wo >; in this article, pure states
will be denoted more concisely by Greek letters.
The eigenstates of S are represented by mutu-
ally orthogonal vectors, called the base vectors,
MATHEMATICAL PRINCIPLES OF defining a frame of rectangular axes F in an ab-
QUANTUM MECHANICS stract representation space. The number n of
Classical Newtonian mechanics assumes that a such eigenstates is usually infinite, but for
physical system can be kept under continuous simplicity, it will first be assumed that each ob-
observation without thereby disturbing it. This servable has only a finite number of eigenvalues
is reasonable when the system is a planet or and hence that the number n is finite; the repre-
even a spinning top, but is unacceptable for sentation space is then a straightforward gener-
microscopic systems such as an atom. To ob- alization of ordinary space to n dimensions,
serve the motion of an electron, it is necessary with the additional requirement that the com-
to illuminate it with light of ultrashort wave- ponents of a vector will be permitted to take
length (e.g., 'Y-rays); momentum is transferred complex values. Any pure state of the system
(not necessarily an eigenstate of S) is repre-
from the radiation to the electron and the parti-
cle's velocity is therefore continually disturbed. sented by a vector a in the representation space.
If (ai, a2,· .. ,an) are the components of a
The effect upon a system of observing it can
never be determined exactly, and this means with respect to F, and these components are
that the state of a system at any time can never arranged as a column matrix, this matrix is said
be known with complete precision; this is to provide an S-representation of the state. The
Heisenberg's uncertainty principle. As a conse- state, vector, and column matrix are all denoted
bya.
quence, predictions regarding the behavior of
microscopic systems have to be made on a The scalar product of two vectors a, {3 is de-
probability basis and complete certainty can noted by (a, (3) and is defined in terms of their
n
rarely be achieved. This limitation is accepted
and is made one of the foundation stones upon components in F to be L at{3i, where OIt is the
which the theory of quantum mechanics is i=1
constructed. complex conjugate of OIj. v(a,OI) is called the
Any physical quantity whose value is mea- norm of a and corresponds to the length of an
sured to determine the state of a physical sys- ordinary vector. All vectors representing pure
tem is called an observable. Thus, the coordi- states are taken to have unit norms and the S-
nates of a particle, its velocity components, its eigenstates are accordingly represented in the
energy, or its angular momentum components S-representation by the columns (1, 0, ... , 0),
are all observables for the particle. A pair of ob- (0, I, ... ,0), etc.
servables of a system are compatible if the act Suppose a system is prepared in the state 01 at
of measuring either does not disturb the value time t. The probability that the system can be
of the other. The cartesian coordinates (x, y, z) observed in the state {3 at an instant immediately
of a particle are mutually compatible, but the subsequent to t is taken to be 1(01, (3) 12. This
x-component of its momentum px is incompati- event is termed a transition of the system from
ble with x (similarly, y, py are incompatible, 01 to {3. If {3 is identical with 01, the probability of
etc.). (p, q, ... w) constitute a maximal set of the transition is unity, as we expect. The proba-
compatible observables if they are compatible bility of a transition from 01 into the eigenstate
in pairs and no observable is known which is (1,0, ... ,0) is found to be 101112. This pro-
compatible with everyone of them. In quan- vides a physical significance for the components
tum mechanics, the state of a system at an in- of 01 in the frame F.
stant is fully specified by observing the values If the system is prepared in a state a on a
of a maximal set of compatible observables. number of occasions and an observable a is mea-
Such a state is called a pure state. The act of sured immediately after the preparation on each
observing a system in a pure state disturbs the occasion, the values obtained will usually differ.
system in a characteristic manner; the system is If 01 is an eigenstate of a with eigenvalue ai,
accordingly said to be prepared in the pure then a will take this value with complete cer-
state by the observation. The cartesian coordi- tainty. If, however, this is not the case, a will
nates (x, y, z) of a spinless (see below) particle take all its possible eigenvalues aj with associ-
715 MA THEMA TICAL PRINCIPLES OF QUANTUM MECHANICS

ated probabilities Pi, and its mean or expected representation of the states and observables of a
value ii in the state a is given by ii = L, Piaj. In system can be developed. If a, a' are column
j
matrices representing the same state in the two
the S-representation, an n X n matrix is associ- representations and a, a' are square matrices
ated with every observable a. This is said to representing the same observable, the transfor-
represent a and is also denoted by a. Then, mation equations relating the two representa-
tions take the forms
ii = at aa, where a is the column matrix specify-
ing the 'state and at is its conjugate transpose. a' = ua, a' = uau- l ,
If a is a real observable, the matrix a is Her-
mitian (i.e., at =a); this ensures that is real. a where u is a unitary n X n matrix (i.e., u- l = ut)
If a is one of the observables belonging to S, its characteristic of the two representations. In
matrix is diagonal, and the ith element in the particular, the Hamiltonian H transforms as an
principal diagonal is the eigenvalue aj of a in the observable.
ith S-eigenstate; in this case, a=L,ata;a;= The type of representation we have been de-
; scribing is called a Schrodinger representation.
It is also possible, by rotating the frame in the
L,la;1 2 a;, which is clearly correct, since lail 2 representation space appropriately, to keep a
; constant as t increases. If this is done, the ma-
is the probability the system will be observed in trices a, b, ... representing the observables of
the ith S-eigenstate. the system necessarily become functions of t
A necessary and sufficient condition that the and are not constants as assumed previously.
state a should be an eigenstate of a in which a This type of representation is called a Heisen-
takes the eigenvalue ao is that a should satisfy berg representation. The Schrodinger equation
the matrix equation aa = aoa. a is said to be above ceases to be valid and is replaced by
sharp in the state a. equations of motion for the observable ma-
Observables are often introduced as functions trices a, b, ... taking the form
of other observables, e.g., the kinetic energy T
of a particle of mass m is defined in terms of its da L
momentum by T = (p:2 + pj + pI )/2m. If a is dt = t; [H, a],
defined in terms of u, v, ... by the equation
a = ct>(u, v, ... ), then this equation also defines
the matrix representing a in terms of the ma- where [H, a I =Ha - aH is called the commuta-
trices representing U,V, etc. There is, however, a tor of Hand a. If H and a commute, [H, al = 0
proviso: wherever a product uv occurs in ct>, this and a is a constant of the system as already
must be replaced by! (uv + vu) before matrices stated.
are substituted, whenever the matrices u,v are If the number of basic eigenstates of a repre-
such that uv =1= vu; this is called symmetrization. sentation is infinite but enumerable (i.e., they
If uv =1= vu, we say that u,v do not commute can be placed in a sequence el, e2, e3," .), the
and it is then found that the observables u,v are matrices appearing in the theory will have an
incompatible. infinite number of rows. This creates conver-
Thus far, the evolution of a system with time gence difficulties in most formulae (e.g., that
has not been considered. Suppose a system is for ii), but otherwise, the form of the theory is
prepared in a state a(O) at t = 0 and is not there- not affected. If, however, some of the observ-
after interfered with by further observation. Its abies upon which the representation is based
state aCt) at a later time t is then determined by have eigenvalues which are spread continuously
the Schrodinger equation, over real intervals, the basic eigenstates will not
be enumerable and the matrix-type representa-
da tion must be abandoned. Such an observable is
Ha = ttl dt' L= y'(=T) said to have a continuous spectrum of eigen-
values; examples are provided by the coordi-
where 1i = h/2rr (h is Planck's constant). Using nates (x, y, z) of a particle. In these circum-
the S-representation, a is the column matrix stances, the discrete sequence CXn of vector
(aI, a2, ... ,an) and H is an n X n matrix, components is replaced by a function I/;(p, q, ... )
characteristic of the system, called its Hamil- of the continuous eigenvalues p, q, ... of the
tonian. H is Hermitian, and in the case of an observables bf the representation set S. I/; is
isolated system, represents the energy observ- called a wave function. If the system is in the
able of the system. An important property of H state specified by 1/;, the probability that if
is that, if a is the matrix representing some ob- p, q, etc. are measured they will be found to
servable and a and H commute, then a is a con- have values lying in the intervals (p, p + dp),
s'tant of the system; this means that, if a has a (q, q + dq), etc., respectively is I/; *I/;dp'dq ....
sharp value at one instant, it keeps this sharp The representation space is now a function
value for all t, and otherwise, its probability space called a Hilbert space in which each vec-
distribution over its eigenvalues never changes. tor corresponds to a wave function and the
If S' is a maximal set of compatible observ- scalar product of the vectors corresponding to
abies different from S, an alternative S'- the wave functions I/;(p, q, ... ), ct>(p, q, ... ) is
MATHEMATICAL PRINCIPLES OF QUANTUM MECHANICS 716

defined by Q(Q + l)h 2 , where Q = o,!.


1, ~, ... ,and of Mz

J
aremh wherem=-Q -Q+l -"£+2 ... Q-l Q
Part ~f the angular' mome~tum ~f a ~ysten: i~
(I/I,I/J) = I/I*qxJpdq···, due to the orbital motion of its particles about
o and the remainder is contributed by the in-
the integration being over all possible eigen- trinsic angular momentum or spin of the parti-
values of p, q, etc. cles. Let (sx,Sy,sz) be the components of spin
In this type of representation, observables are of a particle and S2 the square of its miJgnitude;
represented by linear operators which can oper- then, if the particle is a fundamental one (e.g.,
ate upon the functions of the Hilbert space, an electron, proton, or vector meson), s 2 will
transforming them into other functions belong- have only one eigenvalue Q(Q + l)fz 2 and the
ing to the space. For example, if the system particle is then said to have spin Qfz. For an elec-
comprises a single particle and the representa- tron or a proton, Q=! and for a vector meson
tion being used is based on the particle's coordi- Q = 1. The eigenvalues of a spin component are
nates (x, y, z), its state is specified by a wave then - Qfz, (- Q+ l)/i, ... , Qfz. Thus, any spin com-
function 1/1 (x , y, z) and its momentum compo- ponent of an electron has but two eigenvalues,
nents are represented by operators Px = -! 11, ! fl; a spin component of a vector meson
(fl/t)(a/ax), py = (fl/t)(a/ay), pz = (fl/t)(a/az). has three eigenvalues -fl, 0, fl.
This representation permits the immediate deri- If all aspects of the state of a fundamental
vation of the important commutation rules for particle except its spin are ignored, (s2 ,sz) con-
the coordinates and momenta, namely [Px,x] = stitute a maximal set of compatible observables.
fl / t, [p x, y] =0, etc. Counterparts of all the re- Employing a representation based on this set,
sults given earlier for a matrix representation the number of basic eigenstates for a particle of
can now be written down. Thus, if a is the spin Qtz will be (2Q + 1) and the general spin
operator representing some observable of a sys- state of the particle will be represented by a
tem which is in the state I/I(p, q, ... ), the ex- column matrix (O:Q, O:Q-l ,O:Q-2, ..• ,O:-Q); for a
pected value of a is given by particle in this state, 100k 12 is the probability of

J
measuring Sz to take the value kl!. In the special
case of an electron or a proton, Q =! and its
(i= I/I*al/ldpdq···, spin state is specified by a column (0:1/2,0:-1/2)
called a spinor; in this representation, the three
components of spin are represented by 2 X 2
where a operates on the wave function 1/1 on its matrices, thus:
right. The necessary and sufficient condition
for 1/1 to be an eigenstate of a with eigenvalue
ao is al/l = ao 1/1. Finally, the Schrodinger equa-
tion remains valid, but H is now an operator
and 0: is replaced by the wave function 1/1; for a
single particle moving in a conservative field in The three 2 X 2 matrices appearing in these
which its potential energy is V(x, y, z), em- formulae are called the Pauli matrices and are
ploying the coordinate representation, the total denoted by ax, ay, az.
energy H = V + (Px2 + pj + pi )/2m, Schroding- In recent years, much thought has been given
er's equation becomes to the problem of constructing Schrodinger
equations for the fundamental particles, which
tz2 1/1
--'IPI/I+VI/I=tfi-.
a remain unchanged in form when subjected to
2m at the group of Lorentz transformations of special
relativity theory. Dirac's equation for the elec-
The simplest example of an observable with a tron is one such equation. The fields governed
discrete spectrum of eigenvalues is the angular by these equations have themselves been treated
momentum of a system about a point O. If as physical systems and quantized, thus leading
(Mx , My, Mz) are the three components of to a quantum theory of fields. The symmetry of
angular momentum and M2 = M} + Mj + Ml these fields under rotations and other transfor-
is the square of its magnitude, the following re- mations has been fully exploited by the applica-
lations hold between the matrices or operators tion of group theory and the properties of Lie
representing these observables alge bras. Details will be found in Ref. 10 of the
bibliography below.
[My,Mzl =tItMx , [Mz,Mxl =tfiMy ,
D. F. LAWDEN
[Mx , My] =ttzMz .
These are the angular momentum commutation References
rules. M2 commutes with each of the compo-
nents. It follows that the components are mutu- 1. Bohm, A., "Quantum Mechanics," Berlin,
ally incompatible, but each is compatible with Springer-Verlag, 1979. r
M2. Simultaneous eigenstates of M2 and Mz 2. Cunningham, J., and Newing, R. A., "Quantum
exist in which the eigenvalues of M2 are Mechanics," Edinburgh, Oliver and Boyd, 1967.
717 MATRICES

3. D'Espagnat, B., "Conceptual Foundations of n


Quantum Mechanics," Reading, Mass., W. A. Cij = L aik bkj.
Benjamin, 1976. k=1
4. Jauch, J. M., "Foundations of Quantum Me-
chanics," Reading, Mass., Addison-Wesley, 1968. Generally, multiplication is not commutative,
5. Lawden, D. F., "Mathematical Principles of but it is always associative. The transpose of a
Quantum Mechanics," London, Methuen, 1967. product is given by (ABC)T = CTBT AT.
6. Matthews, P. T., "Introduction to Quantum Me- If A is square, then A is symmetric if A = AT,
chanics," New York, McGraw-Hill, 1974. while if A = - AT it is skew-symmetric. A qua-
7. Merzbacher, E., "Quantum Mechanics," New dratic form SA in the n variables contained in
York, Wiley, 1970. the column x = {Xl X2 ... Xn} may be written
8. Pauli, W., "General Principles of Quantum Me- as SA = x T Ax, where A is symmetric.
chanics," Berlin, Springer-Verlag, 1980. Let det A == IA I denote the determinant of
9. Pilkuhn, H. M., "Relativistic Particle Physics," the square matrix A. If det A =f= 0, A is non-
Berlin, Springer-Verlag, 1979. singular. Then the definition of matrix multi-
10. Schweber, S. S., "Relativistic Quantum Field plication ensures that
Theory," New York, Harper, 1961.
det (AB) = det A det B
Cross-references: MA THEMA TICAL PHYSICS, MA· where A and B are square matrices of the same
TRICES, QUANTUM THEORY, SCHRODINGER ordet.
EQUATION, THEORETICAL PHYSICS, WAVE The cofactor of aij in the square matrix A
MECHANICS. equals (-I )i+j times the determinant formed by
crossing out the ith row and jth column in A.
The sum of the n elements in any row (or col-
MATRICES umn) multiplied respectively by their cofactors
Matrix notation and operations are introduced equals det A; the sum of the n elements in any
into theoretical physics so that algebraic equa- row (or column) multiplied respectively by the
cofactors of another row (or column) equals
tions and expressions in terms of rectangular zero. We have
arrays of numbers can be systematically
handled. det AT = det A; det QA = an det A.
An m X n matrix A = (aij) possesses m rows
and n columns, having in double suffix notation The adjoint of A, denoted by adj A, is the
the mn elements arranged to the form transpos\! of the matrix formed when each
element of A is replaced by its cofactor. We
... a ) have

.•. a~~ .
A adj A = (adj A)A = (det A)I
and det(adj A) = IA In-I. The unique reciprocal
or inverse of a non-singular matrix A is given by
The general element aij may be a complex num-
ber. If all elements are zero, A is the null A-I = (adj A)/det A.
matrix 0 or O. When n = I, the matrix is a col-
umn vector v. If m = n, A is square of order n; This has the property that AA -1 = A-I A = I. It
if all elements not on the leading diagonal all, follows that
a22, . . . , ann are zero, the matrix is a diagonal
matrix D, while D is the unit matrix I if
all =a22 = ... =ann = 1.
and
(A -1 )T = (AT)-l.
The sum or difference of two m X n matrices
A and B is an m X n matrix C = A ± B, where Linear equations relating n variables Xi to n
Cij = aij ± bij. The elements of QA are Qaij- variables Yi may be expressed as x = Ay; if
The transpose of A is denoted by AT; this is det A =f= 0, the unique solution for the Yi in
an n X m matrix whose ith row and jth column terms of the Xi is y = A-I x.
are identical respectively with the ith column The rank of an m X n matrix A is the order of
and jth row of A. Hence v T is a row matrix the largest non-vanishing minor within A; of the
with m elements. For convenience, the column m linear expressions Ax, the rank gives the
v is often printed as a row with braces, number that are linearly independent. The m
{VI V2" ·vrn}. linear equations in n unknowns Ax = d, where
The product C = AB is only defined when the d is a column with m elements, are consistent if
number of columns of A equals the number of the rank of A equals the rank of the augmented
rows of B; A and B are then conformable for matrix (A d).
multiplication. If A is m X nand B is n X p, If the m linear equations Ax = d are incon-
then C is m X p, with sistent in the n unknowns Xl, X2,' .. , Xn, where
MATRICES 718

n < m and rank A = n, there the "best" solu- But if x' = Ax, where A is not orthogonal,
tion for these n unknowns in the least squares f' = Af defines a contravariant vector, but
sense is given by the normal equations g' = AT-l g defines a covariant vector. The
product gTf is now an invariant.
If A is square of order n, then the n homo-
geneous equations Ak = Ak require
AT A being non-singular when rank A = n.
An n X n matrix A is orthogonal if AT A = I, det(A - AI) = 0
that is, if A-I = AT. Clearly, IA 1= ± 1. If Ci de-
notes the ith column of A, then CiT Ci = 1 and for non-trivial solutions. This characteristic
CiT Cj = 0 if i =F j; similar results hold for the equation possesses n characteristic or latent
rows. The transformation x' = Ax represents a roots; if they are all distinct, n corresponding
rotation of rectangular Cartesian axes in three characteristic or latent vectors exist. The vector
dimensions; IA I = +1 if the right-handed char- ki corresponding to the root Ai may consist of
acter is preserved. The element Aij equals the the n cofactors of any row of A - Ail; at least
cosine of the angle between the xi' and Xj one non-trivial row exists.
axes. If N is skew-symmetric, then The Cayley-Hamilton theorem states that a
square matrix A satisfies its own characteristic
(I + N)-1 (I - N) equation.
The following properties are important. If A
is orthogonal. is real and symmetric, and if Ai and Aj are dis-
Matri(;es with complex elements are manipu- tinct, then kiTkj = 0; these two vectors are or-
lated according to the same rules. A square thogonal. Again, if A is real and symmetric,
matrix H is Hermitian if H*T = H, and skew- the n values of A are real, but if A is real and
Hermitian if H*T = - H, a star denoting the com- skew-symmetric, these n values are pure imagi-
plex conjugate. A unitary matrix U satisfies nary. For a real orthogonal matrix A, IAil = 1
U*T = U-l, the columns (and rows) enjoying for all i. The characteristic roots of A-I are
the properties Ci*T Cj = I if i = j and 0 if i =F j. If I fAr·' ki still being the corresponding vectors. If
N is skew-Hermitian, then (I + N)-1 (I - N) is IA 1 is the largest of the moduli of the n roots,
unitary. then as r ~ 00, Arx ~ k 1 , where x is an arbitrary
First- and second-order tensors, arising in column.
many physical problems, may be expressed in If A is symmetric, n mutually orthogonal
matrix notation. If x' = Ax denotes a rotation characteristic vectors may be found even if the
of rectangular Cartesian axes, A being orthog- roots are not all distinct. If each vector ki is
onal, then f' = Af and F' = AF AT define normalized, i.e., divided by y'(kiTki), then the
Cartesian tensors of orders I and 2 respectively. matrix
Evidently, if u and v are vectors or tensors of
order I, then uTv is the invariant scalar product, A = (kl k2 ... k n )
Fu is a tensor of order I and uv T is a tensor of is orthogonal, and the product AT AA equals D,
order 2. For example, the vector product u X v the diagonal matrix consisting of the n roots
may be written as Uv, where arranged down its leading diagonal in order.
The matrix A is said to be diagonalized, and D
U =( ~3 -~3 _:2). is the canonical form of A.
More generally, if A is a general square matrix
of order n, then n independent vectors ki may
-U2 Ul 0 be found corresponding to the n roots if the
latter are distinct. Then
U is a tensor of order 2 if u is a vector; UT is
the dual of u.
The differential operator (column)
transforms A into diagonal form, thus:

V == { a! a! a!
l' 2' 3 }
T-IAT=D.
If some of the characteristic roots are identical,
is a vector or tensor of order I ; namely it mayor may not be possible to find n corre-
sponding independent columns (though it is
V=AV. always possible when A is symmetric). If it is
possible, T is non-singular, and as before
Thus if cp is a scalar and f a vector,
T-IAT=D.
grad cp == Vcp is a vector
If this is not possible, there still exists a distinct
div f\== VT f is a scalar non-singular matrix T such that
curl f == Vf is a vector. T- 1 AT = C,
719 MATRICES

where C, no longer diagonal, is the standard then TT AT and TTBT are both diagonal. The
canonical form of A, containing submatrices transformation x = Ty yields the two sums of
of the form squares SA = yT(TT AT)y and SB = yT(TTBT)y.
In particular , if SA is positiv~ definite, TT AT
will eq!!,al I if new columns kj are used in T,
where k i = kdv(ki T Aki) .
. Necessary and sufficient conditions for the

corresponding to the repeated roots.


for all real x '*
real quadratic form SA to be positive definite
0 are that the n determinants
In each case , respectively, a11, Ia11 al21 ' a11 al2 a13 , ... ,detA
An = T DnT-I or TenT-I. a21 a22
This enables us to define functions of square
matrices whose canonical forms are diagonal,
namely, if A = T D r l . If should be positive. This ensures that the n char-
acteristic roots of A are all positive.
co Finally, matrices may often usefully be parti-
f(x) = L ar xr tioned employing matrices within matrices.
r=O Multiplication may still be performed provided
each individual matrix product is permissible.
is convergent for Ix I < R, then define For example,
co

f(A) =L arAr ( a bT)(e fT) = (ae + bTg afT + bTH)


r=O c D g H ce + Dg efT + DH

T(f arDr) T-
where a, e are scalars , b, c , f, g are I X 3 col-
= 1 umns and D, Hare 3 X 3.
r=O Applications Differential Equations If

dx/dt + Ax = f,
(fO'l) 0 ...) A being constant and
= T\ ~ f(~2): :: T-I f= {fl(t), fz(t), .. . , fn(O},
then if T diagonalizes A, x = Ty yields n non-
simultaneous equations dy/dt + Dy = T-I f. If
provided IA I I, IA21, ... , IAn I < R. Thus, for yo(t) is a particular integral,
example, if

X(t)=T(-~lt::: 0 )

o .. . e- Ant
then

eJ =(
COS e sin e) . X [T-Ix(O) - Yo(O)] + Tyo(t).
Geometry. In three-dimensional Cartesian co-
-sin e cos e ordinates,
Similar remarks apply to Hermitian matrices aTx + d =0
H. If Hk = Ak, all the n values of A are real and
n vectors can always be found such that represents a plane, the perpendicular distance
from x I being
kj*Tkj = Oij.
(aTxI + d)/V(aTa).
The unitary matrix U = (k l k2 ... k n ) trans-
forms H into diagonal form. The equation x T Ax = d represents a central
TW0 quadratic forms SA = x T Ax, SB = xTBx, quadric. If A diagonalizes A, the rotation
where A and B are symmetric and of the same
order, may be reduced simultaneously to sums x=Ax'
of squares . The equations Ak = ABk demand
det(A - AB) = 0; this possesses n roots Ai and n yields x'TDx' = d. The vectors k l , k 2 , k3 spec-
corresponding vectors ki. If ify the three principal axes, of semi-lengths
y(d/Ai) when d/Ai O. >
T = (k l k2 ... k n ) Dynamics. The rotational equations of mo-
MATRICES 720

tion of a rigid body with respect to moving axes When k is large, approximate solutions are pos-
fixed in the body and with the origin fixed in sible if the terms of R-l(dR/dz)g arising from
space or at the centre of mass are the non-diagonal elements of R-l(dR/dz) are
much smaller in magnitude than the correspond-
g=Jw+ruw ing elements of kDg. Then
where g = couple, w = angular velocity, Q T =
dual w. J denotes the inertia tensor - ~mXX, dgj ~ -ikDj gj - ~R-l -dR) gj
where XT = dual x. Explicitly, dz dz jj

=(-~ -; ~;).
and
J
-G -F C
The rotational kinetic energy is ! w TJw. When
principal axes of inertia are chosen, J is diagonal,
yielding Euler's equations.
Small oscillations about a position of equi- giving rise to the characteristic waves propagat-
librium are investigated by considering the ing in the medium.
second order approximations Special Relativity. If x = {ict, x,y, z} refers
to an inertial frame S, and if a second parallel
K.E. = c'J.TAq, P.E. = qTBq, frame S' has uniform relative velocity U along
Ox, the Lorentz transformation is x' = Aux,
q containing n generalized coordinates measured where
from their equilibrium values. A and B are con-
stant symmetric matrices. If the n roots of -iU~/c 0
det(A + AB) = 0 are considered, and if q =Tx,
where T = (k 1 k2 ... k n ) reduces A to the unit ~ 0
matrix I, the equations of motion are
o
Xi + (l /Ai )Xi = O. o 0
The elements of x are the normal coordinates;
each individual solution Xi in terms of the q's is A is orthogonal, and ~ = l/VO - U2 /c 2 ). We
a normal mode of period 21"./Ai. have Av Au = A w , where
Electromagnetic Theory. Maxwell's 3 X 3
stress tensor in matrix notation is W = (U + V)/O + UV/c 2 ).

T= ! [2eee T + 2~hhT - e(eTe)I - ~(hTh)Il For the general velocity v relating parallel frames,
in mks rationalized units. The field exerts a ~ -i~vT/c )
force across an area element n5S equal to (
Tn 5S. A = i~v/c I + (~- l)vvT/v2 .
When electromagnetic waves are propagated
in an ionized medium the equation The operator 0 = {a/icat, a/ax, a/ay, a/az}
is a four vector satisfying 0' = AD; so are the
curl curl e = k2(I + M)e four-current i and the four-potential b,
arises, where

in the usual notation with collisions neglected;


here, n =unit vector directed along the external where a is the vector potential. They satisfy
magnetic field, NT = dual n. These equations OTj = 0 (conservation of charge), DTb = 0 (the
may be rearranged in terms of the matrix Lorentz relation). Maxwell's equations in mks
units in free space take the form
f= {Ex, -Ey,ZoHx,ZoHy}
oTF = -iT/eoc2
giving df/dz = -ikTf, where T is a 4 X 4 matrix.
If the characteristic roots Ai(Z) of T are found, oTG=O
and if R diagonalizes T, then the transforma-
tion f = Rg yields where

dg dR 0 ieT/C)
- = -ikDg- R-l - g. F = Ob T - (obT)T =(
dz dz -ie/c -~oH
721 MEASUREMENTS, PRINCIPLES OF

and ence. For example, how many handspans wide


is my desk? This is an elementary example, but
0 J.LOhT) even such a simple question reveals many of the
G= ( essential and fundamental characteristics of
-J.Loh -iE/c measuring processes. First, the search for im-
are tensors of order 2 under a Lorentz trans- proved precision quickly makes desirable the
formation. ET and HT are the respective 3 X 3 acceptance of a standard reference quantity.
duals of e and h. All tensor equations are in- Second, the primitive process of comparison
variant in form in all frames of reference. makes it clear that we can make statements
The tensor of order 2 about the measurement only within certain
limits. (My desk is between 6 and 7 hands pans
T= ! Eo c 2 F F - ! J.Lo -1 G G wide). Our fundamental inability to make exact

= !(EO eTe + J.LohTh -2ie T H/c )


measurements leads to the concept of uncer-
tainty, and a very extensive theory of the
uncertainty of measurement exists.
- 2iEh/c EoeeT+EoEE+J.LohhT+ Measurement Standards Standard, defined
J.LoHH units of measurement have been in use for
many thousands of years and the situation has
contains the energy density, Poynting's vector, been in continuous flux up to the present day.
the momentum density and Maxwell's stress Every country maintains standards of measure-
tensor in partitioned form. ment, not only of the basic quantities such as
Applications may likewise be made to circuit mass, length and time, which we normally think
theory, to elasticity where 3 X 3 stress and strain of in this context, but also of a large number
tensors are defined, and to quantum mechanics, of other items (such as the optical reflectivity
embracing, for example, matrix mechanics and of paper) which have been identified as impor-
the Dirac wave equation of the electron. tant for trade and commerce. We shall restrict
ourselves here to a few physical quantities which
JOHN HEADING are important in scientific work. It will turn out
that it may be a relatively simple matter to
References define a unit; it is usually a much more difficult
problem to realize that unit in the laboratory
Gourlay, A. R., and Watson, G. A., "Computational so as to make possible the calibration of other
Methods for Matrix Eigenproblems," London and instruments.
New York, John Wiley. (a) Mass. The kilogram was defined in 1889
Heading, 1., "Matrix Theory for Physicists," London, as the mass of a certain piece of metal (plati-
Longmans, Green & Co. num-iridium) still preserved at the International
Heading, J., "Electromagnetic Theory and Special Bureau of Weights and Measures near Paris,
Relativity," Cam bridge, University Tutorial Press. France. Copies of this prototype kilogram,
Jeffreys, H., and Jeffreys, B., "Methods of Mathe- compared with the original by carefully refined
matical Physics," Cambridge, The University Press. beam balance techniques, are kept in most
Liebeck, H., "Algebra for Scientists and Engineers," countries to serve as that country's definition
London and New York, John Wiley. of a kilogram. The normal process of compari-
Perlis, S., "Theory of Matrices," Reading, Mass., son using beam balances allows a precision for
Addison-Wesley. mass standards of around one part in 10 8 , mak-
Williams, I. P., "Matrices for Scientists," London, ing mass standards substantially less precise
Hutchinson. than those of length and time.
(b) Length. The original 1889 definition of
Cross-references: DIFFERENTIAL EQUATIONS IN the meter (chosen to be one ten millionth part
PH-YSICS, ELECTROMAGNETIC THEORY, of the quadrant of the earth's surface at the
LORENTZ TRANSFORMATIONS, MATHEMATI- longitude of Paris) was realized initially using
CAL PRINCIPLES OF QUANTUM MECHANICS, engraved marks on a platinum-iridium bar.
QUANTUM THEORY. However, it very soon thereafter became clear,
on the basis of the pioneering work of A. A.
Michelson on optical interferometry, that a
MEASUREMENTS, PRINCIPLES OF much much more precise, stable, and easily
realizable standard was available using the wave-
Measurement is the process of quantifying our length of carefully selected spectrum lines.
experience of the world around us. It can be as Despite Michelson's early suggestions to that
simple as an elementary event-counting process effect it was not until 1960 that an international
(the number of automobiles passing a certain standard of length was defined in terms of the
point per day or the number of ~-particles wavelength of a certain line in the spectrum of
entering a certain radiation detector in a certain krypton-86. Recently, however, it has become
time interval) but it is usually a more complex clear that even the precision available from the
process involving comparison with some refer- krypton-86 line has been surpassed by precision
MEASUREMENTS, PRINCIPLES OF 722

in two other areas, measurements of the velocity which is defined in terms of the force between
of light and the standard of time. As a conse- adjacent current-carrying conductors. In prac-
quence it is now possible to define a unit of tice it is too difficult to implement this defini-
length in terms of the unit of time and the mea- tion with sufficient precision, and the practical
surement of the velocity of electromagnetic standards are those of potential difference and
waves. As accepted by the General Confer- resistance. The unit of potential difference (the
ence of Weights and Measures in 1983, the unit volt) is defined in terms of the Josephson effect,
of length is the distance traveled in a time a phenomenon occurring in superconducting
interval of 1/299,792,458 of a second by plane junctions, which provides an extremely sensi-
electromagnetic waves in a vacuum. Such a unit tive, precise, and stable connection between
will be realizable in any laboratory with precise potential difference and frequency. The prac-
time standards, and will be translatable into tical realization of the volt is a bank of care-
actual distance measurements using standard fully preserved electrolytic cells which can be
techniques of interferometry. moved for international comparison and which
(c) Time. For hundreds of years the most are checked periodically for drift using a
precise measurements of time were made by the Josephson source. The standard of resistance
astronomers, whose observations served to de- (the ohm) is realized using a bank of 1 ohm
fine the basic unit of time, the second, in terms resistors which can be compared internally and
of the axial rotation of the earth. When ter- internationally.
restrial clock systems achieved the precision Measurement Uncertainty As was mentioned
required to show that the earth's rotation rate earlier the primitive act of comparison between
is not constant it became clear that replacement an object and a reference quantity leads to a
of the defined standard was necessary. Observa- value which is known only within a certain
tions on the radiation frequency in atomic tran- interval. No matter how sophisticated the mea-
sitions can be made with very high precision and suring process we cannot evade this fundamen-
the present definition of the second (adopted tal limitation; we can be confident about mea-
in 1967) is the duration of 9,192,631,770 peri- surements only within a certain interval. The
ods of the radiation associated with a certain way in which we handle these uncertainties
transition in the cesium-l 13 atom. Cesium beam mathematically depends on the way in which
clocks now provide a realizable standard of our confidence varies along the scale.
time, not only for time-keeping, but also for (a) Estimated Uncertainty. If we are making
practical purposes such as radio navigation our measurements by a personal, visual method,
systems and for experimental work like long the outcome of the measuring process should
baseline interferometry in radioastronomy. be an interval, outside which we are certain the
(d) Temperature. The definition of tempera- value does not lie. If, by careful examination of
ture is based on the triple point of water, which a scale, we feel confident that the value we seek
was defined in 1954 to have a temperature of does not lie below 24.6 and does not lie above
273.l6 K. The unit of temperature, the kelvin, 24.8, we can state that we are confident that
is thus defined (since 1967) is as 1/273.16 of the our desired value lies inside the interval 24.6-
temperature of the triple point of water. The 24.8, although we can say nothing about its
realization of a temperature scale" particularly location within the interval. The interval is
over wide temperature ranges, is a much more usually renamed 24.7 ± 0.1, and we call the
difficult matter. In practice the standards labo- quantity ± 0.1 the uncertainty of the reading.
ratories maintain a working scale called the Frequently it is instructive to compare this
International Practical Temperature Scale, 1968 uncertainty with the reading itself, and we
(abbreviated IPTS-68). In this, thirteen fixed call the ratio 0.1/24.7, usually expressed as a
points have temperatures assigned to them and percentage, the precision of the measurement.
specified methods of measurement are used to This question of the uncertainty in reading a
provide intermediate temperatures for calibra- scale is only one contribution to our lack of
tion purposes. Above the temperature of freez- exact knowledge of the value. There may be
ing gold (1337 K) optical pyrometry is used; other effects, like calibration errors in the
between 903 K and 1337 K the standard mea- instrument, which affect all the readings in a
suring device is a thermocouple of platinum and similar way and constitute systematic errors in
an alloy of platinum with 10% rhodium; between addition to reading uncertainty. To make satis-
14 K and 903 K it is a platinum resistance ther- factory measurements we must always be aware
mometer. Standard temperatures are also avail- of the presence of the reading uncertainty and
able in the liquid helium temperature range also alert to the possible presence of systematic
between 0.5 K and 4.2 K using the vapor errors. These last must be identified and cor-
pressure of helium. rected if possible but, at the end of the whole
(e) Electrical Quantities. Once again we have process, it is important to express the reading in
a distinction between the fundamental defini- such a way that the quoted value provides a
tion of a standard quantity and its practical real- realistic appraisal of the complete range of
ization. In principle the fundamental electrical uncertainty.
quantity is the unit of current (the ampere), It is rarely sufficient to make such a measure-
723 MEASUREMENTS, PRINCIPLES OF

ment of a single quantity and we are more chance of falling within x ± 2S m . The mean
commonly faced with the problem of calculat- and the standard deviation of the mean thus
ing some final quantity z as a function of a provide us with measures of probability which
number of measured quantities x, y, etc., each take us as far as randomly fluctuating phenom-
of which has its own uncertainty ~x, ~y, etc. ena allow.
If The statistically determined interval serves as
the measure of uncertainty for processes gov-
z =f (x, y, . . .), erned by random fluctuation so that, irrespective
the value of ~z will be calculated from of the type of measurement, the outcome of
the measuring process is an interval which has
~z = (af/ax)~x + (af/ay)~y + .. . . a certain probability of containing the value we
seek. The measurements are now in suitable
If we were confident that the values of x and condition for the next step in a complete mea-
y lay within the measured intervals Xo ± ~x, suring process.
Yo ± ~y, etc., then we can be equally confident Systems and Models A measuring process is
that the value of z lies within the calculated almost never a primitive process of simply com-
interval Zo ± ~z, where Zo = f(xo, Yo, .. . ). paring an object with a scale (unless we are
This general method of calculating uncertainties satisfied simply to measure something with a
will be found to be useful for a wide range of ruler or read a temperature on a mercury-in-
functions . glass thermometer); the situation is almost
Statistical Uncertainty Circumstances fre- invariably more complicated. Even if we are
quently do not permit the subjective estimation doing something as simple as comparing the
of an uncertainty interval as considered in the weight of an object with that of a standard
preceding section. Many measurement processes mass using a beam balance we have to make
give results which are influenced by random the assumption that the balance beam arms
fluctuations and we must resort to statistical are of equal length. Our measuring process
treatment of the observations. Instead of identi- almost invariably involves some complete sys-
fying an interval within which we are confident tem and the result of our measuring process is
our quantity lies we must be content with state- dependent on the properties of the system.
ments about probabilities. To make this possible Since we can never know these exactly we are
we must build up experience of a fluctuating dependent on the set of assumptions we make
phenomenon by repeatedly making the measure- about the system (like the equality of the
ment. We shall thereby obtain a sample of read- balance beam arms), and this set of assump-
ings whose characteristics will give us as much tions constitutes a model of the system. Our
information as is available from the system. process of measurement almost invariably
This sample will have a certain frequency dis- requires us, therefore, to consider the model
tribution along the scale of values and the of the system, and it is an integral part of all
uncertainty of the measuring process is related satisfactory measuring processes to check, not
to the breadth of the distribution. One suitable that the model is "correct" or "true," because
measure of the breadth is the standard deviation all models are in principle oversimplified, but
of the sample, defined to be that the correspondence between the model
and the system is good enough, at least at the
S= V[ECx - IN]. Xi)2 level of precision under consideration.
Sometimes this process of testing the model
where x is the mean of the sample, Xi are the of a measuring system will have been done for
individual readings, and N is the number of us if we buy an expensive piece of apparatus.
readings in the sample. By making suitable as- If we buy a good quality slide-wire potentiom-
sumptions about the basic distribution from eter from a reputable manufacturer it may be
which the sample was taken (often assumed to satisfactory to assume that the set of assump-
be Gaussian), we can now make numerical tions which constitute the model of the system
assertions about the sample. It turns out that (such as linearity of the slide wire resistance)
anyone reading has a 68% probability of falling has been adequately tested. But if we are mak-
ing up our own measurement system, it is vital
within the interval x ± S and a 95% probability to include in the process adequate provision
of falling within x ± 2S. Rather than make for testing the model on which our work with
assertions about single readings it is more useful
to be able to make statements about probabili- the system is predicated.
ties for the sample mean. We calculate the It is common to do the model testing graph-
standard deviation of the mean ically. For example a measurement of the
resistance of an electrical component is incom-
plete without a study of the complete variation
of the current through and the potential dif-
and we are then able to assert that the value we ference across the component. Only if the
seek in the measuring process has a 68% chance plotted values of V and i turn out to be com-
of falling within the interval x iSm and a 95% patible with a straight line can we assert that
MEASUREMENTS, PRINCIPLES OF 724

the resistance of the component is constant, using strain gauges. Alternatively, a piezoelec-
with a value specified by the slope of the line. tric crystal may be used to obtain an electrical
Usually the model of the system is more output directly.
complicated than V = iR and more complicated (iii) Temperature sensors use a variety of
graphical methods of checking the model phenomena. Gas thermometers use the change
against the system are needed. In compensation, in pressure with temperature of a gas, often
however, the value we seek as the objective of helium. Thermocouples produce an electrical
the experiment will usually be obtained from emf directly but require careful calibration.
the graph, so that drawing the graph serves the From low temperatures up to 500 or 600 K
dual purpose of checking the compatibility the most commonly used junction materials
of the system and the model and of providing are copper and constantan (an alloy of copper
a computational procedure for obtaining the and nickel). At higher temperatures tungsten
answer. Even when the measurements in an and tungsten alloys are used. Resistance ther-
experiment are processed completely analyti- mometers use the change in electrical resistance
cally (using a least squares procedure, for ex- with temperature of metals or semiconductors,
ample, to fit a function to the observations), the choice of material depending on the tem-
it is important not to forget the basically graph- perature range. Pure platinum gives high preci-
ical nature of the process. sion over a wide temperature range; germanium
Measurement Systems The demands of mod- or ordinary carbon resistors are widely used at
ern experimenting and the availability of auto- low temperatures; and thermistors, in which
matic data processing methods have recently a sintered powder, generally of metallic oxides,
revolutionized measuring methods. It is now provides rapid variation of resistance with
common to think 'of a complete measuring temperature, can be made to suit a wide range
system containing: (a) a conversion stage which of temperatures. The change in resistance which
may be desirable to convert the basic quantity provides a measure of the temperature is com-
under investigation into some other form more monly measured using some kind of bridge
amenable to measurement; (b) a sensor, detec- circuit. Temperatures too high for normal sen-
tor, or transducer stage to provide for conver- sors are measured by pyrometers in which an
sion into an electrical signal; (c) a signal pro- absolute measure of the intensity of radiation
cessing system to perform on the signal any at some fixed wavelength is interpreted using
necessary mathematical computations; (d) an Planck's radiation equation to give values of
output stage for display, storage, or telemetry temperature. Liquid-in-glass thermometers, al-
of the information. Let us consider each of though commonly used as an indicator of
these stages in turn. temperature, only rarely qualify for precise
(a) Conversion Stage. This is not always pres- measurement and cannot be used as a transducer
ent but is frequently necessary if it is impossible to supply an electrical output.
or inconvenient to work directly with the (iv) Optical sensors may be of several different
phenomenon under investigation. For example types. The vacuum photodiode contains a sur-
a gas thermometer bulb converts temperature face, generally of some cesium compound, which
into pressure, a moving coil ammeter converts reacts to illumination by emitting electrons
currents into angles, a slide wire potentiometer which are collected by an anode. The photo-
converts values of potential difference into multiplier tube uses successive stages of second-
lengths, a prism, diffraction grating, or crystal ary electron emission in an avalanche process to
converts the wavelength of electromagnetic provide amplification of the current. Solid state
radiation into angles, a digital voltmeter uses a devices may be of the photoconductive type, in
ramp method to convert a potential difference which the resistance of a semiconducting ma-
into a time measurement, a mass spectrometer terial such as selenium or cadmium sulfide
converts atomic masses into magnetic field changes with illumination, or the photovoltaic
values, etc. type in which a semiconducting junction pro-
(b) Sensor, Detector, or Transducer Stage. duces its own emf in response to the light. The
Almost invariably we wish to process our signal familiar "solar cell" is of this variety and usually
by electrical methods and so some process for contains a junction between silicon and a metal.
converting the basic phenomenon into an elec- Infrared detectors may be of the photoconduc-
trical signal is required. Sensors are found in tive or photovoltaic type. They normally use
enormous variety depending on the type of semiconducting materials or juncticns involving
physical phenomenon involved. compounds like indium antimonide or alloys
(i) Strain is measured using strain gauges, a containing such materials as lead, tin, and tel-
small length of metallic or semiconducting lurium. For wavelengths which do not excite
material glued to the component under stress. photoelectrons or for cases in which absolute
Changes in length of the material are detected measurements of intensity are required, a
from the consequent alteration in electrical bolometer can be used. This device is con-
resistance. structed to absorb all the incident radiation,
(ii) Force or pressure transducers may rely regardless of wavelength, and convert it into
on some elastic component (cantilever beam for a temperature increase which can be mea-
force, membrane for pressure) to convert the sured using a thermocouple or resistance
force into a displacement which is then detected thermometer.
72S MEASUREMENTS, PRINCIPLES OF

(v) Acoustic transducers take various forms of the use of a scintillation counter lies in the
depending on the frequency of the radiation. fact that the magnitude of the current pulse is
For ultrasonic applications at frequencies of closely proportional to the energy of the x-ray
tens of kilohertz up to megahertz a piezoelec- or '}'-ray photon. These crystals thereby permit
tric transducer, commonly of quartz, is used to the analysis of such a beam into a spectrum of
convert the pressure fluctuation in the sound its energy components. X-ray and '}'-ray beams
wave to an electrical signal. At the lower fre- can also be detected directly and analyzed using
quencies of the auditory region microphones semiconducting crystals. Lithium-drifted ger-
can use the piezoelectric effect in materials manium and silicon crystals, although requiring
like lead titanate ceramic or the electrostatic cooling to liquid air temperature, provide cur-
properties of electrets, materials possessing rent pulses from individual x-ray or '}'-ray
permanent electric polarization. photons which are accurately proportional to
(vi) Transducers for magnetic field measure- the photon energies.
ments frequently use the Hall effect, in which Signal Processing Systems Signal processing
a current-carrying conductor exhibits a trans- procedures have been available since the intro-
verse potential difference when placed in a duction of electronic circuitry but were initially
magnetic field. Hall effect probes make a sturdy restricted to such elementary operations as
component for many common measurements amplification or heterodyning (the generation
of magnetic field but for other applications of a beat frequency to expedite the tuning
higher sensitivity may be required. For example, and amplification of radio-frequency signals).
airborne surveys of the earth's magnetic field Rather more sophisticated processing became
are commonly carried out using a flux-gate available with the development of analog com-
magnetometer, which detects the out-of-balance puters but was still limited to relatively simple
signal produced by an external field in a care- arithmetic operations or integration and dif-
fully balanced magnetic circuit. Still higher ferentiation. The real revolution came with the
sensitivity is available from the proton preces- development of digital data processing, and the
sion magnetometer. This relies on the relation- availability of fast and powerful computers now
ship between the frequency of proton precession makes it possible, in "real time," to perform
(often in a water sample) and the magnitUde of almost any desired mathematical operation on
the surrounding magnetic field, and can supply the observations as they are made. If, as is fre-
a sensitivity as low as 10- 5 of the external field. quently the case, the output of the sensor stage
Still higher sensitivity is available from the is in analog form, it is necessary to pass the
magnetic dependence of optical transitions in signal through some form of analog-digital (a-d)
rubidium and cesium atoms in the form of the converter. Examples of on-line signal processing
optically pumped rubidium or cesium magne- procedures are given below.
tometer. For the very lowest values of magnetic (a) Basic Arithmetic Operations. Division, for
field the Josephson effect in superconductors example, can be used to calculate resistance di-
has extended the range of magnetic field mea- rectly as VIi or velocities from values of distance
surements by many orders of magnitude. Such and time.
devices require cooling to liquid helium tem- (b) Statistical Operations. These include cal-
peratures but make possible the measurement culation of such distribution parameters as
of magnetic field changes of the order of 10- 12 mean, standard deviation, and correlation coef-
tesla. ficient. Comparison with a prescribed function,
(vii) The various forms of ionizing radiation linear or otherwise, can be carried out by least
encountered in nuclear physics are detected by squares methods, or, in the absence of a speci-
a large variety of techniques, of which some of fied function, a generalized polynomial fit can
the more common are described below. be obtained. In all cases progressive improve-
Gas-filled tubes containing electrodes can be ment of the accuracy of the calculations will
used under various conditions of pressure and result from continued revision as new observa-
potential difference to detect the ionization tions become available.
produced by fast particles or high-energy radia- (c) Time A veraging. This is a very powerful
tion photons. An ionization chamber can pro- method to improve signal-to-noise ratios. If
vide steady monitoring of particle flux densities some phenomenon such as a resonance peak or
and a similar gas-filled tube operated at higher a spectrum line is hidden in noise, we can carry
potential constitutes a Geiger counter which out a process of repeated scans over the range
provides a pulse of current for a single particle of variable containing the signal. If we have
entering the counter. Scintillation counters some way of storing the information and aver-
detect individual photons of '}'-rays or -x-rays aging the results of the repeated scans, we shall
from the flash of visible light which is produced find that the random noise will give, ultimately,
by passage through certain solid crystals or an average of zero. The desired signal, on the
liquids. The solid materials are generally single other hand, will add positively on every scan
crystals of organic materials such as anthracene and eventually appear free of the noise which
or inorganic crystals of sodium iodide (doped formerly masked it.
with thallium). The light pulses emitted by the (d) Fourier Analysis. It is very frequently
scintillation material are normally detected desirable to analyze a time-varying phenomenon
using photomultipliers. An enormous advantage into a frequency spectrum by the method of
MEASUREMENTS, PRINCIPLES OF 726

Fourier analysis. Such computation has recently versatile instrument for the display and study
been greatly facilitated by the development of of oscillations or time-varying phenomena over
the fast Fourier transform methods and is now a wide range of frequencies and times.
commonly used in a wide range of applications. Digital data processing makes possible the
Typical is the Fourier transform spectrometer, presentation of output information using digital
in which optical spectra are processed in an in- numerical displays, based often on neon tubes
terferometer. The resulting variation of fringe or light-emitting diodes. These offer the con-
intensity with order of interference is analyzed venience of direct access to output information
to yield the frequency components of which without need for further interpretation, e.g., a
the original beam was composed. range scale on a laser range finder. Similar con-
(e) Optical Image Enhancement. A two-di- venience is available from the two- or three-
mensional picture can be converted into digital dimensional cathode-ray tube displays in which
data by a scanning process, thereby becoming the output of computer processing of observa-
susceptible to computer processing. Photographs tions can be viewed directly in pictorial form.
showing such defects as lack of contrast, out- For cases in which later use of the results is
of-focus, or smearing from camera movement intended, digital storage methods using magnetic
can be analyzed to determine the precise nature tape and disc or punched paper tape permit vast
of the defect and subsequently corrected to quantities of information to be stored and easily
construct an improved image. Many of the most retrieved. If onward transmission of the infor-
spectacular results of space exploration among mation is required, the use of digital techniques
the planets would not have been visible without makes possible the rapid and accurate transfer
various processes of image enhancement to of observations over interplanetary distances.
emphasize detail in particular ways, and similar
improvement is available for microscopic images D. C. BAIRD
of biological material. Computer processing can
also be used to create "false-color" images of
normally invisible phenomena such as infrared
emission, accoustic signals, and others. These
are now familiar in many applications, such as References
the thermographs which identify poorly insu- Klein, H. A., "The World of Measurements," New
lated houses or human breast cancer and the York, Simon and Schuster, 1974.
satellite-based photography which is used to Rossini, F. D., "Fundamental Measures and Constants
study earth resources such as crops. for Science and Technology," Cleveland, CRC Press,
(f) Pulse-Height Analysis. This is a process in 1974.
which separate storage is provided for pulses Baird, D. C., "Experimentation," Englewood Cliffs,
of differing height. The resulting display can N.J., Prentice-Hall, 1962.
then give a direct picture of, for example, a "The International System of Units (SI)," Washington,
spectrum of 'Y-ray energies from a scintillation D.C., National Bureau of Standards, and London,
counter or x-ray energies from aLi-drifted Her Majesty's Stationary Office.
silicon counter. This very powerful tech- Janossy, L., "Theory and Practice of the Evaluation
nique can be used to record any phenomenon of Measurements," Oxford, Oxford Univ. Press, 1965.
in which the quantity in which we are interested Plumb, H. H. (Ed.), "Temperature: Its Measurement
can be converted into pulses with height depen- and Control in Science and Industry," Instrument
dent on the original variable. For example, Society of America, 1972.
lifetime studies on excited atoms will use a Quinn, T. J., "Temperature," Academic Press, New
time-to-pulse height converter to provide a York, 1983.
direct picture of the decaying radiation. Levi, L., "Applied Optics," New York, Wiley, 1980.
(g) Others. Other examples of on-line signal Keyes, R. J. (Ed.), "Optical and Infrared Detectors,"
processing too complex to be described here Springer-Verlag, New York, 1981.
but of too great importance to omit include Peterson, A. P. G., "Handbook of Noise Measure-
pattern recognition and the various procedures ment," Concord, Mass., GenRad Inc., 1980.
for clinical examination by tomography. In Zijlstra, H., "Experimental Methods in Magnetism,"
these a signal, derived usually from x-ray ab- New York, Wiley, 1967.
sorption but occasionally from other phenom- Knoll, G. F., "Radiation Detection and Measurement,"
ena, is analyzed to provide a picture of a cross- New York, Wiley, 1979.
section of the human body as an aid to diagnosis. Oppenheim, A. V., "Applications of Digital Signal
Display and Storage Systems The tradItional Processing," Englewood Cliffs, N.J., Prentice-Hall,
methods of needle and scale are now less fre- Inc., 1978.
qut!ntly encountered, except as indicators. For
time-varying phenomena the strip-chart recorder Cross-references: ASTRONOMY; COSMIC RAYS;
remains useful at low rates of change and the ELECTRICAL MEASUREMENTS; MAGNETOME-
cathode ray oscilloscope is abSOlutely irreplace- TRY; NOISE, ACOUSTICAL; NUCLEAR INSTRU-
a'ble for rapidly changing phenomena. The addi- MENTS; OPTICAL INSTRUMENTS; PHOTOGRA-
tion of digital processing and memory to the PHY; PHOTOMETRY; TELEMETRY; THEORETI-
CRO has made it a uniquely powerful and CAL PHYSICS.
727 MECHANICAL PROPERTIES OF SOLIDS

MECHANICAL PROPERTIES OF SOLIDS described as a homogeneous stress if, for an


element of fixed shape and orientation, the
When a material is in the solid phase, its con- value is independent of the position of the ele-
stituent particles, which may be atoms, ions, or ment in the body. Usually the term stress is
chemical molecules, vibrate about fixed equilib- taken to mean stress at a point, and is the
rium positions in which the interparticle forc~ is limiting value of force divided by area over
zero. In most solids composed of small constItu- which the force acts as the area tends to zero.
ent particles, e.g., metals and ionic soli~s, these If a force of acts over a surface of area oA and
interparticle interactions produce 3? I!1ternal makes an angle rp with the normal to the sur-
atomic or molecular arrangement whIch IS regu- face, the normal stress a is
lar and periodic in three dimensi?ns over i~ter­
vals which are large compared WIth the umt of
periodicity. Such solids are called crystals. a-
_ bm
. (OF oA
cos rp )
Solids composed of larger units, e.g., polymers, 5A~O

can be crystalline, though the crystallinity is and the tangential or shearing stress T is
usually rather imperfect, or they can be
amorphous.
When a solid is deformed by external forces,
_ hm
T -
. (OF ~Asin rp)
the constituent particles have their separations 5A~O u
changed from the eqUilibrium values. The re- The stress is, of course, transmitted through the
sultant of the interparticle forces acting on a solid.
particular particle is then no longer zero, but The change in the separation of the constitu-
acts to restore the particle to its original posi- ent particles of the solid produced b.y the ap-
tion relative to its neighbors. When the solid is plied forces is seen on the macroscopIC scale as
in equilibrium under the action of external a change in the size and shape of the body.
forces, the interparticle (or internal) forces Since the deformation of different bodies of a
must be in equilibrium to give continuity of the given material subjected to a particular load is
material and must also be equal to the external a function of the size of body, comparisons are
forces, i.e., any element of the body must be in made using the relative deformation, or strain,
equilibrium. These internal forces are m~in­ defined as
tained as long as the external forces are appbed.
When the external forces are removed the inter- change in dimension
nal forces restore to the constituent particles strain = " I d'
ongma ImenSlOn
. .
their original separations.
If after unloading, the body returns exactly A strain is homogeneous if, after deformation,
to its former size and shape its behavior is called lines of the body that were originally straight
perfectly elastic. If it retains completely its remain straight and lines that were originally
altered size and shape it is a perfectly plastic parallel remain parallel.
body. In general, the behavior of real bodies lies The following strains are found to be con-
between these two extremes. venient in describing the behavior of a body in
Stress and Strain Two types of forces may various states of stress.
act on any element of a body: (a) surface forces, When a rod of unstretched length £0 has its
exerted by the surrounding material, which are length increased to £ by the application of ex-
proportional to the surface area of the element ternal forces, the conventional, engineering, or
and (b) body forces, which are proportional to nominal tensile strain € is defined as
the volume of the element, e.g., gravitational
forces. The effects of body forces are usually £ - £0
negligible compared with those resulting from €=~.
surface forces.
For a body to be deformed and not merely Sometimes it is more convenient to use the
accelerated when forces are applied to it the true, natural, or logarithmic strain €*, defined
body must be in statical equilibrium u!1~er the as
action of the applied forces. The condItIOns of
equilibrium are (a) there must be no unbalanced
applied forces and (b) there must b.e no unbal-
anced applied couples. Further, the mternal and
external force equilibrium can be equated. Clearly
The effect produced in a given material by €* = loge (l + €)
forces of given magnitudes depends on the size
of the body to which they are applied, and If as the result of the application of a uni-
hence to enable a comparison to be made of for~ hydrostatic force, the volume of a solid
the r~action to external loading of bodies of changes from Vo to V, the bulk strain () is
different size, the concept of stress is defined as
introduced.
The stress in an element is defined as force
divided by area over which the force acts. It is
() = ----v;;-
V- Vo
.
MECHANICAL PROPERTIES OF SOLIDS 728

When a solid is sheared by the application of havior the change in cross-sectional area is very
couples, the angle of shear is taken as a measure small, so the difference in the stress calculated
of the strain, in this instance a shear strain. using the original area of cross-section and using
Elastic Behavior For very small strains that when the load is applied is negligible.
« -0.1 per cent) the behavior of many solid Elastic shear deformation, in constrast, takes
materials is almost perfectly elastic. In this place at constant volume. The elastic moduli
strain range a Ilpecimen will exhibit a linear re- are not independent and it can be shown that,
lationship between the magnitude of the ap- for an isotropic solid,
plied forces and the deformation produced.
This relationship is known as the Hooke law E=3K(1- 2v)
and in terms of stress and strain it may be
stated in the form and
E
stress = constant X strain. G = 2(1 + v)'
The constant in this equation is called a modu-
lus of elasticity. Each strain has a correspond- Plastic Behavior When a solid is deformed
ing modulus of elasticity. These moduli are under an increasing stress, a stage is reached
temperature-dependent, and in general, depend when the further deformation produced by a
on the direction of measurement, but if elasti- slight increase in stress, though still elastic, does
cally isotropic solids are considered, the value not obey the Hooke law. The stress at which
of a particular modulus is independent of the the departure from linearity of the stress-strain
direction in which it is measured. A solid is ef- curve first occurs is called the proportional limit
fectively isotropic if it is composed of grains or elastic limit. If the stress is increased beyond
whose size is small compared with the smallest the elastic limit, a value is reached at which
dimension of the solid and if the orientations permanent deformation occurs, i.e., the speci-
of the grains are randomly distributed. men does not recover completely its original
Consider a bar of uniform area of cross- size and shape on unloading. The stress at which
section A and unstretched length £0 acted upon permanent deformation is first detected has,
by forces F applied uniformly at the ends. If £ for very many materials, a value characteristic
is the length when this load is applied, the stress of the material at that temperature and is called
a is given by a = F/A and the strain € is the yield stress. The corresponding point on the
€ = (£ - £0 )/£0. (Tensile stresses are counted stress-strain curve is the yield point. For many
positive.) When the Hooke law is obeyed, F/A = materials the elastic limit and yield stress have
E(£ - £0 )/£0 where E is a constant for a given almost the same value and are not readily dis-
material at a given temperature and is known as tinguished. The deformation not recovered on
the Young modulus of the material. unloading is called the permanent set and the
When a solid has a hydrostatic stress a ap- specimen is said to have suffered plastic
plied to it, the volume changes from Vo to V so deformation.
that if the Hooke law is obeyed Plastic Deformation of Simple Crystalline
K(V- Vo ) Solids, e.g., Metals and Ionic Solids . The sim-
a= plest mechanical test that can be performed on
a solid is the tension test, and measurements
made during such tests are often used to charac-
where K is a constant at a given temperature terize particular materials.
and is known as the bulk modulus of the Many simple crystalline solids are ductile at
material. temperatures greater than about 0.3 to 0.4 of
When a solid is deformed by couples produc- the melting temperature in kelvins. The plastic
ing a shear stress 'T, the angle of shear 'Y is taken deformation of such materials takes place at ap-
as a measure of the strain so that, if the Hooke proximately constant volume, and hence, for a
law is obeyed, specimen tested in tension, the cross-sectional
area decreases as extension proceeds. This
'T = G'Y change in cross-sectional area with strain neces-
where G is a constant at a given temperature sitates a more careful definition of stress. Two
and is known as the rigidity modulus for the definitions are in common use, namely, conven-
material. tional stress ac, sometimes called the nominal
The axial deformation of a prismatic bar with or engineering stress, defined by
unloaded prismatic surfaces is accompanied by load
a change in the cross-sectional area. Experiment
shows that the rado ac = original area of cross section

lateral strain/axial strain and true Stlt:SS at, defined by


is a constant known as the Poisson ratio v. For load
the small strains encountered in pure elastic be- at = area of cross section under that load
729 MECHANICAL PROPERTIES OF SOLIDS

,,
a ~
~ at
~
~
~
/
~
~

~
, ~

,
~
D
~

,
~

.... .... " E

o L -_ _ _ _ _ _ _ _ _ _ _ _ _ _ ~ ______ ~ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ __

C F
FIG. 1

When a tensile test is carried out on a fine- observed), but before B is reached the curve
grained sample of a ductile material the conven- bends towards the strain axis and then con-
tional stress ac vs engineering strain € graph has tinues to rise as if unloading had not taken
the form shown by the solid line in Fig. 1. The place. ac continues to rise until D is reached
actual shape of the curve depends on many and then starts to fall. The load corresponding
variables, e.g., purity of the material, tempera- to D on the ac vs € curve is the maximum load
ture of testing, and rate of straining. that the specimen can withstand in tension, and
Over the region OA the graph is a straight line the value of ac corresponding to this load is
passing through the origin, the behavior is per- called the ultimate stress or ultimate tensile
fectly elastic, and the Hooke law is obeyed. strength au.
When the stress exceeds that at A macroscopic For deformations represented by OD on the
permanent deformation occurs and the curve ac vs € graph the extension is homogeneous,
bends towards the strain axis; the stress at A is i.e., on the macroscopic scale the deformation
the yield stress ay and A is the yield point. is the same for all cross sections. At D, however,
However, the stress needed to produce further a neck forms in the specimen, all subsequent
plastic deformation increases with strain and plastic deformation is restricted to this neck,
the material is said to work-harden or strain- and the load needed to produce further exten-
harden. If the specimen is unloaded when B is sion falls. The neck gets progressively narrower
reached, the unloading path is BC, which has until fracture occurs at a strain corresponding
almost the same slope as OA; the elastic proper- to E.
ties of the material are little affected by plastic When at is plotted instead of ac, the curve has
deformation . The elastic strain FC is recovered, the form shown by the dotted line in Fig. 1. If,
but the material retains the plastic strain OC, when the neck forms, at is measured in the
which is the permanent set. The plastic strain neck, at continues to increase with strain up to
becomes an increasing fraction of the total fracture.
strain as the latter increases. When the test is Mild steel and some other materials show a
continued the stress rises along CB (ignoring a different behavior. The elastic range is termi-
small amount of hysteresis which is sometimes nated when the stress reaches a value known as
MECHANICAL PROPERTIES OF SOLIDS 730

TABLE I

E K G ay au
Material (Nm- 2 ) (Nm- 2 ) (Nm- 2 ) 1) (Nm- 2 ) (Nm- 2 )

Al 71 X 10 9 75 X 10 9 26 X 10 9 0.33 26 X 106 60 X 10 6
Cu 130 X 10 9 138 X 109 46 X 10 9 0.34 40 X 10 6 160 X 10 6
Steel 210 X 10 9 168 X 10 9 83 X 10 9 0.28 0.4 X 10 9 460 X 10 6
(aUYS)
0.3 X 10 9
(aLYS)

the upper yield stress a UYS . There is an abrupt behavioro having a Young's modulus of about
partial unloading and macroscopic plastic de- 10 9 -10 1 N m- 2 and breaking at a strain of
formation occurs locally in regions called Liiders about 5%. When tested at temperatures close
bands. These bands spread along the specimen to the melting point th'e deformation of both
and the value of ac oscillates about a relatively types of material is dominated by the sliding
constant value known as the lower yield stress of polymer chains over each other, large irre-
a L YS. When the Liiders bands cover the whole versible strains are produced, and the behavior
speCImen, further deformation is macroscopi- is termed viscofluid.
cally homogeneous and ac rises as the material Between these extremes of behavior is an
work-hardens. intermediate temperature range, the glass tran-
The stress-strain curves usually plotted use the sition range, in which the mechanical behavior
engineering strain, but it should be noted that if is strongly time dependent and is called visco-
true stress is plotted against true strain the re- elastic. One manifestation of the glass transition
sulting curve is the same for both compression is a fairly abrupt change in volume expansivity,
and tension tests. and this can be used to define a glass transition
Some typical values of elastic moduli and temperature Tg. At temperatures above Tg the
yield stresses are given in Table 1. These values polymer chains have a certain freedom of move-
refer to measurements on fine-grained wires at ment relative to each other, whereas at tem-
room temperature. peratures well below Tg there is a complete
The Deformation of Solid Polymers Polymer locking of the polymer chains and their in-
molecules consist of very long chains of atoms, dividual segments.
often containing short side groups at regular At temperatures up to Tg amorphous poly-
intervals. In many of the common polymers the mers deform elastically under tension until the
linking between neighboring chains is weak. so-called yield stress ay is reached, when the
This type of polymer is rigid at low tempera- stress drops to a lower value ad (the draw stress)
tures and soft and rubbery at high tempera- and a neck appears in the specimen (see Fig. 2).
tures, the transition being reversible. Such long- With further deformation this neck propagates
chain polymers, whose properties are strongly along the specimen and the stress only rises
temperature-dependent, are called thermo- again when the complete gauge length has been
plastics. drawn down to a neck. In this process the
Polymers in which there are frequent strong polymer chains are oriented in the direction
links between neighboring long chains are said of the applied stress and the material becomes
to be crosslinked. Crosslinked polymers have
properties that are rubberlike.
Crosslinking is also found in the thermoset- a
ting plastics, in which nonlinear structures are
formed. When these materials polymerize, a
process accelerated by raising the temperature, T«~
the monomers group themselves into a rigid
framework that is not softened when the tem-
perature is raised again. These materials show
brittle behavior under an applied stress.
Thermoplastics can be either amorphous (or
glassy), in which state the polymer chains are
randomly oriented and intertwined, or crystal-
line, in which small regions of the structure
exhibit a definite arrangement of the polymer
chains.
At very low temperatures both amorphous
and crystalline polymers show essentially brittle FIG. 2.
731 MECHANICS

TABLE 2

State at E Tg
Material Room Temperature (MNm- 2 ) (K)

Polyethylene partially crystalline 70-280 153


Polyvinylchloride amorphous/slightly 2500-3500 353
crystalline
Polymethylmethacrylate amorphous 2500-4000 380
Nylon 6 crystalline 2000-3000 323
Phenol formaldehyde thermosetting glass 7000
resin

a Honeycombe, R. W. K., "The Plastic Deformation of


Metals," London, Edward Arnold, 1975.
Closs-references: ELASTICITY, POLYMER PHYSICS,
SOLID-ST ATE PHYSICS, VISCOELASTICITY.

MECHANICS
"Give me matter and motion," proclaimed
Rene Descartes, "and I will make the Universe."
And what the renowned 17th-Century philoso-
pher meant by that bold and somewhat cryptic
FIG. 3. remark was that at the very primal heart of
things physical, there is matter in motion.
Understanding the marveolus subtleties, the
stronger. At temperatures above Tg large strains unity of that fundamental reality, is essential
develop from the start of the test and no yield to knowing the Universe on any level. In its
stress drop is observed. In both these regimes broadest sense mechanics is just that: it is the
the strain can be recovered completely by heat- study of the relative movement of objects
ing the material at a temperature above Tg. (actual and impending), or, if you will, the
Crystalline polymers show rather similar study of motion and rest, the latter merely be-
curves (Fig. 3), but brittle behavior is observed ing a special case of the former.
for temperatures up to T~. Above Tg a neck The subject developed historically along sev-
is produced in tensile deformation, but it eral different lines driven by very different
results from recrystallization of the polymer practical concerns. For the most part, the de-
chains in the direction of the stress, since the scriptive aspects of the study evolved more suc-
melting point of the aligned chains is higher cessfully earlier on, followed only later by an
than that of the unaligned chains. Conse- effective explanatory capability. The result was
quently, no recovery of the strain takes place an almost natural partition of mechanics into
when the material is heated. At temperatures several broad subdivisions which are usually
around Tg the propagation of the neck is usually (though not universally) designated as kinemat-
terminated by flaws in the material, but well ics, dynamics, and statics.
above Tg the neck propagates along the entire The description of every sort of motion, with-
length of the specimen and the stress rises when out regard to either the cause thereof or to the
the aligned polymer chains become strained. physical nature of that which is moving, is
Some data for common polymeric materials known as kinematics. Insofar as it deals with
are given in Table 2. the changing locations of objects in space and
time, it can be regarded as the geometry of
M. T. SPRACKLING motion.
By contrast, dynamics (sometimes referred to
References in part as kinetics) is the study relatirig motion
and the changes therein with the corresponding
Benham, P. P., "Elementary Mechanics of Solids," causative interactions. Dynamics seeks to ex-
New York, Pergamon Press, 1965. plain the motion described by kinematics.
Calladine, C. R., "Engineering Plasticity," New York, As a special case of dynamics, statics relates
Pergamon Press, 1969. specifically to the conditions governing con-
Hall, C., "Polymer Materials," London, Macmillan stant relative motion (including "rest"). His-
Press, 1981. torically the discipline evolved simply as the
MECHANICS 732

study of objects at rest and that is still the If, instead of the distance traveled, we con-
quintessence of the business; but insofar as we sider the displacement (s), which is the vector
now know that absolute rest is a fiction, the drawn from some origin to the moving body,
purview of statics is specified more appropri- we can express in a single concept both the
ately as that of objects in unchanging motion. speed and the direction of motion. Accordingly,
Loosely speaking, statics deals with systems velocity (v) is the time rate of change of the dis-
that can be imagined to be motionless (like the placement, v = ds/dt, and the magnitude of the
Brooklyn Bridge). It is the study of the balance velocity is the speed.
of interactions operating on and within any ma- Variations in motion are commonplace; change
terial system which results in its effectively be- is the rule rather than the exception and the
ing at rest. Unlike dynamics, wherein time is measure of this change is called acceleration.
central, the imagery of statics is quite indepen- Aristotle considered the concept, hinting at
dent of time-presumably nothing changes. it in his book Physics, but never quite grasping
Kinematics The primary goal of kinematics it clearly. His follower Strato (ca. 340-270 B.C.)
is to provide a quantified description of motion, seems to have been the first person to appreci-
one in which the necessary concepts are in- ate the real-world importance of the idea. He
herently measurable. The formulation begins suggested that a body was accelerating when it
naturally enough with the familiar ideas of traversed equal increments of distance in shorter
space and time. Even so, it is an illusion to and shorter times. During the 12th-Century re-
think that we can satisfactorily define these vival of science, the alternative formulation was
most basic underlying concepts. Pragmatically set forth that acceleration obtained when a
we must content ourselves with measuring in- body traveled greater and greater distances in
tervals of space and time with meter sticks and successive equal intervals of time. Only later, in
clocks, relying on intuition for conceptual the 14th Century, did the realization begin to
meaning. emerge that variations in the speed itself were
Clearly the faster an object moves, the farther the essence of the concept. Today we define
it will travel in a given amount of time. That's acceleration (a) as the time rate of change of
the crucial insight that leads to a definition of velocity: a == dv/dt.
speed, the measure of "how fast." The oldest Along with the notions of displacement and
surviving thoughts on the subject are those of time, velocity and acceleration complete the
Aristotle (384-322 B.C.) and, like his fellow vernacular of kinematics. Constructing the
Greeks of the era, he specified speed as the equation of motion of the system (the expres-
distance traversed in a given amount of time. sion of displacement as a function of time) be-
And that's just the way it was framed for well comes the primary task of the discipline.
over a thousand years-a thing traveled with a Dynamics The explanations of dynamics go
speed of "so many miles in so many hours." beyond the descriptions of kinematics, requir-
Nowadays we say almost, but not quite, the ing additional concepts which reflect the phys-
same thing defining average speed (vav ) as the ical nature of mover and moved. Perhaps the
interval of distance traveled (t.l) divided by the richest and, at the same time, most elusive of
interval of time (t.t) it took to do the travel- these notions is that of mass.
ing: vav == t.l/t.t. However close the ancients The idea that there had to be another measure
got to the idea, they never actually carried of matter, in addition to the old standbys of
through the division, apparently because of a weight and volume, was first proposed by the
hesitance to divide the "unlike" notions of theologian Aegidius Romanus (ca. 1247-
space and time. 1316). In an effort to resolve some complex
The scholars of the mid-l300s, especially at religio-philosophic questions concerning the
the University of Paris and Merton College, Eucharist, he suggested that the true measure of
Oxford, dealt quite successfully with the idea a substance, the "how much" of a material en-
of constant speed. But when they attempted tity, was its quantity-of-matter. Though in the
to define the speed of an object at any given end it had little or no influence on theology,
moment, the instantaneous speed, they failed. the new insight, however undefined, found a
That was not surprising, for they lacked the welcome place in medieval dynamics. The
mathematical imagery of motion, a calculus of Parisian physicist Jean Buridan (ca. 1300-1385)
change, something Newton would create just utilized the conception of quantity-of-matter
for the purpose centuries later. As the time in his highly influential impetus theory. By the
interval t.t over which Vav is determined is 17th Century, the phrase quantity-of-matter
made smaller and smaller, the ratio of t.l/t.t and the term mass (already long in common
approaches a value known as the instantaneous unscientific usage) had become synonymous-
speed, or what we nowadays just call the speed Newton used them interchangeably.
(v). That limiting process actually defines the Buridan is responsible for conceiving one of
derivative and so in the notation of calculus, the most important dynamical ideas to come
v = dl/dt: speed is the time rate of change of out of the Middle Ages. He began with a ques-
distance, the derivative of distance with respect tion which is essentially equivalent to this:
to time. "Given that both are traveling at the same
733 MECHANICS

speed, which would you rather get hit by, a per unit time (the measure of motion being the
firefly or a fire engine?" Obviously the firefly, quantity-of-motion or the momentum).
but why? What aspect of motion, above and If mass is taken to be constant, F = mdv/dt
beyond just speed, is involved? He suggested and so F = mao This is the hallmark of Newton's
that the essential measure of motion was pro- theory and yet it does not appear anywhere in
portional to both quantity-of-matter and his work. In fact, it actually was introduced
speed, that is, it depended on their product. decades later by the Swiss mathematician
This new metaphor of motion would soon Leonhard Euler. It is a real tribute to Newton's
come to be known as the quantity-of-motion vision that his formulation in terms of momen-
and ultimately it would be reinterpreted and tum is in perfect accord with modern relativity
renamed momentum. Momentum (mv) has theory, whereas F = ma is not. Mass is a func-
proved to be one of the fundamental character- tion of speed and therefore not constant in
istics of the movement of all things. time-but that would not be shown until 1905
Since the time of Aristotle, it had been widely and Einstein.
assumed that the sustained motion of an object Sir Isaac's third axiom, his third law of mo-
required the action of a sustained force. It was tion, completes the logical picture of force
Galileo Galilei (1564-1642) who most con- which is the very pillar of his dynamics. An iso-
vincingly challenged this seemingly reasonable lated body follows the law of inertia. It cannot
view. Still, it should be pointed out that there alter its own motion; that requires some outside
were others before him who had thought, if intervention called force. And when two bodies,
somewhat tentatively, along similar lines. The like billiard balls, interact, it's only reasonable
Tuscan master performed a series of experi- to assume that both will be affected, both mo-
ments which led him to conclude that an object tions will be altered, both will experience a
once set in motion and left alone will continue force. Leonardo de Vinci (1452-1519) had
in motion all by itself forever. That was the law pointed the way long before. "An object offers
of inertia, one of the first grand insights into as much resistance to the air," he wrote, "as the
the long hidden workings of the Universe. On a air does to the object. You may see that the
sizable planet like Earth, gravity and all sorts of beating of an eagle's wings against the air sup-
friction conspire to obscure this all-important ports its heavy body in the highest and rarest
underlying principle-that is why it remained atmosphere .... " Whatever the source of in-
unrevealed for 2000 years. Had we been dwell- spiration, and others had grasped the essence of
ers on a far smaller vessel in space, the natural it, too, Newton provided the final link: "To
tendency for things in motion to continue in every action there is always opposed an equal
motion would have been quite obvious. As it is, reaction: or the mutual actions of two bodies
we are not, and it is not, and it took the genius upon each other are always equal, and directed
of Galileo to see beyond the Scholastic fiction, to contrary parts." The interaction of two en-
the age-old error that the natural tendency of tities always occurs via an equal action-reaction
matter was to rest. pair: force and counterforce. There is no such
When Isaac Newton (1642-1727) came to thing, as a single force; force is a thing of pairs.
codify motion in his masterpiece, the "Mathe- The second and third laws combine to reveal
matical Principles of Natural Philosophy," he one of the guiding principles of modern phys-
lIJegan with a series of definitions. The first was ics, the law of conservation of momentum. If
a rather unsatisfactory attempt at defining two objects interact, the forces acting on each
quantity-of-matter, while the second was a clear will be equal and opposite and so, too, will be
statement of quantity-of-motion framed as the- their resulting changes in momentum. In terms
product of mass and velocity. Newton then set of the system as a whole, these paired opposite
out the three "axioms or laws of motion" which momentum changes cancel each other, leaving
form the basis of dynamics (and statics) even to the net or combined momentum unaltered. The
this day. The first of these was the law of in- total momentum of a system of interacting
ertia: Every body continues in its state of rest, masses must remain unchanged provided that
or of uniform motion in a straight line, except no net external force is applied. As Newton
insofar as it is compelled to change that state put it, "the quantity-of-motion ... suffers no
by forces impressed upon it. Force is the agent change from the action of bodies among them-
of change; it does not sustain motion, it selves." Amusingly, the idea had been specula-
changes it. tively anticipated by Rene Descartes (1596-
Newton's second law is a quantified recasting 1650), who wrote of the Creator: "He con-
of the first law. Modernized somewhat in its serves continually ... an equal quantity-of-
language, it reads: Thc rate of change of the motion" in the Universe.
quantity-of-motion (i.e., the momentum) of a To the kinematical ideas of time, displace-
body is equal to, and occurs in the same direc- ment, velocity, and acceleration had been added
tion as, the net applied force, F = d(mv)/dt. the dynamical concepts of mass, momentum,
From the first law we have that that which and force, all united in the credo of the three
changes motion is force and now, more spe- laws. Brilliant physicists would spend the next
cifically, force equals the change in the motion two hundred years mathematically honing the
MECHANICS 734

fine edge of Newton's force-dynamics. And all and in its place stood its look alike , kinetic
the while another complementary piece to the energy.
scheme, another powerful vision, was slowly Suppose that there is some sort of driving
evolving-the concept of energy. force constantly exerted on a body, like its
The necessity to formally quantify work weight. To move against that pull requires the
came out of the practical needs of the engineers application of a counterforce and the doing of
and scientists of the late 18th Century at the work. The crucial point is that the force-be it
start of the Industrial Revolution. Work == elastic, electric, magnetic, gravitational, what-
force X distance; work equals the force applied ever-continues to act even after the displace-
to an object multiplied by the distance through ment. Once let loose, that force will drive the
which it moves, a quantity easily measured with body back, imparting kinetic energy in the
scales and meter sticks and just as easily bought process. Clearly it is possible to do work and
and paid for. Power, the amount of work done not have it immediately appear as kinetic en-
per unit time, was also a practical measure dic- ergy, and yet the potential for generating that
tated by the demands of the new machine age. energy is there. This retrievable stored energy,
In "The Two New Sciences," Galileo had long energy by virtue of position in relation to a
before shown some grasp of the key idea. He force, is known as potential energy, a name
talked about the physics of pile drivers and given it by William Rankine (1820-1872).
recognized that the weight of the hammer and When we consider the kinetic and potential
the distance through which it fell determined energy of a body as a whole, it is understood
its effectiveness-force and distance related in a that all of its atoms act together in an organized
crucial way. fashion. Alternatively, it is possible to impart
Suppose we do work on a hammer, exert a motion to the individual constituent atoms
force on it and cause it to accelerate, and then which is disorganized, motion .not of the body,
slam it down on a nail bringing it to rest, but within the body. A pendulum swings until
thereby doing work on the nail; we have work- it ultimately comes to rest-organized kinetic
motion-work. And if we recognize that work is energy is transformed into disorganized kinetic
the changer of energy, then the hammer ap- energy or, as it's called nowadays, thermal en-
parently has some sort of energy of motion. ergy. The ubiquitous agent of that transforma-
Interestingly, the underlying insight to all of tion is known as friction.
this actually began to evolve roughly a hundred One of the great revelations of the previous
years before it reached maturity in the 19th century was the law of conservation of energy
Century. Christian Huygens (1629-1695) never opened out to include both thermal and me-
cared much for Descartes's reliance on the idea chanical processes: Energy can neither be
of quantity-of-motion. Momentum to be really created nor destroyed, but only transformed
meaningful must be framed as a directional from one form to another. Whatever energy is
quantity, a vector, like force. A body at rest (and we have no satisfactory conceptual defini-
could explode into two pieces violently flying tion of that underlying quantity, although we
in opposite directions and yet the total mo- know its various manifestations) it is conserved.
mentum would remain zero throughout. That To the force-time-momentum imagery of the
bothered Huygens, who suggested a different 18th Century was added the force-space-energy
measure of the motion, one which would be vision of the 19th Century. And then in the
independent of the direction of the velocity, 20th Century Albert Einstein (1879-1955),
one which would only vanish when all the mo- questioning the very basic understanding of
tion actually ceased. He subsequently decided space and time, profoundly recast kinematics
on the prodUct of the mass and speed squared. and dynamics in his speCial theory of relativity.
Gottfried Leibniz (1646-1716), Newton's bitter Newtonian mechanics turned out to be the low-
rival, picked up the idea, calling mv 2 the vis speed approximation of the new vision whose
viva or living force. The great and meaningless real significance becomes apparent only at
vis viva controversy between the followers of speeds that are appreciable in comparison to
Descartes and Leibniz roared on for decades as the speed of light.
each side claimed the fundamental notion. The theory builds from two basic postulates.
It was not until the beginning of the 1800s The first is known as the principle of relativity,
that Thomas Young (1773-1829) shifted the which states that all the laws of physics are the
imagery and spoke of mv 2 as energy. "Labour same for nonaccelerating observers. The second,
expended in producing any motion," he wrote, the principle of the constancy of the speed of
"is proportional .... to the energy which is light, maintains that light propagates in free
obtained." Then Gaspard de Coriolis (1792- space with a speed (c ~ 300,000,000 m/s) that
1843), using Newtonian mechanics, showed is independent of the motion of the source (and
that the work done on a system was equivalent of the observer). The speed of light in vacuum
to a change in the quantity t mv 2 . Lord Kelvin is absolute.
(1824-1907), years later, dubbed this the Among the many surprises provided by the
kinetic energy. Vis viva was forgotten, drowned new vision was the realization that rest, motion,
out by the roar of the Industrial Revolution, simultaneity, time, length, and mass are not
735 MEDICAL PHYSICS

absolutes as had long been thought. Instead Envision a force F acting on a body which is
these fundamental quantities are relative, they pivoted at a point 0: The lever or moment-arm
depend on the motion of the observer. of the force F with respect to the axis passing
The result, which Einstein himself thought through 0 is the perpendicular distance (r 1)
was "the most important," was the equivalence drawn from 0 to the line of action of F. The
of mass and energy. These two seemingly dif- moment of the force about 0 is then defined as
ferent concepts are actually manifestations of the product of the magnitude F and r1. Nowa-
one single entity: mass-energy, E = mc 2 , the days it is common practice to symbolize this
total energy E of an object equals its mass m quantity by the Greek letter tau, T, and refer to
multiplied by the speed of light squared. it as torque (from the Latin, torquere, to twist).
Statics The central problem that was never The law of the lever is then simply a require-
far from the surface in the early days of the de- ment that the two opposing torques be equal.
velopment of kinematics and dynamics, was the Formulated as a vector, torque becomes T =
motion of the heavens. Aristotle had woven his rX F.
theory of motion into his cosmography to form In the case of a rigid body (viewed as a collec-
a single fabric that would stand for two thou- tion of interacting point masses) Newton's sec-
sand years. By contrast, the motivation behind ond and third laws lead to the second condition
the development of statics was far more mun- of equilibrium: For a rigid body in equilibrium
dane and pragmatic. The ancients, who weighed the sum of the torques about any point (due to
out their goods on balances, who raised stone, all the externally applied forces acting on it)
hauled ships and pitched tents were all prac- must be zero, ~T = O.
titioners of statics, even before the formal body The two conditions of equilibrium provide a
of knowledge evolved. basis for the analysis of the forces at work in all
An object experiencing no change in its mo- sorts of mechanical systems from trusses and
tion, no acceleration, is said to be in equilib- bridges to the muscle-bone structure of the
rium. Hence it follows from Newton's second human body.
law that translational equilibrium (a = 0) ob- Mechanics, the tap root of physics, is the
tains when the sum of the forces acting on a seminal discipline, whether it is general rel-
body is zero, ~ F = O. If the two teams in a tug ativity on the cosmic scale, quantum mechanics
of war pull equally hard in opposite directions, in the micro-domain of the atom, or Newtonian
the net force is zero and the rope remains mo- mechanics in the macro-world of baseballs and
tionless. This is the first of the two conditions ballistic missiles. "Give me matter and motion
of equilibrium. and I will make the Universe."
The simple equal-arm balance was already in
widespread use well over 4000 years ago. It is EUGENE HECHT
not surprising then, that Aristotle, in his
"Mechanica," attempted to analyze the impor- References
tant practical problem represented by balances,
levers, and seesaws (which are just variations on Hecht, E., "Physics in Perspective," Reading, Mass.,
one theme). When equal downward forces are Addison-Wesley Pub. Co., 1980.
applied to each end of the centrally pivoted Cajori, F., "A History of Physics," New York, Dover
rod, the system remains motionless, balanced Pub., Inc., 1962.
horizontally. Any tendency to rotate clockwise Jammer, M., "Concepts of Mass," Cambridge, Mass.,
is canceled by an equal tendency to rotate Harvard Univ. Press, 1961.
counterclockwise, and we say that the system is Toulmin, S., and Goodfield, J., "The Fabric of the
in rotational equilibrium. In fact, the word equi- Heavens," New York, Harper and Brothers, 1961.
librium derives from the Latin aequus, even or
equal, and libra, a scale or balance. Cross·references: DYNAMICS, ELASTICITY, GRAV·
It was the great Archimedes of Syracuse (287- ITATlON, MASS AND INERTIA, MECHANICAL
212 B.C.) who, in his treatise "On Equilibrium," PROPERTIES OF SOLIDS, STATICS.
framed the law of the lever in a satisfactory
way. Unequal forces (of magnitudes FI and
F 2 ) acting perpendicularly on a bar at unequal MEDICAL PHYSICS
distances from the pivot (rl and r2, respec-
tively) balance each other provided that F I rl = That physics has an important place in medicine
F2r2' It is not enough to be concerned only can scarcely be denied. A physician's first move
with the sizes of the forces; the distances from in examining a patient is to measure his tem-
the pivot at which they act are crucial, as well. perature, count his pulse, listen to his heart
Da Vinci, aware that the human body achieved sounds and take his blood pressure. Only much
its mobility via a system of various kinds of later does the physician get around to chemical
levers, set himself to the study of these simple and laboratory tests. Yet every hospital of any
machines. He was among the first to recognize stature has a laboratory or a department of
the significance of the idea of the moment of a clinical chemistry. Laboratories of clinical phys-
force. ics are virtually nonexistent. While physics
MEDICAL PHYSICS 736

plays a large role in medical diagnosis and treat· heart, and intensive-care physiological monitors
ment, physicists have largely neglected the field. and recorders are examples of the impact of
Some of the earliest applications of the princi- electronics on medicine. While these fields
ples of physics to problems in medicine were in border on medical physics, none are concerned
the fields of optics and sound. An early contrib- primarily with the application of physical prin-
utor was H. L. F. von Helmholtz; a physician as ciples to clinical problems. Yet they compete so
well as a physicist. His work in physiological op- effectively with medical pbo/sics that it is
tics and that on the sensations of tone are con- difficult to delineate the boundaries of the latter.
sidered classics. Even earlier, J. L. M. Poiseuille, The discovery of X·RAYS by Roentgen in 1895
a French physician and physicist, seeking a bet- had an immediate impact upon medicine. With·
ter understanding of the flow of blood, studied in a few months, the new rays were used both
the flow of water in rigid tubes. His work not diagnostically and therapeutically. Indirectly,
only contributed to physiology but also estab- their application set the stage for the develop-
lished an important relation in the physics of ment of medical physics. Therapeutic applica-
viscous fluids. D' Arsonval, a French physicist, tion of x-rays raised questions concerning their
pioneered in the therapeutic use of high fre- quality and quantity-both of which are im-
quency electric currents and measuring instru- portant in accurate dosimetry. Evaluation of
ments. Much earlier, that unusual artist, inven- early successes and failures indicated the im-
tor and physicist, Leonardo da Vinci, had portance of the proper distribution of dose
shown a keen interest in the fascinating mechan- between neoplasm and normal tissue. The
ic.~ of human locomotion. physician turned to the physicist for assistance.
With the intensive development of the sciences The late Otto Glasser was one of the early radio-
of physics and medicine in the latter part of the logical physicists; he and Fricke in 1924 con-
nineteenth century, the two drew further apart. structed an air wall ionization chamber for the
This period also saw rapid development in the measurement of radiation dose. 2 Their construc-
science of physiology which is concerned not tion eliminated some of the nonlinear effects
only with chemical but also with physical pro- due to quality, i.e., photon energy distribution,
cesses in the body. Clinical physiology abounds in the evaluation of biological response. Other
with such concepts as the pressure-velocity re- early workers in America were Edith Quimby
lationships in the flow of blood, the mechanics and G. Failla. In England, L. H. Gray and
of the cardiac cycle, the work of breathing, gas W. V. Mayneord were active. In 1936, Gray
exchange in the lungs, voltage gradient in cellu- proposed the Bragg-Gray formula for determin-
lar membranes, and cable properties of nerves, ing the absolute amount of energy delivered to
to name but a few. These concepts have, of a medium from ionization measurements. 3 The
necessity, been worked out by scientists with work of Fricke, Glasser, and Failla along with
training and experience in the basic biological that of L. S. Taylor4 and others contributed to
and clinical procedures. Physicists have been in- the establishment in 1928 by the Second Inter-
active in the field and have made very little con- national Congress of Radiology of the roentgen
tribution to its development. But there is a as a unit of radiation dose based on the amount
growing awareness among physiologists of the of ionization generated in a standard volume of
importance of physical principles and the need air. The use of higher energies and ionizing
for precise statement of physical law. An ex- radiations other than x-rays led during the
ample of this conviction is Howell's Textbook 1950's to the abandonment of the roentgen as
of Physiology in which editions since the eigh· a unit of absorbed dose. Dissatisfaction with the
teenth have carried the title, "Physiology and roentgen was also due to a growing realization
Biophysics. "1 that biological response was more nearly related
A phenomenon of the mid-twentieth century to the energy absorbed in a medium. The Bragg-
has been the development of interdisciplinary Gray formula permitted the calculation of ab-
fields of science. BIOPHYSICS combines the most sorbed dose in a medium, and the work of J. S.
fundamental of the biological and physical Laughlin 5 established the dosimetry of high-
sciences. It has had an extremely rapid growth, energy radiations in energy units by calorimetric
with something like 30 to 40 university Depart- methods. While radiological physics is clearly
ments of Biophysics in America alone. Its a part of the broader discipline of medical
emphasis has been on the application of physical physics, it included in the early days practically
principles to all aspects of biology-cellular, all that was organized of the later subject.
botanical, zoological as well as clinical. For many years there were no organizations
An even more recent phenomenon has been of workers in the field of medical physics. In
the development of biomedical engineering. Its America, radiological physicists were associate
basis has been the application of the tremendous members of the Radiological Society of North
developments in electronics to medical measure- America, naturally dominated by radiologists.
ments and instrumentation. In fact, the field is First in Britain (The Hospital Physicist's Asso-
frequently referred to as biomedical electronics. ciation) and later in America specialty groups
Such recent developments as vector electrocar- were organized. The American Association of
diography, implantable pacemakers for the Physicists in Medicine (AAPM) brings together
737 METALLURGY

those physicists working in hospitals and med- at or in America, Fermilab, has a large and con-
ical schools, and interested in an understanding tinuing program for cancer treatment with high-
of the physical side of medical problems. The energy neutrons. The award of a Nobel prize in
membership has been largely drawn from those Physiology and Medicine in 1977 to Roslyn S.
working in radiological physics, but an interest Yalow 7 for the development of radioimmuno-
in all areas of the application of physics to assay served notice to the world that physicists
medical problems is rapidly developing. The are contributing to the solution of problems in
A.A.P.M. became affiliated with the American medicine. This was followed two years later by
Institute of Physics in 1958 and is now showing the award of Nobel prizes to a physicist, Allan
the most rafid growth of its nine member or- M. Cormack,s and an engineer, Godfrey N.
ganizations. In 1973, the A.A.P.M. began pub- Hounsfield,9 for the development of computed
lication of its own journal, Medical Physics. A tomography. Certainly, one can now state that
further indication of the developing awareness the new discipline of Medical Physics is healthy,
of this field are the International Conferences growing and showing unmistakable signs of sur-
on Medical Physics, which meet every three vival. And one can find few areas of physics
years (the Sixth met with the International where the challenge is greater, or the rewards
Conference on Medical and Biological Engineer- more satisfying, than in making accurate evalua-
ing in Hamburg, Federal Republic of Germany tion of physical variables in the living human
in 1982). patient.
One last word about a related field: Radiation
protection was in the early days a part of radio- LESTER S. SKAGGS
logical physics. In America, 1. S. Taylor was
active for many years at the Bureau of Stan- References
dards in setting up guidelines for protection from
radiation. During World War II, the Manhattan 1. Ruch, T. C., and Patton, H. D. (Eds.), "Physiology
Project required large numbers of workers in and Biophysics," 20th Ed., W. B. Saunders, Phila-
the field of protection, and the term HEALTH delphia, 1973.
PHYSICS was introduced. Since the war, the 2. Fricke, H., and Glasser, 0., "Standardization of the
field has grown with the growth of the area of Roentgen Ray Dose by Means of the Small Ioniza-
atomic energy. The Health Physics Society is a tion Chambers," Am. J. Roentgenol. 13, 462
large and growing group with many local chap- (1925).
ters and an international organization. The field 3. Gray, L. H., "An Ionization Method for the Ab-
seems, though, to be becoming more closely solute Measurement of X-Ray Energy," Proc.
aligned with the area of public health than with Roy. Soc., London Ser. A., 156,578 (1936).
clinical medicine. 4. Taylor, L. S., and Singer, G., "An Improved Form
of Standard Ionization Chamber," J. Res., Natl.
When the first edition of the "Encyclopedia Bur. Std. 5, 507 (1930).
of Physics" was issued, the viability of medical 5. Genna, S., and Laughlin, 1. S., "Absolute Calibra-
physics as a profession was uncertain. It ap- tion of Cobalt-60 Gamma Ray Beam," Radiology
peared that any breakthrough in the cure of 65,394 (1955).
cancer, eliminating radiation therapy as a mo-
6. Porter, B. F., "AlP Member Societies Entering the
dality of treatment, would also eliminate the
1980's," Physics Today 34, 27 (1981).
livelihood of medical physicists. But much was 7. Yalow, R. S., "Radioimmunoassay: A Probe for
already happening and the results that have the Fine Structure of Biologic Systems," Science
changed the picture are now apparent. Elec-
tronics and nuclear physics have made many 206,1236 (1978).
previously unmeasurable variables in medicine 8. Cormack, A. M., "Early Two-Dimensional Recon-
accessible to quantification. The introduction struction and Recent Topics Stemming from It,"
Science 209, 1482 (1980).
of the concept of modulation transfer function
has made a science of the evaluation of diagnos- 9. Hounsfield, G. N., "Computed Medical Imaging,"
Science 210, 22 (1980).
tic imaging quality. Nuclear medicine has had
wide applications and attracted many physicists
into the field. Ultrasound and computed to- Cross-references. BIOMEDICAL INSTRUMENTA-
mography have had spectacular successes in TION, BIOPHYSICS, HEALTH PHYSICS. MOLEC-
improving medical diagnostics. And now nu- ULAR BIOLOGY, RADIOACTIVITY, X-RAYS.
clear magnetic resonance imaging is beginning
to be applied. During the 1970s, with shrinking
financial support for physics, many recent PhDs METALLURGY
entered the field of medical physics as post-
doctoral trainees. Physicists in general became The metallurgical industry is one of the oldest
more aware of the opportunities and the need of the arts, but one of the youngest of the sub-
to apply physical principles to the many prob- jects to be investigated systematically and con-
lems in medicine. Many large physics accelera- sidered analytically in the tradition of the pure
tor installations developed programs for the sciences. It is only in comparatively recent
treatment of cancer. Even the largest acceler- times that any fundamental work has been
METALLURGY 738

carried out on metals and alloys, but there


are now well-established and rapidly growing
branches of science which are related to the
metallurgical industry.
Extraction Metallurgy Extraction metallurgy,
or the science of extracting metals from their
ores, is broadly divided into two groups.
Ferrous . This branch is concerned with the
production of iron (normally from iron ore,
with coke and limestone in a blast furnace) and
its subsequent refining into steel, by oxidizing
the impurities either in an electric arc furnace
by means of an appropriate slag on the surface
or in a "converter," by blowing oxidizing gas
through the molten iron. The most striking
recent developments in this field have been the
increasing use of pure gaseous oxygen in steel-
making, and the increasing size of furnaces, with FIG. 1. The microstructure of gray cast iron: the
a resultant improvement in efficiency, rate of black lines are flakes of graphite.
production, and quality of product. Over 500
million tonnes or 70% of the worlds's annual
steel production is now made using oxygen metal is soft (hot-working), or at lower tempera-
converters. tures (cold-working) where deformation leads
Nonferrous. Some metals such as chromium, to progressive hardening of the metal (work-
cobalt, and manganese, for example, are prin- hardening). In contrast with casting, forgings
cipally produced as alloying elements to improve usually exhibit differing physical and mechan-
the properties of steels. The nonferrous metals ical properties in different directions, due to
manufactured in greatest quantity include alu- the directional nature of the shaping operation .
minum, copper, nickel, zinc, magnesium, lead, Much modern research in physical metallurgy
and tin, with titanium being an important new- is concerned with investigating the plastic flow
comer in view of its low density, high melting and work-hardening behavior of metals and
point (1943 K) and resistance to corrosion. The alloys . Metal crystals yield plastically at stresses
precious metals, and the "refractory metals" of several orders of magnitude lower than the
very high melting point (e.g., tungsten and theoretical value for the deformation of perfect
molybdenum) are other important families. crystals. This discrepancy is accounted for by
Shaping of Metals This may be carried out the presence of linear imperfections known as
in three main ways. "dislocation lines" within the crystals. Plastic
Casting. Most metals are initially cast into flow takes place in metal crystals by "slip" or
ingots, which may be subsequently forged to "glide" in definite crystallographic directions on
shape. The technique of continuous casting certain crystal planes, due to the movement of
is increasingly used in this context to improve dislocation lines under the applied stress. Dis-
efficiency and to increase the rate of produc- locations multiply and entangle as deformation
tion. By 1990 it is expected that one-half of proceeds, thus making further flow increasingly
the world's steel output will be continuously difficult (work-hardening)-the densitr of dis-
cast. Many alloys are designed to be cast into locations rising from about 105 mm - in soft
their final shape by pouring the molten alloy (annealed) metal to about 1Olo mm -2 in work-
into an appropriate mold. These may be sand hardened material. These and other types of
molds if only a small number of objects are crystal defect (such as stacking faults, which are
re~uired, and very massive castings (e.g., over planar in geometry) can be studied by x-ray
10 Kg in mass) may also be produced in this diffraction and also by means of the electron
way. A permanent mold, or die casting, is microscope and the field-ion microscope (q.v.).
employed if large numbers of the object are Powder Metallurgy. This is a method of shap-
required (particularly in alloys of low melting ing by pressing finely powdered metal into an
point, such ljS zinc-based alloys), and high appropriately shaped die. The "green compact"
dimensional accuracy can be achieved by these thereby produced is of low strength and is sub-
means. Cast iron is the cheapest metallic ma- sequently heated in an inert atmosphere ("sin-
terial. The microstructure of "gray cast iron" tered"); the pressing and sintering may be re-
is shown in Fig. 1. It consists of flakes of graph- peated until strong, dense products are obtained.
ite in a two-phase matrix of iron and iron The technology was first developed for metals
carbide (Fe3C)' The brittleness of this material which were of too high a melting point for con-
arises from the weakness of the graphite flakes, ventional casting and forging methods, and
which act like cracks in the structure. tungsten lamp filaments were first produced by
Forging. This entails shaping of the metal this means. Other refractory metals and hard
by rolling, pressing, hammering, etc., and may metal-cutting alloys may thus be shaped, and
be carried out at high temperatures, when the some magnetic and other special alloys are pre-
739 METAI.U:RGY

pared in this way by suitable blending of and welding (in which the metal itself is fused).
powders, which avoids any contamination that Weldability is often the critical factor in the
may be associated with the melting process. The selection of an alloy for a given purpose, since
pressing and sintering conditions may be the metallurgical changes produced by localized
arranged to leave some residual porosity in the heating are often associated with the develop-
structure of, for example, bronze bearing alloys. ment of deleterious properties at, or adjacent
The pores are filled with oil, thus producing the to, the weld.
so-called oil-less bearings which can operate Alloy Constitution Phase equilibria in alloy
without further lubrication. systems are represented on phase diagrams,
Joining. The three important methods of which represent the temperature ranges of
joining metals are riveting, soldering or brazing phase stability as a function of composition.
(in which metal components are joined by An example of such a diagram is given in Fig. 2;
means of a layer of alloy of lower melting point), they are experimentally established by, e.g.,

1100 A

900

800

U
0
700

;
~ 600
........
:::>

Il:
a.
a. 500
......
~

a.+b
0400 Z
6+£

0 P
300
(
+
"1
0.+£

0
0 10 20 30 40
PERCENTAGE OF TIN

Poillt .-\ Il ( . 1> ~; V ( :


; ....,:)
H ., K L :'1 ~'
o
Sn. "
C.

"
III. :\
0 ,
"iH
:\ ' :;
'j!)~
:!~,n
iH '
:,!,." :,
'i :;.~,
:,!.-, . !l
;:),.,
:!j · 1I :W'li
:l :--:ti
1,-,' S
:I.'-tli
~ ~ ' Ii
.-,SH
~.-, . ~
. .)~o
t .-} · ~
.-):!(I
:!i·tJ
:;~{l
:11 · ~
ZOO
I ·~

Point ~ () I' 1" {J Il :..: T l' \' \\ :\ Y Z


" C. 3':;11 :\:,11 :\,-,11 H ~O [i jli 1i~1I !i~1I ,-)!)l1 ,-,!In .-,nn ' '':' t l :,! ,-'x:! (i~n ~Iii
Sn. 0
0 , I'll :~:! ' .-,,-) :\, · S :\1 .•~ :1. ,:1 :I~' ~ :1.-, .:! :11 '1; :I~' :\ :n · ' :I~· !l :I~· , :1\)· .-. 38·3

FIG. 2. The copper-tin phase diagram. (G. V. Raynor, "Institute of Metals Annotated Equilibrium
Diagram," Series No.2, 1944.)
METALLURGY 740

thermal analysis, dilatometry, microscopical, eral principles have begun to appear, although
and x-ray diffraction methods. Phase diagrams there is yet little theoretical knowledge enabling
can also be calculated by computation of the one to calculate properties or structures of
Gibbs free energy minimum, if the thermo- alloys from fundamental principles. The pro-
dynamic functions of observable phases are pensity to form deleterious phases in nickel-
determined by experiment. Diagrams such as based materials has been related empirically to
that in Fig. 2 are invaluable in the interpreta- the average number of electron vacancies (Ny)
tion of the structures of alloys observed under in a given alloy. Computer calculations, using
the microscope. a system known as PHACOMP, have identified
The microstructure of an alloy (and hence its the critical value of Ny above which such phases
properties) will be determined not only by its form.
composition, but also by its thermal and The Effect of Environment Upon the Behavior
mechanical history. Of particular importance is of Metals Low Temperature. Some metals and
the metallurgical control of the mechanical alloys exhibit a spectacular change in mechanical
properties of an alloy by heat-treatment, which behavior with decrease in temperature . Many
affects the distribution of the phases present. metals of body-centered cubic crystal symmetry
Hardness, for example, will depend upon the (e.g., iron lmd mild steel) which are tough and
state of deformation (i.e., the density of dis- ductile at ordinary temperatures become com-
locations) and upon the composition of the pletely brittle at subzero temperatures, the
alloy. Pure metal crystals can be hardened by actual transition temperature depending upon
other atoms in solid solution (solute hardening) the metallurgical condition of the alloy, the
as well as by finely dispersed particles of a hard state of stress, and the rate of deformation .
second phase (precipitation, or dispersion hard- Some metals of hexagonal symmetry (e.g., zinc)
ening) which are effective in impeding the exhibit this dfect, bu,t metals of face-centered
motion of dislocations when the crystal is cubic symmetry (e .g., copper) remain ductile to
stressed. Fig. 3 is an electron micrograph show- the lowest temperatures. This transition in
ing dislocations on the slip plane of a copper behavior is clearly of critical importance in the
alloy crystal, and it illustrates how the presence selection of materials for . low-temperature ap-
of hard particles has caused local entanglement plication.
of the dislocations . The relationship between High Temperature. Apart from problems of
the microstructure and properties of metals and oxidation (discussed below), metals tend to
alloys is of fundamental importance and is a deform under constant stress at elevated tem-
field of intense scientific activity. peratures (the deformation is known as "creep"),
Although many common alloys were not and creep-resistant alloys are designed to pro-
developed scientifically, a considerable theory vide strength at high temperatures. These are
of alloys is developing, springing from empirical essentially alloys in a state of high thermo-
rules and principles (notably those due to W. . dynamic stability, usually containing finely
Hume-Rothery) which have generalized the dispersed particles of a hard second phase which
facts and enabled predictions to be made . The impede the movement of dislocations. Grain
early theories of the metallic state, due to boundaries may be a source of weakness at
Drude and Lorentz, and later to Sommerfeld, elevated temperatures, and turbine blades in
were developed and discussed by N. F . Mott and the form of alloy single crystals have recently
H. Jones in their book "The Theory of the been employed in engines of advanced design.
Properties of Metals and Alloys." A great in- Fatigue. Metals break under oscillating stresses
crease in our knowledge of transition metals and whose maximum value is smaller than that
al10ys has taken place, and some signs of gen- required to cause rupture in a static test,
although many ferrous alloys show a "fatigue
limit," or stress below which such fracture
never occurs, however great the number of
cycles of application. The phenomenon is
associated with the nucleation of submicro-
scopic surface cracks in the fatigued component
early in its life, which initially grow very
slowly. Eventually a crack grows until the ef-
fective cross section of the piece is reduced to
such a value that the applied stress cannot be
supported, and rapid failure occurs. A typical
fatigue fracture surface is shown in Fig. 4, in
which two distinct zones are apparent. These
correspond to the period of slow growth (left-
hand side) and final failure, respectively.
FIG. 3. A deformed copper alloy crystal containing Oxidation and Corrosion. With the exception
hard particles. Electron micrograph showing inter- of the "noble metals," which are intrinsically
action of dislocation lines with the second phase. resistant to attack by the environment, metals
741 METEOROLOGY

Smithells, C. J., "Metals Reference Book," London,


Thornton Butterworth Ltd., 1983.
Moffatt, W. G., "Handbook of Binary Phase Dia-
grams," Schenectady, N.Y., The General Electric
Co., 1981.
General Reading
Street, A., and Alexander, W.O., "Metals in the Ser-
vice of Man," London, Pelican, 1973.
Martin J. W., "Elementary Science of Metals," London,
Wykeham Publications, 1969.
West, J. M., "Basic Corrosion and Oxidation," New
York, Wiley, 1980.
Cross-references: CR YST ALLOGRAPHY, CR YST AL
STRUCTURE ANALYSIS, ELECTROCHEMISTRY,
LATTICE DEFECTS, MECHANICAL PROPERTIES
OF SOLIDS, SOLID-STATE PHYSICS.
FIG.4. A fatigue fracture surface upon a large steel
shaft. (Courtesy of British Engine Insurance Ltd.)
METEOROLOGY
in general owe their oxidation resistance, when Meteorology is the study of the atmosphere.
they are heated in air, to the presence of im- The word is derived from the classical Greek
pervious oxide films on their surfaces. Those meteoros meaning "things lifted up." As early
which develop porous oxides (e.g., the refractory as 250 B.C. Aristotle wrote extensively about
metals tungsten and molybdenum) oxidize very these topics in his treatise "Meteorologica."
rapidly at high temperatures. Oxidation resistant Meteorology, as practiced today, is broadly
alloys are designed to maintain a protective interdisciplinary, drawing extensively from the
film under these conditions. fields of hydrodynamics, thermodynamics,
Corrosion occurs under conditions of high optics, chemistry, and mathematics. It has
humidity or immersion in aqueous media. The applications in industry, agriculture, transporta-
phenomenon can be interpreted electrochemi- tion, economics, resource management, and
cally-local anodes form at the region of metal many other human activities.
dissolution, and local cathodes form where the Almost every branch of physics and chemistry
electrons are discharged. "Galvanic corrosion" is in some way involved in atmospheric phe-
is encountered where dissimilar metals are in nomena. This article is confined to brief over-
electrical contact under these conditions. Of views of modern methods of weather prediction,
particular importance is the conjoint action of climatology, cloud physics (including attempts
stress and corrosion, where "stress corrosion" or at weather modification), and boundary-layer
(under fluctuating stresses) "corrosion fatigue" processes.
cracking may be encountered, in situations Atmospheric Circulation and Weather Pre-
where no failure would occur under the action diction The basic state of the atmosphere is
of the stress or the corrosive environment applied described by seven variables: three components
separately. Electrochemical principles are ap- of velocity; temperature; pressure; density; and
plied in the protection against corrosion. the water-vapor concentration (usually called
Materials Technology The scientific princi- the mixing ratio). The changes of these variables
ples which govern the behavior of metals are, are governed by seven well known equations.
of course, applicable to a wide range of other The accelerations in the three principal direc-
technologically important materials, such as tions are given by three differential equations
polymers, ceramics, and glasses. In recent years based on Newton's second law. These were
many centers of metallurgical research both in written in their hydrodynamic form by Euler
industry and in universities have broadened their about 200 years ago. Temperature changes
approach in this way and are often described as are governed by the first law of thermody-
Departments of Materials Technology. namics. The other equations are the equation
of state and the equations for the conservation
JOHN W. MARTIN of mass of the dry air and water vapor. The
complete set of equations has been established
References for well over 100 years, but because of their
Metallurgical Data complexity, their application to weather predic-
tion has become a reality only during the last
The series of "Metals Handbooks," published by the 25 years as modern computers have become
American Society for Metals, Metals Park, Ohio. available.
METEOROLOGY 742

The numerical problems involved in solving recent of these is the Global Atmospheric
these equations for the future values of weather Research Program (GARP), planning and co-
elements are far from trivial. Essentially, the ordination of which is done jointly by the
changes must be calculated for periods of a few International Union of Geodesy and Geophys-
minutes at a time based on the global distribu- ics (IUGG) and WMO. The program seeks to
tion of the elements derived from observation provide a better understanding of the large
or calculated during earlier time steps, and scale atmosphere, to define better the data
moved forward step by step. This method was requirements of atmospheric prediction models,
first studied by L. F. Richardson in the early and to exploit the fullest observational capabili-
1920s, but had to be abandoned because of ties of modern technology.
inadequate computational capability. The prob- Climatology Climate may be defined as the
lems were successfully confronted by John von statistical summary of past weather at a fixed
Neumann and his colleagues in Princeton about place. It is most often thought of as a running
1950. In 1955 the U.S. Weather Bureau began average using a fixed time interval, the length
numerical weather prediction on a twice-daily of which may be as little as a month in some
basis. Today, numerical weather prediction is conceptions or indefinitely large in others.
being carried out routinely from greatly im- Usually a period of about thirty years is pre-
proved global models at several national and ferred in order to filter out annual and shorter-
international centers. The use of models has term fluctuations while still allowing longer
brought about a considerable increase of pre- period variations to be studied. Strictly speak-
dictive skill (measured by the degree to which ing, one cannot observe the climate of the
accuracy exceeds that attainable using clima- present. However, the study of past climates
tological averages). The skill achieved for two- and their changes provides a basis for projec-
to three-day forecasts by computer models tions into the present and the future, and such
is about that formerly attainable one day in studies are receiving intensive effort and sup-
advance by subjective methods. Demonstrable port at the present time.
skill has been achieved out to ten or twelve Research in climatology may be subdivided
days, but five days is a more realistic limit in into three main categories: (1) the reconstruc-
temperate latitudes on a routine basis. tion of past climates -from weather observa-
The use of circulation models in weather pre- tions and proxy data; (2) the understanding
diction requires upper-air observational input of the physical and mathematical bases of
over at least the entire Northern Hemisphere climate and its changes; and (3) assessment of
(current models are now global in extent). The the impact of climate on society. Contributions
data are provided each 12 hours by several to the study of climate have been and are being
hundred soundings of pressure, temperature, made by scientists in a wide range of disciplines.
humidity, and winds observed from balloons Instrumental records of weather go back for
and telemetered to ground stations. The specifi- about two centuries at most. Therefore, the
cations for these data and their distribution are reconstruction of climates depends mainly on
coordinated by the World Meteorological Or- the analysis and interpretation of proxy data,
ganization (WMO). Data from several thousand such as tree rings, glacier ice cores, and ancient
surface stations and ships are also used. descriptions of the distribution of plants.
Meteorological satellites have been employed Geological evidence long ago indicated the
extensively to supplement surface based data. occurrence of large variations of climate during
Geostationary satellites, hovering at a height the history of the earth. At least four major
of 35,000 km above a fixed point on the equa- glaciations occurred in Europe and North Amer-
tor provide quasi-hemispheric visible and infra- ica during the Pleistocene. Within the last ten
red images every half hour. These are valuable years, a reasonably reliable chronology began
for short-range prediction of thunderstorms, to become available from isotope analyses of
tornados, and other localized phenomena. They cores extracted from the deep ocean bottom.
also provide important surveillance of tropical From these and other data is emerging an inter-
and extra tropical cyclones in data-sparse areas, esting pattern. The mean temperature of the
particularly in the southern hemisphere. Winds earth declined significantly during Tertiary
derived from satellite cloud-image motions have times and there may have been as many as 17
made important inputs to numerical prediction major glaciations during the last 1.7 million
models. Also for this purpose, much effort has years. A period of about 100,000 years be-
been devoted to developing methods of deriving tween major episodes is quite conspicuous, and
temperature soundings remotely from polar- the spectrum over the last 400,000 years also
orbiting satellites. The inversion technique for shows significant peaks near 41,000 and 22,000
reconstructing the temperature distribution years. The last glaciation reached its climax
from infrared fluxes observed at different wave- about 16,000 years ago and was followed by
lengths has been solved, but the technique can a warm extreme about 6,000 years ago. Since
be used only in cloud-free regions. then the mean temperature has gradually de-
Because of the global extent of the atmo- clined, with numerous short-term fluctuations
spheric circulation, much of the research is on many time scales.
carried out in international programs. The most Rapid strides have been made during the last
743 METEOROLOGY

decade in understanding the physical and math- It is estimated that about half of the carbon
ematical basis of climate. This understanding is dioxide given off from the burning of fossil fuels
greatly complicated by the large number of resides in the atmosphere and the rest is dis-
interactions that occur between different com- solved in the upper layers of the ocean. This
ponents of the atmosphere-ocean-earth system atmospheric residue interacts with the infrared
and by the response of such a complex system radiation in the atmosphere in such a way as
to perturbations imposed on it. As an example to increase the surface temperature. At present
of such interactions or feedbacks, picture the the atmospheric content is increasing about
effect of a temperature perturbation resulting 0.4% a year, and it is estimated that this in-
possibly from some minor disturbance of the crease, acting alone, would raise the mean tem-
sun's atmosphere. An increase of surface tem- perature about 0.3 C over the next 50 years.
perature would increase evaporation from the The observational verification of carbon dioxide
ocean's surface, affecting in turn the mean warming of the atmosphere has not been pos-
cloudiness, snow cover, and probably the sible because of numerous other factors that
circulation pattern; these changes would in turn cannot be controlled.
affect the distribution of absorption of solar Cloud Physics and Weather Modification The
radiation and thereby result in further changes water vapor saturation required for the forma-
of the temperature distribution. The complexity tion of clouds and precipitation normally comes
of the climate system has prompted the devel- from upward motion and accompanying adia-
opment of computer models capable of in- batic cooling of the air. Condensation actually
corporating them. Such models are currently begins at humidities slightly less than lOO% on
severely limited by the insufficient understand- small, suspended hygroscopic nuclei derived
ing of the physics and chemistry of some of from such sources as sea spray and combustion
these interactions, and also the difficulty of products. Because of the strong curvature of
evaluating the covariances that appear when the the surface of the growing droplets, a slight su-
atmospheric equations are averaged over long persaturation is required for continued growth.
periods of time. One of the interesting sugges- Freezing nuclei are rare, and cloud droplets
tions emerging from mathematical and com- normally remain in the liquid phase at sub-
putational studies of climate models is the freezing temperatures down to - 40 C.
possibility that the system may spontaneously Cloud droplets range in diameter from 1 to
oscillate between two or more quite different 50 J1 depending mainly on the size of the origi-
states without any external forcing. nal nucleus and the extent of coalescence with
Many possibilities exist for both external and other droplets. An average raindrop contains
internal forcing of climate changes. Observa- about a million times the water mass of a single
tions of solar energy output over the last 50 cloud droplet and cannot be formed in a rea-
years have shown it to be remarkably steady; sonable time by the ordinary droplet growth
nevertheless, it is scientifically plausible that mechanism. Two processes are believed to act
significant variations of this output rate may naturally if the required circumstances are met.
have occurred over longer time scales. It is also The first of these is coalescence of colliding
likely that the prevalence of suspended dust droplets that fall at different speeds; such a
from volcanoes has varied widely. The impact process requires a broad spectrum of droplet
of such dust on the terrestrial heat budget is sizes. The second mechanism is the Bergeron-
not clear and may not have been great. It is Findeisen process, which requires the presence
becoming rather well accepted that on a time of a few ice crystals in a predominantly liquid
scale of tens of millions of years, continental cloud at subfreezing temperatures. The dif-
displacements are effective causes of change; ference between the saturation vapor pressures
the major glaciations that occurred in India over liquid and ice at low temperatures causes
and Australia during Permian times, and proba- a rapid transfer of water from the liquid to the
bly also the Tertiary cooling, can be attributed ice. Because the requirements for these pro-
to this cause. The presence of 22-, 41-, and cesses are rather stringent, most clouds do not
100-thousand-year periods in the Pleistocene precipitate.
chronology gives strong support to the astro- In principle it should be possible to trigger
nomical theory proposed originally by Milan- the first process by artificially injecting into the
kovitch. According to this theory, changes in cloud liquid water masses, which fragment
the orbital elements of the earth caused by immediately into drops of a wide range of sizes.
the Moon and planets give rise to variations of Each of these should then grow by coalescence
solar insolation with precisely these periods. and fracture into a large number of new seeds,
The amplitude of these variations is rather effecting a chain reaction that can ultimately
small, and the mechanism by which they would produce beneficial amounts of precipitation.
produce the observed climatic response is not The second process has been stimulated in
understood. However, on the basis of past rela- natural clouds by seeding with freezing nuclei
tionships, it is possible to project that in the such as silver iodide or with "dry ice," which
absence of anthropogenic influences, the mean produces ice crystals by the sudden chilling of
temperature of the earth will decline over the droplets.
next 60,000 years. Both of these methods have been employed
METEOROLOGY 744

widely in rain-making operations and experi- (along with products derived from oxides of
ments. These techniques appear to have been nitrogen) in what is commonly called acid rain.
effective in a few experiments, but the amounts Natural rain is slightly acidic due to the presence
of augmentation have usually been too small of carbon dioxide. Rain downstream of pollu-
to demonstrate conclusively under statistically tant sources sometimes reaches pH values of
controlled conditions. In most (but probably 4.0 or less.
not all) cases where there is sufficient liquid Dilution of pollutant concentrations by tur-
water in the cloud to produce a significant bulent diffusion has been intensively studied
amount of precipitation and sufficient vertical for more than 50 years, both in the laboratory
motion to sustain it once it begins, the precipi- and in field experiments. Turbulence in the
tation forms naturally. In a few conditions the atmospheric environment is more complicated
artificially created latent heat of sublimation than in the laboratory, and the challenge of
may provide enough additional buoyancy to these complications has attracted the attention
trigger a large amount of additional cloud growth of a number of prominent hydrodynamicists.
and precipitation. During the last 25 years the emphasis has been
Injection of silver iodide into cumulonimbus directed mainly at observing the distribution
clouds has also been attempted for the purpose of mean wind and turbulence on towers within
of preventing hail. By increasing the number of the lowest 300 meters above the surface. The
ice seeds that compete for the available cloud main efforts are now shifting toward computer
water it is hoped that one can substitute for a modeling of turbulence throughout the entire
small number of large hail stones a vastly in- boundary layer.
creased number of small stones that melt before Research and Publications Research is mete-
reaching the ground. orology is carried out in many countries and
Boundary Layer Meteorology The atmo- results are published in a variety of specialized
spheric boundary layer comprises the lowest journals. A partial list of these includes the
kilometer (more or less), where the air proper- Journal of the Atmospheric Sciences, Journal
ties are directly influenced by interactions with of Climate and Applied Meteorology, Journal
the surface of the earth. The layer is intensively of Atmospheric and Terrestrial Physics, Quar-
studied for several reasons. Nearly all of the terly Journal of the Royal Meteorological Soci-
heat and water vapor that generate kinetic ety, Tellus, !zvestiya-Atmospheric and Oceanic
energy and precipitation pass through this layer Physics, Journal of Geophysical Research, and
and are controlled by processes near the earth's Atmospheric Environment. A complete list of
surface. The greater part of the kinetic energy publications in this field can be found in
that is generated by storms is dissipated within Meteorological and Geoastrophysical Abstracts
this layer. Finally, pollutants enter the atmo- published by the American Meteorological
sphere near the surface, and they are transported Society.
by the winds in this layer, undergo chemical
transformations with other species, and are ALFREDK.BLACKADAR
diffused by turbulence in varying degrees. References
Boundary layer processes must be included in
numerical prediction models. Without the inclu- Anthes, R. A., Cahir, J. J., Fraser, A. B., and Panofsky,
sion of friction, storms tend to overintensify. H. A., "The Atmosphere," 3rd Ed., Columbus, Ohio,
It has also been found that longer period pre- Charles E. Merrill Publishing Company, 1981 (531
diction of precipitation requires the inclusion pp.).
of evaporation and the transfer of water vapor Berger, A. (Ed.), "Climatic Variations and Variability:
through the boundary layer into the free Facts and Theories," D. Reidel Publishing Co., 1981
atmosphere. (795 pp.).
Legislation has made it necessary to find and Butcher, S. S., and Charlson, R. J., "An Introduction
use working theories for relating ambient air to Air Chemistry," New York, Academic Press, 1972
quality to the industrial emissions that affect it. (241 pp.).
The pressure imposed by regulation authorities Byers, H. R., "Elements of Cloud Physics," Chicago,
and the requirements of legislation have been a Univ. Chicago Press, 1965 (191 pp.).
strong stimulus for research to develop better Munn, R. E., "Descriptive Micrometeorology," New
theories for transport, modification, and diffu- York, Academic Press, 1966 (245 pp.).
sion in natural environments. The problems are Pruppacher, H. R., and Klett, J. D., "Microphysics of
compounded by the fact that many industrial Clouds and Precipitation," D. Reidel Publishing Co.,
operations are situated near coastlines or in 1978 (714 pp.).
complex terrain that defy the application of Wallace, J. M., and Hobbs, P. V., "Atmospheric Sci-
simple theories. Moreover, the chemical trans- ence: An Introductory Survey," New York, Aca-
formations involve a large number of possible demic Press, 1977 (467 pp.).
reactions under very low concentrations, often
involving heterogeneous phases. Of particular Cross-references: AIR GLOW, COMPUTERS, PLAN·
importance is the conversion of S02 to sulfate ETARY ATMOSPHERES, TELEMETRY, TEMPERA·
and the precipitation of the resulting acidic ions TURE AND THERMOMETRY.
745 MICHELSON-MORLEY EXPERIMENT

MICHELSON-MORLEY EXPERIMENT* the velocity of light traveling in two mutually


perpendicular directions. The experiment, when
Introduction The revival and development of completed in 1887, gave a most convincing null
the wave theory of light at the beginning of the result and proved to be the culmination of the
nineteenth century, principally through the long nineteenth century search for the ether. At
contributions of Young and Fresnel, posed a that time, the definitive null result of the
problem which proved to be of major interest Michelson-Morley experiment was a most dis-
for physics throughout the entire century. The concerting finding for theoretical physics, and
question concerned the nature of the medium indeed for many years repetitions of this ex-
in which light is propagated. This medium was periment and related ones were performed with
called the "aether" and an enormous amqunt the hope of finding positive experimental
of experimental and theoretical work was ex- evidence for the earth's motion through the
pended in efforts to determine its properties. ether. These later experiments, however, have
On the experimental side, a long series of elec- all been shown to be consistent with the original
trical and optical investigations were carried out null result obtained by Michelson and Morley.
attempting to measure the motion of the earth In the years following 1887, their experiment
through the ether medium. For many years, the led to extensive and revdlutionary develop-
experimental precision permitted measurements ments in theoretical physics, and proved to be a
only to the first power of the ratio of the speed major incentive for the work of FitzGerald,
of the earth in its orbit to the speed of light Lorentz, Larmor, Poincare, and others, leading
(vic ~ 10-4 ), and these "first-order experi- finally in 1905 to the special theory of relativity
ments" uniformly gave null results. It became of Albert Einstein.
the accepted view that the earth's motion The optical paths in the Michelson-Morley
through the ether could not be detected by interferometer are shown in plan in Fig. 1.
laboratory experiments of this sensitivity. With Light from a is divided into two coherent
the development of Maxwell's electromagnetic beams at the half-reflecting, half-transmitting
theory of light, and especially with its exten- rear surface of the optical flat b. These two
sions by Lorentz in his electron theory, theo- beams travel at 90° to each other and are multi-
retical explanations for the null results obtained ply reflected by two systems of mirrors d - e
in the early ether drift experiments were pro- and dl - el. On returning to b part of the
vided . These results were in harmony with the light from e - d is reflected into the telescope
Galilean-Newtonian principle of relativity in at f, and light from el - dl is also transmitted
mechanics, which explains why the essential to f. These two coherent beams of light produce
features of all uniform motions are independent interference fringes. These are formed in white
of the frame of reference in which they are ob-
served. In Maxwell's electromagnetic theory,
however the situation was altered when quanti-
ties of the second order in (vic) were considered.
According to the Max well theory, effects de-
pending on (vlc)2 should have been detectable
in optical and electrical experiments. The pres-
ence of these "second-order effects" would in-
dicate a preferred reference frame for the phe-
nomena in which the ether would be at rest.
At first, this feature of Maxwell's theory imply-
ing observable ether drift effects of the second
order in (vic) raised a purely hypothetical ques-
tion, since the accuracy needed for such experi-
ments was a part in a hundred million, and no
experimental techniques then known could
attain this sensitivity.
Michelson pondered this problem and it led
him to invent the Michelson interferometer,
which was capable of measurements of the re-
quired sensitivity, and to plan the ether drift
experiment which he carried to completion in
collaboration with Edward W. Morley at Cleve-
land in 1887. This famous optical interference
experiment was devised to measure the motion
of the earth through the ether medium by ~
means of an extremely sensitive comparison of 0./
/

*The author passed away on March 5, 1982, after FIG. 1. Optical paths in the Michelson-Morley
preparing this article. interferometer.
MICHELSON-MORLEY EXPERIMENT 746

only when the optical paths in both arms are accurate readings of fringe positions could
exactly equal, a condition produced by moving easily be made while the apparatus rotated.
the mirror at el by a micrometer_ c is an optical The experiment to observe "the relative mo-
compensating plate. The effective optical path tion of the earth and the luminiferous ether"
length of each arm of the apparatus was in- for which this instrument was devised, was
creased to 1100 em by the repeated reflections planned by Michelson and Morley as follows.
from the mirror system. When the interferometer is oriented as in Fig. 3
Figure 2 is a perspective drawing of the with the arm LI parallel to the direction of the
Michelson-Morley interferometer showing the earth's velocity v in space, the time required for
optical system mounted on a 5 foot square light to travel from M to MI and return to M in
sandstone slab. The slab is supported on the its.new position is,
annular wooden float, which in turn fitted into
the annular cast-iron trough containing mercury
which floated the apparatus. On the outside of
LI
(11(1)=--+--=---
LI 2LI I v)
( ~=-
e- v e + v e l - ~2 e
this tank can be seen some of the numbers I to
16 used to locate the position of the inter- The time for light to make the journey to and
ferometer in azimuth. The trough was mounted from the mirror M2 in the other interferometer
on a brick pier which in turn was supported by arm L2 is,
a special concrete base. The height of the ap-
paratus was such that the telescope was at eye (1 (1) = [2L2(1 + tan 2Q:)I/2/el
level to permit convenient observation of the
fringes when the instrument was rotating in the and since tan 2Q: = v 2f(e 2 - V2)
mercury. While observations were being made, (1) _ 2L2 _ _ __
the optical parts were covered with a wooden
box to reduce air currents and temperature (1 - e (1 - ~2) 1/2'
fluctuations.
This arrangement permitted the interferometer When the interferometer is rotated through
to be continuously rotated in the horizontal 90° in the horizontal plane so that the arm L2
plane so that observations of the interference is parallel to v, the corresponding times are,
fringes could be made at all azimuths with re- 2L2 I
spect to the earth's orbital velocity through (11(2) = - - -
space. When set in motion, the interferometer e I - ~2
would rotate slowly (about once in 6 minutes) 2LI
for hours at a time. No starting and stopping ( (2)=-----
was necessary, and the motion was so slow that 1 e (1 - ~2 ) 1/2

FIG. 2. Michelson-Morley interferometer used at Oeveland in 1887.


747 MICHELSON-MORLEY EXPERIMENT

evening of the days of July 8, 9, 11, 12 of


1887. Instead of the expected shift of 0.4 of a
fringe they found "that if there is any displace-
ment due to the relative motion of the earth
and the luminiferous ether, this cannot be much
greater than 0.01 of the distance between the
fringes."
The result of the Michelson-Morley experi-
ment has always been accepted as definitive and
formed an essential base for the long train of
theoretical developments that finally culminated
in the special theory of relativity. The first im-
s portant suggestion advanced to explain the null
result of Michelson and Morley was G. F. Fitz-
Gerald's hypothesis that the length of the
interferometer is contracted in the direction of
its motion through the ether by the exact
FIG. 3. The Michelson-Morley experiment. amount necessary to compensate for the in-
creased time needed by the light signal in its
to-and-fro path. This contraction hypothesis
was made quantitative by H. A. Lorentz in
Thus, the total phase shift (in time) between further development of his electron theory in
the two light beams expected on the ether which he introduced the formalism which has
theory for a rotation of the interferometer since been known as the "Lorentz transforma-
through 90° is, tion" for the analysis of relative motions.
2L 1 [
t::..t = -c- I={32 -
I (1 -
1]
~2 )1/2
H. Poincare also contributed greatly to both
the philosophical and mathematical develop-
ments of the theory. As early as 1899, he asserted

2L 2
+ -c- 1 - ~2
[1 - (I -
1]
~2 )1/2
that the result of Michelson and Morley should
be generalized to a doctrine that absolute mo-
tion is in principle not detectable by laboratory

=
2(L 1 + L 2) [
c 1 - ~2
1 - (1 -
1]
~2) 1/2
experiments of any kind. Poincare further
elaborated his ideas in 1900 and in 1904 and
gave to his generalization the name "the principle
of relativity." He also completed the theory of
Lorentz and it was he who named the essential
For equal interferometer arms, as used in this transformation "the Lorentz transformation."
experiment, In 1905 Einstein published his famous paper
L1 =L2 =L, and, since ~ < I, on the "Electrodynamics of Moving Bodies" in
which he developed the special theory of rela-
2L tivity from two postulates: (I) the principle of
At~-~2 relativity was accepted as the impossibility of
c
detecting uniform me tion by laboratory experi-
The observations give the positions of the ments, and (2) the constancy of the speed of
fringes, rather than times, so the quantity of light was generalized to a postulate that light is
importance for the experiment is the change always propagated in empty space with a ve-
in optical path in the two arms of the locity independent of the motion of the source.
interferometer. Both postulates have a close relationship to the
Michelson-Morley experiment, which Einstein
A = cAt = 2L(vlc)2 knew through his study of the work of Lorentz.
Einstein's paper is generally considered as the
This is the quantity of second order in (vic) definitive exposition of the special relativity
referred to above. principle, and the climax of the century-long
With the Michelson-Morley interferometer, developments which had begun with Young
the magnitude of the expected shift of the and Fresnel to explain the electrical and optical
white-light interference pattern was 0.4 of a properties of moving bodies.
fringe as the instrument was rotated through an At all times the Michelson-Morley experiment
angle of 90° in the horizontal plane. Michelson continued to have great interest and was repeated
and Morley felt completely confident that many times throughout more than a half cen-
fringe shifts of this order of magnitude could tury. In 1904 Morley and Miller4 showed
be determined with high precision. that the Fitzgerald-Lorentz contraction is the
In July of 1887, Michelson and Morley were same in several materials. All repetitions after
able to make their definitive observations. The 1887 failed to find the full expected "aether-
experiments which gave their final measure- drift," although Dayton C. Miller's trials on
ments were conducted at noon and during the Mount Wilson (1921-1926) gave a small effect,
MICHELSON-MORLEY EXPERIMENT 748

later shown to be due to temperature gradients. 5


The most certain null result was that obtained [j~ = ~ [~ + (1Tgh)2]
by J oos6 using an interferometer built by Zeiss
of Jena. Finally, experiments by Townes 7 with g=~l/d
(1)
very sensitive laser techniques, gave definitive
confirmation of Michelson and Morley's work. ~l = (d 2 - d 2 )l/2,
In 1922 at the height of his fame for rela-
tivity, Einstein lectured widely in Japan. A where d is the distance between atoms of neigh-
reprint and discussion of these lectures has boring netplanes orthogonal to their surfaces
recently become available. s In his lecture and ~l the variance of these distances d. Figure
on December 14, 1922 at Kyoto University, 1 shows a two-dimensional paracrystalline lat-
Einstein referred several times to the inter- tice, where 3% larger coins are mixed statistically
ferometer experiment, stating that he "had with 97% smaller ones. At the center of the lat-
thought about the result even in his student tice there exist some crystalline-like domains.
days." In 1950 Einstein told the writerl that Near the boundaries the netplanes become more
after 1905 he and Lorentz had discussed the and more distorted until finally they are de-
Michelson-Morley experiment many times while stroyed at some places. Similar phenomena
he was working on the general theory of exist in atomic dimensions. From statistical
relativity. Today, both the experiment and the laws it is known that the distance variance ~N
theories are among the prized achievements of increases with v'N orthogonal to the surface
Physics. between atoms of the first and (N + 1)th net-
planes. When ~N reaches 100a*% of d, then
R. S. SHANKLAND the valence bonds between atoms or molecules
within the Nth netplane are strongly overstrained
References so that the netplane suffers a break. This leads
to the so-called "a*-relation:"
1. Shankland, R. S., "Conversations with Albert Ein-
stein," A mer. J. Phys., 31, 47 (1963); 41, 895 ~N = v'N ~l = a* d, hence v'N = a* /g
(1973); 43, 464 (1975).
2. Shankland, R. S., "Michelson-Morley Experiment," a*=0.15±0.03. (2)
Amer. J. Phys., 32, 16 (1964).
3. Shankland, R. S., "Michelson-Morley Experiment," In Fig. 2 is plotted v'N against l/g for numerous
Sci. Amer., Nov. 1964, p. 107-114. different colloids. a* always has values between
4. Morley, E. W., and Miller, D. C., Phil. Mag. 9,680 0.12 and 0.l8. Another example is given by the
(1905). steel-ball model (cf. the article DIFFRACTION
5. Shankland, R. S., et aI., Rev. Mod. Physics 27,167 BY MATTER AND DIFFRACTION GRATING,
(1955). dotted line, Fig. 11). The a*-relation was firstly
6. Joos, G., Ann. Physik 7, 385 (1930); Naturwiss, published in 1967. 2 At that time its fundamen-
38,784 (1931). tal importance for the whole world of non-
7. Townes, C. H., Phys. Rev. Letters 1,342 (1958). crystalline matter was not recognized. F. J.
8. Ogawa, T., Japanese Studies in the History of Sci- Balta-Calleja 3 stimulated the publication of
ence, Nov. 18, 1979. diagrams like Fig. 2. Nevertheless, the impor-
tance of the a* = relation is rarely understood
Cross-references: INTERFERENCE AND INTERFER- nowadays. On the right-hand side of Fig. 2, for
OMETRY; LIGHT; OPTICS, GEOMETRICAL; OP- instance, an example is given of the ammonia
TICS, PHYSICAL; RELATIVITY. contact catalysts. Their micro para crystals are,
on account of relation (2), very small compared
with crystals and therefore build up a large
thermostable "inner" surface of some 100 m 2
per 1 cm 3 • This large surface is important in
synthesizing ammonia (NH 4 0H) from hydrogen
MICROPARACRYSTALS (H 2 ), nitrogen (N 2 ), and water (H 2 0) in a
rational technical way:
Most noncrystalline matter consists of micro-
para crystals (mPCs). They can be identified by 3H 2 + N2 + 2H 20 = 2NH4 0H.
X-ray, neutron, and electron diffraction pat-
terns. The reflections of microparacrystals show In Figure 12 of the article on PARACRYSTALS
a characteristic broadening (see, for example, it is shown that the microparacrystals of this
Figs. 8, 9, and 11 of the article DIFFRACTION catalyst consist of a-Fe lattices into which
BY MATTER AND DIFFRACTION GRATINGS. FeAl 20 4 molecules are statistically imbedded
Plotting for instance the integral width [j~ of and destroy the crystalline order of the Fe
the reflections (hOO) against h we can calculate atoms. Here g - 1% and f!! - 15. At tem-
the number N of netplanes and the paracyrstal- peratures higher than 400 C some FeA1 2 0 4
line distortion g by means of the following molecules slowly begin to emigrate, the g-value
theoretical relationships: 1 therefore becomes smaller, and ;;iii reaches
749 MICROPARACRYSTALS

FIG.!. Two-dimensional model of a paracrystalline lattice


with quadratic lattice cells. 97% smaller coins are mixed statis-
tically with 3% larger ones.

values up to Vii = 20. With higher annealing


VN
Conventional
Crystals I temperature the microparacrystals more nearly
approach the crystalline state in the direction
• 02 I
0. •
of the arrow until the catalyst loses its favor-
," 0 0. ~ ,0 I( able properties. A new example of the existence
20 of microparacrystals was found recently in the
NiAl 2 0 3 catalyst. 4 Adjacent to the catalyst in
Fig. 2 are plotted mpes ("single crystals" and
z bulk material) in polymers. Their g-values lie
15
"0 / between 2 and 5%. The example for "crystal-
(5 / PRD49 + line" polymers is given in Fig. 9 of the article
e /
/ Catalysts 0
on PARACRYSTALS, where some micropara-
~ 10 / Polymer
crystals can be recognized. They are linked
c
:J / / Single Crystals 0 together by long molecules and work therefore
C"
<11 Bulk Polymers ® as knots of a three-dimensional network.
/
Spiral Paracrystal"
During strain the chains glide along each other,
5 / each moment building up new micropara-
Graphite crystals which, step by step, become smaller
, / / 6

V
Melts 0 until finally every microparacrystal is disinte-
/ Gas • grated into 30 smaller ones [Fig. 3(b)]. The
50 most disordered microparacrystals are to be
100 1 150 found in melts (left corner oLYig. 2). There
Yg they have values g ~ 8% and yN ~ 2. Figure 4
rec i procal 9 -value shows as an example a microparacrystal of
molten Fe or Pb which builds up a cubic face-
FIG. 2. The a*-law of colloidal systems. The rela- centered lattice similar to that below the melt-
tion of Eq. (2) gives the theoretical background for ing point. In contrast to the solid state, the
noncrystalline matter independent of its chemical atoms have a high mobility and can move over
composition. to other nuclei building up icosahedra in some
MICROPARACRYSTALS 750

a o, lfil 300
, A b
FIG. 3. Microparacrystals in polymers are the knots
of a three-dimensional network: (a) before and (b)
after stretching eight-fold.

4. Wright, C. J., Windsor, C. G., and Puxley, D. c.,


Nat. Phys. Div. A.E.R.E. Harwell, Oxfordshire,
U.K., MPD/NBS/189, Feb. 1982.
5. Steffen, B., and Hosemann, R., Phys. Rev. B13,
3232 (1976).

Cross-references: DIFFRACTION BY MATTER AND


DIFFRACTION GRATINGS; MICROPARACRYS·
TALS, EQUILIBRIUM STATES OF; PARACRYS-
TALS.

MICROPARACRYSTALS, EQUILIBRIUM
STATE OF
The physical m~aning of the a*-law is that the
FIG. 4. Microparacrystals in molten iron, lead, and mean number N of netplanes in an ensemble of
silver. The arrows indicate probable movements in microparacrystals (the so called mPCs) depends
tangential direction forming an icosahedron. only on the relative distance fluctuation g of
atoms belonging to neighboring netplanes (see
also the article MICROPARACRYSTALS. Here,
cases. S Equation (2) is not only an empirically
detected relation but the fundamental law of
colloid and surface science. It manifests an E(N/M) 1

•I 0,8
equilibrium state of microparacrystals which
is unknown in conventional solid state physics
of condensed matter. For details see the article ooo~ 1M
MICROPARACRYSTALS, EQUILIBRIUM STATE 0,6
OF. 0

0
ROLF HOSEMANN 0,4

References 0,2
1. Hosemann, R., Ergeb. Ex. Nat. Wiss. 24, 142
(1951).
2. Hosemann, R., Lemm, K., Schonfeld, A., and 2 3 4 5 6 7
Wilke, W., Kolloid-Z. u. Z. Polymere 216-217, 103
(1967). - N/M
3. Balta-Calleja, F. J., and Hosemann, R., 1. Appl. FIG. 1. The probability E(N) of further growth of
Cryst. 13,521 (1980). a microparacrystal consisting of N netplanes.
751 MICROWAVE SPECTROSCOPY

planes is given by
K(N) =£(1) £(2) £(3) . ..
(1)
. £(N - 1) (1 - £(N».
K (N) is plo!1;ed in Fig. 2 for N = 11 (open
=
circles) and N 62 (open squares) and can be
approximated by Maxwellian functions. 2 The
crosses in Fig. 2 show the K (N) distribution
directly observed 3 from high-precision trans-
I.!!ission-electron-microscopic diagrams, with
N = 11 (Fig. 3). This agreement proves directly
that the equilibrium state of microparacrystals
is of utmost importance for the understanding
FIG. 2. Frequency distribution K (N) of micropara- of noncrystalline condensed matter. 4
crystals with N + 1 netplanes. Open circles and squares ROLF HOSEMANN
plot K (N) by computer calculation; solid lines drawn
by Maxwell approximation. Crosses indicate distribu-
tion as directly observed for N = 11 microparacrystals References
in DuPont PRD 49 fibers. 1. Hosemann, R., Schmidt, W., Lange, A., and Hent-
schel, M., Colloid & Polymer Sci. 259,1161 (1981).
2. Hosemann, R., Colloid & Polymer Sci. (in press).
3. Dobb, M. G., Hindeleh, A. M., Johnson, D. J., and
Saville, B. P.,Nature 253, 189 (1975).
4. Hosemann, R., Physica Scripta (in press).

Cross-references: COLLOIDS, THERMODYNAMICS


OF; MICROPARACRYSTALS; PARACRYSTALS.

MICROW AVE SPECTROSCOPY


Microwaves are electromagnetic waves which
range in length from about 30 cm to a fraction
of a millimeter or in frequency from 10 9 to
0.5 X 1012 CpS . This corresponds to the rota-
tional frequency range of a large class of mole-
cules. Thus, microwave radiation passing through
a gas can be absorbed when the rotating electric
dipole moment of the molecule interacts with
the electric vector of the radiation. Likewise,
absorption can take place if the rotating mag-
netic moment of the molecule interacts with
the magnetic vector of the radiation.
Most microwave spectroscopy is based on a
study of transitions induced by interaction of
the molecular electric dipole with the incident
radiation.
A microwave spectrometer consists basically
FIG. 3. High-precision transmission-electron-micro- of a monochromatic microwave source (kly-
scopic picture of DuPont PRD 49 fibers. stron), an absorption cell, and a detector. The
absorption cell must transmit the microwave of
interest and in the centimeter region may have
cross-sectional dimensions of 1 X 4 cm and may
from the probability £(N), what amount of be a few meters in length. Normally a metal
microparacrystals with N - 1 netplanes aggre- strip is inserted along the length of the cell and
gate the next netplane (see Fig. 1), can be cal- is insulated from the cell. In this way, an auxil-
culated by statistical mechanics. As in biology, iary, spatially uniform electric field may be
where the expectation of surviving the next day established in the absorption cell without af-
decreases continuously day by day, here £(N) fecting the microwaves. The Stark effect thereby
begins with a plateau £ = 1 which suddenly produced splits the molecular energy levels into
changes to a smooth decay. The frequency a series of levels and enables one to identify the
K (N) of microparacrystals meeting N + 1 net- transition.
MICROWAVE SPECTROSCOPY 752

The Hamiltonian for a rotating rigid asymmet- The solution l/I is characterized by three quan-
ric molecule including possible fine and hyper- tum numbers : J, the total angular momentum;
fine structure terms is given in Eq. (1). It is K the component of angular momentum along
assumed, as is most commonly so, that the the symmetry axis of the molecule; and M, the
molecule is in a 1~ state, i.e ., that there is no magnetic quantum number or projection of J
net electronic angular momentum and no net along an arbitrary space axis . The energy does
electron spin (singlet state). The Hamiltonian not depend on M in the absence of external
written is quite general and in many cases not electric or magnetic fields. The energy levels for
all of the terms shown in Eq. (1) need be in- the symmetric top have the form
cluded to account for spectra observed under
normal resolution . EJ.K = BJ(J + I) + (C - B)K2
A brief description will be given of each term
in order that one may most simply understand The rotational constants A =Band C may typ-
the kinds of interactions which may occur and ically range from 2000 to 300 000 MHz for'
which may be pertinent to an understanding of presently observable spectra. A hertz is one
the spectra of rotating molecules. cycle per second. A megahertz (MHz) is one
million cycles per second .
H = HR + Hdist. + Hs + HZe + HQ Selection Rules In all but the accidentally
symmetric top, the permanent dipole will be
+Hzi+HD (1) along the symmetry axis. In this case, the
selection rules for absorption of radiation
1. HR is the framework rotational kinetic through rotation are: 4
energy and may be written
J-+J± I,K-+K
fz2 [la 2 lb 2 le 2 ]
HR=- - + - + - For a component of the dipole moment perpen-
81T 2 Ia Ib Ie dicular to the symmetry axis, the selection rules
where la , lb, and Je are the components of the are
total angular momentum in units of fz referred !:J..J =± 1, 0 and !:J..K =± I
to body-fixed principal axes. la, Ib , and Ie are
the moments of inertia about the respective
principal axes. The Hamiltonian may be written In both cases, !:J..M = ± I, O.
The wave functions of the asymmetric top
HR = AJi + BJb 2 + CJe2 are expressed as linear combinations of sym-
metric top functions. The energy remains diag-
or displaying the total angular momentum J onal in J but not in K. One must, therefore,
arbitrarily label the energy levels and determine
the selection rules. This will not be done here.
In order to do this, however, one needs to know
A, Band C are rotational constants with only the non vanishing matrix elements of the
A > B > C. In this form, units can be chosen so three components of the dipole moment for the
as to give energy levels directly in megacycles symmetric top given above.
per second. HR describes a rigid symmetric top The selection rule for diatomic molecules is
if I a is equal to lb. For a diatomic molecule, simplyJ-+J± I,M-+M,M± 1.
Ia = Ib and Ie ~ o. (11Ie becomes very large, and 2. Hdist. describes centrifugal stretching cor-
the rotational levels about the c axis are too far rections to the energy levels which for an
apart to become excited by microwaves). For asymmetric molecule can be quite complicated.
the spherical rotor,Ia = Ib = Ie, but this implies Corrections for a symmetric top molecule are
no dipole moment and therefore no observable easily derived, and the framework energy in this
rotational spectrum. case including centrifugal stretching is given by·
Energy levels for the symmetric top may be
determined by analytical methods,1 by factori-
zation methods 2 , or by using the commutation EJ.K =BJ(J + I) + (C- B)KL DJJ2(J + 1)2
properties of the angular momenta. 3 In Eulerian - DJKl(l + I)K2 - DKK··
coordinates, the wave equation separates, and
the wave function has the form
For a non-rigid diatomic molecule or linear
polyatomic molecule, K = 0, and the energy is
given by :
where 8JKM is the solution to the differential
equation in the polar angle 8 which results after EJ = BvJ(J + 1) - D v J2(J + 1)2
separation of the simple terms in the azimuthal
angle I/> and the angle l/I which defines the direc- and since J -+ J + I, for absorption the line fn."-
tion of the line of nodes. The equation for 8JKM quencies are:
with appropriate change of variable becomes
the equation for the Jacobi polynomials. Vr = 2Bv(J + I) - 4Dv(J + 1)3
753 MICROWAVE SPECTROSCOPY

where Bv is the "effective" spectral constant, moment will have a component only along the c
h(87r 2 Iv), for the particular vibrational state axis so that Hs consists of the single term
for which the rotational spectrum is observed, Azclh z when lhx = lh y = O.
and where Dv is the centrifugal stretching con- In this case, the energy associated with Hs is
stant for that state. diagonal in all three quantum numbersJKM and
In terms of the constants Be and De for the has the form
hypothetical vibrationless state, Bv and Dv for
diatomic molecules are:
Bv =Be - a(v+1) where M, the "magnetic quantum number,"
Dv = De + ~(v + 1) measures the component of J along lhz and can
take the values M = J, J - I,' .. , - J. The selec-
where a and ~ are interaction constants which tion rules for Mare M -+ M; M -+ M ± 1, de-
are very small in comparison with Band D, re- pending on the polarization of the microwaves.
spectively, and where v is the vibrational quan- For asymmetric molecules, l1e will in general
tum number. For linear poly atomic molecules, have components along the A and B axes as well
there is more than one vibrational mode, and as C. The A and B components give rise to matrix
the above equations must be written in the more elements off diagonal inJ,K and M . For a dipole
general form moment of I debye and an electric field of 1
volt/cm, l1elh is 0.5 MHz.

(Vi + -2di)
4. Hze is the contribution to the Hamiltonian
Bv =Be + ~ aj due to the interaction of the external magnetic
field with the magnetic moment which is created
by rotation of the molecule.
~ ~i (Vi + ~~)
We also include the interaction of the external
Dv = De + magnetic field with the dipole moment of indi-
vidual nuclei. For a molecule with two nuclear
spins, II and 12 , this may be written
where the summation is taken over all the funda-
mental modes of vibrations . The -subscript i =L + I1ngl (II . H)
refers to the ith mode, and di
represents the
Hze
j. k
I1n(J)jkJjHk

degeneracy of that mode. + I1ng2(I2 . H)


In analysis of spectra, the variations of the
stretching constantsDJ and DJK with vibrational where g(J) is in general a tensor, Jj are compo-
state are customarily neglected. It is seldom pos- nents of J along axes to which H is referred, Hk
sible to obtain sufficient data for the evaluation are the components of the field H usually
of these effects upon B and for determination referred to the space-fixed axes, and gl and g2
of Be even for the simpler symmetric tops. are called the nuclear magnetic g factors. The
Centrifugal stretching constants mal typically interaction between J and H is the same order
range from 8.5 to 0.002 MHz or less. as that between the nuclear spin and H. There-
3. The third term Hs is the contribution to fore, we introduce the term I1n so that g coef-
the Hamiltonian arising from the Stark effect ficients are of the order of unity . Thus for a
and may be written as field of one gauss, the quantity I1ngH is 0 .7 kHz.
For molecules with electronic angular momen-
Hs = l1e . lh tum, I1n in the first term is replaced by 110, the
where l1e is the vector dipole moment and lh is Bohr magneton which is 1836 times larger than
the external electric field . I1n. Thus, in this case I1ngH is - 1.4 MHz for a
If the dipole moment lies along the "e" body- field of one gauss . For a discussion of the prob-
fixed principal axis, Hs has the form lem of determining molecular g values, as well as
magnetic susceptibility anisotropies and mole-
Hs =l1e(A x Clh x +A/lhy +AzClhz) cular quadrupole moments, see Ref. 5.
S. HQ is the energy of interaction of the
where Axc, Ayc, Azc are the direction cosines nuclear electric quadrupole with the gradient of
of the e principal axis with space-fixed axes xyz. the electric field produced by the electrons in
lhx, lh y , and ll,z are the cOI?ponents of th~ .elec- the molecule at nucleus with spin I. For a
tric field along the space-fixed axes. Additional nucleus on the axis of a symmetric top, the
terms will be added to this expression if the quadrupole operator is ordinarily considered to
dipole moment has components along the re- be of the form :
maining two principal axes. In order to obtain
the contribution to the energy from this part of - eQq
the Hamiltonian, one must evaluate matrix HQ = 21(21 - 1)(2J - 1)(2J + 3) 1 - J(J + 1)
{3K 2 }

elements of the direction cosines with respect


to symmetric top wave functions. Methods
described in reference 5 enable one to do this. [3(J %
+ 1)2 + (J . 1) - J2 J2 ]
In the case of a symmetric top, the dipole
MICROWAVE SPECTROSCOPY 754

This operator yields only those matrix ele- This operator, like that given for the quadrupole
ments of the quadrupole interaction which are interaction above, and usually quoted in the
diagonal in J. The diagonal contributions are literature, will yield only those matrix elements
sufficient for most cases. which are diagonal in the quantum number J.
In the expression above, eQ is defined by The coefficientgl g 2 /.l n 2/R3 may be of the order
of a kilocycle, This correction is observed only
eQ = (I,Ifpn[3Zn2 - rn2]dTnII,I) in very rare cases. 9
Matrix elements for all of the above-mentioned
components of the Hamiltonian may be evalu-
where Pn is the nuclear charge density at a dis- ated by the methods in references 11. The matrix
tance rn from the center of charge of the nucleus elements themselves are too lengthy to be tabu-
and dT is the differential volume element for the lated here.
nuclear volume. Zn is the position coordinate There is an additional interaction which, for
along the direction of the nuclear spin I. The completeness, should be mentioned. The nuclear
matrix element considered is that for which spins may interact with one another through
M/=I. mutual coupling with the surrounding electron
The quantity q is defined as cloud. This gives a correction of the form
CII • 12 , The coefficient C may be larger than
q = [aac vl
2

2 J(rn = 0)
that in the dipole-dipole interaction term. In
Til, C has the value of 6.57 kHz.7
The preceding discussion emphasizes the in-
where V is the electrostatic potential due to the terpretation of microwave absorption spectra.
electronic cloud and other nuclei surrounding One is thereby led to a knowledge of the struc-
the nucleus and c is the axis in the body-fixed ture of the molecule, the value of nuclear spins,
system which is parallel to the symmetry axis and various coupling constants. For a short
of the molecule. The quantity eqQ varies from review which emphasizes the experimental as-
- 1000 to 1000 MHz a1thoulZh the intermediate pects of microwave spectroscopy see Ref. 12.
values are more common. 6 A tabulation of Microwave spectroscopy is also an effective
matrix elements for quadrupole interaction is technique for determination of barrier heights
given in Ref. 7. For a general discussion, see the associated with internal rotation. 13 , 14, 15 It may
book, Ref. 8. also yield the barrier to ring puckering and also
6. HZi represents the interaction between the the barrier to inversion, the earliest example of
magnetic field caused by rotation of the charged the latter being the inversion of nitrogen through
particles which make up the molecule and the the plane of the three hydrogens in ammonia;
nuclear magnetic moments of the nuclei. For the see Rudolph review. 23
case of 2 nuclei, this takes the form For a discussion of and references to micro-
wave pressure broadening, line shape, and inten-
HZi =L C(l)jkJjIlk + L, C(2)jk JjI2k
sities see G. Birnbaum.16 Among other things
line width is related to the rate of energy trans-
j. k j. k
fer between molecules. The shape of a micro-
C( 1) and C(2) represent the internal magnetic wave line is broadened as pressure increases and
moment tensors for the two nuclei. The C co- this can camouflage fine structure.
efficients are of the order of 1O- 2 MHz.9 J and I We have not discussed electron spin resonance
are pure numbers. This correction will therefore (ESR), also called electron paramagnetic reso-
be unimportant for the large majority of mole- nance. 17 (See RESONANCE and MAGNETIC
cules. For values of the coefficients as deter- RESONANCE.) In this case transitions occur
mined by molecular beam work, see Ref. 7 and between energy levels created by unpaired elec-
10. tron spins in the presence of an external mag-
7. HD is the dipole interaction between the netic field. Absorptions of this type are 0 bserved
two nuclei which may be written in the form in molecules, free atoms, radicals, and solids.
For a review of properties of high-tempera-
H =glg2/.l n2 [I .1 _ 3(11 'R)(I2 'R)] ture species as studied by microwave absorption
D R3 I 2 R2 spectroscopy see Ref. 18.
The role of transient effects in microwave
where gl and g2 are the nuclear gyromagnetic spectroscopy is a new area of research. Tran-
ratios of the nuclei, /.In is the nuclear magneton sients may be achieved by switching the applied
and R is the distance between the two nuclei. Stark field in a microwave spectrometer. The
The operator which is usually used to represent main usefulness of the technique appears to be
this in teraction is in the determination of the rates of energy
transfer between gas molecules. For details on
this subject and many references to other
reviews and papers including the same when
applied to infrared laser spectroscopy, see the
[3(11 'J)(12 'J)+3(12 'J)(11 .J)- 211 '12 ]2] review by R. H. Schwendeman. 19
Microwave spectroscopy has been extended
(2J - 1)(2J + 3) to the submillimeter infrared region; however,
755 MICROWA VE SPECTROSCOPY

this extension has been limited by the ability to 7. Stephenson, D. A., Dickinson, 1. T., and Zorn, 1.
generate harmonics and mix signals in point C.,l. Chern. Phys., 53(4),1529 (1970).
contact devices. With the advent of the laser, 8. Lucken, E. A. C., "Nuclear Quadrupole Coupling
coherent sources have become available in the Constants," New York, Academic Press, 1969.
infrared region. This related subject has been 9. Thaddeus, P., Krisher, L. C., and Loubser, J. M.
reviewed by V. I. Corcoran. 20 N.,/. Chern. Phys., 40,257 (1964).
Although less common than the Stark effect 10. English, T. C., and Zorn, 1. C., l. Chern. Phys.,
technique, the Zeeman effect, that is, exposure 47(10),3896 (1967).
of the molecules to very high magnetic fields 11. Condon, E. U., and Odabasi, Halis, "Atomic Struc-
(up to 30 kg) may also be used to obtain molec- ture," Cambridge, U.K., Cambridge Univ. Press,
ular information. The molecules investigated by 1980; Landau, L. D., and Lifshitz, E. M., "Quan-
this technique are in a "nonmagnetic" ground tum Mechanics," New York, Pergamon, 1977.
state. The Zeeman effect then is due to the very 12. Strandberg, M. W. P., "Microwave Spectroscopy,"
small molecular rotational magnetic moment in "McGraw-Hill Encyclopedia of Science and
which results from the rotation of the unequally Technology," New York, McGraw-Hill, 1977.
distributed positive and negative charge. The 13. Gordy, W., and Cook, R. L., "Microwave Molecu-
main effect is first order in the magnetic quan- lar Spectra," New York, Wiley, 1970.
tum number M, but the slight nonlinear com- 14. Lin, C. C., and Swalen, J. D., Rev. Mod. Phys. 31,
pression of the splitting pattern yields magnetic 841 (1959).
susceptibility anisotropies that permit calcula- 15. Burkhard, D. G., l. Opt. Soc. Am. SO, 1214
tion of the molecular quadrupole moment (1960).
tensor. For the effect on the Hamiltonian and 16. Birnbaum, G., "Intermolecular Forces," in "Ad-
reference to orifnal papers see the review by vances in Chemical Physics," Vol. 12, New York,
H. D. Rudolph.2 Wiley, 1967.
For further details and discussion of topics 17. Squires, T. L., "An Introduction to Electron Spin
omitted, the reader is referred to the book 13 Resonance," New York, Academic Press, 1963;
by W. Gordy and R. L. Cook. For other books Alger, R. S., "Electron Paramagnetic Resonance:
see Refs. 6 and 21. For a review of microwave Techniques and Applications," New York, Inter-
spectroscopy see the article by D. R. Lide, science, 1968; Carrington, A., Levy, D. H., and
Ref. 22; also the reviews by H. D. Rudolph Miller, T. A., "Electron Resonance of Gaseous
(1970), Morino and Hirota (1969), and Flygare Diatomic Molecules," in "Advances in Chemical
(1967).23 Physics," Vol. 18, New York, Interscience, 1970.
The Hamiltonian required to explain the 18. Lovas, F., and Lide, D. R., Adv. High Temp.
results of molecular beam electric resonance Chern. 3 (1972).
(MBER) experiments involving radio frequency 19. Schwendeman, R. H., Ann. Rev. of Phys. Chern.
transitions contains the same kind of inter- 29 (1978).
action terms listed above. Therefore the litera- 20. Corcoran, V. J., App. Spectroscopy Revs. 7
ture on MBER may be referred to for further (1974).
theoretical discussion. See, for example, English 21. Guillory, W. A., "Introduction to Molecular Struc-
and Zorn, Ref. 22. ture and Spectroscopy," Boston, Allyn and Bacon,
For a review of beam maser spectroscopy see 1977. Wollrab, J. E., "Rotational Spectra and
Ref. 24. Molecular Structure," New York, Academic Press,
For a comprehensive computation of micro- 1967. Sugden, T. M., and Kenney, C. N., "Micro-
wave spectra including measured frequencies, wave Spectroscopy of Gases," New York, Van
assigned molecular species, assigned quantum Nostrand Reinhold, 1965; Ingram, D. J. E.,
number, and molecular constants determined "Spectroscopy at Radio and Microwave Frequen-
from such data, the reader is referred to the cies," 2nd Ed., New York, Plenum, 1967; Hedvig,
multivolume work "Microwave Spectral Tables" P., and Zentai, G., "Microwave Study of Chemical
prepared by rersonnel of the National Bureau Structures and Reactions," CRC Press, 1969;
of Standards. 5 Svidziniskii, K. V., "Soviet Maser Research," New
York, Plenum Press, 1964.
DONALD G. BURKHARD 22. English, T., and Zorn, J. C., "Molecular Beam
Spectroscopy," and Lide, D. R., "Microwave
References Spectroscopy," in "Methods of Experimental
1. Dennison, D. M., Phys. Rev., 28, 318 (1926); Physics," Vol. 3, 2nd Ed., New York, Academic
Reiche, F., and Rademacher, H., Z. Physik, 39, Press, 1972.
444 (1926), and 41, 453 (1927). 23. Ann. Rev. Phys. Chern. 21 (1970); 20 (1969); and
2. Burkhard, D. G., l. Mol. Spectry., 2, 187 (1958); 18 (1967), resp.
Shaffer, W. H., and Louck, J. D.,J. Mol. Spec try ., 24. Laine, D. C., Repts. Prog. Phys. 33,1001 (1970).
3,123 (1959). 25. "Microwave Spectral Tables," Superintendent of
3. Klein, 0., Z. Physik, 58,730 (1929). Documents, U.S. Govt. Printing Office, Washing-
4. Dennison, D. M., Rev. Mod. Phys., 3, 280 (1931). ton, D.C. 20402.
5. Flygare, W. H., and Benson, R. C.,Mol. Phys, 20, Cross-references: ATOMIC AND MOLECULAR
225 (1971). BEAMS, MAGNETIC RESONANCE, MICROWAVE
6. Townes, C. H., and Schawlow, A. 1., "Microwave TRANSMISSION, SPECTROSCOPY, ZEEMAN AND
Spectroscopy," New York, McGraw-Hill, 1955. STARK EFFECTS.
MICROWAVE TRANSMISSION 756

MICROWA VE TRANSMISSION width becomes 50 cm. Thus, waveguides are


reasonable in size for the transmission of micro-
That portion of the electromagnetic spectrum waves. Furthermore, when used with ferrites,
adjacent to the far-infrared region is commonly thin metallic films and magnets, waveguide
referred to as the microwave region. It is components can be designed to function as
bounded by wavelengths in the vicinity of 10 isolators, circulators, modulators, discrimina-
centimeters (10 cm) and 1 millimeter (1 mm). tors, or attenuators.
The longest wavelength of 10 cm corresponds The propagation of electromagnetic waves in
to a frequency of 3 X 10 9 cycles per second free space between transmitting and receiving
(abbreviated 3000 megahertz or 3 kilomegahertz antennas can be characterized in terms of
or 3 gigahertz or 3 GHz). The shortest wave- ground waves, sky waves, and space waves.
length of I mm corresponds to a frequency of At microwave frequencies, ground waves
3 X lO ll cycles per second (abbreviated 300,000 attenuate completely within a few feet of
megahertz or 300,000 MHz or 300 kilomega- travel, sky waves are influenced by the iono-
hertz or 300 gigahertz or 300 GHz). sphere and can penetrate through into outer
The development of microwave transmission space, and space waves travel through the
on a major scale was initiated in 1940 with the atmosphere immediately above the surface of
advent of the magnetron, an electronic gen- the earth. At microwave frequencies, space
erator of high-power microwaves. The magne- waves behave like light waves and travel in a
tron spearheaded wartime radar at approxi- direct line of sight. They follow many of the
mately 3 GHz and led to the utilization of rules of optics. They can be reflected from
waveguides for the efficient transmission of smooth conducting surfaces and can be focused
microwaves from the generator of the trans- by reflectors or lenses.
mitting antenna and from the receiving an- If a space wave is radiated from a point an-
tenna to the detector. tenna, the radiated energy spreads out like an
In essence, a waveguide is a hollow metal tube ever-expanding sphere, and the amount of
capable of propagating electromagnetic waves energy per square foot of wave front decreases
within its interior from its sending end to its inversely with the square of the distance from
receiving end. Unlike waves in space which the antenna. The power that can be extracted
usually propagate outward in all directions, from a wave front by a similar point antenna
waves in waveguides are fully confined while varies inversely with the square of the fre-
they propagate. quency. Thus, a point antenna receives power
An electromagnetic wave is comprised of an which is inversely proportional to both the
electric field and a magnetic field. In free space, square of the distance from the source and the
these fields are always perpendicular to one square of the frequency. The ratio of the power
another and to the direction of wave propaga- received to the total power radiated is known as
tion at any instant in time. However, when a path attenuation.
wave travels through a waveguide, the confine- When the receiving antenna is a parabola-
ment forces one of the fields, but never both, shaped dish, power extracted from the wave
to have a component that is parallel to the front is greatly increased. The ratio of the power
direction of wave propagation. received by such an antenna to the power re-
In a waveguide, there are a number of pos- ceived by a theoretical point antenna is defined
sible field configurations. Each configuration as antenna gain. The gain of a parabolic antenna
is known as an operating mode and is deter- increases with the antenna area and the operat-
mined by the operating frequency or wave- ing frequency. Thus, for a given microwave
length and the lateral dimensions of the wave- transmission with fixed-sized antennas, the path
guide. There are two fundamental classes of attenuation increases with frequency, the an-
modes that may propagate in a waveguide. In tenna gain increases with frequency, and the
one class, the electric field is everywhere per- overall result is that one tends to offset the
pendicular, or transverse, to the direction of other.
propagation and the magnetic field has a In radio broadcasting, the signal power radi-
longitudinal component. It is referred to as ates equally in all directions, and a receiving
the transverse electric mode or TE mode. In antenna picks up only a tiny fraction of the
the other class, the magnetic field is transverse signal power. To overcome this low efficiency,
and the electric field has a longitudinal com- the broadcast station must transmit a large
ponent. This configuration leads to the trans- amount of power. By contrast, a point-to-point
verse magnetic mode or TM mode. microwave system radiates only a small amount
Propagation of either the TE mode or the of power, but it uses a directional transmitting
TM mode is limited by the cross-sectional antenna to concentrate power into a narrow
dimensions of the guide. For example, in a beam directed toward the receiving antenna.
rectangular waveguide, the longest wavelength Consequently, such systems are characterized
that can be propagated is equivalent to twice by high efficiencies.
its width. Therefore, to transmit a 3 GHz signal Because microwave transmission in free space
in a rectangular waveguide, the width of the follows essentially a straight line, reflectors are
guide must be at least 5 cm. At 300 MHz, this utilized to redirect a beam over or around an
757 MODULATION

obstruction. The simplest and most common solar system. Another famous facility is the
reflector system consists of a parabolic antenna 2l0-foot GOLDSTONE antenna at the NASA
mounted at ground level which focuses a beam Deep Space Institute (California).
on a reflector mounted at the top of a tower.
This reflector inclined at 45° redirects the beam ANTHONY B. GIORDANO
horizontally to a distint site where a similar
"periscope" reflector system may be used to References
reflect the beam down to another ground level.
If two sites are separated by a mountain, it may Nichols, E. J., and Tear, J. D., "Joining the Infrared
be necessary to use a large, flat surface reflector and Electric Wave Spectra," Astrophys. J. 61, 17-37
referred to as a "billboard" reflector. In a typi- (1923).
cal system, a billboard reflector might be lo- Carter, S. P., and Solomon, L., "Modern Microwaves,"
cated at a turn in a valley, effectively bending Electronics (June 24, 1960).
the beam to follow the valley. Many arrange- Southworth, G. C., "Survey and History of the Pro-
ments are possible which, in effect, resemble gress of the Microwave Art," Proc. IRE (May 1962).
huge mirror systems. Wheeler, G. J., "Introduction to Microwaves," Engle-
Microwaves are ideally suited for communica- wood Cliffs, New Jersey, Prentice-Hall, Inc., 1963.
tion systems where a broad frequency band- Evans, J. V., and Hagfors, T. (Ed.), "Radio Astron-
width of the order of several megacycles is omy," New York, McGraw-Hill, 1968.
required for the rapid transmission of signals Yeh, P., "Satellite Communications and Terrestrial
which contain a large amount of information, Networks," Dedham, Massachusetts, Horizon House,
such as, in television signals. Most of the major Inc., 1977.
cities of the United States are serviced by micro- Topol, S., "Satellite Communications-History and
wave television links so that they can receive Future," Microwave Journal (November 1978).
television programs which originate from other Cuccia, C. 1., "Satellite Communications and the
cities. These systems can also accommodate Information Decade," Microwave Journal (January
thousands of telephone channels. 1982).
In 1960, experiments were initiated aiming
toward communicating over transoceanic dis- Cross-references: ANTENNAS, ELECTROMAG-
tances via microwaves by utilizing balloons as NETIC THEORY, MICROWAVE SPECTROSCOPY
reflectors. Echo I and Echo II were attempts in PROPAGATION OF ELECTROMAGNETIC WAVES:
this direction as passive satellites. The first RADAR.
active repeater satellite (Telstar I) was launched
in 1962 and resulted in live telecasts between
Europe and the United States in addition to
teleprint and other signals. Telstar II added
more data to accent the value of satellite
communications.
In 1962, Congress authorized the formation MODULATION
of Communications Satellite Corporation (Com-
sat). In 1964, Comsat took the lead in forming Modulation is defined as the process, or the
the International Telecommunications Satellite result of the process, whereby some character-
Organization (Intelsat) to coordinate interna- istic of one wave is varied in accordance with
tional developments in the use of satellites. some characteristic of another wave (AS A).
Over 100 nations are now members of this Usually one of these waves is considered to be a
extraordinary effort. Today, all transoceanic c.arrier wave w?ile the other is a modulating
"live" TV broadcasts and two-thirds of all SIgnal. The vanous types of modulation such
transoceanic telephone and telegraph communi- as am~litude, frequency, phase, pulse ~idth,
cations are via Intelsat satellites. They operate pulse tIme, and so on are designated in accor-
in the 12/14 GHz band in addition to the 4/6 dance with the parameter of the carrier which is
GHz band. being varied.
Microwaves are broadly used for radar navi- Amplitude modulation (AM) is easily accom-
plished and widely used. Inspection of Fig. I
gation, and for the launching, guidanc~, and shows that the voltage of the amplitude modu-
fusing of missiles. A typical defense project lated wave may be expressed by the following
which uses microwave techniques is the DEW equation
radar line which protects the United States
from external enemy attacks. v = Vc(l + M sin Wmt) sin wet,
The HAYSTACK facility which has been in
operation since 1966 at Millstone Hill in Massa- where Ve is the peak carrier voltage, We and
chusetts is the first Western radar built for Wm are the radian frequencies of the carrier and
spacecraft tracking, space communications, and modulating signals, respectively, and t is time in
radar astronomy. Through radar astronomical seconds. The modulation index 1M may have
techniques, the multipurpose HAYSTACK 120- values from zero to one. When the trigonometric
foot paraboloid antenna reflector has greatly ~dentity ~ina sin b = ~ cos (a - b) - ~ cos (a + b)
enhanced our knowledge of the galaxy and IS used III the equation above, this equation
MODULATION
758

(I+M)Vc ,- -----------
Vc
(I-M) Vc
II O~~I+++HH~I#+.L-

t~

(a) Carrier
t--... ·t
(b) Modulating signal (c) Amplitude-modulated Carrier
FIG. 1. Amplitude modulation.

become~ components of the rectangular modulating


signal and then adding to the carrier frequency
MVc a pair of side frequencies for each of those
v = Vc Sin wet + - - cos (We - wm)t components. The spacing of these components
2
is 1fT, which is the fundamental frequency, and
MVe the amplitude of these components vary as
- - - cos (We + Wm)t shown in Fig. 2(b). The envelope of this ampli-
2 tude variation follows the familiar pattern of a
(sin x)/x function. An infinite bandwidth would
This equation shows that new frequencies, be required to either produce or reproduce a
called side frequencies or side bands, are gener- perfectly rectangular pulse, which of course is
ated by the amplitude modulation process. impossible to obtain. Bandwidths in the neigh-
These new frequencies are the sum and differ- borhood of 2/td, where td is the pulse duration
ence of the carrier and modulating frequencies. in seconds, are commonly used to transmit a
Amplitude modulation is accomplished by double-sideband pulse-modulated signal.
mixing the carrier and modulating signals in a Frequency modulation (FM) is illustrated by
nonlinear device such as a vacuum tube or tran- Fig. 3. The frequency variation, or deviation, is
sistor amplifier operated in a nonlinear region of proportional to the amplitude of the modulating
its characteristics. The nonlinear characteristic signal. The voltage equation for a frequency
produces the new side-band frequencies. Fre- modulated wave follows.
quency converters or translators and AM de-
tectors are basically modulators. The various
types of pulse modulation are actually special v = Vc sin (Wet + Mf sin wmt)
types of amplitude modulation.
A special type of amplitude modulation known The modulation index Mf is the ratio of maxi-
as pulse modulation is commonly used in digital mum carrier frequency deviation to the modu-
communication and other applications. In pulse lating frequency. This ratio is known as the
modulation, the modulating signal abruptly deviation ratio and may vary from zero to values
changes the carrier amplitude from zero to some of the order of 1000. FM requires a broader
maximum amplitude Vm (or vice versa) as shown transmission bandwidth than AM but may have
in Fig. 2(a). superior noise and interference rejection capabil-
Therefore, the modulation index is I, or 100%, ities. A large value of modulation index provides
at all times. The side frequencies produced in excellent interference rejection capability but
the modulation process are determined by first requires a comparatively large bandwidth. The
using Fourier analysis to find the frequency approximate bandwidth requirement for a fre-

-1 r- M= 11T
o I I I I I I I I IIIII I I I I I

"
" - 21td " - 1/td " + 1/td " + 21td
" = carrier frequency
I---- T----j

(a) Pulse-modulated carrier (b) side frequencies generated.

FIG. 2. Pulse (amplitude) modulation.


759 MOLE CONCEPT

t-- t- t-
(a) Carrier (b) Modulating signal (c) Frequency-modulated Carrier
FIG. 3. Frequency modulation.

quency modulated wave may be obtained from MOLE CONCEPT


the following relationship
The mole (derived from the Latin moles = heap
Bandwidth = 2 (Modulating frequency) (Mf + 1) or pile) is the chemist's measure of amount of
pure substance. It is relevant to recognize that
The noise and interference characteristics of the familiar molecule is a diminutive (little
FM transmission are normally considered satis- mole). Formerly, the connotation of mole was
factory when the modulation index or deviation a "gram molecular weight." Current usage tends
ratio is five or greater. more to use the term mole to mean an amount
Phase modulation is accomplished when the containing Avogadro's number of whatever units
relative phase of the carrier is varied in acc?r- are being considered. Thus, we can have a mole
dance with the amplitude of the modulatmg of atoms, ions, radicals, electrons or quanta. This
signal. Since frequency is the time rate of change usage makes unnecessary such terms as "gram-
of phase, frequency modu.latio~ occurs w~en atom," "gram-formula weight," etc. .
the phase modulating techmque IS used and Vlce A definition of the term is: The mole IS the
amount of (pure) substance containing the same
versa. In fact, the equation given for a frequency-
number of chemical units as there are atoms in
modulated wave is equally applicable for a phase- exactly twelve grams of 12 C. This definition
modulated wave. However, the phase-modulat-
involves the acceptance of two dictates- the
ing technique results in a deviation ratio, or scale of atomic masses and the magnitude of the
modulation index, which is independent of the gram. Both have been established by interna-
modulating frequency, while the frequency tional agreement. Usage sometimes indicates a
modulating technique results in a deviation ratio different mass unit, e.g., a "pound mole" or
which is inversely proportional to the modul.at- even a "ton mole"; substitution of "pound" or
ing frequency, assuming invarient modulatmg "ton" for "gram" in the above definition is
voltage amplitude in each case. implied.
The phase-modulating techniques can be used All stoichiometry essentially is based on the
to produce frequency-modulated waves, pro- evaluation of the number of moles of substance.
viding the amplitude of the modulating volt~ge The most common involves the measurement of
is inversely proportional to the modulatmg mass. Thus 25.000 grams of H 2 0 will contain
frequency. This inverse relationship can ~e 25.000/18.015 moles of H 2 0; 25.000 grams of
obtained by including in the modulator, a cu-
cuit which has a voltage transfer ratio inversely sodium will contain 25.000/22.990 moles of Na
proportional to the frequency. (atomic and formula masses used to five signifi-
cant figures). The convenient measurements on
CHARLES L. ALLEY gases are pressure, volume and temperature .. Use
of the ideal gas law constant R allows duect
calculation of the number of moles n = (P X V)/
References (R X T). T is the absolute temperature;R must
be chosen in units appropriate for P, V and T
Alley, C. L. and Atwood, K. W., "Electronic Engineer- (e.g., R = 0.0820 liter atm mole-I deg K-I). ~t
ing," Third Edition, New York, John Wiley & Sons, may be noted that acceptance of Av?gad~o s
1973. principle (equal volumes of gases under Identical
Comer, David J., "Modern Electronic Circuit Design," conditions contain equal numbers of molecules)
Reading, Massachusetts, Addison Wesley, 1978. is inherent in this calculation. So too are the
DeFrance, J. J., "Communications Electronics Cir- approximations of the ideal ~as law. Refined
cuits," Second Edition, San Francisco, 1972. calculations can be made by usmg more correct
equations of state.
Cross-references: MICROWAVE TRANSMISSION, Many chemical reactions are most conven-
PROPAGATION OF ELECTROMAGNETIC WAVES, iently carried out or measured in solution (e.g.,
PULSE GENERATION, RADAR, WAVE MOTION. by titration). The usual concentration conven-
MOLE CONCEPT 760

tion is the molar solution. (Some chemists prefer Lewis, G. N., and Randall, M., "Thermodynamics,"
to use the equivalent term formal). A 1.0 molar Second edition, revised by Pitzer, K. S., and Brewer,
solution is one which contains one mole of L., New York, McGraw-Hill Book Co., 1961.
solute per liter of solution. Thus the number of
moles of solute in a sample will be Cross-references: CHEMISTRY, ELECTROCHEMIS-
TRY, GAS LAWS, MOLECULAR WEIGHT.
n = Volume (liters) X Molarity (moles/liter)
The amount of chemical reaction occurring at MOLECULAR BIOLOGY
an electrode during an electrolysis can be ex-'
pressed in moles simply as n = q (coulombs}/z 1" Molecular biology, the study of biologically im-
where z is the oxidation number (charge) of the portant molecules and their interactions, is the
ion and 1" is the faraday constant, 96 487.0 result of the progression of biology from the
coulombs/mole. Thus the faraday can be con- classical study of whole organisms to the more
sidered to be the charge on a mole of electrons. recent study of individual cells and their com-
This affords one of the most accurate methods ponents. Its beginnings are usually dated from
of evaluating the Avogadro number (6.0220 the announcement of the double-helical struc-
X 1023), since the value of the elementary ture of DNA molecules made by Watson and
charge is known with high precision . Crick in 1953. During the early years of molec-
Modem chemistry increasingly uses data at the ular biology, most attention was focused on
atomic level for calculation at the molar level. bacteria and their viruses, since they were the
Since the former often are expressed as quanta, most easily studied systems. For example,
appropriate conversion factors must involve the under appropriately controlled conditions,
Avogadro number. Thus the einstein of energy is hundreds or thousands of liters of bacteria can
that associated with a mole of photons, or be prepared in which every cell is essentially
E=Nhv. Thus light of 2537)\ wavelength will identical. Most of our fundamental knowledge
represent energy of
6.02 X 1023 (photons/mole) X 6.62 X 1Q-27(erg-sec} X 3.000 X 10Io(cm/sec}
E=------------------------------------------------~~~
2.537 X 10-5 (cm) X 4.184 X I 0 7(erg/cal) X 103(cal/kcal)
E = 113 kcal/mole
If the SI system of units is used
6.022 X 1023(mol- l ) X 6.626 X 10-34 (1 . s) X 3.000 X 1Q8(ms- l )
E=--------~--~----------~--~--------~--~
2.537 X 1Q-7(m)
= 4.740 X 10 5 (J mol-I)
Another convenient conversion factor is I eV/
particle = 23.05 kcal/mole. about the ways in which cells synthesize and
The chemist's use of formulas and equations use their macromolecules was originally derived
always implies reactions of moles of material, from the study of bacterial systems.
thus HCI(g) stands for one mole of hydrogen At the present time, however, the trend is in
chloride in the gaseous state. Thermodynamic the opposite direction. A concerted effort is
quantities are symbolized by capital letters underway to apply the models developed for
standing for molar quantities, e.g., CIJ (heat molecular biologic processes to multicellular
capacity at constant volume in cal mole- I organisms. Such organisms present a real chal-
deg- I ), G (Gibbs function in cal/mole), etc. At lenge to biologists, since most complex organ-
times it is more convenient to convert an exten- isms contain more than one kind of cell (the
sive property into an intensive expression. This cells have "differentiated"), and the interac-
is especially true in dealing with multicompo- tions between these groups of cells within an
nent systems. These are referred to as "partial organism are carefully controlled. Moreover
molal quantities" and are given a symbol em- cells from multicellular organisms differ in fun-
ploying a bar over the letter. Thus the partial damental ways froIll those of bacteria.
molal volume, VI = (a V/anl) is the rate of Examples of these differences can be seen by
change of the total volume of a solution with referring to Figs. I A and 1B, which show trans-
the amount (number of moles) of component 1. mission electron micrographs taken of thin sec-
WILLIAM F. KIEFFER tions of the two types of cells. Figure IA is a
bacterial cell which exhibits typical features
such as a central cluster of DNA; basically fea-
tureless cytoplasm (the liquid portion of the
References
cell) surrounded by a lipid bilayer (cell or unit
Kieffer, W. F., "The Mole Concept in Chemistry," membrane); and a rigid cell wall around the
Ed. 2, New York, Van Nostrand Reinhold, 1973. entire organism. This type of cell is considered
761 MOLECULAR BIOLOGY

FIG. 1. Examples of biologic organization as seen in the transmission electron microscope.


(A) A thin section of the prokaryote Bacillus sphaericus prepared by Dr. Elizabeth W. David-
son, Arizona State University. The length of the bar is 0.1 micrometer. (B) A thin section of a
eukaryotic cell (from rat liver) prepared by Dr. Candice J. Coffin, Arizona State University. The
length of the bar is 1.0 micrometer. (C) An intact bacterial virus, PBSl, negatively stained with
potassium phosphotungstate by E. A. Birge. The length of the bar is 0.1 micrometer.
The labeled structures are: C, cytoplasm; CW, cell wall; E, endoplasmic reticulum; H, head;
M, cell membrane; Mi, mitochrondria; N, nucleus; Nd, nucleoid; T, tail; Tf, tail fiber.

ancestral, in an evolutionary sense, to the Both DNA and the related molecule ribonu-
type of cell shown in Fig. I B, and is designated cleic acid (RNA) are polymers of nucleotide
as prokaryotic. Figure 1B shows a eukaryotic bases composed of certain nitrogenous bases
cell with its typical nucleus surrounded by a (adenine, cytosine, guanine, thymine and uracil;
unit membrane; cytoplasm containing energy- abbreviated A, C, G, T, and U, respectively)
producing mitochondria and the membranous coupled to a pentose (either deoxyribose or
structures known as endoplasmic reticulum; ribose) and then to phosphate, as shown in
and a unit membrane surrounding the entire Fig. 2. The polymeric chain is formed by alter-
cell. Animal cells do not have cell walls, al- nating pentoses and phosphates with the ni-
though plant cells do. This type of cellular or- trogenous bases projecting to one side. RNA is
ganization is typical not only of plants and generally single-stranded, incorporating the
animals but also the unicellular protozoa, fungi, bases ACGU, while DNA is generally a double-
and true algae. stranded molecule incorporating the bases
Despite the differences noted above, there ACGT. The structure of DNA molecules is
are basic similarities between prokaryotic and somewhat variable and dependent upon such
eukaryotic cells. The basic materials from factors as the temperature, salt concentration,
which the cells are made are identical, as are and base composition, but a typical helical
many of the macromolecules within the cells. structure for DNA is shown in Fig. 3. The struc-
In both cases, the genetic material is deoxyribo- ture is formed by pairing bases from the parallel
nucleic acid (DNA), which must be synthesized strands according to the following rule: A pairs
in a semiconservative manner (replicated) prior with T and G pairs with C. When RNA is in-
to each cell division. This process is facilitated volved, U is substituted for T. During the repli-
by the double-stranded nature of the DNA mol- cation process mentioned above, this base pair-
ecule itself. ing is utilized to spontaneously line up the
MOLECULAR BIOLOGY 762

ph01Phll.

/
FIG. 2. Pairing of DNA strands to make a helix. In the diagram the backbones
of the two DNA strands are shown along the right and left margins. The bases pro-
ject into the space between and are held together by hydrogen bonds (dotted line).
(Reproduced from Waiter, W. G., McBee, R. H., and Temple, K. 1., "Introduction
to Microbiology," New York, D. Van Nostrand Company, 1973.)

precursor nuc1eotides so that a polymerase


enzyme can join them together. Polymerases
are extremely fast acting and may join as many
as 250-1000 nuc1eotides per second under the
appropriate conditions.
The genetic information for any cell is en-
coded in redundant form within its DNA base
sequence due to the specificity of base pairing.
This information is not, however, directly avail-
able for use. Instead an RNA copy of one of
the two DNA strands (the "sense" strand) is
made by an RNA polymerase following the
usual base pairing rules in a process called tran-
scription. Each coding region of the DNA is ca-
pable of producing specific RNA molecules
whose functions are predetermined. Some are
used as part of the subcellular structures called
ribosomes and are designated rRNA. Others are
used as highly specific carriers of the amino
acids, the subunits of the polymers called pro-
teins, and are designated transfer or tRNA. Still
other molecules, the rarest class, contain the
actual code which determines the sequence of
amino acids used to construct a protein. These
molecules are designated as messenger or
mRNA molecules and tend to be unstable. They
code for only one protein if isolated from a-
eukaryotic cell but may code linearly for as
many as ten discrete proteins if isolated from a FIG. 3. The winding of a DNA double helix. A less
prokaryotic cell. To a first approximation, each magnified view of a DNA molecule than shown in Fig.
region of the DNA coding for an rRNA, tRNA, 2, this diagram shows the B form structure. There are
or individual protein molecule represents a gene. ten base pairs per turn of the helix, and each pair is
Overlapping genes are rare but not unknown. rotated 36° with respect to the preceding pair. Re-
The next step in utilization of genetic infor- produced from Walter et al.
763 MOLECULAR BIOLOGY

mation is the translation of the genetic code karyotic cells the entire virus enters, and then
into the appropriate sequence of amino acids. the protein coat is removed.
Each triplet of bases (codon) codes for a spe- Once inside the cell, viruses follow one of two
cific amino acid or punctuation signal, and all patterns. In a "lytic" infection, the viral nucleic
possible triplets are meaningful. Therefore acid immediately uses the host cell enzymes to
ribosomes must always attach to mRNA at spe- produce more viral nucleic acid and the pro-
cific sites to ensure the proper "reading frame." teins necessary to encase it. Other viral specific
During translation, codons on the mRNA are enzymes may be produced to facilitate this pro-
matched to corresponding anticodons on the cess by disrupting host cell functions not neces-
tRNA by the usual base pairing rules to assure sary for the production of viruses. New virus
delivery of the correct amino acid. particles are produced not by division of a pre-
In prokaryotes, which have no compart- existing entity, as is the case with cells, but
mentalization within their cytoplasm, transla- rather by a sequential assembly process which
tion occurs as soon as the mRNA is formed. In shows remarkable similarities to formation of
eukaryotes, however, the DNA and initial RNA chemical crystals from a supersaturated solu-
transcripts are found in the nucleus, while the tion. In most cases, the host cell is destroyed or
translation machinery is found in the cytoplasm. "lysed" during release of the viral particles. This
As a result the RNA must be exported from the observation has given rise to an alternative
nucleus. Before this can happen, the RNA must name for bacterial viruses-bacteriophages or
be processed to remove certain noncoding or simply phages.
intervening sequences which are present in most The second mode of viral infection is called a
eukaryotic genes. The processed mRNA is "temperate" infection. In this case the viral nu-
passed through pores in the nuclear membrane cleic acid establishes a semipermanent relation-
into the cytoplasm. In either type of cell, trans- ship with the cell which preserves both virus
lation occurs in a processive fashion on a com- and host cell. If the viral nucleic acid is RNA, it
plex of a single mRNA molecule and multiple is converted to DNA. The DNA then sets up a
ribosomes called a polysome. Although pro- stable association with the host DNA such that
karyotic and eUkaryotic ribosomes are similar the host cell replicates the viral DNA at the
in function, they are somewhat different in same time as it replicates its own DNA. The viral
structure, with the prokaryotic ribosomes being DNA, instead of producing coat proteins and
smaller than their eukaryotic counterparts. lytic factors, produces a protein repressor which
Interestingly certain organelles within eu- acts to prevent the synthesis of mRNA mole-
karyotic cells, the chloroplasts (sites of photo- cules coding for the lytic functions. The quies-
synthesis) and the mitochondria, contain small cent viral DNA is now designated as a provirus
DNA molecules coding for ribosomes of the or prophage, since the appropriate stimulus will
prokaryotic type. It is now considered likely cause it to revert to the lytic form and destroy
that at least chloroplasts are descended from its host cell. Not all viruses are capable of the
primordial prokaryotic cells which colonized temperate response, and those which are ca-
eukaroytic cells. pable of the temperate response do not neces-
As protein molecules are produced, they fold sarily use it.
spontaneously into three-dimensional config- Proviral DNA has been observed to exist in
urations appropriate to their function. In the two forms. It may integrate itself into the host
case of proteins intended for use outside the cell DNA and become an actual physical part
producing cell, they are generally produced in of the cell's genetic material. Alternatively it
an inactive configuration which is altered by may exist as a small independent circular DNA
removal of a portion of the amino acid chain molecule which replicates side-by-side with the
during transport through the cell membrane. host DNA. The latter form of DNA is called a
As is true for most substances, movement of plasmid and is more commonly found in
proteins across the cell membrane is an energy- prokaryotes.
requiring process. The term plasmid actually encompasses a
All of the normal cellular processes men- much larger group of DNA molecules than just
tioned above can be subverted by small obligate viruses. Nonviral plasmids have been observed
intracellular parasites called viruses. These en- in most bacteria and many of the simpler eu-
tities represent the boundary between living karyotes such as yeast. These extra pieces of
and nonliving matter. When not in a cell, they DNA are considered dispensable to the host cell
have the appearance of complex crystals of even though they may, under the appropriate
protein and nucleic acid, as shown in Figure 1C. conditions, increase the cell's chances for sur-
Viruses are generally rod-shaped or polyhedral vival. Examples of this include plasmids which
structures consisting of a protein "coat" and a make bacterial cells resistant to certain anti-
highly condensed nucleic acid molecule which biotics or which allow them to break down cer-
may be RNA or DNA. Infection of a cell con- tain complex molecules such as xylene for food.
sists of movement of the nucleic acid across the Recent discoveries in molecular biology are
cell membrane. In the case of prokaryotic cells, having a profound effect on our understanding
only the nucleic acid enters the cell. For eu- of molecular genetics and on the way in which
MOLECULAR BIOLOGY 764

Donor DNA

© ~ Fragment Only

0
EeoRI
or
AATTC G
G CTTAA ~ Vector Plus
~
One or More Fragments
AATTC G
G CTTAA

t or
EeoRI

0 Vector Only

0
~

FIG. 4. A diagrammatic representation of gene splicing. The double circle at


the bottom left of the diagram represents a double-stranded plasmid DNA mole-
cule, while the double-stranded donor DNA at the top may be from any source.
The EcoRI enzyme is the prototypical restriction endonuclease which always
leaves identical single-stranded ends on the cut DNA. As shown on the right-hand
side of the diagram, the single-stranded regions may pair so as to reform the orig-
inal molecules or to form new constructs. (Adapted from Birge, E. A., "Bacterial
and Bacteriophage Genetics: An Introduction," New York, Springer-Verlag,
1981.)

biologic problems can be solved. It is now the result is a clone of cells all of which carry
known that while most portions of DNA mole- the particular DNA segment of interest.
cules are stable over long periods of time, in Application of these techniques has led to a
both prokaryotes and eukaryotes certain small true biologic revolution. It is now possible to
regions within the DNA molecules are naturally splice purified DNA into the middle of plasmids
highly unstable. These unstable regions, called or transposons and to insert the spliced DNA
transposons, consist of special terminal ele- into living cells. This is genetic engineering in
ments with a wide variety of coding elements the fullest sense of the term. The results that
such as antibiotic resistance located between have been obtained from the process have in-
them. The terminal elements have the ability to cluded such oddities as bacteria which produce
cause the entire transposon to simultaneously human insulin or growth hormone or animal
replicate itself and insert itself into a new posi- cells which carry bacteriophage DNA. Con-
tion on the same or a different DNA molecule. structs like these may some day permit us to
Transposons thus represent "jumping genes" and understand precisely how cells regulate their
provide a way to move bits of DNA around in internal processes as well as their interactions
the cell in a more or less random fashion. with neighboring cells. They certainly promise
The greatest impact on modern biology has to revolutionize the study of biology.
been made by combining the information pre- As the preceding discussion indicates, it is ap-
sented above with the new techniques which parent that a cell is constantly involved in many
have been developed for producing artificial re- activities which require the movement of mol-
arrangement of DNA molecules-techniques ecules. The extent of this feat becomes more
known as gene splicing. The procedures all de- apparent when a few size comparisons are made.
pend upon the action of certain restriction A typical bacterial cell may be a rod approxi-
endonucleases produced by various bacteria. mately one micrometer in diameter and several
These enzymes attack all "foreign" DNA, mol- micrometers in length. The DNA molecule of
ecules which have not been suitably modified the same cell is a circular structure approxi-
by the addition of small substituents such as mately one millimeter in length. In the case of
methyl groups at specific sites. They cleave the a human cell, the diameter of the nucleus is
unmodified DNA at base sequence-specific sites about 10 micrometers and the 46 linear DNA
to produce variably sized fragments. Since all molecules represent a length of about one
DNA fragments produced by a given enzyme meter. Clearly the DNA cannot exist as a ran-
will have identical ends, it is a comparatively dom coil within the cell and still allow space for
easy job to rejoin the fragments in new com- other activities.
binations and permutations in a manner such as Prokaryotic and eukaryotic cells have solved
shown in Fig. 4. When the spliced DNA is in- this problem in different ways. In prokaryotic
serted into a cell and the cell is allowed to grow, cells the circular DNA molecule is formed into
765 MOLECULAR SPECTROSCOPY

Wang, 1. C. "DNA Topoisomerases," Scientific Amer-


ican 247(1),94-109 (1982).
Watson, J. D., "Molecular Biology of the Gene," Menlo
Park, California, W. A. Benjamin, Inc., 1976.

Cross-references: BIOMEDICAL INSTRUMENTA-


TION, BIOPHYSICS, ELECTRON MICROSCOPE,
MEDICAL PHYSICS, PHOTOSYNTHESIS.

MOLECULAR SPECTROSCOPY
Molecular spectroscopy encompasses the broad
range of efforts to understand and utilize the
interaction of gas-phase molecules with electro-
FIG. 5. Supercoiling of DNA. The two DNA mole- magnetic radiation. Spectroscopy is the basic
cules have been spread onto a thin plastic film; stained tool for exploring the internal structure of
with uranyl acetate; and then shadowed with platinum molecules, and spectroscopic studies are of
and palladium. The molecule on the left has retained fundamental importance for understanding the
its supercoils, while the one on the right has one or microscopic properties of matter. Molecular
more broken phospho diester bonds and has therefore spectroscopy is also of considerable practical
lost all of its superhelicity. (Electron micrograph by value, since the spectrum of a molecule pro-
E. A. Birge.) vides a characteristic "fingerprint" by which
that molecule may be identified.
The spectrum of a molecule may be measured
either by determining the wavelengths absorbed
a series of loops, each of which is supercoiled by the molecule (absorption spectroscopy) or
by a family of enzymes called topoisomerases. by observing the wavelengths emitted by a
These enzymes break either one or two strands sample of excited molecules (emission spectros-
of the DNA helix and can pass other strands of copy). In either case, the spectrum is far more
DNA through the nicked region. Therefore they complex than an atomic spectrum. Emission
also have the interesting property of being able and absorption are found to occur from radio-
to tie and untie knots within the DNA mole- frequency wavelengths through the infrared
cule. An example of the difference between and visible regions of the spectrum and far
supercoiled and "relaxed" DNA molecules can into the ultraviolet. Under conditions of low
be seen in Fig. S. The highly supercoiled DNA resolution, the visible spectrum is observed to
molecule comprises the nucleoid seen in Fig. I A. consist of numerous bands, hence the designa-
Eukaryotic cells literally coil their linear DNA tion band spectra. Higher resolution demon-
around cylinders of protein called histones. strates that each band is composed of numer-
There are four histone proteins, three of which ous closely spaced lines. The microwave and
are used to make cylinders and one of which infrared portions of the spectrum are much
covers the DNA which connects adjacent cylin- less congested and easier to analyze. Herz-
ders. The net effect is to take a single DNA berg's three-volume work l on molecular spec-
molecule and coil it into a sort of "beads on a troscopy is the major reference. The fact
string" structure. Such a structure is called a that Volumes I and II have been in print
chromosome and may exist in an extended for over 30 years testifies to their enormous
state or in an even more condensed form dur- success. A more modern presentation is given
ing cell division. by Steinfeld. 2
Further information on packaging problems Those wavelengths which are present in a
and many other topics discussed in this article molecular spectrum are governed by the law of
can be obtained from the references. quantum physics which states that a photon
of frequency f = (£2 - £ 1 )/h is emitted or ab-
EDWARD A. BIRGE sorbed whenever the molecule undergoes a
transition between energy levels £ land £2'
References Here h is Planck's constant. Understanding the
spectrum, then, is equivalent to understanding
Adams, R. L. P., Burdon, R. H., Campbell, A. M., the energy levels of a molecule. All of the
Leader, D. P., Smellie, R. M. S., "The Biochemistry essential features of molecular spectra are
of the Nucleic Acids," Ninth Edition. New York, present in diatomic molecules, to which the
Chapman and Hall, 1981. discussion below is restricted. The energy level
Glover, D. M., "Genetic Engineering: Cloning DNA," structure of polyatomic molecules can be ex-
New York, Chapman and Hall, 1980. plained by extending the concepts developed
Primrose, S. B., Dimmock, N. J., "Introduction to for diatomic molecules. A recent compilation
Modern Virology," Second Edition, New York, John of data for diatomic molecules is given by
Wiley & Sons, 1980. Huber and Herzberg. 3
MOLECULAR SPECTROSCOPY 766

It is customary in molecular spectroscopy to by


express frequencies in units of wave numbers
11k
= 2; V;
(cm- 1 ). The wave number of a photon of
frequency I is I/c, where c is the speed of light lose
in vacuum. Since the frequency I and wave-
length A of an electromagnetic wave are related where k is the force constant and J.l. = M 1 M 2 /
by AI = c, it is seen that the wave number is (M 1 + M 2) is the reduced mass. Solution of the
the reciprocal of the wavelength. Using I = Schrodinger wave equation for the quantum
{E 2 - E 1 )/h, the photon's wavenumber is harmonic oscillator leads to energy levels given
{E 2 - E1)/he. The values E\/hc and E2 /hc, by
which also have units of cm - , are known as
the term values of the energy levels Eland E2. Evib{V) =hlose{v + t)
Spectroscopists find it convenient to refer to
energy levels by their term values, since a transi- where v = 0, 1,2,3, ... is the vibrational quan-
tion between two levels involves a photon tum number. Transforming to term values, by
whose wave number is given simply by the dif- dividing by he, the vibrational energy levels of
ference in the term values of the levels. a harmonic oscillator molecule are
+ t)
The electrons in a molecule are much lighter
and move much more rapidly than do the G{v) = w{v
nuclei. Consequently, it is possible, to a high where w is the vibrational frequency expressed
level of accuracy, to separate the problems of in cm- 1 .
electronic motion and nuclear motion. This Although the harmonic oscillator approxima-
procedure is called the Born-Oppenheimer tion displays the essential features of the vibra-
approximation. Each electronic state is charac- tional motion, the actual potential in which the
terized by the value of its electronic angular nuclei vibrate deviates rather sharply from a
momentum projected onto the internuclear harmonic potential. A more complete expres-
axis. This component of the electronic angular sion for the vibrational energy can be developed
momentum is conserved by virtue of the as a power series in (v + !) and is given by
fact that a diatomic molecule is symmetric
for rotations about the internuclear axis. G{v) =We{v + t) - WeXe{V + t)2
In analogy with the atomic physics notation of
S, P, D, F, ... , an electronic state is labeled~, + WeYe{V + t)3 + ....
II, .::l, <1>, ... if its projected angular momentum,
in units of h/2rr, is 0, l, 2, 3, .... The energy Here the subscript e refers to the equilibrium
of an electronic state depends not only upon position, and the coefficients for each higher-
the electron configuration but also upon the order term (WeXe, WeYe, . . . ) become suc-
internuclear separation R . Theorists are still cessively smaller. It is rarely necessary to go
challenged by the difficult problem of calculat- beyond the cubic term when analyzing experi-
ing accurate electronic energies. The transitions mental data. The energy difference between
between different electronic states are respon- adjacent vibrational states is typically 0.1 elec-
sible for visible and ultraviolet molecular spec- tron volt, a factor of 100 less than the energy
tra since, as in atoms, energy differences between difference between electronic states.
the states are generally several electron volts. A first approximation for the end-over-end
The appearance of bands rather than distinct rotational motion is to consider the molecule
lines is a consequence of the nuclear motion. to be a rigid rotor. Quantizing the angular mo-
A physical model for the nuclear motion is mentum leads to energy levels which are given
obtained by considering the molecule to be a by
dumbbell which can vibrate along the inter-
nuclear axis as well as rotate end-over-end. The h 2 J{J+ 1)
vibrational energy G and the rotational energy Erot(J) = 8rr 2 I
F must be added to the electronic energy Te
to give the total molecular energy: T = Te + G + where I is the molecule's moment of inertia and
F. Nuclear vibration occurs in the potential J = 0, 1, 2, 3, ... is the rotational angular mo-
well formed by the negative electronic binding mentum quantum number. Expressing these as
energy, which is a function of R, and the posi- term values, the rotational energy levels of a
tive energy due to Coulomb repulsion of the molecule are
nuclear cores. The potential reaches a minimum
at a particular value re of the internuclear F{J) =BJ{J + 1)
separation. This is the equilibrium internuclear
distance, and the nuclear separation oscillates where
about this equilibrium value. For small displace-
ments from equilibrium, the vibrational motion h
B=--
can be approximated by that of a simple har- 8rr 2 eI
monic oscillator. The classical oscillation fre-
quency for nuclei of mass M 1 and M 2 is given is called the rotational constant.
767 MOLECULAR SPECTROSCOPY

Again, the actual nuclear motion is more temperature gas can be populated, and each is
complex than this simple model. Centrifugal allowed by the /j.J selection rule to undergo a
distortion of the molecule has the effect of transition to three levels in the final state.
introducing a term proportional to f2 (J + 1)2. Hence each band in a spectrum is comprised of
In addition, the rotational constant depends 100 or more distinct lines. It is not surprising,
slightly upon the vibrational quantum number, then, that the analysis of molecular spectra
since vibration changes the average moment made little progress before the advent of quan-
of inertia. These considerations lead to a more tum physics.
general formula for the rotational energy, Spectral lines in the infrared occur when a
namely, molecule undergoes a transition between two
different vibrational levels within the same
electronic state. The /j.J = ± I or 0 selection rule
still holds, so P, Q, and R branches can again
where the subscript v indicates a dependence be identified (although the /j.J = 0 Q-branch
upon the vibrational quantum number. The transitions are forbidden in ~ ele·;tronic states).
separation between adjacent rotational energy If the vibrational motion were exactly that of a
levels is typically 0.00 I electron volt . harmonic oscillator, a selection rule /j.v = ± I
Division of the molecular energy into elec- would apply to the vibrational quantum num-
tronic, vibrational, and rotational energies has ber. This rule is not rigorous since, as was noted,
been quite successful for understanding the the harmonic oscillator model is not perfect.
primary features of molecular spectra. At the Nevertheless, /j.v = ± I transitions are usually
very highest levels of resolution, however, it the strongest.
is observed that each line in a band splits into Transitions between the rotational levels of
several very closely spaced lines. This fine a given vibrational level are characterized by
structure is a result of interactions, or couplings, frequencies in the far infrared and microwave
between the various types of motion which, regions of the spectrum. Only /j.J = ± I are pos-
until now, have been considered separately. A sible for this case, so only a P and an R branch
typical example, known as A-doubling, is a appear. Rotational spectra appear very simple
coupling between the molecule's electronic and and regular when compared to visible band
rotational motions . The existence of fine struc- spectra. Transitions between two rotational
ture emphasizes the limitations of the Born- levels can often be detected by the absorption
Oppenheimer approximation. After the various of microwaves. The high accuracy with which
couplings are included, all known aspects of microwave frequencies can be measured allows
molecular spectra can be understood. rigorous tests of the theory of molecular struc-
Each spectral line is a consequence of the ture. This aspect of molecular spectroscopy is
absorption or emission of photons which occurs discussed in a well-known text by Townes and
when molecules undergo transitions between Schawlow. 4
two energy levels. Comparison of an observed Another important feature of a molecular
spectrum with theoretical energy levels requires spectrum is the intensities of the lines. Even
knowing which transitions are allowed and casual observation of a spectrum reveals that
which are forbidden. Information of this sort some lines are quite intense while others are
is codified into selection rules. Most selection barely perceptible. This can be understood by
rules can be understood by considering the pos- noting that not all of the possible transitions
sible symmetries of a molecule. from a particular initial state are equally
Visible and ultraviolet spectra result from probable. Each molecular transition is charac-
transitions between two different electronic terized, then, not only by a wavelength but also
states. The primary selection rule specifies that by a transition probability. The larger the
the molecule's angular momentum quantum transition probability, the more intense the
number J can change only by /j.J = ± I or O. spectral line. In emission spectroscopy, where
Three groups of lines appear, each associated the initial state is an excited state, the transi-
with a particular value of /j.J, which are called tion probabilities determine the average amount
P, Q, and R branches. Changes in the vibra- of time that a molecule spends in the excited
tional quantum number are not restricted by state before emitting a photon. This quantity,
any selection rules. However, the Franck- known as the lifetime of the excited state, is
Condon principle, which states that the inter- typically I X 10- 7 second. The measurement of
nuclear separation cannot change during the excited state lifetimes is an important part of
emission or absorption of a photon, makes molecular spectroscopy.
transitions between some pairs of vibrational Energy levels in polyatomic molecules may
levels far more likely than between other pairs, also be understood by a consideration of elec-
A band is formed from the combined P, Q, and tronic, vibrational, and rotational motions.
R branches associated with a transition be- Instead of a single mode of vibration, an N-atom
tween a particular vibrational level of the upper polyatomic molecule will have 3N - 6 possible
electronic state and a particular vibrational level modes of vibration; 3N - 5 for a linear mole-
of the lower electronic state. Thirty or more cule. Rotation in a poly atomic molecule is
rotational levels in the initial state of a room possible about three separate axes, although
MOLECULAR SPECTROSCOPY 768

a useful simplification occurs for symmetric- Resolution with a Fourier transform spectrom-
top molecules, where two of the axes become eter can exceed that with a large grating spec-
equivalent. As a consequence of the additional trometer. Infrared spectroscopy has made the
vibrational and rotational motions, polyatomic most use of this technique, with high resolution
molecules have many more energy levels than studies of, among many other molecules, H2 0,
diatomic molecules. The spectrum which is CO 2 , CH 4 , and C2 H4 • Fourier transform
actually observed is again dependent upon se- spectroscopy has been valuable as well for visible
lection rules which govern the allowed transi- spectra, especially for mapping the complex
tions between energy levels. Even moderate- spectrum of 12 , which extends throughout
size molecules (3 or 4 atoms) exhibit spectra much of the visible region of the spectrum. 6
which are exceedingly complex, and full rota- Because of the high absolute precision with
tional resolution is, in general, obtainable only which they were measured, 12 wavelengths are
at the highest possible resolution. Spectra of now routinely used as standards in many spec-
larger molecules are usually analyzed only in troscopy experiments.
terms of electronic and vibrational motions, ro- The recent development of lasers, especially
tational analysis not being possible. The study tunable lasers, has awakened a new interest in
of polyatomic molecular spectra is aided con- molecular spectroscopy and motivated a rapid
siderably by the use of group theory. proliferation of new experimental techniques. 7
Experimental molecular spectroscopy his- Although absorption spectroscopy with laser
torically has been pursued with the use of grat- sources has been highly successful, particularly
ing spectrometers. While other techniques are in the infrared, the most innovative techniques
now available, grating spectrometers continue have utilized laser-induced fluorescence. Spectra
to playa major role in contemporary research, are obtained in these experiments by detecting
especially for investigations of ultraviolet spec- photons which are emitted from excited states
tra. The wide spectral range of spectrometers, (fluorescence) as the laser wavelength is varied.
from the far infrared into the vacuum ultra- Excitation to the higher energy level occurs
violet, makes them exceptionally versatile tools. whenever the laser's wavelength coincides with
Instruments range in size from portable table- a spectral line, enabling those molecules in the
top models, with resolution of 1 part in 10 3 , proper lower energy level to absorb laser pho-
to room-size giants, with resolution of 1 part in tons and thus undergo the upward transition.
10 6 . Spectrometers designed for use in the Significant advantages of laser spectroscopy in-
ultraviolet or far infrared are evacuated, to pre- clude high resolution, low background and
vent atmospheric absorption, and utilize special noise, and exceptionally high sensitivity. Laser-
optical materials. induced fluorescence experiments have ob-
Both absorption and emission spectra can be tained spectra from molecules at densities as
measured with a grating instrument. For ab- low as 10 4 cm -3. This has been especially valu-
sorption, light from a continuum source is able for the study of free radicals (chemically
passed through an absorption cell (sometimes unstable molecules) and molecular ions.
the spectrometer itself) and then dispersed. Ex- Laser spectroscopy techniques have been de-
citation spectra are obtained by dispersing the veloped for both the near-infrared and far-infra-
light from a discharge which contains the mole- red spectral regions. Tunable diode lasers and
cule of in terest. color center lasers now cover the entire near-
Photographic plates have long been the tra- infrared region from I to 30 /lm. Wavelengths
ditional means of detecting the dispersed light, for transitions between different vibrational
but they are rapidly being replaced by photo- energy levels of a single electronic state typically
electric detectors, which are more sensitive as fall within this range, and laser absorption
well as more compatible with computers. spectroscopy, because of the narrow laser line-
Fourier transform spectroscopy5 represents widths, provides good resolution of the rota-
an alternative to grating spectroscopy. The basis tional structure even for quite large molecules.
for this technique extends back to Michelson, Extensive investigations have been made for
who noted that the output of an interferometer, molecules such as NH 3 , C2 H2 , CF 4 , and SF 6.
as a function of time, is the Fourier transform Far-infrared lasers have wavelengths which are
of the light source spectrum, as a function of well-matched to rotational transitions in many
frequency. Practical realization, however, has molecules. These lasers, however, operate at
awaited the appearance of high-speed, inex- fixed frequencies. Spectroscopy can nonethe-
pensive computers. In a typical experiment, less be performed by "tuning" the molecule.
light from a continuum source passes through This is accomplished by applying a strong
an absorption cell, containing the molecule of magnetic field which shifts the molecule's
interest, and then through a Michelson interfer- energy levels (Zeeman effect) until the transi-
ometer. The interferometer's output intensity, tion frequency between the two levels matches
as measured by an appropriate detector, is digi- the laser frequency. Laser magnetic resonance,
tized as the mirror moves and then transferred as this procedure is called, is rapidly increasing
to a computer, which calculates the Fourier in usage as more and more far-infrared laser
transform of the data to produce the spectrum. lines are discovered.
769 MOLECULAR SPECTROSCOPY

Tunable laser radiation from dye lasers is Another promising' technique, which avoids
available throughout the visible region of the problems due to the rapid recombination of
spectrum. Nonlinear optical techniques, such as molecular ions, is the use of laser spectroscopy
frequency doubling, extend the range of tun- to study ions which are stored in an ion trap.
ability to wavelengths as short as 200 nm. This Spectroscopic data for molecules are valu-
is the wavelength region for band spectra able far beyond the walls of the laboratory.
associated with transitions between different Spectra are used in a wide range of applications,
electronic states, and both laser absorption and from the routine industrial analysis of chemicals
laser-induced fluorescence, as well as more to the identification of atmospheric pollutants.
exotic laser techniques, have been used to study Perhaps the most important application in re-
a large number of molecules. In most laser cent years of molecular spectroscopy has been
spectroscopy experiments, the resolution is in the field of interstellar chemistry.1O Nearly
limited not by the laser's linewidth but rather 60 molecules in interstellar molecular clouds
by the Doppler width of the transition, a conse- have been identified on the basis of their
quence of molecular motion. The limiting reso- molecular spectrum, and the number continues
lution of around I part in 10 6 , often inadequate to grow. While astronomers have observed and
to resolve rotational details in polyatomic identified some visible spectral lines, most of
molecules, is poor when compared with the the molecules have been detected by radio
limit of I part in 10 9 or better which is set by astronomers on the basis of microwave-emitting
the laser's linewidth. This has spurred consider- rotational transitions. Identification of these
able interest in techniques of Doppler-free interstellar molecules has been possible only
spectroscopy. One such technique, which has because of the extensive collection and tabula-
been especially fruitful for ultra-high resolu- tion of molecular spectroscopy data which has
tion spectroscopy, is optical-optical double been going on for many years. In a few cases,
resonance. It has been used for studies of suggestions by astronomers that some of the
molecules such as BaF 2 , CaF 2 , and N0 2 . Other observed features were due to "exotic" mole-
Doppler-free techniques have been applied to cules have prompted laboratory workers to
a variety of molecules. produce and measure the spectra of these
Ultrahigh resolution alone is often inadequate species. The interchange between astronomy
for analysis of the highly complex spectra of and molecular spectroscopy has been beneficial
polyatomic molecules. A new experimental for both sides, but with over 200 interstellar
technique, introduced to grapple with this microwave lines still unidentified, much work
problem, is laser spectroscopy of supersonic remains to be done.
molecular beams. 8 Molecules of interest, often Both fundamental and applied spectroscopy
seeded into a rare gas, are forced at high pres- have witnessed remarkable growth in the last
sure through an expansion nozzle into vacuum. decade. Increasing demand for spectroscopic
The expansion process cools the molecules' data and continuing advances in technology
vibrational and rotational motions down to will undoubtedly keep molecular spectroscopy
temperatures of only a few degrees Kelvin, vigorous for many years to come.
leaving the molecules in only a handful of the
lowest energy levels. This greatly simplifies RANDALL D. KNIGHT
the spectrum and facilitates analysis. The small
molecule N0 2 has a remarkably complex visible References
spectrum which for many years had stubbornly
resisted attempts at analysis. New efforts with L Herzberg, Gerhard F., "Molecular Spectra and
supersonic beams and Doppler-free laser spec- Molecular Structure," VoL I, "Spectra of Di-
troscopy have, however, finally succeeded in atomic Molecules (2nd ed., 1950); Vol. II, "Infra-
establishing a basis for understanding this red and Raman Spectra of Polyatomic Molecules"
molecule. (1945); VoL III, "Electronic Spectra and Elec-
Recent advances in the spectroscopy of free tronic Structure of Polyatomic Molecules," (1966),
radicals and molecular ions,9 difficult to pro- New York, Van Nostrand Reinhold.
duce in large quantities, stem from the develop- 2. Steinfeld, Jeffrey I., "Molecules and Radiation
ment of such high-sensitivity techniques as laser Cambridge, Mass., MIT Press, 1978.
spectroscopy. The unpaired electron found in 3. Huber, K. P., and Herzberg, G. F., "Molecular
most of these species adds complexity as well Spectra and Molecular Structure," VoL IV,
as interest to their spectra. The number of radi- "Constants of Diatomic Molecules," New York,
cals and ions studied, however, remains quite Van Nostrand Reinhold, 1979.
small in comparison with stable, neutral mole- 4. Townes, C. H., and Schawlow, A. L., "Microwave
cules. High resolution studies of several mo- Spectroscopy," New York, McGraw-Hill, 1955.
lecular ions have been performed by laser 5. Becker, E. D., and Farrar, T. c., "Fourier Trans-
spectroscopy of ion beams. The infrared spec- form Spectroscopy," Science 178,361 (1972).
trum of the one-electron molecule HD+ was 6. Gerstenkorn, S., and Luc, P., "Atlas du Spectre
measured with sufficient accuracy to test rig- d'Absorption de la Molecule d'Iode (14 800-
orously the foundations of molecular theory. 20000 cm- I )," Paris, Editions du C.N.R.S.,
MOLECULAR SPECTROSCOPY 770

1978. A correction to the atlas is given in Gersten- several millions for some virus molecules and
korn, S., and Luc, P., Revue de Physique Ap- certain polymeric compounds. Molecular dimen-
pliquee 14, 791 (1979). Subtraction of 0.0056 sions accordingly range from a diameter of
cm -1 from all wavenumbers in the atlas results about 4A for the hydrogen molecule to several
in an absolute accuracy of 0.002 cm- 1 and a rela- thousand angstroms-which has permitted view-
tive accuracy of 0.0007 cm -1 . ing single large molecules in the electron micro-
7. See papers in Hall, 1. 1., and Carlsten, J. 1. (Eds.), scope. Molecular sizes are generally much small-
"Laser Spectroscopy," Vol. III, Berlin, Springer- er and are not measured directly, but are de-
Verlag, 1977 ; and in Walther, H., and Rothe, duced from x-ray diffraction studies of ordered
K. W. (Eds.), "Laser Spectroscopy," Vol. IV, groups of molecules in the crystalline state or
Berlin, Springer-Verlag, 1979. from the physical properties such as hydrody-
8. Levy, Donald H., "Laser Spectroscopy of Cold namic behavior of molecules in the gaseous or
Gas Phase Molecules," Ann. Rev. Phys. Chern. 31, liquid state.
197 (1980). Many methods for determining molecular
9. Saykally, R. J., and Woods, R. C., "High Resolu- weights which are described below depend
tion Spectroscopy of Molecular Ions," Ann. fundamentally on counting the number of mole-
Rev. Phys. Chern. 32,403 (1981). cules present in a given weight of sample. How-
10. Green, Sheldon, "Interstellar Chemistry: Exotic ever, any usable sample contains a very large
Molecules in Space," Ann. Rev. Phys. Chern. 32, number of molecules: at least ten trillion of the
103 (1981). largest known molecules are present in the
smallest weight measurable on a sensitive bal-
Cross-references: ABSORPTION SPECTRA; ATOMIC ance. Therefore, an indirect count is made by
SPECTRA; ENERGY LEVELS, ATOMIC; FOURIER measuring physical properties which are propor-
ANALYSIS; LASER; RAMAN EFFECT AND RA- tional to the large number of molecules present.
MAN SPECTROSCOPY ; SCHRODINGER EQUA- A consequence of the large number of molecules
TION; SPECTROSCOPY. sampled is the averaging of any variations in
content of atomic isotopes in individual mole-
cules, so that normal isotopic fluctuations lead
MOLECULAR WEIGHT to no measurable deviation of molecular weight
values. Abnormally high concentrations of iso-
The molecular weight of a chemical compound topes in radiation products may, however, pro-
is the sum of the atomic weights of its constit- duce altered molecular weights.
uent atoms. The molecule is the smallest weight The term molecular weight is properly applied
of a substance which still retains all of its to compounds in which chemical bonding of all
chemical properties. By convention, each atomic atoms holds the molecule together under nor-
weight, and therefore molecular weights, are mal conditions (see BOND, CHEMICAL). Thus,
expressed relative to an arbitrary standard (see covalent compounds, as represented by many
below). For example, the molecule of acetic organic substances, usually are found to have
acid, CH 3 COOH, contains two atoms of carbon, the same molecular weight in the solid, liquid,
four of hydrogen, and two pf oxygen, so that and gaseous states. However, substances in
its molecular weight is the sum of 2( 12.0 1) + which some bonds are highly polar may exist as
4(1.01) + 2(16.00), which totals 60.06. This un-ionized or even associated molecules in the
molecular weight value is clearly in arbitrary gaseous state and in nonpolar solvents, but they
units, but a related quantity, the gram-molecu- may be ionized when dissolved in polar solvents.
lar weight or mole, is the molecular weight For example, ferric chloride exists in the gaseous
expressed in grams. One mole of any compound state as FeCh at high temperatures, as Fe2 Cl 6
has been found to contain 6.022 X 10 23 mole- at lower temperatures as well as in nonpolar
cules, and this number is called the Avogadro solvents, but reverts to FeCl 3 in solvents of
constant. moderate polarity, and becomes ionic in water
For many years, the standard used for atomic solutions-as chloride ions and hydrated ferric
weights was the exact value 16 for the naturally ions. Similarly, acetic acid and some other car-
occurring mixture of isotopes of oxygen. An- boxylic acids associate as dimers in the vapor
other system of atomic weights, based on the state and in solvents of low polarity, but exist
value of 16 for the most abundant (99.8 per as monomers with progressive ionization as the
cent) oxygen isotope, came into use for com- solvent polarity increases.
parisons involving single atoms or molecules Truly ionic compounds, such as most salts,
where isotopic differences were important. A exist only as ions in the solid and dissolved
conference of the International Commission on states, so that the term molecule is not appli-
Atomic Weights in 1961 adopted as the standard cable and is not commonly used. Instead, the
a value of exactly 12 for the carbon-l 2 isotope, term, formula weight, is used; this denotes the
and since then all atomic weights in use have sum of the atomic weights in the simplest for-
been based on this standard. mula representation of the compound. If a broad
The weights of molecules range from a value definition of a molecule as an aggregate of
of about two for the hydrogen molecule to atoms held together by primary valence bonds
771 MOLECULAR WEIGHT

is adopted, then salts in the crystalline state which often disagree by a factor of two or more.
would appear to have a molecular weight which This disagreement arises from the different bases
is essentially infinite and limited only by the of the methods-for example, some methods
size of the crystal, since each ion is surrounded yield so-called number-average molecular weights
by several ions of opposite polarity to which it by determining the number-concentration of
is attached by ionic bonds of equal magnitude. molecules in a sample, while other methods pro-
A further complication in the definition of duce weight-average molecular weights which are
molecular weights occurs with inorganic poly- related to the weight-concentrations of each
mers, such as the poly phosphates and poly- species. Another common value is the viscosity-
silicates, whose polymeric nature is clearly average molecular weight, which is related to
evident in both their crystal structure and their the viscosity contribution of each species. Other
highly viscous behavior in the molten state. bases are of importance for certain methods of
However, the magnitude of their molecular study, and some of these are complex functions
weights often cannot be found by conventional involving several averages. For some purposes,
methods because they are either insoluble or the determination of a single average molecular
react with solvents, with consequent degrada- weight is sufficient for establishing relations be-
tion. These examples indicate that the molecular tween molecular weight and the behavior of
weight often depends on the conditions used polymers, but the type of molecular weight
for measurement and must be specified where average must be so chosen as to have a close
compounds subject to association, dissociation relation to the behavior property of interest. A
or reaction are studied. more detailed knowledge of the constitution of
The history of the clarification of molecular a sample is sometimes required, particularly if
weight concepts is of considerable interest, since several properties are to be considered, or if
this was so intimately related to other develop- unusual forms of molecular weight distribution
ments in chemical knowledge. Although Dalton curve are present.
had published a table of atomic weights in 1808, The problem of completely defining the mole-
and by 1825 molecular formulas, derived from cular weight nature of poly disperse materials is
combining weights, were in use, many miscon- most accurately solved by determining the fre-
ceptions of these formulas remained until about quency of occurrence of each molecular species
1860. Then evidence from chemical reactions and representing the results as a frequency dis-
and from measurements of vapor densities firm- tribution curve. Such a study is generally quite
ly established the formulas of many inorganic tedious, though there are a few methods which
and simple organic compounds as they are re- provide muc-h or fhe required information in
presented today. The vapor density method, one experiment. The method currently most
based on Avogadro's hypothesis, was thus the used for determining molecular weight dis-
first molecular weight method and continues to tributions of polymers is size exclusion chroma-
be useful for compounds that can be easily vola- tography, which includes gel permeation chro-
tilized. It was not until 1881 that Raoult showed matography and gel filtration. This involves
that the depression of freezing points was pro- measurement of the differences in extent of
portional to the molar concentration of solute. permeation of molecules of different sizes into
In 1884, van't Hoff related the osmotic pressure pores of a solid or gel matrix. The distribu-
of solutions to the vapor pressure, boiling point, tion of molecular sizes found is converted into
and freezing point behavior, and these methods a distribution of molecular weights by calibra-
were quickly put into use for determining mole- tion with standard polymer samples. The method
cular weights. The abnormal physical properties is rapid and applicable to many polymer types.
of salt solutions were explained in 1887 by the Alternatively, polymers can be separated by
ionization theory of Arrhenius, and the very fractional precipitation or fractional solution
careful measurements of many of these proper- into a series of fractions each of which contains
ties furnished the strongest confirmation of the a fairly narrow distribution of molecular
theory. While these measurements provided the weights. Each fraction can then be character-
most precise determinations of the extent of ized by one of the methods described below to
dissociation of weak electrolytes, they also con- yield an average molecular weight. Finally, the
tributed to the development of the Debye- molecular weight distribution curve can be
Hiickel theory for strong electrolytes. constructed by summation of these results.
Molecular Weight Distributions Most syn- While the curve derived is somewhat inexact, it
thetic and many· natural polymeric substances is the best approach to samples which are not
are mixtures of molecules having various susceptible to analysis by the chromatographic
chain lengths, and thus of different molecular methods. The ultracentrifuge is less com-
weights-so-called poly disperse systems. In such monly used for determining molecular weight
cases, molecular weight values have an ambigu- distributions in a single experiment, partly
ous meaning, and no single such value will com- because of high instrumentation costs and
pletely represent a sample. Various techniques partly because of the complexity of methods
for measuring molecular weights, when applied needed to analyze the data.
to one of these materials, will produce values Uses. Molecular weight measurements, in
MOLECULAR WEIGHT 772

conjunction with the law of combining propor- peratures up to 2000°C have been used to study
tions, have enabled the atomic weights of ele- substances which are volatilized only with diffi-
ments in compounds to be determined. When culty, provided decomposition can be avoided.
the atomic weights are known, molecular weight Solids Measured by ColIigative Methods It
measurements permit the assignment of molecu- has been shown that nonvolatile molecules dis-
lar formulas. Other applications to compounds solved in a solvent affect several physical proper-
of low molecular weight allow determination of ties of the solvent in proportion to the number
the extent of ionization of weak electrolytes, of solute molecules present per unit volume.
and the extent of association of some uncharged Among these properties are a decrease of the
compounds which aggregate. The study of mole- vapor pressure of the solvent, a rise in its boiling
cular weights is becoming increasingly valuable point, a decrease in its freezing point, and the
in assessing the effects which various molecular development of osmotic pressure when the
species of a polymer sample have on the physi- solution is separated from the solvent by a semi-
cal properties of the product. Through such permeable membrane. Properties such as these
knowledge, the synthetic process may be modi- which are related to the number of molecules in
fied to improve the properties of polymers. a sample rather than to the type of molecule are
Methods of Measurement Many physical and called colligative properties. They are the basis
certain chemical properties vary substantially for some of the most useful techniques for
with the molecular weight of compounds, and molecular weight determination. The magnitude
these properties are the bases of all molecular of the effects and the ease of measurement
weight methods. The summary given in this sec- differ greatly, so that certain of the colligative
tion includes principally the methods which are properties are preferred for this purpose. For
most frequently used or have general applica- example, an aqueous solution containing 0.2
bility. The choice of the most suitable method gram of sucrose (molecular weight 342) in 100
for a given sample depends on its state (gas, ml has a vapor pressure 0.01 per cent less than
liquid, or solid), the magnitude of the molecular that of the solvent, a boiling point 0.003°C
weight and the accuracy required in its determi- greater, and a freezing point 0.011 °c lower than
nation, as well as on the stability of the com- the solvent, but will develop an osmotic pressure
pound to physical or chemical treatment. Some of 150 cm of water. Since the effects are related
mention of the applicability of the methods in to the number-concentration of solute mole-
these regards is given wherever possible. cules, each method leads to a number average
Gases and Liquids Avogadro's hypothesis molecular weight if the sample consists of a
(1811) that equal volumes of different gases mixture of molecules of different sizes. Accu-
contain the same number of molecules under rate results with any of the techniques are ob-
the same conditions made it possible to find tained only when measurements at a series of
how many times heavier a single molecule of concentrations are extrapolated to infinite di-
one gas is than that of another. Thus, relative lution where the system becomes ideal, i.e. is
molecular weights of all gases could be estab- not affected by interactions between molecules.
lished by comparing the weights of equal vol- Direct vapor pressure measurements with a
umes of gases. The significance of the idea and differential manometer are generally limited to
utilization of this method were first clearly the larger depressions produced by low molecu-
demonstrated by Dumas in 1827, but it was not lar weight solutes, while refined techniques such
until 1860 that the results were accepted by as isothermal distillation require the most exact
most scientists when Cannizzaro showed that a control of conditions. Isopiestic methods allow
consistent system of atomic weights resulted. the comparison of the vapor pressure of so-
With the additional information from chemical lutions of an unknown with those containing a
experiments on the number of atoms of each known substance, and several modifications have
kind present in each molecule, the relative been used more than other vapor pressure
weights of each atom were obtained. The as- methods. Ebulliometric techniques which de-
sumption of the integral value, 16, for the pend on the elevation of the boiling point of a
atomic weight of oxygen (to give a value close solvent are often used for solutes of low molecu-
to unity for the lightest element, hydrogen) lar weight and find some use for large molecules.
then enabled molecular weights of all gaseous Since boiling points are highly sensitive to the
compounds to be determined. The method atmospheric pressure, it is either necessary to
obviously can be applied to other molecules control pressure very precisely, or more com-
which normally occur in the liquid state but monly to measure the boiling points of both
can be volatilized by heating. The Dumas and the solvent and solution simultaneously. Often
Victor Meyer methods are most used for mo- a differential thermometer is employed to
lecular weight determinations with liquids in determine only the difference of the two tem-
this way. These methods have been refined so peratures, and these devices have been made so
that gas densities can now be determined with an sensitive that molecular weights as large as
accuracy of 0.02 per cent, and extremely small 30 000 have sometimes been studied. Tech-
weights of material (about 1 p.g) can be similarly niques involving the lowering of the freezing
studied with somewhat less accuracy. High tem- point of a solvent (cryoscopic methods) are
773 MOLECULAR WEIGHT

much used for rapid approximate determina- X-ray Diffraction X-ray diffraction analysis
tions of molecular weights in the identification is a powerful method for determining exact
of organic compounds. For this purpose a sub- molecular weight and structural characteristics
stance such as camphor, which is a good solvent of compounds in their crystalline state. However
for many organic compounds and has a large the method is complicated and slower than many
molar depression constant, is often chosen to techniques which provide molecular weights of
magnify the difference in freezing point of the accuracy sufficient for many purposes and so is
solvent and the solution of the unknown. Since usually employed only when the additional
freezing-point depressions are not sensitive to structural information is needed. The sample to
atmospheric pressure, they are easier to measure be examined must have a high degree of crystal-
accurately than the methods described above, line order and is preferably a single crystal at
and much use has been made of them for precise least 0.1 mm in size; such samples are prepared
studies of solutes having low molecular weights. fairly readily from many inorganic and non-
The possibility of association or ionization of polymeric organic compounds. Alternatively,
the solute must be considered with any of these crystalline powders of certain crystal types may
methods, since these effects will greatly in- provide suitable results. Diffraction patterns are
fluence the result. then obtained by one of several methods, and
Osmotic pressures are so much larger than any the angular positions of the reflections are used
other colligative property that they are most to calculate the lattice spacings, and thus the
widely used for molecular weight measurements, size of the unit cell. This unit cell is the small-
particularly for long-chain polymers where the est volume unit which retains all geometrical
high sensitivity of the method is required. For features of the crystalline class, and it contains
accurate measurements, a membrane is required a small integral number of molecules. A rough
which permits the flow of solvent through its estimate of the molecular weight of the com-
pores but completely holds back solute mole- pound is needed from a determination by an
cules. This condition is best satisfied where there independent method in order to obtain this
are large differences in size of the solute and integral number. Finally, the resultant molecu-
solvent molecules or of their affinity for the lar volume is multiplied by the exact bulk den-
membrane. Membranes made from cellulose sity of the crystal and by the Avogadro number
compounds are often successfully used for poly- to yield the molecular weight (see X·RAY DIF·
mers which contain little material with molecu- FRACTION).
lar weights below about 10 000. Below this Light Scattering Measurements of the in-
molecular weight the pore size of the satisfactory tensity of light scattered by dissolved molecules
membranes is so small that solvent flow is very allow the determination of molecular weights.
slow, and thus a very long time is required to Most commonly the method is used for poly-
reach constant osmotic pressure. In spite of this mers above 10 000 units, though under optimum
handicap, some of the most precise osmotic conditions molecular weights as low as 1000
pressure measurements have been obtained with have been determined. Since the intensity scat-
aqueous solutions of sucrose and similar small tered by a given weight of dissolved material is
solutes by the use of membranes prepared by directly proportional to the mass of each mole-
precipitating such materials as copper ferrocya- cule, a weight-average value of the molecular
nide in the pores of a solid support. The upper weight is obtained for a poly disperse system.
limit of molecular weights satisfactorily mea- An average dimension of the molecule can also
ured by osmometry is usually about 500 000, be obtained by a study of the angular variation
which is fixed by the lowest pressures that can of scattered light intensity, provided some dim-
be measured precisely and by the maximum ension of the molecules exceeds a few hundred
concentrations of material which still give satis- angstroms. The interaction between dissolved
factory extrapolations to infinite dilution. In molecules substantially affects the intensity of
comparing various colligative properties for the scattered light so that extrapolation to infinite
characterization of polymers, osmometry has dilution of data collected at several polymer
the advantage that it is unaffected by the pres- concentrations is required. The method has been
ence of impurities of very low molecular weight so well developed in the last decade that it is
which will diffuse through membranes able to now probably the most used method for deter-
retain the polymer, whereas the other proper- mining absolute molecular weights of polymers.
ties are greatly affected by the same impurities. In addition, it provides information on sizes
Modern instrumentation has provided com- which is furnished by few other methods. The
mercial instruments utilizing several of these greatest problem encountered is in the removal
colligative properties for routine, accurate meas- of suspended large particles which otherwise
urements in very short time and with small would distort the angular scattering pattern of
samples. This is true for boiling point, vapor the solutions. This is rather easily accomplished
pressure, and freezing point measurements of by filtration in some cases, but it may be a
molecular weights up to several thousand, and formidable difficulty for particles which are
for membrane osmotic pressure measurements highly solvated or are peptized by the molecules
of high molecular weight samples. to be studied. Auxiliary information is required
MOLECULAR WEIGHT 774

on the refractive index increment of the sample, Archibald method, permits the determination of
i.e., the change in refractive index of the solvent weight-average molecular weights of polymers
produced by unit concentration of the sample. by analysis of the concentration gradient near
This information is supplied by a differential boundaries soon after sedimentation begins.
refractometer using the same wavelength of Chemical Analysis When reactive groups in a
light as that employed in measurements of the compound may be determined exactly and
intensity of scatter. easily, this analysis may be used to determine
The Ultracentrifuge. The sedimentation of the gram equivalent weight of the substance.
large molecules in a strong centrifugal field This is the weight in grams which combines with
enables the determination of both average mo- or is equivalent to one gram-atomic weight of
lecular weights and the distribution of molecu- hydrogen. This equivalent weight may then be
lar weights in certain systems. When a solution converted to the molecular weight by multiply-
containing polymer or other large molecules is ing by the number of groups per molecule which
centrifuged at forces up to 250 000 times gra- reacted (provided they each are also equivalent
vity, the molecules begin to settle, leaving pure to one hydrogen). If the number of reactive
solvent above a boundary which progressively groups in the molecule is not known, then one
moves toward the bottom of the cell. This of the physical methods for determining molec-
boundary is a rather sharp gradient of concen- ular weight must be used instead. The chemical
trations for molecules of uniform size, such as method is convenient and often used for the
globular proteins, but for polydisperse systems, identification of organic substances containing
the boundary is diffuse, the lowest molecular free carboxyl or amino groups which can readily
weights lagging behind the larger molecules. An be titrated, and for esters which can be saponi-
optical system is provided for viewing this fied and determinations made of the amount of
boundary, and a study as a function of the time alkali consumed in this process. The equivalent
of centrifuging yields the rate of sedimentation weights of ionic substances containing, for ex-
for the single component or for each of many ample, halide or sulfate groups may also be
components of a poly disperse system. These determined by titration or by gravimetric analy-
sedimentation rates may then be related to the sis of insolu ble compounds formed with reagents
corresponding molecular weights of the species which act in a stoichiometric fashion. In the
present after the diffusion coefficients for each titration of acids, the "neutral equivalent" is
species are determined by independent experi- the weight of material which combines with one
ments. Both the sedimentation and the diffusion equivalent of alkali, and a similar definition
rates are affected by interactions between mole- applies to the "saponification equivalent" of
cules, so that each must be studied as a function esters. If only one carboxyl or ester group is
of concentration and extrapolated to infinite present in the molecule, these values equal the
dilution as is done for the colligative properties. molecular weight of the compound.
The result of this detailed work is the distribu- In a similar way, if the terminal groups on
tion of molecular weights in the sample which polymer chains can be determined by a chemi-
is available by few other methods. At present, cal reaction without affecting other groups in
this method is only partly satisfactory for mole- the molecule, the equivalent weight or molecu-
cular weight determinations with linear poly- lar weight of the polymer may be obtained in
mers because of the large concentration depen- certain cases. For polydisperse systems, a num-
dence of the diffusion coefficients. Difficulties ber-average value of the molecular weight is
have been found in reliably extrapolating dif- obtained because the process essentially counts
fusion coefficients beyond the lowest polymer the total number of groups per unit weight of
concentrations which are experimentally attain- sample. Since the method depends on the effect
able at present. of a single group in a long chain, its sensitivity
A modification of the sedimentation method decreases as the molecular weight rises, and so
which avoids the study of diffusion constants is is seldom applicable above molecular weights of
the sedimentation equilibrium method in which 20 000. Particularly at high molecular weights,
molecules are allowed to sediment in a much the method is very sensitive to small amounts of
weaker field. Under these conditions, the sedi- impurities which can react with the testing
menting force is balanced by the force of dif- reagent, so that careful purification of samples
fusion, so that after times from a day to two is desired.
weeks molecules of each size reach different It is also important to know that impurities or
equilibrium positions, and the optical measure- competing mechanisms of polymerization do
ment of the concentration of polymer at each not lead to branching or other processes which
point gives the molecular weight distribution may provide greater or fewer reactive groups per
directly. However, again extrapolation to infinite molecule. The analysis for end groups must be
dilution must be used to overcome interaction carried out under mild conditions which do not
effects. The chief difficulty here is the long degrade the polymer, since this would also lead
time of centrifuging required, and the necessary to lower molecular weight values than expected.
stability of the apparatus during the period. Labeling of end groups either with radioactive
A newer and somewhat faster technique, the isotopes or with heavy isotopes which can be
775 MOLECULES AND MOLECULAR STRUCTURE

analyzed with the mass spectrometer provides a transfer, as in (a), to give an electrovalent bond,
rapid and convenient analysis for end groups. resulting from Coulombic attraction between
This labeling can be accomplished with a labeled the oppositely charged ions
initiator if this remains at the chain ends, or
after polymerization is complete, by exchange R
of weakly bonded groups with similar groups in "+" -
:CI:CI: R:N:O:
a labeled compound. Molecular weight determi-
nations by end group analysis are often used R
for condensation polymers of lower molecular (a) (b) (R =Gb)
weights and are especially valuable in studying (c)
degradation processes in polymers.
(W. Kossel, 1916); or by electron sharing, as in
GEORGE L. BEYER (b) and (c), to give a covalent bond (G. N.
Lewis, 1916). In (b), each chlorine atom
donates one electron to form a homopolar bond,
References
which is written Cl-Cl where the bar denotes
Daniels, F., Williams, J. W., Bender, P., Alberty, R. A., on this theory one single bond, or shared elec-
Cornwell, C. D., Harriman, J. E., "Experimental . tron pair. In (c), the nitrogen-oxygen bond is
Physical Chemistry," Seventh Edition, New York, formed by two electrons donated by only the
McGraw-Hill Book Company, 1970. nitrogen atom, giving a semipolar, or coordinate-
Billmeyer, F. W., Jr., "Textbook of Polymer Science," covalent bond, which is written R3 N ~ 0, and
New York, Wiley-Interscience, 1971 . which is electrically polarized. Double or triple
Scholte, T. G., in "Polymer Molecular Weights," bonds result from the sharing of four or six
(p. E. Slade, Jr., Ed.), Part II, New York, Marcel electrons between adjacent atoms, as in ethyl-
Dekker, 1975. ene (d) and acetylene (e) respectively.
Wells, A. F ., "Structural Inorganic Chemistry," Fourth
Edition, New York, Oxford Univ. Press, 1975. H H
Cross-references: ATOMIC PHYSICS; BOND, CHEM- \ C=C/ H-C=:C-H
ICAL; CENTRIFUGE ; LIGHT SCATTERING; MOLE-
CULES AND MOLECULAR STRUCTURE; OSMO-
H
/ \H
SIS; POLYMER PHYSICS; VAPOR PRESSURE AND
EVAPORATION; X-RAY DIFFRACTION. (d) (e)

However, difficulties arise in describing the


MOLECULES AND MOLECULAR structures of many molecules in this fashion.
STRUCTURE For example, in benzene (C6 H6 ), a typical
aromatic compound, the carbon nuclei form a
A molecule is a local assembly of atomic nuclei plane regular hexagon, but the electrons can
and electrons in a state of dynamic stability. only be conventionally written as forming alter-
The cohesive forces are electrostatic, but, in nate single and double bonds between them.
addition, relatively small electromagnetic inter- Furthermore, an electron cannot be identified
actions may occur between the spin and orbital as coming specifically from any of these bonds
motions of the electrons, especially in the neigh- upon ionization. Such difficulties disappear in
borhood of heavy nuclei. The internuclear the quantum-mechanical theory of a polyatomic
separations are of the order of I to 2 X 10-10 molecule, whose electronic wave function can
metres, and the energies required to dissociate a be constructed from nonlocalized electron orbit-
stable molecule into smaller fragments fall into als extending over all of the nuclei. The concept
the I to 5 eV range. The simplest diatomic of valency is not basic to this theory, but is
species is the hydrogen molecule-ion H2+ with simply a convenient approximation by which
two nuclei and one electron. At the other the electron density distribution is partitioned
extreme, the protein ribonuclease contains 1876 in different regions in the molecule.
nuclei and 7396 electrons per molecule. Molecular compounds consist of two or more
Historically, molecules were regarded as being stable species held together by weak forces. In
formed by the association of individual atoms. clathrates, a gaseous substance such as S02,
This led to the concept of valency, i.e., the HCI, CO 2 or a rare gas is held in the crystal
number of individual chemical bonds or link- lattice of a solid, such as ~-quinol, by van der
ages with which a particular atom can attach Waals-London dispersion forces. The gas hy-
itself to other atoms. When the electronic theory drates, e.g., Cl 2 • 6H 2 0, contain halogen mole-
of the atom was developed, these bonds were cules similarly trapped in ice-like structures.
interpreted in terms of the behavior of the The hydrogen bond, with energy'" 0.25 eV, is
valence, or outer shell, electrons of the com- responsible not only for the high degree of
bining atoms. Each atom with a partly filled molecular association in liquids such as water
valence shell attempts to acquire a completed (O-H---O-H---) but also for such
octet of outer electrons, either by electron molecules as the formic acid dimer
MOLECULES AND MOLECULAR STRUCTURE 776

O-H---O the ion X+ and pass this through a mass spectro-

H-C
/ \ C-H graph to determine the molecular weight.
(2) The spatial distribution of the nuclei in
\ O---H-O/ their mean eqUilibrium or "rest" positions. At
an elementary level, this is described in geomet-
rical language. For example, in carbon tetra-
chloride, CCI4 , the four chlorine nuclei are
which contains two hydrogen bonds indicated disposed at the comers of a regular tetrahedron,
by dashed lines. Molecular complexes vary and the carbon nucleus is at the center. In the
greatly in their stability; in donor-acceptor [CoCl4 ] 2- ion, the arrangement of the chlorine
complexes, electronic charge is transferred from nuclei about the central metal nucleus is also
the donor (e.g., NH 3) to the acceptor (e.g., tetrahedral, whereas in [PdCl4 ] 2- it is planar.
BF 3), as in a semipolar bond. The BF3 . NH3
complex has a binding energy with respect to At a more sophisticated level, each molecule is
dissociation into NH3 and BF3 of 1.8 eV. The classified under a symmetry point group. Most
bond here is relatively strong; the electron trans- nonlinear molecules possess only 1, 2, 3, 4 or
fer can occur between the components in their 6-fold rotation axes, and belong to one of the
electronic ground states. On the other hand, in 32 crystallographic point groups. For example,
weaker complexes such as C6H6 - 12 , with bind- the pyramidal ammonia molecule NH3 has a
ing energy of about 0.06 eV, there is only a threefold rotation axis C3 through the nitrogen
fractional transfer of charge from benzene to nucleus and three reflection planes av inter-
iodine. The actual ionic charge-transfer state lies secting at this axis, and belongs to the C3v (3m)
at much higher energy than the ground state of point grouJ2.: Tetrahedral molecules CX 4 belong
the complex. to the Td(43m) point group. Linear diatomic
The discovery of XePtF 6 by Bartlett (1962) and polyatomic molecules belong to either of
has been followed by the synthesis of many the continuous point groups Dooh or Coov accord-
other rare gas compounds whose existence was ing to whether a center of symmetry is present
not predicted by classical valency theories. or not.
Compounds such as XeF 2 , XeF4 , XeF6 and The symmetry classification does not define
XeOF 4 are quite stable, the average Xe-F bond the geometry of a molecule completely. The
energy in the square planar molecule XeF 4 be- values of certain bond lengths or angles must
ing 1.4 eV. also be specified. In carbon tetrachloride, it is
A molecule X is characterized by: sufficient to give the C-Cl distance (1.77 X
(1) A stoichiometric formula AaBbCc'" 10- 10 meters) since classification under the T d
where a,b,c,' .. are the numbers of atoms of point group implies that all four of these bonds
elements A,B,C, ... that it contains. The ratio have e9ual length and the angle between them
a : b : c : ... is found by chemical analysis for is 109 28'. In ammonia, both the N-H dis-
these elements. The absolute values of a, b, c, tance (1.015 X 10-10 meters) and the angle
... are then fixed by determination of the molec- HNH (107°) must be specified. In general, the
ular wei$.ht of X. For a volatile substanc~ the lower the molecular symmetry, the greater is
gas denSIty of X and of a gas of known molecular the number of such independent parameters
weight are compared at the same temperature required to characterize the geometry. Informa-
and pressure. The molecular weights are in the tion about the symmetry and internal dimen-
ratio of the gas densities, since Avogadro's prin- sions .of a molecule is obtained experimentally
ciple states that equal volumes of gases at the by SPECTROSCOPY, ELECTRON DIFFRACTION,
same temperature and pressure contain the same NEUTRON DIFFRACTION, X-RAY DIFFRAC-
numbers of molecules. For a nonvolatile sub- TION, and MAGNETIC RESONANCE. (See these
stance, a known weight can be dissolved in a topics for details.) Nuclear magnetic resonance
solvent, and the resultant lowering of vapor (NMR) is widely used to study molecular struc-
pressure, elevation of the boiling point, or de- ture since it gives information about both the
pression of the freezing point of the solvent can chemical environment of a given nucleus in a
be measured. Each of these properties depends molecule and also the disposition of neighbor-
upon the number of molecules of solute present, ing nuclei. While commonly employed on pro-
so the number of molecules per unit weight of tons, its use is increasing for other nuclei with
X is found and, hence, the molecular weight. nonzero spin angular momentum.
For substances of high molecular weight such (3) The dynamical state is defined by the
as proteins (molecular weight '" 34 000-200- values of certain observables associated with
000) or polymers, the molecular weight is found orbital and spin motions of the electrons and
from osmotic pressure measurements or the rate with vibration and rotation of the nuclei, and
of sedimentation in a centrifuge. The molecular also by symmetry properties of the correspond-
weight of a molecule in crystalline form is de- ing stationary-state wave functions. Except for
termined when the density of the crystal and cases when heavy nuclei are present, the total
the dimensions of the unit cell from x-ray electron spin angular momentum of a molecule
analysis are both known. Finally, for stable is separately conserved with magnitude Sfz, and
volatile compounds, it is often possible to form molecular states are classified as singlet, doublet,
777 MOLECULES AND MOLECULAR STRUCTURE

triplet, ... according to the value of the multi- >


Ia) or an oblate (Ie h =Ia) symmetric top; if
plicity (2S + I). This is shown by a prefix super- Ie> h > la, the top is asymmetric. Symmetric
script to the term symbol, as in atoms. top molecules have Cn symmetry axes with
The Born-Oppenheimer approximation per- n ;;;;. 3 and belong to point groups with degene-
mits the molecular Hamiltonian H to be sepa- rate representations. The component K1i of
rated into a component He that depends only on rotational angular momentum about the top
the coordinates of the electrons relative to the axis is conserved and the rotational levels are
nuclei plus a component depending upon the also characterized by the value of the quantum
nuclear coordinates, which in turn can be number K = 0, 1, 2, ... J. A symmetry classifi-
written as a sum Hv + Hr of terms for vibration- cation is made for 'I1r under the rotational sub-
al and rotational motion of the nuclei (we may group of the molecular point group. Finally,
ignore translation here). The eigenfunctions '11 each eigenstate is described as + or - according
of H may correspondingly be factorized as the to the parity of '11 under inversion in a space-
product 'I1e 'I1v 'I1 r of eigenfunctions of these fixed coordinate system.
three operators, and the eigenvalues E decom- (4) In order to distinguish between different
posed as the sum Ee + Ev + Er. In general, we electronic states '11 e of the same symmetry and
find Ee > Ev > Er. spin multiplicity, a further classification is ob-
Electronic states of molecules are classified tained by expanding 'I1e as a product of n single-
according to the symmetry properties of 'I1e electron wave functions cfJi, each a function of
(which forms a basis for an irreducible represen- the coordinates of one of the n electrons in
tation of the molecular point group). Thus 3 B lu the molecule.
is a term symbol for benzene (D6h point group)
that denotes a triplet electronic state whose
wave function transforms like the Blu represen-
tation of the group. In the case of diatomic and where (n !)-1/2 is a normalization factor. Each of
linear polyatomic molecules, the term symbol the molecular orbitals (MO's) cfJi is constructed
shows the magnitude of the conserved compo- to transform like an irreducible representation
nent of orbital electronic angular momentum of the molecular point group and is usually
M about the axis, states being classified as ~, formed by linear combination of atomic orbitals
II, A,'" according to A =0, I, 2, .. '. The (LCAO) Xi ce'ntered upon the individual nuclei
superscript + or - shows the behavior of'l1e for
a linear molecule upon reflection in a plane
containing the molecular axis; for centrosym-
metric linear molecules (Dooh point group) the
subscript g or u shows the parity +I or - I re-
spectively for '11 e with respect to inversion at the The MO's are written in order of decreasing
center. energy necessary to ionize the electrons which
The vibrational wavefunction 'I1v can be ap- occupy them, and electrons are assigned to the
proximated by a product of 3N - 6 harmonic MO's in accordance with the Pauli principle.
oscillator wave functions t/Ji, each a function of For example, the electronic ground state of
a normal displacement coordinate Qi, ammonia (C 3V point group) is written

n
3N-6
'I1v = t/Ji(Qi)
i =1 where the superscripts show the distribution of
the ten electrons among three MO's of al sym-
The product is (3N - 5), for a linear molecule; metry and one of e symmetry, the electrons in
N is the number of nuclei. Each oscillatory the (3al) orbital being most readily ionized. The
mode can be excited with quanta Vi = 0, I, 2, symmetry of the resultant molecular wave-
.... When Vi = 0, t/Ji transforms like the totally function 'I1e is found by taking direct products
symmetrical representation of the molecular for each orbital occupied by an electron. Here
point group; when Vi = 1, t/Ji transforms like Qi. 'I1e belongs to the totally symmetrical represen-
The symmetry of 'I1v under the molecular point tation (and is also singlet). Excited electronic
group is found from the direct product for all states are obtained by promoting electrons into

°
the t/Ji. The vibrationless ground state with VI =
V2 =... = is always totally symmetrical.
Each rotational state is characterized by a
orbitals with higher energies, but the molecular
symmetry in such states often differs from that
in the ground state, as a result of changes in
value for the quantum number J, where J(J + I) geometry.
fj2 is the squared angular momentum for rota- In calculations of molecular properties, the
tion of the nuclei (apart from spin). If la, Ib MO's cfJi can be improved by variational methods
and Ie denote the moments about the principal which make them satisfy the Hartree-Fock
axes of inertia of the molecule, then a spherical equations. This gives self-consistent field (SCF)
top has I a =I b =Ie; a molecule with two princi- MO's, yielding a better wavefunction 'I1e. How-
pal moments equal is either a prolate (Ie =Ib > ever the latter is still, in practice, constructed
MOLECULES AND MOLECULAR STRUCTURE 778

from an incomplete set of basic functions. was much larger than the natural width r, where
Further improvement is achieved by configur- r is related to the half-life TII2 of the excited
ation interaction (CI), in which 'lie's of the nuclear level by the expression r TII2 = 4.55 X
same symmetry are allowed to mix in linear 10- 16 e V sec. In fact, techniques had been devel-
combination. oped to compensate for the recoil energy loss
by applying large Doppler shifts with an ultra-
G. W. KING centrifuge or through thermal motion. These
methods necessarily broaden the intrinsically
References narrow lines thereby reducing the absorption
Burdett, J. K., "Molecular Shapes," New York, John cross section.
Wiley & Sons, 1980. Mossbauer discovered that in some cases these
Drago, R. S., "Physical Methods in Chemistry," Phila- difficulties may be removed by embedding the
delphia, W. B. Saunders Co., 1977. source and absorber nuclei in a crystal. Being
Gillespie, R. J., "Molecular Geometry," London, Van part of a quantized vibrational system, these
Nostrand-Rheinhold, 1972. nuclei interact with the lattice by exchange of
King, G. W., "Spectroscopy and Molecular Structure," vibrational quanta or phonons only. If the
New York, Holt, Rinehart and Winston, Inc., 1964. characteristic phonon energy is large compared
Levine, I. N., "Molecular Spectroscopy," New York, to the recoil energy R for a free nucleus, the
John Wiley & Sons, 1975. probability for the emission of a gamma ray
without a change in the vibrational state of the
Cross-references: BOND, CHEMICAL; ELECTRON lattice is large. For such a zero phonon transi-
DIFFRACTION; INTERMOLECULAR FORCES; tion, the lattice as a whole adsorbs the recoil
MAGNETIC RESONANCE; MOLECULAR WEIGHT; momentum and the recoil energy is negligibly
NEUTRON 9IFFRACTION; QUANTUM THEORY; small. At the same time, the emission and
SPECTROSaOPY; X-RAY DIFFRACTION. absorption lines achieve the natural width r.
For an atom bound by harmonic forces, the
fraction f of events without recoil ener~ loss is
given by f = exp( - 4rr2 (x 2 )(A 2 ). Here (x ) is the
MOSSBAUER EFFECT mean square displacement of the radiating atom
The Mossbauer effect is the phenomenon of taken along the direction of the photon with
recoilless resonance fluorescence of gamma rays wavelength A. In an environment of lower than
from nuclei bound in solids. It was first dis- cubic symmetry, (Xl), and therefore f, may be
covered in 1958 and brought its discoverer, anisotropic. A large recoilless fraction may be
Rudolf L. Mossbauer, the Nobel prize for physics obtained when (Xl) is small and A large. The
in 1961 . The extreme sharpness of the recoilless former condition implies small vibrational am-
gamma transitions and the relative ease and plitude and thus low temperature, high vibra-
accuracy in observing small energy differences tional frequency and large mass M, while the
make the Mossbauer effect an important tool in latter implies low photon energy, £. Both condi-
chemistry, solid-state phYSiCS, nuclear physics, tions imply small recoil energy R.
biophysics, metallurgy, and mineralogy. Recoilless transitions can also occur in amor-
Resonance fluorescence involves the excitation phous substances like glasses and high-viscosity
of a quantized system (the absorber) from its liquids. For the latter, the diffusive motion
ground state (0) to an excited state (1) by superimposed on the thermal vibration results
absorption of a photon emitted from an identical in a broadening of the Mossbauer line.
system (the source) decaying from state (I) to For all Mossbauer isotopes, the nuclear half-
(0). Not every nucleus has a suitable gamma life T 112, typically 10-8 second, is very long
transition; however, the Mossbauer effect has compared to the period of the lattice vibrations,
been observed in more than 60 different iso- typically 10- 13 second. A conceivable first-order
topes. The parameters characterizing the nuclear Doppler shift of the Mossbauer line due to the
resonance process for some typical isotopes are thermal motion will therefore average out to
illustrated in Fig. I. zero. The second-order Doppler effect, however,
To conserve energy and momentum in the leads to an observable shift, sometimes called
emission and absorption processes, each system, the temperature shift. The photons emitted by
the source and absorber, must acquire a recoil a source nucleus moving with a mean square
energy R equal to £2 /2Mc l , where £ is the velocity (vi) are lower in energy by a fraction
photon energy, M is the mass of the recoiling (vi)/2c 2 as compared to the photons emitted at
system and c is the speed of light. The energy rest. Similarly the transition energy of a vibrating
available for the excitation of the absorber is absorber nucleus appears lower to the incident
thus reduced by 2R, and resonance fluorescence photon by a fraction (vi)/2c 2 • In principle, the
can be achieved only if the missing energy 2R is two shifts may be different whenever the source
not larger than the widths of the levels involved. and absorber are of different composition
Before 1958, it was thought that for all gamma and/or temperature.
transitions the width required to get overlap Mossbauer performed his original experiment
between the emission and the absorption line with 1911r at 88 K, obtaining a recoilless frac-
779 MOSSBAUER EFFECT

SCALE OF ENERGIES AND RELATED QUANTITIES IN MOSSBAUER EFFECT

ENERGY (ev)
1"10 I()' 10 "' 10
"7
"'
10 10 ", "4
10
"5 "Z
10
"I
10

10'
I
10'
I
li/
I
10'
I f IplO I~" HALFLIFE Tl;z (SEC)
FOR NUCLEAR LEVEL
WAVELENGTH ). (1) 10
FOR PHOTON OF I
OF WIDTH r(eV) ENERGY Ey(eV)
,,«to I:' ,.,"I-

Iff
~ ~~

INTRINSIC LINE WIDTH r (eV) PHOTON ENERGY E yleV)


r·~ln2JTl!z Ey&ch/).
I-----------Eyl r z 3.2 x 10 IZ
_ _ _ _ _ _ _ _ _ _ _ _-1 I
MAXIMUM RESONANCE 1Oil 0"11 p" zo
CROSS SECTION ero (cmz) 1-~-ll--Illf--Ilf--Il
FOR PHOTON OF ENERGY
LARGEST OBSERVED
Ey(eV), 00" ).z/2."
HYPERFINE
SPLITTING lIE MAX (eV)

FREE RECOIL ENERGY R (eV)


RECOILLESS FRACTION f AND
R • EI /2MC z SECOND-ORDER DOPPLER SHIFT
r (y2) 12c 2 AS GIVEN BY THE
DEBYE MODEL

CHARACTERISTIC PHONON ENERGY


f :exp {_§1i [1..+ (~)2!.~~ X]}
8• DEBYE TEMPERATURE k8 4 8) e'-I
o elT
(y 2) : 9k a [I + (T)4 ( x3 dx ]
~ 2Mc 2 '5 ti 'j " - I
TEMPERATURE T (0 K) o
EQUIVALENT TO
ENERGY E' kT(eV)

FIG. 1

tion of 1 per cent. Since the natural line width gamma ray energy l!.'. The Mossbauer effect
in 191 Ir, as in most other Mossbauer nuclides, is therefore enables one to detect extremely small
extremely narrow, Mossbauer was able to alter changes in this energy. One of the earliest appli-
the degree of overlap between the emission and cations of this great precision was the laboratory
absorption lines by simply moving the source verification of the gravitational red shift by
relative to the absorber at speeds u of the order Pound and Rebka. According to Einstein's
of 1 mm/sec. Thus the gamma rays were slightly theory, photons have an apparent mass m =
shifted in energy via the first-order Doppler E/e2 • Thus if they fall toward the earth through
effect by an amount AE = Eu/e. By plotting the a distance H, their ener~ increases by AE =
transmission through the absorber as a function mgH, so that AE/E = gH/e ~ 10- 16 per meter.
of the relative source-absorber velocity, one thus Using 57 Fe, which has a large recoilless fraction,
obtains the characteristic Mossbauer velocity and for which r/E = 3 X 10- 13 , the desired ef-
spectrum which exhibits the shape of the reso- fect was observed when the photons were sent
nance curve. From such a plot, one can deter- down the 2 2-meter tower at Harvard University.
mine the recoilless fraction, the lifetime of the It is well known from optical and high-fre-
excited state and any possible energy differences quency spectroscopy that a nucleus interacting
between the emission and the absorption line. with its environment through its charge distri-
With extreme care, it is possible to determine bution and magnetic moment can give rise to
energy differences of the order of 1/1000 of hyperfine shifts and splittings of the order of
the line width r. The latter t{JJically varies with 10- 9 eV to 10-5 eV" In Mossbauer experiments,
isotope from 10- 10 to 10- times the actual such energy differences can readily be measured
NUCLEAR HYPERFINE INTERACTION FOR 57Fe :::
0'
<Il
<Il
MULTI POLE ORDER ELECTRIC MONOPOLE: ISOMER SHIFT MAGNETIC DIPOLE: ZEEMAN SPLITTING ELECTRIC QUADRUPOLE
.-=
c::
!O\
NUCLEAR PROPERTY CHANGE IN CHARGE RADIUS !B MAGNETIC MOMENT ,... ELECTRIC QUADRUPOLE MOMENT Q
R !O\
"..,..,
ATOMIC PROPERTY s- ELECTRON DENSITY 1+(0)1 2 INTERNAL MAGNETIC FIELD H(o) ELECTRIC FIELD 'GRADIENT q !O\
(")
...
_ ,... H(o) I Z 2 3d-l0+11
INTERACTION ENERGY l,.S." Ea-Es: 4; ze2R2(.y~t(O)I~-lt(o)l: E M- I EO: eqQ 41(21-1)

...
-l z
SOURCE ABSORBER ± 3tz
ENERGY LEVEL ./ 1 EXC' 3,12,/ 1 EXC' 3/ 2 . /
EXCITEO (I~MERI~ < ± ~2 I

DIAGRAM STATE: Ruc


I
EO
ES

WITH GAMMA
~IItrtl :l 1 !
GROUND
STATE:R~
I I ~2 I ONO' 1/2 /
I I H2
TRANSITIONS / (nIH!
POINT NUCLEUS fiNITE NUCLEUS I. NO , 1 / 2 / _1/2 ISOMER SHIFT + EL. FIELD GRADIENT
ALLOWED BY ISOMER SHIFT + MAGNETIC FIELD
EXAMPLE; 57 Fe IN Pt VS. 57Fe IN KFe F3 EXAMPLE: 57Fa IN IRON EXAMPLE· 57Fe IN Fe SO. ' 7 H2 0 AT 78' k
SELECTION RULES SR. RoC-RONo'.OOOI \+(OI\~-lojlloll~"17 16' ""G_o' 0 090,.... , 1-'0c'·0 1551-'_, H(el' 330kOe Que' 0.2 x 10'24 cm Z, eq' 22 I 10" Vlcm z
"If R ., i\jl(ol\Z . •

VELOCITY (em/sec) VELOCITY (em/sec) VELOCITY (em/s.c)


·0.2 -0.1 0 0.1 0.2 ·0.6 ·0.4 -0.2 0 0.2 0.4 0.6 0.1 a 0.1 0.2 0.3 0.4
TYPICAL 1. S. I/z/QQ
z z z
0 0 1 s. I
Ui 1.0 r- ....L.. .' Ui 1.0 ~ 1.0
III III III
MOSSBAUER ~~j
" III
i . .. i
IJ)
~
III ." . i I"" \
~0.9 r- ~ 0.9 ~ 0.9
SPECTRA a:
...
",0.8
.... •
a:
...
",0.8~
[\ (i t .. ~\ r
a:V
... 0.8
V
> ~ ~
...
.. '"> SOURCE: 57 Co IN STAINLESS STEEL
~-,0.7 : SOURCE 57eo IN PLATINUM ~0.7~ ~ 0.7
a::: -'
a:::
'"
f-
ABSORBER: KFe F3
l( SOURCE. 57Co IN STAINLESS STEEL '"a: f-
ABSORBER: FeSO, .7H z O AT 78'K

....
ABSORBER' 57 Fe IN IRON
:l:

FIG. 2
781 MOSSBAUER EFFECT

since the line width of the recoilless transitions a measurement of the intensity distribution,
is of the same order of magnitude. Perhaps, one therefore obtains the orientation of the
therefore, the most useful feature of the Moss- internal magnetic field. The temperature depen-
bauer effect is that it may be used to obtain dence of the splitting can yield Neel and Curie
nuclear properties if the fields acting on the temperatures and also relaxation times.
nucleus are known, and conversely, it is a Whenever one of the nuclear levels possesses
powerful tool for probing solids once the a quadrupole moment and an electric field
various interactions are calibrated, i.e., the gradient exists at the position of the nucleus,
nuclear properties have been determined. Some quadrupole splitting of the Mossbauer spectrum
representative results obtained with 57 Fe are may be observed. If the quadrupole moment is
illustrated in Fig. 2. known either for the ground state or for the
The most basic of these interactions is the excited state, then a Mossbauer measurement
effect of the finite nuclear size which, in general, will readily yield the parameters of the field
is different for the ground state and the excited gradient tensor. Usually, however, the quad-
state. The electrostatic interaction of the nuclear rupole moment is not known, and the field
charge with the s-electrons overlapping it raises gradient tensor must be determined from other
the nuclear energy levels by an amount depend- work or else calculated from first principles.
ing on the charge radii and s-electron density at This tensor exists whenever the symmetry of
the nucleus. Therefore under proper conditions, the surrounding charge distribution is lower
there appears a shift in the Mossbauer resonance, than cubic, and it is generally specified by two
the isomer shift, which is proportional to (oR/R) independent parameters. This tensor is easiest
011/1(0)1 2 , where oR/R is the fractional change to calculate for cases of axial symmetry, in
in the nuclear radius during the decay and which it is characterized by one parameter, the
011/1(0)12 is the difference in s-electron density field gradient, q. For simple ionic systems, it is
between source and absorber. To determine the possible to estimate q with some degree of cer-
quantity oR/R, one compares the isomer shifts tainty, and thereby determine the quadrupole
of two chemically simple absorbers, for which moment. Once this is done, the Mossbauer effect
the s-electron density can be calculated. In the may be used to measure field gradient tensors in
case of 57 Fe an isomer shift exists between com- more complicated systems. Such measurements
pounds containing ferric ions, Fe 3 +(3d S ), and yield information about crystalline symmetries,
ferrous ions, Fe 2 +(3d 6 ). Although the number crystalline field splittings, shielding due to closed
of s-electrons is the same for both ions, a detailed shell electrons, relaxation phenomena and chem-
calculation shows that the shielding through the ical bonding. In addition, with single crystals, a
additional 3d electron changes the 3s density at study of the relative intensity of the various lines
the nucleus. For 129 1, the isomer shifts observed of the resonance spectrum as a function of an~e
among different alkali iodides can be related can yield information about the orientation of
quantitatively to the known transfer of 5p the crystalline field axes and, thus, the orienta-
electrons to the ligands which affects the 5s tion of complexes in solids.
density at the nucleus. Once calibrated, the In cases where both magnetic and quadrupole
isomer shift is a tool for measuring s-electron splittings are present the spectrum depends
densities and is therefore of use in studying markedly upon the relative orientation between
chemical bonding, energy bands in solids, and the hyperfine magnetic field and the axes of the
also in identifying charge states of a given atom. electric field gradient tensor. Paramagnetic
One of the early successes of the Mossbauer complexes of lower than cubic symmetry and
effect was the observation of the completely many magnetically ordered compounds are of
resolved nuclear Zeeman splitting arising from this type. Careful quantitative analysis can then
the magnetic hyperfine interaction of s7Fe in yield the magnitude as well as the relative orien-
ferromagnetic iron. For this isotope, as well as tation of the hyperfine interactions. There are
for most other Mossbauer isotopes, the magnetic also disordered systems such as alloys, amor-
moment of the nuclear ground state is known phous solids, and especially spin-glasses, in which
from magnetic resonance experiments, and the each atom has a slighty different environment.
calibration is therefore straightforward. Care- In such cases s7Fe or other Mossbauer nuclides
ful analysis of the velocity spectrum for mag- have been used to probe the distribution of
netic samples is sufficient in general to reveal isomer shifts, local magnetic fields, and/or elec-
both the desired magnetic moment and internal tric field gradients. Moreover, the hyperfine
magnetic field. The latter yields important in- interactions may be nonstationary, for instance
formation about the unpaired spin density at as a result of spin fluctuations, diffusion, or
the nucleus, which in turn is related to the other time dependent processes. Such systems
exchange interaction in crystals, molecular have been successfully treated using dynamical
complexes, metals and alloys. For single crys- models of the Mossbauer line shape.
tals or magnetized samples, the intensities of This article has only covered the basic features
the individual lines of the Mossbauer spectrum of the Mossbauer effect and the phenomena
depend on the angle between the direction of which affect the Mossbauer velocity spectrum
the internal field and the emitted photon. From in a general way. The actual application of the
MOSSBAUER EFFECT 782

effect is extremely far reaching, embracing not


only almost all areas of physics but also the
fields of chemistry, biology, geology, metal~
lurgy, and engineering. The reader is advised to
consult the references for more information.
R. INGALLS
P.DEBRUNNER

References CURRENT CURRENT


Mossbauer, R. L.,Science, 137,731 (1962). DOWN, INTO Up, OUT FROM
"Mossbauer Effect: Selected Reprints," New York, PAPER PAPER
American Institute of Physics, 1963. FIG. 1. Direction of force due to current in a mag-
Stevens, J. G., and Stevens, V. E. (Eds.), "Mossbauer
netic field.
Effect Data Index," New York, IFI/Plenum, 1966-
present.
Goldanskii, V. I., and Herber, R. H. (Eds.), "Chemical F = Bil newtons. Fig. 1 shows the directions
Applications to Mossbauer Spectroscopy," New of current and force.
York, Academic Press, 1964.
Greenwood, N. W., and Gibb, T. C., "Mossbauer Spec- (3) Magnetic structures tend to move to the
position of minimum reluctance (maximum
troscopy," London, Chapman and Hall, Ltd., 1971. inductance) with force F = dw/dx, where w is
Gonser, U. (Ed .), "Mossbauer Spectroscopy," Berlin,
stored magnetic energy and x is distance.
Springer-Verlag, 1975. (4) The force between two coupled circuits
Shenoy, G. K., and Wagner, F. E. (Eds.), "Mossbauer is
Isomer Shifts," Amsterdam, North-Holland, 1978.
Cohen, R. L., "Applications of Mossbauer Spectro-
scopy," Vol. 2, New York, Academic Press, 1980.
Gonser, U. (Ed.), "Mossbauer Spectroscopy II: The
Exotic Side of the Method," Berlin, Springer-
Verlag, 1981. Frequently L 1 and/or L2 are constant and they
make no contribution to the force. Here i 1 and
Cross-references: CONSERVATION LAWS AND i2 are currents, Ll and L2 are total self-induc-
SYMMETRY, DOPPLER EFFECT, ISOTOPES, LU- tances, and M is mutual inductance.
MINESCENCE, PHONONS, RADIOACTIVITY, ZEE-
MAN AND STARK EFFECTS. Symbols
B = flux density in Tesla
F = force in direction x in newtons
MOTORS, ELECTRIC i, ii, i2 = instantaneous currents in amperes
History Power conversion was discovered by Ia,h = dc and effective currents in amperes
M. Faraday in 1831; the commutator, by L 1 , L 2, M =inductances in henrys
J. Henry, Pixii, and C. Wheatstone (1841); the s = slip = (Syn. rpm - rpm)/Syn. rpm
electromagnetic field, by J. Brett (1840), Wheat- u = velocity in meters per second
stone and Cooke (1845), and W. von Siemens w = energy in joules
(1867); drum armatures, by Siemens, Pacinotti, 1, x =length, distance, in meters
and von Alteneck; ring armatures, by Gramme rpa = useful flux per pole in webers
(1870); and disc armatures, by Desroziers (1885) rpm = maximum flux per pole in webers
and Fritsche (1890). Ring and disk types are Motor Types Of many hundreds, the most
now seldom used. Revolving magnetic fields used are:
(1885) and ac theory were discovered by G. (1) Direct Current (a) Series. The field coils
Ferraris; polyphase motors and systems, by are of heavy wire in series with the armature.
N. Tesla (1888); the squirrel-cage rotor, by The torque and current are high at low speeds,
C. S. Bradley (1889); and ac commutator and low at high speeds. Torque and speed vary
motors, by R. Eickmeyer, E. Thomson, L. inversely as a square-law function. These motors
Atkinson, and others. will run away at light loads unless a speed-
Principles These are explained by the laws limiting device is used.
of Ohm, Kirchoff, Lenz and Maxwell; specif- (b) Shunt. The field coils are of fine wire in
ically: parallel with the armature. The speed drops
(1) Moving a conductor of length I across a slightly, and the current and torque increase
magnetic flux field of density B with a velocity with load. Many small motors (up to 15 hp) use
v generates in the conductor an electromotive permanent magnets for the field, particularly
force (emf) e = vBI volts. In motors, e opposes where high acceleration rates are necessary.
the current i and decreases as load increases. (c) Compound. Both shunt and series fields
(2) The force on such a conductor equals are used in the same motor. Behavior is inter-
783 MOTORS, ELECTRIC

mediate between (a) and (b). The armature velocity in the same direction as the rotor and
current in all dc motors is described as Ia = pulsates between the limits rJ>m and rJ>m (1 - s).
(VT - Ea)/R a, where Ea = prJ>aZaS/(ma X 60); The stator current is the resultant of (stator +
VT = terminal voltage, Ea = counter electro- cross axis) magnetizing + load + loss currents.
motive force (cemf), Ra = armature resistance, Analysis of this type of motor is less simple
ma = number of paths, and Za = number of than that for a polyphase motor.
conductors, all for the armature. In addition, Starting is with a line switch, reduced voltage
p = number of poles, S = rpm, rJ>a = useful flux (auto-transformer or "compensator," wye-delta,
per pole. rJ>a may be nearly constant or it may series resistors, or chokes), wound rotors and
be a function of Ia. resistors, part-windings, or more elaborate
Speed control is achieved by adjusting field schemes. For single-phase motors, auxiliary
current and/or armature terminal voltage, tapped start windings (split-phase or capacitor), or
series coils, or (more rarely) field reluctance or repulsion-start are used and are removed at
double commutator. To avoid damage from approximately 65% of full speed. For very small,
excessive currents during starting, either start- low-torque motors, shaded poles are used.
ing resistances or voltage controls are needed, Speed control has historically been by reduced
except for very small motors. voltage, wound rotor, or elaborate degenerative
(2) Alternating Current (a) Polyphase in- feedback schemes. State-of-the-art is variable-
duction: These are usually three-phase, with voltage, variable-frequency electronic inverters.
phase windings distributed equally in slots The motor speed tracks the applied frequency.
around the periphery of the stator to produce (c) Synchronous. Commonly these have a
alternate Nand S poles. When fed with poly- stationary phase-wound armature and revolving
phase current a revolving field is set up which dc or permanent magnet field structure. When
turns at S = 120f/p rpm. This revolving field at full speed and synchronized, the revolving
induces a counter emf E = 2.22frJ>m Zkw volts armature mmf stands still relative to the dc
per phase, where rJ> is flux per pole, kw is the field. When the angle (la, Ea) = 0, the armature
winding factor, and Z is the number of series mmf poles stand midway between the field
turns. poles. When Ia lags E a, the armature mmf as-
The rotor is made of a number of short cir- sists the field; it opposes the field when Ia leads
cuited conductors. If these conductors are open, Ea. Armature current Ia adjusts to a value and
or the rotor is running at synchronous speed S, time phase position such that the counter
each rotor phase behaves as an inductance and emf Ea and current Ia are correct to meet the
draws an exciting current lagging nearly! cycle existing load. Important modeling relations
behind the emf. Shaft load reduces the speed are Ia = (V - Ea)/Za; power converted = mlaEa
from S to SO - s), where s is slip. When re- . cos (la, Ea); input = mla V cos (la, V); and
ferred to the stator mounted armature (pri- armature copper loss = mla2ra. Where Ea is a
mary), s causes the emf in each rotor phase to function of (la, If); angle (la, Ea) and Ea are
be sE (in volts). Each rotor phase sees current in opposition; Ia increases with load; and m =
(in amperes): number of phases.
When connected directly to the utility power
system, synchronous motors are used where the
rpm must be fixed, for power factor correction,
regulating transmission line voltages, or speeds
too low for good induction motor performance.
They are not self starting unless special means
are provided, usually squirrel-cage or phase
where r2 is the resistance andx2 is the rotor leak- windings in the pole faces. Precautions against
age reactance in ohms per phase. high ac voltages in the dc field coils are needed
The current 12 produces the needed torque. when starting. The motors will carry some load
It creates a new magnetomotive force (mmf) with the dc field winding open. Small sizes
which turns at synchronous speed S in the same (reluctance types) operate without field wind-
direction as the revolving stator field, and lags ings. Single-phase motors are less satisfactory
the stator field in space by the electrical angle because of lower efficiency, tendency to severe
90° + tan- 1 (X2/r2)' To balance this mmf the hunting, and problems in starting.
stator draws additional current sufficient to (d) ac Commutator. These are mostly single-
produce the load torque. Performance can be phase series, repulsion, or combination of these
calculated (as for a transformer) if r2 /s is taken types. Single-phase series motors are used in
as the independent variable. Three-phase motors large quantities for portable tools, vacuum
require less material than one- or two-phase sweepers, garbage disposals, household appli-
motors. ances, etc. The ac series motor is similar to the
(b) Single-phase induction. With the motor at dc motor except for a laminated field and
rest there is no revolving field; the motor will precautions against low power factor and poor
start in either direction only if given a push. commutation. The runaway speed is high.
Rotation sets up an elliptical revolving field (e) Brushless dc. These are ac synchronous
which turns synchronously at nonuniform and induction motors which receive their power
MOTORS, ELECTRIC 784

from electronic inverters. The inverters are fed The attributes of musical sound and their
from a dc source. By proper inverter control, subjective correlates are described briefly. The
the entire system responds the same as a dc number of cycles per second, frequency, is a
motor. Uses include industrial drives, electric physical entity and may be measured objec-
cars, railways, and aircraft control actuators. tively. Pitch, however, is a psychological phe-
(f) Stepper Motors. ac and dc motors specially nomenon and needs a human subject to per-
designed for high torque and low inertia. Nor- ceive it. In general, as the frequency of a single
mally electronically controlled to discrete posi- sinusoidal vibration of a sonorous body (pure
tions, or steps. These are used extensively for tone) is raised, the pitch is higher. However,
industrial positioners, computer line printers, pitch and frequency do not bear a simple linear
robots, etc. relationship. To define the relationship human
Probable Future Trends These include less subjects are used to construct a pitch scale so
expensive, higher energy product permanent that one note can be judged to be two times
magnets; electrical insulators with better heat the pitch of another and so on. The unit of
conduction properties; and superconductors. pitch on this scale is called the mel, and a pitch
Superconducting windings will require motors of 1000 mels is arbitrarily assigned to a fre-
to be built of nonmagnetic materials, since the quency of 1000 Hz. In general, it is observed
magnetic fields will exceed the saturation levels that the pitch is slightly less than the frequency
of magnetic materials. Rapid advances in power when the frequencies are higher than 1000 Hz,
electronics will influence motor designs. and slightly more than the frequency at fre-
quencies less than 1000 Hz. Pitch also depends
FREDERICK C. BROCKHURST on loudness; for a 200 Hz tone if the loudness
is increased, the pitch decreases, and the same
References happens for frequencies up to 1000 Hz. Be-
tween 1000 and 3000 Hz pitch is relatively
SIemon, G. R., and Straughen, A., "Electric Machines," independent of loudness, while above 4000 Hz,
Reading, Mass., Addison-Wesley Publishing Com- increasing the loudness raises the pitch. Small
pany, Inc., 1980. pitch changes with loudness do occur with
Say, M. G., "Alternating Current Machines," New complex tones and whether the pitch goes
York, Halstead Press, John Wiley & Sons, Inc., 1976. up or down with loudness seems to depend on
the harmonic structure of the complex sound.
Cross-references: ALTERNATING CURRENTS, ELEC- A rapid variation in pitch when the variation
TRICITY, INDUCED ELECTROMOTIVE FORCE, occurs at the rate of from two to five times
INDUCTANCE, MAGNETISM, TRANSFORMER.
per second is called vibrato. The pitch variation
in mels may be large or small but the rate at
MUSICAL SOUND which the pitch is varied is rarely greater than
five times per second. Violinists produce vibrato
Musical sound may be characterized as an aural by sliding their fingers back and forth a minute
sensation caused by the rapid periodic motion distance on a stopped string. A variation in
of a sonorous body, while noise is due to non- loudness occurring at the rate of two to five
periodic motions. The above statement, origi- times a second is called tremolo. Singers often
nally made by Helmholtz, may be modified produce a combination of tremolo and vibrato
slightly so that the frequencies of vibration of to give added color to their renditions.
the body fall into the limits of hearing: 20 to The ability to discriminate between pitches
20,000 Hz. This definition is not clear cut; depends on other variables beside the acuity of
there are some noises in the note of a harp (the the listener. The just noticeable differences
twang) as well as a recognizable note in the (jnd's) also depend on the frequencies of the
sqeak of a shoe. In other cases it is even more pure tones. Jnd's generally get larger as the
difficult to make a distinction between music frequency increases. Duration of a pure -tone is
and noise. In some modern "electronic music" also a requirement for recognition of pitch.
hisses and thumps are considered a part of the Some pure tones of even 3 millisecond duration
music. White noise is a complex sound whose can still be recognized as having a definite pitch.
frequency components are so closely spaced The duration of time required to recognize
and so numerous that the sound ceases to have pitch depends on the frequency and to some
pitch. The average power per frequency of extent on the loudness of the tone. It takes
these components is approximately the same longer to recognize the pitch of a low fre-
over the whole audible range, and the noise has quency pure tone than for one of a high fre-
a hissing sound similar to that one gets from quency. In all cases if the time is too short one
FM radio that is tuned between stations. Pink hears a click rather than a clear pitch. The pitch
noise has its lower frequency components rela- of a complex musical tone depends on the spec-
tively louder than the high frequency compo- trum of the complex tone. If the complex t~ne
nents and this is accomplished by keeping the is composed of the fundamental and exact over-
average power the same in each octave (or in tones, the ear recognizes the pitch as that of the
each! octave) band from 20 to 20,000 Hz. fundamental, even if the fundamental is weak
785 MUSICAL SOUND

or is missing. Manufacturers of small portable level at 1000 Hz is about 5 dB lower than an


radios take advantage of this when they install equally loud sound at 2000 Hz, for 30 Hz about
small loudspeakers in these radios. These speak- 70 dB lower, and at 10,000 Hz about 20 dB
ers are not capable of producing the fundamen- lower. In general, as the intensity levelincreases,
tals of certain low frequency musical sounds loudness levels tend to be more alike at all
but the ear seems to fill in the "missing funda- frequencies. This means that as a sound gets less
mental." intense at all frequencies, the ear tends to hear
Like frequency, intensity is a physical entity the higher and lower portions of sound less
defined as the amount of sound energy passing loudly than the middle positions. Some high
through unit area per second in a direction fidelity systems incorporate circuitry that auto-
perpendicular to the area. It is proportional to matically boosts the high and low frequencies
the square of the sound pressure, the latter as the intensity level of the sound is decreased.
being the rms pressure over and above the con- This control is usually designated a loudness
stant mean atmospheric pressure. Since sound control.
pressure is proportional to the amplitude of a At times it is necessary to have a scale of ab-
longitudinal sound wave (see WAVE MOTION) solute perceived loudness. The unit of this
and to the frequency of the wave, intensity perceived loudness is the sone. It is arrived at
since it depends on energy, is proportional to by a set of complicated procedures involving
the square of the amplitude and to the square human subjects usually placed in a free field
of the frequency. Sound intensity is measured situation (e.g., anechoic chamber). One sone of
in watts per second per square centimeter and, loudness is defined as the sound pressure at the
since the ear is so sensitive, a more usual unit ear of 40 dB for a 1000 Hz pure tone. Two
is the microwatt per second per square centi- sones is perceived in these experiments as twice
meter. By way of example, a soft speaking as loud, three sones three times, etc. In many
voice produces an intensity of 0.1 micromicro- situations, musical as well as noisy, the figures
watt/cm 2 sec, while fifteen hundred bass voices for sones of multiple sounds seem to add up in
singing fortissimo at a distance 1 cm away an arithmetical way; figures for db do not.
produce 40 watt/cm 2 sec. Because of such a Some consumer testing groups use the sone
large range of intensities, the decibel scale of scale for evaluating the noise of various con-
intensity is normally used to designate intensitr sumer products.
levels. An arbitrary level of 10- 16 watts/em The entity which enables a person to recog-
sec is taken as a standard for comparison at nize the difference between equally load tones
1000 Hz. This level is very close to the thresh- of the same pitch coming from different musi-
old of audibility. At this frequency, other cal instruments is called timbre, quality, or tone
sound levels are compared by forming the color. A simple fundamental law in acoustics
logarithm of the ratio of the desired sound to states that the ear recognizes only those sounds
this arbitrary one. Thus log 1/10- 16 is the due to simple harmonic motions (see VIBRA-
number of bels a sound of intensity I has, com- TION) as pure tones. A tuning fork of frequency
pared to this level. Since this unit is incon- f, when struck, causes the air to vibrate in a
veniently large, it has been subdivided into the manner which is very nearly simple harmonic.
decibel, one-tenth its size. Thus 10 log l/l 0- 16 The sound that is heard does, in fact, give the
equals the number of decibels (dB) the sound impression that it is simple and produces a pure
has. A few intensity decibel levels are listed: tone of a single pitch. If one now strikes simul-
taneously a series of tuning forks having fre-
db quencies f (the fundamental), 2f, 3f, 4f, 5f,
Quiet whisper 10 etc. (overtones), the pitch heard is the same as
Ordinary conversation 60 that of the fork of frequency f except that the
Noisy factory 90 sound has a different timbre. The timbre of the
Thunder (loud) 110 sound of the series can be changed by altering
Pain threshold 120 the loudness of the individual forks from zero
loudness to any given loudness. Another way to
While intensity levels can be measured physi- alter the timbre is to vary the time it takes for
cally, loudness levels are subjective and need a composite sound to grow and to decay. A
human subjects for their evaluation. The unit of slow growth of an envelope, even though it
loudness is the phon, and an arbitrary level of contains the same frequencies, makes for a
zero phons is the loudness of a 1000 Hz note different timbre than one which has a rapid
which has an intensity level of 0 dB. Sounds of growth. The difference in timbre between a
equal loudness, however, do not have the same B-flat saxophone and an oboe is almost entirely
intensity levels for different frequencies. From due to the difference in growth or decay time.
a series of experiments involving human sub- A fundamental theorem discovered by the
jects, Fletcher and Munson in 1933 constructed mathematician Fourier states that any compli-
a set of equal-loudness contours for different cated periodic vibration may be analyzed into
frequencies of pure tones. These show that for a set of components which has simple harmonic
quiet sounds (a level of 5 phons) the intensity vibrations of single frequencies. If this method
MUSICAL SOUND 786

of analysis is applied to the composite tones of quencies of the original wave. This altered wave
musical instruments, it is seen that these tones can later be transformed into the corresponding
consist of a fundamental plus a series of over- sound wave. In this way sounds having desired
tones, the intensity of overtones being different rise and decay times, plus the required funda-
for instruments of differing timbre. Rise and mental and overtone structure, can be made as
decay times will also differ. The reverse of desired.
analysis is the synthesis of a musical sound.
Helmholtz was able to synthesize sound by JESS J. JOSEPHS
combining sets of oscillating tuning forks of References
various loudness to produce a single composite
steady tone of a definite timbre. Modern syn- Rayleigh, J. W. S., "The Theory of Sound," New
thesizers are more sophisticated. Electrical York, Dover Publications, 1945.
oscillators of the simple harmonic variety are Hehnholtz, H., "On the Sensations of Tone," New
combined electrically and then these electrical York, Dover Publications, 1954.
composite envelopes are electronically modified Stephens, R. W., and Bate A. E., "Acoustics and Vi-
to produce differing rise and decay time. A brational Physics," London, Edward Arnold, 1966.
transducer changes the electrical composite Josephs, J. J., "The Physics of Musical Sound," New
envelope into an acoustical one so that a sound York, Van Nostrand Reinhold, 1967.
of any desired timbre, rise and/or decay time Winckel, F., "Music Sound and Sensation," New York,
can be produced. An alternative way to produce Dover, 1967.
similar effects is to use an oscillation known as Benade, A. H., "Fundamentals of Musical Acoustics,"
the square wave. When this oscillation is anal- London, Oxford Univ. Press, 1976.
yzed by the method of Fourier, it is shown to Backus, J., "The Acoustical Foundations of Music,"
consist of a fundamental plus the odd harmon- New York, Norton, 1977.
ics or overtones. Another kind of oscillation, a Rossing, T. D., "The Science of Sound," Reading, MA,
saw-tooth wave, then analyzed, is shown to Addison-Wesley, 1982.
consist of the fundamental and all harmonics-
even and odd. A square wave or a sawtooth Cross·references: ACOUSTICS, ARCHITECTURAL
wave produced by an appropriate electrical ACOUSTICS, NOISE, FOURIER ANALYSIS, RE-
oscillator can be passed through an electrical PRODUCTION OF SOUND, VIBRATION, WAVE
filter which can attenuate any range of fre- MOTION.
N
NEUTRINO relativistically invariant forms, vector and axial
vector, were found to be sufficient to account
The neutrino is an elementary particle postu- for most of the known characteristics of the
lated by W. Pauli 1 in 1930 to explain the appar- weak interactions, was then formulated by Mar-
ent non-conservation of energy and momentum shak and Sudarshan s , and Feynman and Gell-
in that class of nuclear radioactivity known as Mann 6 .
beta decay. A quantitative theory of beta decay In 1962, an experiment at the Brookhaven
incorporating the neutrino hypothesis was for- National Laboratory by a Columbia-Brook-
mulated by E. Fermi2 in 1933 in analogy with haven group, 7 using a heavily shielded 10-
the quantum theory of radiation and served to ton spark chamber array, showed that the
predict the nature of the neutrino and its ex- neutrino most frequently associated with the
tremely weak interaction with matter. Accord- decay of the n meson differed from the neutrino
ing to the Pauli-Fermi ideas, the neutrino (in produced in nuclear beta decay, thus enlarging
Italian, "little neutral one") is a particle of the class of neutrinos to four; Ve, Ve, vIJ., vIJ.. It
vanishingly small and possibly zero rest mass, now appears that any decay or inverse process
no electrical charge, with spin 1/2, and the involving an electron has associated with it an
ability to carry energy and linear and angular electron type neutrino while any such process
momentum. Its interaction with matter is so involving a mu meson occurs in association with
weak that a 3-MeV antineutrino is predicted to a mu-type neutrino. The discovery by Perl 8
be capable of penetrating an astronomical thick- (1975) of a heavy (1.9 times the proton mass)
ness of matter, e.g., 100 light-years of liquid weakly interacting particle called a tau suggests
hydrogen . In 1956, a group of Los Alamos that a third pair of neutrinos exists (VT, VT).
physicists3 succeeded in making a direct obser- An interaction of the neutrino which did not
vation of the neutrino, ve, emitted from beta- change the electrical charge of the target-a
decaying fission fragments produced in a power- neutral current-was observed 9 at CERN (1973)
ful reactor at the Savannah River Plant operated in accordance with the theoretical prediction
by the du Pont Company for the U.S. Atomic of Salam and Weinberg lO (1967-72) . The exis-
Energy Commission. These investigators used tence of the reactions vlJ. + e- ~ vlJ. + e- and vlJ. +
giant liquid scintillation detectors to observe N ~ vlJ. + N I supported the theoretical picture
the inverse beta decay reaction which resulted from unification of the weak
and electromagnetic forces. More recent work 11
Ve + P ~e+ +n shows that, as expected, the neutral current re-
action Ve + D ~ ve + n + p also occurs. A further
where p is the target proton and e+ and n are the prediction of the theory is the existence of car-
product positron and neutron. The experiment riers of the weak interaction called intermediate
consisted of observing the distinctive delayed bosons W±, ZO having masses in the range some
coincidence between the prompt annihilation of 90 times that of a proton. The existence of
the positron and the capture of the neutron by these particles was demonstrated at the CERN
a cadmium isotope dissolved in the scintillator. Ii > p colliding beam accelerator (1983 ).12
In 1957, following the "overthrow" of parity Recent experiments suggest that the neutrino
conservation in weak interactions as a result of may be more complicated than hitherto be-
the work of Lee and Yang4 , the character of the lieved - in particular it may not have a vanish-;
neutrino was further elucidated. Two kinds of ingly small rest mass. Theoretical models for
neutrinos were accepted: the neutrino, Ve, pro- such a possibility13 advanced by Nakagawa et
duced in beta decay in association with posi- al. (1963) and Pontecorvo (1969) assume that
trons, and the antineutrino, ve, produced in the observed neutrinos are linear combinations
beta decay in association with negative elec- of massive base states. Such combinations would
trons. The neutrino emerged as completely pol- give rise to observed states, e.g., Ve, vIJ.' VT ,
arized with the spin angular momentum parallel which differ in an energy-dependent manner
(antiparallel) to the linear momentum for the with distance from a neutrino source. Such
antineutrino (neutrino). A theory of weak inter- variations, called oscillations by analogy with
actions encompassing the neutrino, in which the the composite motion of coupled pendula,

181
NEUTRINO 788

have been sought using reactor neutrinos ve It is seen that neutrino physics encompasses
(1980) and higher energy neutrinos from accel- low-energy reactions using fission reactors,
erators, and hints of an effect 14 are being sub- work in the structure of the weak interaction
jected to further tests. 15 using giant electronuclear machines, and the
The implications of finite neutrino mass beginnings of various studies of neutrino
range from a mixing of neutrino types to conse- sources such as the sun and high-energy cosmic
quences of an equally profound character in rays22 as well as the nature of the neutrino
the formation of galaxies and the universe itself.
itself. A neutrino mass ~20 eV would result
in a closed universe. F. REINES
A search for Ve from the sun 16 using a 400-
ton perchloroethylene (C 2CI 4 ) target located References
~ 1 mile below the earth's surface to diminish
the background due to cosmic rays has, thus 1. Pauli, W., Jr., "Rapports Septieme Conseil Phys·
far, yielded unexpectedly negative results. In ique, Solvay, Bruxelles, 1933," Paris, Gautier·
this experiment the 37 Ar produced by solar Villars, 1934.
neutrinos via the reaction ve + 37 Cl ~ 37 Ar + 2. Fermi, E., Z. Physik 88, 161 (1934).
e- was sought by collecting the 37 Ar from 3. Reines, F., and Cowan, C. L., Jr., Phys. Rev. 92,
the C2Cl 4 container and counting in a tiny 830 (1953); Cowman, C. 1., Jr., Reines, F., Har-
low background proportional counter. The rison, F. B., Kruse, H. W., and McGuire, A. D.,
limits set, a factor of ~3 below expectation, Science 124, 103 (1956); Phys. Rev. 117, 159
are difficult to reconcile with the current solar (1960).
model. 17 4. Lee, T. D., and Yang, C. N., Phys. Rev. 105,
Because of the threshold for 37 Cl reaction 1671 (1957).
(0.8 MeV) the solar Ve signal is dependent on 5. Marshak, R. E., and Sudarshan, E. C. G., Phys.
the details of the solar model. An experiment Rev. 109, 1860 (1958); Proceedings of Padua-
which is insensitive to the solar model is in Venice Conference on Mesons and Newly Dis-
process of development by Davis et al. 16 (1980). covered Particles, Italy, September 1957.
The reaction Ve + 71 Ga ~ 71 Ge + e- has a thresh- 6. Feynman, R. P., and Gell-Mann, M., Phys. Rev.
old of 0.24 MeV and hence can be caused 109,193 (1958).
by Ve from the essentially model independent 7. Danby, G., Gaillard, J. M., Goulianos, K., Leder-
p + p reaction which is responsible for the bulk man, L. M., Mistry, N., Schwartz, M., and Stein-
of the sun's energy: berger, J., Phys. Rev. Lett. 9,36 (1962).
8. Perl, M. L., et aI., Phys. Rev. Lett. 35, 1489
p + p ~ d + e+ + Ve (0.42 MeV endpoint). (1975).
9. Hasert, F. J., et al. Phys. Lett. 46B, 138 (1973).
At this stage the experiment appears to be 10. Weinberg, S., Phys. Rev. Lett. 19, 1264 (1967);
feasible, assuming the availability of the re- Salam, A., Proc. 8th Nobel Symposium (N.
quired 25 tons of gallium. Svartholm, Ed.), 1969.
Two groups, one 2 miles underground in a 11. Pasierb, E., et aI., Phys. Rev. Lett. 43,96 (1979).
South African gold mine,IS and the other in the 12. Armison, G. et al. (VAl Collaboration) Phys. Lett.
Kolar gold fields 20 at a slightly lesser depth, 126B, 7 July 1983. See also CERN Courier, Nov.
have detected high-energy muon neutrinos 1983 for a discussion of these important dis-
produced in the atmosphere by the interaction coveries.
of cosmic ray primaries. A measure of the 13. Nakagawa, M., Okonogi, H., Sakata, S., and
meager flux and weakness of the interaction is Toyoda, A., Prog. Theor. Phys. 30, 727 (1963);
indicated by the small number of neutrino- Pontecorvo, B., Zh. Eksp. Tear. Fiz. 53, 1717
induced events collected: the Indian, Japanese, (1967).
British Kolar group recorded ~ 17 in two years 14. Reines, F., Sobel, H. W., Pasierb, E., Phys. Rev.
of operation; the American, South African Lett. 45, 1307 (1980).
group detected ~ 100 with somewhat larger 15. Mossbauer, R. L., Contribution to v-82 Confer-
equipment and four years of operation. ence. Reines, F., Nucl. Phys., A396, 469 (1983).
Ambitious proposals are being formulated to 16. Davis, R., Jr., Rogers, 1. C., and Radeka, V.,
study these "natural" neutrinos by means of Bull. Am. Phys. Soc. 16,631 (1971).
giant (10 6 -10 9 tons) water Cerenkov detectors 17. Bahca1l, J. N., and Sears, R. L., Ann. Rev. Astron.
to be located deep (~5 kilometers) in the Astrophys. (1972).
ocean. As in the case of the gold mine experi- 18. The use of Gallium was suggested as a solar Ve
ments the neutrinos would be detected via target by V. A. Kuzmin, Soviet Physics-JETP
product muons. Of special interest is the 22,1051 (1966).
sensitivity of such an imagined detector to very 19. Reines, F., Kropp, W. R., Sobel, H. W., Gurr,
-energetic neutrinos (?:: 10 10 eV). If current H. S., Lathrop, J., Crouch, M. F., Sellschop,
conjectures regarding energetic extraterrestrial J. P. F., and Meyer, B. S. Phys. Rev. D4, 80
sources are correct such a deep underwater (1971).
muon and neutrino detector (DUMAND)21 20. Krishnaswamy, M. R., Menon, J. G. K., Narasim-
shows promise of observing them. han, F. S., Hinotani, K., Ito, N., Miyake, S., Os-
789 NEUTRON

borne, J. L., Parsons, A. J., and Wolfendale, A. W., charged, they can and do approach, penetrate,
Proc. Roy. Soc. London A 323 (1971). and interact with nuclei much more readily
21. Several papers and references to DUMAND may than protons, since they have no Coulomb
be found in Vol. II of the 1981 Internat. Conf. on repulsion to overcome. Moreover, they can and
Neutrino Physics and Astrophysics, R. J. Cence, do function as the "binding agents" within a
E. Ma, A. Roberts, Eds. nucleus to hold it relatively stably together
22. Further references and more extensive discussion despite the disruptive electrostatic forces that
may be found in the proceedings of the annual the constituent protons exert on each other.
neutrino conferences. In this capacity they respond to the strong-
interaction internucleon force (they are classi-
Cross-references: ANTIPARTICLES, CONSER VA- fied as hadrons, i.e., "the strong ones," from
TION LAWS AND SYMMETRY, ELEMENTARY the Greek hadros, "strong," a family of particles
PARTICLES, RADIOACTIVITY, WEAK INTER- that includes not only the protons and other
ACTIONS. baryons, i.e., "the heavy ones," but also mesons,
i.e., "the medium ones," when the latter are
susceptible to strong interactions). This nuclear
force is a powerful binding force that acts
NEUTRON attractively between nucleons at short range,
keeping them pinioned within the confines of
Significance The role of neutrons and their ap- a nucleus, whose diameter is on the order of
plications is of central importance to nuclear 10-15 to 10- 14 meters, about a million times
physics and, indeed, to science in general. Some smaller than the diameter of an atom.
indication of their wide-ranging significance in The binding among neutrons and protons in a
the current world is provided by the breadth nucleus is mediated by the exchange of mesons:
of the topics covered in an International Con- principally, pi(1T)-mesons (or pions, the "pri-
ference on The Neutron and its Applications, 1 mary" mesons), but also rho(p )-mesons, both
September 13-17, 1982, commemorating the of which species may carry a positive, negative,
fiftieth anniversary of the 1932 discovery or zero electrical charge, and omega(w)-mesons,
of neutrons by (Sir) James Chadwick 2 in which exist only in the uncharged state. This
Cambridge (UK). The following sample of meson-exchange model of the forces inside a
topics exemplifies the scope: Neutron sources; nucleus also provides an explanation (indicated
properties, lifetime, moments and structure below) for the otherwise puzzling fact that the
of the neutron; ultracold neutrons; neutrons in mean lifetime of many varieties of nuclei con-
astrophysics and neutron stars; nuclear interac- taining bound neutrons can greatly exceed the
tions of the neutron; scattering, diffraction, and mean free-neutron lifetime. From measurements
neutron optics; radiation; fission and fusion; performed on large statistical aggregates of neu-
reactor physics; transmutation, transfer, and trons in the free state, their mean lifetime has
transport theory; magnetism; molecular sci- been determined as T = 925 ± 11 sec, which cor-
ences, polymers, and materials sciences; mo- responds to a "half-life" of T I / 2 = 641 ± 8 sec,
lecular biology, biology, medical, and thera- after which time interval on an average one-half
peutic applications; technological applications; of the aggregate will have decayed into products
theoretical and particle aspects; etc. indicated by the following symbolic decay
This recent overview follows surveys of scheme:
neurton physics in previous international con- n ~ p+ + e- + ii.
ferences that dealt with the interactions of
neutrons with nuclei3 and nuclear structure This expresses the decay of any given neutron
study with neutrons,4,5 supplementing review into a triad of particles comprising a proton
articles,6-8 and topical textbooks. 9 - 14 (p+ = 1H, the positively charged nucleus of the
The neutron is thus a fundamental particle hydrogen atom), an electron (e- = {3-, the par-
of immense versatility, a constituent of all ticle identified as the constituent of (3-rays in
matter (except hydrogen, 1H, whose nucleus radioactivity) and an antineutrino (ii). This
consists solely of a proton) as one of the two particular decay transition, termed "{3-decay" in
principal particle species, collectively termed allusion to the presence of the {3 (electron)
nucleons, that make up the nuclei of atoms. among the decay products, is an instance of a
The protons that form the companion species weak interaction. Thus the neutron in its free
have almost the same characteristic properties state participates in a weak-interaction decay,
as the neutrons, with one important exception: whereas in its bound state in a nucleus, it
they carry a positive electric charge, whereas participates in a strong-interaction binding
neutrons are electrically neutral, i.e., uncharged, among the other nucleons. The heavy particles
and therefore not susceptible to any electro- (n, p) featured in above {3-decay transition are
static repulsive forces. Likewise, for this reason members of the baryon family; the light par-
neutrons cannot attract (negatively charged) ticles (e, ii) are leptons, wherein the presence of
electrons to form neutronic "atoms," nor can the antineutrino is indicative of a weak inter-
they be deflected, focused, or detected directly action. Of the two nucleons, the neutron is
through electric-field devices. But, being un- very slightly the more massive: hence its decay
NEUTRON 790

into a proton and other products does not electric charge). Not only can the presence of
violate the principle of conservation of mass- electrons within the confines of a nucleus be
energy. The neutron's intrinsic angular momen- excluded by quantum-theoretical arguments, but
tum, or spin, is (in units of h/21'(, where h is simple reasoning based upon spin-conservation
Planck's constant, a fundamental physical con- can rule out such a proton-electron configura-
stant in quantum theory that has the dimensions tion in the case of such nuclei as nitrogen-14,
of energy times time) s = !; this "quantum for which measurements indicate the positive
number" is numerically the same for the other nuclear charge to be 7 units (147N, a designa-
end-product particles of f3-decay, which accord- tion in which the superscript on the left denotes
ingly is consistent with the conservation of the mass number A and the subscript on the
angular momentum, schematically expressed as left denotes the atomic number Z) and the
! ~ ! + ! - !. The most recent concensus for the ground state to have an integer spin (of value 1
mass of a neutron is in the conventional h/21'( units): no aggregate
of 14 protons (each having half-integer spin,
mn =(1.6749542 ± 0.0000048) X 10-27 kg s = !) and 7 electrons (each having a single
negative electric charge, and half-integer spin,
= 1.0086650 ± 0.0000028 u, s = !), making up a composite of an odd total
where the atomic mass unit (ud is defined as number (21) of half-integer-spin particles, can
yield a system having integer net spin, whereas
1/12 of the mass of the neutral 1 C atom. When
expressed in energy units (million electron-volts, of course a cluster of 7 protons with 7 neutrons
or MeV), this is equivalent to a rest-mass-energy constitutes an even-number aggregate that can
of 939.5731 ± 0.0027 MeV, as against the pro- give rise to an integer net spin.
ton's rest-mass-energy of 938.2796 ± 0.0027 In adopting the neutron-proton model of
MeV, which corresponds to nuclear structure, with meson exchange to pro-
mote the binding, it is evident that the continual
mp = (1.6726483 ± 0.0000048) X 10-27 kg acquisition and loss of mesons by neutrons
must be the factor responsible for inhibiting
= 1.0072764 ± 0.0000029 u. the neutrons' natural tendency to decay. By
way of a somewhat oversimplified picture, one
The neutron's mass mn is about 1839 times might conceive of the momentary combination
that of the electron, and the proton's mass is of an uncharged neutron with a positively
about 1836 times that of the electron: charged companion meson as akin to an over-
weight, but stable, proton: hence, whenever
me = (9.1095339 ± 0.0000250) X 10-31 kg the bound neutron gains its accompanying 1'(+
=(5.4858027 ± 0.0000015) X 10- 4 u, (or p+) exchange meson it assumes the charac-
ter of a long-lived (pseUdo-proton) particle and
or, in energy units, 0.5110034 ± 0.0000014 so enhances the nuclear lifetime beyond that of
MeV. The rest mass of the antineutrino is zero the normal free neutron. The mesic "clouds"
(or, at all events, vanishingly small relative to in a nucleus thus participate in an ever-ongoing
the above, bearing in mind that the most recent rapid exchange process, binding the nucleons
theories suggest that it may not necessarily be and augmenting the stability of the nucleus.
precisely zero). Thus, the mass excess of the As the number of neutrons (N, the neutron
neutron over the proton, as expressed in energy number) in a nucleus increases, up to and be-
units, is yond the atomic number Z (equal to the num-
ber of protons in the nucleus, and hence to the
mn - mp = 1.29343 ± 0.00004 MeV. number of counterbalancing electrons in the
electrically neutral atom), the nucleus becomes
The decay, after a mean half-life of roughly more stable-up to a reasonable limit, beyond
10.7 min, of neutrons into protons (which are which, with a growing neutron excess (N - Z),
completely stable or, at any rate, have extremely the instability again progressively grows worse.
long mean lifetimes, on the order of 10 31 years For stable nuclei of odd mass number A = N +
if the latest unified theories of particles and Z it is found that to a good approximation the
their interactions prove to be valid) would neutron number is
seem to preclude the existence of neutrons 490A + 7.75As/3
within stable or long-lived nuclei. On the other
N(moststable) = 90
hand, their presence in conjunction with pro- 9 + 7.75A 2/3 •

tons as the ingredients that make up a nucleus


successfully explains all the known properties Whereas the number of nuclear protons (and
of nuclei and, in particular, overcomes the thus of atomic electrons) determines the par-
numerous, otherwise insurmountable, diffi- ticular chemical element (e.g., Z = 1 for hydro-
culties posed by earlier, pre-1932, models of gen, Z = 2 for helium, Z = 27 for cobalt, Z = 92
nuclear structure, such as the proton-electron for uranium, etc.), it is the number of neutrons
model, in which the nucleus was visualized to that determines the particular isotope (Greek,
be composed of protons (making up the mass) "same place") of that element. For example,
and electrons (serving to neutralise the excess the stable nuclide 59CO has N = A - Z = 59 -
791 NEUTRON

27 = 32, while the unstable isotopes on either details ; studies with polarized neutrons, the
side , S7CO and 6 1 Co, respectively have N = 30 acquisition of explicit information on neutron
and 34 ; according to the above stability equa- matter and neutron stars, and application of
tion , one would expect S9Co to be the most neutrons for technological, biological, and
stable of these, since substitution of A = 59 medical purposes as well as for purely scientific
yields the desired N(most stable) = 32, whereas ends, the development of more intense neutron
with A = 57 one would obtain N(moststable) = sources and more efficient means of detection,
31, and with A = 61 one would find that the potentialities offered by, e.g., ultracold
N(moststable) = 33. nuetrons, or at the other extreme, by ultrahigh-
The transformation of nuclei by neutron energy neutrons: all these have the ability to
capture to form new isotopes was first accom- provide fruitful, valuable insights into the phys-
plished in Rome by Enrico Fermi in 1934 ical world that would otherwise remain unex-
(the year in which he also put forward his plored and unexploited.
detailed theory of {3-decay). This opened up Discovery and Classification No finer "com-
an altogether new branch of nuclear physics ing-of-age present" for nuclear physics could be
(proton-induced transmutation had been dis- imagined than the 1932 discovery of neutrons
covered by John Cockcroft and Ernest Walton by Chadwick,2 precisely twenty-one years
in 1932 , for which they were jointly awarded after the birth of nuclear physics following the
the Nobel Prize in physics in 1951; Chadwick recognition by (Sir) Ernest Rutherford in 1911
had received the Nobel Prize in physics in 1935, that at the central core of atoms there lay a
and Fermi in 1938). The importance and massive , dense "nucleus." To set this neutron
breadth of the subsequent developments hardly discovery in perspective, it should be borne in
need to be stressed ; on the cosmic scale, neu- mind that with the discovery of natural radio-
tron capture processes in stars have profound activity by Henri Becquerel in 1896 and of elec-
astrophysical significance, the discovery of fis- trons as the "{3-ray" component of such radia-
sion in Berlin by Otto Hahn (recipient of the tion) by (Sir) Joseph Thomson in 1897, the
Nobel Prize in chemistry in 1944) and Fritz groundwork was laid for the elucidation of the
Strassmann in 1939 through the capture make-up of atoms. This received its major
of slow neutrons by uranium literally had impetus in 1911 from Rutherford 's interpre-
earth-shaking consequences, and the seemingly tative studies and in 1913 from Niels Bohr's
boundless range of neutron-induced nuclear quantum-theoretical explanation, which ren-
transmutation reactions under widely differing dered an admirable account of many observa-
conditions offered new vistas in the rapidly ex- tions of atomic phenomena. With the definite
panding field encompassed by neutron science. identification of the proton in 1919 , it appeared
The discovery of its conjugate antiparticle, reasonable to contend that nuclei consisted
the antineutron (ii , uncharged) , by B. Cork, of protons and electrons in intimate combina-
G. R. Lambertson, O. Piccioni, and W. A. tion. Upon further scrutiny, however, this
Wenzeps in 1956 (one year after the 1955 model became increasingly difficult to reconcile
identification of the antiproton by O. Chamber- with the findings, and by 1930 it was evident
lain, E. Segre, C. Wiegand , and T. Ypsilantis), that it was untenable. Speculation was rife
the recognition of the neutron's having a com- that the nucleus might possibly harbor a
plicated internal structure (since, as discussed neutral ingredient, a suggestion that Ruther-
below , it has a magnetic moment , and there- ford espoused. With the finding of the neutron,
fore acts like a magnet , there must be internal having all the desired properties, the difficulties
electric charges circulating within it, even though were solved in 1932.
its net electric charge is found from external Stemming from the 1930 observation by
measurements to be zero) , the interpretation of Walther Bothe and H. Becker in Berlin that a
the make-up of the neutron as a baryon in penetrating radiation resulted from the bom-
terms of a triad of quarks (e .g. , the pnn or, in bardment of various target elements by nat-
more recent parlance, udd quark combination, urally produced a -particles (alpha-particles are
comprising an "up" quark and two "down" helium-4 nuclei, emitted spontaneously by a
quarks, the former of which carries a positive radioactive polonium source and by some other
charge, +~ e, and the latter of which each carry radioactive sources), and that the reaction with
a negative charge, - ~ e, where e is the electronic beryllium was particularly intense, investiga-
charge), and other related particle aspects , have tions were undertaken at various centers into
all collectively made this a particularly interest- the nature of this radiation, unusually rich in
ing, if difficult , particle to study in its own energy and of novel penetrative capability.
right. Lately, 1 increasing consideration is being Among these was the Cavendish Laboratory in
given to the possible existence of "neutron os- Cambridge (UK) where, first, H. C. Webster
cillations," namely, the likelihood that free and, subsequently, Chadwick pursued these
neutrons could be represented as a superposi- studies to higher precision than had been at-
tion of nand ii states, with an interconversion tained previously, which ruled out the possibil-
(oscillation) period Tn" in excess of 105 sec. ity of the radiation's being high-energy 'Y-rays
The understanding of neutron-neutron forces (gamma-rays are the third component of the
is still far from complete in its quantitative a, {3, l' triad of radiations observed by Becquerel
NEUTRON 792

and separated by the Curies; they are identified A property of the neutron that has particular
as uncharged photons, i.e., electromagnetic ra- significance is its anomalous magnetic moment,
diation of short wavelength). Rather, Chadwick first accurately measured by Luis Alvarez and
reached the conclusion that the uncharged ra- Felix Bloch.16 The presently accepted value of
diation was composed of neutral massive par- this quantity, 17 expressed in units of the nu-
ticles, whose mass he showed to be comparable clear magnet on (J.lN == eh/4rrmpc =3.1524515 X
with, and slightly larger than, that of the pro- 10- 18 MeV/gauss, where c is the vacuum velocity
ton. Following Rutherford's suggestion, he of light) is
designated these particles as "neutrons" and
concluded his exposition of their discovery J.ln = -1.91304184 ± 0.00000088.
with the following words: "The neutron hy-
pothesis gives an immediate and simple ex- The negative sign implies that the magnetic-
planation of the experimental facts; it is con- moment vector points in a direction opposite
sistent in itself and it throws new light on the to that of the spin (i.e., indicates the magnetic
problem of nuclear structure." polarity), while the nonzero (anomalous) value
Thus was the foundation laid for the neu- implies the existence of inner circulating posi-
tron-proton model of the nucleus in 1932. tive and negative, self-cancelling electric charges
That same year, Werner Heisenberg, drawing on in the interior of the neutron. Also of signifi-
an analogy with chemical bonding, put forward cance is the electric dipole moment (also
his theory of the exchange nature of binding termed the El moment, or the EDM) of the
forces; although his first (incorrect) surmise neutron, which has to be zero if the time-re-
was that neutrino exchange was responsible versal invariance is to hold. The experimentally
for the strong internucleon forces, this was sub- determined value is consistent with zero; in the
sequently emended to meson exchange, putting conventional units of 10- 23 e . em, it is 18
the n-p-(rr) model on a sound footing. Another
contribution by Heisenberg, also in 1932, lay D n =0.04±0.15,
in the assignment of isospin 1 = ! to the n-p
nucleon doublet, treating the neutron and the which implies that, within 90% confidence
proton as merely different isospin states, with limits, Dn/e < 0.3 X 10-23 cm. An estimate 19
respective "projected third components of of this value in Steven Weinberg's model of "CP
isospin" 13 = -! and +!, of the basic "nucleon." violation" (akin to time-reversal violation)
Thus were sown the seeds of particle classifica- invoking contributions from strange quarks
tion schemes, brought to their full flower in yielded the result Dn/e ~ 0.07 X 10- 23 em,
more recent years. which is very close to the current experimental
Properties As well as having the mass stated limit (see under "Ultracold Neutrons" below).
above, the neutron has a spin s = ! (hence is An illustrative instance of its usefulness
subject to the Pauli exclusion principle in its as a probe of interaction characteristics, apart
quantum states, and is governed by Fermi- from its strong binding interaction and weak
Dirac statistics, therefore being termed a /3-decay interaction and, indeed, from its
fermion, as opposed to a particle that has inte- participation in the intermediate-strength elec-
ger spin and so obeys Bose-Einstein statistics, tromagnetic interaction during neutron-induced
termed a boson). As a member of the nucleon radiative capture (n, 'Y) reactions, is exemplified
family it has isospin 1 = ! (and, by convention, by its having been chosen 20,21 as the first mas-
13 = -!), positive intrinsic parity (rr = +), and sive non-self-conjugate elementary particle to
baryon number B = + I (the antineutron has B = test gravitation at the fundamental microscopic
-1). Its strangeness quantum number is zero level: a well-collimated beam of low-energy
(8 = 0: it is not a so-called "strange" particle), neutrons sent along a lengthy horizontal
as is also its lepton number (L = 0). Its hyper- evacuated pipe was observed to evince the
charge quantum number is Y = +1. Its net expected downward deflection as a function
electric charge is zero (Q = 0, or at any rate, of horizontal distance. Here, neutrons were
less than 10-21 of the magnitude of the elec- used in order to avoid electromagnetic compli-
tron's unit charge). As a baryon, it is composed cations and because their path could be well de-
of three quarks, arranged in the udd ("up- termined by Bragg reflection to high precision.
down-down") combination. Multineutron states The successful outcome prompted the proposal
or particles, such as 2n, 3n, or 4n, do not exist. to undertake an analogous long-flight-path
There is only one known decay mode of the experiment with antineutrons 22 ,23 as a means
neutron, namely the (pev) ~-decay mode, with of testing the effect of gravity on antimatter
a mean half-life of Tl{2 = 10.7 ± 0.1 min and at very high sensitivity (a I-MeV beam of anti-
a maximum energy re ease per decay event of neutrons traversing a horizonal distance of I
E~max = 0.78245 ± 0.00007 MeV. The proba- km would be expected to fall a vertical distance
bility (branching fraction) for the charge-non- of only 2.5 X 10-8 m). More recently, however,
conserving, but baryon-conserving and lepton- an alternative arrangement using antiprotons
conserving, decay mode (pvv) has been found (and, for comparison, protons) in a very long
to be less than 9 X 10-24 . drift tube, avoiding some of the inherent dif-
793 NEUTRON

ficulties of the antineutron experiment, has 3H + P -+ n + 3He - 0.764 MeV,


been suggested. 24
Details of other neutron properties and "P-T reaction"
numerical data have been given in handbooks,25
reviews of particle properties,26 and compila- 7Li + P -+ n + 7Be - 1.644 MeV,
tions of measured data.~7 For neutron-induced or
reaction data, especial attention is drawn to
the Evaluated Nuclear Data Files, the ENDF 9Be + d -+ n + lOB + 4.362 MeV.
bank, established and maintained by the
National Neutron Cross Section Center at the Neutrons from the D-D reaction typically have
Brookhaven National Laboratory, Upton, NY energies around 2-4 MeV; those from the D-T
11973. reaction have energies around 14 MeV (the re-
Production Neither neutrons nor protons are sidual nuclei take up some recoil energy), while
emitted spontaneously in the course of nuclear neutrons having energies below 1 MeV can be
radioactive decay, which is confined solely to produced from the endothermic reactions hav-
a, ~, or 'Y-emission. Free neutrons must there- ing a negative energy term on the right-hand
fore be produced artificially, through nuclear side. The D-D and D-T reactions are instances
reaction processes which may be induced by of thermonuclear fusion processes, such as ac-
charged particles (a, d, p principally), 'Y-rays count for energy generation in stars, in the
("photoneutron production"), or other neu- hydrogen bomb, and in controlled thermo-
trons (as in fission chain reactions). A useful nuclear reaction (CTR) devices for power pro-
more recent alternative is the spontaneous- duction. An overview of some accelerator-
fission source, containing a nuclide such as based neutron installations and facilities has
plutonium-240 e 40 pu) or californium-252 been given by S. W. Cierjacks,3 and a survey of
e 52 Cf) which has a reasonably long half-life the high-flux production of neutrons has been
(Tl12 = 6760 yr or 2.55 yr, respectively) and, presented by Lawrence Cranberg. 3 Methods
in undergoing spontaneous fission produces for producing intense sources of fast neutrons
fairly copious amounts of neutrons among the have been reviewed by Henry Barschall;28 in
disintegration products (the more intense particular, the generation of high-intensity
source, 252Cf, produces 3 X 10 12 neutrons/sec (~ 10 13 neutron/sec) beams of 14-Me V neu-
per gram). Other self-contained neutron sources trons for cancer therapy29 has been the subject
employ natural a-emitting nuclei such as 226Ra of increasing attention over the past decade. 3
or 230pu mixed with 9Be to bring about the Of course, intense neutron fluxes can also be
neutron-producing reaction produced in nuclear explosions underground.
Neutrons are being constantly produced in the
9Be + a -+ n + 12C + 5.7 MeV atmosphere as one of the secondary effects of
cosmic rays; ty~icallY approximately 2 X 10-6
within a sealed capsule. In polonium-beryllium neutrons per m are so produced every second
(Po-Be) sources, the a particles emanate from in ,the 1-2 MeV energy range. From pulsed
210pO (which decays to stable lead, 206Pb, with accelerators, such as the Van de Graaff genera-
a half-life Tl12 = 0.3789 yr, and hence has less tor, pulsed neutron beams (which can be
durability). Photoneutron sources employ fairly bunched, using the Mobley principle) can be
energetic spontaneous 'Y-ray emitters such as produced; a text on pulsed-neutron scattering
sodium-24 (24Na) to obtain neutrons from the technology has recently been issued. 30 Reactors
are, likewise, prolific (continuous) sources of
9Be + 'Y -+ n + 2a or 2H + 'Y -+ n + 1H neutrons. However, neutrons of different en-
reactions. In the former, the produced a-par- ergies are produced at different locations within
ticles may initiate additional neutron produc- the assembly, from which they may be extracted
tion from the 9Be content; in the latter, a via "canals" or beam tubes; the energy spread
deuterium target is disintegrated into neutrons of the fission neutrons is considerable and
and protons. although their actual energy can be reduced by
Nowadays, one of the most common means the insertion of "moderator" materials such
of producing neutrons is to direct accelerated as graphite (which also acts as a blanketing
charged particles, such as deuterons (d) or pro- shield), it is impossible without special ancillary
tons (p), from an accelerator onto a gaseous methods to acquire monoenergetic neutron
or solid target (of deuterium, tritium, helium-3, beams from a reactor.
lithium-7, or beryllium-9) to bring about a To get well-defined beams of definite energy
neutron-producing reaction such as one may use velocity-selector devices,8 such as
filters (an Sc-Ti combination transmits only
2H + d -+ n + 3He + 3.269 MeV, 0.002-MeV neutrons, while a 56Fe-Ti combina-
"D-D reaction" tion passes 0.024-MeV beams, and a 28Si filter
passes only 0.144-MeV beams) or "choppers"
3H + d-+n + 4He + 17.590 MeV, (cadmium-steel cylinders, absorptive cylinders
"D-T reaction" containing one or more nonabsorbing, trans-
NEUTRON 794

parent channels, set into rotation at such a ferent neutron classes come into usage. The
speed as to allow only a preselected velocity to principal categories are:
pass through). Alternatively, one may utilize (a) Ultracold neutrons, having energies around
the 'Y-ray flux of a reactor to produce mono- 10- 7 eV, whose velocities are therefore but a
energetic photoneutrons resonantly by directing few meters/sec, the corresponding temperature
it onto a suitable nuclear target material. High- being only 10-3 K, i.e., one-thousandth of a
energy, relativistic, neutron beams would, of . degree above absolute zero (in Kelvin degrees);
course, be produced from the bombardment of (b) Cold neutrons, having kinetic energies
appropriate targets by high-energy charged- :s 0.002 eV;
particle beams from a high-energy accelerator. (c) Thermal neutrons, an important class
Detection The methods used for the detec- whose kinetic energy of about 0.025 eV corre-
tion of neutrons depend upon their energy, and sponds to velocities that are compatible with
in all instances require indirect means of detec- tile velocity of molecules in thermal motion,
tion, since the absence of an electric charge i.e., at "room temperature" (T O:! 20°C O:! 293
prevents the neutron from being detectable K) v O:! 2.2 km/sec;
directly. Such secondary processes include the (d) Epithermal neutrons, having somewhat
observation of charged particles or, e.g., radia- higher energies, in the region ~ 0.5 eV;
tion, emanating from neutron-induced reactions, (e) Resonance neutrons, another especially
the detection of recoil protons or recoil nuclei important class, having energies in the range of
from collisions, the registration of fission frag- 1-100 eV that correspond to the region of
ments, or measurement of the heat produced in strong (resonant) absorption by nuclei.35
a given neutron-initiated reaction. The requisite Neutrons within the categories (a)-(e) are all
instrumentation 31 that responds to these considered "slow neutrons." Others are
secondary charged particles includes ionization (f) Intermediate neutrons, having energies
or fission chambers, Geiger or proportional from 0.001 to 0.5 MeV;
counters, scintilla tors or solid-state surface- (g) Fast neutrons, whose energies lie within
barrier detectors, photographic emulsions, the 0.5 to 15 MeV range;
cloud, bubble, or spark chambers, with ap- (h) Very fast neutrons, from IS to 50 MeV;
propriate shielding. The latter usually takes the (i) Ultrafast neutrons, from 50 to 10,000
form of copper shadow bars, borated paraffin MeV;
wax (sometimes impregnated with lithium (j) The rest, with energies above 10,000 Me V,
carbonate), lead, etc. Air scattering of neutrons constitute highly relativistic neutrons.
is considerable; electronic means of discrimina- Ultracold Neutrons Interesting recent de-
tion and background correction have to be velopments have opened up a new branch of
utilized. As an instance of energy-matching in knowledge and application in neutron phys-
the selection of a detector, fission chambers ics. 1,3,8,12,36 Ultracold neutrons, whose motion
clad with 235U are sensitive to slow neutrons; is comparable with human running speed, have
those clad with 238U respond to fast neutrons. the remarkable property of being totally re-
By modifying the design of a counter, its energy fleeted from solid surfaces, which enables them
region of best response can be changed: for the to be isolated within suitable metal (usually,
detection of slow neutrons, boron counters are aluminum or copper) or glass containers, or
frequently employed; by surrounding these trapped within magnetic storage rings (now
with a thick sheath of paraffin wax or other constructed with superconducting windings),
hydrogeneous material to slow down any fast after traveling along a selective guide tube.
primary neutrons they may, as so-called "long Because of their low energy, they can rise only
counters," be adapted to respond to fast neu- about a meter against the earth's gravity before
trons. Their response is triggered by the a-par- falling back. A proposal to create and trap
ticles that ensue from the lOB + n ~ a + 7Li + them from cold neutrons in refrigerated super-
2.78 MeV reaction which takes place within the fluid helium is currently being tried out.
boron-filled counter (usually containing gaseous The wavelength (in meters) of a neutron is
BF3 enriched in the lOB isotope). The neutron- related to its energy (in eV) as
activation technique also finds many applica- '\ = 2.86 X 10- 11
tions. Since the total neutron energy cannot be 1\
directly registered it is customary to adopt yE
time-of-flight spectrometry methods 32 ,33 in
which subnanosecond time resolutions over and hence if E is equal to 10- 7 eV, the wave-
protracted measuring runs are attainable. length is X = 9 X 10-8 m (i.e., 900 Angstr~m
High-energy neutron detection methods, con- units), about two orders of magnitude larger
stantly beinILimproved, have been discussed by than the spacing between the atoms in solids,
R. T. Siegel. which are thus impervious to a wave of ultra-
Categorization Just as it has become cus- cold neutrons . In effect, the wave is repelled
tomary to distinguish between "hard" and by the solid, as predicted and demonstrated
"soft" X-rays on the basis of their energy (I.e., (for higher-energy neutrons) by Fermi in
wavelength), so has a nomenclature for dif- 1945. The reflection is total, provided the ve-
795 NEUTRON

locity is below a critical value which varies reactions, except at resonance energies, when
from one material to another, being roughly 6 the probability may increase a hundredfold or
m/sec for nickel, iron, beryllium, or copper. more over a narrow incident energy range. For
Ultracold neutrons are defined as those having a example, radiative capture of neutrons on
total velocity below this critical value for a indium-115(1I5In) occurs reasonantly at En =
given material. Thus, v = 6 m/sec corresponds 1.457 eV; the capture probability at just this
to E = 2 X 10- 7 eV, which in turn represents a energy is about 180 times larger than when
temperature T = 2 X 10-3 K (two-thousandths measured at about! eV on either side of this
of a degree above absolute zero). Though only resonance value. For thermal neutrons (En =
sparsely available in the neutron flux from re- 0.025 eV), cadmium has a high capture proba-
actors, they have been successfully extracted bility: natural cadmium is, to 12.3%, comprised
and investigated. An obvious application is in of the isotope 113Cd, which displays a broad,
the determination of the mean lifetime of free powerful resonance for the absorption of neu-
neutrons to a higher accuracy than has hitherto trons about the energy En = 0.176 eV, over-
been feasible. However, unforeseen and as yet lapping with the thermal region. For this reason,
still not fully understood problems in maintain- cadmium is used in the fabrication of choppers
ing the necessary long containment times have and in the safety control rods of nuclear
hampered these measurements, although refine- reactors.
ment of the isolation and storage arrangements Neutrons in Fission and Parity Violation
is leading 38 to the desired conditions for the The general subject of fission is too broad for
experiment to be completed satisfactorily. Simi- coverage here;I,3,40-43 this Encyclopedia dis-
larly, ultra cold neutrons offer an ideal means of cusses it elsewhere (see FISSION). Most reactors
esta blishing finer limits to the value of the elec- operate through fission induced in a core con-
tric dipole moment (EDM) of the neutron. The taining an overcritical amount of 235U or 239pu,
accepted value 18 for Dn was determined at the whose break-up into fission fragments (medium-
ultracold-neutron facility situated in the Institut heavy nuclei, more neutrons, and radiation)
Laue-Langevin in Grenoble (France), while a when bombarded by thermal neutrons (i.e.,
result with even smaller error limits, namely neutrons slowed down by a graphite or deuter-
Dn/e = (4 ± 7.5) X 10- 25 cm, has been ob- ium "moderator" to "thermal" speeds) liberates
tained 39 at the B. P. Konstantinov Institute about 200 MeV of energy per fission event.
(USSR). This indicates that, at the 90% confi- Because some 2.5 neutrons are released from
dence level, IDn/el < 0.16 X 10- 23 cm. each fission event initiated by a single thermal
Because of their comparatively long wave- neutron, capable of bringing about further
length (longer than that of electron beams, fission, a "chain reaction" can occur readily,
albeit shorter than that of visible light), ultra- triggered by this multiplication of neutrons.
cold neutrons lend themselves to use in a neu- Through the use of absorptive cadmium control
tron microscope, complementing the familiar rods, the controlled operation of a fission reac-
electron microscope, or to examination of inter- tor can be maintained. In so-called "breeder
atomic forces and the motions of atoms in reactors," based upon a plutonium or thorium
solids, through the use of a neutron spectrome- cycle, as much or more fissionable material is
ter arrangement. Among other potentialities that produced in the course of operation as is used
they present is the possibility for the first time up from the primary fuel; hence a self-sustain-
of obtaining a pure neutron target for atomic ing power-producing plant, such as that in the
and nuclear experiments, without the need to Superphenix installation in France can be de-
employ subtraction procedures to correct for vised for the more conserving use of nuclear
the presence of impurities or other nuclei. fuel resources.
Slow Neutrons In this category, the most Parity violation, i.e., the inequivalence of
important classes are the thermal neutrons coordinate-system "handedness" in the descrip-
(which initiate fission reactions in, e.g., 235U tion of nuclear processes, evoked much atten-
or 239pu) and resonance neutrons (whose likeli- tion when first identified in weak interactions
hood of being captured by a nucleus in, e.g., a in 1956 . Although the weak interactions were
radiative capture process, peaks sharply at cer- found to manifest maximal parity violation, it
tain resonance energies, rising dramatically to was not considered to play any role in strong
a very high, narrow, maximum). Slow-neutron interactions. Recently, however, some indica-
capture in particular is a widely used process tions of its presence even in strong-interaction
for the production of radioactive isotopes from phenomena has come to light. A mechanism
stable nuclides. Whereas the probability for for parity violation in thermal-neutron-induced
elastic scattering of low-energy neutrons does fission 44 has been worked out by o. P. Sushkov
not change greatly with energy or emergence and V. V. Flambaum. 45 Similarly, an explana-
angle, that for neutron capture, in which radia- tion for parity violation in the radiative capture
tion or particles are emitted, foHows a charac- of polarized neutrons has been provided by
teristic diminishing trend that is inversely pro- D. F. Zaretskii and V. K. Sirotkin,46 while that
portional to the incident neutron velocity: the for neutron scattering at threshold (e.g., on
HI/v law" holds for such exothermic capture 124Sn) has been discussed by G. Karl and D.
NEUTRON 796

Tadic. 47 The first clear evidence for parity- The values of the neutron's electric and mag-
non conserving spin-rotation of neutrons passing netic polarizabilities have not yet been estab-
through matter has been observed in tin isotopes lished definitively;1 theoretical estimates of the
(l24Sns 117Sn, and natural Sn) by M. Forte electric polarizability indicate typically an =
et al. 4 8.5 X 10-49 m 3 but experimental values are in-
Fast Neutrons Above the threshold energy, conclusive, whereas up to the present the mag-
a wide variety of interaction processes is avail- netic polarizability has not been measured at
able for neutrons to undergo. These include all.
elastic and inelastic (Le., excitation) scattering Condensed Matter (Neutron Stars) The ex-
from nuclei, from which the neutrons emerge treme densities associated with nuclear matter,
with effectively unchanged, or diminished, averaging about 10 17 kg/m 3 , are encountered
energy, multineutron emission, in which more in practice only in the highly compressed mat-
than one neutron is ejected by each incident ter of neutron stars, in which the immense in-
projectile neutron, particle or multiparticle ward gravitational attractive force effectively
release from nuclei following neutron capture, "squeezes" atomic electrons onto nuclear pro-
radiative capture (whose likelihood diminishes tons to produce a highly condensed "neutron
with energj;), even fast-neutron fission, induced, fluid" as the stellar material. Such neutron
e.g., in 22Th or 238U. A major branch of stars 52 ,53 constitute the end product of a
nuclear-reaction studies deals with neutron- stellar evolution sequence from protostar to
induced processes which play a vital role in main-sequence object, red giant and variable
nuclear physics and astrophysics. phase, thence to the white-dwarf stage and, via
Ultrafast Neutrons Already at En = 6.32 a supernova explosion if the mass transcends
Me V the (classical) kinetic energy of a neutron the Chandrasekhar limit of 1.44 solar masses,
differs by I % from that calculated relativis- to the neutron-star condition, having a radius of
tically; above about 50 MeV the neutrons are but a few kilometers and rotating rapidly, as
distinctly relativistic. At these high energies, a "pulsar" emitting radio signals.
multiple particle emission is, with increasing Neutron Diffraction and Interferometry Over
frequency, found to occur when neutrons in the energy interval En = 0.001-10 eV, the wave-
the 50-500 MeV range are directed onto length of neutrons, according to the formula
nuclear targets. The mechanism is in general above, ranges from about A. = 0.1 to 10ft..
complicated: in an initial step, knock-on Such a wavelength is well suited to the exam-
nucleons are likely to be ejected directly, ination of the structure of materials, akin to
followed by the redistribution of the surplus investigations pursued with X-ray (Bragg)
excitation energy in the residual nucleus, which diffraction. Neutron diffraction on the atomic
deexcites by the emission of 'Y-rays and one or lattice planes of a crystal can occur by way of
more additional nuclear particles. Spallation is reflection or transmission; many designs of neu-
often caused by ultrafast neutrons. tron spectrometers have been employed in a
Neutron Polarization As spin-! particles wide variety of solid-state and material studies,
with a nonzero spin magnetic moment, neu- making use of interferometry. 1,3,14,54,55
trons can be polarized,16 Le., oriented in the Other Applications Neutron applications are
direction of an applied external magnetic field. too numerous to list exhaustively.l,3, 56 By way
This magnetic moment may, at least in part, of example, one might make mention of neu-
account for the small attractive force that tron radiography in studies of paintings, in
acts between a neutron and an electron, the metal science and in medical investigations.
first evidence of which Fermi discovered in Neutrons have been used extensively for other
1947. Thus, polarized neutron beams can be biomedical applications, including biotherapy,
produced, and used in the derivation of basic in the production of radioisotopes and infra-
information on nuclear forces or othercharac- red detectors or the enhancement of computer
teristics. 1,3-5,12,49-51 Several methods exist for memory devices or food preservation, in many
the production of polarized neutron beams: analytical applications, including activation anal-
partial polarization can be induced by the pas- ysis, and for such diverse purposes as assaying,
sage of a neutron beam through a slab of mag- moisture gauging and well-logging, forensic and
netized iron (the "Bloch effect," first established archaeological investigations, solid-state research
in 1936/37 by Felix Bloch). Neutrons can be and chemical molecular studies. Their use to
partially polarized by scattering: the use of such ends in benefitting mankind "has been
double-scattering experiments (in which the both beneficient and benevolent"56 and has
first scattering causes the polarization, and the far outweighed the potential misuse in explosive
second scattering analyzes it) is particularly fission devices.
noteworthy. The neutrons produced in reac-
tions can also be partly or almost wholly ERIC SHELDON
polarized. 50 The {3-decay of polarized neutrons
has been used to test the time-reversal invariance References
of the weak interaction; polarized neutrons have 1. Schofield, P. (Ed.), "The Neutron and its Appli-
also been employed in other investigations of cations 1982," Proceedings of an International
time-reversal invariance in nuclear reactions. Conference to commemorate the 50th anniver-
797 NEUTRON

sary of the discovery of the neutron, Cambridge shop on Nuclear and Particle Physics at Energies
(UK), Sept. 13-17, 1982; Conference Series Num- up to 31 GeV," Los Alamos National Laboratory
ber 64, Bristol and London, Institute of Physics, report LA-8775-C, p. 499 (1981).
1983. 23. Brando, T., Fainberg, A., Kalogeropoulos, T.,
2. Chadwick, J.,Proc. Roy. Soc. London Ser. A 136, Michael, D., and Tzanakos, G., Nucl. Instrum.
(1932); reproduced in "Foundations of Modern Methods 180,461 (1981).
Physics," (R. T. Beyer, Ed.), p. 5, New York, 24. Goldman, T., and Nieto, M. M., Phys. Lett.
Dover, 1949. 112B(6), 437 (1982).
3. Sheldon, E., (Ed.), "Proceedings of the Interna- 25. Goldberg, M. D., and Harvey, J. A., "Neutrons,"
tional Conference on the Interactions of Neutrons Section 8f, pp. 8-218 to 8-253, in the "American
with Nuclei," Lowell, July 6-9, 1976, Vols. I & Institute of Physics Handbook," 3rd Ed. (D. E.
II, ERDA Conference Report CONF-760715-Pl Gray, Ed.), New York, McGraw-Hill, 1972.
& P2, Oak Ridge, TIC-ERDA, 1976. 26. Particle Data Group (M. Roos, F. C. Porter,
4. Neve de Mevergnies, M., Van Assche, P., and M. Aguilar-Benitez, 1. Montanet, Ch. Walck, R. 1.
Vervier, J. (Eds.), "Nuclear Structure Study with Crawford, R. 1. Kelly, A. Rittenberg, T. G.
Neu trons, Proceedings of an International Con- Trippe, C. G. Wohl, G. P. Yost, T. Shimada, M. J.
ference, Antwerp, 19-23 July, 1965," Amsterdam, Losty, G. P. Gopal, R. E. Hendrick, R. E. Shrock,
North-Holland Publ. Co., 1966. R. Frosch, 1. D. Roper, and B. Armstrong (Tech-
5. Ero, 1., and Sziics, J. (Eds.), "Nuclear Structure nical Associate)), Phys. Lett. I11B, 1 (1982).
Study with Neutrons," Proceedings of the Inter- 27. References in Ref. 25, p. 8-253.
national Conference, Budapest, 31 July-5 Aug., 28. Barschall, H. H., Ann. Rev. Nucl. Sci. 28, 207
1972, London and New York, Plenum Press, 1974. (1978).
6. Feld, B. T., in "Experimental Nuclear Physics," 29. Walko, R. J., Bacon, F. M., Bickes, R. W. Jr.,
(E. Segre, Ed.), Vol. 2, p. 209, New York, John Cowgill, D. F. Jr., Boers, J. E. Jr., Riedel, A. A.
Wiley & Sons, 1953. Jr., and O'Hagan, J. B. Jr., J. Vac. Sci. Technol.
7. Amaldi, E., in "Encyclopedia of Physics/Hand- 18(3),975 (1981).
buch der Physik," (S. Fliigge, Ed.), Vol. 38/2, 30. Windsor, C. G., "Pulsed Neutron Scattering,"
p. 1, Berlin, Springer-Verlag, 1959. London, Taylor & Francis, 1981.
8. Feshbach, H., and Sheldon, E., Phys. Today 30(2), 31. Barschall, H. H., in "Encyclopedia of Physics/
40 (1977). Handbuch der Physik," (S. Fliigge, Ed.), Vol.
9. Curtiss, 1. F., "Introduction to Neutron Physics," 45, p. 437, Berlin, Springer-Verlag, 1958.
New York, Van Nostrand Reinhold, 1958. 32. Cranberg, 1., and Rosen, 1., in "Nuclear Spectros-
10. Marion, J. B., and Fowler, J. 1., (Eds.), "Fast copy," (F. Ajzenberg-Selove, Ed.), Part A, p. 358,
Neutron Physics," Part I: Techniques, 1960; New York, Academic Press, 1960; Cranberg, 1.,
Part II: Experiments and Theory, 1963; New Scientific American 210(3), 79 (1964).
York, Wiley-Interscience. 33. Finckh, E., in "Nuclear Spectroscopy and Reac-
11. Beckurts, K. H., and Wirtz, K., "Neutron Physics," tions," (J. Cerny, Ed.), Part B, p. 573, New York,
Berlin, Springer-Verlag, 1964 (completely re- Academic Press, 1974.
vised English edition of "Elementare Neutronen- 34. Siegel, R. T., in "Encyclopedia of Physics/Hand-
physik"). buch der Physik," (S. Fliigge, Ed.), Vol. 45, p.
12. Gurevich, I. I., and Tarasov, 1. V., "Low-Energy 487 (1958).
Neutron Physics," (R. I. Sharp and S. Chomet, 35. Rainwater, J., in "Encyclopedia of Physics/Hand-
Eds.), Amsterdam, North-Holland Publ. Co., buch der Physik," (S. Fliigge, Ed.), Vol. 40, p.
1968. 373 (1957).
13. Bacon, G. E., "Neutron Physics," London, 36. Luschikov, V. I., Phys. Today 30(6),42 (1977).
Wykeham Publications, 1969. 37. Golub, R., Mampe, W., Pendlebury, J. M., and
14. Dachs, H. (Ed.), "Neutron Diffraction," Berlin Ageron, P., Scientific American 240(6), 134
and New York, Springer-Verlag, 1978. (1979).
15. Cork, B., Lambertson, G. R., Piccioni, 0., and 38. Kosvintsev, Yu. Yu., Kushnir, Yu. A., Morozov,
Wenzel, W. A., Phys. Rev. 104, 1193(L) (1956). V. I., and Terekhov, G. I., JEPT Lett. 31(4),236
16. Alvarez, 1. W., and Bloch, F. Phys. Rev. 57, 111 (1980).
(1940). 39. Altarev, I. S., Borisov, Yu. V., Brandin, A. B.,
17. Greene, G. 1., Ramsey, N. F., Mampe, W., Pendle- Egorov, A. I., Ezhov, V. F., Ivanov, S. N., Loba-
bury, J. M., Smith, K., Dress, W. D., Miller, P. D., shev, V. M., Nazarenko, V. A., Porsev, G. D.,
and Perrin, P.,Phys. Rev. D20, 2139 (1979). Ryabov, V. 1. Serebov, A. P., and Tal'daev,
18. Dress, W. B., Miller, P. D., Pendiebury, J. M., R. R., JEPT Lett. 29(12), 730 (1979); see also
Perrin, P., and Ramsey, N. F., Phys. Rev. DIS, Altarev, I. S., Borisov, Y. V., Borovikova, N. V.,
9 (1977). Brandin, A. 8., Egorov, A. I., Ezhof, V. F., Ivanov,
19. Zhitnitskii, A. R., and Khriplovich, I. B., Sov. J. S. N., Lobashev, V. M., Nazarenko, V. A., Porsev,
Nucl. Phys. 34(1),95 (1981). G. D., Ryabov, V. 1., Serebov, A. P., and Taldaev,
20. McReynolds, A. W., Phys. Rev. 83, 172 & 233 R. R., Nucl. Phys. A341, 269 (1980) and Phys.
(1951). Lett. 102B, 13 (1981), and N. F. Ramsey, Rep.
21. Dabbs, J. W. T., Harvey, J. A., Paya, D., and Prog. Phys. 45,96 (1982).
Horstmann, H.,Phys. Rev. 139, B756 (1965). 40. Loveland, W. D., "Nuclear Fission," Section 8g,
22. Kalogeropoulos, T., in "Proceedings of the Work- p. 8-253ff, in the "American Institute of Physics
NEUTRON ACnV AnON ANALYSIS 798

Handbook," 3rd Ed. (D. E. Gray, Ed.), New York, neutrons produced by a research-type nuclear
McGraw-Hill, 1972. reactor provides the best sensitivities of detec-
41. Vanderbosch, R. and Huizenga, 1. R., "Nuclear tion for most elements, and is the most fully
Fission," New York and London, Academic developed and most widely used form of the
Press, 1973. nuclear activation analysis method-hence this
42. Michaudon, A., "Nuclear Fission," in Adv. in discussion will be limited to NAA, particularly
Nucl. Phys. 6, 1 (1973). with reactor thermal neutrons.
43. Michaudon, A., Phys. Today 31(1), 23 (1978). Neutron Sources Lower fluxes of thermal
44. Danilyan, G. V., Vodennikov, B. D., Dronyaev, neutrons (but still useful for teaching purposes,
V. P., Novitskii, V. V., Pavlov, V. S., and Borovlev, and for the determination of major, minor, and
S. P., Sov. J. Nucl. Phys. 27(1),21 (1978 ). a few trace elements) can be produced with
45. Sushkov, O. P., and Flambaum, V. V., Sov. J. commercially available isotopic sources-(o:, n)
Nucl. Phys. 33(1),31 (1981). sources or 252Cf spontaneous-fission neutron
46. Zaretskii, D. F. and Sirotkin, V. K., Sov. J. Nucl. sources, or with modest-sized Cockcroft-WaIton
Phys. 32(1),54 (1980). or Van de Graaff accelerators. Since all sources
47. Karl , G., and Tadic, D., Phys. Rev. C20(5), 1959 of neutrons produce neutrons of appreciable
(1979). kinetic energies, they must be slowed down to
48. Forte, M., Heckel, B. R., Ramsey, N.-F., Green, thermal velocities (a mean kinetic energy of
K., Green, G. L., Byrne, J., and Pendlebury, about 0.025 eV, at 20 c C) by use of a suitable
J. M., Phys. Rev. Lett. 45(26),2088 (1980). moderator-usually water or some other hy-
49. Breit, G. and McIntosh , J. S., in "Encyclopedia of drogenous material. Where activation by fast
Physics/Handbuch der Physik," (S. Fliigge, Ed.), neutrons is desired, no moderator is used.
Vol. 41/1 , p. 466 , Berlin, Springer-Verlag, 1958. Far higher fluxes of thermal neutrons are
50. Walter, R. L., "Polarization Phenomena in Nu- produced in typical research-type nuclear
clear Reactions Observed in Neutron Studies," reactors. For reactors operating at steady power
in "Nuclear Spectroscopy and Reactions," Part B, levels of 0.1, I, or 10 megawatts, thermal-
(1. Cerny, Ed.), Section VI.C, p. 636, New York neutron fluxes of, respectively, about 10 12 ,
and London, Academic Press, 1974. 10 13 or 10 14 n cm- 2 S-1 are available for the
51. Alfimenkov, V. P. Pikel'ner, L. B., and Sharapov, activ~tion of samples. Most such reactors are
E. I., Sov. J. Part. Nucl. 11(2),154 (1980). of the pool type, in which the reactor core is
52. Irvine, J. M., "Neutron Stars." Oxford, Clarendon located at the bottom of a deep pool of circu-
Press, 1978. lating high-purity water. The water serves as
53. Sheldon, E., Nukleonika 23(12), 1091 (1978). a transparent coolant, neutron moderator, and
54. Bacon, G. E., "Neutron Diffraction," 2nd Ed., biological shield. The neutrons produced by the
Oxford, Clarendon Press, 1962. thermal-neutron fission of 235U have high initial
55. Werner , S. A., Phys. Today 33 (12), 24 (1980). energies (ranging from about 0.5 MeV up to
56. Bromley, D. A., Phy s. Today 36(12), 30 (1983). many MeV), but collisions with water protons
rapidly slow them down to thermal energies.
Cross-references: COLLISIONS
OF PARTICLES, Where irradiation with only epithermal plus fast
CROSS SECTIONS AND STOPPING POWER, ELEC- neutrons is desired, samples are enclosed in a
TRON, FERMI-DIRAC STATISTICS AND FERM- thin cadmium container-the Cd serving to
IONS, FISSION, NEUTRON ACTIVATION ANAL· remove almost quantitatively all neutrons with
YSIS, NEUTRON DIFFRACTION, NUCLEAR energies less than about 0.4 eV. Using a fast-
REACTIONS, NUCLEAR REACTORS, PARITY, transfer pneumatic tube (for production and
PROTON, QUARKS, RADIOACTIVITY, detection of induced radionuclides with half
RESONANCE. lives in the range of seconds to minutes), sam-
ples and standards are activated and rapidly
counted one at a time. Where the interest is in
NEUTRON ACTIVATION ANALYSIS the production and detection of longer-lived
induced activities (haIf lives of an hour or
Neutron activation analysis (NAA) is a powerful longer), longer irradiations of many samples
and sensitive method of elemental analysis and standards are carried out, simultaneously,
based upon the quantitative detection of radio- followed by sequential counting of the acti-
nuclides produced in samples via nuclear reac- vated specimens at selected longer decay times.
tions resulting from the high-flux thermal- Types of Neutron Reactions With only a few
neutron bombardment of the samples. There exceptions, thermal neutrons interact with
are also other types of nuclear activation anal- atomic nuclei by only the (n, 'Y) reaction, pro-
ysis in which the bombarding particles are ducing a nucleus of the same atomic number
energetic charged particles (e.g., protons from (Z) as the target nucleus, but now one unit
a cyclotron), energetic photons (e.g., brems- larger in mass number (A). In most cases (those
strahlung photons from an electron linear of use in the conventional radioactive-decay
accelerator), or fast neutrons (e.g, 14 MeV NAA method), such (n, r) products are radio-
neutrons from a deuteron accelerator). How- nuclides. In most instances, being neutron-rich
ever, activation with the high fluxes of thermal radionuclides, these decay by (3- emission, and
799 NEUTRON ACTIVATION ANALYSIS

in all but a few cases the 13- emission is accom- it is simply:


panied by the emission of gamma-ray photons
of one or more characteristic, sharply defined dN*/dt = -dN/dt =N¢a, (1)
energies. In (n, r) reactions, the r refers to the
prompt gamma rays emitted immediately after in which N is the number of such target nuclei
the neutron capture, as the highly excited in the sample, N* is the number of (n, r) prod-
product nucleus cascades down to its ground uct radionuclide nuclei it produces, and a is the
state. These prompt gammas are quite dif- thermal-neutron (n, r) cross section of the
ferent in their energies from the subsequently target nucleus. If the product nuclei are radio-
emitted radioactive-decay gamma rays. Prompt active, they will be decaying, even during the
gammas are also used, somewhat, in "prompt- irradiation, by the usual first-order radioactive
gamma NAA," with a sample being concurrently decay process:
bombarded with a much lower beam-tube flux
of thermal neutrons and counted on a r-ray -dN* = 'A.N* = In 2 N* = 0.69315 N*
spectrometer. For almost all elements prompt- dt T T'
gamma NAA is much less sensitive than radio-
active-decay NAA, and hence this discussion (2)
will be limited to the latter.
With sample bombardment by fast neutrons in which 'A. is the radioactive-decay rate con-
(e.g., 1-15 MeV neutrons), the cross sections stant of the product radionuclide, and T is the
for (n, r) reactions are very small, whereas the half-life of the radionuclide. The net rate of
cross sections for many (n, n'), (n, p), (n, a), formation of the radionuclide during the
and (n, 2n) reactions become appreciable- irradiation is thus its steady rate of formation
ranging from millibarns to about 1 barn (1 barn (N¢a) minus its ever-increasing rate of decay
is a reaction cross section of 10- 24 cm 2 per ('A.N*):
nucleus). For most target nuclei, fast-neutron
reaction cross sections are 1, 2, or more orders netdN*/dt=N¢a- 'A.N*. (3)
of magnitude smaller than thermal-neutron
(n, r) cross sections. This equation, when integrated by means of an
Of these five main kinds of neutron reactions, integrating factor, yields the basic equation of
the (n, r) reaction is always exoergic (usually NAA:
by 5-15 MeV), has no Coulomb barrier or
threshold energy, and produces a neutron-rich Ao = (-dN* /dt)o =N¢a(1 - e -O.69315tj/T),
product of the same element, one unit larger
in mass number. The (n, n') neutron inelastic- (4)
scattering reaction is always somewhat endo- in which A 0 represents the radionuclide disinte-
ergic and hence has a threshold energy, has no gration rate (in disintegrations per second, dps)
Coulomb barrier, and produces a metastable at the end of the irradiation (Le., at zero decay
isomer of the target nucleus (same Z and A). time), and tj is the duration of the irradiation
Metastable isomers usually decay by isomeric period, expressed in the same units of time as
transition, emitting characteristic gamma radia- the half-life, T.
tion. The (n, 2n) reaction is always consider- The N term may be expressed in terms of the
ably endoergic, has no Coulomb barrier, and weight of that element present in the sample,
produces a neutron-deficient product of the w (the sought-for unknown in the analysis of
same element (same Z) but one unit lower in a sample for that element):
mass number. Neutron-deficient radio nuclides
usually decay by orbital electron capture (EC), waNA
by 13+ emission (if energetically possible), or N=-- (5)
both. (n, p) and (n, a) reactions may be either AW '
endoergic or exoergic, but both exhibit effec-
tive threshold energies because of their Coulomb in which w is the weight of the element present
barriers. The (n, p) reaction forms a neutron- (in grams), a is the fractional isotopic abundance
rich product nucleus of the element one unit of the target stable nuclide (stable isotope)
lower in Z than the target nucleus, but of un- among the various stable isotopes of that ele-
changed mass number. The (n, a) reaction gen- ment (Le., a = exactly 1 for the 20 elements
erally produces a neutron-rich product nucleus- that are monoisotopic in nature), N A is Avo-
that of the element 2 units lower in Z than the gadro's number (6.022 X 10 23 atoms per gram
target nucleus, and 3 units lower in mass atom), and A W is the regular chemical atomic
number. weight of the element, in atomic mass units
Theory of the NAA Method When a sample (Le., the abundance-weighted mean of the
containing a particular kind of target nucleus atomic masses of its various stable isotopes).
(of a particular Z and A) is bombarded with a The parenthetical term in Eq. (4),
steady flux ¢ of thermal neutrons, the steady
rate of formation of (n, r) product nuclei from 1 - e-O.69315tj/T,
NEUTRON ACTIVATION ANALYSIS 800

is called the saturation term S, the only time- in which SOC refers to the PPcps at the start of
dependent term in Eq. (4), other than A o. It is the counting period.
a dimensionless term, ranging only from 0 (for In practice, one seldom uses an "absolute"
tilT = 0) to 1 (for tilT = 00). For tifT values method in NAA work, but instead employs a
of 0, 1.' 2, 3, 41 S~ ... , S a3cquires ~alue~ of, "comparator" method, in which samples and
respectively, 0, ~, 'i, ~, U, j!, ... , I.e., It as- standards of the elements of interest are acti-
ymptotically approaches the limiting value of vated and counted identically. Then, if the
1 with increasing tijT (at tifT = 10, S =10231 basic equations above are written for a sample
1024, or 0.999). The saturation term is very and for a standard of the element of interest,
important in NAA work, since it enables one and one equation is divided by the other, all
to accentuate the production and detection of of the parameters that are common to both
short-lived species by employing very short (Le., cp, a, tj, T, S, a, NA, A W, e, and tc) cancel
irradiation periods, followed rapidly by a short out, leaving only the very simple comparator
counting period. To accentuate the production equation (for the same decay time and same
and detection of medium-lived species, one counting period):
employs a longer irradiation period, followed
by a decay period long enough for the shorter- NPPC of sample peak w in sample
lived species to decay out, prior to a somewhat NPPC of standard peak w in standard
longer counting period-and similarly even
longer irradiation, decay, and counting periods (10)
to accentuate the production and detection
of even longer-lived species. Since sample and standard are counted at some-
All of the above equations apply to each what different decay times (on the same de-
radio nuclide being produced in the thermal- tector), one must be corrected to the decay
neutron irradiation of a multi-element sample, time of the other, via a slight modification of
independently of one another. Once the irradia- Eq. (7). The main advantages of the comparator
tion is ended, each radionuclide present decays method over the "absolute" method are (1) it is
away according to its particular half-life (Eq. much simpler, and (2) it is more accurate, since
(2». When Eq. (2) is integrated, one obtains the it does not depend upon accurate knowledge of
equation for the number of radionuclei of a the parameters common to both sample and
given type (N*) present after a decay period t, standard, many of which are not accurately
compared with their number (N o*) present at known (e.g., a values) or are difficult to mea-
the end of the irradiation: sure accurately (e.g., cp and e values).
Detection Sensitivities for Various Elements
N* =No*e-O.6931St/T. (6) Because of the wide range of each of the vari-
ables involved in NAA, among the different
Since -dN*ldt = AN*, and (-dN*ldt)o = ANo*, elements and their various (n, r) products, the
this equation can also be expressed in terms of lower limits of detection (LOD's) of the dif-
disintegration rates (A) instead of in terms of ferent elements cover a wide range, for any
number of nuclei present: given set of conditions. The LOD's for 68 ele-
A = Aoe-O.6931St/T. (7) ments (in the absence of significant levels of
other induced activities) are summarized in
In practice, one does not measure absolute dis- Table 1. These are LOD's calculated for a fairlr
integration rates, but instead (usually) one mea- typical set of conditions: cp = 10 13 n cm- 2 s-
sures the photopeak counting rate (PPcps) of (thermal neutrons), ti";;; 5 hours, tdecay = 0,
the principal decay gamma-ray photon emitted t c ..;;; 100 minutes with a 40 cm 3 Ge(Li) detec-
by the radionuclide. For counting with a par- tor at a distance of 2 cm. In each case, the LOD
ticular r-ray detector, at a particular counting is based upon the largest photopeak of the
geometry, PPcps = eA, where e is the overall radionuclide formed to the largest extent by
photopeak detection efficiency of the detector, the element, and is based upon a defined mini-
at that geometry, for gamma rays of that energy mum detectable (to a reasonable relative pre-
(also taking into account the fraction of the cision, Le., about ±20% of the value) number of
decays of that radionuclide that produce a NPPC, 30.
gamma ray of that energy). Thus, Eq. (7) can Table I includes all the nonradioactive or
also be expressed in terms of PPcps: nearly nonradioactive (Le., K, Th, and U) ele-
ments for which the NAA method is highly or
PPcps = (PPcps)oe-O.6931St/T, (8)
fairly sensitive, except the inert-gas elements
or, integrating over the counting period t e , since (for which NAA is also very sensitive, except
the number of net photo peak counts NPPC for He)-since the noble gases are not signifi-
accumulated during the counting period is cantly present in most solid or liquid samples.
NPPC = [(PPcps at SOC)T/0.6931S] (l - Noticeably absent from the table are the lowest
e-O.6931Ste/T): atomic number elements (H, Li, Be, B, C, N,
and 0), which are scarcely activated by thermal
NPPC = (NPPC)oe-O.6931St/T, (9) neutrons, and the elements P, Tl, Bi, which
801 NEUTRON ACTIVATION ANALYSIS

TABLE I. The precision of NAA measurement of an


element in a sample can be estimated, if other
LOD (/Lg) Elements sources of variance are carefully minimized,
from the counting statistics. For N observed
10- 7 _10- 6 In, Eu, Dy, Ho counts, the standard deviation (resulting from
10- 6 _10- 5 Mn, Sm, Au, Rh, Lu, Re, Ir the random nature of radioactive decay) is sim-
10- 5 -10- 4 Co, Cu, Ga, As, I, Cs, La, ply ±NI/2. Thus, if a certain number of gross
Er, W, Hg, U, Na, V, Br, photopeak counts (GPPC) are measured, for a
Ru, Pd, Sb, Yb, Th particular 'Y-ray peak, its standard deviation is
Sc, Ge, Sr, Te, Ba, Nd, Ta, ±(GPPC)I/2. If the underlying Compton con-
CI, Se, Cd, Gd, Tb, Tm, tinuum baseline counts (BLC) are estimated by
Hf, Pt summing the counts in channels on either side
AI, Zn, Mo, Ag, Sn, Ce, as, of the photo peak (using as many baseline chan-
K, Ti, Cr, Ni, Rb, Y, Pr nels as the number of channels included in the
10- 2 -10- 1 Mg, Zr peak), its subtraction from the GPPC gives
10- 1 -1 F,Ca,Nb the desired NPPC and its standard deviation:
1-10 Fe, Si ± [GPPC + BLC] 1/2 .
10-100 S,Pb As with any analytical method, errors are
possible. Discussion of these is beyond the
scope of this short summary, but a few such
sources of error deserve mention. They include
form only pure (3- emitters, with no accompa- particularly thermal-neutron self shielding and
nying gamma radiation. sample 'Y-ray self-attenuation. The experienced
Forms of the Method There are two forms activation analyst, however, recognizes situa-
of the NAA method: (l) the purely instrumen- tions in which such sources of error can be
tal, nondestructive form, based upon gamma-ray appreciable, and knows how to avoid them,
spectrometry only, and (2) the radiochemical minimize them, or correct for them.
separation, destructive, form which utilizes Applications A summary of the extensive
post-irradiation radiochemical separations with applications of the NAA method that have been
carriers, of the induced activities of interest. made, and are being made, is also beyond the
Nowadays, instrumental NAA usually involves scope of this short summary. However, it should
the counting of activated samples and standards, be noted that NAA has found extensive appli-
at selected decay times, with a high-resolution cation in essentially every field of physical
Ge(Li) or intrinsic Ge 'Y-ray detector, coupled science, biological science, and industry-as well
to a multichannel pulse-height analyzer (typi- as in such diverse fields as art, archaeology, and
cally of 4096 analysis and storage channels). crime investigation.
The instrumental form of the method has the
advantages of detecting and measuring many VINCENT P. GUINN
elements simultaneously, of not destroying the
sample, and of requiring a minimum of the References
analyst's time. However, in some kinds of
samples high levels of other induced activities 1. DeSoete, D., Gijbels, R., and Hoste, J., "Neutron
can raise the LOD for an element of interest Activation Analysis," New York, Wiley-Intersci-
by one or two orders of magnitude above its ence, 1972, 836 pp.
interference-free LOD (by producing a very 2. Guinn, V. P., "Activation Analysis," in "Treatise
high Compton continuum level in the pulse- on Analytical Chemistry," Part 1, Vol. 9, (I. M.
height spectrum, superimposed upon which Kolthoff and P. J. Elving, Eds.), New York, Wiley,
is the photopeak of interest). In such cases, 1971,pp.5583-5641.
one may have to resort to the more time- 3. Guinn, V. P., "Neutron Activation Analysis," in
consuming, sample-destructive radiochemical- "Physical Methods of Chemistry," Vol. 1, Part III
separation technique, prior to counting. D (A. Weissberger and B. W. Rossiter, Eds.), New
Sample Sizes, Precision, and Sources of Error York, Wiley, 1972, pp. 447-500.
In NAA work, samples (any kind of solid or 4. Guinn, V. P., and Hoste, J., "Neutron Activation
liquid material) can be as small as micrograms Analysis," in "Elemental Analysis of Biological
to as large as grams, depending upon how much Materials," (R. M. Parr, Ed.), Vienna, IntI. Atomic
sample is available or the maximum amount Energy Agency, 1980, pp. 105-140.
that can be used without resulting in counting 5. Kruger, P., "Principles of Activation Analysis,"
rates so high that the pulse-height spectrum New York, Wiley-Interscience, 1971,522 pp.
is markedly distorted by pulse pileup. Since 6. Lyon, W. S. (Ed.), "Guide to Activation Analysis,"
one part per million (ppm) by weight cor- New York, Van Nostrand-Reinhold, 1964, 186 pp.
responds to 1 I1g of element per gram of sample,
it is evident that the I1g LOD's tabulated above Cross-references: ACCELERATOR, PARTICLE; ISO-
are numerically the same as the various ppm TOPES; NEUTRON; NUCLEAR REACTIONS; NU-
LOD's in a one-gram sample. CLEAR REACTORS; RADIOACTIVITY.
NEUTRON DIFFRACTION 802

NEUTRON DIFFRACTION dominate in these pictures and that a one-elec-


tron atom, i.e., hydrogen, can be located and
An experiment by Laue, Friedrich, and Knip- detailed with much less accuracy. On the other
ping in 1912 demonstrated that x-rays were a hand, neutrons are scattered not by electrons
form of electromagnetic radiation, with a wave- but by the neucleus of an atom, and the way in
length of the same order of magnitude as the which the scattering power increases with the
distance apart (10-8 cm) of atoms in crystals. mass of the atom is very far from being a stead-
This meant that beams of x-rays could be dif- ily increasing function. The scattering power or,
fracted by crystals in a rather similar way to more precisely, what we call the "scattering
that in which an optical diffraction grating, in length" arises from the summation of two quite
which the elements are separated by about separate effects. The first of these depends on
10-5 cm, will produce a spectrum for visible light. the size of the nucleus, which has a radius pro-
As a result of Laue's discovery, a technique for portlOnal to the cube-root of the atomic weignt,
studying the underlying structure of solids by so that this effect does indeed increase with
"x-ray diffraction" has grown up. For any given atomic weight, but nevertheless fairly slowly.
solid, the end product of this technique is a Superimposed on this scattering, however, is
specification of the shape and content of the resonance scattering, which depends in a com-
building block, or "unit cell," out of which the plicated way on the actual structure of the
solid is built. The content is specified in terms nucleus and on its energy levels. This addi-
of "electron density," and it follows that the tional scattering often varies quite consider-
various atoms or ions which make up the mole- ably from atom to atom, and sometimes
cule of the substance can be identified. from isotope to isotope, as we advance up
A rather similar, but in some respects a much the periodic table. When we combine to-
more powerful, technique has grown up using gether the two effects, and thus assess the re-
beams of neutrons instead of x-rays. A neutron sultant scattering by a nucleus, we find that it
is often thought of simply as a particle, with a varies quite irregularly from atom to atom and
mass approximately equal to that of a hydrogen this is illustrated for elements at the lower end
atom, but in terms of wave mechanics a beam of the periodic system in Fig. 1. It will, however,
of neutrons can be regarded as a wave motion. be noted that there is a relatively small spread
If the neutrons are moving with velocity u, then of values among these scattering lengths. The
they can be considered to have a wavelength mean value for all the nuclei which have so far
equal to h/mu, where m is the mass of the neu- been measured is 0.62 X 10-12 cm, and practi-
tron and h is Planck's constant. If such a neutron cally all elements have values which lie between
beam is scattered by a solid, it will be distrib- a half and twice this average. As a result of this
uted in space as if it were radiation of this wave- we find that most elements are roughly equally
length. It so happens that for neutrons having "visible" to neutrons, though there are a few
energies equivalent to a temperature of a few very interesting exceptions. The practical out-
hundred degrees centigrade, which are readily come of this is that hydrogen atoms can be lo-
obtainable from nuclear reactors, the wavelength cated quite accurately in whatever environment
is about 10-8 cm, i.e., lA, which, as we have seen they are found, and this has meant important
above, is about equal to the interatomic distance advances in our knowledge of the role of hydro-
in solids. It was shown in 1936 that neutrons, gen bonds and molecules of water of crystalli-
then obtainable only from a radium-beryllium zation in building up the structures of both in-
source, could indeed by diffracted by solids. organic and organic crystals. At the same time,
However, it is only since nuclear reactors have we have often been able to get much improved
produced intense beams of suitable neutrons information on the thermal motion of mole-
that the application of diffraction techniques to cules, particularly in those common cases where
the study of solids has proved worthwhile. hydrogen atoms are found on the outside of
Since high-intensity neutron beams are only molecules and which, therefore, provide a very
available at a limited number of research insti- good index of the molecular movement. The
tutions throughout the world, we shall be con- technique of detection becomes much more
cerned only with their application to problems powerful if we can use deuterated material, in-
which cannot be solved by any other method. stead of ordinary hydrogen. Deuterium has a
In particular, we shall enquire what can be neutron scattering length of 0.65 X 10-12 cm
achieved with neutrons which cannot be found and is, therefore, a "good average" atom,
out by using a beam of x-rays, and we shall see whereas ordinary hydrogen, at 0.38 X 10-12 cm,
the answers to this question by making a com- is somewhat below average. This comparison
parison of the ways in which atoms and solids provides a very good example of a difference be-
scatter x-rays and neutrons. X-rays are scattered tween the scattering behavior of two different
by the outer, extranuclear, electrons in an atom, isotopes of an element, arising from differences
and it is for this reason that x-ray diffraction in the nuclear structures.
studies produce a picture of electron density. Another important field of chemistry to which
It follows that heavy atoms, such as lead and neutron diffraction has contributed some useful
uranium which contain many electrons, will pre- results is in the study of the compounds of ura-
803 NEUTRON DIFFRACTION

1·2 Sc
12
10- cm
Ni
1·0 Cl Fe
N

0·8
0
£01 0·6
~b
...J 0.4
Cr
01 NaAl 5 In
c:

8-
·c Co
GI 0'2 A

If)
0 v
20 40 60 80 100 120
ATOMIC WEIGHT
-0,2 Li

H Ti Mn
-0·4
FIG. 1. The variation of the nuclear scattering amplitude of elements for neutrons, shown as a function of
atomic weight, in units of 10-12 cm.

nium, and post-uranic elements, with nitrogen tains 5 unpaired electrons, in manganese salts.
and oxygen. In the case of x-ray studies, the 92 Such atoms or ions scatter additional neutrons,
electrons of a uranium atom completely over- making an additional contribution to the scat-
shadow the seven and eight of nitrogen and oxy- tering length by an amount which is propor-
gen respectively. For neutrons, however, the tional to the magnetic moment. If the magnetic
value of the scattering length for uranium moments in such a material are not arranged in
(0.85 X 10-12 cm) is actually less than the value any regular single direction, but point haphaz-
for nitrogen (0.94 X 10-12 cm) and is only frac- ardly as in a paramagnetic material, then there
tionally greater than that of oxygen (0.58 X will not be any well-defined diffracted beams
1O-12 cm). but there will be a broadly distributed contribu-
Of intrinsic interest, and having particular tion to the scattered background. This contri-
significance in the growing field of study of bution may be a little difficult to identify but,
chemicals of biological importance is the so- nevertheless, the identification can be achieved
called "anomalous scattering" of neutrons by a and the phenomenon can be confirmed. In other
few elements, such as cadmium and samarium, magnetic materials, however, all the magnetic
for which the nuclear scattering amplitude is a moments in a single domain lie parallel to a sin-
complex quantity, with real and imaginary com- gle direction, and in the particular case of a
ponents. This can provide crucial information ferromagnetic material they all point algebrai-
for determining the structure of large molecules cally in the same sense. In this circumstance
such as insulin if one of these anomalous scat- the magnetically scattered neutrons contribute
terers can be incorporated in the molecule. specifically to the diffracted beams and the in-
So far we have been considering the process tensity of these is observed to vary with increase
whereby the neutron is scattered by the atomic of temperature, falling to a minimum at the
nucleus, and this is a process which occurs for approach of the Curie temperature, above which
all atoms. There is, however, an additional scat- no ferromagnetic alignment takes place. In the
tering which takes place for magnetic atoms, case of antiferromagnetic materials, in which
i.e., for atoms which have a resultant magnetic the moments lie parallel to a single direction
moment on account of the fact that the atoms but alternately up and down with opposite alge-
contain unpaired electrons. Examples of this are braic sense, the neutron data are extremely in-
an atom of iron in metallic iron, which appears formative. In such a material it will be appre-
to contain 2.2 unpaired electrons, and the dou- ciated that, from a magnetic point of view the
bly-charged manganese ion Mn++, which con- repeat distance (considering the alternate -+ and
NEUTRON DIFFRACTION 804

- moments) is twice the repeat distance which tions but with a net balance in one direction.
is apparent when only the chemical nature of Moreover, with further research, it has been
the atoms is considered. This means that extra demonstrated that these structures are only
diffraction spectra will be produced at smaller the simplest examples of a wide range of "mag-
angles of scattering, corresponding to what netic architecture" which it is now possible to
would happen if the inter-line spacing of an opti- draw in detail as a result of study with neutron
cal diffraction grating were doubled. The exis- beams. This later work, devoted to the iron
tence of antiferromagnetism can, therefore, be group of transition elements and the elements
detected very directly by noting the appearance of the rare earth group, has identified a variety
of these extra spectra, particularly if the neu- of noncollinear arrangements of magnetic mo-
tron diffraction pattern is compared either with ments such as the spiral spins in MnAu2, the
an x-ray pattern or with a neutron pattern taken umbrella structure in erSe, composite structures
at a higher temperature at which the regular in holmium and erbium and the complicated
magnetic arrangement has broken down. Such a structures, not yet fully understood, which
comparison of results obtained at two different occur in metallic chromium.
temperatures is illustrated in Fig. 2. Results such In our discussion so far, we have implicitly
as these have established the antiferromagnetic assumed that when a neutron is scattered by
structures of a variety of materials and have dem- an atom it is scattered elastically and does not
onstrated the true nature of ferrimagnetism, as lose any of its energy. This is no more than a
for example in the ferrites in whlch moments first approximation to the truth, because atoms
are directed in both positive and negative direc- are by no means rigidly fixed but are in vibra-
tion about their mean positions because of their
possession of thermal energy. A neutron which
A
makes collision with an atom may, therefore,
lose or gain a quantity of energy. If we keep in
mind the fact that atoms in a solid are not
4K isolated, and that the movement of one atom
c will invariably affect to some degree the motion
of its neighbors, it becomes fruitful to regard
the interchange of energy as occurring between
the neutron and the lattice vibrations of the sol-
id. Indeed we speak of "phonons" which are
the embodiment in particle form, from the point
of view of wave mechanics, of the quanta of
energy among the crystal vibrations. If we could
>-
measure accurately the interchanges of energy,
~ ~IO~----~2~0----~~----~--~~~~--- then we could learn about, and indeed study in
~ ANGLE OF SCATTERING. 28. IN DEGREES
detail, the phonon spectra and the dispersion
I- law for the solid. In fact, for neutron scattering,
z but not for x-rays, such a measurement can be
C made and this gives a quite unique value to the
use of inelastic neutron scattering for studying
solids. The particular supremacy of neutrons
becomes clear if we consider the actual energies
420K
of a neutron and x-ray quantum which possess
the same wavelength. We find in fact that the
latter is roughly 10 5 times larger. Thus, whereas
the energy of a neutron of wavelength 11\ is
about equal to that of a quantum of crystal
energy, yet the energy of I A x-ray is 10 5 times
greater. It follows, therefore, that if a neutron
B gains or loses such a quantum, then its own en-
ergy will be greatly changed; for example, it
30 40 50
could easily be roughly doubled or halved. On
FIG. 2. A comparison of neutron diffraction patterns the other hand such an interchange for an x-ray
taken at 4 and 420 K for an antiferromagnetic alloy, would be quite insignificant and the resulting
AU2Mnt.7Alo.3. At the lower temperature, intense change of wavelength could not be detected. It
magnetic lines A and B appear, but at the higher tem- is in fact possible, therefore, to measure both
perature, where very little magnetic order remains, momentum and energy changes in neutron-
these extra lines have practically disappeared. The pat- phonon interchanges, and this information leads
terns also show the composite nature of the nuclear directly to the details of the dispersion law in
sC,llttering line C at low temperature, which occurs the solid. The full power of these methods can
becat.:~e the crystal symmetry changes from cubic to be achieved only by detailed and extensive ob-
tetragon,,'i when the magnetic order becomes servations with single crystals. Nevertheless a
established. relatively simple technique of neutron spectro-
80S NOISE, ACOUSTICAL

scopy is available for powdered and polycrys- converting systems, noise has become a major
talline samples, which has rather similar aims problem. Some of its harmful effects are inter-
to conventional infrared and Raman spectro- ference with mental and skilled work, impair-
scopy, but with some distinctive advantages. In ment of sleep, creation of emotional distur-
particular, any vibrations in which hydrogen bances, damage to hearing and a deterioration
atoms are involved are greatly enhanced, and of health and well-being. Consequently, the
moreover, the selection rules which limit the control and reduction of noise has become an
observation of transitions in optical spectra do important science.
not generally apply to the neutron spectra of in- Adequate measuring means are a prime re-
elastic scattering. quirement in the scientific control and reduc-
As the intensity of the beams of neutrons tion of noise. Even then, the problem of estab-
available from research reactors has steadily lishing a true relationship between the subjective
increased, with roughly a tenfold increase each and objective properties of noise is difficult
ten years, these techniques have become pro- because of the many different aspects of human
gressively more powerful and determinative, reaction to noise. The first relationship between
leading to a steadily widening view of solids and the subjective and objective measurement of
liquids in the several unique respects which we sound is the simplified rule relating loudness in
have discussed. A neutron flux of 1.2 X 10 15 sones to the loudness level in phons. The loud-
cm- 2 sec- I is now available but it may be diffi- ness scale in sones is proportional to the average
cult to increase the flux beyond this. Further person's estimate of the loudness. Also, the
progress is being made using linear accelerators loudness level, P, of a given sound, in phons, is
from which pulsed beams of protons generate numerically equal to the median sound pressure
neutrons by reactions with heavy-element tar- level, PL' of a free progressive wave at a fre-
gets. However, it is likely that the limitations in quency of 1000 Hz presented to listeners facing
the use of neutron-diffraction techniques will the source, which is judged by the listeners to
continue to be set by the limited availability of be equally loud. The sound pressure level, PL' is
adequate neutron sources. It may fairly be said defined as
that many promising applications have never
been tested.
PL = 20 loglo .!!..- (decibels) (1)
G.E.BACON Po
where P is measured sound pressure, in micro-
References bars, and Po = 2 X 10-4 micro bars.
The relation between sones, S, and phons, P,
Bacon, G. E., "Neutron Diffraction," Third Ed;' is given by
London, Oxford Univ. Press, 1975.
Dachs, H., "Neutron Diffraction," Berlin, Springer- S = 2(P-4 0)/1 0 (2)
Verlag, 1978.
Izyumov, Yu. A., and Ozerov, R. P., "Magnetic Neu- Referring to Eq. (2), a loudness level of 40
tron Diffraction," New York, Plenum Press, 1970. phons produces a loudness of one sone, and
Marshall, W., and Lovesey, S. W., "Theory of Thermal a loudness level of 80 phons produces a loud-
Neutron Scattering," London, Oxford Univ. Press, ness level of sixteen sones, etc.
1971. The simplest means for the measurement of
Squires, G. L., "Introduction to the Theory of Thermal noise is the sound level meter, an instrument
Neutron Scattering," Cambridge, Cambridge Univ. comprising a microphone, an amplifier, fre-
Press, 1978. quency weighting networks, and an output
meter. The characteristics of the frequency
Cross1:eferences: DIFFRACTION BY MATTER weighting networks in the meter are based upon
AND DIFFRACTION GRATINGS, MAGNETISM, the equal loudness contours of hearing for dif-
NEUTRON, PARAMAGNETISM, X-RAYS. ferent levels.
More sophisticated means for measuring noise
include octave band and one-third octave band
sound analyzers that supply information on the
NOISE, ACOUSTICAL sound level in various frequency ranges. These
analyzers are used for research on the reduction
Strictly defined, noise is any unwanted sound, of machine noise, transmission and other areas
whether pleasant or unpleasant. More com- where information on the sound levels in spe-
monly, however, sounds that are unpleasant and cific frequency bands is required. Narrow-band
disturbing, or that mask desired sound, are analyzers may be used to obtain the spectrum
termed noise. Thus, noise, in a general sense, of a noise. The sound-pressure spectrum level is
may be thought of as any sonic disturbance. that level within a frequency band of 1 Hz. This
Depending upon the degree of pitch distribu- level is plotted against frequency to obtain the
tion, intensity, and persistance, noise can range spectrum frequency characteristic of the noise.
from being merely annoying, to hazardous or If the spectrum level of a noise is known, so-
injurious. In our highly industrialized society, phisticated means may be used to relate the
with its rapid growth of energy-producing and objective to the subjective qualities of the noise.
NOISE, ACOUSTICAL 806

As noted earlier, noise abatement has become may provide sufficient insulation under such
an important science. For instance intensive conditions.
research on the quieting of automobiles has There are now federal regulations on permis-
been in progress for three decades, with out- sible noise in industry as given in the reference
standing results; some of the major problems below. Examples of the permissible noise ex-
remaining to be solved involve wind- and road- posures are as follows: for 8 hours per day the
induced noise. Similarly, research has been permissible sound level is 90 dBA, for 4 hours
carried out on the reduction of noise of all per day 95 dBA, for 1 hour per day 105 dBA, etc.
types of household appliances employing mo-
tors, fans, compressors, pumps, gears, and other HARRY F. OLSON
moving parts. Another phase of acoustical en-
gineering involves methods for reducing the References
transmission of sound through the walls, floors, Federal Register, Saturday, May 29, 1971, Vol. 36,
ceiling and partitions in all manner of build- No. 105, Part II, Department of Labor, "Occupational
ings or enclosure by the use of construction Safety and Health Administration." Article 1910.95,
and materials based on fundamental acoustical "Occupational Noise Exposure."
principles.
Goodfriend, Lewis S., "Noise Pollution," CRC Scien-
The noise in typical environments and noise
tific Publications, Cleveland, Ohio, 1972.
produced by various sources are given in Table
Harris, G. M., "Handbook of Noise Control," New
I. York McGraw-Hill Book Company, 1957.
The masking effect produced by noise reduces
the intelligibility of speech. For example, if the Rettinger, M., "Vol. II Noise Control," New York,
speaker and listener are separated by 5 feet, Chemical Publishing Co., 1977.
the levels of noise that will barely permit reli-
able word intelligibility are 51 dB for normal Cross-references: ACOUSTICS; ARCHITECTURAL
conversation, 57 dB for raised speech, 63 dB for ACOUSTICS; HEARING; MEASUREMENTS, PRIN·
very loud speech, and 69 dB for shouting. CIPLES OF; MUSICAL SOUND.
A person subjected to high noise levels for
long periods of time may suffer considerable NUCLEAR INSTRUMENTS
impairment of hearing. The use of ear protectors
Nuclear instruments are detectors of ionizing
radiation and associated apparatus for amplifi-
cation and analysis of the signals. Nuclear in-
T ABLE I. NOISE LEVELS FOR VARIOUS struments are used in measuring the activity
SOURCES AND LOCA TIONS a of a radioactive sample and identifying radio-
isotopes by their characteristic radiations, in
Source or Description of Noise Noise Level (dB) nuclear physics experiments with particle ac-
celerators, control of nuclear reactors, dosim-
Threshold of pain 130 etry for radiation safety and cancer radio-
Hammer blows on steel plate 2 ft 114 therapy, and for medical diagnosis in radiology
Riveter 35ft 97 and nuclear medicine.
Factory 78 Ionizing radiation includes energetic charged
Busy street traffic 68 particles such as electrons, protons, alpha par-
Large office 65 ticles, and fission fragments; uncharged par-
Ordinary conversation 3 ft 65 ticles (neutrons); and short-wavelength elec-
Large store 63 tromagnetic radiation (x-rays and gamma
Factory office 63 rays). The charged particles ionize directly,
Medium store 62 while neutrons must first undergo a nuclear
Restaurant 60 reaction that produces charged particles. X-rays
Residential street 58 and gamma rays interact primarily by the photo-
Medium office 58 electric effect, Compton scattering, or pair pro-
Garage 55 duction, which generate secondary electrons.
Small store 52 It is interesting that a single nuclear particle or
Theatre 42 quantum of electromagnetic radiation can be
Hotel 42 detected, and in many cases its energy and
Apartment 42 time-of-arrival measured.
House, large city 40 Detection depends on interaction of the
House, country 30 radiation with the detecting medium, and trans-
Average whisper 4 ft 20 fer of energy by ionization and excitation. The
Quiet whisper 5 ft 10 signal-generation process may involve genera-
Rustle of leaves in gentle breeze 10 tion of many electrons and ions (or holes in
Threshold of hearing 0 semiconductors) and the movement and collec-
tion of these charges; emission of light (lumi-
aOlson, Harry F., "Acoustical Engineering," p. 256, nescence); induction of chemical changes, or
New York, Van Nostrand Reinhold, 1957. heating.
807 NUCLEAR INSTRUMENTS

Gas ionization detectors include the ion- rent across the insulators. The applied dc po-
ization chamber, proportional counter, and tential Vo may be a few hundred to a couple
Geiger-MUller (GM) counter. Fig. I illustrates thousand volts. Normally the gas is an insulator,
some general features of these detectors. The no current flows, and a steady electric field is
detecting medium is a gas such as argon (or a established between anode and cathode. When
special mixture for the GM counter), often at ionizing radiation interacts in the gas or elec-
or below atmospheric pressure but sometimes trode, many low-energy ions and electrons are
at a few atmospheres pressure for increased in- released in the gas. The energy required to gen-
teraction efficiency. The gas is contained in a erate an ion-electron pair, W, depends on the
vessel fitted with high-quality insulated elec- gas but is typically around 30 electron volts.
trodes in either parallel plate or coaxial cylinder Thus a 3-MeV alpha particle would release
geometry as shown. A grounded guard elec- 100,000 ions and electrons. These charges drift
trode may be added to intercept leakage cur- to the oppositely charged electrode under the

Guard ring

Collector elect rode

(a)

(b)

FIG. 1. General features of gas-ionization detectors: (a) parallel-plate


ionization chamber with guard ring to define field and intercept leakage
current; (b) cylindrical proportional or GM counter. (From Profio, A.
Edward, "Experimental Reactor Physics," New York, Wiley, 1976, p. 244;
reproduced in Profio, A. E., "Radiation Shielding and Dosimetry," New
York, Wiley, 1979, p. 266.)
NUCLEAR INSTRUMENTS 808

influence of the electric field. This motion con- very long, the charge is integrated and read out
stitutes a current which can be amplified, or the at the end of the irradiation. The pulse ioniza-
voltage drop across the load resistor R L can be tion chamber has been replaced by the semicon-
amplified and measured. Eventually (microsec- ductor detector except where radiation damage
onds for electrons, milliseconds for ions) the is a consideration. The mean-level and integrat-
charges are collected on the stray capacitance ing ionization chambers are commonly used
Cs, until drained off through RL. for radiation monitoring and for reactor con-
If the time constant RLC s is small compared trol. The detector is made sensitive to neutrons
to the drift time of the electron or ion, that by coating the wall with boron or boron-IO, or
component of the signal voltage will not reach a fissionable material such as uranium-235, or
full amplitude. Furthermore, a detailed analysis by filling the chamber with boron trifluoride
shows that the contribution of each component gas.
depends on the distance from ionization point The proportional counter and GM counter
to electrode. Only if both electrons and ions are differ from the ionization chamber in that the
collected does the charge and corresponding charge is increased by gas multiplication. The
voltage reflect the energy deposited by ioniza- amount of gas multiplication is a function of
tion in the gas. For simple counting without 8../p, the ratio of the electric field strength to
energy analysis, only the fast (electron) com- the gas pressure. The ionization chamber op-
ponent is needed. If the time constant is long erates at low 8../p, where 8.. is large enough to
compared to the mean interval between ioniz- separate the ions and electrons before they can
ing nuclear particles, the detector will not recombine, but there is no gas multiplication.
resolve individual pulses of charge, but the Thus the ionization chamber operates in region
current will indicate the mean rate of detection 2 in Fig. 2, which plots the number of charges
of the nuclear radiation. If the time constant is collected as a function of applied voltage, for

4 5 6

10 7

"0 106
~
u
~
0
u
~ 105
c:
g
u
~
w 10'

10 3

102

10

o 200 400 600 800 1000 1200 1400


Vo (volts)

FIG. 2. Number of electrons collected as function of applied voltage for energy de-
positions EI = WN I and E2 = WN2 . Region 2 corresponds to ionization chamber,
region 3 to proportional counter, region 5 to GM counter. (From Profio, A. E.,
"Radiation Shielding and Dosimetry," New York, Wiley, 1979, p. 267.)
809 NUCLEAR INSTRUMENTS

two different amounts of primary ionization, semiconductor, usually silicon or germanium.


NJ = 10 3 and N2 = 10. At medium 8./p and (Newer materials such as cadmium telluride and
voltage, the electrons are accelerated suffi- mercuric iodide are under development.) A
ciently between collisions with neutral gas true solid insulator cannot be used, because the
atoms to release more electrons on impact. charges (electrons and holes) produced by ion-
These electrons can in turn release more elec- ization would not move in the applied electric
trons, for an overall typical gain or multiplica- field and generate a signal. On the other hand,
tion factor M of 10-1000. If M is not too large, the device must have high resistivity or the
gain is independent of the initial ionization, noise would be too great from the current of
hence the output signal charge is proportional thermally released electrons. These require-
to the initial ionization and the detector can ments have been met by: (a) doped silicon op-
be used for energy analysis. This corresponds erated as a reverse-biased p-n junction diode
to region 3 in Fig. 2, the region of the propor- at room temperature; (b) p-type silicion or
tional counter. (Actually a proportional counter germanium compensated by drifting an electron
may also have a different gas filling, to stabilize donor, lithium, and operated at low tempera-
the gas multiplication.) In order to achieve the ture as a reverse-biased p-i-n junction diode;
required 8./p at reasonable applied voltage, it is (c) ultra-high-purity germanium operated at low
standard to make the anode a small-diameter temperature as a reverse-biased p-i-n diode; and
wire, because 8. is inversely proportional to (d) mercuric iodide and other room tempera-
radius in cylindrical geometry. Most of the mul- ture diodes. Semiconductor detectors generally
tiplication occurs very close to the anode and have faster response times than gas detectors
collecting the electron component alone is suf- (order of tens of nanoseconds), and higher ef-
ficient. The low-noise gain afforded by gas mul- ficiency because of their density (2.33 g/cm 3
tiplication makes the proportional counter suit- for Si, 5.33 g/cm 3 for Ge). Their biggest ad-
able for detection of low energy x-rays and vantage over other types of detector is the low
electrons. Gain is usually not needed for alpha energy required per hole-electron pair (3.6 eV
particles or fission fragments, and detection for Si, 2.96 eV for Ge), resulting in a larger
efficiency for gamma rays is low although ef- signal amplitude and better energy resolution
ficiencies of a few percent can be obtained with because of smaller statistical fluctuations in the
walls of a dense and high atomic number ma- charge collected. Their main disadvantages are
terial such as bismuth. Neutrons can be detected sensitivity to radiation damage from heavy
by filling the tube with BF3 or helium-3. charged particles or fast neutrons, the relatively
GM counters operate at high 8./p and the small sensitive volumes available, and cost in
gas multiplication may be 10 6 or more, corre- the larger sizes.
sponding to region 5 of Fig. 2. Multiple gas dis- A common detector for alpha particles and
charge avalanches are created and propagated other charged particles is the silicon surface
by ultraviolet photons as well as electrons. The barrier detector shown in Fig. 3(c). This is a
Geiger discharge can be initiated by a single pri- type of p-n junction diode, where the junction
mary ionization, and the amplitude of the re- occurs between the body of n-type (electron-
sulting pulse is no longer proportional to the donor-doped) silicon and a very thin layer of
initial ionization by a nuclear particle: all pulses p-type (electron-accept or-doped) material pro-
have the same amplitude, at a given applied duced by controlled oxidation of the surface.
Voltage. In order to terminate or "quench" the The surface is protected by a thin evaporated
discharge, the GM tube is filled with a mixture layer of gold, but even so the surface barrier
of gases, including either an organic quencher detector is sensitive to light and must be op-
such as ethyl alcohol or ethyl formate, or a erated in the dark. When the diode is reverse
halogen (chlorine or bromine). Quenchers sup- biased (positive polarity to the n-type silicon
press multiple discharges otherwise started by contact), by a hundred volts or so, the ther-
positive ions hitting the cathode, by absorbing mally released charge carriers are swept out and
energy in dissociation of the molecule. An or- a "depletion" region is left which can sustain a
ganic quencher is consumed in some 10 9 counts, large electric field. The depletion region corre-
but halogens are reconstituted and the tube sponds to the sensitive volume of the detector;
lifetime is much longer. Because of the hun- it may be on the order of one cm 2 in area and
dreds of microseconds for the ions to be col- a few hundred micrometers thick. This thick-
lected and the discharge terminated, the GM ness is sufficient to stop alpha particles of a few
counter has a correspondingly long dead time MeV, so their energy can be measured. A small
and is limited to rather low count rates (hun- correction is needed for the loss of energy in
dreds of counts per second or less). With its the "window" or dead layer of gold and p-type
large internal gain, electronics can be simple material. Similar detectors are made with ion-
and cheap, and the GM counter is still used for implanted or diffused junctions. The depletion
simple counting of beta particles, and for mon- region can be made to extend entirely through
itoring gamma rays fields at low detection a detector 1-2 mm thick, but this is still too
efficiency. small for stopping energetic electrons and ef-
A semiconductor detector is similar to a pulse ficiency for gamma rays is poor.
ionization chamber with the gas replaced by a The maximum thickness of p-n junction de-
NUCLEAR INSTRUMENTS 810

pletion regions is limited by voltage breakdown. are suitable for detection and spectroscopy of
The technique of lithium drifting has been electrons, where the low atomic number of sili-
developed to make a compensated or quasi- con minimizes backscattering. Si(Li) detectors
intrinsic resistivity region up to some 10 mm are also used for low energy x-ray and gamma
thick. One starts by thermally diffusing some ray spectroscopy (below 30 keY). Figure 3(b)
lithium into the surface of a crystal of rather illustrates the coaxial design typical of lithium-
pure but p-type silicon or germanium. The p-n drifted germanium, Ge(Li), detectors where the
junction thus formed is reverse biased and the compensated region may have a volume of 30-
small lithium ions caused to drift into the bulk 150 cm 3 • The volume, density, moderately high
of the crystal, at elevated temperature, by the atomic number of germanium (Z = 32), and
applied electric field. After some days or weeks, most of all the excellent energy resolution
nearly exact compensation is achieved auto- (about 0.1%) of germanium detectors have
matically because the space charge remaining made them the instrument of choice for gamma
with imperfect compensation is sufficient to ray spectroscopy, although efficiency is less
drive the lithium ions until the space charge is than can be obtained with the sodium iodide
canceled. Figure 3(a) shows the planar geom- scintillation detector. The Si(Li) and Ge(Li) de-
etry typical of a lithium-drifted silicon diode, tectors are always operated at low temperature,
designated Si(Li), where the compensated re- usually at liquid nitrogen temperature (77 K) to
gion may be a few mm thick. Such detectors reduce thermal noise. A SHLi) detector is usu-

~ < Ie' ",'"..,

(a)

Ox idized surface

Gold electrode n- type silicon

Nuclear Aluminum electrode


rad ia tion -----;~

..
(b) (c)

FIG. 3. Types of semiconductor detectors; (a) planar lithium-drifted; (b) co-


axial cylindrical lithium-drifted; (c) surface barrier. (From Profio, A. Edward,
"Experimental Reactor Physics," New York, Wiley, 1976, p. 220; reproduced in
Profio, A. E., "Radiation Shielding and Dosimetry," New York, Wiley, 1979, p.
261.)
811 NUCLEAR INSTRUMENTS

ally stored at liquid nitrogen temperature, too, lowed to warm up occasionally. It is still usu-
but can survive an occasional warmup. The ally stored at liquid nitrogen temperature to
Ge(Li) detector must always be stored and used minimize contamination of the surface by va-
at liquid nitrogen temperature to avoid redis- pors, and is operated at low temperature to
tribution of the lithium. One warmup is suffi- minimize noise. The crystal is housed in a vac-
cient to ruin a Ge(Li) detector, and while it can uum cryostat, on a good heat conductor or
be redrifted, energy resolution is likely to be "cold finger" dipping into a reservoir of liquid
worse. nitrogen in a dewar. Weekly replenishment is
Figure 4 illustrates the electrical properties of usually required even with a 3D-liter dewar. The
a lithium-drifted p-i-n junction detector, as in- trouble and expense of liquid nitrogen and the
fluenced by the Li concentration and starting bulky dewar and cryostat has led to interest in
material. As shown , after compensation there is materials such as cadmium telluride and mercu-
no net space charge density within the so-called ric iodide, which have large enough band gaps
intrinsic region, but there is a potential gradient to allow operation at room temperature. Den-
and a unif()rm electric field (in planar geometry) sity and their high-atomic-number constituents
which moves and collects the electrons and holes make them candidates for x-ray and gamma ray
released in this region by ionizing radiation. spectroscopy. However, trapping centers and
The ultra-high-purity germanium detector is crystal growing difficulties have so far limited
very expensive, but has the advantage that there the detectors to about I mm thickness if good
is no lithium to redistribute and it can be al- energy resolution is desired. With careful atten-

lithium -drifted p - type


intrinsic region region
To POS' !' e r-------~.~--------~\~
( 1.>" .. vo l ! aq~

Load /

I
I
Concen tr allon ~ LI

Or i~ . -,>-+~ _______ __--.._--I,:, - " ,___


~

f - typ c '1
Imou r i tv I
I

'~I V
I I

'~
:i_
I I

'~
FIG. 4. Basic configuration of a lithium-drifted p-i-n junction detector.
Also shown are the corresponding profiles for impurity concentration,
charge density p , electric potential <p, and electric field th. (From Knoll,
G. F., "Radiation Detection and Measurement ," New York, Wiley, 1979,
p.417.)
NUCLEAR INSTRUMENTS 812

Reflector and light shield Photoelectron

Electron. multiplier Light shield

,, ,
,,, ,,,
/ / /
/ / /

,,
/ / /

,,
/ / /
/ /

,,,
/

,,
/ / /
/ / /

,,, ,,
/ / /
/ / /

,
/ / /
/ / /
/ / /

'(
Particle track Magnetic shield

Photocathode
FIG. 5. Elements of a scintillation detector. (From Profio, A. Edward, "Experimental
Reactor Physics," New York, Wiley, 1976, p. 274; reproduced in Profio, A. E., "Radia-
tion Shielding and Dosimetry," New York, Wiley, 1979, p. 270.)

tion to minimizing noise sources, resolution carbon-14) when the radioactive material is dis-
may approach that of the Si and Ge detectors. solved in the scintillator. NE213 is a proprietary
A scintillation detector consists of a trans- formulation which is useful for separation of
parent substance that fluoresces under ioniza- gamma ray and neutron induced counts by
tion and excitation by a charged particle, op- pulse shape discrimination (PSD). Plastic scintil-
tically coupled to a photomultiplier tube that lators are made of scintillating chemicals in a
detects and amplifies the weak flash of light, as solid solution with polyvinyltoluene (PVT) or
shown in Fig. 5. The photomultiplier is shielded other polymer. Because of their low atomic
from outside light and from stray magnetic number and density, organic scintillators are
fields that could deflect the electrons in the not very good for gamma rays.
multiplier. The scintillator is usually coated The inorganic crystal scintillators, especially
with a white reflective material and contained sodium iodide doped with thallium, NaI(Tl),
in a light-tight can except for the window in are well suited for detection and spectroscopy
contact with the photomultiplier. of gamma rays because of their high effective
Table I summarizes properties of commonly atomic number and density. A drawback to
used scintillators. Organic crystals such as an- NaI(Tl) is that it is brittle and hygroscopic, and
thracene may be used for detection of elec- must be protected from shock and moisture.
trons, but are fragile and available only in thin Fluorescence decay time constant is longer than
sections. Organic liquid scintillators are used for for the organic scintillators (significant in fast
detection and spectroscopy of fast neutrons by timing, coincidence, or high count rate applica-
means of the recoil protons generated by elastic tions) but light output is better. Until the ad-
scattering on hydrogen, and for counting low- vent of the germanium semiconductor detector,
energy beta particles (e.g., from tritium and NaI(Tl) was widely used for gamma ray spec-

TABLE 1. PROPERTIES OF SCINTILLATORS.


Relative Fluor.
Scintillator, Density Output Decay
Type Composition (g/cm 3) (electron) (ns) Remarks

Organic crystal anthracene, C14 H 1o 1.25 1.00 30 68 eV/photon


Organic liquid NE213, CH1.21 0.88 0.78 4,100 PSD
Plastic NE102, CH1.1O 1.03 0.65 4 PVTbase
Inorganic NaI + 0.1% Tl 3.67 2.10 230 hygroscopic
Crystal Bi4 Ge 30 12 7.13 0.17 300 HighZ
ZnS + 0.01% Ag 4.09 3.00 200 poly crystal
Glass NE908, glass + 8% Li 2.67 0.20 5,75 0.1% Ce203
Gas xenon (1 atm) 0.006 1.5 2 UVemission
813 NUCLEAR INSTRUMENTS

troscopy, even though its energy resolution is ment, the halide grains are reduced to metallic
not very good (typically 8% at 662 keY). Some silver, darkening the film. The process may be
32 eV is required to generate a light photon, considered as chemical amplification. The dark-
but only about 10% of the photons generate a ening must be calibrated against known doses
usable electron in the photomultiplier, and with measured by another instrument, and the re-
other inefficiencies, the energy per electron sponse at low energies may be much greater
may approach 1000 eV. Bismuth germanate than in tissue. Other chemical dosimeters exist,
gives much less light than NaI(Tl), but because but are limited to high dose applications. Radia-
of its high density and atomic number, is tion also causes heating, which can be measured
even more efficient in detecting gamma rays in by temperature rise in a calorimeter, but heat-
a small volume. ing is seldom used for radiation dosimetry be-
Silver-activated zinc sulfide, ZnS (Ag), is use- cause more sensitive instruments are available.
ful for counting alpha particles but is available Absorbed dose, defined as energy absorbed
only in poly crystalline form and light scatter- by ionization per unit mass of material, is nor-
ing limits its use to thin layers. The cerium- mally measured by an integrating or mean-level
activated, lithium-loaded silicate glass scintilla- type of cylindrical ionization chamber. The
tors such as the proprietary NE908 are used for walls are made of an electrically conducting
slow neutron detection, but have the drawback plastic whose elemental composition is close to
of also being sensitive to gamma rays. The that of average soft tissue (C s H40 0 18 N), and
xenon gas scintillator has a fast fluorescence the wall is made slightly thicker than the range
decay time, but emits in the ultraviolet requir- of the most energetic electrons involved. The
ing a special photomultiplier tube, and being a gas filling also has soft tissue elemental compo-
gas is not very efficient in stopping radiation. sition, and the gas pressure and electrode spac-
The photomultiplier is a vacuum tube, usu- ing are chosen so that the dimensions are small
ally with a thin, semitransparent layer deposited compared to the range of the electrons. Under
on the inside of a flat glass faceplate or window these conditions, the energy absorbed per kg of
at one end. The photocathode itself may be wall material (and tissue) is equal to the energy
oxidized cesium antimonide, or a multialkali absorbed per kg of gas. The latter is obtained
(K-Cs-Sb). The low-energy photoelectrons re- from the measured charge, the known energy
leased from the cathode are collected, accel- per ionization, and the mass of the gas. The
erated, and focused on the first electrode (dy- special units of absorbed dose are the gray
node) of the electron multiplier by an applied (J Gy = 1.0 J /kg) and the rad (1 rad = 0.0 I J /kg).
electric field. The dynode is treated so that two Depending on dose rate and mass of gas, the
or more electrons are generated per incident current may be 10-a A or less, and a low-noise,
electron, by secondary emission. The electrons high-gain, high-input-impedance dc amplifier
from the first dynode are accelerated and fo- (electrometer) is required to drive a meter.
cused on the second dynode, and so forth for Neutrons, protons and alpha particles are
perhaps 9 to 14 dynode stages. Current ampli- more damaging per unit of absorbed dose than
fication can easily exceed 10 5 . The required electrons, x-rays and gamma rays. This is taken
100 volts or so per stage is usually supplied into account by defining the dose equivalent
from a resistive voltage divider connected to a (measured in sievert or rem) as the absorbed
high voltage supply. The supply must be very dose times a dimensionless quality factor that
well regulated because gain is very sensitive to varies from 1 (for gamma rays) to 10 (for fast
voltage. The photomultiplier is selected for size neutrons). The dose equivalent in a mixed radia-
of the scintillator, spectral response (e.g., to tion field can be derived from a special ioniza-
match the blue emission of NaI(Tl», gain re- tion chamber that gives information on the
quired hence number of stages, gain stability microscopic distribution of ionizing events
for spectroscopy, and pulse risetime for fast along the particle path, the so called LET
timing applications. (linear energy transfer). Particles with higher
Another type of detector emitting light is LET have larger quality factors associated with
the thermoluminescent dosimeter (TLD). Cer- them.
tain materials such as LiF or CaF2 doped with Reactor instrumentation includes several neu-
Mn, trap electrons and holes generated by tron-sensitive ionization chambers and associ-
ionization. Upon subsequent warming the elec- ated dc amplifiers, which can be calibrated in
trons and holes recombine with emission of terms of reactor power. In addition to linear,
light, which is detected by a photomultiplier range-switched amplifiers, it is convenient to
tube. TLD's are widely used for personnel dose have logarithmic amplifiers and meters display-
monitoring, although the materials do not re- ing the power over many decades during reactor
spond exactly as tissue does, especially at low startup. Also, the output of the logarithmic
energies. amplifier can be differentiated to give a signal
X-ray film and other silver halide emulsions inversely proportional to the time constant
are used for imaging and for personnel dose (called reactor period) of the exponential in-
monitoring. The ionizing radiation activates crease in power. Electronic trips are provided
centers in the grains, much as a latent image is to shutdown the reactor, by inserting control
formed by light. Upon later chemical develop- rods, if the reactor power gets too high or the
NUCLEAR INSTRUMENTS 814

r- [~u-;;t
rdte
,--_ _ _ _ _ _--, : meter

Integral I
discriminator I
or differential
discriminator
(SeA)

FIG. 6. Elements of a typical signal chain for pulse counting. (From Knoll, G. F. "Radiation
Detection and Measurement," New York, Wiley, 1979, p. 660.)

reactor period becomes too short (rapid in- Cross-references: LUMINESCENCE; MEASURE-
crease in power). MENTS, PRINCIPLES OF; NUCLEAR RADIATION;
Semiconductor and scintillation detectors and RADIATION, IONIZING, BASIC INTERACTIONS;
the proportional counter are operated as pulse RADIOACTIVITY; SPARK AND BUBBLE CHAM-
instruments. Figure 6 is a block diagram of a BERS.
typical pulse counting system. The small signal
pulses from the detector are amplified in the
preamplifier, transmitted by coaxial cable to
the main linear amplifier for further amplifica- NUCLEAR RADIATION
tion and pulse shaping, and then to the integral
discriminator or single channel pulse amplitude Nuclear radiation results from the transitions of
analyzer (SCA). The integral discriminator gen- atomic nuclei. The two chief types of transition
erates an output pulse only if the input pulse in natural radioactivity are those in which the
exceeds a set amplitude, thus rejecting noise number of constituent particles of a given nu-
and low-amplitude background. An SCA also cleus (nuclide) is changed by the emission of
has an upper-level discriminator and anticoinci- one or more particles such as an alpha or beta
dence circuit, such that pulse amplitude must particle and those in which there is a rearrange-
lie between the discriminator settings for an ment of the particles of a given nuclide such
output pulse to be generated. The number of that the nuclide passes to a lower energy state
pulses in a given timed interval is accumulated with the emission of high frequency electomag-
in an electronic counter (also called a scaler), or netic radiation called gamma rays.
the average count rate derived and displayed by The study of nuclear radiation together with
a count rate meter. the results of high energy bombardment of
Although pulse amplitude hence energy anal- nuclei has become our most important source
ysis can be performed by sequentially stepping of information about the structure of atomic
the lower discriminator of an SCA, it is prefer- nuclei and the fundamental nature of matter.
able to use a multichannel analyzer (MCA) in- And such studies now extend to the internal
stead. An MCA consists of an analog-to-digital processes of our sun and the stars as well as our
converter for randomly arriving pulses, control more immediate problems concerned with
unit, memory, oscilloscope display of the spec- emission from nuclear reactors and nuclear
trum, and usually a printer. The ADC digitizes weaponry.
the pulse amplitude and the control unit adds a The chief sources of nuclear radiation are;
count to the appropriate address or channel in (1) naturally radioactive substances; (2) those
memory, usually for 1024 channels or 512 substances in which radioactivity has been
channels. induced; (3) emissions resulting from collisions
For further information consult the references. with cosmic rays or by particles from high
energy accelerators; (4) nuclear reactors and
A. EDWARD PROFlO wea ponry; and (5) nuclear processes in the
sun and stars. There is even a small amount of
radiation from spontaneous fission of uranium
References in the earth.
In 1896 Henri Becquerel found that a new
Knoll, G. F., "Radiation Detection and Measurement," and penetrating type of radiation from uranium
New York, John Wiley & Sons, 1979. had darkened a photographic plate and he called
Profio, A. E., "Radiation Shielding and Dosimetry," the new phenomenon radioactivity. His associ-
New York, Wiley-Interscience, John Wiley & Sons, ates Marie Curie and husband Pierre after
1979. painstaking search found other far more in-
815 NUCLEAR RADIATION

tensely radioactive substances including radium Alpha and beta rays, because of their particle
and polonium. Soon various radioactive sub- character, have a more sharply defined range
stances were arranged in series showing how than gamma rays, and a significant fraction of
one type would decay into a different but the gamma radiation may pass through a num-
related type. ber of centimeters of metal shield. This is be-
Early studies of rays from different substances cause particles lose speed and kinetic energy
revealed that an alpha (0:) particle was the in both ionizing and non-ionizing collisions
doubly charged nucleus of a helium atom; beta with other particles, whereas a gamma-ray-
({3) particles were identified as the then newly photon always travels at the same speed, and
discovered negative electrons; and gamma ('y) the beam of photons is usually weakened by
rays were found to be similar to x-rays but some process such as scattering or absorption
could be even more penetrating. Alpha rays of photons. And the method of magnetic sepa-
were emitted with speeds up to fs the speed, ration and the use of various absorbing materials
c, of light and had the unusual property of are still basic in our far more complicated ex-
producing scintillations on a fluorescent screen perimental procedures today.
by which they could be counted. Beta rays An alpha ray can produce between 20,000
could have a range of speeds up to a maximum and 70,000 ion pairs per centimeter of path in
speed of 0.96c, a hitherto almost unbelievable a gas depending on the initial energy of the ray,
speed for any particle. Beta and gamma rays maximum ionization being reached near the
could be readily detected by the ionization end of the path. A beta rayon the other hand
they produce in a gas. The rest-masses in kilo- may only produce 200 ion pairs or less per
grams of the two particles and their energy centimeter. Thus such rays lose energy rapidly
equivalents in million of electron volts (MeV) in passing through a gas, and rays of a particular
are energy have a rather sharply defined range (see
IONIZATION) .
Mo(alphaparticle)=6.64S x 1O-27 kg The range of alpha particles in air at 76 cm
mercury pressure (1s0e) varies from 2.7 cm for
= 3727.2 MeV alpha particles from uranium to 8.62 cm for al-
mo (beta particle) = 9.109 x 10-31 kg pha particles from thorium C' (P 0 212). The for-
mer are emitted with energies of 4.2 MeV while
=0.511 MeV the latter, the most energetic of any from a nat-
urally radioactive substance, are emitted with
The three types of rays could be readily energies of 8.6 MeV.
separated by means of a magnetic field (Fig. 1). How these radiations could be emitted from
In crossing the field the alpha rays are deflected an atom was unknown until 1911 , when Ruther-
in a direction that indicates they carry a posi- ford, in his famous alpha-ray scattering experi-
tive charge. The beta rays are deflected in the ment, deduced that nearly all of the mass of an
opposite direction and the gamma rays are not atom resides in a tiny, positively charged central
deflected at all as they have no charge and are region called the nucleus having a diameter in
of the nature of light waves . meters (m) of the order of 10- 14 m whereas the
The penetrating power of any nuclear radia- atom itself with its encircling electrons was of
tion depends on the nature of the ray, the energy the order of 10- 10 m. Newton's idea that atoms
of its emission and the nature of the absorption are solid little bits of mass that last for ever was
process. In general it may be said that alpha overthrown and the study of the nucleus as a
rays usually are stopped by a few sheets of paper complex, highly organized system began.
and beta rays by a few millimeters of aluminum. In 192 9, Rosen blum discovered the fine struc-
ture of alpha rays. That is, alpha rays from a
single type of nuclide may not all have exactly
the same energy and range but often consist of
two or more groups with slightly different but
sharply defined ranges and consequently differ-
ent initial energies. This led to the ultimate rec-
ognition of the existence of different energy
levels in the nucleus.
Now the origin of gamma radiation could be
explained. When an alpha particle is emitted as
for instance from a radium nucleus (Ra 88) it
loses four units of mass and two units of charge
and forms a new nucleus in this instance emana-
FIG. 1. Separation of alpha (0<), beta (13) and gamma tion (Em 86). If the alpha particle does not
(-y) rays from a radioactive source (S) by a magnetic possess enough energy to leave the product
field directed into the paper. This indicates alpha rays nucleus in its lowest energy state (ground state)
possess a positive electric charge, beta rays a negative then it can return to the ground state by emit-
charge, and that gamma rays are uncharged. ting a gamma ray (Fig. 2) of the required energy.
NUCLEAR RADIA nON 816

Ra-88 ground state that other antiparticles could also be involved


in nuclear processes.
When induced radioactivity was discovered,
only some forty naturally radioactive substances
were known and the only known types of nu-
clear radiation were the alpha (a), beta ({3-),
and gamma (1) rays. To these now had to be
Excited state
added the positron ({3+).
In 1933 Pauli suggested that still another new
')'-ray 0.2 MeV particle of zero charge might be required to
explain how a neutron might decay to a proton
Em-86 ground state in the nucleus. The following year Fermi in-
corporated this idea into his theory of beta
FIG. 2. Energy levels showing origin of gamma (-y) emission and assumed that a little neutral
rays from radium (Ra-88) when emission of alpha ray particle which he called the neutrino (v) is
(a d leaves product nucleus in an excited energy state also emitted. Since the beta particle is an
above its ground state. It then returns to the ground electron and the electron had been found to
state by emission of a gamma ray. By comparison, spin on its axis like a top the new particle
emission of a2 carries off enough energy for the nu- seemed requited to possess spin in order not
cleus to go directly to the ground state. to violate conservation of angular momentum
when a beta particle is emitted.
The new particle also seemed necessary to
The discovery of the neutron by Chadwick in prevent violation of the law of conservation of
1932 made it possible to describe all atomic energy. Unlike the more sharply defined ener-
nuclides as simple groupings of protons and gies of alpha particles the beta rays from a single
neutrons. The number of protons would be the type of nuclide have a wide distribution of
same as the atomic number in order to give the ranges and energies, the distribution being
nuclide its required charge, and the number of approximately continuous up to a certain
neutrons would be those required to give the maximum or "end-point" characteristic of a
nuclide its characteristic mass. Heisenberg then particular type of nuclide. It was assumed that
suggested that the proton and neutron be called whenever a beta particle carried off less than its
nucleons and that they be considered different characteristic end-point energy the missing
energy states of the same particle such that in energy must also be carried off by the neutrino.
certain nuclear changes each could decay into This gave the neutrino the properties of spin
the other. and energy but since at that time there appeared
Since there are not beta particles (electrons) to be no measurable loss of mass it was assumed
as such in the nucleus and their existence in so that the new particle had zero mass along with
small a volume would not be in accord with zero charge.
quantum mechanics, it was assumed that the The great difficulty in detecting a particle so
beta particle was produced at the moment of elusive and frustrating that most would pass
emission by conversion of a neutron to a through the earth without hitting anything
proton. Useful as the proton-neutron picture delayed experimental confirmation until 1956
of the nucleus was, it became increasingly when Reines, Cowan, and co-workers at Los
evident that the nucleus was not a single sys- Alamos with an intense beam of neutrinos
tem and that the nuclear forces and modes of (later found to be antineutrinos), (v) finally
excitation must be investigated. Neither protons achieved success. The only difference between
nor neutrons are ever emitted by a radioactive the neutrino and antineutrino is the direction
nuclide, but certain nuclides emit alpha particles of spin.
and each alpha particle consists of two protons In the meantime the discovery of two new
and two neutrons closely bound together. It nuclear particles revealed further complexities
must be assumed that only such a grouping can in nuclear processes. They were called mesons
gain enough energy to escape from an unstable because their masses were intermediate between
type of nuclide. the masses of the electron and the proton. The
With the discovery of what has been called first one, the Il meson or muon (Il) was dis-
"artificial" or induced radioactivity by Irene covered in cloud-chamber tracks by Anderson
Curie-Joliot and her husband Frederick Joliot and Neddermeyer in 1937 while studying effects
in 1933, it was found that positive electron of cosmic rays. The new particle was found to
(positron) emission may occur in nuclides whose have a mass 137 times that of an electron and
instability results from the nucleus possessing it could have either a positive (Il+) or negative
"too much charge for its mass." This is, of (Il-) electronic charge. After the war a closely
course, only another form of beta emission rep- related but still more massive particle the pi
resented as {3+' whereas ordinary electron emis- meson or pion (1T) was detected by the tracks
sion is represented as {3-. Since the positron is it left in nuclear emulsion plates sent aloft in
the antiparticle of the electron it was evident balloons. It had a mass 273 times that of the
817 NUCLEAR RADIATION

electron and could have either a positive or therefore concerned with overcoming these
negative charge, and could even be neutral, as basic nuclear forces.
found later. Both of these new particles are Since hadrons are believed to be composed
unstable and decay with a mean life of 2.2 X of quarks, and since atoms in general can be
10-6 sec for the muon and 2.5 X 10-8 sec described as combinations of leptons and had-
for the charged pion, the neutral pion being rons, then according to one theory, leptons and
somewhat shorter lived. Since these particles quarks may be the ultimate particles from which
are found in the atmosphere as the result of all matter is made. Quarks are not radiated, be-
collision by incoming cosmic rays they may ing bound too strongly to escape (see QUARKS).
be classed as secondary nuclear radiation. They When the muon was discovered it was thought
can now be readily produced by high energy to be the particle predicted by Yukawa that
bombardment in the laboratory along with would hold nucleons together by means of an
many other new particles. exchange force derived from the newly devel-
The close relationship between the electron, oped theory of quantum electrodynamics. But
the muon, and the pion is indicated by the fact muons failed to interact sufficiently with
that pions decay in several ways to form muons nuclei. Than when the pion was discovered
as follows. The positively charged pion 1f+ can it appeared to be the desired particle because
decay to a positive muon. plus a neutrino: it did interact. However, matters soon became
more complicated with the discovery of K
1f+ ..... p.++v mesons, found by Rochester and Butler in 1947
as secondary nuclear radiation in cosmic ray
and the negative pion can decay to a negative experiments. The masses of K particles were
muon plus an antineutrino: found to be a little more than half that of a
1f- ..... p.-+v. proton or neutron, and they could be either
positively or negatively charged or neutral.
The muon of either charge in turn decays to an Along with pions they are our most important
electron of the same charge plus a neutrino (v) particles now classed as mesons. The charged K
and an antineutrino (v): mesons have a mean life of 1.2 X 10-"8 sec
before usually decaying in various ways into
+ + -
p.- ..... e- + v + V. one or more 1f mesons plus a few other particles.
The mean life of the neutral K particle is some-
In 1962, although the two neutrinos seemed what less.
to be the same they were found to not be quite Following the discovery of K mesons came a
identical, and this presented a problem. They great profusion of new particles. Although the
seem to "remember" their parenthood and mean lives of K mesons seem short on our time
react only with their respective closely related scale they actually live an enormously long time
particles. In order to distinguish them we call compared to the many far more evanescent par-
one the electron-neutrino, represented by ve , ticles heavier than a proton that have mean lives
and the other we call the muon-neutrino, repre- as little as 10- 24 or 10- 26 sec. Such particles
sented by vJ.I.. A third neutrino, the tau-neutrino obtained by very high energy bombardment
(vr) is now recognized. Thus, with their anti- may be created with their antiparticles, and
particles, we add six new particles to the list of some may be identified as higher excited states
basic nuclear radiation particles. These particles of other particles.
can now readily be detected in collision pro- The determination of which particles are
cesses or in beams from nuclear reactors. fundamental and which ones are not became
With the increasing complexity of nuclear more complicated and seemed to thwart all
structure had come recognition that two new attempts at a simplified theory of the nature
forces exist that operate only in the confines of matter, until a new development by the
of the nucleus. They are called simply the weak theorist Gell-Mann and others suggested all
force and the strong force and are required to matter may be composed of still smaller basic
explain particle emission. These are now added particles called quarks. Although quarks may
to the well-known electromagnetic force and never be found outside the nucleus they have
the gravitational force of classical physics. The led to much success in theory by showing how
strong force acts to bind nucleons together; it is possible that many observable particles can
it must be charge-independent because it binds have a simple and basic underlying relationship
neutrons to neutrons as well as to protons. The (see QUARKS).
emission of light-weight particles such as the Recent attempts to measure the mass of the
beta particle and the muon (now classed along neutrino have given more positive results than
with neutrinos as leptons), is now recognized earlier ones. The newer experiments now indi-
as connected with the weak force. Those heavy cate the neutrino may possibly have a small
particles such as neutrons and protons that are but measurable mass of a few electron volts in
bound by the strong force are now classed as terms of its energy equivalent. This is to be
hadrons. And the emission of such particles compared with the electron, itself a tiny par-
either spontaneously or by bombardment is ticle, which in the same units has an energy
NUCLEAR RADIATION 818

equivalent mass of 511,000 eV. The neutrino Early Work


mass must be a very small quantity, but if the Rutherford, E., Chadwick, J., and Ellis, C. D., "Radia-
universe is full of neutrinos it could add up to a tions from Radioactive Substances," New York, The
very large amount. Macmillan Co., 1930.
We are continually bathed in the streams of
neutrinos from the sun which result from the Cross-references: ELECTRON; ELEMENTARY PAR·
nuclear processes that sustain the sun's heat, TICLES; FISSION; NEUTRON; NUCLEAR REAC-
and the most noteworthy attempt to measure TIONS; NUCLEAR RADIATION SHIELDING; NU-
some of the billions of billions of neutrinos that CLEAR STRUCTURE; POSITRON; PROTON;
must be radiating from the sun is that of Davis,
using a 100,000 gallon tank of detecting fluid a QUARKS; RADIATION, IONIZING, BASIC INTER-
mile deep in a South Dakota mine. The smaller ACTIONS; RADIOACTIVITY.
than expected result which has so far been
obtained may result from difficulties of detec-
tion or other reasons not yet clear.
Although 90% of the neutrino radiation which NUCLEAR RADIATION SHIELDING
we receive most likely comes from our sun the
number of stars radiating into space could fill Nuclear reactors, based upon the fission of
all of space with neutrino radiation. Such a heavy nuclei, have become an important power
profusion of neutrinos, if they possess appreci- source for the generation of electricity. The
able mass, might even affect our ideas about the great intensity of nuclear radiations produced
expanding universe: it presents the astronomers in the reactors requires radiation shielding.
with another source of mass to consider in Fast neutrons and gamma rays are by far the
figuring if there is enough mass to cause the most penetrating of the radiations produced;
expansion to ever stop and reverse itself. therefore, most shield design is concerned with
Nuclear radiation presents both hazards and reducing their levels by factors of as much
benefits. Radiation from experimental nuclear as 10 15 .
reactors at various centers now furnishes rich A variety of other systems gives rise to
opportunities for further study of the nature shielding problems which are related more or
of matter and the properties of different radia- less closely to those of the fission reactor and
tions. At the same time the hazards of radiation these are treated with many of the same meth-
from nuclear power plants and nuclear warfare ods which are used in reactor shielding. After
are becoming more widely recognized and have discussing these various areas of application,
led to an intense study of such radiation in- the problem of calculating and confirming the
cluding the long-range as well as the short-range design or performance of a radiation shield is
physiological effects. Closely related are the outlined.
dangers resulting from aging and deterioration Areas of Application Fission Reactors. In the
of the materials in a reactor plant itself that are United States and much of the rest of the
continuously exposed to radiation. world, the fission power plants now being in-
The rapid development of nuclear weaponry stalled for electricity generation (or ship pro-
has led not only to the many problems of the pulsion) are based upon the use of water as
physiological effects but to military problems the reactor moderator and coolant. As a shield,
such as the development of a "clean" bomb water has no cracks and it offers simplifications
and to the possible enhancement of neutron following from the fact that it tends to produce
radiation at the expense of explosive power, an equlibrium neutron energy distribution due
or the reverse. With all of this the subject of to the inverse dependence upon energy of the
nuclear radiation has expanded out of the hydrogen neutron cross section. This equilib-
laboratory and has become more and more rium fast-neutron spectrum has allowed the use
deeply involved in our lives. of simple attenuation methods with empirically
derived constants to solve many design prob-
ROGERS D. RUSK lems which do not require much detail in the
answer. The outstanding problem for light-water
References reactors has been the streaming of neutrons in
the cavity outside the reactor pressure vessel,
Feinberg, G., "What Is the World Made Of?" Garden where rather detailed calculations are required.
City, N.Y., Anchor-Doubleday, 1977. A summary of approaches to this problem has
Lapp, R. E., and Andrews, H. L., "Nuclear Radiation recently been made available. l
Physics," Fourth edition, Englewood Cliffs, N.J., Current interest in the development of ad-
Prentice-Hall, 1972. vanced reactor types centers upon the breeders,
Segre, E., "Nuclei and Particles: An Introduction to especially the liquid-metal-cooled fast breeder
Nuclear and Subnuclear Physics," Second edition, reactor (LMFBR) using the 238U to 239pu
Reading, Mass., W. A. Benjamin, 1977. breeding cycle. Breeders are essential if nuclear
Sorensen, 1. A., and Phelps, M. E., "Physics in Nuclear fission is to make a long-term contribution to
Medicine," New York, Grune and Stratton, 1980. our energy needs. The LMFBR operates at high
819 NUCLEAR RADIA nON SHIELDING

temperatures (-500 to 600°C) and with neu- of the shield and require careful study. 5 Valida-
trons of high energy (-100 keV average energy tion of the analytical methods used will be based
for neutrons producing fission). Both condi- upon specifically designed shielding experi-
tions tend to exclude hydrogenous materials ments.
from the vicinity of the reactor, and sodium, Nuclear Weapons Radiations. Nuclear explo-
the liquid-metal coolant usually considered, is sives may produce radiations by both the fission
the dominant medium near the core. The fast- and fusion processes and there are substantial
neutron spectrum does not attain an equilib- overlaps in the radiation-shielding problems
rium after attenuation in any thickness of with the reactors considered above. For the
sodium (to 15 ft) so far as is presently known, protection of military or civilian systems against
and much more detail in the neutron spectra radiations from nuclear weapons, initial atten-
must be predicted for LMFBR shield design. tion must be given to the radiation transport
Fast-neutron streaming through clearance gaps through air from the source to the system.
and sodium-filled pipes, which necessarily sur- As a result of many years of study of this
round or penetrate the LMFBR shields, must basic problem, air-transport methods and re-
be calculated for geometries which become sults are now available in detail with accuracies
quite complex in actual practice. For the (except for problems with the cross sections
coolant pipes, both the sodium and the insula- for the production of secondary gamma rays
tion required by the high temperatures are of by fast neutrons) of the order of 10 to 20
low density and constitute significant gaps in percent: 6
the shield. Specific shielding problems of liquid An overview of the problem of shielding
metal fast breeder reactors have been reviewed military systems against weapons radiations is
by Farinelli and Nicks. 2 given in the multivolume DNA Handbook. 7
Finally, for fission reactors used to propel For the specific consideration of the shielding
or to provide auxiliary power for space vehicles, protection offered by structures against weapons
the weight of the power plant becomes crucial fallout radiation, a comprehensive review has
to the feasibility of the mission. Since the recently been published. 8
radiation shield may account for as much as Accelerators. Radiation of virtually all types
two-thirds of the total power-plant weight, the (including neutrons and gamma rays) and ener-
radiation shield design must be made with the gies can be produced by the many types of
highest attainable accuracy. Efforts are cen- charged-particle accelerators. Over the energy
tered upon methods of optimizing the shield range to -20 MeV there may be quite direct
design with respect to weight. This can now be overlap in the radiation-shielding problems
done directly and accurately insofar as the with reactors or weapons radiations. For higher-
power plant can be considered one-dimensional. energy accelerators additional reactions and
Two-dimensional optimization methods are new particle types such as mesons may be
less developed. . involved. High-energy particles which are slowed
A thorough review of radiation shielding for down (reduced in energy) in the shield even-
fission reactors was published by the U.S. AEC tually pass through or give rise to additional
in 1973. 3 particles in the lower energy range applicable
Fusion Reactors. Potential advantages of to fission and fusion reactors. Thus methods
safety and unlimited fuel supply justify current developed at low energy may be incorporated
efforts to develop controlled thermonuclear as part of the solution of accelerator shielding
reactors (CTR) based upon the fusion of light problems. In any case, the methods of solution
nuclei. The most commonly considered reac- tend to be closely related.
tion of deuterium and tritium produces 14-MeV Space Vehicles. The space environment in-
neutrons which must be absorbed to preve!lt cludes charged particles ranging from the elec-
radiation damage, to breed more tritium as a trons (trapped by the magnetic field of the
fuel, and to gain the energy (heat) which the earth) through protons and alpha particles
fast neutrons carry from the reaction. A major (helium nuclei) to heavier nuclei. The protons
requirement of the radiation-transport calcula- tend to be the most numerous and their inter-
tions for CTR systems is to insure radiation actions the best understood. Nuclear inter-
levels that will not destroy components of the actions of heavy nuclei are little understood
system; e.g., the cyrogenic magnet used to and may constitute a major problem for future
contain the fusion reaction. Radiation-shielding manned interplanetary space travel.
calculations for fusion systems are closely re- All of the charged particles lose energy
lated to the neutronic design of the lithium- dominantly by ionization which creates es-
containing blanket which is used to breed sentially no transport or shielding problems.
tritium. An example of a calculation of this Nuclear reactions do occur, however, at ener-
type is given in Ref. 4 for a Tokomak fusion gies above a few MeV and ignoring these may
reactor, one of the most highly developed types. lead to dosimetry estimates in error by 25
Neutron streaming through necessary penetra- per cent or much more depending upon the
tions of the reactor shield for any type of fusion incident-particle energy spectrum and system
reactor now visualized will limit the attenuation geometry.
NUCLEAR RADIATION SHIELDING 820

The methods employed for space radiation and much-studied distribution in energy of
shielding are largely those also used for accel- neutrQ!ls (-5 MeV/fission total energy release
erator shield design. However, in general, the with E-2 MeV) varies with the isotope under-
requirements in space radiation shielding are going fission and slowly with the energy of
for highly accurate calculations for thin shields. the neutron producing fission. Delayed neu-
For accelerators, less accuracy is usually re- trons, which follow {3-decay of fission frag-
quired but much thicker shields (approaching ments, have an intensity relative to the "prompt"
or surpassing the attenuations of reactor shields) neutrons of the order of I to 2 per cent and
may be required. Space radiation shielding is become important largely for fluid fuel reactors
treated in a comprehensive handbook.9 where they may be emitted beyond part of the
Nature of the Problem Stated generally, the primary shield and thus enhanced in impor-
problem of radiation shielding is to solve the tance. Gamma rays are emitted at essentially
Boltzmann transport equation which describes all times after fission but a major share
the radiation transport. It is possible in princi- (-8 MeV/fission) appear within a few nano-
ple, of course, to make direct measurements of seconds. Gamma rays following isomeric transi-
the necessary radiation attenuations, but in tions in fission-fragment nuclei (important for
practice this is seldom practical. Thus, given times after fission ~ I sec) and following {3-
the necessary nuclear data, radiation-shielding decay (for t> I sec) together amount to an-
analysis reduces largely to (1) developing nec- other -7 MeV/fission.
essarily approximate methods of solving the The 14-MeV neutrons produced in fusion of
Boltzmann equation, (2) testing their validity, light nuclei constitute the dominant radiation
usually against measurements in simplified geo- for a fusion source. For high-energy accelerators
metries, and (3) applying these methods to or space vehicles, the sources of penetrating
design problems. radiations are dominantly due to nuclear inter-
The Boltzmann equation may be considered actions of the primary charged particles. 11-
simply as a balance equation tabulating the mesons, formed from the decay of other par-
sources and disappearances of particles. In this ticles, are extremely penetrating due to their
sense, and ignoring time dependence, it may small interaction cross sections, and they tend
be written as: to dominate parts of the shield for accelerators
IJ . ncf;(r, E, n) + ~t(r, E) cf; (r, E, [2)
with energies> 10 GeV.

ff ~s(r, n' ~[2,E' ~E)X


A broad review of radiation sources is given
in the "Engineering Compendium of Radiation
= S(r,E,n) + Shielding."io
Cross Sections. The probability of interac-
tion or cross section must be treated in great
cf;(r,E',n') dE' dn' detail in radiation-shielding problems, espe-
where cf; is the number flux of particles of cially for neutrons. The cross sections are
strongly energy-dependent, in part with rapid
!Lnergy E, at position r, with angular direction variations or fluctuations with neutron energy.
n. The left side of the equation represents Neutrons may (1) disappear, giving rise only to
all losses including leakage out of the region
of interest and disappearance by reactions of charged particles of no consequence for the
all types with probability ~tcf;. On the right radiation transport; (2) produce fission or
side, S represents all neutron sources such as fusion with the sources these reactions imply;
fission and the integral includes in-scattering (3) scatter, changing energy and direction, thus
into the spatial and energy region of interest introducing "new" particles into the transport
of particles scattered from elsewhere. This equation; (4) scatter as in (3) with the addi-
"scattering" process may in the most general tional production of deexcitation gamma rays;
or (5) be "captured" with the emission of
sense include the production of gamma rays capture gamma rays. For shielding, one requires
upon the interaction of neutrons. the sum of gamma rays produced in fission,
Solving this integro-differential equation re- capture, and inelastic scattering.
quires input for the radiation sources Sand The vast amount of cross-section information
interaction probabilities ~ including details required for accurate shielding calculations con-
about secondary particles which may be pro- stitutes a central problem, and perhaps more
duced. The allowable radiation levels corre- effort is applied altogether in cross-section
sponding to cf;(E) are generally provided as preparation than in transport calculations. Neu-
constraints determined by the radiosensitivity tron cross sections are measured at many
of man, materials, or systems which may be "points" in energy. These detailed point sets
exposed to the radiations. Then one solves for are then commonly reduced to a limited
the minimum shield thickness (or weight or number of energy groups (-30 to -300 groups)
cost) consistent with the constraints. with the energy variation in cross section with-
Nuclear Data Sources of Radiation. Fission in each group represented by a weighted aver-
produces neutron and gamma-ray sources im- age. Weighting schemes approximate the proper
portant for shield design. The characteristic but unavailable weighting by cf>(E) for the prob-
821 NUCLEAR RADIA nON SHIELDING

lem being considered. Since <P(E) is a function methods is available in the revised Chapter 3 of
of position, no single set of weights can be the "Weapons Radiation Shielding Handbook."7
exact. The group structure must be especially Virtually all solutions of problems of practical
selected for difficult problems. The validity interest depend upon use of a digital computer,
of sets of group cross sections must be es- and the type of computer available and its
tablished by comparing the results of calcu- cost of operation may constrain the choice
lations using them with values obtained from of methods. The availability of computer codes
calculations with point cross-section sets or developed by others is very important. Codes
from benchmark experiments. of shielding interest are collected and made
The use of group cross sections allows treating available by the Radiation Shielding Informa-
scattering by introducing group-to-group trans- tion Center,12 which also assists new users in
fer matrices with a large reduction in necessary making them operable.
information storage. Similarly, group cross sec- Discrete Ordinates. The nature of the Boltz-
tions can be readily generated to handle secon- mann transport equation as a balance of particle
dary gamma-ray production. flow is closely matched by the discrete ordinates
As for neutrons, gamma-ray cross sections formulation as a finite difference equation, and
can be considered as point or group-averaged the equivalence has been demonstrated for one-
functions of energy. Gamma-ray cross sections dimensional geometries. Thus, few approxima-
tend to be known more accurately than those tions are necessarily required in the application
for neutrons, and the production of other of this method, which obtains the desired
particle types (e.g., neutrons) by gamma rays flux rf> Cr, E, Q) by successive iterations over the
can usually be ignored for low-energy trans- difference cells. A derivation and a method of
port problems. solution of the finite-difference equations which
Nuclear Data Stores. The wealth of cross- are required for a one-space dimension time-inde-
section detail demands computer-based handling pendent problem are given in Ref. 7.
and efficient storage. A National Nuclear ~ross The adaptation of the discrete ordinates
Section Center (NNCSC) has been established method, which was originally developed for
at the Brookhaven National Laboratory for other purposes, to radiation shielding required
storing neutron and gamma-ray cross sections, special attention to several features of the
and this is the appropriate primary source of method. In particular, the steep flux gradients
information in the United States. l l Very im- characteristic of good shields have required
portant is the fact that a cooperative group modified schemes for iteration within the dif-
(The Cross Section Evaluation Working Group or ference cells. For deep penetration in a shield,
CSEWG) has been set up to identify the "best" the angular distributions of scattered particles
set of data for each reaction type and nuclei. This become very important since the forward-
choice may require choosing among sets of scattered radiation can penetrate most readily.
experimental measurements with different val- To handle the anisotropic scattering in a prac-
ues for the same quantities, or may require tical manner, the angular distributions after
filling gaps covered by no measurements, on the scattering from energy group to energy group
basis of nuclear-model calculations or system- are approximated by a Legendre series. A low
atics. This Evaluated Nuclear Data File (ENDF), order of expansion, P3, usually suffices.
which changes with time as new data become The discrete ordinates methods have been
available, is also available from NNCSC.!1 Simi- expanded to two dimensions and widely applied
lar, but more preliminary, data and sets of to practical design problems. Demands upon
groups of cross sections derived from them are computer time have provided incentives for
available from the Radiation Shielding Informa- developing improved forced-convergence tech-
tion Center (RSIC) at the Oak Ridge National niques including those which concentrate upon
Laboratory. 12 the spatial or energy region of interest. Flux
Methods of Solution Exact analytic solu- aberrations have been frequently observed in
tions of the transport equation are not possible two-dimensional problems for small radiation
for practical shielding problems, but numerous sources. These "ray effects" preserve indications
approximate, basically numerical methods are of the finite angular directions in the flux
available. Two of these have assumed the greatest solution, and various methods have been devised
importance today, the discrete ordinates and to eliminate or control them.
Monte Carlo methods. Appiied in their most Practical computer codes are available 12 which
highly developed forms, the residual approxima- incorporate the discrete ordinates method. One-
tions become unimportant. The effort and cost dimensional calculations are very fast and have
required for such applications may be substan- been used as the basis of iterative shield op-
tial, however. Other methods which are easier to timization procedures which perform a series
use offer approximations to the desired solution of flux solutions in one pass on the computer.
which have more or less vailidity depending Two-dimensional calculations require fast com-
upon the complexity of the problem and its puters with large memory stores. It should be
similarity to those from which the approximate noted that the division of neutron energies
method has been derived. A review of transport into groups allows a convenient method of
NUCLEAR RADIATION SHIELDING 822

handling the production of secondary gamma however, requires considerable effort on the
rays by neutron capture and inelastic scattering. part of the user for solving a given problem.
Proper treatment of these secondary gamma Recent developments in Monte Carlo methods
rays is essential in almost all radiation-shielding have tended toward systems, such as those
problems which include fission or fusion sources. using standard neutron energy groups, which
Overall, discrete ordinates is the method of ease the user's burdens. Using the same energy
choice for most one- or two-dimensional radia- groups as those of the discrete ordinates method
tion-shielding problems. The major complica- greatly facilitates comparisons of results. The
tion which is encountered in its use is that ultimate verification of results from Monte
there is no clear a priori approach to defining Carlo calculations as for all other methods must
the best (or even certainly adequate) sets of rest upon comparisons with benchmark experi-
directions, space meshes, and energy groups for ments. A current review of Monte Carlo meth-
a given problem. Experience is the best guide ods gives most emphasis to shielding applica-
to these choices and it must be based upon tions. 13
detailed comparison of test calculations with For reactor-design problems the attenuations
benchmark experiments which are designed for may exceed those which may be practically
for the purpose. treated with Monte Carlo techniques, but parts
Monte Carlo. Under the pressure of neces- of the geometry may require three-dimensional
sity, many components of radiation shields may analysis. Therefore, coupling codes have been
be approximated in one or two dimensions. developed 12 which enable solving part of the
There remain, however, complicated problems, problem using two-dimensional discrete-ordi-
such as the streaming of neutrons through nates and part in three-dimensional Monte
holes, which require a three-dimensional treat- Carlo.
ment. At present, the Monte Carlo method is Other Transport Methods. Many other meth-
the only approach to such problems which is ods have been considered and used to a limited
free of serious approximations. extell.t for solving the transport equation. In
The Monte Carlo method depends upon one approach, the angle-dependent terms are
stochastic estimation or random sampling from represented by expansions in spherical harmonic
the probabilities describing the stochastic pro- polynomials, Pn. Development of the method
cesses that determine the solution of the trans- has been restricted by the requirement to con-
port equation. This mathematical analog for a sider many terms in the scattering representa-
problem of interest may be made as exact as de- tion and the difficulty of treating multidimen-
sired, limited only by the investment of effort sional shielding problems. Limited use of the P3
and computing time. The size of the sample of approximation has been made for one-dimen-
particles of interest (i.e., those penetrating the sional problems, but no two-dimensional calcula-
shield) determines the statistical accuracy tions are known. Diffusion theory, which corre-
achieved. sponds to the Pi approximation, is widely used
The major determinant of the success of the for reactor criticality calculations because of its
Monte Carlo method for problems whose im- relative simplicity, and multidimensional ver-
portant parameters depend on very unlikely sions are available. Its validity is limited to
events such as penetration through thick shields, nearly isotropic fluxes, however, as opposed to
is the' method of sampling. Through "impor- the highly forward-directed fluxes characteristic
tance sampling," larger numbers of particles of shields. Diffusion theory can sometimes be
can be chosen which contribute to the unlikely used for the inner portions of a reactor shield
events of interest. 7 Thus the sampling is biased or to provide a source for transport shielding
without, hopefully, biasing the result, since calculations, but the current trend is to use the
correction factors are applied. The most power- transport calculation throughout the system.
ful biasing technique uses an importance or Another approach to solving the transport
value function based upon an adjoint solution equation consists of transforming the problem
of the transport equation, using the best avail- by constructing spatial moments of the flux
able method, which will usually be one- or two- and calculating in the transform space. This
dimensional discrete ordinates. Such biasing in moments method has a number of computa-
energy and space is routine but angular biasing tional advantages and it was used widely for
has usually been limited to one-dimensional gamma-ray transport. It is, however, greatly
problems. Estimates of the variance calculated restricted in the geometries that can be consi-
upon the basis of a normal distribution may be dered, and has been applied largely to infinite
grossly in error, since Monte Carlo distributions homogenous media. Furthermore, for neutron
are frequently nonnormal, and unfortunately, transport, difficulties persist in reconstructing
error estimates are usually too low. This diffi- the flux from the moments.
culty must be kept in mind in using the Monte The so-called invariant imbedding, or matrix,
Carlo method for shielding problems. method does not solve the transport equation
Several powerful and general Monte Carlo directly, but rather solves for reflection and
codes are available 12 for a variety of computer transmission functions throughout the medium.
types. The inherent flexibility of the method, A particular (shield) configuration is imbedded
823 NUCLEAR RADIA TJON SHIELDING

in a larger class of configurations, and solutions penalty for error or overdesign tends to be less
are obtained for all in one calculation, an ob- and the costs for meaningful mockups exces-
vious efficiency. Definition of reflection and sive. An obvious exception would be the space
transmission functions and their solution by nu- reactors for which the shield design is crucial.
merical methods do not differ greatly from other Measurements on reactors as built can in prin-
apt;'roaches to the transport equation. Bound- ciple be used to validate calculation methods.
aries are precisely included in the formulation, In practice, however, access to interesting por-
it is well suited for heterogeneous shields, it is tions of reactor or accelerator shields is limited
efficient for thick shields and inefficient for and down time required for performing mean-
thin. The outstanding limitation is that applica- - ingful measurements is very expensive. The
tions have been made only for slab geometries. test is also rather late for judging methods for
Approximate Methods. Before large comput- anything other than serial production of power
ers made transport ·solutions practical, the plants.
kernel technique for radiation transport based The most useful tests of transport methods
upon the use of a Green's function was widely follow from comparisons with experiments
used. The point kernel, K(lr - ,'i), relates the specifically designed for the purpose. Such
desired response of a detector at point' to a experiments attempt to reproduce in the sim-
radiation source at point ,'. Integration over plest arrangement attainable the essential fea-
a source volume yields the solution to problems tures of the system of interest (e.g., a fast-
of arbitrary geometry. However, values for the neutron spectrum for applications to fast
kernels have been obtained from solutions by reactors or large attenuations in the pertinent
methods valid only for infinite homogeneous materials for thick shields). The experimental
media, and these kernels give results which are results must be accurate, reliable, and unambig-
in error when applied to finite media. Scattering uously interpretable above all else. Measure-
may be approximated by adding one of a variety ments include the neutron energy distributions
of build-up factors as a function of energy E penetrating the shield, using one or more of
and separation I, - ,'I. For gamma rays, useful the various neutron spectrometers which have
build-up factors were derived from moments- been developed for the purpose. An example
method calculations. of the test of a two-dimensional discrete-
For neutron attenuation, many shields con- ordinates code against measurements of neutron
tain large regions of water following other spectra serves to illustrate the process.
materials. This water has the effect of filtering Sensitivity Analysis and Channel Theory The
out scattered neutrons, making the material complexity of radiation shielding problems has
appear as an absorber. The absorption or re- made worthwhile developing sensitivity analysis.
moval cross section, as derived empirically from The sensitivity of a desired response R (such as
experiment, was used in defining a kernel a radiation dose outside a complex shield) to
applicable for shields with outer hydrogenous changes in input parameters ax (such as the
regions. Other kernel approaches have been used neutron cross sections) is just the functional
in order to parameterize the results of transport derivative aRlaax. Using generalized perturba-
solutions. tion theory, it is possible to obtain for a single
A logical extension of the removal concept response sensitivities to all input parameters.
was to follow the deep penetration of neutrons This allows identification of those data which
as described by the removal process by modera- are most significant in obtaining accurate atten-
tion in energy using diffusion theory. This uation predictions. When combined with esti-
removal-diffusion method has been highly de- mates of uncertainties in the input data, know-
veloped and widely applied, especially in Europe, ing the sensitivities allows estimating the uncer-
in the design of shields for power reactors using tainty in the desired response. Uncertainties in
fission induced by thermal neutrons. The basic the model (i.e., effects of choice of energy
assumptions upon which the method is based groups or angular quadrature for discrete-
are not fulfilled for fast reactors . The removal ordinates calculations) can also be taken into
process is considered for many energy groups account.
(to 18) from which neutrons can be transferred Examination of the sensitivity profile (sensi-
to other energy groups used to describe the tivity as a function of energy, for example) may
diffusion process. The key step then becomes offer insights into the physical processes (such
derivation of the group-transfer matrices from as "holes" in the cross-section variations with
experiments which determine the penetration energy) which are important or crucial in deter-
of monoenergetic neutrons in various materials. mining shield penetration. Sensitivity analysis
A modern reference to approximate shield- can and should be used in the design of shield-
design methods is available in a survey of nuclear ing experiments to demonstrate that the impor-
reactor engineering. 14 tant parameters of the experiment correspond
Validation of Calculations Mock-up experi- to those in the shield design of interest. Asession
ments, which are universally used to validate was devoted to this topic at the last interna-
reactor-core criticality calculations, are much tional conference on reactor shielding. IS
less common for shielding calculations. The While sensitivity analysis has tended to con-
NUCLEAR RADIA TION SHIELDING 824

sider energy regions of greatest importance in Neutrons and Secondary Gamma Rays in the At-
attenuation, spatial channel theory is used to mosphere," Nucl. Sci. Eng. 46,334 (1971).
locate regions in space that may be of domi- 7. Abbott, L. S., Oaiborne, H. C., and Clifford, C. E.
nant importance. Thus streaming paths for (Eds.), "Weapons Radiation Shielding Handbook,"
neutrons can be identified and visualized, even DNA-1892 (formerly DASA 1892). Chapter 3 has
for the very complicated geometries of fast been issued in revised form as DNA-1892-3, Rev.
breeder reactors. 16 The additional insight into 1 (March 1972). Available from National Techni-
the radiation-transport process thus gained may cal Information Service, Springfield, Virginia
be used to determine weaknesses in shield de- 22151.
signs and/or the optimum placement of shield- 8. Spencer, L. V., et al., "Structure Shielding Against
ing materials. Fallout Gamma Rays from Nuclear Detonations,"
For Further Information The development NBS Special Publication 570, U.S. Government
of radiation-shielding technology is almost Printing Office, 1980.
entirely dependent upon high-speed digital 9. Alsmiller, R. G., Jr., et al., "Shielding of Manned
computers. This dependence of reliable solu- Space Vehicles Against Protons and Alpha Par-
tions to transport problems upon computer ticles," ORNL-RSIC-35, Nov. 1972.
codes with masses of nuclear data required as 10. Jaeger, R. G., et al. (Eds.), "Engineering Com-
input precludes the handbook approach to pendium on Radiation Shielding," Vol. I, "Shield-
practical design problems. Therefore, a Radia- ing Fundamentals and Methods," New York and
tion Shielding Information Center has been Berlin, Springer-Verlag, 1970.
established to assist in providing the necessary 11. National Nuclear Data Center, Brookhaven Na-
tools for solving shielding problems and in tional Laboratory, Upton, L. I., N.Y. 11973
providing means of retrieving the pertinent (516/345-2091).
literature. The Center does not attempt to solve 12. Radiation Shielding Information Center, Oak
problems directly. It does provide coverage of Ridge National Laboratory, Post Office Box X,
the literature for all of the indicated areas of Oak Ridge, Tennessee 37830 (615/574-6176).
application. It also collects, packages, evaluates, 13. "Proceedings of a Seminar-Workshop, A Review
and makes available to users the computer-based of the Theory and Application of Monte Carlo
methods and data which have been developed. Methods, Oak Ridge, TN, April 21-23, 1980,"
Feedback from users regarding their problems ORNL/RSIC-44, August 1980.
and successes in applying the methods is impor- 14. Glasstone, S., and Sesonske, A., "Nuclear Reactor
tant in leading to improvements. The Center Engineering," New York, Van Nostrand Reinhold,
may be contacted at the address given. 12 The 1981.
Center was responsible for publication of the 15. Roussin, R. W., et al. (Eds.), "Proceedings of the
"Proceedings of the Fifth International Con- Fifth International Conference on Reactor Shield-
ference on Reactor Shielding," held in Knoxville ing, Knoxville, Tennessee, April 18-22, 1977,"
in 1977 .15 Included here are a wide range of Science Press, 1977.
relevant papers. 16. Williams, M. L., and Engle, W. W., Jr., "Channel
A general text on radiation shielding and Theory in Shielding Analysis," Nucl. Sci. Eng. 62,
dosimetry was written by A. E. Profio and 92 (1977).
published in 1979. 17 17. Profio, A. E., "Radiation Shielding and Dosim-
etry," New York, Wiley, 1979.
FRED C. MAIENSCHEIN
Cross-references: FISSION, FUSION, NUCLEAR IN-
STRUMENTS, NUCLEAR RADIATION, NUCLEAR
References REACTIONS, NUCLEAR REACTORS, NUCLEAR
STRUCTURE, TRANSPORT THEORY.
1. Lahti, G. P., et al. (Eds.), "Radiation Streaming in
Power Reactors," RSIC-53, February 1979.
2. Farinelli, U., and Nicks, R., "Physics Problems of
Fast Reactor Shielding," Atomic Energy Review NUCLEAR REACTIONS
17,1(1979).
3. Schaeffer, N. M. (Ed.), "Reactor Shielding for Several years after the discovery of the nucleus
Nuclear Engineers," U.S.A.E.C. TID-25951, 1973. by scattering a-particles from gold, Rutherford
Available from National Information Service, and his collaborators noticed that if air were
Springfield, Virginia 22151. exposed to the flux of a-particles, occasionally
4. Santoro, R. T., et al., "Neutronics and Photonics a very penetrating particle was observed. After
Calculations for the Tokamak Experimental Power some nuclear detective work, this phenomenon
Reactor," Nucl. Tech. 37,274 (1978). was explained in the following way: the nitro-
5. Santoro, R. T., et al., "Monte Carlo Analysis of gen nucleus and the a-particle react to produce
the Effects of a Blanket-Shield Penetration on the an isotope of oxygen and an energetic proton.
Performance of a Tokamak Fusion Reactor," In chemical notation, such a reaction may be
Nucl. Tech. 37,65 (1978). written as 14N + a -+ 17 0 + p, where the super-
6. Straker, E. A., "The Effect of the Ground on the scripts are the atomic mass numbers of the ele-
Steady-State and Time-Dependent Transport of ments in question.
825 NUCLEAR REACTIONS

Nuclear reactions may take place only when Neutrons form a special class of projectiles,
a target nucleus and a projectile come close since they are not charged and need not possess
enough together for the nuclear forces to take any large amount of energy to overcome a re-
effect. The range of nuclear forces is very short, pulsive electrostatic barrier. In fact, the slower
about 1.5 fermis (1 fermi = 10- 13 cm). Since the neutron, the more likely it is to interact
nuclei and all the massive projectiles except with a target simply because it spends more
neutrons are charged positively, they repel each time in the vicinity of the target nucleus.
other; and if they are to be brought into suf- Nuclear reactions may also be initiated by
ficiently intimate contact to interact, the en- PHOTONS (electromagnetic radiation quanta)
ergy of the projectile must equal or surpass the of very high energy, greater than 5-10 MeV.
repulsive electrostatic force. Typically, for pro- The photons are produced as x-rays when
tons on light nuclei, the energy required for a high-energy electrons impinge on a target.
reaction to take place is 1 or 2 MeV rising to Since photons are not charged electrically,
15 MeV for protons on very heavy, highly there is no barrier; but all photonuclear reac-
charged nuclei. Thus to produce nuclear reac- tions are endoergic, that is, they require several-
tions by the collision of charged particles, we million-electron-volt x-rays to take place. High
must first accelerate one of them to an energy energy accelerators produce copious amounts
sufficient to overcome the electrostatic repul- of unusual projectiles: 1'(- and K-mesons and
sive force. In all nuclear reactions, energies are other exotic particles with a sufficiently long
usually given in million electron volts (MeV) half-life. While research with these projectiles is
and masses in atomic mass units (amu). One not terribly common at the present time, there
MeV is the energy acquired by a particle of one are indications that their use will increase.
electronic charge as it is accelerated by an elec- Rutherford's original reaction would now be
trical potential of one million volts. An atomic written thus: 14N(a, p) 17 0. The target nucleus
mass unit is defined as one-twelfth the mass of comes first, the projectile and the emitted par-
a neutral carbon-l 2 atom. ticle or particles appear in that order inside the
A nuclear reaction, like its chemical counter- parentheses, and the residual nucleus is last.
part, may be exoergic (kinetic energy is liber- The superscript gives the atomic mass of the
ated) or endoergic (kinetic energy is absorbe8). isotope. Some common abbreviations are p for
For example, Rutherford's original reaction is proton, n for neutron, d for deuteron (2 H), t
endoergic, consuming 1.19 MeV which is con- for triton eH), 'Y for 'Y-ray or photon.
verted into mass of the product particles ac- The projectile energy range over which nu-
cording to Einstein's E = mc 2 . In the case of clear reactions are studied varies from a fraction
endoergic reactions, energy must be supplied in of an electron volt for neutrons to several hun-
the form of projectile kinetic energy to make dred million electron volts per atomic mass unit
the reaction possible. for charged particles. At very high energies,
In the course of a nuclear reaction, the pro- lighter projectiles appear to interact with the in-
jectile and the target may fuse completely or dividual neutrons and protons of the target
parts of nuclear matter, neutrons, protons, or rather than with the nucleus as a whole, and we
clusters of these may be transferred from the leave the domain of nuclear reactions to enter
target to the projectile or vice versa. the field of ELEMENTARY PARTICLE inter-
The projectiles used to produce nuclear re- actions.
actions are usually energetic charged particles. Nuclear reactions are literally the foundation
In the early studies of nuclear reactions, the on which our world is built. The energy of the
most common projectiles were the ISOTOPES sun is nuclear in origin deriving principally from
of hydrogen, protons or deuterons, as well as the fusion of four protons into a helium nucleus
a-particles. Because these projectiles have small in the 'course of which 22.7 MeV are released in
electric charges, they feel the lowest electro- each fusion. The fusion is not a direct four-
static repulsion from the target and need the body reaction but rather proceeds by stages
least amount of energy to accomplish their pur- through many two-body reactions (see SOLAR
pose. Recently, however, increasing emphasis ENERGY SOURCES). Not only is our chief source
is being placed on heavy ion projectiles, from of energy of nuclear origin, but the constituents
lithium to uranium. Because the heavy nuclei of the earth are also the result of long-gone nu-
carry a large charge, they require much more clear reactions, principally a series of (n, 'Y) pro-
energy to approach the target closely enough cesses which served to build up the elements in
for the nuclear matter to interact. Usually the the earth as we now find them. One particularly
energy is expressed in terms of MeV per nu- pleasing success of nuclear reaction theory and
cleon (MeV/A); and for heavy ions the desirable experiment is the fact that the abundance of
energy for nuclear reactions is around 10 the elements and their isotopic ratios as they
MeV/A; thus, for example, the total energy for occur in nature can be calculated simply on the
uranium nucleus would be 2,380 MeV. Another basis of the probability with which neutrons are
field of nuclear reactions that is increasing in captured by various nuclei, these probabilities
popularity uses very high energy electrons (500 having been measured in nuclear research
MeV to 5 GeV) usually produced by linear centers.
accelerators. Not only nature's energy source is of nuclear
NUCLEAR REACTIONS 826

origin. The two mightiest sources of man-made ment of cancer, for metallography, and for
energy, nuclear FISSION and nuclear FUSION, food preservation.
derive from nuclear reactions. By means of nuclear reactions such as 241 Am
In fission, a neutron is captured by a uranium (11 B, 4n)248Fm, scientists have been able to
nucleus which splits (fissions) into, say, a Ba make new elements not found in nature. Some
and Zr nucleus, in the process releasing about of these, for example plutonium and califor-
200 MeV of energy and some neutrons which in nium, have found important uses as reactor
turn split other U nuclei in the vicinity. This fuels or portable power sources. The prepara-
leads to the familiar chain reaction which, if tion of new elements has played a decisive role
controlled, is used to produce power by con- in our understanding of the chemistry of heavy
verting the fission energy (heat) into electricity. elements.
If the chain reaction is allowed to proceed with- We turn now to a description of nuclear reac-
out control, a violent explosion results, i.e., an tions. These may be regarded as proceeding in
atomic bomb. (We note parenthetically that two principal ways. First, the colliding particles
what is commonly known as atomic energy may fuse, their components get thoroughly
should really be called nuclear energy since its mixed in a very "hot" compound nucleus. This
source is not the entire atom, but only its nu- compound nucleus may exist in a heated state
cleus. Burning of coal, on the other hand, is for a period varying between 10- 16 and 10-20
atomic energy since heat is derived from the second, a time we normally consider imponder-
combination of a carbon atom with two oxygen ably short but which is nevertheless long on the
atoms to form CO 2 .) nuclear time scale when compared with the
Fusion as a source of energy derives from a transit time of a nucleon across the nucleus
reaction such as 2H(d, n)3He in which 3.3 MeV 00- 22 second). The compound nucleus boils off
are released. This process is similar to the source fragments, mainly neutrons, protons, a-particles,
of solar energy in that hydrogen nuclei fuse to and 'Y-rays, and in this way cools down to a nor-
produce helium. Controlled fusion reactions mal energy content. In its final state, it is called
that proceed slowly and are contained spatially a residual nucleus; and it may be radioactive or
are being studied at many laboratories, and stable depending on the details of the reaction.
much progress has been made to the ultimate If the target, the projectile, and the energy of
goal: a controlled fusion reactor, which how- the projectile are known, it is possible to
ever is still several decades in the future. An un- predict what the residual nucleus will be; and
controlled fusion reaction has been achieved- thus, if certain isotopes are wanted, one can
it constitutes the energy source of the hydrogen tailor the reaction accordingly. We note here,
bomb. however, that the exact mechanism of these
It is clear that reserves of coal and oil must reactions is very complicated and is not really
someday be exhausted. When this happens, understood in a fundamental way.
mankind will require new sources of energy. In the other kind of reaction, the nuclei do
Two candidates for this are derived from nu- not fuse but only part of the nuclear matter
clear reactions, fission or fusion. With due care, is transferred from one nucleus to the other.
either one could become the source of energy This is often called a direct reaction and in-
for the next several millennia; and in fact, fis- cludes stripping reactions, pickup reactions,
sion energy is likely to be the major source of and transfer reactions. A typical example is
electrical power in many countries by the end 2 7 AI(d, p)28AI, where a neutron is stripped
of the century. from the deuteron and caught by the Al nu-
The terrors of war and the blessings of abun- cleus while the remaining proton stays relatively
dant power both come from nuclear reactions. undisturbed on its original course. The nuclei
But that is not all. Perhaps the most significant in such reactions do not come into intimate
contribution to science has been the use of contact, and the reaction is fast on the time
radioactive elements produced in nuclear reac- scale of 10- 22 second.
tions. Let us take a typical example. Consider In general, light projectiles are used to study
the reaction l3C(n, 'Y)14C. This reaction pro- individual states of nuclei. For example, if a
duces radioactive 1 4C which has a half-life of proton is scattered inelastically, it will deposit
5700 years. The radioactive carbon decays a certain amount of energy in the target nucleus
with the emission of an electron, which can be before it emerges from the collision. By study-
counted with a suitable detector. Thus we are ing the amount of energy deposited, the angle
able to locate individual atoms of carbon and at which the proton leaves its target, and some-
separate them from all others which are not times by measuring the spin of the proton be-
radioactive. Such radioactive atoms are called fore and after collision, physicists are able to
tracers; and by using a variety of them, l3C, obtain detailed information about the energy
18F, 32p, 35S, 131 1, 198Au, all produced by levels in the nucleus such as the spin and parity
some sort of nuclear reaction, unprecedented of a particular state. The most common reac-
advances have been made in biology, geology, tions used for this kind of study are inelastic
chemistry, metallurgy, physiology, and medi- scattering, stripping and pickup reactions such
cine. Furthermore, radioactive isotopes in large as (d, p) or (d, t) reactions. In order to interpret
amounts can be used instead of x-rays for treat- reactions of this type, phvsicists employ an
827 NUCLEAR REACTORS

optical model, in which the nucleus in all its with the nucleus then provides information
complication is replaced by a complex potential about nuclear density at that radius.
containing a real part, an imaginary part respon- In spite of many years of research, a deep un-
sible for nuclear absorption, and a spin-orbit derstanding of nuclei and a theoretical ability.
part that accounts for the fact that many of the to predict their properties still elude us. Nuclear
projectiles have spin which influences the reac- reactions are a prime tool for the investigation
tion process. The optical model and the dis- of nuclei. By careful study of nuclear reactions
torted wave Born approximation calculations in all their multiplicity of detail, including the
for direct reactions have been very successful in precise knowledge of the projectile, its energy
analyzing the details of the nuclear reactions and its spin direction, the nature, as well as en-
and enable physicists to interpret reaction data ergy, angle of emission, and the spin of the re-
and thus learn more about the structure and action product, a good deal of information has
properties of nuclei, as well as about the reac- been gained about this realm. A deep under-
tion process itself. standing of nuclei yet eludes us. We have no
Heavy ion reactions are used to study an en- accurate predictive theory, even though we
tirely different set of nuclear properties. In- have searched for it for many decades. The
stead of learning about details of the nucleus, prize, however, is worth reaching for: its prom-
with heavy ions physicists can study the nu- ise is to reveal the nature of nuclei, the small-
cleus as a whole, learn about its viscosity, about est stable constituents of matter, and the laws
oscillations and deform ability of nuclei, about which govern their interactions to give man im-
the compressibility of nuclear matter and even mense power for war or for peace.
about the possibility of observing nuclear shock-
waves. The nuclear model that is most useful ALEXANDER ZUCKER
here is the liquid drop model invented in the
early 1940s to describe fission. In this model, References
the nucleus is represented by a liquid drop that
rotates, changes shape, has a surface tension, Two advanced books which deal with the nuclear re-
and in other ways mocks up a nucleus. A par- actions are:
ticularly interesting heavy ion reaction is deep Cohen, B. L., "Concepts of Nuclear Physics," New
inelastic scattering, a process in which two nu- York, McGraw-Hill, 1971.
clei just touch then rotate together for a frac- Satchler, G. R., "Introduction to Nuclear Reactions,"
tion of a full rotation while they exchange New York, Wiley, 1980.
between them significant amounts of nuclear
matter and energy. These reactions are inter- Cross-references: ACCELERATOR, PARTICLE; ELE-
mediate between compound nucleus and direct MENTARY PARTICLES; FISSION; FUSION; ISO-
reactions. Another useful property of heavy ion TOPES; MESONS; NEUTRONS; NUCLEAR STRUC-
reactions is the capability to produce isotopes TURE; PHOTON; PROTON; RADIOACTIVITY;
far from stability simply by the appropriate se- SOLAR ENERGY SOURCES; TRANSURANIUM
lection of the target and projectile. For example, ELEMENTS.
nuclides such as tin-IOO are now being looked
for in heavy ion experiments. (Recall that the
most stable tin isotope is 120 Sn, and the lightest
stable tin nuclide is 112Sn.) NUCLEAR REACTORS
Electrons are used almost exclusively for the
detailed study of nuclear structure. Electrons Definition A nuclear reactor is a device in
have the advantage that their interaction with which the fission of a nuclear fuel releases
nuclei is purely electromagnetic and thus under- large amounts of energy in a controlled chain
stood theoretically. In principle, then, when reaction involving neutrons. Reactors serve as
the interaction is understood the information important heat sources for the commercial gen-
about nuclei is much cleaner and more readily eration of electricity and can as well supply
interpreted. There is also hope that with high- neutrons and useful radioactive isotopes.
energy electrons one can probe the subnuclear Fission and the Chain Reaction When a
structure due to quarks in nuclear matter. uranium-235 nucleus absorbs a neutron, an
Exotic projectiles such as 1T- or K -mesons at excited compound nucleus of uranium-236
the present time provide interesting informa- is formed. In about 15% of the cases this re-
tion about the nuclear surface, or what one sults in gamma ray emission, while in 85%
might even call the nuclear stratosphere. It is there is a splitting of the nucleus into two frag-
well known that nuclear densities fall off rapidly ments. These fly apart with high speed, with
with radius, and it is a matter of considerable total kinetic energy around 200 million elec-
controversy just what the structure of the sur- tron volts (MeV). In this fission process beta
face is, whether there are mostly neutrons or particles, gamma rays, and an average of about
mostly protons, and how the density decreases 2.5 high-energy neutrons are emitted. The neu-
for various nuclei. 1T- and K-mesons form mesic trons induce additional fissions, and under
atoms in quasi-electronic states. If the state is proper conditions a neutron chain reaction
known, its radius is known, and its interaction results.
NUCLEAR REACTORS 828

Since only one neutron is needed to continue During the period 1945-1960 many reactor
the chain, extra neutrons are available. Depend- concepts were studied by industry in coopera-
ing on the size and materials of the reactor, tion with the U.S. Atomic Energy Commission.
these neutrons will be captured in non fission- Out of these, three favored types emerged: the
able materials or will escape from the reactor, pressurized water reactor (PWR), developed and
a process called leakage. If uranium-238 is marketed by companies such as Westinghouse
present, neutron capture can lead to the pro- Electric, Babcock and Wilcox, and Combustion
duction of plutonium-239. Engineering; the boiling water reactor (BWR),
Fission fragments consist of a great variety of mainly a product of General Electric; and the
isotopes, with the most likely split in the mass high-temperature gas-cooled reactor (HTGR),
ratio 3: 2. They bear most of the kinetic energy built by General Atomic. In Canada, the heavy-
that can be transformed into thermal energy water reactor CANDU was developed, and in
and then into electrical energy. The heavy par- Great Britain the main power source was the
ticles resulting from fission constitute the col- gas-cooled natural uranium graphite reactor.
lection of material called fission products. Many The number of power reactors in operation
of the isotopes are radioactive, with half-lives throughout the world, as of June 30, 1982, is
ranging from a fraction of a second to millions given in Table 1. The United States has 77 in
of years. Although there are some useful radio- operation and a comparable number on order
isotopes among them, the bulk must be re- or under construction. Worldwide, 23 countries
garded as a waste, to be disposed of in some have 265 reactors in use.
safe manner. Reactor Components The only ingredients
When on the average each neutron reproduces required for a chain reaction are a fissile isotope
another neutron, the assembly is said to be and neutrons. The main fissile materials are
critical. Similarly, if fewer or more neutrons are uranium-235, plutonium-239 and -241, and
produced per cycle, the system is subcritical or uranium-233, produced by neutron capture in
supercritical. These three cases correspond to a thorium-232. Neutrons of high energy (-0.9
neutron multiplication factor of exactly 1, less MeV) are required to cause fission in uranium-
than 1, and greater than 1, respectively. 238. Practical operation demands as well a vari-
The chance of capture or fission of nuclei ety of components for heat removal, radiation
tends to vary inversely with neutron energy. protection, and control.
Low energy neutrons (-0.025 eV) are thus said Fuel. Various combinations of heavy isotopes
to have a higher cross section than high energy comprise the "fuel" of a reactor. This fuel is
neutrons (-1 MeV). A target isotope is classed "burned" in the sense that it is consumed and
as fissile if slow neutrons can induce fission. gives up energy. Examples of fuel are natural
History The discovery of fission in Germany uranium, uranium slightly enriched (-3%) in
in 1939 led to the initiation of a major effort in uranium-235, and highly enriched uranium
the United States in World War II to exploit the (-90%).
new energy source for military purposes. Two Control Absorption. A reactor that is to be
paths of research were followed-(a) to separate operated steadily for long periods of time re-
the desired uranium-235 (0.7% of uranium as quires fuel in excess of the amount required to
found in nature) from uranium-238 and (b) to be critical. Thus, at the start of operation, an
produce plutonium by means of neutron cap- excess of neutron-absorbing material such as
ture in uranium-238. Each of the programs boron must be present. Typically this control
was successful and eventually led to peacetime absorber is in solution or in the form of mov-
applications. able rods.
The first "pile" (nuclear reactor) was built at
the University of Chicago by Enrico Fermi and
his colleagues. The reactor consisted of lumps TABLE 1. WORLD REACTORS
of natural uranium metal and oxide imbedded (As of 30 June 1982).
in a large block of graphite. On December 2,
1942 it "went criticaF' and demonstrated that Number of
a self-sustained chain reaction was possible. Operable Electrical Capacity,
The reactor served as a model for plutonium Reactors Megawatts
production reactors constructed at Hanford,
Washington. Uranium isotope separation was United States 77 57,977
achieved at Oak Ridge, Tennessee by two dif- France 29 19,988
ferent processes-electromagnetic and gaseous Japan 25 16,652
diffusion-leading to uranium enriched to about USSR 31 16,515
90% in uranium-235. Atomic bombs were de- Germany FR 12 9,801
veloped at Los Alamos, New Mexico, in the pe- United Kingdom 32 8,048
riod 1942-1945. One was composed of U-235, Sweden 9 6,400
the other of Pu-239. The war with Japan was Canada 10 5,476
terminated through the use of these weapons, All Others 40 17,492
and a new era of peactime applications of TOTAL 265 158,349
nuclear energy began.
829 NUCLEAR REACTORS

Moderator. The neutron fission cross section Studies are made of both static (steady state)
for uranium-235 for neutrons of energy 0.025 and dynamic (transient) behavior, using neu-
eV is around 580 barns (1 barn = 10-z4 cm Z ). tron diffusion theory based on Fick's law for
At 2 MeV, the initial energy of fission neutrons, approximate calculations, and transport theory
the cross section is only 1.3 barns. Thus to based on the Boltzmann equation for more pre-
maintain a chain reaction with a small amount cise analysis. The quantity of interest is called
of fuel, it is desirable to reduce ("moderate") neutron flux, which is the product of neutron
the Ifeutron energy through collisions with number density and neutron velocity. In com-
nuclei of a light element such as hydrogen or bination with material cross sections, it de-
carbon. Moderators such as light water (HzO), termines reaction rates.
heavy water (D 2 0), graphite (C), and beryllium The simplest mathematical description of a
oxide (BeO) slow neutrons to the energy cor- reactor was devised in the early 1940s. Neglect-
responding to thermal agitation in reactors ing neutron losses by leakage, the number of
called "thermal." In contrast, a reactor without neutrons produced per initial neutron is given
moderator is called "fast" because the neutrons by the (infinite) multiplication factor k, which
remain at high speed. is a product of four components: E, the fast
Coolant. In order to prevent melting and to fission factor, representing the effect of high
extract useful heat, a cooling agent is needed. energy neutrons; p, the resonance escape prob-
Most reactors use ordinary water as coolant; a ability, the chance of not being absorbed during
few use heavy water, liquid sodium, or helium. slowing; f, the thermal utilization, as the frac-
Reactor types are sometimes distinguished by tion of neutrons absorbed in fuel; and 'rI, the
the coolant used, e.g., light water reactors reproduction factor, as the number of neutrons
(LWR's). These are of two types. In pressurized per absorption in fuel. Thus if k = Epf'rl is greater
water reactors (PWR's) the coolant is at pres- than I a chain reaction is possible. If not, as is
sure over 2000 pounds per square inch and at the case of natural unmoderated uranium, a
temperature around 600 degrees Fahrenheit. chain reaction is impossible. The earliest task of
Heat is transferred from a primary circulating researchers during World War II was to find
coolant loop by use of a heat exchanger (steam the combinations and purity of materials that
generator). In boiling water reactors (BWR's) at would yield a k value greater than 1. The ef-
somewhat lower pressure and temperature, forts of those who design, operate, and analyze
steam is generated within the reactor. modern reactors are still directed toward the
Structure. The fuel in typical reactors is optimization of k during the operating cycle.
formed into small cylindrical pellets of UO z Nuclear Fuel Cycle The principal raw ma-
of volume about 1 cm 3 . These are placed in terial for nuclear fuel is uranium. It is mined,
long (3 m) metal tubes to form what are called chemically purified, enriched in the isotope
fuel rods. The tubes provide structural support uranium-235, and fabricated into fuel assem-
and also prevent fission products from escaping blies. These are shipped in special casks by
into the coolant. Fuel rods are spaced to allow truck or rail to the power plant for storage
coolant flow between them to remove fission until needed.
heat generated in the UO z . Around 200 rods A typical reactor is operated on a one-year
form a bundle called a fuel assembly; around cycle, in which it produces steady power except
180 assemblies form the reactor core, the for occasional interruptions for repairs. At the
region of the system in which the chain reac- end of the period the reactor is shut down, and
tion takes place. The core is supported within a about one-third of the assemblies are removed
thick-walled steel container called a pressure from the core as spent fuel. The remaining two-
vessel. thirds of the assemblies are relocated in the
Shielding and Containment. To protect work- core and fresh fuel is added. The loading pat-
ers against radiation released by the reactor, tern is designed to give a power distribution
concrete shielding is erected. The reactor vessel that is as uniform as possible and to obtain
and coolant system are located in a large cylin- maximum energy from the fissile material. The
drical building with steel-reinforced and steel- reactor is brought again to operating pressure
lined concrete walls. This containment is de- and temperature.
signed to withstand pressure in the event of an The spent fuel, which is highly radioactive, is
accident to the coolant system. stored in metal racks in water-filled pools. In
Reactor Physics The specialized branch of some countries such as Great Britain, France,
physics that deals with experiments and theory and Japan, the spent fuel is reclaimed by re-
of reactors is called reactor physics. This con- processing, which consists of cutting fuel rods
sists of measurements and calculations of neu- into small pieces, dissolving out the uranium
tron cross sections, the number and energy oxide, and separating uranium and plutonium
distribution of fission neutrons, multiplication from the fission products by chemical treat-
factors, critical sizes and masses, and power ment. In the United States, spent fuel continues
reactor performance. The latter involves the to be stored.
amount of control absorption required, neutron Reactor Safety The physical processes that
and power spatial distributions, fuel consump- govern the operation of a reactor in themselves
tion and production, and isotope generation. provide inherent safety against accident. One
NUCLEAR REACTORS 830

effect is the delayed emiSSlOn of a small frac- (214,000 years), and iodine-129 (15.7 million
tion of neutrons, as a consequence of beta- years) are of principal concern.
decay governed by a half-life. The time constant Several techniques for disposal of radioactive
of power rise in a slightly supercritical system is wastes obtained by the chemical reprocessing of
very much longer (of the order of seconds) than spent fuel have been studied. The most likely
it would be without delayed neutrons (of the method will involve immobilizing the fission
order of milliseconds or microseconds). An- products by solidifying them in glass and bury-
other is the negative feedback effect of tem- ing them in corrosion-resistant waste con'tainers
perature increase of fuel, as in a power rise. deep in the earth. Favored geologic media are
Increased thermal agitation of fuel molecules rock salt, basalt, or granite.
causes a broadening of neutron resonance cross Purpose and Uses of Reactors The principal
sections in what is labeled the Doppler effect. use of nuclear reactors is in commercial elec-
This has the effect of reducing neutron multi- trical power generation, serving as an alternative
plication, which tends to counteract the initiat- to heat sources using coal, oil, or natural gas.
ing power increase. These reactors typically generate around 1000
Other safety factors are provided by equip- megawatts of electrical power. Figure 1 is a
ment design. If it becomes necessary to reduce schematic diagram of a pressurized water reac-
the power of the reactor quickly, "safety rods" tor, showing the reactor vessel, the primary
are rapidly driven into the reactor core. These loop, the heat exchanger, turbine, generator,
neutron absorbing components are composed and condenser. Figure 2 shows a group of three
of boron carbide or an alloy of cadmium, silver, reactors at a site in South Carolina.
and indium. Reactors are used for propulsion of naval
There is residual heat being produced, how- vessels, including submarines and aircraft car-
ever, by the radioactive decay of the fission riers. Reactors are also used to generate pluto-
products. Although this heat power amounts to nium for military purposes.
only a small percentage of the fission power, it Research/training reactors are generally of
is high enough to melt fuel if adequate cooling low power, up to a few megawatts. They are
is not provided. In the event of a loss-of-cool- used in universities and government laboratories
ant accident (LOCA) in which a pipe in the pri- throughout the world for education and nuclear
mary reactor cooling loop breaks, equipment research, including studies of the effects of neu-
called the engineered safeguards (ES) system trons and gamma rays on materials.
comes into play. Its central feature is the emer- Reactors serve as a major source of radioiso-
gency core cooling system (ECCS) which injects topes for a variety of peaceful and beneficial
cooling water from various sources using special purposes. Neutrons produced by fission are the
pumps. primary agents by which isotopes are produced,
Much attention has been given to an accident either through capture or through fission.
that occurred in March 1979 at the Three.. Mile Uses of Radioisotopes Some examples of
Island reactor in Pennsylvania. The cause was a radioisotopes and their uses are listed. Cobalt-
combination of inadequate design, equipment 60, of half-life 5.3 years, emits two gamma
failure, and reactor operator misinterpretation rays of average energy 1.25 MeV. These rays are
of events and consequent error in handling the alternative to x-rays for medical diagnosis and
situation. As a result, some radioactive material treatment of cancer and for industrial radiog-
was released, the reactor was damaged, and the raphy . The isotope is also a very promising
building was seriously contaminated. The in- source for irradiation of food to eliminate pests
cident prompted corrective action in all nuclear or to sterilize the food for long-term preserva-
power plants by industry and the U.S. Nuclear tion. Plutonium-238, half-life 86 years, is an
Regulatory Commission. . alpha emitter produced by neutron irradiation
Radioactive Wastes from Reactors No use of a neptunium isotope. This isotope serves as
has yet been found for most of the radioactive an excellent source of heat from which electric
materials produced in a reactor. Among the power is produced for space vehicles. Tritium,
fission products are isotopes of short half-life the heaviest isotope of hydrogen (H-3) has a
such as iodine-131 (8 days) which constitute a half-life of 12.3 years. Its low-energy beta
hazard only if radioactive gases are released particles make it a good "tracer" to follow
from the reactor. The longer-lived krypton-85 biological processes by measurement of the
(about 11 years) does not pose a major problem radiation. Reactors are widely used to irradiate
since it is a noble gas, and being essentially inert samples to induce radioactivity and thence to
it disperses throughout the earth's atmosphere. determine chemical composition by measure-
Two isotopes provide the main radioactivity for ment of characteristic radiation. This process,
the first hundred years after nuclear fuel is re- called neutron activation analysis, is valuable
moved from a reactor. These are strontium-90 for studies of environmental pollution because
(~29 years) and cesium-137 (~30 years), it can detect minute amounts of impurities.
which are beta and gamma emitters, respec- Breeder Reactor Most reactors in current op-
tively. For the long term, isotopes such as eration are based on thermal neutrons and con-
plutonium-239 (~24,000 years), technetium-99 sume slightly enriched uranium, obtained by an
831 NUCLEAR REACTORS

tu,.bine
generator

~~:: condenser
cooling
water

FIG. 1. Fluid flow diagram of pressurized water reactor system. (Courtesy of Atomic
Industrial Forum, Inc.)

isotope separation process. Since the main fis- is high (~3). For each neutron used to perpet-
sile material is the U-235 in natural uranium, uate the neutron multiplication cycle, slightly
large amounts of uranium ore are required. more than one neutron on the average is avail-
These reactors are classed as converters, with a able to produce Pu-239 from capture in U-238.
rather small amount of plutonium production. The plutonium then can serve as new fuel. The
In contrast, reactors that do not have a mod- breeding ratio (fissile atoms produced)/(fissile
erator can operate with fast neutrons, where atoms consumed) is greater than 1 and the re-
the number of neutrons per absorption in fuel actor is classed as a breeder. Such a reactor uses

FIG. 2. The three-reactor Oconee nuclear power plant near Seneca, S.C. (Courtesy of
Duke Power Company.)
NUCLEAR REACTORS 832

all the scarce U-235 and indirectly the abun- some reactors already ordered. In contrast, nu-
dant U-238 as fuel. Employment of breeders clear plant construction is going ahead in sev-
instead of converters would reduce the amount eral foreign countries where energy needed
of uranium ore required by a factor of about 50. cannot be met by the use of coal and where the
The length of time before resources are seri- regulation and construction situation permits
ously depleted could be increased from less more rapid completion of facilities than is pos-
than a hundred years to thousands of years. sible in the U.S.
Prototype breeder reactors have been built and It is likely that nuclear reactors will provide a
development is proceeding, notably in France. significant amount of the world's electric power
Safeguards and Proliferation Several nations for several centuries. For the longer term, fu-
have nuclear weapons capability while many sion reactors using deuterium as fuel might be
others do not. On the premise that the more found to be feasible and practical as an alterna-
countries that have weapons the greater the tive nuclear energy source.
chance they will be used, there is general appre-
hension about nuclear proliferation. Reactors RAYMOND L. MURRAY
can be employed to produce plutonium of
weapons grade; plutonium separated from com- References
mercial reactor fuel can be made into a crude
weapon. Other avenues to nuclear arms capa- Henry, Allan F., "Nuclear-Reactor Analysis," Cam-
bility are available, however, including uranium- bridge, MA, MIT Press, 1975.
235 isotope enrichment. It is believed that any Hewlett, Richard G., and Anderson, Oscar E., Jr.,
country can go nuclear if it wishes to dedicate "The New World," Vol. 1, 1939/1946, U.S. Atomic
enough of its resources, but it is generally felt Energy Commission, 1972.
that every effort should be made to discourage Hewlett, Richard G., and Duncan, Francis, "Atomic
countries from doing so. Shield," Vol. 2, 1947/1952, U.S. Atomic Energy
Public Attitudes toward Nuclear Reactors Commission, 1972.
The rate of development of nuclear power has Murray, Raymond L., "Nuclear Energy," Elmsford,
been influenced by diverse public opinion over NY, Pergamon Press, 1980.
the years. The use of nuclear weapons to end Murray, Raymond L., "Understanding Radioactive
World War II was acclaimed at the time as serv- Waste," Columbus, OH, Battelle Press, 1983.
ing to save lives, but in recent years the con-
tinued expansion of nuclear arsenals has be- Cross-references: ATOMIC ENERGY, FISSION, FU-
come a source of great concern. This fear has SION, ISOTOPES, NEUTRON, NUCLEAR REAC-
been carried over to the peaceful use of fission, TIONS, RADIOACTIVITY, NUCLEAR RADIATION
with many people believing incorrectly that a SHIELDING.
reactor can explode like a bomb.
The public generally concedes a need for al-
ternative sources of energy in view of the high NUCLEAR STRUCTURE
cost of oil, but many believe that reactors are
unacceptable because of the chance of accident The basic problem of nuclear structure was
that could release radioactive material. first sharply defined in 1932 when Chadwick
Concern is also expressed about the accumula- discovered the neutron. The nucleus was then
tion of radioactive wastes and the apparent in- recognized to be a system of A particles (nu-
ability of government and industry to imple- cleons) of nearly equal mass (Z protons and
ment a plan for disposing of the wastes. N =A - Z neutrons). The forces binding the
When nuclear energy first became available, it "nucleons" together must be quite distinct
was believed that electrical power would be ex- from the well-known electromagnetic interac-
tremely inexpensive. Over the years, however, tions which will evidently operate to push the
total costs increased because of the introduc- nucleons apart, as well as gravitational forces
tion of additional safety equipment, escalated which appear to be too weak to have any
construction expense, delays in licensing, and influence in the nucleus at all.
changes in regulatory policy. Nuclear power is The nuclear forces are found to be of short
cheaper than power from oil-fired plants but range (_10- 13 cm). Within their short radius of
about the same as that from coal-fired plants. interaction, they become strongly attractive
The high capital cost of modern nuclear power (depths near 100 MeV), but at very small dis-
plants-well over a billion dollars for a 1000 tances the interaction becomes so powerfully
MWe facility-is regarded by many consumers repulsive that nucleons seldom come closer than
of electricity as too large a burden for them to about 0.4 X 10- 13 cm from one another. This
bear. character of the nuclear force produces "satu-
Because of uncertainties in the regulatory sit- ration." The binding energy per nucleon is
uation, public attitude, and economic factors, nearly constant (-8.0 ± 1 MeV) throughout the
new nuclear plants have not been ordered by periodic table, while each nucleon occupies a
utilities in the U.S . for several years and con- volume which may be approximated by a sphere
struction has been delayed or suspended on of radius 10- 13 cm.
833 NUCLEAR STRUCTURE

The forces described above are assumed to The major details which focus attention on these
operate between each pair of nucleons. Their numbers may be summarized as follows:
exact nature is rather complicated and depends (1) Stability and abundance. Nuclei with N
upon the motion, and even the orientation of or Z magic are unusually tightly bound and
the nucleons. The two-body portion of the correspondingly very abundant. If both Nand
nuclear force is now reasonably well deter- Z are magic (2He2, 6C6, sOs, 14Si14, 20Ca20,
mined through studies of nucleon-nucleon 20Ca2S, and S2Pb126), then the nucleus is even
scattering, meson-nucleon scattering, and basic more tightly bound (here, the left subscript is the
quantum field theory. Considerable attention number of protons; the right is the number of
is now being focused on the investigation of neutrons). The last nucleon to complete a
multi-nucleon interaction operators. In particu- magic number has sometimes nearly twice the
lar, since two-pion exchange is now known to binding energy of an average nucleon. Further-
make a very important contribution to the more, nuclei with Z magic possess an unusually
two-body part of the interaction, the three large number of stable isotopes, and nuclei with
nucleon force arising from two-pion exchange N magic have an unusually large number of
must be carefully investigated. stable isotones.
Even if the nuclear force were precisely (2) Neutron capture cross sections. Nuclei
known, the structure of the nucleus would not with neutron number one short of a magic
be immediately understood in its entirety. number have a large neutron capture cross sec-
Methods are not at present available for solving tion, while nuclei with a magic number of
the three-body problem in closed form for neutrons exhibit a small neutron capture cross
simple interactions, and for more than three section.
bodies, drastic approximations must be used (3) Islands of isomerism. Long lived 'Y-active
to make the problems tractable. For this nuclear states (half-life-l second) appear just
reason, considerable attention in physics is prior to the completion of a magic number.
devoted to constructing simplified models of the (4) Electric quadrupole moments. Nuclear
nucleus. The purpose of these models is to quadrupole moments tend to be small near
isolate the salient characteristics of the nucleus, magic numbers and large far from magic num-
and thereby obtain some understanding of its bers. A nucleus with N or Z magic appears,
behavior. therefore, to prefer a spherical shape.
Two very general classes of nuclear models (5) Delayed neutron emission. Delayed neu-
have received considerable attention. The sim- tron emitters (e.g., 36KrSl' S4XeS3, and S09)
plest may be described as "powder models." appear when N is one greater than a magic
Here it is assumed that the motion of nucleons number.
within the nucleus is so complicated that The data clearly point to an interpretation of
statistical mechanics may be employed. In one the magic numbers in terms of shell closures,
version of this model, the nucleus is treated in analogous to the atomic structure of noble
analogy to a drop of liquid (the nuclear forces gases. Thus it appears that the nucleon-nucleon
described above are qualitatively quite similar interaction somehow averages out within the
to forces between the molecules of a liquid). nucleus so that each nucleon, to a reasonable
Such a chaotic state seems to ensue if the first approximation, sees a fairly smooth po-
nucleus is highly excited, so that the nucleons tential. Such a potential should be flat near the
have a great deal of kinetic energy. For this origin (like an harmonic oscillator), and should
reason, powder models have found their great- go rapidly to zero outside the nucleus (like a
est usefulness in describing the nucleus during square well). A suitable form which has under-
a nuclear reaction, and especially in describing gone much study is the Fermi function.
nuclear fission.
In the shell model (sometimes called the inde- V(r)=Vo [l+expa(r-a)]-l (1)
pendent particle model), the motion of the indi-
vidual nucleons is assumed to be much more In order to reproduce the magic numbers one
simple. Each nucleon is, in fact, assigned a must add a spin-orbit term:
definite orbit within the nucleus. The resultant f(r)l-s (2)
motion is then reminiscent of our usual picture
of electrons within the atom, all revolving nearly to this central potential. Here I is the orbital
independently of each other about the nucleus. angular momentum of a nucleon, and s its spin.
The motivation for introducing this model into The effect of this l-s interaction is to couple the
nuclear physics came from the early (1934) ob- orbital and spin angular momenta so that the
servation that nuclei that contained certain states with total angular momentum j = I + !
numbers of protons or neutrons were unusually will be lower in energy than states with j = I + ! .
stable and abundant. These numbers are gener- Shell closures obtained with such a potential
ally referred to as magic numbers. The set we are found to reproduce the observed magic
recognize today as magic are numbers in a simple and striking manner.
The fact that the shell model works at all in
Nor Z = 2, 6, 8, 14,20,28,50,82, and 126 nuclei is somewhat surprising since the theoret-
NUCLEAR STRUCTURE 834

ical motivation that enhances its success in ble to attribute all of the properties of the
atomic structure is lacking. One has no strong nucleus to the nucleons in unfilled orbits. One
central field originally in nuclei, and further- must take into account possible distortions of
more the nucleons are much more closely the orbits of nucleons in the closed shells.
packed. Consequently, collisions, which should This line of thought gives rise to the "collec-
tend to scatter a nucleon out of its orbit, should tive model," in which the nucleons in the core
be more frequent. It is found that the Pauli (closed shells) are treated as an incompressible,
exclusion principle plays a vital role here. When irrotational fluid capable of surface oscillations.
such a collision occurs, the orbit into which a Nucleons in unfilled shells will exert a centrifugal
nucleon would tend to be scattered is actually force on this fluid and tend to deform it into
occupied by another nucleon. This fact makes a nonspherical shape. In regions of the periodic
the shell model a far better first approximation table where several shell model orbitals are
than one would originally suspect. nearly degenerate in energy (90 ~ N ~ 114 and
The main effect of the nucleon-nucleon inter- Z > 88), the nuclear core takes on a rather
action is to produce the average shell model large spheroidal deformation. The formalism of
(single-particle) potential. This, of course, is not the theory is reminiscent of that for diatomic
the sole effect. Many refinements are required molecules. Low-lying excited states are gener-
before an adequate description of nuclei may be ated through a rotation of the entire system
attained. about an axis perpendicular to the axis of sym-
Consider, for example, a nucleus with just metry. This gives rise to a band of energy levels:
two nucleons beyond a double closed shell. One
of the nucleons will go into an orbit j, and the
other into an orbit j'. These two orbits may
orient themselves such that the nucleus may where I is a moment of inertia. The identifica-
have total angular momentum J anywhere tion of spectra in nuclei which could be empiri-
within the range cally fitted to Eq. (4) was a great truimph of the
collective model.
Ij- j'1~J~j+j' (3) It should be noted that I is not the moment of
inertia of the entire nucleus. It is only the mo-
so long as one is consistent with the Pauli ex- ment of inertia of the part of the nucleus
clusion principle. If one only had single-particle which participates in the rotation, and this is,
potentials all states of this configuration would of course, always less than the "rigid body"
have the same energy independent of J. To de- value.
termine the level order actually observed, one We note that the shell model and collective
must recognize that not all of the original model appear to be quite different in their
nucleon-nucleon interaction is used up in con- basic assumptions. The collective model pre-
structing the single-particle potential. Some "re- sumes that the nucleon motions are so closely
sidual interaction" is left over. This residual correlated that we can treat a rotation of the
interaction will still be of the two-body type, nucleus on the whole, while the shell model
and it will remove the degeneracy between begins with independent orbitals. It is very
states of the same configuration. important to remember, in this regard, that the
The most important feature of the residual shell-model orbitals must be properly coupled
interaction is that it produces a pairing force together in the nucleus. The Pauli exclusion
for like orbitals (i.e., j = /). If j = j', and we have principle must be satisfied, angular momentum
like nucleons, the interaction is by far the must be a good quantum number, and so on.
strongest if J = O. This has the important con- These orbitals are, therefore not so independent
sequence that for nuclei with both an even as it might seem. They may very well give rise
number of protons and an even number of neu- to collective effects depending upon the exact
trons, one must have net angular momentum way in which they are coupled together to ob-
equal to zero. This rule is never violated. If one tain the final nuclear wave function .
adds one nucleon to this even-even nucleus, Various coupling schemes have recently been
the result will be a J equal to the j of the examined to investigate this, but these are
odd nucleon. This rule is violated in only a beyond the scope of this article. We present
very few cases. here a simple illustrative example. Consider a
There are a few nuclear phenomena which set of identical particles attracted by means of
the shell model cannot describe at all. The sign identical springs to a common origin. At time
of the static, nuclear, electric quadrupole mo- t = 0 each particle is at the origin and is
ments are generally given correctly by a single- given some arbitrary velocity. Even though
particle model, but the magnitude is frequently each particle has a different initial velocity,
found to be larger than that predicted by more and moves independently of every other par-
than a factor of ten. Similarly, electric q uadru- ticle, the system will undergo a periodic dilation
pole transition rates are frequently much faster and contraction due to the fact that the
than those given by the shell model. It seems that frequency of vibration is the same for each
even in low-lying nuclear levels it is not possi- particle. Thus we have a collective motion
835 NUCLEAR STRUCTURE

exhibited by a set of independently moving References


particles. Baranger, M., "Recent Progress in the Understanding
The next step in the investigation must be to of Finite Nuclei from the Two-Nucleon Interaction,"
attain a quantitative interpretation of the in- Proceedings of the International School of Physics,
dividual particle motion itself. This must come Course 40, p. 511 (Academic Press, 1969).
directly from a full treatment of the many- Eisenbud, L., and Wigner, E. P., "Nuclear Structure,"
body nuclear problem using realistic nucleon- Princeton, N.J., Princeton University Press, 1958.
nucleon interactions. This problem is compli- Elliott, 1. P., and Lane, A. M., "Handbuch der Physik,
cated by the fact that nuclear forces become Yol. 39, p. 241, Berlin, Springer-Yerlag, 1957.
sharply repulsive at very short nucleon-nucleon Feenberg, E., "Shell Theory of the Nucleus," Prince-
separations. Thus the nuclear wave function ton, N.J., Princeton University Press, 1955.
exhibits strong correlations between nucleons Goldhammer, P., Rev. Mod. Phy s., 35, 40 (1963).
which must be accounted for in any final Inglis, D. R., Rev. Mod. Phys., 25, 390 (1953).
theory. The two-body correlations seem well Mayer, M. G., and Jensen, J. H. D., "Elementary
accounted for by a formalism devised by Theory of Nuclear Shell Structure," New York,
Bruechner, Bethe, and Goldstone, but the ef- John Wiley & Sons, 1955.
fects attained when three or more nucleons all Rajaraman, R., and Bethe, H. A., Revs. Modern Phys.,
come into close proximity are yet to be com- 39,745 (1967).
pletely evaluated in finite nuclei.
Cross-references: ATOMIC PHYSICS ; NUCLEAR RE-
ACTIONS; NEUTRON; PROTON; STRONG INTER-
PAUL GOLDHAMMER ACTIONS; WEAK INTERACTIONS.
o
OCEAN ACOUSTICS tering coefficient depends on wind speed and
direction, the direction of propagation of a
Introduction Ocean Acoustics is the study of wave on the surface, and the angular distribu-
the correlation between oceanic variability and tion of ocean wave energy.
its effect on underwater acoustic signals. A vast Acoustic signals propagating in the ocean
range of scales is involved: time scales from scatter from the surface both coherently and
seconds to months, acoustic frequencies from incoherently. Models of this scattering include
a few hertz to several hundred kilohertz, both perturbation methods for small wave height
deterministic and random ocean inhomogene- surfaces, tangent plane or Kirchhoff scattering
ities ranging from centimeter size to circulations for surfaces with large radii of curvature, and
and currents extending hundreds of kilometers. multiple scattering techniques for strongly
It encompasses not only physics but also aspects rough surfaces. Multiscale surface models con-
of biology and chemistry. Signal propagation taining a long-scale periodic component on
ranges can vary from meters to thousands of which is superimposed a small-scale random
kilometers, through a strongly inhomogeneous surface have met with success. The coherent
medium whose effects alter the sound speed of wave propagates in the specular direction and
the acoustic signal in many ways. We separate is formed by averaging the wave field over the
the description, not inconveniently, into the statistical ensemble of random surfaces. For
effects of the ocean surface, its bottom, and the shallow surfaces it is a Gaussian function of the
interior domain on acoustic propagation and Rayleigh roughness parameter. For rougher
scattering. surfaces multiple scattering must be incorpo-
The Ocean Surface The ocean surface is an rated. Backscattered sound can be modeled as
example of a complex wave motion formed arising from the glitter due to a facet model of
through the action of the wind. Its properties the surface. At very high frequencies trapped
can be understood in terms of the statistical air bubbles near the surface are the main mech-
properties of wave heights. Both the wave anism of surface sound scattering. The frequency
height probability distribution and its correla- spectrum of the scattered field is thus composed
tion function, or the Fourier transform of the of a single spectral line (due to specular reflec-
latter, the surface (spatial) spectral function, tion) and a broad background due to the in-
play an important role. The major height param- coherent contributions.
eter is the Rayleigh roughness parameter, L = The Ocean Interior The ocean is a strongly
ka cos e, where k is the wavenumber, a the inhomogeneous medium. The inhomogeneities
e
rms surface height, and the angle of incidence are of two types, regular and random. The fun-
of an acoustic wave (measured from the nor- damental cause of a regular inhomogeneity is
mal). Values of L> I describe a very (strong) the dependence of the ocean sound speed on
rough surface. position. In fact, because of this inhomogeneous
For a fully developed sea (one where the structure, the sound speed (or index of refrac-
wind has blown long enough so that the surface tion) in the ocean acts like a converging lens
is in equilibrium) the surface height and slope to the propagating acoustic field, and results in
are nearly Gaussian distributed, although the the creation of an effective ocean duct or wave-
distribution is skewed due to the creation of guide. This is called the SOF AR channel (for
broad troughs and narrow swells on the surface. sound fixing and ranging) or the underwater
Additional discrete spectral contributions can sound channel (USC). Its net effect is to refract
result from periodic surface components. In and focus the acoustic signal. Sound trapped in
regions where the surface is composed of ice, this duct can propagate for thousands of kilom-
other models must be considered. eters.
For shallow surfaces the surface spectral func- The mean speed of sound in the ocean is about
tion can be directly related to the scattering lSOO mls and can vary approximately ± so m/s.
coefficient. Little direct data is available on Even though changes from the mean are small,
the spatial spectrum itself, but it can be inferred they can significantly alter sound propagation
from frequency spectra and the ocean gravity over long ranges because their effects are cumu-
wave dispersion relation. In addition, the scat- lative. The sound speed c can be parameterized

836
837 OCEAN ACOUSTICS

in two ways. The first way is as a function of Other examples of regular inhomogeneities in
temperature T, salinity S, and pressure P, that the ocean are bottom protrusions such as sea-
is, c(T, S, P). Near the ocean surface, c varies mounts, which can occur on a horizontal scale
primarily due to the seasonal or diurnal tem- of hundreds of kilometers and a vertical scale
perature changes. Near the ocean bottom on the order of kilometers, and, in general, sea
(below about 1;5 km in the Atlantic for ex- depth variability even when more gradual such
ample) the temperature and salinity are nearly as near coasts and shores. Large-scale highly
constant and the major functional dependence energetic variability also occurs in the upper
is on pressure. In the deep ocean this latter ocean regions in the form of circulating spin
yields a stable monotonic increase in c as a offs from large current flows such as the Gulf
function of depth. In general then the sound Stream and Kuroshio current near Japan. Due
speed is largest near the surface and bottom to their size they are generically called meso-
with a minimum at what is referred to as the scale phenomena. Examples include the synoptic
SOF AR axis. Examples are illustrated in Fig. 1. mesoscale eddies and rings which can move as
The second parametrization of c is a func- well-defined rotating water masses for long
tional dependence on spatial coordinates (depth times at speeds of about 10 cm/s with the actual
z, range r, angle 8) and possibily on time t, that circulation speeds about an order of magnitude
is, c (z, r, 8, t). The temporal dependence is larger. Their scale sizes are roughly 100 km in
often ignored since the ocean is usually con- the horizontal and about 1 km in the vertical.
sidered to be temporally frozen on the time Thus they are highly anisotropic, as is much of
scale of most ocean acoustic experiments. Al- the oceanic variability. Their net effect on sound
though surface ducting can occur due to tem- propagation is again due to the fact that they
perature variability and, if an experiment modify the sound speed structure. For example,
occurs over long periods of time, longer-scale counterclockwise rotating eddies pull colder
effects on the system (such as tides) must be water up towards the surface thus lowering the
considered, the main effect arises from the depth effective sound speed and producing a deeper
dependence. There is a slower (adiabatic) range, refraction.
dependence, and a slower still angular varia bility. Additional ocean inhomogeneities can be

1480 1555 14801510 1480 1540 1440 1480 1510 1560

FIG. 1. Examples of typical vertical sound speed profiles c (z). Profile (a) is a two-layer sound speed with each
layer approximated by a linear segment in depth; (b) illustrates a sound speed which can form a surface sound
channel; (c) is a typical profile from the Pacific; (d) is an Arctic profile illustrating a steep gradient and the effect
of colder water; and (e) is a representative northeastern Atlantic profile.
OCEAN ACOUSTICS 838

classed as randomly varying on the acoustic modeled as a large deterministic refraction


time scale. The ocean can be considered as a (due to the depth variation of temperature,
density-stratified fluid. Displacement of a fluid salinity, and pressure), additional multidimen-
particle from equilibrium and the action of a sional deterministic variability due to mesoscale
restoring (gravitational) force on the particle phenomena, and random modifications due to
produces a wavelike motion in the ocean called smaller scale phenomena such as internal waves,
an internal wave. Scale sizes of these waves are fine structure and turbulence. Functionally it
as large as 10 km in the horizontal and 100 m can be written as
in the vertical. They have periods of anywhere
from a few minutes to about a day. For fixed c(x, t) = co(z) + Cl (x) + C2 (x, t).
source and receiver they are a major source of
fluctuations of the sound speed and hence of Here Co is the purely depth-refractive contribu-
the acoustic intensity within a frequency inter- tion; Cl the deterministic spatial mesoscale ad-
val of cycles per day and cycles per hour. Con- dition depending on the spatial vector x and
siderable effort has been expended in attempting smaller than Co by about two orders of magni-
to understand the fluctuation effects of internal tude; and C2 the spatial and possibly temporal
waves on acoustic propagation and extensive variation due to random effects, which is smaller
results are available on the resulting acoustic than Co by about four orders of magnitude.
amplitude and phase-rate spectra. Internal waves The ocean is also a very noisy environment.
contain much less energy than the mesoscale At low frequencies (below 10Hz) the major
phenomena. noise source is geophysical (earthquakes, vol-
For many purposes it is useful to treat the canic activity, etc.), with additional contribu-
ocean as a layered medium. It is sometimes tions from turbulence and nonlinear wave
nearly exactly so. For exa~ple, the vertical interactions. At higher frequencies (40-400 Hz)
profiles of temperature, density or salinity can ship traffic produces a steady background noise
have an almost steplike structure, with layer because of low sound absorption. In the lower
thicknesses from tehs of centimeters to meters kilohertz regime the wind and ocean surface
and horizontal extent on the order of kilom- interactions (such as breaking waves and
eters. This ocean fine structure, as it is termed, trapped air bubble collapse) produce the
affects acoustic propagation, since the vertical majority of the noise. At still higher frequencies
layer gradients can alter the sound speed field. molecular effects can produce some thermal
The trajectories of acoustic rays can be changed noise. In addition there is biological noise
due to the gradients, or wavelike reflection produced at various ocean locations due to the
properties from the layers can be significantly presence of marine organisms.
altered. Currents in the ocean can also con- There are several different mathematical
tribute to these effects. Periods for the fine models for acoustic propagation in the ocean.
structure are on the order of minutes, and fine They depend of the dimensionality of the prob-
structure is less energetic than internal waves. lem and on whether the oceanic variability is
Note that both the internal waves and fine deterministic of random. The acoustic field in
structure as well as the deterministic mesoscale the ocean is the solution of a Helmholtz equa-
phenomena are highly anisotropic. An example tion which can be derived from the perturbation
of random isotropic ocean variability is the solution of the fundamental fluid equations. If
homogeneous isotropic turbulence due to ocean this solution is represented as an amplitude and
microstructure. Here horizontal and vertical phase term and diffraction is neglected, then
scales are about 10 cm with periods of about a the most widely used propagation method, ray
minute. This turbulence is generally confined to theory, can be derived. It is valid at high fre-
the upper mixed layer of the ocean and is due quencies, and the wavelength should be much
to atmospheric effects. Examples include wave less than any significant length scale. It is in-
breaking and bubble cavitation. It is the least valid when the phase curvature does not change
energetic of the inhomogeneities we have as rapidly as the projection of the amplitude
described. gradient on the phase gradient (e.g., shadow
There is an added problem in treating many regions, edges of obstacles, caustics, or focal
of the above phenomena, since they often do points) and when the fractional change in the
not occur with any particular regularity in the refraction index is not small with respect to a
ocean. Not only is there the problem of under- wavelength (as at the ocean surface and bot-
standing the effects of mesoscale, internal waves, tom). Its advantages are that the rays can be
etc., there is the additional problem of identi- found independently of the amplitude, and the
fying the existence of the particular phenome- amplitude derived from the intensity along a
non or combinations of phenomena which may ray where the ray coupling (diffraction) is
be contributing to an ocean experiment or neglected. It is computationally simple to im-
interrupting or modifying the performance of plement and yields at the least a good qualita-
some system. As we have mentioned, all the tive picture of the acoustic field. It also can be
above inhomogeneities affect the sound speed. used multidimensionally.
The sound speed in the ocean can thus be If the sound speed profile is one-dimensional
839 OCEAN ACOUSTICS

(depth-dependent) and the propagation regime longer the propagation range and the lower the
can be modeled as a waveguide, then the z- absorption. The absorption is mainly due to
dependent part of the acoustic field constitutes liquid shear viscosity and oceanic chemical re-
an eigenvalue problem or a normal mode prob- laxation processes. Inhomogeneities (air bub-
lem. The boundary conditions on the acoustic bles, marine organisms) also scatter the sound
velocity potential at the air-water interface are and it is usually the combined effect of absorp-
nearly perfectly reflecting (soft or Dirichlet tion and scattering (called attenuation) which is
conditions) and at the bottom either perfectly measured. The attenuation coefficient is roughly
reflecting (hard or Neumann conditions) or are proportional to frequency squared so that low
varied to include bottom loss. The discrete frequency sound can, as we remarked, propa-
modes can be found numerically or, in certain gate thousands of kilometers. As a comparison,
cases for exactly solvable profiles, analytically. laser energy in the ocean would be totally
Statistically rough boundaries can be incorpo- absorbed in less than a kilometer.
rated using perturbation methods, and, depend- One final ocean topic should be included. We
ing on the model chosen for the bottom, a have mentioned direct propagation problems
continuous spectral contribution can be in- where the sound speed is given and the resulting
cluded. The latter is important at short ranges, acoustic field is computed. Potentially far more
while the discrete spectrum is the dominant important are inverse methods, where from a
contribution at long ranges. If the profile has an knowledge of the source and received signal
additional slow variability in range, an adiabatic (or its travel time in ray theory) one infers the
mode theory has been developed which treats intermediate oceanic properties such as the
the range variation as a succession of static sound speed. This latter research in ocean to-
depth-dependent problems whose solutions are mography is presently in progress and is a pre-
matched at successive range interfaces. cursor of much of the future oceanic monitoring
General range dependence of the profile can effort.
be modeled using the parabolic approximation The Ocean Bottom The acoustic field inter-
to the Helmholtz equation coupled with a fast action at the ocean surface can be generally
Fourier transform algorithm so that the solu- decoupled from the effects in the air. That is,
tion can be computationally marched in range. the major air-water interface effect is a geo-
The approximation consists in neglecting the metric effect. This is not so with the ocean
term involving the ratio of the second to first bottom interaction. Bottom topography, sub-
range derivative of the field divided by the bottom layering, and the elastic properties of
wavenumber. It has been shown that this is a the bottom can significantly alter low frequency
narrow spectral approximation centered about propagation in the ocean. Thus, not only is
the wavenumber term (which is arbitrary). The there the bottom geometry to consider, but
latter can be used to define an equivalent ray also its material properties such as density and
angle about which the parabolic approximation compressibility. Shear as well as compressional
is also a narrow angle approximation in a for- waves are present.
ward direction. In most presently available Bottom geometry varies on the horizontal
codes the ray angle is along the horizontal. It is scale from the size of small ripples due to cur-
also possible to relate the parabolic approxi- rents on sand to hundreds of kilometers (e.g.,
mate solution rigorously to the Helmholtz solu- large ocean ridges). Vertical protrusions occur
tion and from this to correct the parabolic solu- on the scale of kilometers (e.g., seamounts).
tion particularly in the phase term where the The sound speed in the shallow bottom is near
original approximation fares badly. Other cor- that in the water bottom (about 1550 m/s). It
rections can also be included, such as the inclu- can be less (for clay, 1530 m/s) or greater (sandy
sion of the earth's curvature for long-range clays, about 1700 m/s) or much greater (about
propagation. The small dependence on angular 1800 m/s for low-porosity sediment). In the
variability can also be incorporated into a three- sub-bottom substrate the compressional speed
dimensional parabolic code. is 3-4 times greater than in the water, with
When the ocean is random, propagation codes shear speeds roughly twice as large (3000 m/s).
become more complicated. Unlike the atmo- Estimates of the bottom sound speed can be
sphere, the major propagation effect in the obtained using seismic reflection techniques.
ocean is refraction which is deterministic. Thus Sound absorption in the bottom is roughly
perturbation methods for random oceanic vari- proportional to frequency. At low frequencies
ability must be based on ray- or modal-type the material bottom properties dominate the
solutions which include the deterministic depth sound reflection whereas above a few kilohertz
dependence. Random propagation methods the bottom topography plays an important
have also been applied to investigate the effect role.
of internal waves on acoustic propagation, and Forward scattering from the bottom can sig-
to explain acoustic signal fading. nificantly contribute at low frequencies since
Experimental ocean frequencies vary over six the sound can first penetrate the bottom and,
orders of magnitude, from a few hertz up to the at a distance couple back into the water. It is
megahertz regime. The lower the frequency the generally insensitive to correlation lengths of
OCEAN ACOUSTICS 840

the topography. Backscattering at normal inci- would have at 25 cm. The ratio of the angle
dence is quite large, and for larger incidence subtended by the virtual image to the angle
angles roughly follows the cosine squared de- subtended by the object in the first case is
pendence of Lambert's law. called the magnifying power of the lens. It
depends on the focal length of the lens. For
JOHN A. DESANTO example, if the lens has a focal length of 2.5 cm
and if the virtual image is created 25 cm from
References the observer, then the lens has a magnifying
1. Brekhovskikh, L. M., and Lysanov, Yu. P., "Theo- power of 25/f+ 1, or 11. This value is appro xi-
retical Fundamentals of Ocean Acoustics," Berlin, mat ely the upper limit for a single lens provid-
Heidelberg, New York, Springer-Verlag, 1982. ing convenient viewing without major aberra-
2. Clay, C. S., and Medwin, H., "Acoustical Oceanog- tions to the image.
raphy," New York, Wiley, 1977. Compound microscopes employ two lenses
3. DeSanto, J. A. (Ed.), "Ocean Acoustics," Topics in (the objective and the eyepiece) in order to
Current Physics, Vol. 8, Berlin, Heidelberg, New further magnify the object. The objective is
York, Springer-Verlag, 1979. closer to the object and provides a real and
4. FlaWS, S. M. (Ed.), "Sound Transmission Through inverted image that falls within the focal dis-
a Fluctuating Ocean," Cambridge, U.K., Cambridge tance f2 of the eyepiece. In Fig. 1, u is the
Univ. Press, 1979. distance between the object and the objective
5. Keller, J. B., and Papadakis, J. S. (Eds.), "Wave and v is the resulting image's distance. The ratio
Propagation and Underwater Acoustics," Lecture of the size of the image to the size of the object
Notes in Physics, Vol. 70, Berlin, Heidelberg, New· is called the magnification by the objective and
York, Springer-Verlag, 1977. is equal to v/u. The image serves as an object to
6. Kinsman, B., "Wind Waves," Englewood Cliffs, the eyepiece which functions as a magnifying
New Jersey, Prentice-Hall, 1965. glass since it produces a virtual image (the final
image of the microscope). It is this image that
Cross-references: ACOUSTICS; CAVITATION, ELEC- an observer sees. The combined magnifying
TROACOUSTICS; FOURIER ANALYSIS; NOISE, power of the instrument is the multiplication
ACOUSTICAL; PHYSICAL ACOUSTICS; ULTRA- of the magnification of each component:
SONICS; VIBRATION; WAVE MOTION. (v/u) (25/12 + 1) where f2 is the focal distance
of the eyepiece. The magnification is increased
if u is decreased and v increased. However, a
OPTICAL INSTRUMENTS practical limit of 16 cm for the length of the
microscope tube restricts improvement. Also,
Optical instruments create images of objects by resolution is eventually lost. The magnifying
altering the direction of travel of light rays. power of most microscope designs is limited
They are of two types. One type creates a real to about 2000.
image that is projected onto a surface such as Magnification in Refracting Telescopes Re-
a screen or a piece of film. The other type fracting telescopes are normally employed to
creates a virtual image that can be perceived see objects that cannot be brought close to the
only by an observer. The observer's eye further objective. The magnifying power is defined as
redirects the light rays so that a final image of the ratio of the angle subtended by the image
the object falls across the retina. The major provided by the telescope to the angle that
advantage of optical instruments is that they would be subtended in the observer's field of
afford an image of an object that could not be view with normal sight. A refractive telescope
easily perceived with the unaided eye. With employs two lenses as in a microscope: an ob-
microscopes one can see or photograph details jective and an eyepiece. In the usual adjustment
of an object totally inaccessible with unaided of a telescope the objective produces a real
viewing. With telescopes one can see details image at the focal point of the eyepiece, which
of truly distant objects whose image to the then creates a virtual image that is essentially
observer would otherwise be imperceptible. infinitely far away. The magnifying power is
Here simple microscopes and refracting tele- equal to the ratio of the objective's focal length
scopes will be examined. to the eyepiece's focal length.
Magnification in Microscopes The fineness Astronomical telescopes are often classed ac-
of the detail that one can see in an object with cording to a magnifying power that depends on
unaided sight depends in part on how close the the diameters of the apertures through which
object can be brought to the eye. However, the light must pass. The entrance pupil is simply
for a normal eye an object can be no closer the objective lens. The exit pupil is defined, in
than about 25 cm if a clear image is to form on principle, with an experiment in which a diffuse
the retina. If an optical lens is positioned ap- light source (such as an illuminated ground glass
propriately between the object and the eye, screen) is placed in front of the objective. The
the lens produces a virtual image at a distance light rays passing through the eyepiece are
of 25 cm. This image subtends a larger angle in focused on a surface to form an image called
the observer's field of view than the object the exit pupil (Fig. 2). (Often a metal cap with
841 OPTICAL INSTRUMENTS

_ ... - .. -:-
- ---
~ --_c:~_;~}2~-:' ~,:=~: ~- - ~ ----
t..-r""'" -
>

FIG. 1

entrance pupil
p

Q'" - - - - - - - j; - - - - - - - .. -/2 -..


ground glass
FIG.2 FIG.3

~u~ value of sin a is 1. For air J.I. is 1. Thus the


a circular ~lOle lies at the site of the exit pupil.)
The magmfying power is defined as the ratio of hmlt of resolution in air is A/2 or half the
the diameter of the entrance pupil to the diam- ~avelength of. the li~ht employed'. Closer spac-
eter of the exit pupil. In a simple refracting mg of the pomt objects would eliminate one's
telescope the light must pass through the pupil resolving them separately.
of the eye. In. a good design the exit pupil of In practice the resolution is partially limited
by.ho~ close the object can be brought to the
the telescope IS no larger than the diameter of
pupil in the eye. objective so that sin a is a large as possible.
T2herefore <;>bjectives with short focal lengths
For example, at night the diameter of the
pupil in the eye is about j in. For a 6-in tele- (3, ?;, 0: .q
mch) are employed in microscopes.
T~e hml~ of resol?tion can also be improved
scope (telescopes are often classed according to
the diameter of their objective) the magnifying by mcreasmg the mdex of refraction of the
power should be medium betw.een the object and the objective.
Commonly oil such as cedar oil is placed be-
Magnifying power = 6/ t = 18. tween a cover slip lying over the object and the
objective (Fig. 4).
H the exit pupil of the telescope is larger than Another technique for improving the limit is
~he diam~t~r of the pupil, the full light gather- to employ ultraviolet light whose wavelength
mg capablhty of the objective is wasted.
Resolving Power and Limit of Resolution of
Microscopes The limit of resolution of the
objective of a microscope is the least distance
x between two points that can just be resolved.
Abbe calculated this limit as

x=---
2J.1. sin a
where A is the wavelength of the light J.I. is the
refractive index between the points and the ob-
jective, and 2a is the angle subtended by the
objective in the field of view from the points
(FIg. 3). The term J.I. sin a is often called the
numerical aperture of the objective. The maxi- FIG.4
OPTICAL INSTRUMENT 842

is about half that of visible light. With UV the stop plate


resolving power is doubled but a camera is

~4
needed to capture the image since the eye does I

not respond to UV. _____ .. _---1 ______ ~________ _____ ---


Resolving Power and Limit of Resolution of
Telescopes The limit of resolution of a tele- o 1~
scope is given as an angle ¢ subtended at the
objective by two points on an object. The limit
is the smallest angle such that the two points
can just barely be resolved and is given by FIG.5

¢ = 1.22A./a
plate after passing through a lens. The stop can
where A is the wavelength of the light and a is be used to reduce the size of the image to an
the diameter of the objective. Thus telescopes acceptable diameter of 0.025 cm.
with larger objective diameters yield better The depth of focus of a microscope is ap-
resolving power. However, to insure full ad- proximately inversely proportional to the square
vantage of the objective in a simple refracting of the numerical aperature. At a magnification
telescope, one must be certain that the exit of 1000X an objective with a numerical aper-
pupil is not larger than the diameter of the pupil ture of 1.40 has a depth of focus of only
of the eye. 0.0005 mm.
For example the Yerkes telescope has an Field of View The field of view of an instru-
objective of 40 inches in diameter. With 6 X ment is the angle subtended at the observer's
10- 7 m as a typical wavelength for visible light eye by an object when the whole of the object
its resolution limit is given by can just barely be seen. In the case of a simple
refracting telescope having two lenses this angle
1.22 X 6 X 10- 7 is given by aZ/(f1 + [z), where az is the aper-
¢ = - - - - - - =0.15 second of arc. ture of the eye lens and f1 and f2 are the focal
40
lengths of the objective and eyepiece respec-
The resolution limit of the unaided eye is about tively. (See Fig. 6.) If f1 is doubled so as to
1 minuh' of arc. double the magnifying power of the telescope,
Depth of Focus and Depth of Field The then the field of view is nearly halved.
depth of focus for a camera is the distance the Vignetting is the fall off in brightness at the
plate (or film) in the camera can be moved edge of the field of view provided by a tele-
from the object without spoiling the resolution scope or microscope. It is due to the loss of
of the image. The depth of field is the cor- the rays that pass through the objective but
responding distance the object can be moved. which do not pass through the eyepiece. Vig-
In practice these two limits depend on how the netting is reduced by inserting a circular open-
final photograph is to be viewed. Suppose that ing (called a field stop) at the point where the
a point object lies at a distance from the camera image from the objective is formed.
such that it creates a well focused point on the Illumination in Microscopes The final image
plate. Another point object at a different dis- in a microscope magnifying 1000 X has an area
tance creates a blurred circle on the plate. one million times that of the object. The image
Whether or not these adjacent images can be is therefore only one-millionth as bright, assum-
resolved depends on the resolving power of the ing no loss of light in the instrument. To im-
eye, whose limit is about 1 minute of arc. As- prove the illumination on a transparent object
suming that the eye is normal and that the the light is concentrated on it by means of a
closest separation between the print and the substage condenser.
eye is 25 cm, the two images must be no closer Dark ground illumination is used for viewing
than tiny particles or very fine lines. The illumina-
tion is made too oblique for light to pass di-
X 1T rectly from the condenser into the objective.
- = 1X radians The objects are seen by scattered and diffracted
25 180X60. light against a dark background.
x = 0.007 cm Phase contrast illumination is a method of
making the structure of transparent objects
if they are to be at the limit of resolution. Usu- visible. The arrangement consists of an annular
ally a larger value of 0.025 cm is employed in stop at the condenser. Beyond the objective lies
cameras as being a useful limit. However, if the a phase plate consisting of a glass plate on which
print is enlarged, then the compromise on the there is an annular layer of material to increase
resolution becomes apparent. the optical path of the light by a quarter of a
The depth of focus of a lens can be increased wavelength. Interference between direct and
by reducing its aperture with a stop. In Fig. 5 diffracted light augments the diffraction pat-
a point object creates a circular image on a tern of the image.
843 OPTICAL INSTRUMENT

objective
field stop cap
chief ray I
I,
eye lens

a a pupil

f - - - - - - / 1 + /2-------<·1
FIG.6

Interference microscopy is another method of from a source into its constituent wavelengths
making fine structure visible. The transparent while measuring the intensity at each wave-
object is placed between two semi-silvered sur- length. Interferometers are used for fine dis-
faces. The interference between light passing tance measurement and instrument testing.
through the object and light passing by it is Refractometers measure the index of refraction
then observed. of a substance. Radiometers are employed for
Polarized light, obtained by a suitable polar- measurements of high temperatures.
izer below the condenser, is used in microscopes One area of optical instrumentation that has
designed for geological testing of constituents exploded into a tremendous number of applica-
in rock specimens. tions is that using coherent radiation in the
Other Instruments The range of modern op- infrared, visible, and ultraviolet. Lasers have
tical instruments and technology is vast. Here been used in diverse applications ranging from
only a few items can be mentioned. Photography cutting fabric to reattaching loose retinas. Laser
itself is a large area of ongoing research with beams allow one to measure precisely the dis-
continuous improvement in picture quality tance to the moon as a check on the current
and reduction in size of the equipment. (See theories of gravitation. They also allow one to
PHOTOG RAPHY.) Computer- designed lenses have monitor shifts in the tectonic plates as a pre-
revolutionized popular cameras. Highly sophis- warning of earthquakes.
ticated photography is currently employed in Holography is an imaging process using co-
weather monitoring, earth resource research, herent radiation. Once a novelty for its ability
and intelligence gathering. High-speed photog- to present three-dimensional pictures, it has
raphy has captured images of events far too since proved valuable in industrial testing such
fleeting to be seen otherwise. Even a short pulse as in searching for defects in rapidly rotating
of light has been photographed in midflight. tires. Although not yet fully realized, three-
The medical profession employs optical in- dimensional movies may incorporate it.
strumentation to view inside the human body
without surgery. Bundles of optical fibers il- A. E. E. McKENZIE
luminate the interior of a patient. They also N. C. McKENZIE
carry an image of the interior out to a monitor J. D. WALKER
to allow the doctor to see the interior. A num-
ber of ophthalmic instruments now enable a References
doctor to see into and perform various measure-
ments on the human eye. The military employs Cooper, H. J. (Ed.), "Scientific Instruments," Vols. I
telescopic gun sights, binoculars, periscopes and and II, London, 1946 and 1948.
range finders. Many of these designs use infrared Martin, L. c., "Technical Optics," Second' edition,
radiation and lasers in both the visible and Vols. I and II, London, 1961.
infrared ranges. Civil engineers use theodolites Kingslake, R. (Ed.), "Applied Optics and Optical En-
and lasers for surveying. There is a variety of gineering," Vols. IV and V, London, Academic
optical instruments used in the machine tool Press, 1967 and 1969.
industry for angular measurements, aligning, Jenkins, F. A., and White, H. E., "Fundamentals of
and inspection for faults in the products. Optics," New York, McGraw-Hill, 1976.
Especially important for science and tech- Walker, J. (Ed.), "Light and Its Uses," San Francisco,
nology have been optical instruments designed W. H. Freeman, 1980.
to disperse light into its constituent wavelengths.
These spectrometers have been crucial in the Cross-references: ABERRATIONS; HOLOGRAPHY;
study of the fundamental nature of atoms and INTERFERENCE AND INTERFEROMETRY; LA-
molecules and continue to playa role in many SERS; LENS; LIGHT; OPTICS, GEOMETRICAL;
areas of research in chemistry, physics and PHOTOGRAPHY; REFLECTION; REFRACTION;
astronomy. Spectrophotometers split the light SPECTROSCOPY.
OPTICAL PUMPING 844

OPTICAL PUMPING ter. For example, the optical pumping source


may produce a beam rather than isotropic
Optical pumping describes the transfer of order radiation, the light may be polarized, or the
from a beam of light or other electromagnetic spectral profile of the light may consist of a
radiation to matter. Some examples are the use single narrow frequency band rather than the
of optical pumping to produce spin polariza- broad spectral distribution of thermal radiation.
tion of ground-state or excited-state atoms, Light which is effective for producing order
molecules or ions; the optical production of in matter is also useful for detecting the exis-
metastable populations in matter, the optical tence of order in .matter. For example, spin-
excitation of spin-polarized conduction elec- polarized atoms attenuate and scatter resonance
trons in semiconductors, and the optical pump- radiation differently than unpolarized atoms .
ing of masers and lasers. Since light can be detected as single photons,
The antecedents of optical pumping go back the sensitivity of optical pumping experiments
to the original studies of atomic resonance often is limited by photon counting noise
absorption by Bunsen and Kirchhoff in the (shot noise) rather than thermal noise as is the
19th Century. Zeeman, Wood, Hanle , Heyden- case with conventional microwave or nuclear
burg, and others contributed to the early de- magnetic resonance spectroscopy. Thus, optical
velopment of optical pumping. Kastler in 1949 pumping methods can be sensitive to only a few
coined the term pompage optique and sug- atoms in the pumping beam when optical de-
gested the combination of optical pumping tection is used.
with radio frequency or microwave fields which A simple optical pumping experiment to
resulted in the fruitful spectroscopic technique produce spin-polarized excited atoms is il-
of Qptical double resonance, pioneered by lustrated in Fig. I(a). There atoms are con-
Brossel and Bitter. The use of optical pumping tained in a transparent cell C which is located in
to produce large numbers of spin-polarized a static magnetic field H which defines the z
ground-state atoms was also suggested by axis of a coordinate system. The atom has
Kastler, who received the Nobel Prize in 1966 angular momentum J g = 0 in the ground
for his work . state g and J e = I in the excited state e, and the
To be effective as an optical pumping source, energy levels are split by the magnetic field
the light must differ from the normal black- as shown in Fig. I(b) . Resonance radiation,
body radiation at the temperature of the mat- for example the 2537 -A line of mercury, is

PF MAGNETIC FI ELD H

Q] L
H

o
MAGNETIC FI ELD H
( 0)
FIG. 1. The production of polarized excited atoms by optical pumping. Radio-
frequency spectroscopy is used for the Doppler-free measurement of excited-state
energy level splitting with fluorescence monitoring of the resonance.
845 OPTICAL PUMPING

isolated from the sources with the filter F, is double resonance experiments which have been
linearly polarized with the polarizer P along the widely used to determine small energy splittings
magnetic field and is used to pump the atom of excited atoms or molecules. It is important
from the ground state to the sublevel of the ex- to note that the rf resonance curve is not af-
cited state with azimuthal quantum number fected by the doppler broadening of the optical
m = O. Angular momentum selection rules pre- lines used to detect the energy splittings.
vent the light from exciting atoms into sub- Closely related to the optical double-resonance
levels with m = ± I . experiment of Fig. I are the methods of level-
The fluorescent radiation has the character- crossing spectroscopy, light-modulation spec-
istic angular distribution and polarization of an troscopy, decoupling spectrosopy, the Hanle
electric dipole antenna, so little radiation is effect, and various other spectroscopic tech-
received by a detector D located in the null niques which may be described by the term
of the antenna pattern. The pattern can be per- perturbed fluorescence spectroscopy, by anal-
turbed by driving transitions from the excited- ogy with the related nuclear field of perturbed
state sublevel with m = 0 to the sublevles with angular correlations.
m = ± 1 with radiofrequency (rf) radiation of The production of ground-state or meta-
sufficient intensity and appropriate frequency. stable-state polarized atoms is illustrated in
The atoms in the sublevels m = ± I radiate Fig. 2. The atom has angular momenta J = ! in
strongly along the direction of the magnetic both the ground state and in the excited state.
field and the rf transition is therefore detected The atom is situated in a magnetic field Hand
as an increase in the intensity of light received is pumped by circularly polarized light which
by the detector. By observing the detected in- carries one unit of spin angular momentum
tensity as a function of the radio frequency , along the magnetic field direction. Atoms in
a resonance curve can be plotted out which the ground state sublevel with azimuthal quan-
determines the splitting of the energy levels. tum number m = -! can absorb light, but
The arrangement in Fig. I is typical of optical atoms in the sublevel with m = +! cannot be-

w
u
z
w
u
C/)
w
a::
0
:::J
-l
lL.

MAGNETIC FIELD H
CP F
z

0]
0
C/)
C/)

~
en
z
<t
a::
~
MAGNETIC FIELD H

>1
:m=1/2
Wrf

(!)
TIt
,I
-1/2

a::
w
z 112
W

MAGNETIC FIELD H

(0) (b)
FIG. 2. The production of polarized ground-state atoms by optical pumping. The ground-
state energy level splittings are measured by radio-frequency spectroscopy with fluorescence
or transmission monitoring of the resonance.
OPTICAL PUMPING 846

cause of angular momentum selection rules. uct of the quantum mechanical amplitude ai for
When the excited atoms decay they may fall one sublevel with the complex conjugate ampli-
back to the sublevel with m = +! where they no tude aj* of the other sublevel, i.e., Pij = (aiaj*)
longer absorb light. The result of repeated ab- where the average is taken over all components
sorption and reemission of light is to pump the (e.g., atoms) in the system. An example of a
atoms into the ground-state sublevel with m = physical quantity described by coherence is
+! where they. no longer attenuate or scatter the transverse component of the spin of an
light. The pumping can be detected by a de- atom. Coherence will ordinarily oscillate at the
crease in the fluorescence recorded by detector difference frequency Wij = (Ei - Ej)/fz of the
Df. In contrast to the situation shown in Fig. energy levels, and light modulated at the dif-
I, where only a small fraction of the atoms ference frequency, or unmodulated light in
are maintained as polarized excited atoms, conjunction with radiofrequency or other ra-
under favorable conditions substantially all of diation tuned to the difference frequency, is
the atoms in the sample can be polarized for often used to generate coherence. Transient
situations like Fig. 2 where ground-state atoms coherence can also be produced whenever
are pumped. However, the pumpup times in there is a sudden change in the optical pump-
Fig. 2 are limited by the light intensity and can ing conditions. Coherence is conveniently de-
be as long as seconds, while the pump up times tected by observing the intensity or polariza-
in Fig. I are equal to the spontaneous decay tion modulation of the transmitted or scattered
times of the excited atoms and are typically light in an optically pumped system.
some tens of nanoseconds. Optical pumping of gallium arsenide is used
Optically pumped atoms lose their polariza- to produce intense sources of spin-polarized
tion by various relaxation mechanisms. The re- electrons. The conduction band of gallium
laxation is slowest for spin-polarized nuclei of arsenide is S-like in character, while the top of
atoms with no electronic angular momentum in the valence band has P3/2 character. Thus con-
their ground states (e.g., He 3 , Xe 129 , Hg199). duction electrons generated by the absorption
Relaxation times ranging from many seconds to of circularly polarized light near the band edge
hours have been observed for such species, and of GaAs are highly polarized in analogy to the
much of the relaxation is due to hyperfine in- polarization of the photon-excited atoms of
teractions of atoms adsorbed on the container Fig. 2. By appropriate treatment of the semi-
walls. For atoms with electronic spin angular conductor surface, the electron affinity of the
momentum but no orbital angular momentum GaAs conduction electrons can be made nega-
(e.g., H, Na, N), relaxation times as long as tive and the spin-polarized, photogenerated
several seconds are observed. The relaxation is electrons can be efficiently extracted from the
due to weak, collisionally induced spin-orbit semiconductor into a vacuum.
couplings in gas-phase collisions, in weakly Atoms polarized by optical pumping (Rb, Cs)
bound van der Waals molecules, or while the are used as precise frequency standards and
atoms are adsorbed on the container walls. magnetometers. Optically pumped mercury and
For atoms with nonzero orbital angular mo- xenon have been used in experimental nuclear
mentum nearly every collision causes substan- magnetic resonance gyroscopes. Optical pump-
tial changes in the polarization, and relaxation ing is used to provide population inversions
rates are comparable to gas kinetic collision for many masers and lasers, and optical pump-
rates. "'- ing by starlight may drive some celestial masers.
Not all slowly relaxing atoms can be polarized Finally, optical pumping and closely related
easily by optical pumping. An important ex- techniques have been an extremely effective
ample is atomic hydrogen. Since the resonance spectroscopic tool for investigating the fine
line of atomic hydrogen is in the vacuum ultra- structures, hyperfine structures, lifetimes, and
violet region of the spectrum, and since the fine other properties of molecules, atoms, ions, and
structure interaction which transfers the optical centers in solids.
polarization from the photon to the spin is
small, it is very difficult to pump hydrogen W. HAPPER
directly. However, hydrogen atoms can be
readily polarized by spin exchange with optically References
pumped rubidium according to the reaction
Bernheim, R. A., "Optical Pumping; an Introduction,"
Rb(t) + H(.j.) -+ Rb(.j.) + H(t) New York, W. A. Benjamin, 1965.
Bloom, A. L., Sci. Am. 203,72 (1960).
Thus, spin exchange can be used to extend the Carver, T. R., Science 141,599 (1963).
usefulness of optical pumping to many species De Zafra, R. Am. J. Phys. 28,646 (1960).
(e.g., H, N, e-) for which direct optical pump' Happer, W., "Progress in Quantum Electronics," Vol.
ing is impractical. 1, p. 51, New York, Pergamon Press, 1970.
Optical pumping can also produce coherence Happer, W., Rev. Mod. Phys. 44,169 (1972).
between the energy sublevels of matter. The co- Happer, W., and Gupta, R., "Progress in Atomic Spec-
herence Pij between a pair of energy levels i and troscopy," Part A, (W. Hanle and H. Klempappen,
j is defined to be the average value of the prod- Eds.), p. 391, New York, Plenum, 1978.
847 OPTICS, GEOMETRICAL

Kastler, A., and Cohen-Tannoudji, C., "Progress in magnitude is a function of the properties of the
Optics," Vol. 5, p. 3, Amsterdam, North-Holland, two media on either side of the refracting sur-
1966. face (see Fig. l(b». The function is
Kastler, A., Opt. Soc. Am. 47,460 (1957).
Pierce, D. T., Meier, F., and Zurcher, P., Phys. Lett. sin 8 1/sin 8 2 =J.1.12
SIA,465 (1975).
Series, G. W., Rept. Prog. Phys. 23,280 (1959). The constant J.1.12 is the index of refraction for
Skrotskii, G. V., and Izyumova, T. G., Soviet Phys.- the two media when light travels from medium
Usp. (English transi.), 4,177 (1961).
1 to medium 2. If the direction of the ray is
reversed, as in Fig. l(c), it traverses exactly the
Cross-references: ATOMIC AND MOLECULAR
same path so that
BEAMS, ATOMIC CLOCKS, ELECTRON SPIN,
sin 8 2/sin 81 =J.1.21, :. J.1.21 = 1/J.1.12
LASER, MAGNETIC RESONANCE, MAGNETOM-
ETRY, MASER, MODULATION, POLARIZED If the first medium is air, it is usual to drop the
LIGHT, SPECTROSCOPY, ZEEMAN AND STARK suffixes and to replace J.1.12 by n. If the first
EFFECTS. medium is a vacuum then J.1.12 is the absolute
refractive index of the second. For air at NTP
the value is 1.00028 so that, for most practical
OPTICS, GEOMETRICAL purposes the distinction between J.1.12 and n,
with air as the first medium, is negligible for
The radiant energy emitted from a point of a substances, such as glass, commonly used for
luminous source situated in a homogeneous light control. The value of n depends on the
medium free from obstacles travels through the wavelen,gth of the light. For sodium yellow of
medium as a spherical wave front whose velocity 5890.6A, the values of n for a number of media
of advance is a characteristic of the medium. A are:
radius of this wave front is a light ray. If a ray
experiences a change of medium or encounters Crown glass 1.517
an obstacle, it will, in general, deviate from the Flint glass 1.650
rectilinear path defined by the radius. The Quartz 1.544
science of geometrical optics is concerned with Fused silica 1.459
controlled deviations. Water at 20°C 1.333
The Law of Reflection (a) The directions of Diamond 2.42
incidence and reflection and of the normal to For a mathematically plane surface, the co-
the reflecting surface are coplanar. efficient of reflection for normal incidence is
(b) The angle of reflection is equal to the
angle of incidence, both angles being taken with dependent on the value of J.1.12. Fresnel's law
gives for this coefficient
respect to the normal (see Fig. 1(a».
The Law of Refraction (a) The directions of P = [(J.1.12 - I)/(J.1.12 + 1)]2
incidence and refraction and of the normal to the
refracting surface are coplanar. Clean polished surfaces give results largely in
(b) The ratio of the sines of the angles of inci- accord with Fresnel's law. Films of grease or
dence and refraction, both angles being taken slight roughness appreciably reduce the value
with respect to the normal, is a constant whose of p.

NORMAL NORMAL NORMAL

MEDIUM I
~I

1'2

INCIDENT
RAY

(a) REFLECTION (b) REFRACTION (c) REFRACTION RAY TRAVELLING


REFLECTION AND REFRACTION FROM DENSE TO
AT PLANE SURFACES LESS DENSE MEDIUM
FIG. 1. Reflection and refraction at plane surfaces.
OPTICS, GEOMETRICAL 848

LIGHT LIGHT
MEDIUM MEDIUM
4

DENSE
MEDIUM

2 3 4

(0) (b)
FIG. 2. IDustrating total reflections.

Total Reflection Figure 2(a) shows a series of By reversing the directions of the rays, we see
rays passing from air into a denser medium. that an angle of incidence 8e is the maximum
Ray No. I experiences no deviation. The others possible angle for emergence into the air. For
experience progressively increasing deviations angles > 8e there is internal reflection, as in
until, with the tangential ray, No.4, the angle Fig. 2(b), governed by the law of reflection.
of refraction 8e is the greatest possible for that The angle 8e is called the critical angle for the
particular medium. Since the angle of incidence particular combination of media. Figure 3 shows
is 90° a number of practical applications of total
l/sin 8e = n, :.sin 8e = lIn reflection.

(a) (b) (e)

(d)
FIG. 3. (a), (b) and (c) show three methods of using the 45° prism. (d) is a constant deviation prism as used
in spectrometers.
849 OPTICS, GEOMETRICAL

;i
~ NORMALS
B' B o
~"'''''' ........ ~' z
A
I
,,,
i
i
, i
,1 LIGHT MEDIUM
I
!

,J
I. ............ ~ .. D· D
C' c
IMAGE OBJECT

(a) (b) (e)

FIG. 4. Image formation by plane surfaces: (a) point object, (b) extended object, (e) image by diffraction of a
point object.

Image Formation The geometry of Fig. 4(a)


shows that the image in a plane mirror of a point
in front of it is another point which lies behind
the mirror along the normal from the object LIGHT
point, and is as far behind the mirror as the MEDIUM

object is in front. The image of an extended


object is identical in size and shape but experi-
DENSE
ences lateral inversion (see Fig. 4(b)). The image MEDIUM
is virtual in the sense that it cannot be received
on a screen.
The geometry of Fig. 4(c) shows that the image
of a point in a dense medium is another point
situated on the same normal to the surface and
p'
at a distance from the surface of lin of the dis-
tance of the object point. For large viewing
angles, this relationship does not apply, the rays
forming the envelope of a caustic as in Fig. 5(a).
p
If the object is extended, a straight line for ex-
ample, the image is curved and concave towards FIG.5(a). Formation of caustic by refraction. P,
the surface as in Fig. 5(b). point-object. pI, position of point-image for smaIl
Refraction by Prism Figure (6a) shows the angles of incidence.
general case, the incident angle 8 1 being different
from the emergent angle 8 1 I. The deviation D is
given by
D=8 1 +8 1 ' -A
Figure 6(b) shows the conditions for minimum
deviation. 8 1 = 8 2 = 8, say. The deviation is

I
now given by

A +Dmin A
n=sin 2 sin 2'
Thus for a 60° prism for which n = 1.517
A + Dmin
sin =nsinAI2= 1.517 X 0.5 = 0.7585
2
FIG.5(b) Image by refraction of an extended
object.
OPTICS, GEOMETRICAL 850

(0) ( b)
FIG. 6. Deviation by prism: (a) General case, (b) condition for minimum deviation.

A "thin" prism is one for which A:t> 15°, so Reflection at Curved Surfaces The element
that O(radians) ~ sin O. This gives of area round a point on the surface is regarded
as part of the tangent plane at that point. First
Dmin ~ (n - I)A consider a spherical surface, Figs. (7a) and (b),
and let the aperture be small compared with the
Only if the incident ray is monochromatic will radius of curvature. In each Figure, 0 is the
the emergent ray be monochromatic. If the in- center of curvature, OP is a radius, and 0' is the
cident ray is heterochromatic, then the various "pole" of the mirror. A ray parallel to the axis
components in the emergent beam will have is reflected to a point F, the focus. In Fig. 7(a),
different deviations according to their wave- F is in front of the mirror; in Fig. 7(b) it is
lengths. Deviation without dispersion can be
achieved by using a compound prism. The main
prism, of apex angle A, is of crown glass of
refractive index n, and the correcting prism is NORMAL
of flint-glass, of apex angle A I and refractive ............
index n'. Denote the light of minimum wave-
length by the suffix b and that of maximum
wavelength by r, then for the main prism
~~---t----~------X
Db - Dr = (nb - nr)A,

~
and for the correcting prism
Db - D; = (nb - n;)A' .
For zero dispersion we must have
(a)
(Db - Dr) = (Db - D;),
(nb - nr)A = (nb - n;)A ' ,
therefore
A nb - n;
A' nb - nr
If the deviation is to be zero, D =D'. Now ~--~--~~--------------x

D - D' = (n - l)A - (n ' - l)A I,


(n- I)A =(n ' - I)A '
since D - D' = 0; therefore
(b)
A n'-
FIG. 7. Reflection at spherical mirrors. (a) Concave,
A' n- I (b) convex.
851 OPTICS. GEOMETRICAL

B B

A o A

( b)
FIG. S. Image formation in spherical mirrors. (a) Concave, (b) convex.

Concave Mirror Convex Mirror

u and r negative u negative and r positive


u=-S,r=-6,[=-3 u=-S , r=+6,[=+3

:. ~ = -~ - (-~) = - 4~S :. ~ =~ - (- ~) =~~


u =-4 .S (I = +2.1Sm

behind, so that the reflected ray appears to virtual. An image is real when the reflected rays
come from F. For small apertures O'F ~ PF actually pass through points in it ; it is virtual if
and the position of F is independent of 0 within they have to be traced back, so that they only
this limitation. This gives for O'F, the focal appear to come from the image.
length f = r/2. Conjugate Foci In Fig. 9(a) a point object A
Image Formation For an object of small di- produces a point image A'. If A and A' are inter-
mensions, the image can be obtained by drawing changed, the rays take identical paths but the
two rays only: (a) a ray through O-this strikes arrows are reversed. A and A' are called conju-
the mirror normally and is returned along the gate foci. In Fig. 9(b) the image A' in the con-
same path; (b) a ray parallel to the axis-this is vex mirror is virtual . The relationship between
reflected through the focal point F . the angles is as follows:-
In Fig. 8, AB is the object and A'B' the image.
In Fig. 8(a) the image is real, in Fig. 8(b) it is 'Y = (J + {3, {3 = (J + a; :. a + 'Y = 2{3

A o

~------------u----------~

(0) ( b)
FIG. 9. Conjugate foci.
OPTICS, GEOMETRICAL 852

For small apertures Refraction at Spherical Surfaces The geom-


etry of Fig. II(a) gives for a surface of small
a =PO'/O'A, ~ =PO'/O'O, "( =PO'/O'A' aperture
I I 2 I n- I n
'-+-=-=-
.. v
u r f
--=-- -
r v u
This is purely quantitative relationship. For If we put v =00, so that the rays are parallel after
general application it is necessary to ad6pt a refraction and put fl for the corresponding
convention regarding signs. Distances are always value of u
measured from the mirror, and the direction of
the incident light is reckoned positive. As an n- I
example let u = 8 and r =6. (See calculations r fl
under Fig. 8.)
The magnification If we put u = 00, so that the incident rays are
parallel, the corresponding value of v (call it
A'B' v f2) is given by
m=--=-
AB u
n-I n n I
This is positive if v and u have the same sign as -- = +- :. - + - = 0
with the real inverted image of a real object, r f2' i2 fl
Fig. 8(a). It is negative if u and v are of opposite For an object AB, Fig. II(b), we obtain the
sign as with the virtual erect image of a real position of the image by drawing two rays from
object, Fig. 8(b). B; the first parallel to the axis, thereby passing
The axial magnification of an object of finite through F 2 after refraction; the second passing
axial dimension is given by differentiating the through F 1, there by becoming parallel to the
above equation. axis after refraction.
dv du dv v2 fl
----=0 m=--
v2 u2 '
u - fl
This shows that if an object is moved towards the Lenses Figure 12 shows three media separated
mirror a distance du, the image will move by spherical surfaces. Considering each surface
dv = - v2 du/u 2 • Thus the image will move more in turn
rapidly when u is small.
Parabolic Mirror For large apertures for which
the angle (3 is not small, the rays of a parallel
axial beam are not all brought to a focus at F,
but form the envelope of a cusp, as in Fig. 10(a).
A parabolic mirror brings all the rays of such a . n2 nl n2 - nl
-- -=---
beam to the same focus. [see Fig. 10(b)]. w u

(b)
FIG. 10. Reflection by mirrors of wide aperture. (a) Spherical, (b) parabolic.
853 OPTICS, GEOMETRICAL

NORMAL

DENSE
MEDIUM A

(a)

DENSE
MEDIUM B
A' F2
0' FI A

B'

~----~----~~----fl------~

~-------------y----------------~-----------u--------------~
(b)
FIG. 11. Refraction at spherical surface of separation between two media.

with air on either side of the central section,


nl = n3 = 1, and we can put n2 = n.
A

If we put u =00 then v =[, the focal length,


FIG. 12. Lens.
I (I I) *
:. -=(u- 1) - - -
[ rl r2
rl r2
; [=---'0......=...._-
(n - 1)(r2 - rd
Similarly
La t era I magm·f··
Icatlon m = -v or ----
[
n3 /;22
--~=-----
n3 - n2 u u- [
v w
A XI.a1 magm·fIcatlon
·· dv = -
--
adding, du u2

n3 _ ~ = n3 - n2 + n2 - nl *Some authors give this equation with a + sign in the


second bracket. In such a case it is necessary to use
v u r2 rl
different signs for convex and concave surfaces, a con-
fusion which is avoided by the simple convention
In the case of a lens, n3 = nl , giving adopted in this article. .
OPTICS, GEOMETRICAL 854

DOUBLE CONCAVE CONVEXO-CONCAVE

--
DOUBLE CONVEX CONCAVO-CONVEX

+- +-

15 X (-10) -15 X (-10) -15 X (+10) 15 X 10


f= 0.6(-10 -15) f= 0.6(-10 + 15) f= 0.6(10 -(-15» f= 0.6(10 - 15)
=+10 = +50 = -10 = -15

With the convex lenses the focus is on the far thin lens, while increasing divergence brings the
side of the lens from the source; with the con- focus nearer to the lens. Figure 13(c) shows
cave lenses it is on the same side as the source. chromatic aberration-the focal length, even for
With plano-convex and plano-concave lenses, a thin lens, being a function of the wavelength.
l/r for the plane surface is zero. This is because n varies with the wavelength and
Defects in Mirrors and Lenses An aperture increases as the wavelength decreases. Thus the
which is large compared with the focal length focal length for violet is less than for red.
introduces errors in both mirrors and lenses. Curvature of the Field The image of a plane
One of these is the axial spread of the focus as object lies, in general, on a slightly curved field
shown in Fig. 10(a). The amount of spread for (Fig. 14), and this combines with the effects of
a mirror of aperture 28 is astigmatism. Pincushion distortion results when
a lens is used as a magnifying glass; barrel dis-
tortion results when an object is viewed through
a lens at some distance from the eye.
Correction of Spherical Aberration (I) Spher-
Astigmatism results when the object is situated ical mirror. The distortion due to a spherical
off the axis. The reflected pencil is brought to a concave mirror can be corrected by imposing
focus at a point I, also off the axis, and is spread a similar distortion, but in the opposite direc-
into an axial line Ql Qt' as in Fig. 13(a). If the tion, on the incident light. An elegant solution,
mirror is turned througn a small angle, the point used in the Schmidt telescope, is to place at the
I describes a line called the first focal line. center of curvature of a mirror-and therefore
Ql Q2 is the second focal line and is perpendic- between the incident light and the mirror-
ular to the other. In between there is a region a transparent plate having one surface aspher-
where the reflected pencil is the nearest ap- ica!. The contours of this plate are such that the
proach to a circle; this is called the circle of beam transmitted by the plate is distorted from
least confusion. true parallelism by the required amount. The
Figure 13(b) shows longitudinal spherical paths of the extreme rays are indicated in Fig.
aberration in a lens; rays very close to the axis 15. The cone of rays reflected by the mirror is,
pass through the focal point as defined for a by this device without distortion; thereby form-

WH ITE

(0) (b) (e)

FIG. 13. Image defects. (a) Astigmatism, (b) longitudinal spherical aberration, (c) chromatic aberration.
855 OPTICS, GEOMETRICAL

(a) (b) (e)


FIG. 14. Image distortion. (a) Object, (b) pincushion distortion, (c) barrel distortion.

(2) Thick Lens. Close to the axis the two


opposite faces of the lens have only a small
inclination to one another, whereas away from
the axis this inclination is greater. This increase
with distance from the axis brings the trans-
mitted rays to a focus progressively nearer
Aspherical plate
to the lens, as shown in Fig. 13(b). With large
diameter lenses the aberration is corrected by
means of a stepped construction in which the
lens consists of a series of concentric zones.
Figure 16(a) shows a single stepped lens
and also a double lens in which the second com-
ponent is inverted. The double lens allows a
much greater angle of collection. Cheap lenses
can be of moulded glass, but where precision is
FIG. 15. Ray diagram for Schmidt Telescope. essential the surfaces must be polished and the
cost therefore high. Figure 16(b) shows what is
probably the supreme example of light control
ing a very sharp image on the photographic by refraction and internal reflection; it is a sche-
plate P. matic drawing of a lighthouse lens system. In
In conventional reflecting telescopes of the this case every prism has to be optically perfect,
Newtonian kind, uncorrected aberration limits and also the refracting surface of each dioptic
the permissible inclination of rays to axis to a (stepped) lens. For this reason the various com-
few degrees. In the Schmidt telescope the ab- ponents have to be separate, as indicated in the
sence of aberration means that the angular di- figure.
ameter of the effective field is proportional to Systems of Lenses The inverse of the focal
the ratio of the aperature to the focal length of length of a lens is called the power of the lens.
the mirror. If the unit of length is the centimeter, the unit

FIG. 16(a). Stepped or dioptic lens.


OPTICS, GEOMETRICAL 856

For a minimum longitudinal spherical aberra-


tion, the distance d should be

d=f, - h
For minimum spherical aberratior"
f, - h = d.
For minimum achromatism,
fl + f2 = 2d.
Com bining these two gives
h = ~d and h = t d .
These are the conditions for the Huygens eye-
piece. In this eyepiece the combination is
equivalent to a single positive lens. (A single
negative lens can also be used as an eyepiece.)
Consider the case of the compound microscope.
Figure 17(a) gives the ray diagram for a positive
eyepiece. The objective forms an image of the
object behind the eyepiece, that is, within the
instrument tube, and the rays are brought to a
focus at what is called the Ramsden circle out-
side the eyepiece. The magnified image, as seen
by the observer is inverted. Figure 17(b) shows
that the Ramsden circle is between the eyepiece
and the objective and that the image is erect.
FIG . 16(b). Schematic drawing of a lighthouse For the same magnifying power the working
lens. distance, that is, the distance between object
and objective, is larger with the negative eye-
piece. Its disadvantage is that the field of view
is smaller so that a smaller portion of the ob-
of power is the diopter: ject is visible. In order to ensure the largest
possible field of view for given powers of the
1 objective-eyepiece combination the Ramsden
D=- circle must exist between the eyepiece and the
f
eye. The Ramsden eyepiece as actually manu-
If a number of thin lenses are in contact, the factured is a combination of two plano-convex
power of the combination is the algebraic sum lenses of equal focal lengths. Their distance
of the individual powers. As an example, con- apart is decided by the requirements and can
sider two thin lenses for which f, = 10 and vary between the focal length of a single lens
h =- 15 cm. and two-thirds of this distance. According to
the previously determined relationships for two-
1
lens combinations a separation equal to the
D
,
= -10 = 0.1 diopter'
'
focal length of a single lens would require that
one of the lenses would be at the first principal
focus. As this is where the image is formed the
1 . separation has to be reduced in order that the
D2 = - - = - 0.0667 dIOpter focal plane may be removed from the lens
15
surface. The ray diagram for this eyepiece is
D = 0.1 - 0.0667 =+ 0.0333 given in Figure 17(c). If the eyepiece is used as
a micrometer, the cross-wires are placed between
:.f= + 30 cm the two lenses and indicated in the diagram by
AB. The image A'B' is between the lenses and
If two coaxial lenses are separated a distance is upright as with a single double-concave lens.
d, then the focal length of the combination is The shortening of the lens distance introduces
given by a small decrease in achromatism, but this can
be remedied by the use of a combination lens
1 lid for the eye lens.
-=-+ - - - Thick Lens Figure 18(a) shows a thick con-
f f, f2 fd2 verging lens receiving a parallel beam of light.
857 OPTICS, GEOMETRICAL

Eye lens

FIG. 17(a). Compound microscope with positive eye lens.

I
I
I
I
I
I
I
I
I
I
+-I
I
I
I
I
I (~\

I '-/
Ramsden circle
I
I
I
FIG. 17(b). Compound microscope with negative eye lens.
OPTICS, GEOMETRICAL 858

FIG. 17(c). Ramsden eyepeice.

The axial ray 1 passes through the lens without as though the deviation of the rays leaving point
deviation. Any other ray, such as 2, on emerging F all takes place at the plane P; also the deviation
in tersects the axis at some point F'. If we extend of the emergen t rays appears to take place at the
the incident ray forward and the emergent ray plane pl. Image formation is therefore in
backwards, as shown by dotted lines, they will accordance with Fig. 19.
intersect at some point P'. The point F' is called From the geometry of the figure we see that
the second focal point and the plane through
P' to which the axis is perpendicular is called
the second principal plane.
Figure 18(b) shows a system of rays startmg
.
ff' = xx' ... m = - -xf x'
= - -f'
from a point F such that the emergent beam is
parallel. Then point F is the first focal point
and the plane through P, determined as before, also
the first principal plane.
The solution of the thick lens is therefore
dependent on the following: The lens behaves

(a) (b)

FIG. 18. Thick lens: principal focal points and principal planes.

First Second
Principal Principal
Plane Plane
B
~-------t - - - - - -
Object

FIG. 19. Thick lens: image formation.


859 OPTICS, GEOMETRICAL, ADVANCED

First Second S
Focal Focal
Plane Plane

p --<::o----------:-:+-e+-------------~- Q
MeN
F F'

FIG. 21.

~~---d------~

t-----f~ l-f'·---t CS2/2PC + CS2/2QC = (n - I)MN

FIG. 20. 1 I 1 1
- +- = - + - = 2(n - I )MN.
PC QC u v

The focal length of a thick lens is reckoned Considering the points P and Q as the centers
from the second focal plane, namely, the dis- of curvature'l and'2 of the two lens faces,
tance OF' in Fig. 20. In terms of the radii it is
given by I
+ -- = 2MN/CS2 ,
'2
f'=(J.L-I)(~- ~+J.L-I~). giving
'1'2 J.L '1'2

It differs from the expression for the focal length ~ + ~ = (n - 1)(~ + ~).
of a thin lens by the term u v \'1'2
H. COTTON

References
and this, of course, is zero when d = O.
For light traveling from right to left and for Curry, G., "Geometrical Optics," London, Arnold,
which a parallel beam is brought to a focus at F 1953.
Jenkins, F. A., and White, H. E., "Fundamentals of
Optics," New York, McGraw-Hill Book Co., 1957.
f=(J.L-I)(~- ~+J.L-I~) Kingslake, R., Ed., "Applied Optics and Optical Engi-
'2'1 J.L '1'2 neering," Vol. 1, New York, Academic Press, 1965-
1969.
f=- f'. Kline, M., and Kay, E. W., "Electromagnetic Theory
and Geometrical Optics," New York, Interscience
Optical Distance The optical distance be- Publishers, 1965.
tween two points is the minimum possible Longhurst, R. S., "Geometrical and Physical Optics,"
time required for light to travel from one point London, Longmans, 1967.
to the other. Since a ray of light is the normal Martin, L. c., "Geometrical Optics," London, Pitman,
to an element of wave front, this concept in- 1955.
volves velocity. In a homogeneous medium the Smith, F. G., and Thomson, J. H., "Optics," London,
optical distance between two points separated John Wiley & Sons, 1971.
by a distance Q is Qn/c where c is the velocity Welford, W. T., "Geometrical Optics: Optical Instru-
of light in vacuo. Since, in all problems we are mentation," Amsterdam, North-Holland, 1962.
concerned with the paths between two given
points we can give to c the arbitrary value of Cross-references: ABERRATIONS;
LENS; LIGHT;
unity thus giving Qn for the optical path. As OPTICAL INSTRUMENTS; OPTICS, GEOMETRI-
an example consider again the equation for a CAL, ADVANCED; OPTICS, PHYSICAL; REFLEC-
thin lens (Fig. 21). For the axial ray we have TION; REFRACTION. .

PS + SQ = PC + CQ + (n - I) MN OPTICS, GEOMETRICAL, ADVANCED


PS = (PC2 + CS2) =PC + CS2/2PC Geometrical optics deals with light regarded as
rays, an approximation which is valid when the
the approximation being relevant to a thin lens. wavelength of the light is much smaller than the
This gives dimensions of the optical system. (Otherwise,
OPTICS, GEOMETRICAL, ADVANCED 860

diffraction effects are significant; see FOURIER gives


OPTICS and DIFFRACTION BY MATTER AND
DIFFRACTION GRATINGS.) The other physical
principle needed to describe optical systems is (7)
Snell's law, in the form
(1) where the 2 X 2 matrix

where n 1, n2 are the indices of refraction of


two media separated by a plane interface, and
8 1 , 8 2 are the incident and refracted angles,
R1 = G-:J (8)

respectively. (See OPTICS, GEOMETRICAL for a is the refraction matrix for surface 1.
more extended treatment.) Next, consider the ray P 1 P2 • Its position at
Paraxial Matrix Optics An extremely inge- P 2 is
nious way of simplifying the mathematics as-
sociated with ray tracing is the matrix method X2 =Xl'+tl tanO:l'-x1'+tl'O:l' (9)
proposed by Sampson (1913) and Smith (1945),
and extended by Brouwer (1964). Figure 1 and combining this with the identity 0:2 =0:1'
shows a ray leaving an arbitrary object point P gives
and striking the first surface of a typical lens
at Pl' If the angles 0:1, 0:1', 8 1 , and 8 1' shown
are all small (0.1 radian or 5° or less), then
Snell's law (1) in paraxial form (this term means
"close to the axis") becomes where
n181 =n282. (2)
T21 = ( ,I , 0)1
tl In1
(11)
The figure shows that this is
n1(0:1 +</»=n1'(0:1'+</». (3) is the translation matrix for the passage from
surface 1 to surface 2.
But the paraxial approximation also leads to The matrices of (8) and (11) are all that are
needed, for the refraction at surface 2 can be
(4) expressed by a matrix R 2 , the translation to an
image plane or a second lens by T 32 , etc. In
Define the refracting power of the surface with addition, the left-hand side of (7) can be sub-
vertex VIas stituted into the right-hand side of (10) and the
k1 = (n1' - ndlr1. (5) combined translation-refraction process is speci-
fied as a matrix product. Thus, we have a sim-
Combining (3), (4), and (5) with the identity ple, virtually automatic procedure for tracing
paraxial rays through an optical system of
arbitrary complexity. The constants of the

o~~~~---L~~~~~--------+----+~----~~----Z
p C

FIG. 1. Refraction matrix.


861 OPTICS, GEOMETRICAL, ADVANCED

system which must be specified are the indices where T I ' is the actual length of the ray PIP2.
ni, n/ and the radius ri of each refracting sur- We note that when the paraxial approximation
face (i = I, 2, ... ) and the axial distance t/ is valid-Le., sin x = x -Eqs. (12), (13), and (14)
between each pair of surfaces. In addition, it simplify to (5), (7), and (10), respectively, as
is a straightforward matter to derive by matrix they should do.
multiplication the formulas quoted in GEO- One complication associated with these me-
METRICAL OPTICS, and many others which ridional matrices is that T I', unlike t I', is a
are rather difficult when using ordinary algebra; variable; it may readily be determined from the
examples and a more extended discussion of geometry in the form of an auxiliary formula.
sign conventions will be found in Nussbaum We shall refer to a more general form of this
and Phillips (1976). formula in a later section. In addition, the angle
Meridional Matrices Figure 1 incorporates o I requires an auxiliary formula and 0 I' then
the usual convention that the symmetry axis of follows from Snell's law. Another complication
the optical system is taken as OZ. A plane con- is the fact that the matrices are no longer con-
taining this axis is said to be meridional, and stants. All these difficulties are surmounted by
we consider the problem of ray tracing in such using numerical, rather than algebraic calcula-
a plane with the paraxial restriction removed. tions, and the procedure outlined here is easily
It is not difficult to show [again see Nussbaum programmed on hand-held calculators and
and Phillips (1976)1 that the matrix approach computers.
outlined above is preserved, provided some The consequences of the removal of the par-
moderately obvious changes are made. These axial approximation for meridional rays is shown
involve (1) defining a skew power K for surface in Fig. 2. The rays close to the axis meet at the
1 as paraxial focus F', those farther out come to-
gether at an intermediate focus FI, and the rays
Kl =(nl'cosOl'-nl cosOd/rl (12) passing through the edge of the lens (the margin)
meet at the marginal focus FM. The situation of
where 0 I and 0 I' are the Snell's law angles Fig. 2 is what the matrix equations would reveal
before and after refraction, respectively; (2) for a double convex lens; the existence of a set
writing (7) in the form

e :~ al')
of focal points and the resulting blurry image is
called a third-order aberration, where the desig-
l
' = (~ -:, (nl s:~ al } nation refers to the fact that the paraxial ap-
proximation sin x = x implies that we had cut
off the Taylor series for the sine function at the
(13) first-order term and now we are including third-
order terms. [There are also fifth-order and
and (3) writing (10) as higher aberrations which are primarily of inter-

~) e :i~,
l
' al }
est to lens designers; see Cox (1964).1 The sig-
nificant feature of Fig. 2 is that it indicates that
the blurred image is a consequence of the failure
of marginal meridional rays to conform to the
(14) paraxial approximation; this failure is called

F,

FIG. 2.
OPTICS, GEOMETRICAL, ADVANCED 862

spherical aberration. If the object at infinity


producing the parallel rays of light is replaced
by a point-object on the axis, we still find a
similar aberration situation on the image side;
hence, spherical aberration is one of three third-
order point aberrations; the other two will be
introduced immediately below.
Skew Matrices We now remove the meridio-
nal restriction and introduce the possibility of
skew rays-those not confined to the plane of
the paper in Fig. 2-leaving an object. We specify
such rays in terms of their point of origin (a)
(x, y, z) and their direction cosines 'Y, 8, e with
respect to the axes OXYZ. This implies that the
single equations (13) and (14) must each be
generalized to a three-dimensional set of the
form

with equations in 8, y and e, z, and

(n2'Y2\ =( ,1 , 0) (n,,~,,) (16)


\ X2 J T, In, 1 x,
again with two similar equations.
Using the auxiliary formulas and numerical (b)
approach mentioned above, we are now in a FIG. 3.
position to trace skew rays. Consider the two
rays of Fig. 3, generating an intermediate focus.
Then incline the rays (Pig. 3b), shifting the eral produce the same results as the meridional
focus as shown and rotate the rays through an equations (13)' and (14), the location of the
angle of 180 0 about a parallel axis halfway be- focal point is no longer FI'. What happens, in
tween them. All the new rays produced in this fact, is that it must move in a closed path on
fashion are skew, and since the two sets of three the image, returnincf to its original position at
matrix equations (15) and (16) will not in gen- the end of the 180 rotation, since the rays are

x x

Paraxial
focal
plane

FIG. 4.
863 OPTICS, GEOMETRICAL, ADVANCED

once again meridional. This path would be


expected to be a circle in the image plane, since
the rays generate such a path on the front of
the lens, and this is indicated as circle 3 in Fig.
4. A second 180 0 rotation gives a second circle
coincident with the first, and closer pairs of
rays give the smaller circles 2 and 1. This blur-
ring of the image due to failure of the skew rays
to match the behavior of the meridional rays is
called coma, the name coming from the comet-
like shape of the image in Fig. 4. Such a shape
is seldom observed, because we have assumed
that circles are imaged perfectly. For a lens
which is not well-corrected, the double circles
in the image plane will be asymmetrically ar- la)
ranged, as shown in Fig. S, and the resulting
pattern will be a cardiod [Fig. S(b)].
Coma can be associated only with off-axis
object points, since all rays from an axial point
are automatically meridional. Let us assume
that a lens system has been corrected both for
spherical aberration and for coma. Since these
are due to essentially different causes, we have
no guarantee that the image point of the merid-
ional and of the skew rays is identical and this
failure to agree is then the third point aberra-
tion, astigmatism. Figure 6 shows a meridional
or tangential fan of rays leaving an off-axis
object point P and forming a sharp image at
PT'. Another set, the sagittal fan, is at right
angles, and all of these rays (except for one
in the vertical plane) have to be skew. The
sharp image is at Ps', and the tangential fan at Ib)
this point images as a line. In between the two
lines shown, we have the ellipses and circle of FIG. 5.

FIG. 6.
OPTICS, GEOMETRICAL, ADVANCED 864

rr
I-
I
--
1-
FIG. 7. I I

(a) (b)

FIG. 10. Barrel (a) and pincushion (b) aberrations.

Finally, we point out that the matrix equa-


tions as outlined above are applicable only to
spherical lenses (or mirrors), with plane surfaces
as a special case. This treatment, however, can
be readily extended to aspheric surfaces (hyper-
boloids, paraboloids, and ellipsoids, with spheres
as a special case) by a straightforward general-
(a) (b)
ization of the auxiliary relations [Nussbaum
FIG. 8. Astigmatism: (a) object; (b) image. (1979)] . A brief computer program will handle
the numerical aspect of ray tracing in such
systems.
Fig. 7, so that again we have blurring. Note ALLEN NUSSBAUM
that a bundle of arrows [Fig. 8(a)] would have
images going from sharp to blurry, as shown in References
Fig. 8(b); this also explains the use of the term
sagittal, from the Latin sagitta, arrow. This 90° Sampson, R. A., Phil. Trans. Roy. Soc. Lond. A212,
transition explains why cylindrical corrections 149 (1913).
are used for astigmatism of the eye; after the Smith, T.,Proc. Phys. Soc. Lond. 57,286 (1945).
original prescription is determined, this defect Brouwer, W., "Matrix Methods in Optical Instrument
requires further correction in one plane only. Design," Menlo Park, CA, W. A. Benjamin, 1964.
The last two aberrations are associated with Nussbaum, A., and Phillips, R. A., "Contemporary
objects of finite extent. If we correct for astig- Optics for Scientists and Engineers," Englewood
matism by having the tangential and sagittal Cliffs, NJ, Prentice-Hall, 1976.
focal surfaces merge to form a common Petzval Cox, A., "A System of Optical Design," Focal Press,
surface (Fig. 9), this image surface will in gen- 1964.
eral be curved, since the amount of astigmatism Nussbaum, A.,Am. J. Phys. 47,351 (1979).
can depend on the distance of the object point
from the axis. The resulting aberration, associ- Cross-references: DIFFRACTION BY MATTER AND
ated with the OZ axis, is called Petzval curva- DIFFRACTION GRATINGS; FOURIER OPTICS;
ture or curvature of field. A similar effect can LENS; MATRICES; OPTICAL INSTRUMENTS; OP-
occur f?r distances measured in the XOY plane, TICS, GEOMETRICAL; OPTICS, PHYSICAL.
producmg the barrel or pincushion distortion of
Fig. 10. In addition to, and independent of
these five geometric aberrations, the fact that OPTICS, PHYSICAL
the index n is a function of wavelength leads to
chromatic aberrations, which are discussed in Physical optics deals with those phenomena
the article OPTICS, GEOMETRICAL. that are described using the wave nature oflight.
These are interference, diffraction and polariza-
tion. Of these, interference is the most impor-
tant; interference phenomena require the wave
Tangential focal
concept for their explanation. Diffraction and
Sagittal focal surface polarization can, at least in principle, be dis-
V Petzval surface cussed using the particle nature of light.
\ The unique property of a wave is that it is in
two or more places at the same time (a particle,
on the other hand, is well localized). A light
wave has a phase and can either have a positive
or negative value. A wave can be divided into
two or more parts.
The principle of superposition states that
FIG. 9. several individual waves at any location can be
865 OPTICS, PHYSICAL

screen. Yet if we place a card in either arm of


M2 (movable) the interferometer, blocking one beam, we will
see one half of the initial amplitude. With no
card (i.e., using both beams), we get zero be-
1 cause one beam is 1800 out of phase with the
other. This is the essence of interference. To
'2 E2
explain the phenomenon, we have to invoke the
M1 (fixed)

j Beam splitter
wave nature of light.
In the preceding analysis we assumed that the
Eo E1 source was perfectly monochromatic. When
Incident
Wave Michelson used his interferometer to examine
real sources, he found a very interesting phe-
1~---'1---
nomenon. As the path difference between the
two arms of the interferometer was increased,
the visibility of the fringes decreased and finally
To screen disappeared. A simple interpretation of this
FIG. 1. Michelson interferometer. * result is that light from an ordinary spectral
lamp consists of wave trains of finite length.
When the path difference is less than the length
of the wave train, part of the train coming from
superimposed to determine the resultant. Be- the other arm, interference occurs [Fig. 2(a) J.
cause the phase of the wave is important, lead- When the path difference is greater than the
ing to negative as well as positive amplitudes, length of the wave train [Fig. 2(b)J, the light
the summation of several waves can be zero. from one arm is added to the light from the
Interference There are two broad categories other arm which consists of a different wave
of interference: division of amplitude and divi- train. Since the phase relation between two
sion of wavefront. The Michelson interferome- trains is a random variable (the trains come
ter, Fig. I, illustrates division of amplitude from independent atoms) and since many such
and two-beam interference. The incident wave superpositions take place in an observation time,
with amplitude Eo strikes a glass plate at no interference will be observed, and the inten-
45 0 • The back side of the plate is lightly coated sity will be a constant. The length of the wave
with silver so that it serves as a beam splitter; train is related to the temporal coherence length
part of the wave is transmitted to mirror M I of the light. Ordinary spectral-line sources have
and part is reflected to mirror M 2 . The coat- coherence lengths on the order of millimeters,
ing on the beam splitter is made with a density whereas lasers have coherence lengths on the
that will give the two parts of the beam equal order of kilometers.
amplitudes. An alternative but equally correct explanation
After reflection by mirrors M I and M 2, the of the disappearance of fringes can be made if
components are recombined at the beam splitter we consider the spectral distribution of the
where, on average, half the energy is reflected source. If more than one wavelength is emitted,
onto a screen and the other half is transmitted each will form its own interference pattern. At
back toward the source. The path difference A zero path difference, all wavelengths will form
for the two waves, which may be varied by a maximum and the fringes will be very sharp,
changing the position of M 2, is given by A = but at large path differences the fringe maxima
2(11 - 12 ), where II and 12 are the lengths of for the different wavelengths will not coincide
each arm of the interferometer. The phase dif- and the fringes will vanish. Since one may
ference 8 is 21TAj'A, where A is the wavelength. Fourier analyze a wave train and obtain its
For a monochromatic wave the total field at spectral distribution, it turns out that these
the output is: two explanations are equivalent.
After observing the loss in visibility of fringes
E T =E I +E2 with increasing path difference, Michelson went
on to explain this quantitatively, and he ob-
= 0.5 Eo [1 + cos 8 J
= 0.5Eo [1 + cos 21TA/AJ.
Here we see an important property of inter-
ference; when 8 = n1T (where n = 1, 3, 5, ... ) EI '\I\f\fW '\I\f\fW
the two axial components will combine in such

'\I\f\fW '\I\f\fW
a manner that there is zero amplitude on the
E2
*Figures 1-5 and 7-10 adapted from Contemporary
Optics for Scientists and Engineers, by A. Nursbaum (a) (b)
and R. Phillips, © Prentice-Hall, 1976. Reprinted with
permission of the publisher. FIG. 2.
OPTICS. PHYSICAL 866

tained an extremely important result which we Eo

\\t\l\
shall consider next. Let the incoming wave
contain a number of frequencies, rather than
being monochromatic.
Most optical detectors including the eye,
photocells, and photoconductors detect energy,
which is porportional to the square of the am-
plitude, i.e., E2 . In optics, this is also designated
intensity, I. The total intensity for a polychro-
matic wave or group of waves, then, is given by
a summation over frequency. When we compute
intensity by squaring an amplitude, which itself
involves a summation, the cross-product terms
are converted into functions such as cos [(W1 -
W2) t] and these average to zero over one or FIG. 3. Interference in thin films.
several cycles. This will be the case here for all
the products of the form E1 (wdE 2 (wz), since
any physically realistic observation time will hand, in Fourier transform spectroscopy, only
extend over many cycles. The total intensity one-half of the light, on average, is discarded.
IT will then be This gives a better signal-to-noise ratio for a
given detector. This technique is widely used
for high-resolution analysis of weak sources,
particularly in the infrared region.
Thin films illustrate multiple beam interfer-
We assume that the distribution of frequen- ence. (See Fig. 3.) Consider reflection from two
cies is continuous rather than discrete, there- parallel surfaces separated by a distance t. The
fore this sum should be replaced by an integral. index of refraction of the film is n, and that of
References 1 or 2 show that when this expres- the surroundings is no and nz. The amplitude
sion is evaluated the result is of the incident wave is taken as Eo. At the first

=I:
interface, the coefficients of amplitude trans-
mission and reflectance are 01 and P 1 , respec-
IT 1(0) [1 + cos {21TLl(00 +o}] do tively, and at the second interface, they are U2
and P2 , respectively, where

where
1
~= 00 + o. The negative sign in - P 1 means that it is mea-
sured inside the layer of index n 1. (As explained
This extremely important formula states that, below, we want to calculate the transmitted
in a Michelson interferometer, the average in- wave.) The phase difference introduced by two
tensity IT (Ll), a function of spacing, is the consecutive reflections inside the middle layer
Fourier transform of the intensity 1(0), a func- for a wave incident perpendicular to the surface
tion of wavelength (spectrum). We may thus is
regard this instrument as a form of computer
which performs a Fourier analysis of the spec- [) _ [21Tn1~
trum of the source. (We shall see that a lens 1 - \ A J 2t.
behaves in a similar way. In fact, it can instantly
produce a two-dimensional Fourier transform We will calculate the expression for the trans-
of the spatial distribution of a source, and this mission coefficient and then apply conservation
is something which is extremely complicated of energy to obtain the expression for the re-
when done numerically or by an electrical flection coefficient. The amplitudes of the
network.) transmitted components are:
An interferometer can be used as a spectrom-
eter, by measuring IT (Ll) and taking its Fourier E1 =U1U2EO
transform to yield 1(0). One may then ask about E2 = U1 U2P1P2 E O
the advantages of the interferometer as com-
pared to a conventional grating spectrometer. E3 =U1U2P12P22EO,
In the latter, light from the source is dispersed
over a wide area, and a narrow spectral band is
detected. It is difficult to detect a signal from
weak sources and simultaneously attain high and the total transmitted amplitude E t is
resolution because most of the light available
at a given moment is not used. On the "ther E t =E 1 +E2 +E3 +···+En •
867 OPTICS, PHYSICAL

SunlIght

FIG. 4. Young's divided wavefront experiment.

Combining the amplitude and phase expres- slits are usually used because they transmit more
sions and using 10 =! Eo 2 yields light and the resulting pattern is easier to ob-
serve. In the arrangement used by Young (Fig.
10a1 a2 2 2
IT = ----=--=---=----
1 +P1 2P2 2 - 2P1P2 cos 15 .
4) sunlight pass through a pinhole P1, which
acts as a point source. This assures that the
wave on the right-hand side of the pinhole is
In the absence of absorption, the reflection spatially coherent. A screen with two adjacent
coefficient R and the transmission coefficient pinholes, P2 and P3 , divides the wave front into
T add to unity, or two portions, and the openings act as mutually
coherent point sources of spherical waves. At a
R =1 - T spot on the screen where the path difference
where for the two waves is zero, they are in phase, and
a maximum in the intensity exists. If the path
T=hll0' difference is X/2, the two waves are 180 0 out
of phase, and a minimum exists. The alternating
Combining the above three equations with maxima and minima constitute a fringe system,
1 = a2 + p2 gives each of the maxima corresponding to a differ-
ent order of interference.
P1 2 +P2 2 - 2P1P2 cos 15 We shall calculate the interference pattern
R = 2 2 .
produced by a single point source emitting
1 +P1 P2 - 2P1P2 cos 15
monochromatic waves and later generalize this
It can be seen by inspection of this equation to polychromatic and extended sources. In Fig.
that, with proper selection of values for n1 , n2, S a monochromatic point source S is located on
t, and X, R can be zero, i.e., reflection can be the line perpendicular to the midpoint between
eliminated from the surfaces. the two slits, S 1 and S 2. The separation of the
Today multiple-layer thin films are used to slits is a. The intensity distributions of the pat-
achieve many interesting and useful results. For tern which would be observed on a screen at a
examp~e, multiple layer interference filters can distance d from the slits is given by
achieve very low reflection over a broad band,
or increased reflection, or high reflection at one 2 15
IT = 410 cos -
waveband accompanied by low reflection at 2
another waveband.
The two-slit experiment illustrates the other where 15 is the phase difference:
type of interference-division of wavefront. It
was first performed in 1802 by Thomas Young, 15 = 211 ay
who used pinholes. In repeating the experiment, A d'

POInt
t
y
source
S~~______________~~__~~~____~ ~

Observation
screen

FIG. 5.
OPTICS, PHYSICAL 868

For an extended source in place of the pin-


hole source, the contribution of each point on
the source must be summed. A point j is located
a distance u from the center of the source
which is at a distance D from the slits. The phase
difference between the two beams from passing
through the slits is:
8 = 21T fay + au\ .
"A \d dl
Then the contribution to the intensity on the
screen from the jth point is given by
dIj(y) = 2I(u) (1 + cos 8) duo
For incoherent sources, the total intensity at
point y is the sum of the intensities from each

f:
point j on the sources; therefore FIG. 6. Fraunhofer diffraction.

IT (y) = 2 I(u) (1 + cos 8) du o A classical example of Fraunhofer diffraction


is produced by a circular aperture of radius R.
The pattern (Fig. 6) was first calculated by G. B.
Again we see a Fourier transform relationship Airy in 1835 and is often referred to as the
between the intensity distribution of the pattern Airy disk. The derivation can be found in many
and the intensity distribution of the source. As references. 2 The intensity is given by
the source becomes larger the pattern becomes
less distinct. The visibility of the fringes is re-
lated to the spatial coherence of the source just
as the visibility of fringes in the Michelson in-
1=10 t2J1(X~
--
x
2

terferometer is related to the temporal coher-


ence of the source. where J 1 is the first order Bessel function and
Division of wavefront has been used to mea- x = (21T/"A)R sin (). The angle () is the measure-
sure the angular diameter of stars with an accu- ment between the normal at center of the
racy exceeding that attainable from telescopes. aperture and the point of observation. (See
Diffraction Diffraction is the bending of light Fig. 7.)
as it passes an object. Under certain conditions Diffraction is very important in optical imag-
examination of a shadow of an object on a ing. The limit on the performance of many
screen behind it reveals that some light did not optical systems is set by their diffraction
travel in a straight line from source but was properties.
bent. A close examination of the edge of certain Polarization Polarization of light was dis-
shadows reveals that they. are· composed of covered in 1808 by E. Malus, a French engineer,
alternating light and dark bands. The patterns while looking at sunlight reflected from the
can be explained and predicted using wave windows of the Palace of Luxembourg through
theory.
Division of wave front leads naturally into
diffraction, for the two phenomena are closely
1/10 = [2J 1 (xl/x) 2
related. When parts of a wave front come from
different openings and combine, we have inter-
ference; but when parts of a wave front come
from a single opening and combine, we have
diffraction.
There are two categories of diffraction: Fraun-
hofer and Fresnel. Fraunhofer diffraction occurs
when both the source of light and the observa-
tion screen are located at distances from the
aperture which are very large compared to the
size of the opening. It is the simpler of the two
to evaluate and is the condition normally en-
countered. Fresnel diffraction occurs when the
source, observation screen, or both are at dis-
tances such that the ratio of aperture size to
distance cannot be neglected. FIG. 7. Plot of Airy disk function.
869 OPTICS, PHYSICAL

0
a calcite crystal. He saw two images which varied

0
in intensity as he rotated the crystal.
This and other polarization phenomena estab-

I
lish that light waves are transverse rather than
longitudinal. Only a transverse wave i.e., one in
which the amplitude is perpendicular to its
direction of travel, can be two sided.
Waves of perpendicular polarization do not

0
~nterfere. If, for example, polarizers are placed
m each arm of a Michelson interferometer and
oriented at right angles to each other the
interference pattern disappears. '
c ::>
. Light waves are electromagnetic and usually
mteract most strongly with materials via the
elect.ric field. The magnetic interaction is pro- Linear Circular Elliptical
portlOnal to vic, where v is the velocity of the FIG. 8. Polarization.
electrons in the material and c is the velocity
of light. Hence, it is small.
An early demonstration of the polarization of
electromagnetic waves was performed by Hertz. Polarization manifests itself through the inter-
His demonstration can be repeated using a set action of light with matter. To exhibit the po-
of parallel wires. When the wires are oriented larized nature of light effects, the matter must
along the direction of the electric field of a be anisotropic. Crystals such as quartz calcite
radio wave, as determined by the orientation and cordierite can be used to display p~larizing
of the transmitting antenna, currents are set effects.
up along their length. By conservation of en- Double Refraction This phenomenon is ex-
ergy, the wave must be absorbed in the process. hibited by certain materials. A beam of light
When the wires are rotated 90° so that they are can be refracted or deviated into two portions,
perpendicular to the direction of the electric each of which travels in a different direction.
field, the wave is not absorbed . Double. refraction was discovered in 1669 by
This experiment was repeated in 1963 by Barthohnus, who observed two images when
Bir~ and Parrish using light waves. They de-
he looked at an object through a crystal of
posIted gold strips onto a plastic diffraction Iceland spar (calcite, CaC0 3 ) (see Fig. 9). The
grating have 50,000 lines/inch. The gold film effect can be explained by assuming that the
was evaporated onto the grating from the side. crystal had two different indices for this double
Viewed from the top, the gold looked like wires, refraction. Therefore, light is refracted in two
each less than a wavelength wide. When inci- different directions as it enters the crystal
dent light was polarized parallel to the gold (hence the name, double refraction).
"wires," it was absorbed, and when polarized in The indices for this double refraction are
the perpendicular direction, it was transmitted. referred to as ordinary (no) and extraordinary
It should be pointed out that the usual analogy (n e ) (Fig. 10). We then apply Snell's law twice
namely ,
of polarized light with waves on a string and a
picket fence is off by 90° because of the orien- sin e
tation of the electric vector E. When the ampli- --=no
tude of string waves is polarized parallel to the sin cf>o
openings of a picket fence, the waves are trans-
mitted through the fence. But electromagnetic and
waves polarized parallel to wires are absorbed. sin e
The amplitude of a light wave is a vector-it -.-,1.- = ne(cf>e)·
has both magnitude and direction-and the sm 'l'e
simplest type of polarization is linear polariza-
tion. In this case, the amplitude vector points There are many intereJting phenomena which
in a fixed direction normal to the direction of depend on the interaction of polarized light and
propagation k (also a vector). This is also known anisotropic materials. A convenient way to
as plane-polarized light, because the amplitude
and the wave vector define a plane.
Light can also be circularly polarized. In this
case, the direction of the amplitude rotates in
time. The most general state is elliptical polar-
ization, where the amplitude pattern at anyone
location is an ellipse. Both linear and circular
polarization are special cases of elliptical polar-
ization. There are always two orthogonal states
of polarization, as shown in Fig. 8. FIG. 9.
OPTICS, PHYSICAL 870

Extraordinary ray
Ordinary ray

FIG. 10.

categorize them is by the general expansion of the left side of the CRT and increases as the dis-
the material response P in a power series in the play travels to the right . Signals measured by
electric and magnetic fields and the spatial oscilloscopes are usually generated electrically.
derivatives. Other physical signals must be developed me-
chanically and converted to electrical energy
p j W = p i o + XijEjW + Xijl VIEt for display on the oscilloscope through the use
of transducers.
+ XijlEjW1 E l w , + Xijlm EjW1 E l w , Em W3 Today, most oscilloscopes require manual
+ XijlEjW1BIW, + XijlmEjW1B/w'Bm w" operation. Each has a CRT to display data,
a vertical input to condition the signal, a trig-
where the superscripts denote the frequency of ger section to select the signal parameter to
the field and the subscripts denote Cartesian begin the display and a horizontal section to
components. Each term in this series describes provide an accurate time comparison to the sig-
one or more effects; for example, the first term nal being displayed. A description of how these
describes pyroelectricity and ferroelectricity; fundamental sections of the oscilloscope func-
the second double refraction; the third , optical tion together follows. (See Fig. I .)
activity; and the fourth, the linear electrooptic Display The oscilloscope develops a graphic
effect and second harmonic generation. This representation of the signal by moving the
last is the generation of a light wave at twice CRT's electron beam across a phosphor coating
the frequency of an incident wave; it is analo- on the inside of the viewing screen. This results
gous to the generation of harmonics in electronic in a bright, visible display that traces the path
circuits due to the nonlinear response of circuit of the electron beam. (See Fig. 2.) The display
elements. brightness is controlled by the oscilloscope's
intensity control [Fig. I(A) .J Proper focus
RICHARD A. PHILLIPS provides a crisp display . The display is turned
on at the time the trigger section has sensed
References the correct input characteristic that a user
has selected as the starting point (usually a
1. Michelson, A., "Light Waves," Chicago, Univ. Chi- voltage level).
cago Press, 192 7 . Vertical Section The primary functions of
2. Nussbaum, A., and Phillips, R., "Contemporary the vertical sectiod are to supply the display
Optics for Scientists and Engineers," Englewood section with a Y-axis and to condition the sig-
Cliffs, NJ, Prentice Hall, 1976. nal being captured . Conditioning includes dis-
3. Shurcliff, W., and Ballard, S., " Polarized Light," play size, input coupling, display position,
New York, Van Nostrand, 1964. signal delay to allow viewing of leading edge
transcient responses and trigger signals. (See
Cross-references: DIFFRACTION BY MATTER AND Fig. 3.) The vertical section is further broken
DIFFRACTION GRATINGS; FOURIER ANALYSIS; down into key blocks: an attenuator to set dis-
INTERFERENCE AND INTERFEROMETER; OP- play size, input coupling to select fundamental
TICS, GEOMETRICAL ; POLARIZED LIGHT; RE- signal characteristic, trigger pickoff to trigger
FLECTION ; REFRACTION; THIN FILMS. display start, and a delay to allow user to see
leading edge of signal.
The vertical sections's attenuator reduces
OSCILLOSCOPES large signals to a size necessary for display on
the CRT. Attenuator sections also provide three
The oscilloscope is a measurement instrument input coupling sections: AC, DC, and GND . AC
that presents electrical phenomena (signals) in is selected for changing or AC (alternating
the form of an X- Y graph on a cathode ray current) signals. DC is selected for measuring
tube (CRT) for analysis. This CRT, much simi- DC levels like those in power supplies. And
lar in appearance to the one in a small television, GND is selected to identify where ground or
typically presents signals in a voltage (Y-axis) zero voltage is located on the display.
versus time (X-axis) relationship . (See Fig. The vertical amplifier increases the display
2(B).) As voltage increases , the corresponding size for extremely small signals. In addition the
location on the CRT moves upward. Time , amplifier generates a trigger pickoff signal to
with relation to the displayed signal , starts at drive the trigger circuitry.
871 OSCILLOSCOPES

Horizontal
Triggering

(AI

(Bl

FIG. 1. (A) Front Panel Controls for functional blocks. (B) Functional relationship of blocks.

The delay compensates for normal propaga- a retrace signal to move the electron beam back
tion delays in any electronic circuit. With the to the left side of the CRT. An internal holdoff
delay line the CRT is able to display leading time is then generated to reset the trigger cir-
edges of the signal that are lost otherwise. cuit to accept another trigger signal, thus com-
Figure 4 illustrates how a delay line enables pleting the horizontal signal loop. (See Fig. 6.)
the CRT to display the leading edge of a posi- The horizontal section establishes an accurate
tive step response . timing reference at several different possible
Horizontal Section To develop a graphic time selections, trigger holdoff, X-axis signals,
representation of the signal being captured, an and positioning.
oscilloscope needs an accurate time relationship Triggering Section Triggering determines
on the X-axis of the CRT. This is accomplished when the oscilloscope draws the graph. The trig-
by using a very linear sweep generator which ger circuit tells other oscilloscope circuits when
produces a sawtooth (ramp) waveform. Many to start drawing the graphic representation of
different ramp (time/div sweep) rates are avail- the input signal (Fig. 7). Controls in the trigger
able for selection so the user can choose the section that set up the required trigger condi-
one that will develop the required graph of the tion are coupling, slope, level, and source. The
signal. (See Fig. 5.) The horizontal section in- control circuit is much like the vertical input
corporates an extremely linear and accurate coupling with possible AC and DC selections.
timing ramp that turns the display on at the Positive and negative trigger slopes are available
trigger point. As the ramp voltage increases the on almost any oscilloscope. [The trigger slope
X-axis of the signal, it travels left to right on represented in Fig. 7 is positive (+ ).J The level
the display . At the end of the selected sweep control determines the trigger signal amplitude
(ramp) rate the horizontal section generates required to activate the trigger circuit. Most
OSCILLOSCOPES 872

r - DISP LAYSECTI oN -
CRT
DISPLAY

~ } INTENSITY
FOCUS
VERTICAL TRACE
- SECTION
(YAXIS)
ROTATION

'---r--

-4-
TRIGGER I HORIZONTAL i-+-
SECTION
SECTION
I (X AXIS)

(A)
BEAM DEFLECTION PLATES

..-.-..,"-----==-___P~g!il~~R

(8)

FIG. 2. (A) The basic sections of the oscilloscope work together to create a graph. (B) Changes in voltage on
the CRT's deflection plates move the electron beam across the display, causing the phosphor coating to glow.

DISPLAY
1....,..._ _...... SECTION

I
I
I
I ____ _
L

FIG. 3. Blocks of the vertical section.

oscilloscopes provide internal source (from Applications for oscilloscopes are limited only
vertical input signal) or external to trigger on by one's imagination. Researchers in the com-
a different signal than the one displayed. puter field are using oscilloscopes in developing
Those who use an oscilloscope span many smaller, high-speed data processing units.
different industries and several job functions. Also using oscilloscopes, the development of
electric control systems necessary to continue
the exploration of deep space is underway.
Faster, more efficient digital communications
systems are being designed with the help of
I II scopes. Using scopes, designers in the auto-
mobile industry are developing safer cars. Ad-
J vancement in LSI circuitry, new digital high-
speed memories, and A-D converters are aided
by the oscilloscope. Using scopes, medical
A. WITHOUT DELAY LINE B. WITH DELAY LINE
researchers continue to develop methods of
FIG. 4. The horizontal sections's delay line allows restoring eyesight. Work on robotics is advanc-
viewing of a signal's leading edge. ing with the help of scopes, and eventually will
873 OSCILLOSCOPES

INCREASING
VOLTAGE

RAMP

---
I
CRETRACE

HOLDOFF
I

FIG. 5. Horizontal sections sweep ramp .


FIG. 7. Trigger points tell the oscilloscope when to
begin drawing a graph. Also , for repetitive signals the
trigger signal is sent at the same time to produce a
stable "repetitive" display.

ability to capture and display signal shot or low


improve manufacturing productivity in all repetitive rate signals.
industries. Until the 1970s, oscilloscope designs required
The Development of the Oscilloscope The several dozen power-hungry vacuum tubes.
use of CR T's began around the turn of the 20th As many as 20 different oscilloscope types
Century. Their use in advanced oscilloscopes were available. As bandwidth pressure con-
began about 40 years later, with the develop- tinued, the vacuum tube became economically
ment of the first triggered oscilloscope. Since unfeasible. The 1970s saw increased usage of
the late 1940 's demands have increased for semiconductors in the oscilloscope , and even-
higher bandwidth, more accurate voltage and tually vacuum tubes were rendered obsolete.
time measurements , better triggering, and the Semiconductor technology provided the in-

.... VERTICAL
SECTION DISP LAY
.. ... S ECTION
2AXIS
SIGNAL
,.. TO
CRT
..J .. ..

---,
~

r- - - - f--------I-

I I
I I
I

t
TRIGGER SWEEP I
~I I
SECTION GENERATOR

- SEC/DIV POSITIO N
I
I
I
_\j
- TRIGGER HOlDOFF

L ________
- MODE

HORIZONTAL
SECTION
FIG. 6. Blocks of the horizontal section.
OSCILLOSCOPES 874

crease in accuracy and stability and the higher resolution timing measurement of digital logic
bandwidth demanded by users. The semicon- circuits. Other types of display (e.g., state
ductor also provided small size and power con- tables, mapping, etc.) are also available in many
sumption, which made possible more than logic analyzers .
100 different sophisticated types of scope. The Future Prior to the 1980s oscilloscopes
Storage Scopes In addition to advancements were developed primarily for manual operation.
in traditional oscilloscopes that have historically Future developments will, with continued ad-
addressed repetitive signal measureme'nts, new vancements in LSI, A-D memory speeds, and
"storage" oscilloscopes were developed that programmable instrumentation, focus on the
could capture, store , and display single events. oscilloscope's interface with a computer. New
This storage technique was made possible by oscilloscopes of the 1980s will provide im-
the use of special phosphors and processes that proved productivity through the use of pro-
stored the signal for long periods of time on grammability, digital storage, and increased
the inside of the CRT viewing screen in a bi- measurement specialization.
stable storage mode. (Bistable is the name given Oscilloscopes not tied to the external con-
to a storage process that provides a long viewing trollers/computers will offer the user more than
time.) Soon, faster storage oscilloscopes became just graphic CRT displays. Instead, answers
available as variable persistence and fast transfer (e.g., risetimes, falltimes, pulse width, fre-
storage CRT's were invented. People requiring quency, period, etc.) will be available auto-
storage products used the storage speed as a matically with a higher degree of accuracy and
major criterion for selection, second only to repeatability. Vast improvements in the tradi-
vertical bandwidth . tional oscilloscope parameters (e .g., bandwidth,
With the development of storage CRT's, the stored writing speed, accuracy , and timing
oscilloscope market took two basic directions resolution) will occur. Oscilloscope numbers
in the 1970s: (l) improved bandwidth and ac- will expand beyond the 100 types available in
curacy for making repetitive signal measure- 1980.
ments, and (2) improved bandwidth and storage
speeds for those requiring nonrepetitive signal DAVID E. MCCULLOUGH
measurements. Nonstorage oscilloscopes passed
the 1 GHz mark . During the same decade, con- References
ventional CRT storage speeds became sufficient
to capture and display 400 MHz signals. Tektronix, "The XYZ's of Using a Scope," 1981.
Advancements in digital memory speeds, and Tektronix, "The Digital Storage Oscilloscope," 1980.
faster and lower priced A-D converters, resulted
in a new type of storage oscilloscope in the Cross-references: ELECTRON, ELECTRON OPTICS,
late 1970s that did not require special CRT CIRCUITRY, PULSE GENERATION .
storage. In addition, digital storage provides an
effective method of securing signal data and
transferring it to a computer for processing of
long-term documentation. OSMOSIS
Special Purpose Scopes Over the years, spe-
cial purpose oscilloscopes have made possible If, into the bottom of a jar containing water,
totally new measurement methods . The first, a solution of cane sugar is introduced with
the spectrum analyzer, was developed for spe- care so as to avoid mixing, not only will the
cialized measurements made predominantly in molecules of cane sugar diffuse into the water
the communications industry. Instead of pre- but the molecules of water will diffuse into the
seding graphic data in the traditional voltage sugar solution. These processes will go on until
(Y-axis) versus time (X-axis) format, the spec- the concentration of sugar, and of water, is
trum analyzer displays dB (Y-axis) versus fre- the same throughout.
quency spectrum (X-axis). If the solution is placed in a container, whose
A second type, the sampling oscilloscope, walls are relatively impeImeable to the sugar
maintains the traditional oscilloscope graphic while being permeable to the water, and the
display in voltage versus time, but higher band- container is placed in water, the water will
widths (to 14 GHz) are made available using a pass from the outside into the container. The
random sampling method. However, sampling term osmosis is usually restricted to the passage
oscilloscopes are only good for making repeti- of water. If the influx of the water results in
tive signal measurements. an overflow of solution to somewhere other
The latest development is the logic analyzer. than the surrounding water this overflow will
Like the traditional oscilloscope, some logic continue until all the sugar is removed from the
a"nalyzers show time on the X-axis. Instead of container. If the container is closed, water will
voltage on the Y-axis, however, logic analyzers continue to enter until there is sufficient stress
display only digital wave shapes without in the stretched walls to cause a pressure on
reference to voltage. This new type of instru- the solution inside; this will eventually stop
ment has been used primarily to make high- the influx. Of course, if the walls of the
875 OSMOSIS

container are not completely impermeable to pressure on the solution to a value P. This
sugar, then the sugar will be escaping into the sufficient increase, P - pO, is the osmotic pres-
water outside the container and this will go on sure of the solution. In general we cannot state
until the concentration of sugar is the same out- p - pO, the osmotic pressure, knowing only
side and inside. If the walls of the container were N 2 , the ratio of moles of solute to the sum of
impermeable to water but permeable to solute, the moles of water plus solute, the mole fraction
the latter would escape. The cause of this of solute. (N 2 = I - N,).
osmosis, this "pushing," of water into the What we can say is, that if in a solution with
solution is that the tendency of the water a mole fraction N z of solute under a pressure P
molecules to escape from the pure water is the water has the same escaping tendency as it
greater than that of the water molecules in has in pure water at the same temperature and
the solution. Consider water in contact with a at a pressure pO, then dPldN 2 = AlB where
limited volume of air. Of those molecules of dP/dN 2 is the increase of P relative to increase
water striking the surface some will have suffi- of N z to keep the escaping tendency unchanged;
cient energy to escape into the air and this A is the decrease of escaping tendency relative
escape will result in net loss to the air which to increase of N z when P is unchanged; and B
will continue until the concentration of water is the increase in escaping tendency relative to
vapor molecules there is such that the rate of increase in P when N2 is unchanged. For dilute
escape from the air (into the water) equals the solutions A/B approximates to RT/V, and so
rate of escape from the water (into the air). If P - po approximates to N 2R T/ V, where Viis
the volume of the air space is fixed, the pressure the volume of one mole of water, R is constant
will rise. Just as the temperature of all bodies is 82.07 cm 3 atmldeg, and T is the absolute
the same when they are in thermal equilibrium, temperature. For very dilute solutions N zl VI
although their heat content per unit volume approaches nzlV, the number of moles of
varies with their specific heat, so the escaping solute in a volume V of solution and P - po =
tendency of the water is the same in all systems nzRT/V (van't Hoff's equation). This gives an
when they are in aqueous equilibrium, whether osmotic pressure of I atm for one mole of
the system is pure water, solution, gas phase, solute in 22.4 liters at O°C. There is a departure
wettable solid, etc. The same concept can be from both these equations for stronger solu-
applied to any substance, say mercury in pure tions. The fact that one mole of a perfect gas
mercury, in air containing mercury vapor, and in 22.4 liters at O°C exerts a pressure of I
in an amalgam with another metal such as zinc. atmos, coupled with the above, has led some
The term osmosis is usually restricted to the to say that the osmotic pressure is the bombard-
passage of water from a solution where the ment pressure of the solute molecules. It is
escaping tendency is higher to a solution where correct to say that for very dilute solutions the
it is lower. Moreover it is usually restricted to osmotic pressure of a solution is equal in
the passage through a solid or liquid barrier magnitude to the pressure the solute molecules
which prevents the solutions from rapidly would exert if they were alone in the same
mixing. It is not used for the passage of water volume and behaved as a perfect gas, but that
in the form of vapor through the air from a is another matter.
dilute solution to a stronger solution in the To measure the osmotic pressure, a semiper-
same confined space, although the process is meable membrane must be prepared which itself
fundamentally the same. It is sometimes re- can stand sufficient pressure, or it must be
stricted to the case where the barrier is semi- deposited in the walls of a porous pot so that
permeable, that is, lets through water but not the pressure can be sustained. With the solution
solute. being inside and water out, pressure is applied
The escaping tendency of water is lowered by to the former until there is no net movement of
the addition of a solute. If the molecules of the water.
solute have no other effect than to reduce the Observations by Berkeley and Hartley showed
number of molecules of water in unit volume, that for 3.393 gms of cane sugar per 100 gms
then the escaping tendency of the water will be H2 0, the osmotic pressure at O°C is 2.23 atmos
reduced proportionately to the reduction in the while the van't Hoff equation .sives 2.17 atmos
mole fraction of water, N 1, the ratio of the since nz/V= 9.27 X 10- 15 • If N 2 /V, is used
moles of water to the sum of the moles of instead of nz/V, the value of 2.22 is obtained.
water and solute. Such is a "perfect" solution. With stronger solutions the measured osmotic
If, however, there is some attraction between pressure exceeds that calculated: with 33.945
the solute and water molecules, a smaller frac- gros of sugar, 24.55 atmos is the value measured,
tion of the latter will have energy sufficient while van't Hoff's equation gives 18.41 and the
to escape-a "nonperfect" solution. The escap- other 21.8 atmos. The observed value is given
ing tendency is increased by pressure. Hence a if, in calculating N z IN 1 , it is assumed that each
solution in which the water has lower escaping sugar molecule immobilizes five molecules of
tendency than it has in pure water at the water.
same pressure, pO, can be brought to water The solutes in the vacuole of a plant cell are
equilibrium by a sufficient increase in the exposed to the inward pressure of the distended
OSMOSIS 876

cell wall and that of the turgid surrounding For many naturally occurring membranes
cells. Water will pass into the cell vacuole from which are not completely semipermeable, i.e.,
water outside as long as the total inward pres- they let solute molecules through slowly, elec-
sure on the vacuole falls short of the osmotic tro-osmosisis important. If the membrane tends
pressure of the solution in the vacuole. Passage to lose negative charges to, or take negative
of water into the vacuole dilutes the contents charges from, water or solutions, then the water
and lowers the osmotic pressure and increases molecules, in the pores of the membrane, will
the inward pressure by distension. The amount tend to take on an opposite charge to the mem-
by which the inward pressure falls short of the brane. If there is a gradient of electric potential
osmotic pressure is called by some the suction across the membrane, the charged water will
pressure. move in the appropriate direction. If the poten-
A substance such as cellulose or gelatin tends tial difference is established by the use of elec-
to take up water, the tendency decreasing with trodes, this is electro-osmosis.
increase in water content until the stress in If, for any reason such as the greater solu-
the substance causes a sufficient rise in the bility of the solute relative to that of water in
escaping tendency of the water in the substance. the membrane, the volume flow of the solute
This process, which like osmosis is a movement from the side of higher concentration is greater
from higher to lower escaping tendency, is called than that of water in the opposite direction,
imbibition, and the pressure on the substance then there is negative osmosis. This occurs
sufficient to stop the uptake is the imbibitional with some plant cells with lipid-soluble sub-
pressure. Hence, if a plant cell with a cellulose stances of small molecular weight.
wall, after coming to equilibrium with a solu- The rate of osmosis depends, not only on
tion, is transferred to water, the wall takes up the excess of the escaping tendency of the water
water by imbibition and the vacuole by osmo- in the phase from which it moves, over that in
sis. The latter considers only the over-all move- the phase to which it moves, but also upon the
ment from outside to vacuole and does not con- area of surface of interchange and the over-all
sider the movement from cellulose to vacuole, a resistance experienced by the water. The rate
process which is the reverse of imbibition. A of shrinkage of the vacuole of a plant cell when
plant cell in equilibrium with a solution having it is placed in a strong solution at first seems
an osmotic pressure of 25 atmos would also surprisingly high. When allowance is made for
be in equilibrium with air about 98 per cent the fact that the ratio of surface to volume
saturated with water vapor. If the cell were increases as the linear dimension is reduced
transferred to a saturated atmosphere, it would then it is realized that when the vacuole of a
take up water. We lack precise terms for the spherical cell of radius 30 Il shrinks to half its
passage of water from air into the cellulose volume in say 5 minutes the passage of water
and into the vacuole. Condensation, which is only I ml per 10 000 cm 2 per minute although
might be used, ranges more widely. the thickness of the layer between vacuole and
The escaping tendency of water is affected by external solution is of the order of 1 Il in thick-
factors other than concentration of solute and ness. Under other circumstances, this layer
pressure. Increase of temperature increases es- might be said to be relatively impermeable to
caping tendency. This is a complex problem water. It seems probable that much of the re-
involving not only transfer of water but also of sistance resides not in the cellulose wall or cyto-
heat. To a minor extent, the passage of water plasm but in the tonoplast which separates the
from pure water to a solution involves a heat latter from the vacuole.
transfer. G. E. BRIGGS
p
PARACRYSTALS* of the interface with their container exhibit
There are three states of aggregation: solid, anisotropic optical and electrical prop~rties. !he
liquid and gaseous. This categoric division, ~o~­ explanation for this phenomenon IS denved
ever does not explain why there are, WIthIn from the shape and the dipole moment of the
the 'solids category, such different sub~tan.ces molecules of the (mostly organic) liquids. They
as metals minerals, and glasses, or why lIqmds, are oblong or platelet-shaped as in soap solu-
when soiidifying, convert partly into a crys- tions.
talline and partly into a glassy state. At the Crystals Today we have much more know-
same time substances exist which do not be- ledge of the crystalline state. The regular ~nd
long in anyone of these groups. The word plane-facets-limited shape of a polyet~er, whl~h
exhibits many single crystals, has Inspued Hauy
"para crystal" was use~ for ~he ~irst !ime by (8) to speak of "elementa~ building blocks" of
Stanley (I) in connectIOn WIth bIOlo~lcal s~b­ equal size in the crystal, In the shape of cubes
stances and by Rinne (2) in connection WIth or parallelepipeds, which accumulate densely
liquid crystals. It is t~e tas~ ~f.this contribution like a three-dimensional chessboard. In present
to find an atomistic defInItion of the para- terminology these are the "lattice-cells," and
crystal (3) and to demonstrate the fundamental Seeber (9) recognized very early t.hat !hey
importance of this idea relative to concept and represent the space in which the baSIC umt ~f
application. In doing this. one cannot n~glect atoms, ions, or molecules of the crystal IS
discussing at the same time the x-ray Inter-
ferences of all these substances, because with- housed. Since the distance between centers of
out the discovery by von Laue (4) that one gravity (CG) of !he basi~ unit~ is a~sol~tely ,;on-
can gain direct information on shape ~nd order- stant, i.e., 4 A In certaIn lattIce duect1~ns, we
ing of atoms and molecules, everythIng would can safely say that in a range of I /It In these
remain pure philosophy. One ~an und~rstand directions, exactly 2500 building blocks are
the paracrystalline state only If one sll~ul.ta­ situated, or that the distance from one CG to the
neously has knowledge of the other eXIstIng 2501st neighbor is 10004 /l. Thus, because the
modes of solids. building blocks are lined up in a countable
Gases The gaseous state is characterized by. a fashion, one uses the expression "ideal remote
maximum of disorder. The molecules are dIS- order."
tributed in space completely irreg,ula~ly ~cco.rd­ Amorphous Solids Almost a~ glass.es, ce-
ing to purely statistical laws. Theu ~lstnb~tIOn ramics and plastics have properties, whIch can
of velocity and proximity obeys, In the Ideal be explained by the presence of "amorph0,us"
case the Boltzmann statistics. The more com- components, in which th~ at~ms are particu-
pres;ed the gas, the narrower become these larly disordered. How o~e V1s~ahzes t?e order of
distribution functions. the atoms is open to dIScussIon. It IS generally
Liquids The mobility of m<,>lecules in l~q?ids agreed upon, however, that no lattice regions
is considerably less than that In gases; OpInIOnS and no remote order exist here.
on their spatial order, however, are at present The Idea of Paracrystals The status today is
still divided. The school of Bernal (6) advocates that we have lattice structures with remote
the extreme of highest disorder; whereas the order on one hand and amorphous materials,
other extreme of quasicrystalline order is sup- liquids, and melts as structures without lattice
ported by Kaplow, Strong and Averbach (7). and without remote order on the other hand
opposing each other. On the basis of som.e
Liquid Crystals Whereas liquids ge~erall~ ex- fundamental work by J. J. Hermans (10) It
hibit equal properties in all three dlmensIO~s, was shown (3) that the idea of a statis~ic~l
i.e., they are isotropic; so-called "crys~~~ne c1ose-order for all building blocks, as It IS
liquids"t are also known, which in the VICInIty advocated by Zernike (11) and Debye (12),
*The editor is indebted to Mr. Walter Trapp of the definitely allows three-dimensional lattices of
Air Force Materials Laboratory for his careful transla-
tion of this article. *1 A(named after the Swedish physicist Angstrom)
tSee the essay by B. BOttcher and D. Gross in is \l<lual to one ten-thousandth of a micron.
Umschau in Wissenschaft und Technik, 69, 574 t 1 /l(micron) is equal to one ten-thousandth of a
(1969). centimeter.
877
PARA CRYSTALS 878

the CG of the building blocks. These lattices From the building block (p + 1, q , r) to the
consist of rows of countable building blocks adjacent CG (p + 2, q, r), one finds a com-
(running number p), spaces (running number q), pletely different distance vector z, which
and columns (running number r); however, once more obeys the same probability level
these are not straight lattice lines, but some- H(z). According to the rules of probability the
how curved in such a way that, for instance, combination of the quantities y and z occurs
along the rows of building blocks the distances with the frequency H(y) . H(z).
between adjacent CGs (p, q, r) and (p + 1, q, r) If one is interested in the statistics of the
are not constant but adopt different values distance between nearest neighbors (p, q, r) and
y with the frequency H(y) . This means also that (p + 2, q, r), one has to calculate the frequency
this frequency distribution guarantees the same H 2 (x) for the vector y + z = x, taking into
close-order ratios for all building blocks (p, q, r). consideration at the same time all possible

... ....................................................
".:::::;::'
.......................................
...................... .... ....................
.
..::,::::: :'. ':.: : '.: ...... ':. '.:.- .- .. : : :: :.' : ;: .,': : '::::
....
.. . ... .. ... . ...............
..................................... ................ ........
.......
..............
, ...
.
........ . ...
.. ......................................
......... ....... .............
......................
......... ... .. ,...
........
....... .. .......
.. ...............
.................................. ............ .
.
...... .. ....... .....................
.,, .....
....
.. .... ..... ..... ........ ............
. , ............................. ........... , ........ .
. ....... ..... , .....
... ....... ............
" . , ........... .
, ............
......... , ................ .
...............................
"" ..
.......... ... .. . ..................
............................................ .....................
..........
........
............ ... .. ........................
.................. . .............................
. ............
.......................................................
....... .. .......... .. ................ ....
..... ....
....... .. .........
....... .. .. .... ... ....... ............. .... .
.............
, ........................ ............... .
.... ...... ............................... .. ....
..................
................. . ........ ..............................
.... ................................
....... .. .... .............. .... ...... ...............
.. ..... ......................... ........... .. ........ ... .. ....... .
................................ .. . ...... ................
.....................
............. ...................
.. ..... ......... ...
, , ..........
... .
........... , ......... , ... ...... ... . .... . ..... .... .
...................
............ , ........................ .
.. ....... .. ............................... .
. .............................. , ........ ... .... ....... .
........ ......
.............. ............
.. ..... ...' ....
. ..... ... .. ... ......... ....
..... ..... ..., .... ................. ..........
..... ....... .. . . . ..... .............................
. .. ... ... ..................... ...... ..... .. ....... ..
..............
.... .
...... ...
. .. ...
..... ..
.... .. ......................
......................... .. . .................................................... ..
..................... .... ........ .. ....
.................................... . .. ........... ..... .....................................
............... .......................... .. .
..... ............
....... ....
.... ....
.... .. ..
.. ............ ...... . ..
. ....................... ...
.. .... . .. .......... .. . ........ ....... .....
.. .................. . .. .... . .... .......... ........ .... ....
.... ............
.......... .. ... .... ... .... ....
............. ......
..... ...... ...........
... ....... ............ .
........ .......................
.......... .... ................ ............
...........
. ...........
........ .. .................. ... .. .... .... . ........
..... ............................................. ., .................. ................ ... .............. .

(b)
(a)

::::::,": ........ ::::: .. :::.: .. ':::::::;:::::::::::'

.! i)) )1)\))Hi\) \
.................... ":;:;:: :::.:::. ': :::' .. :;;'

.. : .. :::: ... ..

;.:.:.;.:.;.;.: :.:.:::......
•. .. .........
.. . :':::: ::::: ":: :: ::::: : :::::

(c)
(d)
FIG. 1. Two-dimensional point-lattices; computer-drawn, with disturbances increasing from left to right.
(a) Crystal. (b) Fibrous material (fiber axis vertical) in which, especially, the horizontal lines are not distorted
and proceed equidistantly. (c) Metal melt with relative distance variations of 10 percent and without any
partial crystalline residue. (d) Compressed gas, idealized by building blocks sorted in lines and columns.
879 PARACRYSTALS

combinations of y with z as long as they lead


to the same x. This means that one has to
ensure that z= x - y and only y varies. This
leads to the following "convolution integral"*

H2 (x) = JH(y)H(x - y) dy.

In order to derive the distance statistics be-


tween third neighbors one has to apply this
folding process twice, etc. In this manner the
statistics of the whole distorted lattice is derived
from knowledge of the neighbor relationship.
Since the scatter of distance increases with
the distance of the building blocks from each
other, the remote order is lost in the paracrys-
talline lattices. In Fig. I a few two-dimensional
examples are given, which demonstrate that the
paracrystal contains the gaseous as well as the
crytalline state as degenerated exceptions.
Macroscopic Examples Nature, as it is di-
rectly visible to us, offers many more examples
of paracrystalline structures than one would
normally believe. Figure 2, for instance, shows
a paracrystalline laminar lattice created by water
running off of a sand beach. Repeatedly new
waves are introduced; which correspond to
step dislocations in solid-state physics. The
statistical deviations in the direction and in the
spacing of the waves obey the laws of para-
crystals. The same holds for the two-dimen- FIG. 2. Beach at the Tagliamento as an example of a
sionallattices, which are exhibited by the grains paracrystalline laminate lattice, as it appears also in
biological and synthetic fiber structures transverse to
the fiber axis.
*In pure mathematics "convolution integrals" have
been well known for a long time, particularly in the
theory of the Laplace transformation. (compare, i.e., of the corncobs in Fig. 3. Since they are
(15)). Their importance for mathematical physics has statistically of various sizes, only a paracrys-
become obvious only in the last few years, specifically tal1ine lattice can originate. This is clearly
since more and more numerical problems have become evidenced in Fig. 4. by the experiment with
solvable through the use of modern computers. steel balls of two different sizes located in the

FIG. 3. Two corn cobs, wrapped in cellophane, the grains of which build a multitude of two-dimensional para-
crystals with disturbances.
PARACRYSTALS 880

(a)

(b)

FIG. 4. Identical steel balls on a slightly sagging support. (a) In the middle section, a few balls are carefully re-
placed with a pair of tweezers by slightly larger ones, without disturbing the surrounding area. (b) The same after
shaking lightly. The large balls make space for themselves, which is done by way of step dislocations in the sur-
rounding host-lattice. (Two can be seen in upper left corner.) (c) The large balls have reached the support and
thus created a paracrystalline lattice in the center.
881 PARACRYSTALS

center of the structure. Outside all balls have to 1/100 /1 -100)\) supply us with excellent
the same diameter and they form the crytalline examples of paracrystalline laminar structures
lattice in whose center the paracrystal is em- analogous to Fig. 2. As soon as one transforms
bedded. them into fibers, for instance in a spin process,
Electron-microscopic Examples Synthetic one recognizes in the electron microscope that
high polymers in colloidal dimensions (below I /1 they consist of subfibers ("ultrafibers") of 100

FIG . 5. Electron micrograph of a paracrystalline laminar structure, transverse to fiber, in Teflon (14). Chain
molecules are located in direction of arrow. Fibrils are cross-striped. Interval of laminates is approximately 500 A.

I 0./ jJ..

FIG. 6. Electron micrograph (light-field) of a small section of a scraped Teflon surface, where a few ultrafibrils
have splintered off (14). The arrows, in an interval of approximately 500 A, designate the boundaries of the
individual blocks. These blocks each consist of about 8 disks that either are oriented vertically to the fiber (A) or
are partly inclined due to paracrystalline disturbances (B) or bent (C) or buckled.
PARACRYSTALS 882

to 500A. In these, crystalline and amorphous by the structures. As an example, Fig. 7 shows
regions, with a paracrystalline period between the Fraunhofer diffraction patterns of the
50 and I oooA (depending on treatment), follow greatly reduced, transparent negatives of the
each other as in a string of pearls. By the models in Fig. I, which have been produced
assembling of ultrafibers into a microfiber the with monochromatic visible light . Through their
laminar structure originates (Fig. 5). But even study the static parameters of the para crystalline
in isotropic material (cast in the form of plates) lattices can be calculated.
one finds these ultra fibers (14) as is shown in X-ray Interferences In order to generate
Fig. 6. Here , the crystalline ranges, being ap- interference patterns of atomic structures, the
proximately 500 A thick and about equally wavelength used must be smaller than the
high, consist of eight lamellae. These cross length of the edges of the lattice cells. This,
striations can be found in almost all biological for instance, is the case for x-rays which
fibers. However, here one reaches the limit of emanate from a copper anode when it is ir-
resolution of the electron microscope . radiated with electrons. The larger the lattice
Interferences The only route to take from cells the closer the interference points move
here is to investigate the interferences generated together and they finally yield, as for instance

(a) (b)

(c) tel)

FIG . 7. Fraunhofer diffraction pattern made from the reduced transparent negatives of Fig. 1 with a He, Ne
laser (13).
883 PARACRYSTALS

MODEL OF LINEAR POLYETHYl.ENE

FIG. 9. Principal figure of the structure of a synthetic


material consisting of long molecules. The paracrystals
consist of 100 to 500 A thick bundles of parallel-
oriented chain molecules. On the front planes, part of
the molecules is folded back in larger or smaller loops;
some of these run, stretched out, into the next para-
crystallite and some are more or less bent. On cold
FIG. 8. X-ray small angle diagram of (j-keratin of the drawing, chains of paracrystallites split into ultra-
quill of a seagull (23). Observe the similarity with fibrils, separating from each other (left); whereas,
Fig. 7(b). From the angle position of the interferences with heat-treatment they join again close to each
one can calculate that the bent lines parallel to the other, whereby the paracrystallites form a para-
fiber have a distance from each other of about 12 A, crystalline, laminate-super-Iattice (center). When
whereas the points along these lines, i.e., the centers of grown from solution, the portion of the chains which
gravity of the basic building blocks of the proto fibrils, does not fold back decreases with the degree of dilu-
have a distance of 183A. This means that the basic tion (right).
building block is very long and thin.
drawing, where the fibrils separate spatialy. In
in Fig. 8, a so called x-ray angle diagram. The these lateral grain boundaries, as in the grain
one shown here was the first with which it boundaries of a crystal, an increased disorder
could be proven that in biological materials exists. This increased disorder is thought to be
para crystals exist in submicroscopic dimension caused mainly by a high concentration of
(17). Their building blocks, being long, thin "kinks" as displayed in Fig. 10 [according to
structures, have the characteristics of fibers, Pechhold (19) J •
since they can deviate along the fiber only in Influence of Mechanical Properties In the
direction and not in distance from each other. lateral grain boundaries an increased mobility
Superstructures in Polyethylene Similarly, of the chains relative to each other exists,
large lattice cells are exhibited also by the so- because kinks can move along the chain by
called "superstructures" which frequently occur means of the simple flipping-over of some of
in synthetic polymers. As an example, a few the CH 2 groups, and the chain therefore creeps
results from polyethylenes are given which are on like a caterpillar according to Renecker (20).
representative of many plastic products with If one clamps, for instance, both ends of a
long molecules and which have been obtained synthetic fiber and moves them parallel to each
from the analysis of large and small angle x-ray other as shown in Fig. 11, one can see in the
interferences (18). The fundamental Fig. 9 small inserted diagram that the fibrils glide
shows that a lattice cell of the superstructure relative to each other. This explains why metal
consists (in the fiber direction) of many basic fibers are much less flexible, and glass fibers
chemical building blocks C2 H4 (their CG marked can only be bent elastically. In Fig. 12 the
by a dot) and is limited in the fiber direction by influence of heat treatment on the superstruc-
amorphous regions in which the chain-back- ture of stretched polyethylene is shown, by
folding takes place. The lateral extension in the which the mechanical properties are changed.
extreme left is particularly evident during cold With increasing temperature the crystalline re-
PARACR YSTALS 884

I
" • •••• .
I-I • •

I
I.
••
.,'. --·.1
,.
... .._••

• II" II ••, •
I' , • '
1,1," I •••••
'

__ ...••J
·· ..'·.1•••'·1
I • • •
••••••
•..•
1 2

. , .. .
FIG. 10. Model for various kinds of "kinks" in linear ..-......'
. ·1-· -. --•••.
II'

..
., •
.......I.
• ••••
I•
., """
11ft''''''''

.
polyethylene after Pechhold (19). The white half-
spheres represent the hydrogen atoms. They mask the I •
"
black-symbolized Carbon atoms to a great extent. In
-. , -_ •••••• 1
the crystalline state (extreme left) the CH2-gIOUPS are I •• I /t1 ~ ~"
"·1 ••,_,1 ~~ ~
Willi

"
arranged in zig-zag fashion. I •••• lc1
3

- -
..........•..
,.----.---- ....
--_.- ..
~
~

..'."'..
II. •
• 1

•1
............ -

·1,···········1.
- ____ I.
5
FIG. 12. Influence of heat treatment on the super-
structures of polyethylene (18). 1, Cold-stretched.
Splintering into ultrafibrils. 2, 500 hours tempered at
110°C. The empty spaces or vacancies have disap-
peared; the front planes are more distinctly defined.
No other changes have occurred. 3,5 minutes at 120°C
produces additionally an elongation in cross-section of
the paracrystals. 4,10 minutes at 120°C causes growth
of the average value of the lengths at constant density,
but gives a more diffuse front plane. 5,500 hours at
120°C recrystallizes the adjacent ultrafibrils at the
FIG. 11. Bending of a hot-worked piece of poly- same time generating thicker paracrystallites with
ethylene (schematic). From the x-ray small-angle dia- smaller disturbances. The density and the average
gram (at the top) one calculates the superstructures. laminate interval grow, and the front planes are again
On the left, before, and on the right, after, bending of well defined.
both ends of the piece. The crystalline ranges are
marked black. ferences. According to Fig. 4 they grow with
the mixing ratio and the differences in size of
regions grow and the fiber finally loses its the building blocks. Thus, one gets information
desirable properties. on their shape when one knows their concen-
Paracrystalline Atomic Lattices As Fig. 7(c) tration. In this manner, the ammonia catalyst
shows, the widths of the reflexes increase with (enriched with 3 weight per cent aluminum)
growing scatter angle the larger the paracrystal- was investigated. It distinguished itself by the
line disturbances are. * These disturbances can fact that its inner surfaces do not decline even
qualitatively be determined by accurate mea- with long service times, thus keeping the con-
surements of line profiles of x-ray angle inter- nected regions equally small. The result is
shown in Fig. 13: A considerable part of the
*Unlike the widening of reflexes through internal aluminum is built into the cubic-centered a-iron
stresses, these vary not in a linear fashion, but with the lattice in form of FeAl 2 0 4 "motives" or units.
square of the sine of the Bragg angle. One unit displaces seven iron atoms. It has the
885 PARAMAGNETISM

"Investigations and Considerations concerning


Paracrystallinity. "
3. Hosemann, R., Zs. f. Phsy., 128, 1 (1950).
4. laue, M., F. Friedrich und P. Knipping (1912)
Sitzungs Ber. Bayer. Akad. Wiss. Math. Phys. Kl,
303,363.
5. Boltzmann, L. (1895) "Vorlesungen iiber Gas-
theorie," Johann Ambrosius, Leipzig.
6. Bernal, J. D., Nature, 185,68 (1960).
7. Kaplow, F., Strong, S. L., and Averbach, B. L.,
Phys. Rev., 138 A, 1336 (1965).
8. Haiiy, R. J., Journ. de Phys. Paris (1782).
9. Seeber, L. A., Gilberts Annalen, 76, 349 (1824).
10. Hermans, J. J., Rec. Trav. Chim. Pays.-Bas., 63,
5 (1944).
11. Zernike, F., and Prins, J. A., Zs. Phys., 41,184
(1927).
12. Debye, P. P., Phys. Zs., 31, 348 (1930).
13. Hosemann, R., and Miiller, B., Mol. Cryst. and
FIG. 13. Atom model of the ammonia catalyst, Liqu. Cryst., 10,273 (1970).
promoted with 3 weight per cent aluminum. FeAl204 14. O'leary, K. J., "Dissertation," Case Institite,
units are built into the metallic iron atom lattice (.) Oeveland, Ohio (1965).
under the substitution of seven atoms each, which fit 15. Doetsch, G., "Theorie und Anwendung der la-
into the lattice only in respect to their volume but not place-Transformation," Springer Verlag, Leipzig,
their shape. The unit consists therefore of one iron ion 1937.
(0), two aluminum ions (0), and four oxygen ions (0). 16. Hosemann, R., and Bagchi, S. N., "Direct Analysis
of Diffraction by Matter," North. Holl. Publ.
Comp., Amsterdam, 1962.
17. Hosemann, R., "Die Erforschung der Struktur
same volume, but not the same shape as these, a hochmolekularer und Kolloider Stoffe mittels
fact which explains the measured paracrystalline Kleinwinkelstreuung," Erg, d, Ex. Nat. JI'iss., 24,
disturbances quantitatively (21). Similar mea- 142-221, Springer Verlag, Berlin, 1951.
surements on polyethylenes demonstrated that 18. Loboda, J., Hosemann, R., and WIlke, W., Koll.
the disturbances in the transverse direction of Zs. u. Zs. Polym., 235,1162 (1969).
the paracrystalline micro fibrils become smaller 19. Pechhold, W., and Blasenbrey, S., Koll. Zs. u. Zs.
with their extension. In all the cases mentioned Polym., 235, 216 (1967).
here, the distance variations are at the largest, 20. Renecker, D. H., J. Polym. Sci., 59, 39 (1962).
3 per cent, which is hardly recognizable in 21. Preisinger, A., Hosemann, R., and Vogel, W.,
models. Figure l(c), in comparison, with 10 per Ber. Buns. Ges. Phys. Chem., 70, 796 (1966).
cent relative paracrystalline disturbance varia- 22. Lemm, K., Mol. Cryst. and Liqu. Cryst., 10, 259
tions, corresponds to those values which are (1970).
found for the very small lattice ranges in
metallic melts (22). Cross-references: COLLOIDS, THERMODYNAMICS
Outlook The sketched examples may dem- OF; CRYSTALLIZATION; CRYSTALLOGRAPHY;
onstrate that the theory of paracrystals can DIFFRACTION BY MATTER AND DIFFRACTION
be applied to a multitude of groups of sub-
GRATINGS; ELECTRON MICROSCOPE; INTER-
stances, and that it furnishes quantitative criteria
FERENCE AND INTERFEROMETRY; LIQUID
for the definition of these substances through
STATE; MICROP ARACRYSTALS; MICROPARA-
a series of novel statistical structure parameters.
CRYSTALS, EQUILIBRIUM STATE OF.
One resu}J, already of interest in colloid-chem-
istry, is that the lattice ranges are, and remain,
sm8J.ler the larger their paracrystalline distur-
bances are. The large and stable inner surface of PARAMAGNETISM
the ammonia catalyst and the remarkable simi-
larity between many biological and synthetic Classically, paramagnetism is defined as the ac-
super-structures, as well as the similarity of a quisition of a magnetization M that lies along
metal structure above and below the melting the direction of an applied magnetic field H.
point, indicate that new fundamentals have been This definition implies that when no field is
found upon which one could build. applied, the magnetization is zero, which in
essence eliminates ferromagnetism and ferri-
ROLF HOSEMANN
magnetism. In the light of today's knowledge,
at least one caveat must be added; specifically,
antiferromagnetism must be excluded. How-
References ever, ferro-, ferri-, and antiferromagnets above
1. Stanley, W. M., Science, 81, 644 (1935). their ordering temperature behave as paramag-
2. Rinne, F., Trans. Farad. Soc., 29, 1016 (1933), nets. Temperature-dependent and temperature-
-
PARAMAGNETISM 886

the dipoles, then either a classical or quantum


mechanical calculation shows that M is linearly
related to H provided IA • H «kT. The suscep-
tibility, defined as X =MIH, is found to be given
H by X = CIT, where C is a constant. This result is
known as Curie's law, and indeed several sys-
tems are found to follow this relationship. If
there are interactions between the dipoles, an
FIG. 1. elementary and oversimplified approach yields
the Curie-Weiss law, X = C/(T - 8), where 8 is
another constant which is a measure of the in-
independent paramagnetism have different or- teraction strength; some materials have suscep-
igins, and require separate discussion. tibilities that can be fitted to this equation.
Temperature-dependent paramagnetism can If H is very large, or T is very low, then the
be conveniently discussed by considering a col- dipoles will tend toward complete alignment.
lection (ensemble) of magnetic dipoles, as in- The magnetization at the temperature T, M(T),
dicated in Fig. 1. In general the dipoles may compared to its value at absolute zero, M(O), is
have different magnetic moments, but for sim- then given by the Brillouin junction, BJ(x), viz.,
plicity they will be assumed to have the same
moment IA. Then the vector sum over a unit M(T)/M(O) = BJ(x)
volume V yields the magnetization, that is,
kV IA = M. The dipoles may be located at ran- 21+1 21+1
= - - - coth - - - x
dom positions in configurational space, as in a 21 2J
gas or vapor, or at sites with some short-range
order, as in a liquid or an amorphous solid, or 1 x
at sites with translational symmetry, as in a - - coth-
crystalline solid. Thermal energy, kT, where k 21 21
is Boltzmann's constant and T is the absolute
temperature, will then ensure that the moments where J is the total angular quantum number,
are distributed over the possible orientations. and x =IA . H/kT. In the classical limit (J -+ 00),
Then if H = 0, it follows that M =O. However, BJ =L(x) = coth x - l/x, where L(x) is called
if a magnetic field is applied, the dipoles will the Langevin junction. For J = S = where St,
tend to align along the field direction, in order is the spin quantum number, B I/ 2 (X) = tanhx.
to minimize the interaction (Zeeman) energy, Plots of B 1/2 (x), B 5/2 (x), and L (x) are shown
-IA . H. Thermal agitation will normally ensure in Fig. 2. For small x the curves approach
that complete alignment of the dipoles will not straight lines; this is the Curie Law region.
occur. The dipoles of the ensemble can be realized
Classically all dipole orientations are allowed, in a variety of ways. Atoms, ions, or molecules
whereas quantum mechanics requires spatial with unpaired electrons, that is, with nonzero
quantization. If no interactions occur between angular momentum G, (orbital and/or spin),

1.0

x
J
CD
II

-
0
~
.........
0.5

-
I-
~

0.0
0.0 5.0 10.0 15.0 20.0
X
FIG. 2.
887 PARAMAGNETISM

have magnetic dipole moments p.. These moving act with the ligand fields much more strongly
electrons are essentially equivalent to an elec- than the 4f electrons of the rare earths. As a
tric current, and give Ijse to a permanent mag- result, the orbital energy levels are split. Then
netic moment given by the dipole moments are largely determined by
the spin only, and often a Curie or Curie-Weiss
e Law is observed, again if the temperature is not
p.= - - G, too low. Indeed, the susceptibility is a measure
2mc
of the paramagnetic ion's energy levels and the
where e and m are the charge and mass of the number of ions in each level, as determined by
electron, respectively, and c is the velocity of the principles of statistical mechanics. In prin-
light. Since G is quantized in integral or half- ciple, a susceptibility measurement provides
integral units of Planck's constant, 11, it is information on the energy levels of the para-
convenient to introduce a unit of magnetic magnetic ion. In practice, details of these en-
moment, called the Bohr magneton, iJ.B,: ergy levels can be deduced from X only for a
and given by iJ.B = eff/2mc, with a value of few special cases. Instead, the procedure today
9.2741 X 10- 21 erg G-I (9.2741 X 10-24 J T-I is to determine these energy levels by other
in SI). For a free atom or ion the dipole mo- methods, usually electron paramagnetic (spin)
ment is given by p. =giJ.B J, where g, called the resonance, and then to calculate the suscepti-
g-factor or spectroscopic splitting factor, is the bility. Other techniques used to deduce the
Lande formula of atomic spectroscopy. For magnetic energy levels include specific heats,
ions in a solid, the same form of equation may magnetooptics, and Mossbauer spectroscopy.
be used, except that g is essentially a phenom- In some compounds, the paramagnetic ions
enological constant that reflects the splitting of may be covalently bound, that is, electrons are
the orbital and spin states by the internal inter- shared with the diamagnetic ions. The dipole
actions. Atomic and molecular electronic dipole moments are then somewhat spread out spa-
moments are usually of the order of one Bohr tially, and the localized permanent dipole mo-
magnet on. ments of Fig. 1 become somewhat fuzzy. Co-
Examples of paramagnetic gases are sodium valent interactions are usually important for
vapor and nitric oxide. Even though molecular ions of the palladium (Sd) and platinum (6d)
oxygen, O 2 , has an even number of electrons, groups.
its ground state has S = 1, and consequently it Other examples of electronic permanent di-
is a paramagnet. Other molecular paramagnets pole systems include electrons trapped at va-
are organic free radicals and biradicals. cant lattice sites (F-centers), radiation damaged
By far the largest number of substances with bonds, and localized donors and acceptors in
permanent dipole moments are liquids or solids semiconductors. Next, the nucleus may also
containing ions of transition group elements, have a nonzero angular momentum, and hence
that is, those with partially filled 3d, 4d, Sd, a permanent magnetic moment. It is much
4f, or Sf electron shells. The lanthanides, com- smaller (-10- 3) than the electronic magnetic
prising the elements from La to Lu in the pe- moment, and consequently the nuclear con-
riodic table, usually in the trivalent state, have tribution to the susceptibility is usually negli-
0-14 4f electrons. The 4f electrons, which are gible. However, the nuclear susceptibility of
surrounded by outer Ss and Sp electrons, are solid (atomic) hydrogen, water, and C6 (CH 3 )6
situated well to the interior of the ions. Hence, has been detected. Nuclear magnetic energy
the dipoles almost behave as if the ions were levels are usually investigated by nuclear mag-
free, and often obey Curie's Law fairly well, at netic resonance (NMR).
least provided the temperature is not too low. Small particles, about 10 nm in diameter and
However, the electric crystalline fields of the containing transition atoms, may be ferro- or
diamagnetic ions (ligands) in a solid do interact ferrimagnetic; examples are iron or iron oxide
with the 4f electrons, and lead to susceptibility (Fe304 or ,},-Fe203)' Thus, in the context of
changes that are important at low temperatures. Fig. I, the permanent magnetic moment of
In recent years, interest in the actinides or Sf each particle is about iJ. = 10 6 iJ.B. Thermal agi-
group has increased. The trivalent ions, Th 3+, tation may still be sufficient to overcome any
Pa 3+, U3+, Np3+, Pu3+, Am 3+, and Cm3+ have anisotropy in each particle, and hence cause
1-7 Sf electrons, respectively. Complex ion oscillations of the magnetic moment between
groups, such as (U0 2 )2+, (Np02 )2+, and various energy minima. Since the time average
(PU02)2+ are also formed with 0, I, and 2 un- of the moment of a particle is zero, the particle
paired Sf electrons, respectively. Both spin- is said to be a superparamagnet. Application of
orbit coupling and crystal-field interactions a field to the system of particles will then pro-
tend to be more important in the actinides than duce a magnetization given by the Langevin
the lanthanides. function.
The 3d group are usually divalent, but may be Electrons in an unfilled energy band, as in a
in trivalent or other states; this group includes metal or alloy, give rise to a weak paramagne-
Cr2+ Cr3+ Mn2+ Fe2+ Fe 3+ C02+ Ni 2+ and tism that is essentially temperature indepen-
Cu 2+'. The 'relativ~ly exposed 3d electrons inter- dent. These electrons have a magnetic moment
PARAMAGNETISM 888

E magnetic susceptibility must be considered


when the temperature is high, when the relative
numbers of atoms or ions with permanent mo-
ments are small, or when Pauli or Van Vleck
paramagnetism dominates. If 11 = I1B, then the
paramagnetic susceptibility for a mole of di-
poles is about 10- 4 at room temperature, and
10-2 at I K (cgs units). Pauli susceptibilities are
of the order of 10- 6 to 10- 5 emu cm- 3 .
A. H. MORRISH
2f.L.H
o ---...,;~-+--~- N (E) References
Boudreaux, E. A., and Mulay, L. N., "Theory and Ap-
plications of Molecular Paramagnetism," New York,
John Wiley & Sons, 1976.
FIG. 3. Morrish, A. H., "Physical Principles of Magnetism,"
Huntington, N.Y., Krieger, 1980 (reprinted).
Vonsovskii, S. V., "Magnetism," Vol. 1, New York,
t
11 ~ I1B since S = and g ~ 2. Suppose the elec- Halsted Press, 1974. (Translated from the Russian,
tron gas is considered as two subsystems, with "Magnetizm," Moscow, Nauka, 1971.)
half the momehts lying along a given direction,
say up, and half in the opposite direction, Cross-references: ANTI FERROMAGNETISM, CAL-
down. The net moment is of course zero until ORIMETRY, FERRIMAGNETISM, FERROMAG-
a magnetic field is applied. Then the energy of NETISM, MAGNETIC RESONANCE, MAGNETISM,
the electrons with magnetic moments parallel MOSSBAUER EFFECT.
to H is decreased by I1BH and the energy of
those antiparallel is increased by I1BH. Thermo-
dynamic equilibrium then requires that elec- PARITY
trons with antiparallel moments enter the
parallel-moment subsystem until the two sub- The theoretical description of any physical pro-
systems are filled to the same energy levels, that cess in the relativistic domain must remain in-
is, the Fermi levels of the two systems become variant under a Lorentz transformation. In a
equal, as shown in Fig. 3. The susceptibility, classical theory, where the interactions depend
first calculated by Pauli, is on the relative spatial separation of the posi-
tion coordinates and the relative temporal
3 NI1 B2 separation of two instants of time implying a
X= 2EF(O) finite velocity of propagation of the inter-
action, the description of the interaction must
at absolute zero, where N is the num ber of con- be independent of the coordinate frame in
duction electrons per unit volume and EF(O) is which the process is described. A proper
the Fermi energy at T = 0 K. Since the Fermi Lorentz transformation from one frame S to
energy changes slightly with temperature, then another S' is given by
so also does the susceptibility.
Second-order quantum mechanical perturba- XJ.I.' = aJ.l.vxv
tion theory applied to. atomic electron wave where
functions also yields a small temperature-
independent susceptibility. This second-order XJ.I. == (XI, X2, x3, X4) = (x, y, Z, ict),
Zeeman effect was first derived by Van Vleck.
The order of magnitude of the Van Vleck sus- and a summation is to be made over repeated
ceptibility is given by indices. (This summation convention applies
throughout the present article.) Similarly,
NI1B2
X0:: pJ.I.' =aJ.l.vPv
En - Eo
where
where En is the energy of the nth excited state
and Eo is the ground state. PJ.I. == (PI, P2, P3, P4) = (Px, Py, Pz, iE/c).
In addition to the paramagnetic susceptibility,
there is always a diamagnetic one, and it is the The four coordinates xJ.I. are collectively
sum of these two that is measured in an experi- called the space-time coordinates and the four
ment. For permanent moments of about one quantities PJ.I. are collectively called the four
Bohr magneton, the diamagnetic contribution momenta, the first three being the spatial
is small, and may often be neglected. The dia- components of the momentum and the fourth
889 PARITY

being proportional to the energy within a factor ner time reversal. The laws of classical physics,
i/c. The aJ,lV are constants independent of the apart from statistical effects such as frictional
space-time coordinates such that the relativistic forces and electrical resistance, are invariant
interval is to remain invariant: under time reversal where one has the normal
motion in reverse.
XJ,lXJ,I =x/XJ,I' =aJ,lVxVaJ,lTxT Time inversion is actually the product of time
reversal and charge conjugation, an operation
PJ,lPJ,l = p/PJ,l' = aJ,lVPVaJ,lTPT = -m 2 c 2 • where all particles are changed into their anti-
This invariance requires that particles. Charge conjugation is a relativistic
quantum-mechanical concept only, and like
aJ,lVaJ,lT = 8vr space and time inversion, is a discontinuous
operation. In the limit where a nonrelativistic
and, in order to preserve the reality conditions approximation is valid the operation of charge
of the space-time coordinates, aJ,lv with Il, v = 1 conjugation is not meaningful, so only space
2, 3, and a44 are real, while aJ,l4 and a4J,1 are inversion and time reversal have physical mean-
imaginary. ing there.
These transformations are continuous, being In quantum mechanics, the interacting ob-
obtained from a series of infinitesimal trans- jects such as elementary particles, atoms, and
formations from unity, and are characterized nuclei are described by a wave function 1/1 (r, t)
by det(aJ,lv) = 1. They do not change the direc- which is itself a function of the space-time
tion of the time axis. coordinates. Here a symmetry property called
Such a formalism can be used to describe parity is introduced to express the equivalence
proper rotations of the physical system and between left and right. If we denote by 1/1 (-r, t)
boosts to a different velocity of the physical the wave function in the space-reflected system,
system with respect to the space-time co- then
ordinate system. The requirement of equiva-
lence between different Lorentz frames leads VCr, t) = TiP 1/1 (- r, t).
directly to the laws of conservation of four- The space inversion quantum number TiP is
momentum and angular momentum . called parity and has two eigenvalues, ±l. When
In addition, there are the improper Lorentz Tip = I the wave function is said to have even
transformations of space inversion and time in- parity and when TiP = -1 the wave function is
version, which are discontinuous, unlike the said to have odd parity.
proper Lorentz transformations. These im- The supposition that any nondegenerate
proper transformations each have the property stationary quantum-mechanical state of an ob-
that det(aJ,lv) = - I. The transformed system ject, such as an atom or nucleus, will have a
under space inversion is definite parity has led to very powerful conser-
vation laws in the description of physical pro-
X/=-Xj, where i=I,2,3, cesses involving interactions of atoms or nuclei.
and that under time inversion is The principle that the total parity of the system
was conserved was regarded until recently as
X4' = -X4' a fundamental principle of physics.
Stated formally, if P is a unitary operator
Clearly, XJ,I'XJ,I' = XJ,lXJ,I for either case or for that inverts the space coordinates through the
both taken together. It should be noted that origin such that
the reversal of the space coordinates changes a
right-handed coordinate system to a left-handed PI/I(r, t) = I/I'(r', t) = TiP 1/1 (-r, t)
system, or vice versa. This implies that if there
is a physical system in nature that has a definite then the wave equation
handedness it is also possible to have a system
of the opposite handedness obeying the same HI/I(r, t) =EI/I(r, t)
physical laws. Classically, this invariance under can be transformed into
reflections leads to no conservation law in the
way that invariance under rotations leads to the PHp-l I/I'(r', t) = EI/I'(r', t).
conservation of angular momentum . This is not
so in quantum mechanics . The wave equation in the transformed coordi-
In a relativistic theory time inversion is not nate system is unchanged provided that PHP-l =
equivalent to an operation where every velocity H or PH - HP = O. The Heisenberg relation for
is replaced by the opposite velocity so that the a time rate of change of an operator P is
position of the particle at +t becomes the same
as it was, without time inversion, at - t. This lat- dP 11
ter operation may properly be called velocity
- =- (HP- PH).
dt i
reversal or motion reversal, but in quantum
mechanics or quantum field theory it is usually, Therefore P is a constant of motion provided
perhaps illogically, called time reversal or Wig- it commutes with the Hamiltonian operator.
PARITY 890

Operating twice with P on any wave function TABLE 2. TRANSFORMATION OF OBSERV-


gives the same wave function, i.e., p2 = 1, so ABLES UNDER SPACE INVER-
the only possible eigenvalues of Pare +1 and -1. SION, TIME REVERSAL, CHARGE
It is now well established that the strong CONJUGATION, AND TIME
interactions between elementary particles and INVERSION
the electromagnetic interactions are invariant
under space inversion, so any nondegenerate Observable P T C P4
energy eigenstate of an atom or nucleus will be
also an eigenstate of the parity operator P. Sim- Position x -x x x x
ilar statements can be made for the time re-
versal operator T and the charge conjugation Time t t -t t -t
Velocity y -y -y y -y
operator C.
Momentum p -p -p p -p
A quantum-mechanical system such as an
atom or nucleus is usually described by wave Energy E E E E E
Mass m m m m m
functions in the angular momentum representa-
Orbital angular
tion. The one-particle wave functions of the momentum Q Q -Q Q -Q
individual constituents are a product of an Spin angular
intrinsic part that may be also an eigenfunc- momentum s s -s s -s
tion of the spin angular momentum operator Charge e e e
and an orbital part that is an eigenfunction of -e -e
Force f -f f f f
the orbital angular momentum operator Qz. The Acceleration a -a a a a
latter eigenfunction is a spherical harmonic
Y Qm (0, </» having an eigenvalue (- I)Q of the
space inversion operator P. The total parity of
the system is then the product of the parities In Table 2 the transformation properties of
of the constituents. The intrinsic parity of any various physical observables are listed. The
elementary particle cannot be determined transformed observables are independent of the
theoretically, nor can the eigenvalues of the values of Tip, TIT, and Tic, of the fields.
time-reversal operator T and the charge-conju- The phases Tip, TIT, Tic, etc. for those parti-
gation operator C, often called time parity and cles that can be created or destroyed can be
charge parity. found experimentally from the properties of
For a meaningful and satisfactory formula- their mutual interactions, assuming conserva-
tion of charge conjugation and time reversal it tion of P, T, or C in these interactions. For ex-
is essential to consider the field operators in a ample, the 1To neutral meson decays into two
quantized theory, taking into account the photons; an analysis shows that its intrinsic
ordering of operators and their commutation parity TiP' = -1 independently of the TiP" for
relations appropriate to Bose statistics for the electromagnetic field. The 1To, having odd
integer spins and Fermi statistics for half- parity and zero spin, is therefore a pseudo-
integer spins. The wave function for a particle scalar particle, as opposed to a scalar particle
is the one-particle expectation value of the having even parity and zero spin. The charged
field operator. In Table 1 the transformation mesons 1T± also have TiP = -1, but charge con-
properties of various kinds of field operators servation prevents their decay into two photons
are listed for the linear transformations P and C so that an interaction where the 1T± are created
and the antilinear transformation T. These or absorbed through interactions with nucleons
transformation properties are obtained by con- must be considered. The number of nucleons
sidering the invariance of the free-field equa- is conserved, so only the product !Tlp!2 = 1 is
tions of motion. determined for the nucleons. Similarly, the

TABLE I. TRANSFORMATION OF INTRINSIC FIELDS UNDER SPACE INVERSION,


TIME REVERSAL, AND CHARGE CONJUGATION

Intrinsic Field P T C

Scalar 'P(r, t) IIp''P(-r, t) lIT''P(r, t) lIC''P+(r. t)


Dirac spinor Hr, t) IIp'Y41/1(-r, 1) lIT'Y3'Y1 I/I*(r, -t) lIcY21/1 *(r, t)
Four vector A/-I(r, t) 1Ip',(-1)6/-14A/-I(-r, t) lIT,,(-1)6/-14A *(r, -t) lIC"A /-I *(r, t)
Pseudoscalar 1T(r, t) 1Ip'''1T(-r. t) lIT"'1T*(r, -t) lIC"'1T*(r, t)
Electric field &(r, t) IIp&(-r, t) -lIT"&(r, -t) lIc"&(r, t)
Magnetic field !B(r, t) -1lp"!B(-r, t) lIT"!B(r, -t) lIC,,!B(r, t)
Two-component I/I(v)(r, t) does not exist 11 TY3 'Yl 1/1 (v) (r, -t) does not exist
neutrino
891 PARITY

electromagnetic field interacting with an elec- acteristic lifetimes of the other multipoles are
tron current determines 71/' =-1 for the elec- long compared with the typical atomic colli-
tromagnetic field and I71p12 = 1 for the electrons. sion times, an alternative method of deexcita-
In analogous ways all the phases for those tion. The first concept of the use of parity in
particles with integer spin (bosons) can be atomic physics resulted from the experiments
found by experiment, while those with half- of Laporte showing that the initial and final
integer spin (fermions) cannot and, indeed, are states were always of opposite parity. This re-
indeterminate. Although the intrinsic parity sult is predicted in Table 3.
is indeterminate for fermions, the intrinsic In nuclei, collision plays little role, so all
parity for an antiparticle is opposite to that for electromagnetic multipoles can contribute, but
a particle, in contrast to the boson case; e.g., usually the lowest-order multipole comparable
the intrinsic parity is odd for e+ e- in an S state with the conservation of angular momentum
and even for 1(+1(- in an S state. Considering the and parity predominates. These restrictions are
conservation of C, P, and angular momentum J, known as angular momentum and parity se-
the only allowed modes of annihilation of an lection rules.
electron-positron system are a singlet-S system The parity of the probability density 1/I t 1/1 is
decaying into two photons and a triplet-S sys- always even. The expectation value of an oper-
tem decaying into three photons. ator () is 1/I t () 1/1 and vanishes if () is an odd func-
Conventional usage assigns 71P =+1, i.e., posi- tion under the parity operation. From the
tive parity, for fermions. Any fermion state of properties of the multipole operators, all odd
total angular momentum j can be specified by electric multipole moments and all even mag-
1/Ij£s where j = Q + s and has total parity (-1)£ netic multipole moments vanish. The restric-
by convention. Similar argume-ilts define a tion to nondegeneracy is essential and is satisfied
"time parity" and a "charge parity" for each for all atomic and nuclear systems. In molec-
quantum-mechanical state. Likewise, the elec- ular systems there are many observed static
tromagnetic field can be expanded into multi- electric dipole moments. These usually result
poles in an angular momentum representation; from the accidental degeneracy of coulomb
however, here the intrinsic parity is specified. wave functions with different orbital angular
A photon of angular momentum J can have two momenta.
possible parities, characterized by electric One of the best evidences for the conserva-
multipole radiation EJ, with parity (-1)1, tion of parity and invariance under time reversal
and magnetic multipole radiation MJ, with par- in electromagnetic interactions is the lack of
ity (-l)J±l. These different types result from an electric dipole moment in the neutron.
possible vector products of the intrinsic spin A plane wave of definite linear momentum
and the orbital angular momentum to give the can be decomposed into a sum of degenerate,
sameJ. even and odd, angular momentum states and is
Several examples of the use of parity con- not an eigenstate of parity. Localized wave
servation follow. Consider the interaction of packets can be obtained by superimposing
the electromagnetic field with a quantum- many plane waves. Any macroscopic classical
mechanical system, a transition from an initial body, so described, will be degenerate in states
state of definite angular momentum, Ii, and of opposite parities and thus can have such
parity, 1Ti, to a final state of definite lr and 1Tf, properties as electric dipole moments.
with the emission or absorption of electromag- In 1956 T. D. Lee and C. N. Yang, after a
netic radiation of angular momentum J and careful examination of experimental evidence,
parity 1T. The conservation of angular momen- concluded that there is abundant evidence for
tum and parity requires that parity conservation in strong and electro-
magnetic interactions, but could not find any
IIi - If I ~ J ~ Ii + If evidence for conservation in weak interactions,
including nuclear beta decay. They suggested
1Ti 1Tf1T = 1. several experiments to test parity conserva-
The possible transitions are shown in Table 3. tion in weak interactions.
In atoms, the predominant electromagnetic An experiment performed by C. S. Wu, E.
transitions are of electric dipole type. The char- Ambler, R. W. Hayward, D. D. Hoppes, and
R. P. Hudson in 1956 at the National Bureau of
Standards and published in 1957 demonstrated
that parity conservation was violated in nuclear
TABLE 3. ELECTROMAGNETIC MULTIPOLE beta decay. The experiment measured the
SELECTION RULES angular distribution of beta and gamma radia-
tion from radioactive 60Co nuclei that had been

~
0 cryogenically oriented. The angular distribu-
1*0 1 2 3 4 5 tion for the beta radiation obeyed a relation

+ Ml Ml E2 M3 E4 M5 (j) P
W(!,)) = 1 +a-
El El M2 E3 M4 E5 UI Ipi
PARITY 892

where j is the nuclear spin and p the momen- which can be continuously transformed into
tum of the beta particle. The angular distribu- one another. The postulate of space inversion
tion of the gamma radiation is a known func- symmetry allows us to infer the existence of
tion that depends only on even powers of j . k, a right-handed massless neutrino; however,
where k is the momentum of the photon, and there is no reflection symmetry, so the exis-
on numerical factors depending on the spin of tence of both right- and left-handed neutrinos
the nuclear states involved in the gamma in nature would be coincidental. Only if the
transition. neutrino had mass could the existence of both
A measurement of the gamma ray anisotropy, helicity states follow from the existence of a
in turn, determines the amount of nuclear po- single state and from the properties of proper
larization 0)1 Ii I. Inspection of Table 2 ir.dicates Lorentz transformations.
that momenta change sign under the parity At present, three charged leptons are known
operation while the spins do not. Neither the with nonzero mass: the electron, the fJ. lepton,
momenta nor spins change sign under the charge and the T lepton, each having associated with it
conjugation operation. The beta ray angular a left-handed neutrino, Ve, vJ.L, and vr, re-
distribution will change under the operations spectively. The Vr has not been observed di-
P or CP but not C, while the gamma ray angular rectly but may be conjectured from the dynam-
distribution is unaltered under C, P, or CPo The ical properties of the weak interaction.
measured beta ray angular distribution from In all the weak-interaction phenomena, which
60Co gave a value for the coefficient a = -vic, includes all decays of the elementarK particles,
apart from some small coulomb corrections. A except the electromagnetic decays 1T """* 2')' and
subsequent experiment measuring the angular
distribution of positrons from polarized S8CO
°
1: """* A + ')' that were experimentally investi-
gated prior to 1964, the violation of P and C
gave similar results but with the opposite sign invariance and conservation of CP and T in-
for the coefficient a. The nonzero value for the variance was fully established. In 1964, experi-
coefficient a indicates that space inversion in- ments at the Brookhaven National Laboratory
variance is violated. The opposite signs for the showed that the KO meson, which should de-
coefficient a, depending on whether electrons cay only into three 1T-mesons according to CP
or positrons are emitted, indicate that charge invariance, decayed a small fraction of the time
conjugation invariance is violated. An analysis into two 1T-mesons, a process violating CP in-
shows that both the P and C violations are variance. Vigorous experimental research since
maximal but that the product CP is conserved. that time has fully confirmed this CP violation
In the beta-decay process within the nucleus result, but no other process has been observed
we have a neutron making a transition to a pro- where CP is violated. There have been many
ton with the emission of an electron and an theoretical attempts to account for this small
antineutrino. The inverse process has a proton CP violation, but no fully satisfactory explana-
making a transition to a neutron, emitting a tion has emerged. However the small amount
positron and a neutrino. One of the most satis- of CP violating interaction in the decay of the
factory explanations of the violation of C and K mesons may account for the present pre-
P in the beta-decay process can be ascribed to ponderance of particles over antiparticles,
the properties of the neutrino. Rather than assuming that both were created in equal
being a particle which can be described by the amounts initially after the big bang occurring
Dirac equation with a four-component wave at the origin of the universe.
function, its zero mass permits a modification Whether or not there is invariance under the
of the Dirac equation to one that has a two- operations C, P, T, or CP individually, any local
component wave function. This two-component field theory must be invariant under the product
description, although invariant under proper CPT. This invariance is known as the CPT
Lorentz transformations, is not invariant under theorem. The combined operation is equivalent
Cor P. to the product of space inversion and time in-
Experiment shows that the antineutrino version and is often referred to as stronf reflec-
emitted in beta decay is right-handed, i.e., its tion. This transformation in which xJ.L = -xJ.L,
intrinsic spin is parallel to its momentum. although discontinuous, is characterized by
Noninvariance under P implies that there is no det(aj.lv} = I, and an equivalent transformation
left-handed antineutrino, while noninvariance can be achieved by a proper Lorentz trans-
under C implies that there is no right-handed formation, i.e., continuous rotations in space-
neutrino. Invariance under CP implies the time. At present only the CPT invariance re-
existence of a left-handed neutrino as well as mains unbroken experimentally.
the right-handed antineutrino. The symmetry
of right- and left-handedness in matter is broken RAYMOND W. HAYWARD
and replaced by a symmetry where matter has
a characteristic handedness and antimatter the
opposite handedness. References
A left-handed, massless neutrino appears as Sakurai, J. J., "Invariance Principles and Elementary
a left-handed neutrino in all Lorentz frames Particles," Princeton, Princeton Univ. Press, 1964.
893 PERIODIC LAW AND PERIODIC TABLE

DeBenedetti, Sergio, "Nuclear Interactions," New integral values increasing in order of increasing
York, John Wiley & Sons, 1964. average distance from the nucleus. The total
Lee, T. D., and Yang, C. N., "Question of Parity Con- capacity of each shell can be expressed as 2n2 ,
servation in Weak Interactions," Phys. Rev. 104, being 2 for n = 1,8 for n =2,18 for n =3, and
254 (1956). 32 for n = 4; no higher level actually contains
Wu, C. S., Ambler, E., Hayward, R. W., Hoppes, D. D., more than 32 electrons.
and Hudson, R. P., "Experimental Test of Parity These total capacities can easily be accounted
Conservation in Beta Decay," Phys. Rev. 105, for by the several quantum number restrictions
1413 (1957). and the Pauli principle. Within each principal
Christenson, J. H., Cronin, J. W., Fitch, V. L., and energy level are differently shaped regions called
TurJay, R., "Evidence for the 21T Decay of the "orbitals," that can be occupied by electrons.
KwO Meson,"·Phys. Rev. Lett. 13,138 (1964). The shape of each orbital is designated by the
Gatto, R., "A Basic Course in Modern Weak Inter- "orbital quantum number" t, which may only
action Theory," in "Weak Interactions" (M. Baldo have integral values from 0 up to n - 1. The
Ceolin, Ed.), Amsterdam, North-Holland Publish- number of orbitals of each shape that can exist
ing Co., 1979. within a principal quantum level depends on
the fact that an electron in an orbital is a charge
Cross-references: ANTIPARTICLES, CONSERVA- in motion and therefore has magnetic proper-
TION LAWS AND SYMMETRY, ELECTROMAG- ties which influence the orientation of the or-
NETIC THEORY, LORENTZ TRANSFORMATIONS, bital in an external magnetic field. The possible
NEUTRINO, QUANTUM THEORY, RELATIVITY, orientations are designated by the "orbital mag-
STRONG INTERACTIONS, WEAK INTERACTIONS. netic quantum number" m" which may have
values from 0 to plus or minus the orbital quan-
tum number t. Thus when n = I, t can only have
the value 0, which means that only one orbital
is possible, having orbital magnetic quantum
PERIODIC LAW AND PERIODIC TABLE number O. When n = 2, 1 can have the values 0

When the chemical elements are compared in


order of increasing atomic number, many of
m,
and 1. For the value 0, one orbital is possible,
but when t = 1, can have values 0, +1, and
-1, corresponding to three orbitals. Four or-
their physical and chemical properties are ob- bitals are therefore possible in the principal
served to vary periodically rather than ran- quantum level n = 2. When n = 3, the same
domly or steadily. This relationship, recognized kinds of four orbitals are possible, and in addi-
empirically a century ago by de Chancourtois in tion, 1 can equal 2. This gives five possible
France, Newlands in England, Lothar Meyer in values, for m,: 0, +1, +2, -1, and -2, corre-
Germany, and Mendeleev in Russia, is now sponding to five more orbitals for a total of 9.
known to be the logical and inevitable con- When n = 4, the same kinds of 9 orbitals are
sequence of the fundamental periodicity of possible, and in addition I can equal 3. This
atomic structure. The familiar statment of the gives seven possible values for m,: 0, + 1, +2, +3,
periodic law is this: "The properties of the -1, -2, and -3, corresponding to 7 more or-
chemical elements vary periodically with their bitals for a total of 16. No principal quantum
atomic number." A more informative statement level uses more than 16 orbitals even though
of this same law is: The atomic structures of more are theoretically possible when n = 5 or
the chemical elements vary periodically with more.
their atomic number; all physical and chemical One additional property of an electron in an
properties that depend on atomic structure atom needs to be considered. This is its pro-
therefore tend also to vary periodically with perty as a magnet, irrespective of its orbital
atomic number. motion. This is designated by the "spin mag-
The periodicity of atomic structure (see netic quantum number," which can have only
ATOMIC PHYSICS) is described by quantum the values +112 and - 1/2. Since each orbital is
theory as developed through modern wave me- uniquely specified by the first three quantum
chanics. Each successive electron, beginning numbers, n, I, and ml, the capacity of each or-
with the first, that comes within the field of an bital is thus limited to 2 electrons, and these
atomic nucleus, occupies the most stable posi- only if, according to the Pauli principle, they
tion available to it. The number of possible are of opposed spins (differ in the fourth quan-
positions is limited by quantum restrictions tum number). The total capacity of each princi-
which describe each position in terms of four pal quantum level is therefore twice the number
quantum numbers, and by the Pauli exclusion of orbitals within it, because this represents the
principle that no two electrons within the same total number of permissible combinations of
atom may have the same four quantum num- four quantum numbers within that level. For
bers. These electron positions, or energy levels, example, the capacity of the n = 4 level is lim-
are grouped with respect to their average dis- ited to 32 electrons by the fact that only 32
tance from the nucleus as "principal quantum different combinations of the four quantum
levels or shells," designated by the "principal numbers are possible within that level; these
quantum number" n = 1,2,3,4,' .. , successive electrons will occupy 16 orbitals.
PERIODIC LA W AND PERIODIC TABLE 894

The differently shaped orbitals having orbital again adds 10 elements before filling of the
quantum numbers 1= 0, 1, 2, and 3 are com- outermost shell is resumed; this period also con-
monly called s, p, d, and f orbitals. From the tains 18 elements. The sixth period begins as
above discussion, it should be clear that within before with two electrons in the outermost
any principal quantum shell, there can be only shell, but then there is an interruption at lan-
one s orbital, three p orbitals, five d orbitals, thanum (57) to begin filling the 5d orbitals.
and seven f orbitals. The p orbitals do not ap- Here, however, occurs an additional interrup-
pear until n = 2, the d until n = 3, and the fun- tion, in which 14 elements are formed through
til n = 4. Within any given principal quantum filling of the 4f orbitals, before the remaining
shell, the order of decreasing stability, and 5d orbitals can be filled, and in tum before the
therefore the order of filling with electrons, is outermost shell receives any more electrons.
always s-p-d-f Consequently here it takes 32 elements to bring
The periodicity of atomic structure arises the outermost shell to 8 electrons and thus end
from the recurrent filling of new outermost the period. The seventh period is similar but in-
principal quantum levels, but it is complicated complete. In principle it would end with ele-
by the fact that, although the principal quantum ment 118, but artificial element 105 is the high-
levels represent very roughly the general order est in atomic number known at the time of
of magnitude of energy, there is considerable writing.
overlapping. This overlapping is such that the These elements which represent interruptions
outermost shell of an isolated atom can never in the filling of the outermost sand p orbital
contain more than 8 electrons. In the building octet are called "transitional elements" (where
up of successively higher atomic numbers, elec- d orbitals are being filled), and "inner transi-
trons al~ays find more stable positions, once a tional elements" (where f orbitals are being
set of p orbitals in a given principal quantum filled). This is to distinguish them from the
level is filled, in the s orbital of the next higher other, "major group," elements in which under-
principal quantum level rather than the d or- lying d or f orbitals are either completely empty
bitals of the same principal quantum level. or completely filled.
When this s orbital is filled, electrons then go Physical properties of the elements that de-
into the underlying d orbitals until these are pend only on the electronic structure of the in-
filled, before continuing to fill the outermost dividual atom, such as the ionization potential
shell by entering its p orbitals. The building-up and atomic, radius, vary periodically with
of the atoms of successive atomic numbers may atomic number simply because of the recurrent
be represented by the following sequence: Is, filling of the outermost shell. Increasing the
2s, 2p, 3s, 3p, 4s, 3d, 4p, 5s, 4d, 5p, 6s, 5d, 4f, atomic number by increasing the nuclear charge
6p, 7s, 6d, Sf The periodicity of atomic struc- while adding to the outermost shell of electrons
ture thus consists of the recurrent filling of the maintains the electroneutrality of the atom but
outermost shell with from one to 8 electrons has a highly significant effect on that part of
that corresponds to the steady increase in nu- the nuclear charge which can be sensed by
clear charge. outermost electrons. Underlying electrons are
A period is considered to begin with the first quite effective at shielding the nucleus, but in
electron in a new principal quantum shell and the outermost shell, each additional electron
to end with the completion of the octet in this is evidently too involved in avoiding the others
outermost shell, except, of course, for the very for them to be very effective in shielding one
first period, in which the outermost shell is another from the additional nuclear charge.
filled to capacity with only two electrons. From In fact each additional electron is successful
the order of orbital filling given above, it should in blocking off only about one-third of the posi-
be apparent that periods so defined cannot be tive charge that is simultaneously added. There-
alike in length. The first period, consisting of fore, with each unit increase in atomic number,
hydrogen and helium, has only two elements. when electrons are being added to the outer-
The second period, beginning with lithium (3) most shell to form the major group elements,
and ending with neon (10), contains 8 elements, the effective nuclear charge-that part of the
as does the third period, which begins with so- total nuclear charge that is sensed by an elec-
dium (11) and ends with argon (18). The fourth tron in the outermost shell-increases by about
period begins with potassium (19), but follow- two-thirds. This causes the electronic cloud to
ing calcium (20), the filling of the outermost be drawn closer to the nucleus, diminishing the
(fourth) shell octet is interrupted by the filling atomic radius and holding the electrons more
of the d orbitals in the third shell. Thus 10 tightly, as generally indicated by increasing
more elements enter this period before filling of ionization energy. The combined effect of in-
the outermost shell is resumed, making the to- creased effective nuclear charge and shorter
tal number of elements in this period 18. In the radius increases the attraction between nucleus
fifth period, the first two outermost electrons and outermost electron, defined as the electro-
are added in rubidium (37) and strontium (38), negativity. Thus from left to right across the
but then this outer shell filling is interrupted by major groups, within each period, the radius
the filling of penultimate shell d orbitals, which decreases and the electro negativity increases.
895 PERIODIC LAW AND PERIODIC TABLE

Similar but smaller effects are observable for outermost electron per atom and can therefore
the addition of d or f electrons to underlying form but one covalent bond. Each is very low
shells, in the transitional and inner transitional in electronegativity and thus tends to become
series. highly positive when bonded to another ele-
The bonding properties of elements also de- ment. Crossing each period the elements be-
pend on the electronic structure of the individ- come less metallic, higher in electronegativity,
ual atom and therefore likewise vary periodi- and able to form a greater number of bonds un-
cally. For example, each period (except the til limited by the number of outer shell vacan-
first) begins with an alkali metal, lithium (3), cies rather than the number of electrons. The
sodium (11), potassium (19), rubidium (37), halogens, fluorine (9), chlorine (17), bromine
cesium (55), and francium (87), all of which (35), iodine (53), and astatine (85), each of
show similar metallic bonding, crystallizing in a which is next to the end of its period, are all
body-centered cubic lattice. Each has but one nonmetals, highest of their respective periods

MAJ OR GROUPS

N,. MI 1M2 M2' I M3 M4 MS M, M 7 MI


I IH 1 He 2
2 Li 3 Be 4 B 5 C6 N 7 o 8 F 9 Ne 10
3 Na 11 Mg 12 AI 13 Si 14 P 15 S 16 CI 17 Ar 18
4 K 19 Ca Ga 31 Ge 32 Kr 36
Zn As 33 Se 34 Br 35
Rb 37 Sr
,
S
Cd In 49 Sn 50 Sb 51 Te 52 I 53 Xe 54
Cs 55 Ba Hg TI 81 Pb 82 Bi 83 Po 84 At 85 Rn 86
7 Fr 87 Ra

4
/l ~ I 3
Se 21
I 4
Ti 22
I
V 23
S I •
Cr 24
I
Mn 25
7 I •
Fe 26 Co 27
I 9 I
Ni 28
10 I
Cu 29
II

S Y 39 Zr 40 Nb 41 M042 Te 43 Ru 44 Rh 45 Pd 46 AS 47
, La57 Lu IHf 72 Ta 73 W 74 Re 75 Os 76 Ir 77 PI 78 Au 79
7 Ae89 Lr I 104 105 106 107 108 109 110 111

/l~N -..:::::: -
'ICe 58 Pr 59 Nd60lpm 611Sm 621Eu 63 Gd641Tb 65 Dy 661 Ho 671 Er 68TTm 69 Yb 70

71 Th 90 Pa 91 U 921Np 9JIPu 941Am 951Cm 961sk 97 Cf 981 Es 991 Fml00lMd 101 No 102

COMPLETE LONG FORM


.--

.
r-
I

Vous aimerez peut-être aussi