Brehm Mullin
Brehm Mullin
10» 2 T tera
9
10 G g»ga
10
6
M mega
3
10 k kilo
10- 3 m milli
6
1(T M micro
10- 9 n nano
12
10 P pico
5
io-' f femto
10" 18
a atto
£po~»-
477e
.2
e
= 1.440 eV •
run
4we
electron mass m = e
9.109 X 10~ 31 kg = 0.5110 MeV/c 2
proton mass Mp = 1.673 X 10" 27 kg = 938.3 MeV/c 2
M P
=
proton -electron mass ratio 1836
m ,
~ 34
Planck's constant h = 6.626 X 10 J •
s = 4.136 X 10~ 15 eV •
s
he = 1240 eV nm •
h = 1.055 X 10"
34
J •
s = 6.582 X 10-' 6 eV •
s
he = 197.3 eV nm •
-1
Avogadro's number NA = 6.022 X 10
23
mole
Boltzmann's constant kB = 1.381 X 10" 23 J/K = 8.617 X 10~ 5 eV/K
h
electron Cbmpton wavelength = 2.426 X 10
,2
m
m e
c
http://archive.org/details/introductiontostOObreh
INTRODUCTION
TO
THE
STRUCTURE
OF
MATTER
INTRODUCTION
TO
THE
STRUCTURE
OF
MATTER
John J. Brehm
and
William J. Mullin
University of Massachusetts
Amherst, Massachusetts
WILEY
John Wiley & Sons
New York Chichester Brisbane Toronto Singapore
Copyright © 1989, by John Wiley & Sons, Inc.
Brehm, John J.
Introduction to the structure of matter.
Bibliography
Includes index.
1. Physics. 2. Matter — Constitution. I. Mullin,
William J. II. Title.
10 987654321
to
A
first course in modern physics should be
exciting and rewarding for studentand teacher alike. Such a course offers an
opportunity to appreciate the wondrous findings of the 20th century as milestones in
the growth of a whole new science. From this perspective the student can take pleasure
in the discovery of new phenomena and the teacher can draw inspiration from the
revolutionary new ideas.
VII
ww Preface
questions, and yet extensive enough to cover many areas. We also believe that, within
reason, the text should offer every topic desired by every teacher, even if the sum of all
the topics is too much for two semesters. This diversity of offerings is necessary if a
given teacher is to have access to every possible pathway through the available
material.
We recommend that four foundation stages be prepared first, and one of two main
routes be chosen afterward, with the adoption of this book as a text. The first four
quantum theory,
areas cover special relativity, introduction to quantum mechanics,
and application to atoms. These topics occupy Chapters 1-9 and should consume
more than one semester of lecture time. The teacher may then choose to go from
atoms to larger quantum systems, such as molecules and condensed matter, or to
smaller quantum systems, such as nuclei and elementary particles. The one route
spans Chapters 10-13, and the other spans Chapters 14-16. Our textbook gives ready
access to either of these alternatives.
Every topic in the book is developed to contain a core of subject matter for the
teacher's lectures and a body of supporting material for the students' readings. We
devote a substantial portion of the text to side-reading because we expect the students
to turn from the lecture to the book for a properly detailed understanding of the
subject. This plan assumes the usual ideal classroom setting, where the teacher
introduces the students to concepts and applications, and the students refine their
grasp of the material by studying on their own.
The book is aimed generally at physics majors in their junior year, although some
flexibility is built into this choice of level. In fact, a large portion of the material has
actually been taught at the University of Massachusetts to second-semester sophomores
and first-semester juniors. We assume an adequate background to be three previous
semesters of physics, a la Resnick and Halliday. We take for granted any topic found
in this preparatory body of coursework, and we occasionally reiterate certain classical
points if they bear directly on new ideas. The new material is always constructed in a
self-contained and thorough manner, consistent with the intended level. Sometimes it
is up the material from the ground and eventually "take off"
instructive to build
toward a rather high level. One of our main assumptions is that interested students
want to " take off" on occasions, with the guidance of the teacher. These instances of
excess occur here and there throughout the book and are easy to bypass if that is the
teacher's wish.
The text emphasizes discoveries and ideas, and incorporates analytical formalism
sufficient for the level of presentation. We regard our treatment of modern physics as
a prelude to a subsequent course in quantum mechanics. Hence, the text is supposed
to build a case for the quantum theory and then supply enough quantum machinery
to give the students an understanding of all the different quantum systems. The text is
not intended as a presentation of quantum formalism, however. Instead, we use only
the Schrodinger equation to convey most of the quantum mechanics in the book.
Thus, many of the essential "words" of quantum mechanics (Hamiltonian, commuta-
tion relation, spinor, matrix element, . .
.
) never actually appear, even though the
concepts lie just beneath the surface in our discussions.
The mathematical demands book are quite consistent with the level intended
of the
for the physics. We assume that all somewhat skilled in the
students at this level are
use of calculus. We also take the view that exposure to new mathematics is a worthy
secondary objective of the course and its accompanying text. Matrices are employed
on a few occasions to elucidate certain algebraic considerations. Complex numbers are
put to constant use in the solution of the Schrodinger equation and the interpretation
Preface ix
exercised when the topics of interest are governed by partial differential equations.
Our main assumption about the introduction of unfamiliar mathematical techniques
is that the students are willing to move quickly into the use of differential equations in
order to grasp the interpretation of the solutions. We promote the development of
these capabilities for a good and healthy cause. The students' analytical powers are
strengthened for the future, and their understanding of the subject is enriched as a
result. Modern physics is ideal in this respect because the subject cannot be properly
supported without a certain amount of mathematical scaffolding and because the new
ideas can be fully brought to life with this support.
History is also included in order to present modern physics as a sort of adventure in
20th century science. We insert these narrative elements whenever we wish to illustrate
the growth of ideas and put separated events in proper context. Modern physics
should be viewed as an ongoing adventure, and historical background can be used as a
suitable vehicle to convey this impression.
Every major subject area has its own chapter in the book. Some of the chapters are
rather long because of the broad scope of the particular areas. This aspect of the book
should be ignored by the reader since every chapter is comfortably divided into
sections of manageable length. The sections themselves should be regarded as the
basic logical units of the text.
At least one example appears at the end of almost every section. We put all
numerical computations and some algebraic manipulations into these examples so that
we can separate illustrative material from the main flow of the text. Every topical area
is further illustrated by a wide selection of problems at the end of each chapter.
The book includes only one appendix containing our table of nuclear properties.
We deviate from the usual practice followed in other books and choose not to relegate
supporting material to a series of appendixes. Instead, we evaluate every fragment of
material on its own instructional merits. In many cases these fragments are important
for their mathematical content and are worthwhile for students to learn. Any item of
pedagogic interest is given its own place along with other pedagogic matter in the
main body of the text, and all items of less instructional value are left out of the text
altogether.
It is sometimes necessary to digress and set aside a certain derivation so that the
accompanying topic can be presented without interruption. We employ these digres-
sions sparingly and call them "details." The occasional detail is inserted at the end of
the relevant section as a sensible alternative to the use of an appendix at the end of the
book. This practice enables the reader to locate useful information out of the main
flow but still near its proper place in the text.
We have argued for a balance between depth and breadth in the teaching of
modern physics. A teacher can strike such a balance with the aid of this book by
exercising selectivity over all the available material. It is clear that every topic cannot
be covered properly in a single year. Our hope for the students is that they find this
book instructive as a presentation of the teacher's choice of topics and also enlighten-
ing as a source of further reading later in their careers.
John J. Brehm
Amherst, Massachusetts William J. Mullin
ACKNOWLEDGMENTS
wisdom and advice to this project. We would like to thank Ian Aitchison, Tom Arny,
Ed Chang, John Donoghue, Bob Gray, Bob Hallock, Ted Harrison, Bob Krotkov,
Francis Pichanick, Kandula Sastry, Janice Shafer, and Mort Sternheim for offering us
the benefits of their expertise.
We should also extend our appreciation to the reviewers who provided constructive
criticisms of the manuscript. All their remarks were given serious attention, and almost
all were acted on in one way or another to improve the quality of the presentation.
We were extremely fortunate to have the assistance of Nellie Bristol in the
preparation of our text. Her ability to convert a disorderly handwritten draft into a
typed manuscript was needed to carry out the project, and her customary good humor
was always greatly appreciated.
We would express our gratitude to our families too, if we could only find the words.
Their encouragement was there at the start when none of us could judge the scale of
the endeavor, and continued to be there throughout as the enormity of the work
became all too apparent.
J.J-B.
W. J. M.
xi
CONTENTS
CHAPTER ONE
RELATIVITY /
1-7 Space-Time 38
1-8 Relativistic Momentum and Energy 44
1-9 Relativistic Dynamics 50
1-10 Collisions and Reactions 53
1-11 Four- Vectors 60
Problems 66
CHAPTER TWO
PHOTONS 73
Problems 'IS
xlii
xiv Contents
CHAPTER THREE
INTRODUCTION TO THE ATOM m
3-1 The Reality of Molecules and Atoms 122
Problems 178
CHAPTER FOUR
MATTER WAVES m
4-1 De Broglie's Hypothesis 182
Problems 216
CHAPTER FIVE
QUANTUM MECHANICS m
5-1 The Schrbdinger Equation 220
Problems 294
CHAPTER SIX
QUANTIZATION OF ANGULAR MOMENTUM soo
Problems 341
CHAPTER SEVEN
THE ONE-ELECTRON ATOM 344
Problems 373
CHAPTER EIGHT
SPIN AND MAGNETIC INTERACTIONS 375
Problems 437
CHAPTER NINE
COMPLEX ATOMS 440
9-3 The Ground States of Atoms and the Periodic Table 449
Problems 498
CHAPTER TEN
MOLECULES 501
CHAPTER ELEVEH
QUANTUM STATISTICAL PHYSICS 538
Problems 571
CHAPTER TWELVE
SOLIDS 575
Problems 631
Contents xvii
CHAPTER THIRTEEN
SUPERFLUIDS AND SUPERCONDUCTORS m
13-1 Experimental Characteristics of Superfluity 635
Problems 665
CHAPTER FOURTEEN
PROPERTIES AND MODELS OF THE NUCLEUS 667
Problems 736
CHAPTER FIFTEEN
NUCLEAR PROCESSES 740
Problems 811
CHAPTER SIXTEEN
ELEMENTARY PARTICLES sis
Problems 909
APPENDIX A
TABLE OF NUCLEAR PROPERTIES a-i
BIBLIOGRAPHY a-b
ANSWERS a-12
PHOTO CREDITS M6
NAME INDEX / -/
The towering achievement of the period was the unification of the laws governing
electricity and magnetism in Maxwell's electromagnetic theory. All the known proper-
ties of charges and currents and all the known behavior of electric and magnetic fields
were described by Maxwell's equations in agreement with existing experimental
evidence. The theory went further and predicted the propagation of oscillating electric
and magnetic fields through empty space. It was also possible to predict the speed of
wave propagation in terms of parameters appearing in Maxwell's equations. Labora-
tory evidence for electromagnetic waves in vacuum was subsequently demonstrated in
the experiments of H. R. Hertz. The prediction for the wave speed has been confirmed
many times in several different experiments during the past century and a half. The
current value
c = 2.99792458 X 10 8 m/s
has become known
as the speed of light, a quantity conveniently approximated as
8
3 X 10 m/s,an accuracy of three significant figures. (In fact, c has recently been
to
defined as exactly 299792458 meters per second, leaving the meter to be determined by
experiment.)
The 19th century was a productive period, when scientists could reaffirm Newton's
laws and verify Maxwell's equations as parallel bodies of doctrine. Together these
classical formulations described physics successfully over an enormous range of
applications. By 1900, however, a confrontation of principle had begun to develop
between Newton's mechanics and Maxwell's electrodynamics. The point at issue
involved the description of physical behavior in moving frames of reference. The
conflict between the two systems of classical laws was addressed and resolved by the
theory of relativity.
To appreciate the problem, we have to recall some basic notions about the use of
coordinate systems for the description of particles. In particular, we wish to examine
how these descriptions change when we pass to different frames of reference, especially
those in motion at constant relative velocity. Newtonian mechanics tells us that an
inertial frame of reference is a coordinate system in which Newton's first law holds.
Any other frame in uniform relative motion is also inertial, and so the whole
Newtonian scheme is valid in all such frames. We find, however, that electromagnetic
effects such as the phenomenon of light appear to select a privileged frame for the
inquiry has a familiar previous theory. In modern physics we are concerned with the
principles of special relativity put forward by A. Einstein in 1905. The propagation of
electromagnetic waves and the speed of light are central concepts in the logic of this
theory. When we follow the logic we find that the theory requires us to alter our naive
attitudes about the treatment of space and time.
The identification of light as an electromagnetic wave led scientists of the 19th century
topresume that light had properties in common with mechanical forms of wave motion.
The mathematical similarity between sound and light was apparent since both types
/- / The Lumini/erous Aether
Albert Einstein
of wave were described by solutions of the wave equation. Of course, there were
distinctions to observe as well. Sound waves were known to be longitudinal oscillations
of the medium, without polarization properties, while electromagnetic waves were
understood as oscillations of electric and magnetic fields, with polarization transverse
to the direction of propagation. The similarities suggested that light, like mechanical
waves, should require a medium for support and propagation. The medium was
known as the aether, an entity that supposedly possessed some very remarkable
properties. It had to fill empty space with a universal tension in order that electromag-
netic waves could propagate in vacuum at the unique speed c. The medium also had
to beinfinitely rigid and incompressible, so that longitudinal disturbances could not
exist, and yet had to offer no obstacle whatsoever to the motion of material bodies.
Thus, the aether was conceived to be a mechanical medium for light with no
mechanical properties in the presence of matter.
Given the advantages of hindsight, we should find it difficult to appreciate the
general acceptance and appeal of such a contrivance. Physicists of the period were
accustomed to the adoption of mechanical models and were not receptive to the
possibility of wave motion without a mechanical medium. Indeed, Maxwell himself
had advocated an "aethereal medium" in his own definitive work on electromagnetic
theory. When the predicted waves were discovered after his death, the scientific
community was predisposed to accept the aether as a working hypothesis and examine
its further consequences.
We should realize that a philosophical position of some depth is rooted in these
arguments. The an aether implies mechanical wave propagation where
existence of
disturbances of the medium travel through the medium. In this context, the speed of
propagation refers to the speed of wave motion relative to the medium. Believers in the
4 Relativity
aether would therefore regard the accepted value of c as the speed of light waves with
respect to their aethereal medium. This aether would then represent an absolute frame of
reference in which all electromagnetic waves travel with the speed c. Furthermore, since
the existence of the waves and the prediction of the wave speed are consequences of
Maxwell's equations, it follows that the aether must constitute the unique frame of
reference in which Maxwell's equations are valid. Electromagnetic theory would then
have to assume an altered form in another frame in motion through the aether so that
light waves would be expected to have a different velocity of propagation with respect
to that moving frame.
The aether was obviously more than just a mechanical convenience for the
propagation of light. It also embodied a logical attitude toward the validity of certain
established physical laws. Manifestations of the aether were presumably subject to
experimental investigation. Aether believers did not presuppose that the Earth was at
rest in the absolute frame of reference and set out to devise an experiment capable of
detecting the motion of the Earth relative to the aether. The required apparatus had
to be sufficiently sensitive to measure an Earth speed presumed to be much smaller
than c. The historic experiment was performed with the aid of an interferometer,
which had been developed by A. A. Michelson for the measurement of lengths to
extraordinary precision. Michelson was joined in the aether measurement by E. W.
Morley, and the results of the famous Michelson-Morley experiment were reported in
1887.
Theinterferometer is sketched in Figure 1-1. Light of a definite wavelength is split
into two beams by a half-silvered mirror; the two beams then travel at right angles
and are reflected back by the two mirrors, and 2, shown in the figure. The beam1
returning from mirror 1 is finally reflected to the observer where it interferes with the
beam returning from mirror 2. The two beams are coherent since they originate from
a single source, and the difference in their optical paths determines how they interfere
at the position of the observer. If the paths are of equal length, the two beams have
the same transit time, arrive in phase, and produce constructive interference. This
statement suggests the following way to measure the speed of the Earth through the
aether, since it is implicit in the statement that the speed of light is the same over the
two paths.
In keeping with the aether hypothesis we let c be the velocity of light and u be the
velocity of the Earth with respect to the aether frame. The velocity of light in the
Earth or interferometer frame is therefore
c = c — u. (1-1)
It is convenient to transfer the analysis to the Earth frame and let the aether move
through the interferometer with velocity — u. Then c' represents an aether-drifted
light velocity, analogous to the velocity of a swimmer in moving water. We take the
aether drift to be from right to left in the figure, and, for simplicity, we let the two
arms of the interferometer have exactly the same length d. Equation (1-1) tells us that
the light speeds to the right and to the left are c — u and c + u, respectively.
Therefore, the transit time to mirror 1 and back is
d d 2dc
c — u c + u c — u
The inset to Figure 1-1 shows how to construct c' for the light traveling to mirror 2
/-/ The Luminiferous Aether
Figure 1-1
Mirror 2
Half-silvered
mirror
© =>-
^=^
d
Light source
Observer Mirror 1
-^ Aether drift
and back. The light speed c' over this path is evidently (c
2
— «
2
)
1/2
each way.
Consequently, the transmit time to mirror 2 and back is
2d
2
(,._„«)./*
Since u/c is expected to be very small, we can use the binomial expansion to write
2 \ -1/2 2
u 1 u
1
"7 = 1 + -z +
C
and 1
c
= !
+ «- +
2
2d
-
2rf/
c \
1 + -
ti
c
and ,
2
=
T
I
1 + --
1u
The difference in transit times causes the beams to arrive at the observer out of phase.
To the same order, the optical path difference is c{t x
— t
2 ). The number of wave-
lengths contained in this distance determines the number of fringes by which the
interference pattern departs from the situation for zero path difference. A rotation of
the apparatus by 90° introduces a change of roles between the times <, and t
2
. The
)
Relativity
c(t, — t., d l u
(1-2)
We emphasize that these assertions are predicated upon the adoption of the aether
hypothesis and the use of Equation (1-1).
The experimenters were confident that a shift as small as ^ of a fringe could be
Morley experiment yielded a null result; no fringe shift was observed. Subsequent
versions of the experiment have reproduced this same result. To the consternation of
aether believers everywhere the conclusions were inescapable. The motion of the Earth
through the aether frame was not detectable, and the speed of light in the interferom-
identical times in the above analysis. This ad hoc suggestion turned out to have its
place in future developments. However, it remained for Einstein to interpret the
problem in its proper light.
Example
11 m 2
8n (10" 4 ) = 0.4,
5.9 X 10" 7 m
as remarked above.
The concept of the aether as an absolute frame of reference was rendered meaningless
by the Michelson-Morley experiment. Einstein was not really influenced by this
result, even though he had previously entertained the desire to perform such an
1-2 Principles of Relativity 7
experiment himself. He had already become convinced that notions of absolute rest
and absolute motion should have no observable consequences and that only relative
motion could have physical meaning. This point of view had a previous history. It was
accepted that the laws of mechanics should make no distinction between a state of rest
and a state of motion with constant velocity. The idea that uniform motion must be
relative, and hence detectable only with reference to an external point, originated with
Galileo. This venerable and self-evident relativity principle assumed a new thrust when
Einstein applied it to the propagation of light in vacuum. The question was part of
the problem of reconciling electromagnetism and mechanics to the same relativistic
viewpoint. Einstein investigated this problem during his period of employment at a
Swiss patent office in an environment isolated from the stimulus of institutional
research.
Einstein recognized a conflict between the classical theories of Newton and Maxwell.
He noted that Newtonian mechanics allowed an observer to move at speed c, since an
accelerating force could cause an object to reach any speed if applied long enough,
and he asked how electromagnetic waves traveling in the same direction, also at speed
c, might appear to the moving observer. Because the wave fronts would be accompa-
nying the observer, the oscillations of the propagating fields would not be detectable.
The electromagnetic wave motion predicted by Maxwell's equations would therefore
not exist in the observer's frame of reference. The argument employed the logic of
Newton's laws to deny a logical consequence of Maxwell's laws. This amounted to an
inconsistency or contradiction in the classical theory. Einstein also saw that his
thought experiment furnished a violation of the relativity principle regarding relative
motion. He argued that a moving observer in a closed chamber could know that the
chamber was in motion at light speed, without reference to any external point, by
noting that light could not be observed in the chamber.
Impossible according to
the principle of relativity
.
8 Relativity
The laws of electromagnetism are valid in all frames of reference in which the laws
of mechanics hold.
The speed of light in vacuum is the same for all observers independent of their
motion or the motion of the source.
The first postulate implies that observers in relative motion at constant velocity must
agree concerning their expression of the physical laws. The second asserts that these
observers must also agree that light waves propagate at speed c with respect to every
frame of reference in which each particular observer is at rest. Einstein's theory sweeps
the aether aside and declares it to be superfluous with this assertion.
The second postulate is alien to our rudimentary common sense. Consider the
situationshown in Figure 1-2 in which a moving observer 0' carries a light source and
sees wave fronts of light propagating away at speed c. A stationary observer sees
those wave fronts moving also at speed c, even though 0' is approaching at a speed
u, which in principle could be almost as great as c. This seems paradoxical if we react
to the kind of common sense that has taught us to use Equation (1-1) as the rule for
the addition of velocities. However, we have just learned that there is experimental
evidence refuting this rule for the propagation of light. The correct method of adding
velocities must yield a different rule so that the observers and 0' in the figure can
agree that the light speed has the value c. Furthermore, a regime of speeds must exist
for things other than light, with velocities v and v' substituted for c and c' in Equation
(1-1), such that the equation is a valid approximation to the exact formula for velocity
addition. The second postulate has survived every experimental test, and so our
common sense has to be revised by giving cautious consideration to the space and time
coordinate systems used by observers in relative motion.
Let us first define what we mean by an observer in more general terms. Imagine a
coordinate frame in which clocks are placed at suitably spaced regular intervals. This
lattice of timers can be synchronized to keep identical times by the use of light signals.
A simple method for doing this is given in the example at the end of the section. Let
each lattice site be equipped with a device for recording events that occur at that
Figure 1-2
Two observers in relative motion. is at rest and 0' moves toward at constant speed u.
and 0' agree on the speed of light coming from the source carried by 0'
Observer O'
1-2 Principles of Relativity 9
the different sites, all with synchronized timers. It is obvious that a single observer who
reads the recorded data is sufficient in a lattice so equipped. The particular location of
that observer in the reference frame is immaterial. In fact, even the one observer is
rest and a frame S' in which observer 0' is at rest. The frame S' moves uniformly
through the frame S at speed u. These two coordinate systems are shown at two
instants of time in Figure 1-3. At the first instant when S and S' coincide, a light flash
is emitted from a source at rest at the origin in S', and at the later instant, the wave
front from this flash reaches a detector at a location fixed in S. This wave propagates
spherically from its point of origin with the same light speed c in both frames,
according to the second principle of relativity. The figure indicates qualitatively thai
the detection of the wave front occurs at a distance Ax in S and at a lesser distance
Ax' in S'. In view of this the elapsed times in 5 and S' are different:
A< = —
Ax
c
in S and A/' =
Ax'
c
in S'
so that A/' is evidently less than A/. We sketch this argument here to introduce a
feature of the second postulate, which we must be prepared to accept. Observers in S
Figure 1-3
Reference frames S and S' in uniform relative motion. S' moves through 5 with speed u, and
so S moves with thesame speed in the other direction through S', A light source at the origin in
S' emits a flash at the instant when S and S' coincide, and the propagating wave front is
observed later at a location fixed in S. Emission and detection are shown in the frame S on the
(*>- ( iy-
-o <>
Emission Emission
S'
Ax
>5>-
A.v
Detection Detection
St later At' later
10 Relativity
and in S' do not agree about time intervals or space intervals in order that they do
agree about the light speed c. We deduce the actual relations for time and space
intervals in different frames in Section 1-3.
Relative motion satisfies a reciprocal property, which we have incorporated in
Figure 1-3. We note that 5" moves to the right with speed u when 5 is the frame at
rest and that S moves to the left with the same speed u when S' is the frame at rest.
Figure 14
Propagation of a light flash from the midpoint of a moving railroad car. Light-triggered buzzers
act as detectors at each end of the car. In S', the rest frame of the car, the buzzers are observed
to go off simultaneously. In S, where the car is moving to the right, the left buzzer is observed
to go off before the right buzzer.
,s" ) (*>
//// ////
\/s
© S' )^- o
yv
//// s ////
////
o ^
VV
1-3 Time Dilation and Length Contraction 1
simultaneously. The right side of the figure shows what happens in S, where the light
source and buzzers are moving to the right. The light flashes and the wave front
propagates outward from the point of origin. It is clear from the figure that the wave
front reaches the left end of the car before it reaches the right end so that the buzzers
do not go off simultaneously in S.
We have begun to describe a revised treatment of space and time as introduced by
Einstein to ensure consistency between mechanics and electromagnetism. We should
be able to see that Maxwell's theory occupies the privileged position in this picture. If
Maxwell's equations are valid in a particular reference frame, they are supposed to
hold in any other frame in uniform relative motion. Otherwise, it would not be
possible for electromagnetic theory to predict the same speed of light c for both
frames. It would therefore appear that Newton's laws must be modified in order to
complete Einstein's picture.
Example
Suppose that we have an array of clocks at rest with respect to each other and
that we wish to synchronize these clocks to read identical times. Let us consider a
linear array in which the clocks are placed along a line at locations separated by
3 m intervals. A light signal traverses a 3 m distance in 10
8
s, or 10 ns. Let the
clocks be triggered to start ticking when a wave front of light passes, and preset
the line of clocks to read sequentially 0, 10, 20, 30, . .
.
, in nanoseconds. A light
flash at the location of the ns clock activates the other clocks in sequence so
that they all keep identical times thereafter.
Figure 1-5
t
d
source
i
.Or
-\3T
detector
t' = ® k_j At
mS'
t = ^)At
mS
x = M u At
A/. We use the universality of the speed of light to express the time intervals as
2d 2(
At' = —c
in S' and At = —
c
in S.
It is obvious from these relations and from the figure that At is greater than At'. If we
examine the light path in 5 we see that
= d2 4 | -A
At'
At = " (1-3)
/l - u
2
/c 2
Figure 1-6
Fixed synchronized clocks showing more elapsed time than the clock in motion. The measuring
rod from A to B is at rest along with the synchronized clocks in S. Its length D is a proper
length determined by the spatial interval in S between points A and B.
0-
"0 D
Measuring rod
The device in the experiment is a kind of clock. The fact that the clock is at rest in
S' distinguishes that frame from S, which may be any other frame where the device is
moving. We have timed the interval between two events, again using the emission and
detection of a light signal. The frame S' is distinguished as the only one in which these
events occur at the same spatial location. The time interval between two such events at
the same point in space is a unique quantity called the proper time interval between the
events. According to Equation (1-3), the elapsed time A/ between the same events in
any other frame is dilated, or enlarged, by the factor (1 — u
2
/c 2 )~ x/1
compared to the
proper time interval A/'.
The identification of a proper time is often the key to the resolution of many
puzzling applications of time dilation. Consider clocks in relative motion as a first
example. We can expect such timekeeping devices to record different times in the
manner illustrated in Figure 1-6. The array of clocks between points A and B in the
frame S are synchronized, and another clock in motion from A to B indicates a
smaller time interval for the journey than that recorded by each of the clocks in the
fixed array. In terms of Equation (1-3), the moving clock reads A/' while the
synchronized array reads A/ for the elapsed time. The moving clock keeps proper
time; the ticks of that clock are events at a fixed location in the moving frame and are
being observed at a sequence of locations in the fixed frame S. The relation between
A/ and A/' depends on the ratio u/c. Time dilation has real effects that are not
readily observed unless the speed u is sufficiently close to c. The phenomenon is
therefore surprising to us because the regime of such large velocities is beyond our
limited experience.
Observers in relative motion must also disagree over the measured length for a
spatial interval when the length is along the direction of motion. To see how their
measurements differ, let us construct another kind of relativity laboratory, building
again on the propagation of light signals. As before, we want a design in which light is
emitted and returned to the same point in the laboratory. Figure 1-7 shows a
construction consisting of two facing walls in which the one wall is mirrored while the
other contains a light source and a detector at the same location. The device is in
motion at constant speed u in the frame S and is at rest in the frame S'. The
wall-to-wall distance depends on the frame and is denoted by L' in S' and by L in S.
We can measure these lengths by timing the passage of a light signal from the source
14 Relativity
Figure 1-7
Mirror
Source
Detector V inS'
L inS
inS'
0<
At'C-*-
inS
t = 0'
M i
C^
At e*-
u M L L + u At
back to the detector after its reflection at the mirror. In S', the light flashes at /' =
and the reflected signal arrives at the detector at t' = At' after traveling the distance
L' each way. The transit time is
2L'
At' = -
in S'.
c
Note that At' is a proper time interval because the emission and detection of the
signal are events at the same location in S'. In S, the light signal propagates as shown
in the figure. The mirror moves to the right with speed u while the signal travels with
speed c and overtakes the mirror. If the flash goes off at / = in S, then at / = A/,
1-3 Time Dilation and Length Contraction 15
the signal reaches the mirror, which has moved to the location L + uAt and at {
,
t = At the reflected signal arrives at the detector, which has moved during the
intervening time to the location «A(. Since the light travels the distance L + u At x
in
the time At l
, we have
I + uAi,
= At l
and so
L
At = .
x
c — u
I + i/Ai,- u At.
A/ - A/,
c
so that
At - At = l
c + u
L L
At = + = —2Lc in S.
c + u c — u c" — u~
We also know that A< and At' are related by Equation (1-3), because we have
identified At' as a proper time interval. The resulting equality is written as
c
2
~u 2
yl-.-'/r- yjc
2
- «-•
L = Z/yi- (1-4)
7 .
called the proper length. Equation (1-4) says that the length L of the same object in any
frame S in which the object is moving is shortened relative to the proper length by the
— 2 x/2
contraction factor (1 u /c 2 ) . Length contraction, like time dilation, is a real
effect that becomes observable for sufficiently large values of u.
Lorentz's assumption that objects contract along their direction of motion because of
the mechanical properties of materials. Instead, Einstein's view holds the observable
contraction to be an intrinsic property of space and time coordinate systems.
The relativity of space intervals can be deduced in another way. Let us return to
Figure 1-6 and assume that an observer travels with the moving clock between the
indicated points A and B. The observer measures the elapsed time A/' on that clock
and then uses the speed u to compute
D' = ubt'
as the distance from A to B. This result represents the length of the array of clocks in
S, observed in the framemoving clock as the array passes by (traveling to the
S' of the
left). The corresponding distance in S from A to B is given by the length D of the
measuring rod shown in the figure. An observer in S reports that the moving clock
travels the distance D (to the right) in time A< so that
D= ubt.
We can use Equation (1-3) and immediately relate the lengths D and D'\
2
u
This conclusion agrees with Equation (1-4), even though the two results may not
appear to be quite the same. It is important to realize that the distance D is the length
of a rod in the rod's rest frame and is The observer on the
therefore a proper length.
moving clock measures and obtains D', the appropriately contracted
this distance
length. Figure 1-8 shows the situation in the frame S' where the moving clock,
mounted by the primed observer, is at rest. In this frame the rod moves to the left with
speed u, and so the length D' is seen by the primed observer as the contraction of the
proper length D.
Some bewilderment over these formulas is inevitable. When circumstances involve
two (or more) relatively moving frames, it is natural to wonder where the primes
should go in the use of Equations (1-3), (1-4), and (1-5). The recommendation would
be to identify wherever possible the proper time or proper length in the appropriate
frame. The time or space interval in any other frame is then dilated or contracted
according to the formulas.
Figure 1-8
inS' [T_
1-3 Time Dilation and Length Contraction 17
Example
The instrument has been synchronized with a clock on the West Coast and is
compared with another clock on the East Coast after the journey. The West
Coast and East Coast clocks remain synchronized throughout. The picture is the
same as the one shown in Figure 1-6. Let the speed of the plane be u = 300 m/s
and let the transcontinental distance be D = 6000 km. A ground-based observer
records the flight time to be
D 6 X 10
6
m
At= — =
2
= 2 X 10
4
s.
u 3 X 10 m/s
1 u- At i u
At - At' = A/ 1-11- = At 1
-
?'
2
(We use the binomial expansion and note that the first nonvanishing term is
~6
enough since u/c = 10 .) The time difference between clocks at the end of the
journey is
4
2 X 10 s ,
At - At' = (l0-
6
r = 10~ 8 s.
2
Time dilation has a very small effect here because jet travel is so slow compared
to the speed of light. The traveler observes the transcontinental distance to be
D' = u At' . This corresponds to the Lorentz-contracted version of the proper
length D, where D and D' are related as in Equation (1-5). The difference
between D and D' is
u(A«-A«') =
At
«y(-J =7(7)
I
="
u\ 2
D I u\* 6
—
X 10
6
m
"(10- 6 r =
,
3x 10~ 6 m,
Example
distance and the 2 /is time interval do not refer to the same space-time frame.
The 2 [is lifetime represents a proper time interval in the muon's rest frame. A
fast muon, traveling at 99.9% of the speed of light, would see only about 600 m
of atmosphere pass by before decay:
~ /l
For u/c = 0.999 the time dilation factor is (1 — u
2
/c '
)
'
= 22. Hence, an
observer on Earth sees these muons live 22 times longer -and travel 22 times
farther. From Equation (1-5) we have
D'
D= .
= 22(0.6 km) = 13 km.
/l - u
2
/c 2
Thus, the Earth observer sees the muon's lifetime dilated, and the muon in turn
sees the distance through the atmosphere contracted.
Example
The relativity of simultaneity is the key to the following famous puzzle, which
we present with numbers first and then with general expressions. Let us use what
we have learned in order to make a 5 m pole fit into a 4 m barn. The obvious
procedure would be to run with the pole at speed u = \c into the barn (which
we assume to be open at both ends), as described in Figure 1-9. The observer
in the figure sees that the 5 m length of the pole is contracted by the factor
y 1 — u and hence can say that the pole just fits in the 4 m length of the
2
/c = 2
i
barn. Of is more to the puzzle than this. The observer 0' sees the
course, there
barn approaching the pole, observes that the 4 m length of the barn is
contracted by the same factor '\, and therefore says that the barn is only 3.2 m
long, too short to contain the entire length of the 5 m pole. We have an obvious
disagreement here between two observers and this would appear to be a
contradiction in relativistic physics. In fact, the disagreement is real and is to be
expected. To appreciate this we must realize that we are describing the obser-
vation of two events. Event B is the arrival of the right end of the pole at the
right end of the barn, and event C is the arrival of the left end of the pole at the
left end of the barn. Observer sees these events as simultaneous, but observer
0' cannot agree since the simultaneity of the two events is relative. Thus, the
lining of the pole in the barn is also relative, and observers in relative motion
are bound to disagree over it. Let us examine their disagreement in general
terms, letting S be the barn frame and S" be the pole frame. For reference, we
identify event A, the arrival of the right end of the pole at the left end of the
barn. Let this event occur at the origin in both frames, when / = in S and
when t' = in .S". Call the proper length of the pole L' (the length of the pole
in ,S"), and call the proper length of the barn <f
()
(the length of the barn in S).
The sequence of events A, B, and C is shown in the figure. In S, the length of
the pole is
L = L'
\l
1 - - 2
1-3 Time Dilation and Length Contraction 19
Figure 1-9
Observer 0'
Jl^
k* Observer O
inS inS'
11
A A
t =
u
1
f-Q
-L = ( e
-«o -L'o
u
B
''
l
f
l
t -'^
C
u
B
< -t'
B~ "
BC "
o e
*~ u
C
fC -&
< u
'
We choose u such that L = tf , so the pole "fits" in the barn in S. Then, when
t = t
BC = tf /u, the events B and C are observed in S as simultaneous. In 5", the
length of the barn is
r = /n \ i
-
r= L\ i = L' \l
20 Relativity
2 2
u / u
t
B = -u
= -v 1-7
u V -'«V
,_
7
and
1 ^BC
u u \ - 2
U /<T
2
^1 _ „2/ f '-'
Thus, there is a time gap in S', by which the events 5 and C fail to be
simultaneous.
14 Relative-Motion Symmetry
Relativity denies the concept of absolute motion and treats relative motion symmetri-
cally. We
have noted the reciprocal symmetry of frames in uniform relative motion in
Figure 1-3. We have also used the symmetry implicitly in Figure 1-5 to claim that
observers in such frames agree on lengths transverse to the direction of motion. This
reciprocity has certain confusing aspects that we now want to consider.
Suppose that two frames S and S" are in relative motion with velocity u. If we can
draw conclusions about S' moving at speed u in one direction through S, we must be
able to draw similar conclusions about S moving at speed u in the opposite direction
through .S". Let us visualize this in terms of a rocket ship traveling past a platform in
deep space as in Figure 1-10. Call the rocket frame S' and the platform frame S, and
assume two frames are synchronized when the
for convenience that clocks in the
origins of the frames coincide. We already know from Figure 1-6 that a moving rocket
clock at the origin in S' runs slower than a fixed array of platform clocks in S.
However, we can equally well say that the platform is moving past the rocket in the
other direction since only their relative motion is physically significant. Relative-
motion symmetry then requires a moving platform clock at the origin in S to run
slower than a fixed array of rocket clocks in 5". A picture of the symmetrical situation
is included in Figure 1-10.
It would appear that we have the possibility of a contradiction in the application of
this symmetry. In fact, no difficulties can arise, but we must be careful to examine
possible problems with questions that are properly posed. Note that each point of view
in Figure 1-10 employs a single clock keeping proper time in a "moving" frame,
compared with an array of synchronized clocks in a " fixed" frame. Note especially
that the proper time of interest refers to the ticks of the single moving clock as events
at the same position in that clock's frame.
The moving clocks have to be examined differently if we want to relate readings
between a single rocket clock and a single platform clock. We might wish to compare
a clock (call it C") at the origin in S' with a clock (call it C) at the origin in 5 by
letting the rocket pass the platform and then return. This procedure allows C" and C
to separate and then come back together for confrontation at the same location. We
must realize, however, that the clock C" parts company with the frame S' as soon as it
stops moving uniformly, as it must in order to reverse direction and return. Thus, is C
at rest in S" a new frame of reference moving through S, when it comes back to meet
,
Figure 1-10
Rocket ship and space platform in uniform relative motion. A rocket clock, at rest in S\ runs
slower than an array of platform clocks in S. A platform clock, at rest in S, runs slower than an
array of rocket clocks in S'.
Ny Ny
Let us apply the symmetry in a different way by letting an observer with one clock
look through a telescope at another receding clock and compare readings. The clocks
maintain uniform relative motion, and so the two frames S and S' suffice in this case.
We recognize first of all that the act of reading the time through a telescope is
accomplished by receiving the light emitted previously from the receding clock. In S,
let C" be moving away and sending its light to the location of C, as described in
Figure 1-11. Light that reaches the origin in .S' at time / = /, is light from C, emitted
earlier in S at time / = t . Since the light travels the indicated distance ut in the time
interval /,
— t , we set
ut.
so that
1 + u/c
22 Relativity
Figure 1-11
t' =
& S'
( =
&
f = t'
t = t o© in,,
t = t, (C
1 - K/f
«o .2 'o
1 + «/f
Thus, when C is read through a telescope and compared with C, it is found that the
time recorded on C is behind the time recorded on C by the factor
(1-6)
c + u
If we reverse the situation so that an observer in S' reads C and compares the reading
with C", we reach the symmetrical conclusion in which the receding clock is observed
to hand by the same ratio r given in Equation (1-6).
run behind the clock in
The Doppler effect for light offers an interesting practical realization of these ideas.
The effect for sound is familiar, and it is instructive to recall the results so that we can
appreciate the distinctions.
Sound waves
are observed to undergo a shift in frequency when the source of sound
is motion through the medium, either approaching or receding from the observer. A
in
different shift is obtained if the observer is moving and the source is stationary in the
medium. The shifts for sound waves are not symmetrical because the medium
provides a frame of reference in which source motion and observer motion can be
distinguished. Formulas for the various cases may be found in any introductory
textbook. We can combine all the possibilities in a single expression by writing the
1-4 Relative-Motion Symmetry 23
In this equation, / is the frequency of the stationary source, w is the speed of sound
in the medium, and u s and u are the speeds of the source and the observer, also with
respect to the medium. The signs of these last two quantities are such that u s is
positive when the source moves toward the observer, and u is positive when the
observer moves toward the source. (As a check, let source and observer move in the
same direction with the same speed so that u = — u s The result from Equation (1-7) .
and observer motion; instead, source and observer are governed by relative-motion
symmetry. Moreover, time intervals for the source and time intervals for the observer
differ in relative motion according to the time-dilation formula. There are several
ways Doppler result for light. Let us analyze the effect in the manner
to derive the
customarily used for sound but adapted suitably to apply to light.
We distinguish light and sound from the outset by adopting different notation. Let
us call the source frequency and the observed frequency v, and consider the case
i>
where the source moves toward the observer with speed u, as shown in Figure 1-12. Of
course, only the sense of the relative motion (approaching rather than receding) is
physically significant. Let S' be the source frame and S be the observer frame, and
consider the situation in the figure where two wave fronts are observed at instants a
full cycle apart. In S', the time interval T' between the emission of these wave fronts
Figure 1-12
inS
u ^
t = ©
Observer
t = T
L
x = uT cT
24 Relativity
is the period
1
T= -
v,
T
T= -7=
2 /„2
At t = 7" the observer notes the current position of the wave front, previously emitted
at t = 0, and measures its distance from the wave front of identical phase coming
from the moving source at the same instant. This measurement determines the
observed wavelength
A = cT- uT
as indicated in the figure. The observed frequency is therefore given by
A (c-u)T'
When we fold the time-dilation equation for T into this, we obtain the desired relation
between v and vn :
c + u
\c — u) 1 V
I 1 " —
c
= V
V c — u
"o-
The result is the Doppler formula for relative motion in the approaching sense. The
receding case is described simply by changing the sign of u. Note how the factor r,
defined in Equation (1-6), makes its appearance in the two situations. The quantity r
determines a red shift since, in the case of a receding source, the observer detects a
lesser red-shifted frequency given by
In astrophysics, the red shift for a receding emitter is expressed by the parameter
z = \/r — 1. The formula obtained above from Figure 1-12 applies to the case of an
approaching source, where the observed frequency is given by
approaching '
V ^/
sound by expanding the relevant formulas in powers of the ratio between the speed of
relative motion and the wave speed. It happens that the light formula in Equation
(1-9) is exactly the same as the sound formula in Equation (1-7) to first order in this
ratio. The differences between the two expressions only begin to appear when the
Example
Let us return to the rocket ship and space platform in Figure 1-10, and suppose
that "space twins" are born at the instant of coincidence between the rocket
origin in S' and the platform origin in S. The "twins" are Chester, whose
location is at the S' origin on the rocket, and Esther, whose position is at the S
origin on the platform. Let the relative speed between frames be u = |c, so that
has reached "age" 8 when he returns to confront his "twin." The famous twin
paradox asks why we cannot reverse the situation symmetrically to Chester's
point of view and argue instead that it is Esther who goes off on her platform
and then comes back as the younger of the two. In response, we can say that
relative-motion symmetry applies between 5 and S', and between S and S", but
does not apply over the whole excursion. Chester's entire motion is not symmet-
rical to Esther's, and so the age difference, whereby Esther stays at home and
The dilation of proper times of proper lengths are the first main
and the contraction
results of relativity. Now thatwe know how these proper intervals are altered as we
pass to other frames, it is appropriate that we generalize and ask next how arbitrary
26 Relativity
descriptions of space and time transform when we move from any one frame of
reference to another. Again, we are guided by the principles of relativity and
particularly by the universal property of the speed of light.
We are concerned with the space-time description of events. The word event has
already been put in use to denote a happening associated with a location in space and
an instant in time. Occurrences at the same spatial location are distinct events if they
happen at different times. In more abstract language, an event defines a point in
space-time. Observers in relative motion make different assignments of space-time
coordinates for such a point. Thus, a given event has space-time coordinates
(x, y, z, t) in S and (*', y\ z' , t') in S', where S and .S" are frames in relative motion.
Our problem is to deduce the transformation rule by which the coordinates in one
frame are determined, given the set of coordinates in the other frame.
Let us restrict our attention to frames in relative motion along the direction of a
common axis. Two such reference systems S and S' are shown in Figure 1-13, where
clocks at = and t' = when the two frames coincide. Then, at a later time / in S,
/
the primed origin at x' = is located at the point x = ut. The event shown in the
figure has coordinates (x, y, z, I) in and (x\ y', z', t') in .S". For reasons that we
.S'
have already discussed, the spatial coordinates transverse to the relative motion are
the same in the two frames, so that y' = y and z' = z in this case. Hence, the
relations at issue are those between the variables (x, t) and (x', t').
In pre-Einstein relativity, the time coordinates / and t' are presumed to be the
same for an event, in keeping with the notion of absolute time. Let us adopt t' = t
temporarily and pursue the consequences before we consider the true relativistic
picture. We can use the fact that x = ut when x' = to establish the equality
Figure 1-13
Reference systems S and S' in relative motion. An event occurs at (x, y, z, t) in S and
(*', y', z', t') in S'. In this view, S' is moving through S.
y'
t' =
x'
e = o
© Event
1-5 The Lorentz Transformation 27
x' = x — ut. The collection of relations among the space-time coordinates may then
be written as
=x—
x' ut,
/ = y,
*' = *,
t' = t. (1-10)
This particular system for relating the space-time descriptions of a given event is
relativity,which says that if the laws of mechanics hold in S, they hold also in S\
where S and S' are related by Equations (1-10). To see this, let the event of interest
denote the instantaneous position of a particle, and consider the velocity of the particle
as seen in the two frames. If we introduce the vector u = uxas the constant velocity of
S' in S and differentiate Equations (1-10) with respect to the time, we get the vector
result
d
v' = —dv'
- = — (r - ut) = v - u. (1-11)
dt' dt
S' We recognize that the components are not altered by the transformation because
.
the orientation of the axes is retained and the time variable is unchanged. We can
express this equality of components by writing F' = F and conclude that, if F = ma
governs the motion in S for a particle of mass m, then F' = ma' holds likewise in S'.
Thus, Newton's law has the same form in any two frames connected by the Galilean
transformation of coordinates.
Note that Equation (1-11) immediately becomes Equation (1-1) when the event of
interestis chosen to be a point on a wave front of light. We can therefore regard
28 Relativity
Figure 1-14
y'
t=
f = ~x'
(?) Event
because the relative speed between frames is u from either point of view and because
y depends only on u.
We can now connect the viewpoints in the two figures and determine y by
considering the propagation of a light signal. Let a light flash be emitted at the
coincidence of origins in the two frames when / = and t' = 0, and let the given
event at the later time represent a point on the propagating wave front. For simplicity,
we choose y' =y= and z' = z = so that the event occurs at a location on the x
and x' axes. The coordinates must satisfy x = ct in S and x' = ct' in S' because of the
universality of the speed of light. When these data are inserted in Equations (1-12)
and (1-13), the expressions become
u\ I
u
t' = y 1 - t and / = y 1 + - I
.
t = y(l + -I
y ' (1-14)
l/l
- u
2
/c 2
x = y (x — ut ) + yut'
2
1-Y
t = yt -\ x.
Y«
Equation (1-14) tells us that (1 - y 2 )/yu = —yu/c 2 and , so the final relation takes
the form
These results fulfill our objective, as Equations (1-12) and (1-15) serve to determine
the transformed variables (*', /') in S' from the given variables (x, t) in S.
The rules for finding the coordinates of an event in S', given those in S, are
cpllected as follows:
y = y,
:'
^3
II
X (1-16)
-A'- 7
t'
This famous set of formulas is known as the Lorentz transformation. Credit is given to
Lorentz because of his discovery, before Einstein, of the curious mathematical
property that the form of Maxwell's equations remains unchanged when the space
and time variables are substituted according to Equations (1-16) and when the fields
in the equations are also transformed by a suitable set of rules. Lorentz's contribution
isthat of a believer in the aether hypothesis and should therefore be regarded as a
mathematical observation without physical foundation. Einstein's theory gives status
to the contributionby providing the proper theoretical setting in which the transfor-
mation emerges logically from the principles of relativity. Einstein's famous paper of
1905 demonstrates over again Lorentz's observation that the transformation rules
leave the form of Maxwell's equations unchanged. Thus, the Lorentz transformation
30 Relativity
supplies the general mathematical language to use for the description of space and
time and for the formulation of dynamical principles.
The speed of light has a central role in all these proceedings. We can see that the
light speed represents a speed limit inasmuch as y in Equation (1-14) becomes infinite
in the limit u —> c. Hence, frames of reference may have relative motion at any speed
u, even u approaching c, as long as u remains smaller than c. Of course, our
experience makes us more familiar with the nonrelativistic regime where u is much
less than c. When we examine the Lorentz transformation in the nonrelativistic limit,
we find y —» 1 and t' -* t. Therefore, absolute time and the Galilean transformation
become valid approximations to Equations (1-16) in this regime.
The Lorentz transformation can be inverted so that we can pass from the frame S'
to the frame Equation (1-13) is evidently one member of such an inverse set of
S.
relations. The whole set may be obtained from Equations (1-16) simply by exchanging
primed for unprimed coordinates and changing the sign of ;/:
x = y(x' + id'),
y = /,
z = z',
Note that we can associate Equations (1-16) with Figure 1-13 and Equations (1-17)
with Figure 1-14. Of course, relative-motion symmetry assures us that both sets of
relations are Lorentz transformations with entirely symmetrical meanings. Frames of
reference are called Lorentz frames if their space and time coordinates possess these
transformation properties.
The Lorentz transformation is a general scheme of relations between any two
Lorentz frames. Let us gain familiarity with some of the transformation strategies by
confirming the relativistic effects that we have already investigated regarding the
relativity of time and space intervals and the relativity of simultaneity.
( x x
, /, ) in S and ( x' , t\) in S'
(x 2 , t
2 ) in S and (x' , t
2 ) in S'
in S'. When we apply Equations (1-17) to the two events, we obtain for the time
coordinates in S
t
2
= y\t'2 + —x'
and
u
l
\
= y\ t'\ + —x'o
1-5 The Lorenlz Transformation 31
A/ = yA/\ (1-18)
Since A<' is a proper time interval in this situation, the result describes time dilation as
in Equation (1-3).
To examine length contraction, measurement illustrated in
let us consider the
Figure 1-8. We want to determine the length of the moving rod
in S' by locating the
endpoints of the rod in that frame, where the two spatial locations are to be found at
the same time t' Associate an event with the simultaneous arrival of each end of the
.
(x[,t' )
and (x'2 ,t' )
for the arrival of the left and right ends, respectively. We can determine the
corresponding spatial coordinates for the two ends in S by applying Equations (1-17)
to each event:
x2 = y(x'2 + ut'Q )
and
*i = y( x {
+ ut'o)-
D
D= yD', or /)'=—, (1-19)
Y
in agreement with Equation (1-5). As in the earlier derivation, D is the proper length
of the rod and D' is its contracted length.
To examine the relativity of simultaneity, we
refer back to the thought experiment
in Figure where the soundings of two buzzers at opposite ends of the moving
1-4,
railroad car represent two spatially separated events, occurring simultaneously in S'
but not in S. Let these events be denoted in S' by the coordinates
(x[,t' )
and (x'2 ,t' )
for the buzzers on the left and right, respectively. Note that both events have the same
time coordinate in S'. When we apply Equations (1-17) to each event, we find the
time when each buzzer goes off in S:
u
{ , ,
and
/ "
t\ = y\ <<>
,
+ —> x
,
M = y^L'. c
(1-20)
32 Relativity
This result formalizes the qualitative description given to accompany Figure 1-4. The
distance L' is the length of the railroad car in S'. Since the car is at rest in S', L' is the
proper length of the car. In S, where the car is moving, the left buzzer is observed to
go off before the right buzzer with intervening time given by Equation (1-20).
Example
assume for convenience that S and S' are coincident at this instant. The two
shots are events, with coordinates
= Y /
8
(3 X 10 m/s)(l(T 8 s) 1
9m 3'
The motion of a particle is described by the time dependence of the particle's position
vector r(/). The velocity vector is then obtained by differentiating:
dx
y(t
dt
Figure 1-15
reference frame S, where the coordinates of the moving particle are (x, y, z, I). In
another Lorentz frame S' in motion relative to .S", the coordinates of the particle
transform into (x', y', z' , and so the description in that frame is in terms of the
t'),
transformed vectors r'(t') and v'(t'). The interesting feature is the transformation of
the time as we pass to the new frame. We have to take this into account when we
determine how v transforms into v'.
dx'
dt"
the limit of Ax '/At' for small space and time intervals. Let these intervals refer to the
space and time coordinates of events 1 and 2 defined by two successive locations of the
particle along its path. Then, from Equations (1-16), we obtain
A*' y(x 2 - ut
2 )
- y(.v, "I i
7
It - YM, - ~7*!
t
2
~ ~^x 2 \
Ax - u At At
u Ax
A/
7 Ji
2
34 Relativity
The result follows when the limit of vanishing time intervals is taken:
w-
-
(1-21)
1 -
One-dimensional velocities transform their speeds from v to v' according to this rather
complicated formula.
Two elementary checks may be made on
the result. Suppose that the particle has
constant speed and that S' moving along with the particle; we then have u = v so
is
that v' = 0, as expected. Suppose instead that the particle is at rest in S; in this case
v = and so !»'= — «, also as expected. A more critical check on Equation (1-21) is
made by applying the formula to the speed of light. If we set v = c we find
c.
1 - uc/c'
Thus, the light speed transforms into itself, in keeping with the property of universal-
ity.
Figure 1-15 also shows the more general case in which v, the particle's velocity
"x ~
-
1
~7
v
,.' >
°y~ uvx \
,(.
Vz
f _
(1-22)
Y|l --£
We can infer the first of these formulas directly from the derivation of Equation
(1-21). The second and third involve identical manipulations, so let us consider
only v' :
y
A/ __ y't
- y\ y-i
- 7.
:
u u
bt' t'- /
-
\
1[
y\t 2
72
x2 -y M.t^x,
/
u k \ . I u A.v
The result given in Equations (1-22) is obtained when the limit defining dy'/dt' is
taken. Note that v' and v are not the same, even though y' = y, because of the
invalidity of absolute time.
1-6 Transformation of Velocity 35
v' + ;/
Vx ~
1 + 2
C
v'y
°y~
>(
(1-23)
uv, *
Y 1
+
The formulas given in Equations (1-22) and (1-23) constitute the correct relativistic
rules for adding velocities. These results take the place of Equation (1-11) and of
course reduce to that equation in the nonrelativistic limit, when both u and v are very
small compared to c.
illustrated in Figure 1-16, inwhich S is the star frame and S' is the Earth frame. A
light ray emitted from the star at an angle 8 in S is received in S" at a greater angle
8', according to the following analysis. The components of the velocity of light in 5
are
c sin 6 + u c cos
v. = indJ
"*" v'
"y
=
u u
i
-sin 6 y 1 + -sin i
<
v[ c sin + u
tan#' = — = y-
oi
1
/
u
tan 6' = -7= tanfl+ . (1-24)
VI - u
2
/c 2
\ c COS0 I
It is clear from this expression that 8' is larger than 8. Therefore, we see that we must
36 Relativity
Figure 1-16
tor y.
ccos e
tip the telescope forward relative to the direction defined by 6, as indicated in the
figure. The classical treatment of the aberration of starlight employs nonrelativistic
velocity addition. This analysis is also sketched in the figure, in an inset taken from the
telescope's point of view. We note that the same answer results except for the presence
of the y factor in Equation (1-24). Consequently, there is no difference between the
classical and relativistic predictions to order u/c.
1-6 Transformation of Velocity 37
Example
u = \c. The problem is to find the speed of rocket 2 relative to rocket 1 and then
to find the time elapsed on rocket l's clock while rocket 2 passes. The figure
shows the motion of the rockets in two Lorentz frames; on the left the passing is
seen in the Earth frame S, and on the right the passing is seen in S', the rest
frame of rocket 1. The speed u' of rocket 2 in S' is obtained by inserting v = —u
and v' = — u' in Equation (1-21):
lu 2(f0 40
c.
2
1 + u /c' 1 + 16/25 11
Let the length of rocket 1 in S' be denoted by / '; this specifies the rocket's
Figure 1-17
Two rocket ships passing each other. The proper length of each rocket is /„'. In S, the Earth
frame, both rockets have speed and contracted length ( Passing takes place in S between
u .
times t = and t — /, = f/u. In S\ the rest frame of rocket 1, oncoming rocket 2 has
speed u' = 2u/{\ + «-' /c~) and contracted length /'. Passing in S' takes place from l' = to
t > = t[ = (^ + /')/»'•
t=
f =
-<3
1 l_S'
-( V
o
t= U =77 f = (
1 "
^-<3
*o- e ' f
- o
38 Relativity
4 + <" 20 +
<;
= m/c = 25 m/c,
hi
n
using meter/c as a convenient unit of time. These results complete the problem.
Let us confirm the answer for t[ by analyzing the motion in S, as shown on the
left side of the figure. In this frame each rocket has the contracted length
t = / Vl - u
2
/c 2 = (20 m)(f) = 12 m,
( 12m
t, = - 15 m/c,
as indicated in the figure. This represents a proper time interval because it gives
the time elapsed between two events at the same point x = in S. Therefore,
the corresponding time inteival in S' is dilated relative to /,:
/; =(l5mA)(f) = 25mA,
- 2 ;
/I u /c
1-7 Space-Time
Relativity dissolves the boundaries of definition between space and time, and mixes
these variables together in the passage between Lorentz frames. We see this most
clearly in the Lorentz transformation, where one observer's time for some event is
related to another observer's time and position for the same event. We have already
adopted a notion of space-time as the setting in which events happen. Now we want
to extend the idea so that we can visualize several Lorentz frames at once, and let the
space and time coordinates mix accordingly.
Let us first introduce some new notation by setting
(S = - and y (1-25)
c 1
y' = y,
Z' = 2,
The resulting set of formulas employs coordinates (.v, y, z,ct) that have the same
dimensions of length and display an obvious interchangeability in the variables .v and
ct.
2
s
2
= x +y + 2
z
2
-c 2 t
2
. (1-27)
A primed version of this is also defined by the coordinates (*', y', z', ct') for the same
event in another frame. We can use Equations (1-26) to relate s' and s:
^=x ' 2
+y' 2 + Z
' 2
- c
2
t'
2
= y (x
2
- Pctf+y 2 + z
2
- y (ct
2
- (3xf
= y
2
(l-p 2
)x
2
+y 2 + z
2
-y 2
(l-(] 2 )c 2 t
2
= x2 + y2 + z
2
- c
2
t
2
The remarkable result is that s is not changed in the transformation to the new
Lorentz frame. Therefore, s has the same value in all Lorentz frames and is said to be
a Lorentz-invanant quantity.
r
2
= x
2
+ y2 + z
2
.
This distance does not change when we perform rotations of the three-dimensional
coordinate system. Similarly, we can regard j as a "distance" in four-dimensional
space-time between the origin and an event (x, y, z, ct). Lorentz invariance then says
that this "distance" remains unchanged when we carry out Lorentz transformations
on the space-time frame. The negative sign is crucial for the time contribution in
Equation (1-27) since Lorentz invariance would not follow without it. In a given
frame space and time have distinct meanings, and the minus sign for the time
highlights that distinction. The sign tells us that Euclidean geometry does not hold in
our four-dimensional space-time. The interpretation of s as a distance must therefore
be made advisedly since s is not a positive quantity in all cases. We may extend the
meaning of Equation (1-27) by considering two events and defining the space-time
interval As between the corresponding points in space-time:
2 2 2 2 2
(A*) = (A*) + (Ay) + (Az) - c
2
(A0 - (1-28)
This expression is Lorentz invariant and may be either positive or negative. Space-time
can present a complicated picture owing to the mixing of coordinates in the passage
from frame to frame. It is very useful to know that such features of the general picture
as the space time interval remain unchanged during the mixing process.
A full visualization of space-time requires a four-dimensional picture. Let us
forego this complication by concentrating on problems for which there is only one
spatial dimension. Consider the straight-line motion of a particle whose path, x versus
t, lies in the plane of the variables (x, ct), as in Figure 1-18. The locus of points on the
40 Relativity
Figure 1-18
t=
rt,
path is called the world-line of the particle. The figure shows the motion beginning at
x = when = and proceeding at
t constant speed v until t = t
l
. Thereafter, the
world-line may continue as a straight line, if the particle does not accelerate, or may
curve downward or upward if the particle speeds up (a > 0) or slows down (a < 0).
Note that the slope at any point along the world-line is
del c
'
dx dx/dt
where dx/dt is the instantaneous velocity of the particle. This slope must always
exceed unity for a moving particle and may approach unity from above as the
particle motion approaches light speed.
We can use such diagrams in the (x, ct) plane to visualize various kinds of events in
their space-time setting. It is especially useful to see how the interpretation of the
diagrams can be extended by defining different pairs of axes corresponding to
different Lorentz frames. We let the axes denoted by (x, ct) pertain to the particular
choice of frame S, and we consider the transformation to another frame S' by means
of Equations (1-26) so that we can introduce new axes in space-time labeled by
(x\ ct'). The orientation of the new primed axes in the plane of x and ct is established
as follows. Equations (1-26) tell us that along the ct' axis we have
so that ct = — ,
1-7 Space- Time II
The lines ct =and ct = /8* are drawn in Figure 1-19 and labeled as rf' and x'
x/fi
Note that these axes are not perpendicular but skewed. Each axis is
axes, respectively.
rotated inward toward the 45° line by the same angle a, where tan a = u/c = ft. The
axes approach the 45° line from either side in the limit u —* c and (5 -> 1, as the speed
of the frame S' approaches light speed in its uniform motion through S. A given event
at (x, ct) in S and at (*', ct') in 5" can be plotted with respect to either set of axes.
Because the primed axes are skewed, the projection of primed coordinates parallel to
these axes differs from the usual method of dropping perpendiculars onto the axes. We
should not be surprised to encounter such non-Cartesian properties since we have
already noted the non-Euclidean nature of space-time. The space-time interval
between the origin and the event indicated in the figure is given by
We may also consider a third frame S", which, like S', moves through S at the same
speed u but does so in the opposite direction. The coordinates (x", ct") in S" are
found by formulas identical to Equations (1-26) except for the replacement u — — u.
>
Space- time diagram showing three pairs of Events in a space-time diagram with axes for
axes for three different Lorentz frames. The two Lorentz frames S and S'. Events 1 and
transformed axes are skewed, inward for S' occur at the same spatial location in S'.
moving through S along the positive x axis, Events 2 and are separated by the
and outward for S" moving through S along propagation of a light signal. Events 3 and
the negative x axis. In the space-time are simultaneous in S'.
diagram, the x' axis is along the line ct = fix,
ct" ct ct
Event
u -^ *i
42 Relativity
Accordingly, the axes in the space-time diagram are rotated by the same angle a in
the outward direction, as shown in the figure.
Example
2 2 2
(uAt) - (cA<) = -(cht')
and solve for A/ to recover Equation (1-3), the time-dilation formula. Note how
the non-Euclidean geometry may tend to deceive us here; despite appearances
in the figure, A< is larger than A/'. Events 2 and are separated by the
propagation of light. Note that event 2 satisfies x = ct and x' = ct', and so a
light flash emitted at can propagate outward and be detected later at a
location on the x and x' axes as event 2. Events 3 and are observed in S' to be
simultaneous since both occur at t' = 0. Note that event 3 follows event in S
by the indicated time interval /,. The relation between these events is the same
as that in the example of the hunters and the game warden in Section 1-5.
Example
As a finale, let us return to the twin paradox and reconsider the motion of the
space twins Chester and Esther at the end of Section 1-4. Recall that Chester has
gone on a rocket trip away from Esther on at speed u = jc and, after 5 years
Esther's clock, abruptly turns same speed. We
around and comes back at the
know that the problem involves three Lorentz frames, Esther's frame 5 and
Chester's frames S' on the way out and S" on the way back. Figure 1-21 shows
the axes for these three frames in a space-time diagram, where 5 is chosen as
the frame at rest. We use years and light-years for the unwritten units of time
and distance in the diagram and in the following discussion. (Recall that a
light-year is the distance traveled by light in 1 year.) When / = /' = /" = 0, the
World-lines for Chester and Esther are also shown in the figure. Esther remains
at x = for 10 years in S, while Chester stays at x' = for 4 years out in S'
and then changes his address to x" = for 4 years back in S". Chester's clock
reads /" = 8 upon reunion with Esther. We have reached this conclusion from
Esther's point of view by staying in the single Lorentz frame S and observing
that Chester's time runs slower. Thus, we have recognized that her time is
dilated by the factor \ relative to the proper time in his frame, whether in S'
going out or in S" coming back. The puzzling aspect of the twin paradox arises
1-7 Space -Time 43
Figure 1-21
fi'i
£l
t"=
t' = x' x"
t= 6 t-v
44 Relativity
Chester's world-line where he reverses his rocket and changes over from S' to
S". Two drawn through the space-time location of this event, one
lines are
parallel to the x' axis and another parallel to the x" axis. Along these two lines
events occur in S' and in S" that are simultaneous with Chester's turn-around
event. On Esther's clock, the times for these events occur 3.2 units from the
beginning and 3.2 units from the end of the trip, as indicated in the figure. Thus,
the diagram reveals a simultaneity gap of duration At = 3.6, which Chester must
include in his bookkeeping to account for all of Esther's time. Of course, the
calculation is much easier in S where Chester's turning point is simultaneous
with the midpoint of Esther's world-line. We have appealed to the relativity of
simultaneity to resolve the paradox, and we have found the space-time diagram
to be an indispensable aid in visualizing the argument.
well. The problem is to deduce the proper relativistic expressions for momentum and
energy so that the corresponding conserved quantities are respected by the Lorentz
transformation. We know that these expressions must also reduce in the nonrelativistic
limit to the familiar forms in Equations (1-29).
The most direct way of presenting this problem is simply to divulge the answer
immediately and then demonstrate the properties of the solution. In this spirit we
assert without further ado that the relativistic momentum p and the relativistic energy
E are given for a particle of mass m by the formulas
OTV
P= ,
- 2
(1-30)
/l v /c 2
1-8 Relatmstic Momentum and Energy 45
and
mc"
E= 9 ,
(1-31)
,
/l - v
2
/c 2
The momentum obviously goes over into its nonrelativistic form for v <s: c. The
energy E is a new kinematical quantity whose meaning is to be examined in due
course.
The underlying strategy should be spelled out clearly so that these assertions can be
grasped. We expect observations of momentum and energy to yield altered values
when the observations are made in a different Lorentz frame. Our strategy focuses on
the proper behavior of these kinematical quantities under transformation to any such
new frame. We know that Equations (1-26) describe the passage from one space-time
frame S, where the coordinates are (x, y, z, ct), to another space-time frame S',
where the coordinates are (x', y', z', ct'). Let us set up a correspondence of properties
between the momentum and energy variables and the space and time coordinates by
first assembling the kinematical quantities into the fourfold set (p x , p , p., E/c). (We
use E/c instead of E for the fourth variable so that all four quantities in the set have
the same units of momentum. This convention parallels our use of ct instead of / in
Equations (1-26).) We then assume that these four variables in the frame S transform
into a new set (p'x p', p': E'/c)
,
, when we pass to the new frame S' , and require that
the transformation rules have exactly the same form as those used to relate (x, y, z,ct)
and (x', y', z', ct'). The proposed relations are just like Equations (1-26):
p: = p„
E' IE
— = y I--J8/.
\
I, (1-32)
where and y are again given by Equations (1-25). These relations specify the
/?
particular that p and E are mixed in the passage to the new frame just as x and /
t
are.
The argument behind Equations (1-32) should be restated for emphasis. We want
to guarantee the transfer of conserved quantities from one Lorentz frame to another
and ensure specifically that, if p and E obey conservation laws in S, then p' and E'
automatically obey the same conservation laws in by arguing
-S". We accomplish this
Y,.=
'
(1-33)
i 2
as a shorthand device and note that y depends on the relative speed u between frames
5 and S' while yv depends on the velocity v of the particle in the frame S. Thus, y is a
constant parameter since u constant, while yB is a dynamical variable that varies
is
with time in situations where the speed of the particle is changing. Equations (1-30)
and (1-31) take the form
p = yv m\ E= 2
and y„mc
are related in Equations (1-22). We can use these formulas to establish a remarkable
algebraic relation among the y's:
y
/
\
a - P-]
c
E \
= yyM v -
I
x ") = y,,-™—
1
vx
—
: ——
—
uvx /c
u
>
= y^ mv ' = P'
x -
We have used, in turn, Equations (1-30) and (1-31), (1-33), (1-34), (1-22), and finally
(1-30) again in its primed form. The last of Equations (1-32) follows in similar
fashion:
y(E - ficpx )
= yy„m(c 2 — uvx )
= y^mc 1 = £".
Here, we use the same series of formulas concluding with the primed version of
Equation (1-31). These two derivations along with analogous ones for p' and p'.
furnish the desired proof that p and E in Equations (1-30) and (1-31) indeed
transform under Lorentz transformation according to Equations (1-32).
Our discussion has focused on transformation properties, and so the presentation
may seem somewhat abstract. We should realize that relativity is a treatment of
frames of reference where rules for the transformation between frames are essential
considerations. We have acknowledged this by introducing relativistic momentum and
energy in context with the Lorentz transformation. Now that these abstract conceptual
features have been presented, we can proceed to the physical interpretation of the new
dynamical quantities.
It is obvious that p and E should depend on the velocity of the particle. We can
eliminate the explicit dependence on v and relate p and E directly to each other by
squaring and comparing expressions. The result is a very important equality known as
the relativistic relation between energy and momentum:
E2 = c
22
p + m 2c\ (1-35)
1-8 Relativistic Momentum and Energy 47
We have yet to identify the meaning of the relativistic energy E. Let us note first that
E has a definite nonvanishing value, even when the particle is at rest. If we set p =
in Equation (1-35), we obtain
E rest
= mc\ (1-36)
the minimum energy possible for a free particle. This quantity is determined by the
mass of the particle and is called the rest energy. If we let v be nonzero but very small
2
compared to c, we can approximate yv by expanding in powers of (v/c) :
2 \ —1/2 2
V \ ' V
'
+ '"-
2^
E= mc + -,mv
2
(nonrelativistic)
to order (v
2
/c 2 ). We recognize the two contributions to be the rest energy and the
Newtonian kinetic energy. This calculation is useful only for small v/c. The result can
be generalized for any value of v by defining the relativistic kinetic energy as
-1 . (1-37)
A graph is shown in Figure 1-22. The effect of the speed limit is seen
of this expression
clearly in the definition and in the figure. The kinetic energy of the particle is not a
real-valued quantity and is therefore undefined for v > c. The nonrelativistic version
of K is an adequate approximation to Equation (1-37) for !»<c but departs from the
exact expression and eventually violates the speed limit as v grows without bound. The
interpretation of the relativistic energy may be summarized by writing
E= mc 2 + K. (1-38)
This formula is valid for any v < c and consists of two separate pieces, the one
associated with the mass of the particle and the other associated with its motion.
Equations (1-30) and (1-31) suggest the identification of a mass" in the
"relativistic
combination of factors yu m. This quantity grows as v increases and becomes infinite as
v -» c. The factor m by itself is sometimes called the "rest mass" to distinguish it from
the ^-dependent relativistic mass. Figure 1-22 indicates how a practical application
may be made of this nomenclature. We know that the kinetic energy of a particle can
be increased by applying a force and doing work. The work done on the particle
causes a gain in the particle's relativistic mass along with the expected increase in
velocity. The figure shows that we cannot accelerate the particle to indefinitely large
velocity since there is a speed limit at v = c. Close to the limit, the increase in
relativisticmass becomes the dominant effect as further increase in velocity becomes
more and more restricted. It is obvious from the figure that an infinite amount of work
is required to accelerate the particle all the way to light speed. We have already noted
that the relativistic mass approaches infinity in this limit, while the rest mass remains
fixed throughout the acceleration process. Now that we have acknowledged the
48 Relativity
distinctive behavior of "relativistic mass," let us forego the terminology hereafter and
always refer implicitly to "rest mass" whenever we speak of mass from now on.
The formula E = mc 2 appears frequently in popular science writing. In context,
the usage refers generally to a nuclear process in which a small portion of the mass of
an atomic nucleus is converted into a large amount of energy. Prospects for obtaining
energy from mass have their origin in one of Einstein's thought experiments. His
hypothetical process involves a mass at rest spontaneously changing into a lighter
mass, also at rest, with the simultaneous emission in opposite directions of two
high-energy electromagnetic waves, called y rays. The described process
A B + y + y
is shown schematically in Figure 1-23, where A and B denote two species of atomic
nuclei. The sum of the energies carried away by the two y rays is evidently given by
A£ = kMc 2
,
77"° —* y + y,
in which one of the elementary particles, the it meson, is observed to change into y
rays and convert all its mass into y-ray energy.
in this case. Equations (1-30) and (1-31) make sense as indeterminate forms in the
limit m —* 0, provided the simultaneous limit v —> c is also taken. Thus, massless
particles may exist provided they travel at the speed of light. Such particles cannot be
found any frame and must have speed v = c in all Lorentz frames. The
at rest in
limiting behavior of energy and momentum E — and p —» may be observed for a >
massless particle provided the small values occur together in the appropriate frame
obeying Equations (1-39).
The first of these two equations may be somewhat familiar. We know from
Maxwell's theory that electromagnetic waves carry energy and momentum according
to the properties of the Poynting vector. The energy and momentum of the radiation
satisfy a relation identical to the equation E= cp. The possible massless-particle
aspects of the radiation remain to be explored in Chapter 2.
Example
formulas, and not by Equations (1-29), whenever the kinetic energy of the
particle is comparable with its rest energy. To illustrate, let us consider beams of
accelerated electrons and protons and take the approximate values 0.5 MeV and
1 GeV for the respective rest energies. (Recall that the electron volt eV is a unit
of energy equal to that acquired by a single electronic charge accelerated in a
potential difference of 1 volt.) It is clear that a 1 MeV beam kinetic energy is
relativistic for the electron but not for the proton and that a 10 GeV beam is
v
- =
/(c(k + 2)
, where k = — A'
- .
c k + 1 mc
v
- = v2k when k •« 1
v
k = 2 and - = \l - for electrons,
c V 9
/hile
v / 1
v / 100020000
k = 10000 and - = \ = 0.999999995,
c V 100020001
.
50 Relativity
V / 120
k = 10 and \\
= 0.996.
c V 121
V 1 1
- = 1
- to order —7
c 2^ K
v 1
H
" 2k 2 ' = 0.5 x 10 forx = 10
4
,
Classical dynamics begins with concepts of force and mass in the context of Newton's
laws of motion and then proceeds to the conservation laws of energy and momentum.
The empirical approach is taken to establish the equations
F = ma and F = —
dp
dt
(classical)
as principles that govern the behavior of particles. This scheme is internally consistent
and amply confirmed by experiment, as long as the particles have velocity much
smaller than c.
Our treatment of relativistic dynamics begins with a different premise based on the
conservation of momentum as a first principle. The conservation of relativistic energy
follows logically from the same starting point. We have introduced these ideas in
Section 1-8 and we are now prepared to take up the concept of force, again pursuing
the empirical approach to the dynamical equations. We turn to this question next,
after we have made the following observation about the two conservation laws.
Let us consider a process in which an initial system of colliding particles reacts to
form a final system of emerging particles. The particles in the final state of the
reaction do not have to be the same We suppose that an
as those in the initial state.
observer S determines the total momentum and total energy, before and after the
in
reaction, and records the results as (P bl fort., ^before) anc (Pa fter> ^after)- Differences in
.
^
.Hid
The transformation rules in Equations (1-32) can then be used to deduce the
corresponding differences determined by an observer in another Lorentz frame S' in
motion relative to S. The relations are
A£
ap; = y
/
ap, - p— \
,
ap/ = ±py ,
ap; = ap,
AP' / AP \
=y /?AP I
(1-40)
m\
¥= —
dp
= -
d
.
-
(1-41)
dt dt /l D 2/ f 2
governed by
[dp dp
W= J
r*2
Fdx = I —vdt = jvdp =
r /•'..
J
v — dv,
.
52 Relativity
where y, and v 2 are the velocities of the particle at x, and x 2 - We then use
2
dp d
(
dy\ I v
:
dyv d 1 v/c
~& -
' ~dv /i v */ c 2 '
(1 -v 2 /c 2 Y
II
r mo
(i-, 2 A 22\V
>
)
/
TTjdv
v
2 /
/c
:
= £,-£,.
W= K 2
- A', (1-43)
Thus, the work done by the force on the particle is equal to the change in the
F = e\ X B. (1-44)
Figure 1-24
1
.
As usual, this force does no work since F and v are perpendicular vectors:
2
W= '
( F • dr = (~F •\dt = 0.
It then follows from Equation (1-43) that the kinetic energy is constant, and so the
speed of the particle is also constant. Therefore, yv is a constant parameter in
Equation (1-41), and the equation of motion takes the simpler form
dv
fvXB = yv m — dt
(1-45)
v is initially perpendicular to B when the particle is injected into the field, the three
vectors a, v, and B remain mutually perpendicular throughout the motion. We
recognize that a particle orbit with these properties must be planar and circular, as
indicated in the figure. The familiar centripetal acceleration results, with magnitude
a = v~/R, where R is the radius of the circular orbit. We insert all this information
into Equation (1-45) to get
2
v
evB = y, m — ,
eBR = yv mv = p.
This expression for the relativistic momentum is identical in form to the nonrelativistic
result. The measurement of p then follows directly from measurements of B and R.
Modern physics is concerned with the structure of matter and the interactions of the
constituents of matter. These basic properties of nature can be investigated by probing
systems of particles in collision processes. Thus, we are able to "see" the structure of
the atom and by exposing the systems to beams of suitable particles with
the nucleus
appropriate incident energies. Studies of matter on an even smaller scale are accom-
plished in collisions at relativistic momenta. The larger collision energies make
possible the disintegration of the nucleus and the production of new elementary
particles. These investigations at high energy are analyzed according to the principles
of relativistic kinematics. The analytical procedures are in essence exercises in the
application of momentum conservation and energy conservation. It is important to
realize that we do not have to understand the specific nature of the forces between
particles in order to implement these conservation laws.
To begin our survey of relativistic kinematics, let us reconsider the conversion
between mass and energy and emphasize that mass does not have to be conserved. We
are guided by the principle of conservation of the total energy, where we construe the
energy to include all possible manifestations. Consider the collision shown in Figure
1-25 in which the two colliding particles are assumed to have the same mass m and
speed v. The particles come together from opposite directions and interact in a
completely inelastic collision to form a single mass M. We refer to the process in these
54 Relativity
Inelastic collision in m + m — M.
which > Inelastic collision in a conservative classical
Conservation of momentum and energy imply model. The initial kinetic energy is converted
that mass is not conserved. into potential energy stored in the compressed
spring.
.0_Q_Q_Q_Q_Q.
m jOJJflfl(
m
terms since all the kinetic energy of the initial system disappears in the collision. The
finalmass M is produced at rest because the conserved total momentum is zero. We
determine this mass by invoking conservation of the total energy in either of the two
forms
mc 2
Mc 2 = 2
l/l - v
^/c=
2 2
(1-46)
or
Mc 2 = 2mc 2 + 2K,
where K is the kinetic energy of each initial particle. It is obvious that the final mass
M exceeds the total initial mass 2m as the initial kinetic energy is entirely converted
into the increase in mass.
Let us imagine a classical model for such an inelastic collision in which the masses
stick together and yet only conservative forces are involved in the balancing of all the
energy. We may use the massless ratchets-and-spring device in Figure 1-26 for this
purpose, and assume that the colliding particles lock onto each other with negligible
friction and compress the spring between them. The spring absorbs, and stores as
potential energy, the total kinetic energy brought into the collision by the initial
particles. Our relativistic treatment simply identifies the final aggregate as a new
system having its own mass M, whose rest energy includes the effect of the potential
energy modeled by the compressed spring. If we remove the spring from the classical
model, we conclude that all the initial kinetic energy is dissipated in friction and shows
up as heat in the final system. Again, the relativistic treatment regards this quantity of
heat as part of the final rest energy Mc'. Thus, even thermal energy can be associated
with an identifiable mass from the viewpoint of relativity.
Next, let us reverse the completely inelastic process and consider the breakup of M
into two unequal masses, as shown in Figure This phenomenon is realized
1-27.
physically in the spontaneous fission of a nucleus into two fragments and in the
two-body decay of an unstable nucleus or elementary particle. Since M is at rest,
momentum conservation demands that m and m 2 have
x
opposite momenta of the
same magnitude p, as indicated. Conservation of energy then requires that
£, + E2 = Mc 2
1-10 Collisions and Reactions 55
-2*,2
c
c c „2 4
+ m\r + \ic
i i
+ 2,4
m\c
w 2
= Mc
p p
2
with the use of Equation (1-35). It is not difficult to solve this relation for p , the only
unknown quantity, and obtain
2
[M 2 -{m +m f\[M -{m -m 2 f\
x 2
2
x
P = (1-47)
\M'-
Thus, the momentum of each particle in the final state of the two-body breakup
process is uniquely determined by the values of the participating masses M, m,, and
m2 . Conservation of energy may also be written in terms of the kinetic energies of m,
and m 2 :
m x
c
2
+ K + m 2c 2 +
x
K 2
= Mc 2 .
The difference in rest energy between initial and final states is called the Qvalue for
the given process. In this instance, the ()-value has the form
Q = (M - m, - m 2 )c 2 = K + K2 x
. (1-48)
Note that the kinetic energy in the final state is obtained from the excess mass in the
initial state.
Let us now introduce Lorentz frames in our discussion of the breakup process. For
simplicity, we take the masses of the two final particles to be equal and consider the
two-body decay M— > m + m described in Figure 1-28. The upper part of the figure
shows the decay of M at rest in the Lorentz frame 5". Note that this view of the process
is the reverse of the collision in Figure 1-25, and so Equation (1-46) gives the correct
is
left at rest.
®
S'
^K5 &^
56 Relativity
relation between the indicated speed vand the masses of the particles. The Lorentz
transformation comes into the picturewhen we also consider the decay of in flight M
and introduce another Lorentz frame S in which is a moving particle. This version M
of the process is illustrated in the lower part of Figure 1-28 for the special situation
where one of the final particles happens to be produced at rest. Our problem is to
determine the velocity v of the other final particle in the frame known
S, given the
velocity v in the frame S'. Special circumstances characterize this problem; the initial
mass is at rest in S', and one of the final masses has speed v in S' and is at rest in S.
These conditions tell us that S' must be moving through S wifh speed u = v and that
M must be moving in S with the same speed, as shown in the figure.
We analyze the problem by using the Lorentz transformation to relate momenta
and energies between the frames S' and S. The transformation from S' to S is
performed with formulas inverse to those in Equations (1-32):
/ E'
E E'
- =
y[
I
— +£/,;• (1-49)
Let i's apply the first of these relations to the final particle moving to the right in the
figure. In S' we have
P'x
= yv mv an d £' = Y, W(:
">
and so
~ Yv mv ~"Vv mc
=
Px M ~*~
2y,r'»»'.
px = y-M>-
These two expressions for p imply an equality between the factors 2y,r^ and
x
y-y.
t
The
desired solution for v emerges from this equality after a few algebraic steps:
2v
r,= (1 -
50)
rr^7^-
+ v/c
1
We should recognize the result. Equation (1-50) expresses the relativistic addition of
velocities according to the first formula in Equations (1-23).
Our point in this discussion of Figure 1-28 is to emphasize the fact that the two
indicated decays, at rest and in flight, are one and the same phenomenon viewed from
two different Lorentz frames. It is evidently sufficient to understand the decay at rest
since the decay in flight can then be analyzed by applying the Lorentz transformation.
Many interesting phenomena are observed in relativistic collisions as a consequence
of the conversion between mass and energy. The conservation laws allow processes in
which particles collide and produce altogether different systems of particles in the final
state. These reactions must conserve the total momentum and the total energy, so that
1-10 Collisions and Reactions 57
the equations
govern the production of new particles. There also exist other conservation laws that
pertain to other attributes of the interacting particles. An assessment of all possible
conservation laws in a particular reaction constitutes a certain view of the interactions
responsible for the process, where the observations are made at a great distance from
the scene of the reaction. The subscripts before and after in Equations (1-51) are meant
to convey this sense of the observation procedure.
Figure 1-29
the CM frame.
CM frame S'
Lab frame S
58 Relativity
The first equation is a statement of velocity addition in Galilean relativity, and the
second is a statement of nonrelativistic momentum conservation in the CM frame. We
can eliminate v' between these two equations and obtain the desired relation between
u and v.
m
u = v (nonrelativistic). 1-52)
m + M (
To conclude the digression, let us recall that the center of mass of the Newtonian
system is a well-defined point, located between m and M, whose speed remains
unchanged before, during, and after the collision. The relativistic transformation
between S and S' calls for more complicated equations. We express velocity addition
and momentum conservation by writing
v — u mv' Mu
v' = and
It is obvious that the elimination of v' is more laborious in this case. The result is left
mv
m + M\j\ - v
2
/c 2
Of course, the final formula goes over into Equation (1-52) for v «c c.
We have only sketched this route to Equation (1-53) because we now want to show
the simpler recommended way to obtain the speed u. Let us return to Figure 1-29 and
identify the total momentum and total energy for the two-body system in the lab
frame S:
mv mc
Ptcai - "7= =T7T and £«- = = +Mc2 (1 - 54)
\jl - d/c yi
T7T
- »7c
-
^ total
2
u mv /l/l - i' /c-
c mc'/p - v
l
/c l + Mc'
I 10 Collisions and Reactions 59
when Equations (1-54) are used, and the desired Equation (1-53) follows directly from
this result. We note in passing that the relativistic relation between S and S' does not
refer to any point called the "center of mass." Instead, S' is characterized by the
momentum condition P ^ = 0. It would t tal
be more fitting to call S' the "center of
momentum" frame for this reason.
Example
l + yi - v
2
/c 2
This relation gives u, the speed of the CM frame through the lab. We can turn
the equation inside out and solve for v, the speed of the beam particle in the lab:
2a
+ ir/r
The result is easily understood by noting that the two protons must have the
same speed given by u in S', the CM frame. The expression for the speed v of
the beam proton then follows directly from velocity addition. We have found it
partner
'
i? t jta ,
, the total energy in the CM frame. The second of Equations (1-49)
gives the relation between the total energies in the two frames:
£ o«a. = y(Ku*
t
+ ^p ; t tal )
= y£ ; t tal ,
where
Mc 2
EL, = 2 = 2yMc 2
yl - Myc-2
F'total
L-
2Mc 2
E; 2 a] = 2Mc% otal .
£,'otai
= 2(10 GeV + 1 GeV) = 22 GeV.
60 Relativity
The related value of the total lab energy is quite large because of the quadratic
relation between the two quantities:
2
£/!, (22 GeV)
£«.,
'total
= t-^7 = = 242 GeV.
2 Mr- 2 GeV
The beam proton must therefore have a correspondingly large kinetic energy in
the lab frame, where the target proton is at rest:
K=£ totaI
- 2Mc 2 = 240 GeV.
1-11 Four-Vectors:}:
Relativity invites us to think in terms of four dimensions where the fourth dimension
refers to the time. We already know and time is
that a geometrical conception of space
useful for the presentation of events and world-lines. We now want to incorporate
relativistic momentum and energy and expand the presentation into a more fully
y
(1-56)
ict
$The methods and results of this section are needed again only in isolated parts of Chapters 2
and 16.
/-// Four- Vectors 61
A different Lorentz frame S' corresponds to another set of space and time axes in
Minkowski space. The given event is located in S' by means of a primed four-vector,
with coordinates (x\ y', z', id') and column matrix
(1-57)
id'
Thus, we use o to specify some event in one frame S, and we use a' to specify the same
event in another frame S'.
The Lorentz transformation relates the components of o and o' according to
Equations (1-26). We can rewrite these relations in more compact notation with the
aid of the matrix equality
x Y iyfi x
/ 10 V
10 (1-58)
id' iyfi y id
(It is important to appreciate the equivalence between Equations (1-26) and Equation
(1-58). Similar techniques of matrix multiplication are put to use freely throughout
this section.) We take advantage of the simplifying matrix notation to cast Equation
(1-58) into its final concise form,
= 2>o (1-59)
in which =£? denotes the indicated four-by-four matrix parametrized by the constants
ftand y. This square matrix depends only on the speed u of S' moving through S. It
therefore contains all the information needed to relate position four-vectors for any
given event as observed in the two Lorentz frames 5 and S' The highly streamlined .
formula replaces a cumbersome set of relations and thereby facilitates our interpreta-
tion of Minkowski-space geometry.
We are immediately brought back to familiar ground when we evaluate the
T 1
product of matrices o o (where o denotes the transpose of the matrix o):
X
y
i
nl = ^+y + z- 1-60)
id
x'
2
+/ 2
+ z' :V 2 . 1-61
(1-62)
62 Relativity
The main reason for the imaginary number i in the time coordinate of o is now
apparent. We need the i to secure the vital minus sign in the c
2
t
2
contribution to
T
o o.
This sign then ensures Lorentz invariance in the form expressed by Equation (1-62).
Our strategy may be summarized and evaluated as follows. We have two space-time
frames S and Minkowski space, and we are given a specific event for
S' in
that the "length" of the original four-vector is preserved in this operation. It follows
that the Lorentz transformation can be interpreted as a rotation of the axes in
Minkowski space, since rotation preserves length. The Lorentz transformation of
interest refers to motion of S' through S along the x direction, and so the correspond-
ing rotation involves the x and ict axes in the four-dimensional space. This remark-
able correspondence between frames in relative motion and rotations in space-time
can now be used to good advantage.
Thegeometrical interpretation of Lorentz invariance translates directly into a
characterizing property of the transformation matrix ££ Let us return to Equation '.
.'T = o
T££'>
when Equation (1-59) is also used. The product of matrices y T££ evidently has the
same effect as the four-by-four unit matrix on any four-vector o. The matrix equation
<e
Tse= i (i-63)
follows from this observation. We can verify the equality by explicit matrix multipli-
cation:
Y ) -iyfi Y
1 1
') (i
1 1
lyfi 3 y _-iyfi Y
Vo-zn
1
J - 1
Y (1 a
Since y
2
(\
)
—
= 1, the unit matrix is obtained and Equation (1-63) is confirmed.
fi
2
b
y
bz
ib.
/-// Four- Vectors 63
6'=<e&, (1-64)
where ¥
is the same transformation matrix used in Equation (1-59) to pass from S to
A
Py
/ (1-65)
Pz
iE/c
we then know from our discussion in Section 1-8 that ft meets the necessary
transformation criterion. The components of ft give the momentum and energy of a
particle as observed in the frame S. These quantities transform to the frame S'
according to Equations (1-32). The transformation rule has the form of the matrix
equation
ftt=Seft, (1-66)
as required for a four-vector. In fact, we have anticipated this desired behavior in our
discussion of the properties of relativistic momentum and energy. Note that the
imaginary factor i is needed in the fourth component of
ft
to make the transformation
of the momentum four-vector come out correctly.
We can now begin to appreciate the power and elegance of four-vectors. Let us
start by showing that the Lorentz-invariance condition in Equation (1-62) holds ior ft
as it does for a:
T T
ft' ft'=ft ft. (1-67)
7
This statement is easily proved by multiplying through Equation (1-63) with ft
on
the left and ft
on the right, and by using Equation (1-66). The resulting equality
becomes
p? + p:
2
+ p:
2
— E' 2
r = p* + p; + p- -
E2
—•
c
or
2
E'
1
„
i
E 1
P' = P 2 ,
when the components of the momentum four-vectors are inserted in the calculation.
Observe again how the presence of the factor i in the fourth component of
ft
affects
the identification of a Lorentz-invariant property. Equation (1-35) tells us that the
64 Relativity
T
invariant quantity ft ft
is determined by the mass of the particle:
Thus, Lorentz invariance holds for masses as it does for space-time intervals,
requiring in this instance that all observers agree on the mass of a given particle.
Equation (1-63) can be regarded as the source of all possible Lorentz-invariance
properties. If we
choose 6 to represent any four-vector, and we multiply this basic
equation fore and aft by & T and & and use Equation (1-64), we obtain
before after
cs>>
,y
— opt
before ^after
makes a similar statement in 6". Of course, we cannot say that 0' and are equal.
These four-vectors refer to different frames and must be related, component by
component, via the Lorentz transformation
0" =<£0.
Figure 1-30
Before After \
0—^ -«— ® ,
©—
•
•
CM fra me S'
Lab frame S
©- > (M ©-*
Before After
/-// Four-Vectors 65
(1-70)
can be made as a valid equality. This simple formula opens the way for an endless
variety of applications.
Example
Every elementary particle has its own antiparticle, where the two species have
the same mass but opposite charge. The proton and antiproton are related to
each other in this way. Antiprotons can be produced in reactions initiated by the
collisions of a beam of protons incident on protons in a stationary target. The
specific process that involves the least number of final particles is
p + p^p + p+p + p.
Note that the creation of the antiproton p requires the simultaneous production
ofan extra proton in the final state in order to conserve the total charge and the
total number of nuclear particles. This reaction cannot proceed unless the beam
proton has enough kinetic energy to produce the excess mass in the final system.
Our problem is to determine the minimum beam kinetic energy, or threshold, for
the process. The situation at threshold is such that, in the CM frame, the four
final particles are produced at rest, and so just enough energy is available to
account for the four final rest energies. In the lab frame, the four equal-mass
particles travel together, in the same direction as the beam, and carry equal
shares of the total momentum. If the beam energy is increased above threshold,
it becomes possible for the final masses to separate from each other, as in the
&' = (I
i\Mc
where M is the proton mass. In the lab frame, the total momentum four-vector
before the collision is
i{E + Mc 2 )/c
where p and E pertain to the beam proton. When we use the Lorentz-invari-
ance relation in Equation (1-70), we obtain the equality
(E + Mc 2 Y
2,2
= -16A/-V
66 Relativity
E2 = c
22
p + M c\ 2
= K + Mc 2
and get
j K = 6Mc 2
for the final answer. If we call the proton rest energy 1 GeV, wc : see that the
beam kinetic energy for protons in the lab must be at least 6 GeV for the
production of antiprotons.
Problems
1. Consider the Micholson-Morley experiment in Figure 1-1 for the realistic situation in
light speeds are aether drifted and obtain a formula for the difference in transit time for
the light to travel to and from each mirror. Repeat the derivation with the apparatus
rotated by 90°. Obtain an expression for the number of fringes by which the interference
pattern shifts owing to this rotation.
3. A meter stick approaches an observe! horizontally with speed f^c. What is the length of
the stick in the observer's frame? Suppose instead that the meter stick is oriented at an
angle 6' = cos \D in the frame moving with the stick. Calculate the length of the stick
Barney wishes to drive his racing car the length of a straight track at high constant speed
u. His serviceman Clyde has observed from the pit stop that the car consumes fuel at the
rate dn/dt (in droplets of fuel injected in the carburetor per second). Clyde has filled the
tank with exactly enough fuel for Barney to drive the course. What rate does Barney
measure for his fuel consumption? Does he observe his car to reach exactly the end of the
track, to run short, or to have fuel left over at the end?
Problems H7
Electrons from the main beam at the Stanford Linear Accelerator Center can reach speeds
as large as 0.9999999997c Let these electrons enter a detector 1 m long, and calculate the
length of the detector in the rest frame of one of the particles.
A rocket ship has proper length 30 m and travels at ~c past an installation equipped to
emit and receive radar signals. The emitter sends out a signal at the instant the rear of the
rocket goes by, and the signal is reflected from the nose of the rocket back to the receiver.
Determine the transit time of the signal out and back, as measured at the radar station.
The Doppler shifts for sound and for light may be compared in terms of their power series
expansions. For sound, distinguish as two separate formulas the one for source motion
alone by taking u n = 0, and the one for observer motion alone by taking us = 0. Expand
1
the expressions to second order, that is, to order (u/w) for sound and to order (u/c) for
light, to show that the differences between the two cases do not show up until the second
order.
Unlike the case for sound waves, there is a Doppler shift for light when the source moves
at speed u transverse to the orientation of the observer. Prove that the formula for the
= "ov 1 ~ "
2
A 2
9. An observer is located between two stationary light sources emitting at the same frequency
i> . How fast must the observer move toward one of the sources so that the observed
10. The radiation received from quasars contains wavelengths characteristic of the common
elements, except that the wavelengths are obseived to be red shifted. If the shift is entirely
attributed to the Doppler effect, the amount of the shift AA relative to the emitted
wavelength X (1
can be used to determine the recessional speed v of the quasar. Derive a
formula for the ratio v/c in terms of AA/A . Since 1986, several quasars have been found
with red shifts greater than 4. Calculate the value of v/c corresponding to AA/A = 4.
11. The indicated frames S and S' coincide when t = t' = 0. In S, event A occurs at the
origin when = / 0, and event B occurs later at jr ,
= 9 m when I = t
]
= 10 s. In S',
events A and B are simultaneous, occurring at the origin and at x[, respectively, when
t' = 0. Determine the speed of the frame S' moving through the frame S. How far apart
/ = f'=
®- ®-
-V'
t= r.
-©-
68 Relativity
12. Two hunters are located 4 m apart. Each of them fires his shotgun, the one preceding the
other by 20 ns. A passing game warden claims that the shots are fired simultaneously. Is
this possible? If it is, how fast must the game warden be moving?
13. A laboratory has proper length L' = 10 m and moves through S with speed u = § c.
Another Lorentz frame S' moves with the device. Three events are observed as indicated.
Event A is the emission of a light signal at (x = 0, I = 0) in S and (V = 0, /' = 0) in S'.
Event B occurs when the signal reaches a mirror at the right end of the laboratory. Event
C occurs when the signal returns to a detector at the left end of the laboratory. Determine
the space and time coordinates (x, 1) in S and (x', l') in S' for events B and C.
s
H u
S'
s
u
<L
S'
14. The runner in the figure is being filmed by a camera operator who moves with speed v
= j
',;<
,
parallel to a 100 m track. The figure shows the situation at / = in S, the Earth
frame, when runner and camera operator are 10 m apart. The runner's watch in S' reads
t' = at this instant, where 5' is the runner's frame. Calculate the relative speed with
which the runner overtakes the camera operator. Determine the time on the runner's
watch required for overtaking. Does this occur before the runner has completed the
course?
10
15. A skateboard has speed u = \c in S, the Earth frame; its proper length is 1 m. A bug
crawls in the same direction along the board with velocity v' = fc, relative to the board.
In .S", the rest frame of the board, find the time for the bug to crawl from one end of the
board to the other. In S, find the speed v of the bug and the time for the bug to crawl the
length of the board. The bug carries a watch; find the time on the watch for the bug to
16. When hockey star Guy L'Einstein takes his relativistic slap shot, his body is moving along
the ice at speed u u while his wrist is thrust forward at speed u[ relative to his body and his
hockey puck is pushed ahead at speed v" relative to his wrist. Each of these speeds has the
same value, «„ = u\ = v" = \c. The speeds in S, the ice frame, are shown as u , u t
, and
v in the figure. Determine the speed v of the puck in S. Let his slapshot have stroke length
d = \ m in S, as illustrated, and calculate the time required to get the shot off in S.
^v >- v
d= 1 m
17. Refer to Problem 12 and identify the gunshots as events in a space- time diagram. Use the
diagram to prove that the game warden's claim is impossible.
18. Refer to Figure 1-9 for the details of the pole-in-the barn puzzle, and plot all the various
distances and times on a space-time diagram, using (x, ct) and (*', ct') axes. Indicate the
events A, B, and C on the diagram. Calculate the Lorentz- invariant space-time intervals
19. Refer to Problem 13 and plot the events A, B, and C on a space-time diagram. Draw the
20. The elastic scattering of two equal masses is shown in two Lorentz frames S and S\ before
and after a collision.The total momentum is zero before and after in S', where all speeds
have the same value v'. The frame S' moves through the frame S with speed u in such a
way that, in S, one of the particles has no x component of velocity before and after the
collision. Determine the indicated velocities u, a,, v2x , and v
2y
in terms of v' under these
conditions.
Before After
21. Continue Problem 20 and demonstrate as follows that the total relativistic momentum is
mu, x
total i
before the collision. Then, argue from the symmetry of the figure that PUttal y
is conserved.
-1 ^ 2 =
for velocities v and v' related by velocity addition, where y = (1 — u
2
/c ) , y,,
(1 - v
2
/c 2 y^ 2
, and Y„< = (1 - v'
2
/c 2 y^ 2
.
23. The Lorentz transformation equations for relativistic momentum and energy are ex-
amined in the text. The demonstration of their validity is not giyen for the components of
the momentum transverse to the x direction. Prove that p' =p holds, and argue that
p' = p. follows by an identical derivation.
24. The angle </> in the figure is defined by sin <£ = v/c, where v is the speed of a particle of
mass m. Identify all the straight lengths in the figure in terms of the appropriate
kinematical quantities.
25. A particle has speed v and relativistic kinetic energy K. Prove that
v /k(k + 2)
C K + 1
where k = K/mc 1 . For k 3> 1, expand the result in powers of 1/k and show that
v
-
c
= 1 - —
2k-
1
7 +
to order 1/k .
mv mc~
p — .
=- and E=
- 2
_
2
'l - v
2
/c
2
^1 v /c
as functions of the speed v. Make the substitution m —* ijx and let v exceed c to obtain a
modified version of p and E. How does the relativistic relation between these quantities
differ from the unmodified case? Draw graphs of p and E, in their modified form, and
observe their behavior as v —» oo. (These manipulations describe faster-than-light par-
ticles, called lachyons. Such a particle cannot be found at rest in any Lorentz frame;
therefore, the mass parameter m is not interpretable in terms of a rest energy and is not
required to be real valued.) Where are the world-lines of tachyons supposed to lie in a
space- time diagram?
Problems 71
28. Prove the work-energy theorem for particle motion in three dimensions. To be specific,
use the relativistic force law F = dp/dl, consider the work done by the force between
locations 1 and 2 along the path of the particle
2
W= /
J
F •
*,
\
29. A particle of mass M at rest disintegrates into two fragments of equal mass m. Determine
the velocity of each fragment in terms of M and m.
30. A particle of mass M at rest decays into two unequal masses m and m 2 Show
x
. that the
square of the momentum of each of the final particles is given by
2
[M 2 - (m l
+ m2 ) [m \
2
- {m - m 2 f\ l
P =
AM 1
31. Continue Problem 30 and derive formulas for the relativistic kinetic energies of the final
M- m, + m., M—m , + m.
A' = -0
^ and K = 2
- n
K
2M 2M
,
' '
32. Consider an elastic collision of equal-mass particles with one particle initially at rest. The
conservation laws imply that the final momenta are perpendicular in the nonrelativistic
case. Is this true for a relativistic collision? Answer the question by deriving the relativistic
formula
c p, •
p, = A", K,
34. The lower part of Figure 1-28 shows a particular example of the decay M —* m + m in
flight. Use the conservation laws to obtain v in terms of v, where the speeds are identified
in the figure.
iE/c
72 Relativity
Let S' be the rest frame of the particle so that the corresponding momentum four-vector
38. A proton-proton collision can create a 77" meson as an additional particle in the final
p+p->p+p + 77°.
Derive an expression, in terms of the masses, for the minimum kinetic energy -K thrcshold for
beam protons incident on protons at rest in this reaction. Calculate the numerical value of
^threshold' usm g 938.3 MeV and 135.0 MeV for the rest energies of the proton and the
meson.
TWO
PHOTONS
73
74 Photons
Max Planck
could exist only in discrete states, it was clear that a serious break with classical
physics was at hand.
Quantum physics came to life in the year 1900. The occasion was marked in
history by a famous pronouncement put forward by M. K. E. L. Planck to explain the
observed properties of the radiation emitted by incandescent objects. This common-
place phenomenon posed an unsolved problem that had lodged at the forefront of
theoretical physics for several decades. Principles of thermodynamics and electromag-
netism had been applied to the problem, but classical methods had failed to give a
sensible explanation of the experimental results. Finally, Planck grasped in despera-
tion for a solution based on the new idea of quantization.
The quantum hypothesis of Planck and the subsequent interpretation of the idea
by Einstein gave electromagnetic radiation discrete properties somewhat similar to
those of a particle. These quantized components of light became known as photons. It
was surprising that such discrete characteristics should be in evidence since light was
firmly established as a wave phenomenon. The quantum theory made provision for
radiation to have both wave and particle aspects in a complementary form of
coexistence. The theory was extended when matter was also found to have wave
characteristics as well as particle properties. These formative notions continued to
evolve, completely at variance with classical ideas, until 1925 when the formal
apparatus of the quantum theory finally came into being.
The evolutionary phase of quantum mechanics is called the period of the "old
quantum theory." We devote the following two chapters to the main feature of this
theory, the quantization of energy in radiation and in atoms. The last of our three
introductory chapters then concludes these developments with a preliminary picture of
the wave description of matter.
21 Blackbody Radiation 75
An ordinary property of every object is its ability to emit and absorb electromagnetic
radiation. The phenomenon is called it involves an inter-
thermal radiation because
change between radiation energy in the electromagnetic fields around the object and
thermal energy owing to the motion of particles within the object. The interchange is
assumed to be an equilibrium process occurring at a certain temperature. Some of the
features of this complex problem appeal to common sense. The familiar observation
that an incandescent solid glows "red-hot" when heated, and "white-hot" when
heated more, suggests a correlation between the temperature of the solid and the
frequency of the emitted radiation. In fact, the object emits and absorbs radiation of
all frequencies, and so a particular range of emitted frequencies tends to prevail for a
/•CO
M(T)= [ M {T)dv.
v
(2-i;
•'o
The integrand M^T) identifies the spectral radiant emittance, or total energy radiated
per unit time per unit area per unit frequency interval. Our notation stresses the
dependence of this spectral quantity on the two variables, v as well as T.
We are concerned with the equilibrium situation where the rates of emission and
absorption are equal by definition. A perfect absorber reflects no incident radiation
and therefore satisfies a (T) =
lt
1. This ideal radiator is a perfect emitter and is called
a blackbody. Figure 2-1 shows a model of a blackbody constructed in the form of an
evacuated cavity with walls at temperature T and with a hole in one of the walls. The
hole is very small so that rays entering the cavity have essentially no chance to be
reflected back out. The spherical shape shown in the figure is not a necessary feature
of the model.A blackbody is a useful idealization, which we can employ as a standard
by introducing the associated spectral emittance M*(T) as a standardizing function.
We can use this fundamental fictitious quantity to define the spectral emissivity
M„(T)
«,(r) = Tsbv; (2-3)
M {T) f
b
76 Photons
Figure 2-1
this ratio then provides a measure of the radiating efficiency for a real object whose
spectral emittance is M (T).V
The following facts should be added as background for the blackbody problem. In
1859 G. R. Kirchhoff proved that the ratio M (T)/a ,(T)
p l
should be a universal
function of v and T, the same for all radiators. He used thermodynamic arguments to
show that a failure of this universal property was equivalent to the existence of a
perpetual motion machine that violated the second law of thermodynamics. The
universal function in Kirchhoff's theorem was identified to be the spectral emittance
of a blackbody M„(T). These arguments presented a challenge for the physics
community to deduce the form unknown
function. In 1879 J. Stefan conjec-
of the
tured, on empirical grounds, that the emittance of an object should be proportional to
the fourth power of its temperature. In 1884 Boltzmann proved the conjecture
theoretically, but only in the case of a blackbody. Their conclusion, the Stefan-Boltz-
mann law, was expressed as
/•OO
with a = 5.67 X 10
8
W/m 2
K 4
Stefan-Boltzmann constant.
for the value of the
The burden of Kirchhoff's challenge on experimenters to develop a
fell initially
suitable laboratory blackbody and then measure the radiation over a broad range of
frequencies at fixed temperature. In time the shape of the blackbody spectrum became
established. Curves like those in Figure 2-2 were obtained for various temperatures,
where each curve displayed a single maximum occurring at a particular frequency vm .
The observed variation of the spectrum with T was such that the peak of the
distribution shifted to higher frequency as the temperature was increased. In 1893 W.
Wien deduced from thermodynamics that vm and T should obey a linear relation. He
also proposed a form for the frequency distribution M^(T) in agreement with the
limited available data. Meanwhile, pioneering improvements were being made on the
experimental front as O. Lummer and E. Pringsheim proceeded to broaden the range
of frequencies in the measured distributions. By 1900 it was clear from these data that
Wien's proposed formula was not adequate as a fit to the whole known spectrum.
Thus, Kirchhoff's challenge continued to stand through the end of the century.
This brief summary sets the stage for Planck's contribution. The great importance
of his idea warrants a detailed analysis of the entire blackbody problem. We
21 Blackbody Radiation 77
T, < T,. The distribution of emitted temperatures 7", < T,. The maximum occurs
frequencies has a maximum at vm for at \m for temperature T, where the product
temperature Tx
and at Pm for temperature X„,T is given by Wien's constant.
T2 . According to Wien's law, vm is MAT)
proportional to T.
M V
(T)
concentrate on the ideal blackbody radiator and suppress the superscript on A/*( T ),
as we have already done in Figure 2-2. Our system is the cavity model in Figure 2-1
whose shape, we have noted, can be arbitrary. The derivation of the spectrum requires
several steps, the most important of which is the use of Planck's quantum hypothesis.
Any desired property of the resulting blackbody solution may also be deduced,
including particularly the Stefan-Boltzmann T A
law in Equation (2-4).
One of the deductions that we can draw from Planck's result is Wien's law for the
position of the peak in the blackbody spectrum. This result is more often expressed in
terms of the wavelength as the variable rather than the frequency, so let us show how
these two types of distribution are related. Equation (2-1) may be consulted for this
purpose and written in two ways:
/•OO ,-00
M(T) =
J
/ M„(T)di> = /
•'0
M {T)d\.
x (2-5)
/•oo
ro UV
dv
Jo
I M {T)dv=
v
M (T)—dX.
d\
v
•'oo
Note how the range of integration is reversed to accommodate the inverse relation
between v and X. When we compare with the A integration in Equation (2-5) we get
dv c c
M (T)= -M (T) -^
X r
= -^MXT) where v = — (2-6)
\m T= 2.898 X 1(T 3 K m
•
(Wien's constant). (2-7)
It should be emphasized that there is no reason for the peak positions vm and X m in
the respective distributions to be connected by the relation c = v\.
Blackbody radiation is realized physically in many practical applications. The
radiation from the Sun is a particularly conspicuous candidate for treatment by
the blackbody model. Measurements of the solar spectrum reveal that 99% of the
radiation falls in the wavelength range 270-4960 nm. The quantity of interest is the
solar constant S, defined as the total solar energy received at the Earth per unit time
per unit area at normal incidence, corrected for the effects of the Earth's atmosphere.
A recent determination of this quantity quotes the value 5 = 1351 W/m 2
. The
wavelength distribution of the incident radiation is called the solar spectral irradiance.
This function of A peaks around X m = 470 nm, and so Wien's law gives T = 6166 K
for the Sun's surface temperature. The quoted solar constant may also be used in
conjunction with the Stefan-Boltzmann law to produce the slightly different result
T= 5762 K. A blackbody distribution at this temperature can be made to fit the solar
spectral irradiance and reproduce the quoted value of 5 for the integrated area under
the curve.
Blackbody radiation also leaves its traces in more subtle areas. Experiments
conducted in the 1960s by A. A. Penzias and R. W. Wilson have demonstrated the
existence of isotropic background radiation that presumably permeates the universe.
This electromagnetic background can be fit by a blackbody distribution with a
Solar spectral irradiance versus wavelength. The area under the curve gives the value of the
solar constant S = 1351 W/m'. Data are taken from the survey of M. P. Thekaekara et al.,
2000
1500 -
1000-
500
21 Blackbody Radiation 79
density contained inside the cavity. The relation between the corresponding spectral
quantities is
C
M {T)=
V
~u v (T), (2-8)
where u v (T) denotes the electromagnetic energy in the cavity per unit volume per
unit frequency interval. We then recognize that the radiation in the cavity takes the
form of standing electromagnetic waves. These occur in every interval dp as discrete modes
whose number per unit frequency interval depends on the frequency. We can
therefore determine the spectral energy density at frequency v if we count the number
of modes and multiply by the average energy for each mode. This construction is
expressed as
and (e), as we find that the quantum hypothesis makes its appearance through the
Independent factor (e).
Example
Consider the application of Wien's law to the following rather different blackbody
situations. If we take the surface temperature of the Sun to be 5800 K and
consult Equation (2-7), we find that the peak of the solar spectrum should occur
at
2.898 X 10~ 3 K m
Xm = = 500 nm,
5800 K
a wavelength near the center of the visible range. On the other hand, the
universal 3 K background radiation gives
2.898 X 10
3
K m •
Xm = = 0.97 mm,
3 K
a wavelength in the microwave region.
80 Photons
Figure 24
Averaging over rays with velocity components
to the right.
^z
Area
2wr 2 sin ede
Example
The energy flux rate M(T) and the energy density u(T) are related for cavity
radiation by a proportionality factor c/4, as in Equation (2-8). We can deduce
this result as the product of two contributions with the aid of Figure 2-4. The
relation calls for a factor of \ because, for all the electromagnetic radiation in
the cavity, only half of the standing-wave energy corresponds to rays with
velocitycomponents to the right where the hole is located in the figure. (Recall
that a standing wave is an equal-parts admixture of traveling waves having
opposite directions of propagation.) These rays carry energy through the hole
with an average velocity to the right given by the z component of the velocity of
light, c z = c cos 6, averaged over the right hemisphere. If we let the sphere in the
figure have arbitrary radius r and average over the hemispherical area we
obtain
2
/ (c cos #)277T sin Odd cj xdx
<o =
(" /2 2mr 2 sm0 dO 2'
C dx '
in which we have used the substitution x = cos 6. The desired relation is then
found by assembling factors:
C
M=\(cz )u= -u.
walls for the enclosure so that the fields are completely enclosed and the energy is
stored in the form of standing electromagnetic waves. We realize at once that the
geometrical and physical properties of the cavity select only those particular con-
figurations of wave fields that "fit" the enclosure. Each of these allowed configura-
tions, or modes, has own characteristic frequency of oscillation. Our first objective
its is
to determine the number of modes Nv dv that occur in the frequency range between v
the form
2ttx
y{x,t) =jv sin—- — sin 2Trft (2-10)
A
with amplitude y , wavelength A, and frequency /. The function satisfies the wave
Figure 2-6
Figure 2-5
n - 3
. -2
82 Photons
equation
dy 1 dy
Y~2 = -l^' (2- 11 )
ox v at
where v is the speed of wave propagation in the string medium. It is easy to verify that
the parameters obey v =/X by inserting Equation (2-10) into Equation (2-11). Recall
that the standing-wave character of y(x, t) is attributable to the multiplicative
construction of the expression. This product of oscillating functions of x and /
describes oscillations that remain in place and do not travel along the length of the
string. The explicit form of Equation (2-10) exhibits the required node at x = and
also incorporates the other node at x = L if A satisfies the condition
lirL
sin— — = 0.
2L
A„=— n
,
(2-12)
= (2-13)
l-=i il"-
Hence, the modes of the string are indexed by an integer and assume the form
mrx nirvt
y*(*>t) =J>osin— sin-^— (2-14)
when Equations (2-12) and (2-13) are inserted into Equation (2-10).
We describe standing electromagnetic waves in a cavity by similar methods, with
allowances made for the three-dimensional nature of the wave configurations and for
the vector character of the electromagnetic field. The oscillating quantity of interest is
the electric field E(x, y, z, t ). (The accompanying magnetic field is known in terms of
E and does not have to be discussed explicitly.) Each component of E is a wave that
satisfies the wave equation in three dimensions with speed of propagation c:
d 2Ex 2
Ex E
2 2
d E d i d x
— -I
—x A — = -= t- and similarly for E v and E.. (2-15)
2 2 2 2 2
dx dy dz c dt '
"
A boundary condition holds at each wall of the enclosure since the components of E
tangential to the wall are required to be continuous as a consequence of Maxwell's
equations. Our cubical enclosure is bounded by a perfectly conducting medium in
which the E field vanishes. Therefore, the tangential components of E inside the cavity
must vanish at each wall to meet the requirements of continuity. If we refer to Figure
2-2 Standing Electromagnetic Waves 83
E =
x
at y = and L and , at z = and L ,
E =
y
at at = and L, and at z = and L,
For standing waves we again want multiplicative expressions like Equation (2-10), but
adapted for three-dimensional configurations and three-component fields. The waves
describe oscillations in place in all three spatial variables and obey boundary
conditions as specified if the components of the field are
n 7Tx n.jry
E = Ex
fos
x
sin — — sin n 3
TTZ
sin 2mvt,
E' = v
.£ n „sin
'
n
n^x
ILL
x
mx
cos
n
— — sin n^irz sin 2ttv(.
2
iry
Note that three separate integers are needed here in place of the one employed in
Equation (2-14). It is not difficult to demonstrate that these functions behave correctly
at the walls and satisfy the wave equation in all three components. The latter point
should be examined closely since it leads to a restrictive condition on the frequency v.
If the first of Equations (2-17) is inserted in the wave equation for E x the result is ,
n,7T - 77 d rinir
\ / tf \ (
T) +
T
2
+
T"
e- -
-(2™r'£,
This equality tells us that the allowed frequencies are given by a formula like
Equation (2-13),
v = + nl + n\ ,
(2-18)
21
except that the role of the single integer in that result is now played by the three
integers «,, n 2 , and n 3
.
The modes that " fit" the cubical enclosure are not as easy to picture as the modes
on a string. Equations (2-17) and (2-18) prescribe the allowed fields and frequencies
according to the values assigned for the set of three independent integers («,, n .,, «,).
Figure 2-7
Octant of a
spherical shell
The figure can be used to count the number of modes A",, dv in a given frequency
interval dv. Let us choose a frequency v and interval dv and visualize the calculation
with the aid of the indicated spherical shell in the three-dimensional (n,, n 2 « 3 ) space.
,
- 21
2
In + n\ + «3 =
for the radius of the shell, and {2L/c)dv for its thickness. A unit cube in this space
contains one lattice and all the sites lie in the first octant, where each of the n 's is a
site,
positive integer. Therefore, we can count the number of oscillating field configurations
by calculating the volume of the octant shell that contains the corresponding lattice
points. We take V= I? for the volume of the cavity and get
2 2
2L 2L
-
47r| v\
\
dv = —^~ Vdv
4-ttv
for the volume of the shell. The desired number of modes Ar„ dv is actually twice this
result, because every spatial configuration of fields has two independent polarization
states. We therefore obtain
.Y (2-19)
as the final expression for the number of modes at frequency v per unit frequency
interval.
2-2 Standing Electromagnetic Waves 85
This construction gives the correct formula for one of the factors that make up the
spectral energy density u p {T) in Equation (2-9). Classical methods can also be used to
deduce a result for the other unknown factor (e), the average energy per mode in the
cavity. We are concerned with this second result only in passing since it leads us to a
prediction for u v (T) that cannot be correct. Our motive in presenting the classical
form of (e) is to prepare the way for Planck's quantum hypothesis.
We want to know the average energy for a population of many radiation modes,
each with its own energy. The enclosed fields are in equilibrium with the walls of the
cavity at temperature T, and so the radiation exchanges energy with the many
material particles bound to the walls. Statistical physics teaches us how to average
over such large numbers of particles in order to deduce bulk properties for the system
and average values for certain particle variables. The kinetic theory of gases is an
example of this averaging procedure. Each of the three dimensions of linear particle
motion in a gas at temperature T is assigned an average kinetic energy ^k B T, where
kB is Boltzmann's constant. We can apply this conclusion to an oscillating particle
bound to the walls of the radiating cavity if we allow the oscillator to have potential
energy as well as kinetic energy. We know that the averages of these two contributions
to the energy are equal for any oscillating particle in simple harmonic motion.
Therefore, T
we obtain k B as the average total energy for each degree of freedom of a
linear oscillator in a collection of such particles at temperature T. This result is an
application of the classical principle of equipartitwn of energy. We then return to our
system of enclosed radiation in equilibrium with bound oscillators and simply transfer
the result for the average energy from one part of the system to the other. If we equate
the average energy per degree of freedom of a bound particle to the average energy
per mode of radiation, we obtain
<e> = kB T (2-20)
u v {T) = —r 8 77 v 2
c
kB T (2-21)
as the classical prediction for the spectral energy density of a cavity radiator. Equation
(2-8) then gives
2
lirp
M (T)=—^k B T
v
c
(2-22)
for the corresponding spectral emittance. The failure of these predictions can be
ascertained from the graph of M (T)
lf
in Figure 2-8. We see that the result resembles
the experimental spectrum in Figure 2-2 only at low frequency. We also see that the
increase at high frequency causes a catastrophic divergence of the integrated emit-
tance as defined in Equation (2-1).
The principle of equipartition of energy was applied to radiation by J. W. Strutt
Rayleigh around the turn of the century. Rayleigh's name has been attached to the
unsatisfactory result in Equation (2-21) for this reason. The high-frequency catastrophe
was a disastrous conclusion for the blackbody problem. This issue proved to be a
critical breaking point for classical physics.
86 Photons
Figure 2-8
MAT)
Planck's solution to the blackbody problem is based on an average energy (e) that
differs radically from k B T. We can appreciate this feature of the problem more fully if
we first understand how such average values are determined for systems containing a
large number of elements in thermal equilibrium. It is instructive to digress from
blackbody radiation for a while and devote an entire section to this question. The
digression follows a straightforward line of statistical arguments to a rather general
conclusion. Our purpose in the end is to apply the resulting statistical formalism to a
collection of electromagnetic modes, even though the arguments are developed for a
collection of particles.
Statistical physics deals with systems whose variables are far too numerous to treat
individually and whose properties are therefore defined in terms of averages. This
approach takes advantage of the large number of variables to analyze the unobserved
behavior of the constituents of a system and extract the observed thermodynamic
features of the system as a whole. The average energy of a particle in a many-body
system one such dynamical quantity of particular interest.
is
Figure 2-9
a a
and
be cb
abc n2 = 3
1 way
nx =
be ac ab n2 - 2
, 3 ways
a b c n.\ - l
a b c n2 = 1
3 ways
be ac ab n, = 2
n2 =
-, 1 way
abc ni = 6
assigning n x
particles with energy £,, n 2 particles with energy e 2 , and so on. This
revised method employs the original variables n :
and c, to generate a list of
occupation numbers (n {
, n 2 ,. . . , n r ) for the distribution of N particles into r cells.
three coins.) We can assume, a priori, that each microstate has an equal chance to
occur (just as head-head-head has the same likelihood as head-head-tail); however, we
can see that the various macrostates have different probabilities (as three heads are
less probable than two heads/one tail).
A system of particles can occupy a certain microstate at a given time and then
change its microstate repeatedly thereafter because of particle collisions. We assume
that all such microstates of the system are equally probable as a basic hypothesis. This
assumption means that over a long time scale any one complete specification of every
particle's state is expected to occur as often as any other. Of course, every system has
rare configurations of particles; these are rare because there are very few ways of
achieving these configurations as macrostates. Hence, macrostates are not equally
probable because they are usually realized by differing numbers of microstates. As
time passes and microstates change, the most frequently occurring macrostate, repre-
senting the most probable configuration of the system, is the one that corresponds to
the greatest number of microstates. We can accomplish our objective and learn how
the cell variables n t
and e, are related by implementing this simple idea.
The statistical problem is first of all a question of counting. We want an expression
for the number of microstates corresponding to a given macrostate whose cell
tion. We deduce W
N by first recalling that the number of ways to assign A particles
r
N(N - l)(N - 2)
••• = N\.
AH
n x
\(N-n y/ x
(N-n x
)\
\(N — — '
n 2
nx n 2 )\
and so on, until all N particles are distributed over all r cells. The grand total number
of such formations is given by the product of the corresponding r factors:
AH (N- n )\x
(N- n, - ••• -n r _,)!
n x
\(N - n )\ n
x 2
\{N - n x
- n 2 )\
n r
l{N - n x
- -n r
)l'
The result can be simplified in an obvious way to give the final number of microstates
WN {n„. ..,*,) = — —
n,\ ... n.
-. (2-23)
2-3 The Maxwell- Boltzmam Distribution 89
n, + •• +n = r
N (2-24)
as a condition that constrains the occupation numbers and accounts for the total
number Note that the definition 0! = 1 is also used.
of particles.
Now that we know the number of ways to form the macrostate («,, n ), we . .
.
,
r
next want to determine the most probable of all such distributions. The desired
macrostate is evidently the one that maximizes N We observe that the same W .
distribution is determined by the solution that maximizes In N and that the function W
lnW^ is more convenient to consider than N itself. Equation (2-23) is converted for W
this purpose to read
\nWN {n x
,...,n r ) = In AH- ln^!- ••• -ln« r !
large enough to justify a few approximations. Quantities like \nn\ can be very
accurately approximated with the aid of Stirling's formula:
\nWN (n x
,...,n r )
= N\nN - N - XX^lnn, - ».)
= N\nN - 2>,ln« ( ,
(2-27)
i
where we express the approximation as an equality and again use Equation (2-24).
W
We maximize In N by taking integer increments in each n and seeking the largest t
result. Since the increments are so much smaller than the n's themselves, it is
— lnW^=0
for every n t
, if the variables (n 1 ,...,n r ) could all be regarded as independent. In
fact, we know that two restrictive conditions exist among the variables so that only
r — 2 of them are actually independent. One condition fixes the total number of
particles to equal N, as in Equation (2-24). Let us call this the jV constraint and
rewrite the condition as
X>, = N. (2-28)
i
The other restriction fixes the total energy of the system to equal E. Let us call this the
E constraint and express the condition as
It would appear that at the last moment our problem has taken on a complication
that prevents us from treating all the variables symmetrically and forces us to select
two arbitrarily as the ones that are not independent.
The constraints can be handled by means of an elegant symmetrical procedure,
devised by J. L. Lagrange in the 18th century. We do not directly use lnWy a ,
F(« l5 ... ,n r a, P)
,
= \nWN ( ni ,...,n r )
- «( £«, - N )
-
#( X>,e,- - E^
i i
(2-30)
—
dF
da
= and
8F
— =
dp
dF
-— = for all i = 1 to r, (2-31)
3n,
so that the net effect is to maximize \nWN , as required, and simultaneously satisfy the
two constraints. The advantages of Lagrange's procedure are twofold; the entire set of
r occupation variables appears on the same footing, and the extra variables a and ft
i i i
—=
3F
dn.
-ln«,. - 1 - a- j3e. = Q. (2-32)
We can solve this equality for the /th occupation number to obtain
e-i-oe-to (2-33)
This is essentially the desired result except that the quantities a and fi remain to be
identified. If we use the N constraint in Equation (2-28) to eliminate a we find
2-3 The Maxwell -Boltzmann Distribution 91
/here
A'
-J-* (2-35]
Z
the final expression for the Maxwell -Boltzmann distribution function.
Our concluding formula specifies the number of particles in the zth energy cell for
the macrostate of maximum thermodynamic probability. It can be argued that the
system tends to this configuration in the approach to thermal equilibrium. We know
that the maximizing state cannot represent a static final situation because of the
continual rearrangement of the particles among the cells. The conclusions suggest
instead that large departures from the most probable macrostate are expected to be
highly improbable.
Equation (2-34) defines a new quantity Z called the partition function This definition .
plays a recurring role in many of the applications of statistical physics. The expression
for Zdepends on the manner in which e, varies from one cell to another, and so the
explicit ft dependence of Z can only be expressed by examining separate cases. The
variable (3 itself remains to be identified. A variety of arguments can be used to obtain
l> (2-36)
One such demonstration of this result is given in the second illustration below.
Example
Let us gain some feeling for the approximation by inspecting the graph in
Figure 2-10. We can invoke the properties of the logarithm to write
In n ! = In n + \n(n - 1 ) + • • •
+ In 1
and then identify the sum of terms with the area of the rectangles shown in the
figure. The area under the curve In x is a good approximation to the area of
the rectangles if n is large enough. The rest of the construction is summarized in
the following steps:
In a dx = ( x In x - x ) |
,
= n In n — n + 1 -* n In n - n for 1 arge n.
Figure 2-10
In x
Example
The energy cells with a discrete index in Equation (2-35) can readily be adapted
for application to the case of free particles with a continuous energy. We
accomplish this by letting a very refined incremental cell of definite energy e be
identified withan infinitesimally thin spherical shell in the space of the three
velocity components (v x v vz ). These variables are connected by the formula
, ,
y
$ + v] + v1 =
2vdv = — de.
in m
2e de
dr = 4irv
2
dv = 4w
The number dn of particles in this energy cell is proportional to the cell volume
drv and is also proportional to the Maxwell -Boltzmann factor e~" e according to
dn = Ae~ Pe dTv ,
1//2
to e de, and so we can reexpress the number of particles in the cell as
x/2 pi
dn = A'£ e- de,
where A' is another proportionality constant. Two definite integrals are needed
at this point. We get the first from a table of integrals,
r x
l/2
e-
px
dx= iv^/r 3/2 (Dwight 860.04),
r>v*<&= -4
Jn
-'(i dB
r x ^ e-^dx= Jn
2
i^p-
5/2
.
These formulas enable us to calculate the average kinetic energy per particle as
follows:
3'2
£dn i '-**'
J J ifrB-^ 3_
(C>
x 3/2 ""
hdn f e
l
^e-^de "
^fi~ 2jB
•'o '0
We know from kinetic theory that the average kinetic energy per particle is
Rayleigh's result for the spectral energy density u v {T) was certainly not the correct
answer to KirchhofTs problem. The fault was found to be the lack of a frequency-
dependence in the classical expression for the average energy (e). It was apparent
that (e) should decrease sharply with increasing v so that the predicted blackbody
distribution could rise and then fall with frequency as observed in Figure 2-2. A
drastic revision of the high-i' behavior was obviously needed to make the integral over
4
the spectrum converge and give the desired T" prediction for the total emittance.
Planck discovered the proper v dependence in his empirical studies of the blackbody
problem and, on these grounds, proposed the following formula for the spectral
energy density:
"' (r) = —
877J>~
,*/*"•-!
hv
• (2 " 37)
The second group of factors in the formula was deduced to represent the average
~
94 Photons
The values obtained from Planck's investigations of the blackbody data were not very
different from the currently quoted figures
h = 6.6260755 X 10
34
J •
s and kB = 1.380658 X 10" 23 J/K.
Planck's constant made its first appearance as the basic new parameter of quantum
physics on this occasion.
These findings were recognized as an inspired contribution. The proposed blackbody
formula was an accurate representation of experiment and, as such, was a satisfactory
response to KirchhofFs long-standing challenge. Planck did not let the issue rest at the
empirical level, however. He turned his attention to a theoretical derivation of his
formula and in the process gave the quantity of energy hv an interpretation of
fundamental significance to the quantum theory.
Planck recognized that cavity radiation should be treated as an equilibrium
problem involving an exchange of energy in the cavity between the radiation fields
and the particles bound to the walls. He accepted the view that the bound particles
could be modeled as oscillators, like masses connected to springs, with arbitrary
frequencies of oscillation. The familiar classical outlook permitted these oscillating
particles to have a continuous range of energies corresponding to a continuous variety
of possible amplitudes of oscillation. This meant that the particles could exchange any
amount of energy with the radiation fields in the cavity. Unfortunately, the classical
assumptions led to the undesirable Rayleigh result, as we have seen in Equation
(2-21). In desperation Planck proposed that the energy exchanged between the
oscillators in the walls and the radiation in the cavity must vary in discrete rather than
continuous amounts. The proposition implied that the radiation in the cavity at each
frequency v could be represented as a large collection of quantized elements of energy
assertion the quantization of a dynamical quantity, the energy, appeared in physics for
the first time.
2-4 The Quantum Hypothesis 95
Our analysis of the blackbody problem implements Equation (2-39) by treating the
collection of quantized energies as a statistical system. The procedure follows an
interpretation of Planck's reasoning along lines developed by P. J. W. Debye in 1910.
We adopt this picture because it is easier to present than Planck's thermodynamic
This expression gives the likelihood, in Debye's interpretation, for the occurrence of a
given energy e in the distribution of energies in the cavity. Figure 2-1 1 shows how the
larger energies are exponentially suppressed by the behavior of /(e). The modes at
large v are similarly suppressed because of the Planck relation between energy and
Figure 2-11
= *" r
Maxwell-Boltzmann distribution factor /(e) e~
f
giving the relative likelihood for an
energy e in a distribution of energies at temperature T . Quantized energies nhv are sampled
and weighted by the distribution factor for three different values of hv.
/(G)
hv « kR T hv » kR T
96 Photons
frequency. The figure indicates the details of this suppression for three different
number of multiples of hv are sampled
choices of frequency. For hv <£ k B T, a large
by /(e) before the value of the function drops to l/e. For hv k B T, the reverse is »
true and even the first allowed quantum of energy is greatly suppressed. The case
hv = kB T is also included, showing the weight of the first quantum to be /(e) = l/e.
Thus, the direct connection between energy and frequency in Planck's hypothesis
accomplishes the suppression of high frequencies in the blackbody spectrum through
the influence of the thermal distribution of energies.
Let us now proceed with the derivation of Planck's formula for (e). At frequency
v, the average energy per mode is computed as the sum of the various weighted
energies, with weights given by the likelihood factors, divided by the sum of the
weights:
IX/(0
L/(0
Since we have discrete energies to average, the averaging process involves discrete
summations in which the sums range from n = to n = oo. When Equation (2-39) is
incorporated, the expression for the average energy becomes
nhv/k » r ~" x
Y,nhve Y. nxe
/here
hv
(2-43)
kB T
We establish the functional form of (e) by starting with the denominator and writing
00
~" x x 2x
Z{x) = E= e = I + e~ + e~ + .
Z(x)
V
= .
'
1 - e~
x
Next, we note that the numerator in Equation (2-42) is found from Z(x) by
performing the following derivative operation:
-x —d Z{x) d _
= -x—Y,e- nx = xY,ne- nx
^ .
dx dx n
Finally, we assemble the results of these observations and carry out one last series of
24 The Quantum Hypothesis 97
maneuvers:
k R Tx d d
(e) = ——Z{x) = -k B Tx— InZ(x)
Z\x) dx dx
x
k B Tx
k B Tx —d \n{\ - e~
x
) = k B T)x-
e~
— —
dx 1 e " e* 1
Planck's formula for (e) is then obtained when Equation (2-43) is reinstated in this
result.
The remarkable formula for the spectral energy density in Equation (2-37)
encompassesall that we might wish to know about blackbody radiation. The following
examples illustrate this with derivations of Wien's law and the Stefan-Boltzmann law.
Example
2
hv 2irhc
M,(T)= -- hv / k B T
I
2
A 4 c
3
e - l A
5
e
hc/XkBT - 1
'
using v = c/X. This expression can be made more compact by recalling the
dimensionless variable x from Equation (2-43) and rewriting x in terms of A:
he
x =
xk B r
The wavelength distribution then becomes
2ir{k R Tf
M {T)=
X
he\
g( x ),
where
*(*) =
e
x
- 1
The function g{ x ) describes the universal shape of the wavelength spectrum for a
blackbody at any temperature. A spectral curve is defined with a single peak
occurring at the special value of x given by
x = 4.965.
To prove this assertion, let the derivative of g(x) vanish and solve for x:
dg \
dx e
x
— 1 \ 1 — e * J 5
obtain x = = 4.965 from there by trial and error Wien's law pertains to the
. peak
of M (T)
X at X = X m Since
. dMx(T)/dX = when dg/dx = 0, the parameters
X m and x are immediately related by the equs ility
he
Kk B T'
A final ca culation gives
" 34 8
he (6.626 X 10 T •
s)(2.998 X 10 m/s)
\ T— ~ 23
xk B 4.965(1.381 X 10" J/K)
= 2.897 X 10" 3
K • m,
Example
The derivation of the Stefan -Boltzmann law begins with the formula for the
frequency spectrum of the emittance. Equations (2-8) and (2-37) are consulted
to get
2 77 h
M (T)
V
c
>
e
hv/k B T _ j
'
3 4 3
• oo 27r/z v dv 2-nh I k B T\ /.oo x dx
M(T)= /
- „hv/k B T
2 x
-
1
1 '
1
i. * 1
in which Equation (2-43) is again used, this time to change the variable of
integration. The dimensionless integral in the computation has a known value,
tabulated as
x dx
(Dwight 860.33).
f 1")
M(T) = oT\
Example
2.90 X 10" 3 K m •
= 580 nm,
5000 K
and noting that the filter passes wavelengths in the interval 579-581 nm. The
transmitted portion of the blackbody emittance would usually be found by
integrating the distribution M
X {T) over the wavelength range
of the filter. In
this case, the range narrow enough to permit an approximation in which
is
M X (T) is evaluated at
X = X m and multiplied by the wavelength interval
AX = 2 nm. We then obtain the transmitted power when we multiply this result
by the area of the filter aperture. Thus, the final formula for the power reads
2
2-nhc 1
P= irr
2
M x (T)AX = mr
1
I
AX.
Note that we have used the relation between X m and x from our first example.
The final calculation gives
-2 2 8 2
(10 m) (6.63 X 10" 34 J •
s)(3 X 10 m/s) (2 X 10" 9 m)
P= 2m
(5.80 X 10- 7
m)V 96
- 1)
= 25.3 W.
We have already remarked that the use of a broad-band filter would necessitate
an integration over the relevant wavelength interval. This task would probably
have to be done numerically.
Planck's quantum hypothesis was a radical concept whose effects were discovered in a
very complicated setting. Real inspiration was necessary to recognize the influence of
quantization beneath the surface of the blackbody problem, and further inspiration
was needed to ascertain the meaning of quantization as a new physical principle.
Planck's contribution was not immediately acknowledged as a major turning point in
theoretical physics. Planck himself was among those who hesitated over the question
of interpretation, while Einstein was the one who came forward with the next
definitive idea.
Planck's assumption allowed oscillating particles and radiation fields at frequency v
to exchange energy only in integral multiples of the quantum of energy hv. Einstein's
proposal interpreted the radiation directly as an intrinsically discrete system composed
of these quanta of energy. (Our analysis of the blackbody problem has already been
100 Photons
^y-^f- Current
Voltage
Applied voltage
discoveries, the effect was noticed by accident. The light-ejected negative charges were
proved to be electrons in experiments conducted by J. J. Thomson in 1899. The
puzzling aspect of the phenomenon was the fact that the energy of the emitted
was quite surprising
electrons did not vary with the intensity of the incident light. This
since was thought that electrons with greater energy should be seen if the metal
it
when a single incident quantum of light was absorbed in the metal and all its energy
was given up to a single electron. The electron could then be ejected if it acquired
enough energy from the absorbed photon to separate the particle from the metal and
release the excess energy as free-electron kinetic energy. An increase in the intensity of
the light would flood the metal surface with a greaterof quanta and cause number
more would produce a greater photoelec-
electrons to be ejected. These photoelectrons
tric current but would carry no more energy individually unless more was delivered to
them by the incident photons. An increase in the frequency of the incident light would
result in a greater photon energy. The photons would then have more energy to give
up to the electrons, and so each ejected electron could leave the surface with greater
kinetic energy. Thus, Einstein was able to explain every aspect of the puzzle by
assuming an interaction between light and matter, represented by the absorption of
discrete quanta of light. The photon concept eventually gained credibility in the
decade that followed, as Einstein's predictions were confirmed in a series of photoelec-
tric experiments undertaken by R. A. Millikan.
Let us examine Einstein's predictions in the context of a typical photoelectric
experiment, following the procedure illustrated in Figure 2-13. We take light of fixed
frequency and variable intensity to be incident on a photocathode, and we let the
liberated electrons be accelerated to a collecting anode by applying a difference in
potential between the two elements. An ammeter records the observation that the
photoelectric current increases, and eventually levels off, as the applied voltage is
increased for a given light intensity. This saturation of the current indicates a limiting
regime where the ejected photoelectrons are being collected at their maximal rate. An
increase in the intensity of the light causes the incidence of photons and the liberation
of electrons to proceed at a greater rate so that the photoelectric current rises to a
higher saturation level. If we reduce the applied voltage to negative values we observe
that the negative potential difference retards the collection of photoelectrons at the
anode until the current finally vanishes at a particular applied voltage — <f>
. The
quantity <$> is called the stopping potential. Its value provides a direct measure of the
maximum kinetic energy for an ejected electron,
*w = «*b. (2-44)
current but the same values for the stopping potential. We therefore conclude that <J>
does not vary with the flux of the incident radiation as long as the frequency of the
light remains fixed. This striking feature of the experiment is at odds with classical
expectations and bears out Einstein's proposition.
The light-quantum hypothesis describes the incident light at frequency v as a
stream of photons, each with energy
e = hv. (2-45)
A photoelectron is ejected from the metal surface of the cathode whenever it absorbs a
photon, provided enough energy is acquired to release the electron from confinement
inside the metal. The basic mechanism is expressed as
y + e
bound free
102 Photons
Figure 2-14
where the symbol y refers to the absorbed photon. A portion of the absorbed energy e
is spent to remove the electron from the metal, and the remainder appears as
free-electron kinetic energy. The minimum energy required to free an electron by this
process is called the photoelectric work function W This quantity. is characteristic of
the specific material in the surface so that W varies from one type of metal to
another. Every electron leaving a given metal with maximum kinetic energy must pay a
cost in energy equal to W in order to obtain its freedom. We can account for the
energy absorbed in this case by writing
h W
4>
=-p Q
.
(2-47)
e e
This result tells us that the experimental value observed for <f>
should increase linearly
with an increasing light frequency v. Figure 2-14 shows a representation of the
straight-line relation between <p() and v. The graph is typical of those found for any
given metal and is representative of the results obtained by Millikan in confirmation
of Einstein's hypothesis. Note that the slope of the straight line is predicted to be h/e
in Equation (2-47). A photoelectric experiment can therefore provide another de-
termination of Planck's constant, by a method completely different from Planck's
blackbody approach. Note also that the figure indicates a minimum light frequency
below which the photoelectric effect cannot occur. This threshold frequency vxh is
obtained by setting = in Equation (2-47) to get
<J>
hvth = W . (2-48)
Light with this frequency contains photons with just enough energy to release
electrons from the metal with zero speed of ejection. Light of lesser frequency cannot
produce photoelectrons no matter how intense the illumination might be.
2-6 XRays 103
Example
=
W (2.42eV)(l.60 X 10" 19
J/eV)
V,u = 5.84 X 10
14
Hz,
~h 6.63 X 10" 34 s
J -
8
X 10 m/s
A,.
th
= —c
p tb
=
5.84
3
X 10
-
14
Hz
=514 nm,
U/s
= 2.58 X 10
18
photons/s.
(2.42 eV/photon)( 1.60 X 10~ 19 J/eV)
The corresponding number striking unit area of the Li surface per second is
18
X
2.58 10 photons/s
-
2
— - = 2.06 X 10
17
photons/s •
m 2
,
47r(l m)
2-6 X Rays
The quantum hypotheses of Planck and Einstein pertain to frequencies and wave-
lengths of radiation across the entire electromagnetic spectrum. Quanta of energy in
the electron-volt regime are associated with wavelengths of light in the visible range,
where the photoelectric effect is observed as an important process. Longer wavelengths
in the infrared and microwave regions of the spectrum correspond to quanta whose
energies fall below the photoelectric thresholds of matter. These forms of radiation
make their appearance in other kinds of photon interactions. An altogether different
domain of interesting radiative processes is found at the shorter wavelengths where the
corresponding photon energies are of kilo-electron-volt order. The radiation known as
x rays occurs in this part of the electromagnetic spectrum.
X rays were detected for the first time in 1895 by W. K. Roentgen. The discovery
was a by-product of experiments on the behavior of electron currents, or cathode rays,
in the space between the terminals of a rarefied gas tube. The radiation was observed
to come from the high-potential end of the tube and was found to have a remarkable
ability to penetrate material and cause ionization in matter. The nature of the
radiation remained obscure until 1912, when x rays were conclusively demonstrated to
104 Photons
Figure 2-14
Frequency
v= 10' Hz
n J\ -
;= -16
i= 24 f_
-14 -
22 7 ray
-12
20 -
-10 Xray
L8
8
Lb
- Vis Ultraviolet
-6
14
Infrared
-4
12
2
Id
Microwave
8 and
radio
2
6
4
4
;= 6
i= 2 >
1
Wavelength
A= 1
;
m
Figure 2-15
'rfsin 8
2-6 XHays 1(15
arguments with the aid of Figure 2-15. The diffraction of x rays by a crystal lattice is
described in terms of the interference of waves reflected from parallel crystal planes. A
crystal plane is defined by a particular two-dimensional array of atoms at their lattice
sites inside the crystal. Incident waves are scattered by all the atoms in the crystal, and
of course each atom scatters radiation in all directions. The role of the crystal plane is
to single out a certain direction for the scattered waves such that the angle of
scattering equals the angle of incidence, as in the law of reflection familiar from
geometrical optics. Atoms in the crystal produce this collective effect because waves
scattered from individual atoms are in phase and interfere constructively if their
direction corresponds to reflection from a crystal plane. The resulting wave geometry
is illustrated in the figure.
The figure goes on to show two equal-angle reflections from neighboring parallel
planes. Waves reflected from planes with lattice spacing d undergo a second stage of
constructive interference at reflection angle 6 if the optical path difference for the two
reflected waves equals a whole number of wavelengths. The figure tells us that the
difference between paths is given by 2d sin 6, and so we expect to observe constructive
interference whenever the angle obeys the relation
2d
sin 6 = an integer. (2-49)
the reflection angle in the formula is called the Bragg angle. These methods of x-ray
analysis are named after W. H. and W. L. Bragg, pioneers (as father and son) of the
science of x-ray crystallography.
Figure 2-16 shows a sketch of a certain type of x-ray tube in which the x rays are
produced in the collisions of energetic electrons with a metal target. The electrons boil
away from a hot filament and are accelerated through a large potential difference to
the target, where they collide with the heavy atoms in the metal and undergo an
Figure 2-16
Figure 2-17
Target _
nucleus()
.
106 Photons
abrupt deceleration. Much of the electrons' loss of energy goes into heating the target,
while the rest is emitted directly in the form of radiation. The observed x rays exhibit
all wavelengths above a certain minimum value, as illustrated by the distribution of
x-ray intensities in the figure. This spectrum shows a superposition of two distinct
kinds of x-ray distributions. The radiation has a continuous component containing all
wavelengths A above the minimum value A mjn the shape of this distribution is
;
essentially the same for any kind of metal target in the x-ray tube. A discrete
collection of sharp wavelengths is also observed in the spectrum; these x-ray lines are
characteristic of the target and differ from one metal to another.
The continuous distribution of x rays is called a bremsstrahlung spectrum, from the
German word for "braking radiation." This component is associated with the classical
radiation that results from the acceleration of charged particles, as the electrons are
brought to a sudden stop in the metal target. The minimum wavelength is explained
by including the quantum hypothesis in the description. Let us refer to Figure 2-17
and visualize the process in terms of a single electron entering the target, encountering
an atom, and emitting a single x-ray photon. (Actually, the nucleus of the atom is
responsible for the indicated Coulomb attraction and acceleration of the radiating
electron.)
The basic process of energy loss may be described as
e
—* e + y near a nucleus,
a sort of inverse to the photoelectric effect. We identify the energy of the emitted
photon as the difference of kinetic energies shown in the figure:
hv = K- K'.
Note that the recoil energy of the atom can be neglected because the nucleus of the
atom is very massive. The energy of the photon is a maximum for those collisions in
K' = =* hv mA% = K.
The kinetic energy of the incoming electron is determined by the voltage <f>
across the
terminals of the x-ray tube:
K= e<j>.
In
= hv m *x = <*
and so
he
A mm = — ,
(2-50)
v '
It is obvious that K' is free to assume any value from zero on up to the value of K.
Therefore, v must vary continuously between zero and vmax while A ranges continu- ,
Example
(6.626 X 10" 34 J •
s)(2.998 X 10
8
m/s)(l0 9 nm/m)
19
1.602 X l(T J/eV
= 1240 eV •
nm = 1 .240 keV •
nm.
We can apply the result immediately to Equation (2-50) and express the
short-wavelength cutoff for the continuous x-ray spectrum as
1 .240 kV •
nm
rnin ,
1.24
nm = 0.031 nm
Figure 2-18
Compton scattering of radiation. The scattered wavelength A' is longer than the incident
wavelength A. In the classical picture, the incident plane wave and the scattered spherical wave
have the same frequency and wavelength.
relativistic properties, but such was Einstein's provisional attitude toward the develop-
ing quantum theory. He looked for support from experiment to establish momentum
and energy same hypothesis and was not able to find immediate
as joint aspects of the
evidence. A test of his revised photon concept was finally proposed by Debye and,
independently, by A. H. Compton. The decisive experiment was then performed by
Compton in 1923.
The Compton effect pertains to the scattering of monochromatic x rays by atomic
targets and refers to the observation that the wavelength of the scattered x rays is
greater than that of the incident radiation. Figure 2-18 illustrates the process and
identifies the Compton wavelength shift in terms of the wavelength difference A' — A.
This quantity is observed to vary as a function of the scattering angle 6 shown in the
figure. The experiment is performed with x rays because the short wavelengths are
needed to have an observable effect. A pronounced x-ray wavelength shift is associ-
ated with a scattering of the x ray by an electron in an atom rather than by atom as
the
a whole. We demonstrate this experimentally by finding that the shift does not depend
on the identity of the atomic scatterer, and so we attribute the effect to the electron as
the common constituent of all target atoms. Our discussion of the Compton effect is
given in terms of the scattering of an x ray by a free electron, since an x-ray quantum
carries enough energy to make the distinction between a bound electron and a free
The quantum theory of radiation treats the x-ray beam as a stream of photons. For
x rays of wavelength A the photon energy is given by
he
£ = hv= — . (2-51)
A
Figure 2-19
e .p
>
E , P
photon concept:
hv h
(2-52)
Note that this assignment of energy and momentum reproduces the relativistic
relation given in Equation (1-39). Recall that the equation holds for a particle of zero
mass whose speed is equal to c in all Lorentz frames. Thus, the revised light-quantum
prescription begins to attach some of the properties of a massless particle to the
behavior of a photon.
The Compton process is then described in terms of a relativistic collision involving
the elastic scattering of a photon by an electron,
y + e — > y + e
We proceed by imposing the familiar conservation laws for the total relativistic
momentum and energy, where the various kinematic quantities are identified in
Figure 2-19. We take the electron to be at rest initially and to have energy and
momentum E and P after the collision. These variables are related by the relativistic
formula given in Equation (1-35):
(2-53)
p - p' = P
2
p - 2c p
2 2
c • p' + 2
c p'
2
= c
2
P2 .
e — Zee cos a + e = E~ — my ,
e + m e
e
2
= e' + E.
e
2
- 2ee' + e'
2
= E 2 - 2Em e
e
2
+ m 2e 4 .
The two quadratic equations represent information obtained from independent con-
servation laws. We subtract equalities to get
2ee'(l — cos 8) = 2m e
c
2
\E — m e
c
2
)
= 2m e
c
2
(e — e'),
1 1
1 — cos 6 = mx "'
A' A
I
— — —
he he
using Equation (2-51) at the last step. Finally, we solve for the wavelength difference
and find
h
AA = A'- \= - -(1 - cos0) (2-54)
m (
c
It is interesting that the shift in wavelength depends only on 6 and does not
vary with the wavelength of the incident radiation. This feature of the result is seen
clearly if we rewrite Equation (2-54) in the form
AA = A c (l - cos0).
h
A c.= --. (2-55)
m c e
This quantity is numerically equal to 0.00243 nm, a length that sets the scale for the
wavelength shift in the Compton effect. It is clear that the incident wavelength must
be comparable to A f if A' is to be noticeably different from A. For this reason the
.
effect is not detectable for visible light and only begins to be measurable for x rays.
Compton's experiment was performed with incident radiation at one of the
characteristic wavelengths of molybdenum, taken from a Mo target in an x-ray tube.
These x rays were scattered from graphite, and the scattered radiation was observed in
a detector set at 90° with respect to the direction of incidence, as shown in Figure
2-20. Compton's data confirmed his prediction of a wavelength shift and thereby
verified the formula in Equation (2-54) as a valid consequence of the premises
2-7 The Compton Effect 111
Figure 2 20
Detector |
X-ray tube
treatment of the scattering of radiation. We have alluded to the latter viewpoint, and
to Thomson's classical theory, in our discussion of Figure 2-18. The classical picture
with the Compton prediction in Equation (2-54) for = 90°. The first peak represents
no shift and evidently corresponds to photon scattering from the atom as a whole. In
this circumstance, the factor h/m e c in Equation (2-54) is replaced by h/Mc, where M
is the mass of the atom. The Compton wavelength of the atom is much smaller than
A c since M
is much larger than m
e
Hence, even x-ray wavelengths experience no
.
observable shift in scattering from the whole atom, and the corresponding feature of
the data exhibits the classical Thomson prediction.
The photon is often called a particle because it occurs in radiation discretely and
because it has energy and momentum properties appropriate for a relativistic particle
of zero mass. We refrain from adopting this usage in all its implications, however. We
especially avoid contemplating any sort of localization of the photon, as we might
imagine in the case of ordinary types of particles. None of our applications of the
light-quantum concept includes any notion of electromagnetic energy and momentum
at some localized position in space. We learn that the photon is absorbed by an
~
112 Photons
Example
Let us begin with Equation (2-55) and compute the value for the Compton
wavelength of the electron. We use the electron rest energy and the convenient
constant he to get
he 1240eV-nm
Ac = = 0.002427 nm,
me 2 " 0.5110 X 10
6
eV
as quoted above. If we then let the incident x ray have wavelength A = 0.0711
nm as in Compton's 1923 experiment, we find from Equation (2-54) that the
wavelength of the x ray scattered at 8 = 90° is
Figure 2-20 shows how this feature would appear alongside the classical
Thomson component at A' = 0.07 nm. (The broadening of the observed
1 1
wavelengths may be attributed to the fact that the target particle is not
necessarily at rest, as assumed in the analysis.) For 90° x-ray scattering, the
recoil of the electron has momentum components, parallel to the incident
photon and antiparallel to the scattered photon, given by
h 6.63 X 10" 34 J •
s
P.-P- = 9.32 X 10" 24 kg •
m/s
A
"
0.711 X 10~
10
m
and
4
6.63 X 1 *
J •
s
4
P,-P'- = 9.02 X 10 ~ kg •
m/s.
A' 0.735 X 10 lu
m
9.02
tan<£ = 0.968 <t>
= 44
c
9.32
Figure 2-21
6', p'
Light
source
Observer
in S
Example
Let us take the photon's energy and momentum from Equations (2-51) and
(2-52) and introduce a momentum four-vector for the photon as
/*
=
iz/c
P
where p = - =
f
—
hv
c c
hv.
P
IE /
= y -iyfi P'
fi
= se~ y tyfi y ie'/c
where
Y
= and fi
'1 - ft''
We let the observed photon frequency be called i> m S and perform the
transformation as follows:
hv 1 hv 1 -ifi 1
c i c ifi 1
hv 1+/? hvn
= Y
;
y(i +fi)
c i(l+fi)
114 Photons
1 + j8 1+
Photons with extremely large energy are found in the y-ray regime, beyond the x-ray
region of frequencies in the electromagnetic spectrum. In practice, any wavelength of
nanometer order or less refers to y radiation, so that the x-ray and y-ray regions of the
spectrum actually overlap. High-energy y rays appear abundantly in nature in
certain very energetic physical processes. Some of the nuclear reactions that take place
at very high temperatures in stars produce y radiation. An appreciable y-ray
component is also present in the secondary cosmic radiation that results from
high-energy particle interactions in the Earth's atmosphere. Nuclear y rays are also
observed in the laboratory when accelerated particles are used to excite nuclei and
stimulate radiation.
Antimatter can be formed in the interactions of y rays if the radiation has
sufficiently large energy. The production process occurs when an incident y-ray
photon is absorbed in the vicinity of an atomic nucleus, and a matter-antimatter pair
of particles is created from the absorbed photon's energy. The least massive of these
pair systems consists of the electron and its antiparticle, the positron. Typical
particle antiparticle properties are observed for the electron and positron; the two
species have equal mass m and
e
opposite charge — e and +e. The electron and
positron are denoted as T and e
+
, and the pair-production process is expressed as
—» e~+ +
y e near a nucleus.
Figure 2-22 shows a sketch of the reaction in which a high-energy incoming photon
disappears and materializes into massive particles of opposite charge. Pair production
takes place in the Coulomb field of the nucleus, where the photon can be absorbed
Figure 2-22
and where the nucleus can act as a massive body to ensure the conservation of
momentum and energy. The nucleus is an essential participant as a spectator to the
process. It is clear that a spectator is needed because, if the photon could sponta-
+
neously convert into an e~e pair in empty space, a Lorentz frame could then be
found which e~ and e + would have equal and opposite momenta, and y would be
in
at rest. Such a conclusion is untenable since y rays must have speed c in all frames.
To analyze pair production we consider a large photon energy that is still small
compared to the very large rest energy of the nucleus. We are then allowed to neglect
the recoil kinetic energy of the spectator and account for the energies shown in the
figure by writing
hv = E_+ E +
= K^+ K + +2m e
c
2
,
(2-56)
+
where K_ and K + are the kinetic energies of e~ and e . The photon energy must
exceed a minimum value in order for the y ray to initiate pair production. We obtain
hv mm = 2m/ (2-57)
from Equation (2-56) when we set K_= and K+ = 0. This formula determines the
threshold for pair production in the limit of infinite mass for the spectator nucleus.
The numerical value of this threshold y-ray energy is 1.02 MeV. The corresponding
maximum y-ray wavelength is given by
c h
A_
max
'
=
vm 2m c
,„
nun
a result equal to half the Compton wavelength of the electron. Radiation of shorter
wavelength contains photons whose energies exceed the threshold and produce elec-
trons and positrons with nonzero kinetic energy.
Pair creation has a reverse process known as pair annihilation. This reaction occurs
at any energy when a positron encounters an electron and the two particles disappear,
converting their total energy to y radiation. Positrons suffer this fate
relativistic
+
inevitably whenever they come into proximity with matter. The e~e system cannot
annihilate into a single y ray because the photon would then be found at rest in the
CM frame of the e~e
+
pair. Pair annihilation occurs most rapidly in the two-photon
mode
+ ->
e'+ e 2y,
although annihilation into three y rays is also possible for certain configurations of the
electron-positron system.
+
An interesting e~e system actually exists in the form of positromum, in which e~
+
and e are bound together by the force of Coulomb attraction. This quasiatomic
structure has been synthesized in the laboratory and has been analyzed thoroughly in
experiment and in theory. Positronium has a very short lifetime because of the
10
pair-annihilation mechanism; the system lives only 10 s on average before decay-
ing by its 2y annihilation mode. Figure 2-23 shows an illustration of the decay in the
positronium rest frame, where the indicated photon energies are determined by the
relation
2hv = 2m/.
116 Photons
Figure 2-23
CM at rest
2m/ 2 , hv
Before After
It follows that each y-ray wavelength is equal in value to the electron Compton
wavelength h/m/ in this particular frame.
hv exceeds 10 MeV. Of course, the other two processes are the only ones possible when
h v is less than MeV. The photoelectric effect dominates below 100 keV, while
1
scattering takes over in the intermediate range above and below MeV. We can make 1
these qualitative assertions more specific after we have learned how to measure and
compare the various processes.
Figure 2-24
Example
The threshold formula for pair production in Equation (2-57) holds in the
limiting case of an infinite-mass spectator. The nucleus of an atom approximates
this limiting condition quite satisfactorily. Let us instead consider pair produc-
tion in the neighborhood of an electron, so that we are compelled to account for
the kinematics of all participants in the reaction
The threshold formula turns out to be quite different in this case. We express the
situation at threshold by assigning the incident photon an energy e = hv min and
by letting the three final particles have the same speed u in the same direction.
Momentum conservation requires
3m u
(*)
c ]/l - u
2
/c 2 '
o 2
3m c
e + m e
c
2
= - e
o ^
. (**)
yl — «/r
e u
e + m.c c
and so
u 2emc 2 + m 2,c*
2 2
c (e + m e
c )" (e + m p
c
\
e = ,
'l
3m
- u
uc
2
=
/c 2
= 3m c
e + m/
e
2
\jm f c (2e
e
2
+ mc
+ m e
c
2
)
or
J2e + m (
c
2
= 3{m/~ .
9m t
c
2
— m e
c
2
e = hv mtn = = 4m c
The methods of Section 1-11 can also be used to obtain the same conclusion.
118 Photons
Problems
1. Show that the universal function in Kirchhoff's theorem is identical with the spectral
emittance of a blackbody. Prove that, for an arbitrary radiating object, the theorem can
be stated as
E „(r) = «,(r),
2. Calculate the power received from the Sun at the Earth, the total power radiated by the
Sun, and the temperature of the Sun. The following data are provided:
5 = 1350 W/m 2
solar constant,
r
E
= 6.37 X 10
6
m radius of the Earth,
r
s
= 6.96 X 10" m radius of the Sun.
the plane of the enclosure. Obtain expressions for the components of the electric field,
subject to the condition £ tan = at the boundaries. Determine the allowed frequencies of
Continue with the hypotheses of Problem 3, and show that the number of modes at
27:1'
K = —a, c
Still continuing, let "radiation" be emitted from a small opening in the enclosure, noting
that the opening is on^-dimensional so that the radiation streams across an arclength. Let
M v
be the energy emitted per unit time per unit arclength per unit frequency interval,
and let «,, be the energy stored in the enclosure per unit area per unit frequency interval.
Show that these quantities are related by
M„=-u„
Itemize all the microstates for the assignment of four particles into three cells. Organize
these according to macrostates of the system and verify that the number of microstates
Problems 119
corresponding to each macrostate is in agreement with the general formula for the
thermodynamic probability.
<«> =
^ = kB T
8. Show that the Planck result for (e), the average energy per mode, agrees with the
classical equipartition result k B T in the limit of small frequency.
9. A 50 g mass hangs from a spring whose spring constant is 80 N/m. The oscillations of the
mass have amplitude 10 cm. Can it be argued that this macroscopic classical oscillator is a
Planck oscillator by assigning the system to a quantized energy state whose energy is a
multiple of hv? How accurately would the energy of the oscillator need to be known in
10. The wavelength distribution of a 5000 K blackbody radiator is studied for a range of
correspond to the values of X where the spectral emittance is half the peak value.
11. For a blackbody, the frequency distribution peaks at vm and the wavelength distribution
peaks at \ m Consider the derivations of the dependence of vm and \ m on the temperature
.
12. The peak value M x at the wavelength \ m in the distribution of blackbody radiation
increases with temperature as indicated in Figure 2-3. Show that M^ depends on T as
MXm = CT",
13. Continue with " flatland radiation" following Problem 5, and deduce a formula for the
frequency dependence of M v
. (Should the Planck formula for (e) be different from the
case for a three-dimensional cavity?) Integrate M v
over all v to obtain the total emittance
M. Cast the integration in a form that contains the dimensionless integral
/•oo x dx
Jq e* - 1
as an explicit factor. Establish the form of the Stefan- Boltzmann law for "flatland," and
determine the analogue of the Stefan-Boltzmann constant in terms of the dimensionless
14. Determine the range of photon energies that corresponds to visible light, with wavelengths
lying between 400 and 700 nm. Repeat the calculation for radiation in the ultraviolet
range from 5 to 400 nm, and in the infrared range from 700 to 500,000 nm.
15. Radio station Will (Amherst) broadcasts at 1430 kHz with 5 kW power. Calculate the
energy of a WTTT photon and the number of photons broadcast per second.
120 Photons
16. Estimate the number of photons emitted per second by a 100 W light bulb, assuming the
average wavelength for the light to be 500 nm. At what distance from the source is the
photon flux equal to 100 photons per second per square centimeter?
y + ffree ~~
*
f
free '
19. The radiation from a 500 K blackbody strikes a metal surface whose work function is
0.214 eV. Determine the wavelength for which the peak of the blackbody spectrum
occurs, and determine the longest wavelength in the spectrum capable of ejecting
photoelectrons from the surface. What portion of the blackbody 's total emittance M(T) is
effective in producing photoelectrons from the metal surface? Express the result in terms
20. A y ray with wavelength 0.005 nm is incident on an electron at rest and is scattered
straight backward. Calculate the wavelength of the scattered y ray and the kinetic energy
21. Ay ray of wavelength 0.0062 nm is incident on an electron initially at rest. The electron
is observed to recoil with kinetic energy 60 keV. Calculate the energy of the scattered y
ray (in keV), and determine the direction in which it is scattered.
22. Obtain a formula for the fractional loss of energy (e — e')/e for a Compton-scattered
photon in terms of the incident wavelength A and the Compton shift AX. Calculate
values of this quantity for 90° photon scattering as A varies from 0.1 nm down to 0.001
nm.
23. Consider Compton scattering of photons with wavelength A, and show that the scattered
photon direction 6 and the recoil electron direction </> are related by the expression
cot-
/
=11 +
M
y tamj.,
have just enough energy to create an electron- positron pair in a dense medium. Deduce
the value of the voltage on the terminals of the x-ray tube.
process is known to be
kvmm
m -
= 2m,c
f
2
for M— * oo
and
figure. Let a photon and an electron collide head-on with relativistic energies e and E,
respectively, and derive an expression for the energy e' of the final photon, in the special
1 + my/ieE
Calculate the value of e' for the collision of a 700 nm photon with a 20 GeV electron.
-0
Before After
THREE
INTRODUCTION
TO
THE
ATOM
and skeptics alike that the quantum theory was in need of a proper formulation and
that the theory was likely to have applications to matter as well as radiation. It was
generally recognized that the structure of the atom presented the next urgent problem
to be solved.
The modern view of the atom began to develop during the same year. E.
Rutherford was the one responsible for devising the basic model and for conducting
the decisive experiment. Rutherford's conception of the atom drew upon classical
principles aloneand contained a serious flaw that posed another insoluble problem for
classical The remedy was found by appealing to quantum concepts to
physics.
complement the classical picture. This next major contribution to the quantum theory
was introduced in 1913 by N. H. D. Bohr. The proposal had the effect of broadening
the new theory into a combined quantum treatment of matter and radiation. The
Rutherford atom and the Bohr atom were the most important developments in physics
during this period.
The molecular and atomic nature of matter attracted speculative interest long before
the real existence of molecules and atoms could be proved. These constituent particles
were believed to exist as identical units in a given substance and were supposed to
make up the composition of any sample of that substance. This belief began to gain
ground on two fronts during the 19th century. The concept of submicroscopic units
was given consideration in chemistry in a scheme to organize the regularities of the
elements and was directly employed in the kinetic theory in a model to describe the
behavior of gases. Skeptics could still argue, however, that even the most successful
122
3- I The Reality of Molecules and Atoms 123
scheme or model could not establish the actual existence of the particles. This
skepticism had to be overcome by building up the internal consistency of the whole
molecular picture.
The periodic table of the elements was conceived during this period. The idea was
originally put forward in 1869 by D. I. Mendeleev as a device for arranging the
known chemical elements in order of increasing mass. In time the atomic number
supplanted the atomic mass as the more relevant ordering index, although the
ultimate interpretation of the ordering scheme was not made clear until much later.
Mendeleev's table listed the atomic numbers in rows and columns, where each row
contained a series of elements whose chemical valence varied in a pattern from left to
right, and where each column contained elements whose valence was the same from
top to bottom. This organization of the chemical species achieved order out of
disarray, as elements were arranged according to common chemical properties and
vacancies were left available for undiscovered elements. The demonstration of these
regularities supported the belief that different varieties of atomic particles were to be
associated with the various species.
Avogadro's hypothesis was among the first assertions of the reality of molecules and
atoms. The principle stated that equal volumes of gases at fixed pressure and
temperature must contain equal numbers of molecules. Avogadro's number was not
immediately determined by either theory or experiment. The eventual determination
of this quantity proved to be a decisive influence on the credibility of the molecular
theory.
A discovery early in the 19th century by the botanist R. Brown played an
important part in these developments. Brown used a microscope to examine grains of
pollen suspended in a fluid and observed that the microscopic particles executed a
distinctive random motion. The explanation for this effect came decades later, after
the formulation of the kinetic theory. The movement of pollen grains was seen as
evidence for the thermal motion of particles in the suspending fluid, as the collisions of
124 Introduction to the Atom
fluid particles with suspended particles resulted in the observed Brownian motion.
Einstein analyzed this behavior in 1905 (again, his phenomenal year) and showed that
the conclusions could be used experimentally to deduce a value for Avogadro's
number.
We can reproduce Einstein's treatment of Brownian motion in a few steps by
following P. Langevin's more elementary analysis of the derivation. Let us consider a
large number and mass m, suspended in a
of identical spherical particles with radius a
fluid of viscosity and examine the forces that act in one direction on one of the
tj,
particles. The force of viscous damping depends on the velocity of the particle
according to the law of G. G. Stokes:
dx
F™=-Pj
K,= -fi
t
>
(3" 1 )
where
= 6irt)a. (3-2)
The suspended particle also experiences a random force Fcol whenever it collides with
particles in the suspending fluid. Newton's law for the mass m takes the form
d 'x
a^ /o n^
since there are no other forces acting on m. (We neglect gravity and buoyancy because
these effects tend to cancel each other.)
We are not concerned with a detailed solution for x( t ), the instantaneous position
of the suspended Such a function would be rather difficult to predict because
particle.
of the complex t dependence of the random external force Fcoi Instead, we are .
2
d 2x
d dx
—d dx *
x
2 _ 2x—
dt
and
dt
-x
2
2
= 2x—
dt
2
+ 2
I
—
dt
dt \
2
1 d Idx^ P d
-tf-- («)
lit 2 * ' dt ? y*.
The next step is to average this result over all the suspended particles. We note first
that the term xFco[ must average to zero because of the random nature of the collisions
3- I The Reality ol Molecules and Atoms 125
however, the mechanism for Brownian motion is still present in one of the other
remaining terms. Note that the second contribution on the left is twice the kinetic
energy of the particle for one degree of freedom. We recall that the average of this
energy is given by \k B T, where T is the temperature of the suspending fluid. Thus,
when we introduce
dx
m\—\ ) =k HD T J
dt
for the average of the term in question, we find that thermal-equilibrium aspects of
the problem are still securely in place. The averaging procedure is finally performed
as follows:
m d2 (i d \ I I dx \
2
\ m dg fi
X
-
+ " + = T
2^ 2 7<*T\"U)I -27, -2 S k"
'
(3 - 6)
d
t<0 -«<*•>
Note that m and /? are common to all particles and therefore act as constants when
the averages are taken.
Wedraw our main conclusion from this differential equation for g(t) by passing
immediately to a certain limiting case. The phenomenon of Brownian motion becomes
more and more pronounced for diminishing values of the mass of the suspended
particle, so that Equation (3-6) describes the situation of interest in the m — > limit.
The dg/dt term then drops out of the equation, and so the solution for g( t ) assumes
the simple limiting form
d 2k R T
We take (x
2
) = 0at< = and obtain the t dependence of the Brownian fluctuation
by integrating Equation (3-7):
T T
<*
2
>=^=T^ 2k B kR
(3-8)
using Equation (3-2) at the last step. We obtain the final formula
RT
<*
2 = 1 (3- 9
> a, IT
S m]aN
>
R
k B=
This remarkable deduction was the basis for a series of experiments undertaken by
J. B. Perrin in 1908. Perrin's measurements of NA were reasonably close to the current
value
NA = 6.0221367 X 10
23
molecules/mole.
Example
X 10-'23
k B Tl (1.38 J/K)(300K)(60s)
<*
2
>
= 26.4 X 10" 12 m
3vTT)a 3t7(10"
3
N s/m
•
2
)(l0"
6
m)
The atom was believed to have an internal charged structure even before the real
existence of atoms could be firmly ascertained. Several pieces of evidence supported
this belief. Faraday's electrolysis experiments detected the presence of charged atomic
particles, or ions, in solutions. Radiation from atoms suggested the influence of some
kind of oscillating charge inside the atomic system. Radioactivity demonstrated the
ability of some atoms to change certain aspects of their internal composition. These
indications of structure were already in evidence before 1900. At the end of the 19th
century the electron was finally identified as a universal charged constituent that
appeared in the construction of all atoms.
Electrons were discovered in electrical discharges in gases at low pressure. These
phenomena were observed in the application of a potential difference to gaseous
systems containing electrons, ions, and neutral atoms. The electrons in a discharge
tube could be accelerated by an applied voltage to produce a beam of cathode rays, so
called because the negative charges moved in the applied field toward the anode and
appeared to originate at the cathode. The beam particles were known to have
negative charge because the deflection of the beam by transverse magnetic and
electric fields could be detected and correlated with the sign of the charge. The
identity of the particles was established by Thomson in 1897, in an experiment
designed to measure the charge-to-mass ratio e/m of the cathode rays.
3-2 The Electron 127
Figure 3-1
Thomson's cathode ray tube. Electrons are accelerated from the cathode to the anode and are
collimated by the slit to produce a beam. The electrons are deflected by the transverse field
applied to the beam on its way toward the phosphorescent screen.
Cathode
Screen
A sketch of Thomson's cathode ray tube is shown in Figure 3-1. Let us recall the
familiar details of the classic e/m experiment by referring to Figure 3-2, where we
describe the path of a negative charge —e in a transverse electric field. We regard the
strength E of the deflecting field as known, given a plate separation d and an applied
voltage <£>:
E=
The velocity in the x direction is equal to the speed v of the particle as it enters the
field, while the acceleration in the y direction is equal to eE/m since eE is the
magnitude of the force on the mass m. If we let the particle traverse the field in time t
we get
x = vt
1 eE 5
E e
(3-10)
2 m 2 m \
for the vertical deflection of the beam as it leaves the field. The speed v is determined
by a separate procedure at another stage of the experiment, where a magnetic field of
strength B is also applied in a direction perpendicular to both v and E. The desired
configuration of v, E, and B is such that the electric and magnetic forces cancel, so
Figure 3-2
O^ E
~
128 Introduction to the Atom
that the particle moves through the fields undeflected. We tune the B field to ensure
the equality of forces,
eE = evB,
(3-1D
.-f
Equations (3-10) and (3-11) are then combined to produce a formula for the ratio e/m
in terms of measurable quantities.
Thomson's determination of e/m for the electron gave a value that was rather
different from the current figure
—
m
= 1.7588196 X 10" C/kg.
r
His result was significant nevertheless, because the order of magnitude was too large to
be interpretable in terms of ions (given what was known at the time about ionic
charges and masses) and because the value did not vary appreciably when different
gases and cathodes were used in the tube. It was clear that he had discovered a unique
particle of smallmass whose occurrence was common to atoms of all species.
Since the cathode rays in Thomson's discharge tube were electrons with negative
charge, it would follow that ions with positive charge should also be detected drifting
through the gas in the opposite direction. These ions were in fact observed in a similar
kind of tube in which the cathode was perforated to allow the reverse flow of particles
with positive charge. Thomson called the particles positive rays. Rutherford proposed
that the most elementary ions should be those obtained from hydrogen atoms, and he
gave these the name protons. An e/m experiment of the Thomson type would find all
such atomic ions to have charge-to-mass ratios thousands of times smaller than the
value of e/m e .
The famous Millikan oil-drop experiment is like its partner, the Thomson e/m
experiment, to the extent that both employ concepts from classical physics and both
produce information for the modern era. A sketch of Millikan's device in Figure 3-3
shows an oil drop of mass M
suspended in air in an electric field E between a pair of
charged metal plates. We let the selected drop have a net positive charge q, and we
express the charge as n multiples of the electron charge unit e. A microscope is used to
observe the and fall of the drop through a fixed fiducial region as indicated in the
rise
figure. The forces on the drop are the weight Mg, the Stokes-law force Bv due to the
viscosity of air, and the electric force neE. We assume that the motion in the fiducial
region occurs at terminal velocity for all observations of rise and fall. The forces acting
on the drop are therefore in equilibrium so that the drop moves with constant speed v.
The procedural aspects of the experiment are described in the figure. First, we
examine the fall of the drop with the electric field turned off. The figure tells us that
32 The Electron 129
Figure 3-3
1 i
M k
E y
q = ne
Fall Rise
neE
P»o = PTn
pv =
j8f
'Mg Mg^
y
(3-12)
where t is the measured fall time. Next, we measure the rise time t with the electric
field turned on. The figure shows the equilibrium condition to be
y
—
neE = Mg + ft
i i
= h — + -
where Equation (3-12) is used to get the final expression. We then alter the charge on
the drop by exposing the air gap to a burst of x rays. The ionizing radiation changes
the charge multiple from n to ri and introduces a new measured rise time t'. The
previous equation changes accordingly into the new form
( 1 1
n'eE = (3y
The desired formula for the analysis of the experiment is obtained by subtracting the
130 Introduction to the Atom
(3-13)
We insert data into this formula for a series of repeated trials, corresponding to a
variety of integer differences n' — n, and thus deduce the value of e from the resulting
survey.
The results of a Millikan oil-drop experiment and a Thomson e/m experiment can
be combined to determine the mass of the electron. More current experimental
methods are also available in which greater accuracy is achieved in the measurement
of e and m r The values quoted for these basic parameters
. of the first known
elementary particle are
e = 1.60217733 X 1(T
19
C and m =
t
9.1093897 X 1(T 3 kg. '
Example
Let us get a feeling for the numbers involved in a Thomson e/m experiment by
choosing the following reasonable values for the measurements. Take the
deflection plates in Figure 3-1 to be at 200 volts with a 2 cm separation so that
the deflecting electric field is
200 V
= 10
4
V/m.
d 0.02 m
Let the length of the plates be 4 cm and let the vertical deflection of the electron
beam be 0.5 cm so that the speed of the electrons is found from Equation (3-10)
to be
4 1/2
Eex
2
10 V/m)(l.60 X 10- ,9 C)(0.04mV
2 my 2(9.11 X 10" 31 kg)(0.005m)
7
1.68 X 10 m/s.
The strength required for the B field may then be computed from Equation
(3-11):
E 4
V/m
B=-
v 1.68
10
X
— m/s
10
=
7
= 5.95x10 4
T,
Example
measurable quantities, since the radius of the drop is not measured directly. We
assume a spherical drop and express the mass as
M= 3
f77a p,
where p is the density of oil. We then recall the formula for /? from Equation
(3-2) and rewrite Equation (3-12) in the form
y 4
677170 — = —irapg.
t 3
was known that the electron was thousands of times less massive than any atom, and
so it was obvious that almost all the mass of the atom had to reside in its positive
component.
Thomson visualized the distribution of charge and mass in the atom by means of a
spherical model, shown in Figure 3-4, in which the whole atomic volume was supposed
to filled by the massive positive component, except for isolated locations throughout
be
the volume where the Z electrons were to be found. He assumed that the electrons
were subject to the Coulomb force and were able to move about positions of
Figure 3-6
_^£&Y*^
€) Foil
Collimator
3-3 The Nuclear Mode! of the Atom 133
several MeV, and a collimated beam of these particles was directed at a very thin foil
target. Scattered particles were detected on a scintillating screen that responded to the
arrival of a charged particle by emitting a burst of visible light. The distribution of the
scattered a particles was studied at fixed energy as a function of the scattering angle 6
shown in the figure. Different beam energies were obtained by using various a sources,
and different atoms were probed by substituting foils of various metals. The purposes
of the experiment were to determine the angular behavior of the scattering and to
compare observations with predictions in order to assess the validity of the atomic
model. Two qualitative conclusions were immediately drawn from the first experi-
ments. Almost all the incident particles were transmitted through the foil with very
little deflection, and a very few incident particles (approximately 1 were
in 10,000)
scattered backward into detectors located on the beam side of the foil. These amazing
observations were explainable only in terms of a nuclear model of the atom.
Let us see how this evidence bears on the choice of atomic models. We note first
that it is safe to ignore atomic electrons when a particles are incident on atoms
because the electrons are too light to cause appreciable scattering of the much heavier
a particles. The observed scattering is therefore attributable to the interaction between
the a particle and the massive positive component of the atom. This interaction is
expected to have very different effects in the two models of Thomson and Rutherford.
The Thomson atom has a very diffuse distribution of positive charge, and so the
encounters between beam particles and target particles are likely to result in only
small random deflections as the a particles pass through the target atom. Rutherford's
model of the atom adopts a very concentrated distribution of positive charge so that
most of the atom presents itself as empty space to an incident particle. Therefore, most
encounters with the Rutherford atom also result in small deflections, except for the rare
occasion when the encounter between the a particle and the target nucleus is nearly
head-on. These collisions produce the rare backward scattering events that distinguish
the Rutherford atom from the Thomson atom. Thus, the qualitative predictions of the
nuclear model are borne out in the experimental results, while the diffuse model is
decisively refuted. The details of the Rutherford scattering problem are analyzed
quantitatively in Section 3-4.
Example
e = 3
\<irR p.
The force on the embedded electron depends on the charge ^mr zp, the portion of
the total positive charge that lies within the electron's instantaneous location at
radius r. Gauss' law determines this force to be
F= --—fa r
>
9
)L 1 t
477e r
where the sign denotes an attractive force directed toward the center of the
134 Introduction to the Atom
F= -kr
with effective "spring constant"
1 1
k = npe
477£ 3 47re R3
We therefore expect the electron to execute simple harmonic motion with
frequency
1 Fk
f
=
and we also expect the oscillating electron to emit radiation of the same
frequency. These considerations have nothing to do with reality, as the spectrum
of hydrogen has infinitely many frequencies and indicates a very different
physical picture.
scattering of a single beam particle by a single target nucleus, with charges denoted by
ze and Ze, respectively. (Since a particles are ionized He atoms, the corresponding
charge index is =
Both charges are positive and so the
actually given by z 2.)
1 Zze'
K=
47re D
3-4 Rutherford Scattering 135
Figure 3-7
,, 1 ,, -j Infinite mass
Te ©—
r = oo r =
^D (l)ze
2
1 Zze 2 1 2 Zze
D= = -. (3-14)
47T£ K 477£ My
The speed of the particle at r = oo appears in the final step through the nonrelativis-
tic expression for the beam kinetic energy, K= t,Mv".
We turn next to the general case of a non-head-on collision and consider the orbit
of a scattered particle. Figure 3-8 shows how the incident particle is initially directed
along an incoming asymptote that passes the target at a distance b, known as the
impact parameter. This distance is an important property of the orbit in any scattering
problem. The conservation laws of energy and angular momentum require the
outgoing asymptote to have the same impact parameter. It then follows that the orbit
is symmetrical about a bisector that divides the motion into incoming and outgoing
parts. These features of the orbit are indicated in the figure. We note that the orbit in
Figure 3-8 can be identified with the path of the scattered particle in Figure 3-6 and
that the same scattering angle 6 appears in both figures. Our main objective in this part
of the analysis is to establish the relation between the impact parameter and the
scattering angle.
Angular momentum plays a major role in the solution for the orbit. A conservation
law holds for the angular momentum vector L = r X p because the Coulomb force is
a central force that imposes no torque on the particle. Therefore, the direction of L
is fixed (so that the orbit lies in a plane containing the origin), and the magnitude of L
Figure 3-8
Orbit of a charged particle in a non-head-on collision. The orbit has incoming and outgoing
asymptotes parametrized by the impact parameter b and the scattering angle 6. The polar
coordinates r and cp denote the instantaneous orbit variables for the scattered particle.
, .
L = Mvb. (3-15)
L = Mr 2 —
dcp
dt
(3-16)
We can use this formula immediately to eliminate the unknown variable (p from the
dynamics.
The desired solution for the orbit is supposed to be an expression for r as a function
of <p. Newton's law leads us instead to a description in terms of the joint variables r(t)
and <p(t), where the time serves as a variable parameter. We recall that the radial
d 2r I d<p\ 2
a '~ ~
2
It '
' \~dij
We then introduce the Coulomb force on the mass M and write the radial equation of
motion as
2
d2r I d<p\ 2
1 Zze
M 2
(3-17)
dt '
\ dt 47re r
(3-18)
u
dtp L L
It ' 'Ah
2 " ~M
from Equation (3-16). The following maneuvers are executed to change the variables
from r(t) and <p(0 to the single unknown function u(cp):
du d<p du L L du
dr dr 1
2
dtp
—
M u
)
d.
~M dtp
'
It tin d<p dt U
and
d2r d L d2u L
V —M
I dip
-1 u
2
Hi2 '
d<p{ dt -J ~~M
34 Rutherford Scattering 137
L2 d2u 1 L2 Zze
2
M 2
u
2
.
We then divide by the factor —L"u'/M and obtain a differential equation for u((p):
—+„=--
d^u Zze 2 M
— . (3-19)
d<p~ 477-f L"
The constant on the right can be rewritten in a more compact form by consulting
Equations (3-14) and (3-15) and by using K = 2
-.Mv :
Zze
2
M M D
477e L 2 ~ M 2 2
v b
2 " 2b
2
'
These steps lead us to the final version of the equation for w(<p):
d2u D
x
2
+„=-_.
+ = u (3-20)
d<p 2b
a particular solution of the given equation and by observing that sin qp and cos cp are
independent solutions of the corresponding homogeneous equation. We obtain the
general solution of Equation (3-20) by combining these special solutions as
This result is subject to two conditions, both imposed at r = oo on the incoming portion
of the orbit in Figure 3-8. We specify the first requirement,
to designate the initial position on the orbit. We also observe that the incoming
velocity is
dr L du
dt M dq)
—=—
duMv
d(pL
= -
1
b
when <p = 0, (3-23)
to designate the initial derivative on the orbit. The resulting orbit equation for u(<p)
. .
- = -sin? -
1
—Dj(l - cos<p), (3-24)
r lb
because this expression has the structure of Equation (3-21) and obeys the conditions
in Equations (3-22) and (3-23).
Our final result contains more information than we need, since the orbit equation
describes every point on the orbit of the particle. In a typical experiment, a beam of
particles is directed onto a target and the scattered particles are counted in a detector
far from the target, so that only the asymptotes of the orbit are under actual
observation. We recall from Figure 3-8 that these asymptotes are described by the
orbit parameters b and 8, and we note that the orbit equation tells us how b and 8 are
related. When the orbit reaches the outgoing asymptote we have
r = 00 and <p = ir — 8
1 / D
= -sinC (1
y
+ cos0)
b\ 2b
8 8 D J
= —
1
b\
/
2 sin — cos— —
2 2
—
2b
2cos —
2
for these values of the orbit variables. The desired relation between the scattering
angle and the impact parameter follows as a direct result:
sin—
8
2
— —D cos— =0
2b
8
2
=> cot—
82b
—
2D = (3-25)
We need only this much information from the orbit equation to continue our analysis
of Rutherford scattering.
The first stage of the problem has been devoted to the collision of a single beam
particle with a single target nucleus. Let us now turn to the second stage and consider
the observational aspects of the scattering process. An experiment usually involves a
beam many incident
carrying particles with the same kinetic energy and a target
sample containing many nuclei of the same species. We identify two factors pertaining
to these considerations — the number of nuclear scatterers exposed to the beam and the
rate of incidence of particles onto the target. The factors are deduced from properties
of the beam and the target as follows. First, introduce p and m as the density and
atomic mass per mole of target material, and assemble these quantities along with
Avogadro's number to form the number of nuclei per unit volume
Then, let 8 be the thickness of the sample and suppose that a surface area A of target
is exposed to the beam. The number n of nuclear scatterers is evidently equal to the
product of these accumulated factors:
pN,
A 8. (3-26)
m
3-4 Rutherford Scattering 139
The beam intensity I is defined as the number of incident particles crossing unit area
per unit time, and so the number N ofbeam particles incident on the target per unit
time is written as
NQ = I A . (3-27)
The parameters introduced in the last two equations are controlled in the design of the
experiment.
The scattered particles are counted by a detector system that has an axis of
symmetry along the direction of the incident beam. Figure 3-6 shows a small detector
at the scattering angle 0, where scattered particles are collected at a rate that does not
vary with the placement of the detector around the beam axis. All particles scattered
at angle 6 are therefore detected uniformly over a thin ring of detectors around this
axis. Let the ring lie on a sphere of radius R centered at the target, and let the small
The distance R from the target to the detector has no significance in the interpretation
of the scattering distribution. Only the direction of the scattered particles is of interest,
and so the factor R~ is removed from the element of area to define
dtt = —
dS
R
= 2
2ir sin0 d6 (3-28)
as the more effective measure of the scope of the detector. This definition introduces
the element of solid angle as a mathematical quantity whose span takes in all directions
that emanate from the and pass through the detector at the angle 6. The solid
target
angle dQ, is dimensionless and is conventionally given in steradians, by analogy with
the angle d6 given in radians.
We let dN denote the number of particles scattered per unit time into a given solid
angle dSl. This measurable quantity is proportional to dQ, so that the ratio dN/dQ, is
independent of the size of the subtended solid angle. The resulting number of counts
per unit time per unit solid angle varies with the scattering angle 6 and the beam
energy A'. Hence, the ratio dN/d& gives the angular distribution of scattered particles,
the main object of immediate experimental interest.
Particles are scattered into dQ, by every target nucleus exposed to the beam. Figure
3-9 shows how a single orbit is influenced by one nucleus and how the behavior of all
such orbits is incorporated in the treatment of a beam containing many particles.
According whose impact parameter lies between b
to the figure, every incident particle
and b + db follows an orbit whose scattering angle lies between and 6 + dO. Axial
symmetry applies to this statement, so that all particles entering the axially symmetric
element of beam area 2mbdb are scattered by the one nucleus into the indicated
element of solid angle dQ,. We implement this essential observation with the aid of
quantities introduced in the preceding paragraphs. The number of particles incident
per unit time is given by the product of factors
2-rrbdbI ,
and the number scattered into dSl per unit time by a single nucleus is given by the
140 Introduction to the Atom
Figure 3-9
Ingredients in the definition of the scattering cross section. All particles passing through the
beam area 2mbdb are scattered into the solid angle dSl. The relation between the scattering
angle 9 and the impact parameter b is such that dO and db have opposite sign.
db
Nucleus
ratio
dN
n
These two expressions are set equal, and the beam intensity I is divided out, in order
to define the scattering cross section
dN
da = — =
riL,
2irbdb. (3-29)
The identification of this new quantity is the final goal in our analysis of the scattering
problem.
Let us examine the significance of the cross section by recalling Equation (3-27)
and rewriting Equation (3-29) as
dN I n
K T
da
()!
# scattered into d£l by one nucleus/* incident per unit area of beam.
The interpretation tells us that the cross section has units of area; this simple
observation leads us to the basic meaning of the new concept. The cross section da
represents the effective target area presented to a single beam particle by a single
target nucleus for scattering into the solid angle d£l. Cross sections are given in barns,
3-4 Rutherford Scattering 141
" 28
1 barn = 10 nr.
A direct proportionality always holds between da and dQ so that the ratio da/dQ, is a
more basic quantity. This ratio is called the differential scattering cross section, a function
of scattering angle and beam energy, given in units of barns/steradian (b/st). The
definition of da/dQ, removes the last experimentally controlled ingredient from the
measured counting rate dN. Removal of the factors n, I and dil begins in Equation ,
(3-29) and serves to divide out those elements of dN that are peculiar to a particular
experiment. The result da/dSl is left behind as a measure of the basic two-body
process that is common to all such experiments.
The definition of the cross section holds generally for any kind of elastic scattering.
We apply the concept directly to the Rutherford scattering problem by recalling the
relation between b and 6 for Coulomb scattering and by implementing Equation
(3-29). We relate db and dO by differentiating Equation (3-25) to find
6 dO
-esc 2 = —2
2 2/) db. (3-30)
v '
We note that db and dO are of opposite sign. This property of the orbits can be seen in
Figure 3-9, where the repulsive nature of the Coulomb force causes particles incident
with impact parameters larger than b to be scattered through angles smaller than 6.
We insert Equations (3-25) and (3-30) into Equation (3-29) and obtain the formula
for the cross section in the following series of steps:
D I D ,0 dd\
da = 2tt — cot— esc
2
2 2\ 2 2 2/
77 sin(0/2)cos(0/2)
4 V ;
4 sin (0/2)
it smd\d6\
2
4
8 sin (0/2)
D 2
d&
4
' (3-31)
16 sin (0/2)
The solid angle expression in Equation (3-28) is used at the final step. We can then
recall Equation (3-14) and immediately write the differential cross section as
da D 2
4
(3-32)
dQ 16 sin (0/2)
These two famous formulas describe the angle and energy dependence of the
Rutherford cross section.
Geiger and Marsden completed a very painstaking series of a-par-
his student E.
ticle scattering experiments in 1913. They made observations over an angular range
from 5° to 150° and counted more than 10 5 scattered particles on a zinc sulfide
scintillating screen. Their results verified the predictions of the Rutherford formula as
142 Introduction to the Atom
4
to the csc ( 0/2) behavior in the scattering angle and the \/K l dependence in the
a-particle energy. They also established that the scattering was proportional to the foil
thickness 8 and the square of the atomic mass of the foil material. This last
while the analysis of Rutherford scattering assumes a point-like nucleus. The deriva-
tion is valid nevertheless, provided the a particle does not have enough energy to
penetrate the volume of the nucleus. The two spherical particles interact through the
Coulomb force between point charges until the beam energy becomes sufficiently large
to cause penetration, whereupon the strong nuclear force takes over and becomes the
dominant effect. These considerations suggest a way of using Rutherford scattering to
deduce the radius R of a particular nucleus. We employ the energy-dependent
distance D for reference and compare results from experiment for the differential cross
section with predictions from Equation (3-32) for da/d^l. Agreement implies the
validity of the Coulomb scattering assumption, and this in turn implies that D exceeds
R. We then reduce D by increasing K until we observe the onset of a departure from
the Coulomb prediction. The corresponding critical value of D provides a determina-
tion of the nuclear radius R.
Let us finally be reminded that the derivation of the Rutherford formula is based
entirely on classical principles. It is reasonable to wonder whether Newtonian mechan-
ics should apply on a scale as small as the atomic nucleus, particularly because we
anticipate the influence of quantum principles in atomic and subatomic systems. It
happens that a proper quantum mechanical treatment of nonrelativistic Coulomb
scattering leads to exactly the same conclusion as the classical Rutherford derivation.
Example
unlike the ring of surface area used to derive Equation (3-28). The figure gives
the area as
dS = R 2 sm6d9d<t>,
Figure 3-10
R sin $d<t>
3-4 Rutherford Scattering 143
dS
d&= —j = smO d6d<j>.
R2
We obtain the total solid angle by integrating dSl over the range of the angular
coordinates, < < it and < <p < 2ir. Let us first integrate over <f>
to get
2
dtt= CsinOde f "d<t> = flirsinedd,
f J J J
•'all Q,
a result to be compared with the expression for the ring in Equation (3-28). The
final integration over 6 gives
77
Example
e
2
(8.988 X 10
9
N m2/C 2 )(l.602
•
x 10 19
C)~
l9
477e 1.602 X 10" J/eV
= 1.440 X 10" 9 eV •
m.
e
2
Zz (1.44 X 10" 9 eV- m)(79)(2)
D= = ,
6
' =4.29X 10
14
m.
4t7e A 5.30 x 10 eV
p = 19.3 X 10
3
kg/m 3 and m= 197 g/mole;
pNA (19.3 X 10
3
kg/m 3 )(6.02 X 10
23
nuclei/mole)
m 0.197 kg/mole
= 5.90 X 10
28
nuclei/m 3 .
144 introduction to the Atom
Let the foil thickness be 2.10 X 10" 7 m and suppose that 10 4 a particles strike
the foil per second. Equation (3-26) determines the number of target nuclei
encountered per unit beam area as
—
PNA
m
8 = (5.90 X 10
28
nuclei/m 3 )(2.10 X 10~ m)
7
22
1.24 X 10 nuclei/m 2 .
Our main objective is to predict the total number AA' of a particles scattered
backward by the foil per second. The number dN scattered per second into solid
angle dO, is found by using the known Rutherford cross section in conjunction
with Equations (3-27) and (3-29):
dN = nln da = -N da.
AA n do r ir n D 2
2k sin 8 40
dtt =
-ibackward ^o "" K/'iA
/)
J-n 16 sin
4
(0/2)
'
il
j it/2 sin
sin0 dd
An
r
I
cos{8/2)d6
t— —— = 477
f
/
J j 1/2
2du
3~
8 77
477.
(0/2) K/2 sin-*(^/2) U 2u'
using the change of variable ;/ = sin(^/2). Our final result for the fraction of
AA n D<
477" -(1.24 X 10 22 m" 2 )(4.29 X 10" 14
m) = 1.79 X 10" 5 .
A ()
A ()
16
We then find
AA = (10
4
a particles/s)(l.79 XlO~ 5 = 0.179a )
particles/s
for the predicted number of counts per second. This value is comparable with
the observations of Geiger and Marsden.
Rutherford's planetary model assigned the nucleus and the electrons to separate inner
and outer regions of the atom. The success of the model inspired Bohr to imagine a
corresponding separation of physical domains of influence, in which the electrons in
3-5 The Quantum Picture of the Atom 145
the atom accounted for the chemical properties of the element while the nucleus was
responsible for any radioactive behavior. This picture began to reveal the correlation
between the number of electrons in the atom and the location of the element in the
periodic table. The meaning of the atomic number finally emerged after the construc-
tion of Bohr's model of the atom.
The Rutherford atom evolved into the Bohr atom in 1913. The evolution of models
resulted from the introduction by Bohr of certain quantum concepts in a new
treatment of matter. This revolutionary contribution to the understanding of the atom
initiated the next phase in the development of the quantum theory.
Indications of quantum behavior in matter had been accumulating long before the
time of Rutherford and Bohr. The best source of evidence was to be found in the
spectra of light emitted by atoms. Investigations of atomic spectra began early in
the 19th century, after J. von Fraunhofer discovered the existence of dark lines in the
spectrum of light from the Sun. These studies gathered impetus when it was learned
that heated samples of the elements emitted light whose wavelengths appeared in
discrete patterns. Kirchhoff showed was characteristic of
that each observed spectrum
the chemical species contained in the source of the light. Atomic spectroscopy came
into being, as investigators proceeded to measure and tabulate the wavelengths of
light in the spectra of the various elements. Cesium and rubidium were among the
new elements identified with the aid of these spectroscopic methods. Atomic spectra
gave precise experimental information in the form of a unique signature for each of
the atoms. These measurements offered access to the internal structure of the atoms,
once it was learned how the different spectra were to be decoded. The appropriate
decoding principles were eventually established in the quantum theory of atoms.
Most of our knowledge of the properties of matter has come to us through some
branch of spectroscopy. The oldest spectroscopic techniques employ optical types of
apparatus. We can describe how these instruments are used to observe emission and
absorption spectra by referring to the schematic illustrations in Figure 3-11. An emission
spectrum is the result of the spectral analysis of light emitted by atoms in a gas
discharge tube. The atoms in the tube gain energy in collisions with electrons and lose
energy by emitting radiation. The figure shows how the light from the tube is directed
through a slit to a prism spectrometer and is dispersed by the prism into different
component wavelengths. We see the resulting emission spectrum as a series of discrete
spectral lines, corresponding to different colored images of the slit. The positions of the
lines in the image plane of the spectrometer are unique to the atoms in the discharge
tube. An absorption spectrum is formed when atoms are used to remove certain
wavelengths from the familiar continuous spectrum of white light. The figure shows
how the light from an incandescent source is directed through a slit and is then
allowed to traverse a gas-filled cell before entering the prism spectrometer. The
dispersion of the prism produces a continuous spectrum of colors due to the thermal
radiation from the source, accompanied by a superimposed set of discrete dark lines
associated with the intervening gas. These dark images of the slit are just like the dark
lines seen by Fraunhofer in his observations of the solar spectrum. The absence of lines
is attributed to the absorption of specific wavelengths of the incident white light by the
atoms in the gas. Thus, the absent wavelengths are unique characteristics of the atoms
in the gas-filled cell. We find that the bright lines in the emission spectrum occur in
the same positions as the dark lines in the absorption spectrum if the gas in the
discharge tube is the same as the gas in the absorption cell.
Optical line spectra were collected, without further analysis or interpretation,
through much of the 19th century. The first successful empirical understanding of the
1 —
146 Introduction to the Atom
Figure 3-1
Spectrometers for the study of emission spectra and absorption spectra. Focusing lenses are also
needed as additional elements in the two optical systems.
Bright spectral lines
Continuous spectrum
observed lines was achieved in 1885 in the case of hydrogen, the simplest atom. J. J.
Balmer showed that the visible spectrum of wavelengths in hydrogen could be fitted
by the formula
X = (3645.6 X 10" 10
m)— w = 3,4,.. (3-33)
His fit applied to the so-called Balmer series of lines in which the integers n = 3,4,...
corresponded to the spectral labels a,/?,... shown in Figure 3-12. The measured
3-5 The Quantum Picture of the Atom 147
Figure 3-1
Balmer series of spectral lines in hydrogen. The observed wavelengths are in good agreement
with the indicated predictions from Bohr's model for the hydrogen atom.
— o <T> O CTi
id id oo
ro
o
-:
jo
ir> 00 g>
Nanometers
Series limit
Wavelength
1 / 1 1
- =R,
and then interpreted his expression as a special case of the following more general
formula for hydrogen:
1 1 1
= Rr (3-34)
Rydberg's analysis reproduced the Balmer series for n' — 2, with a value for the
constant given by RH = 10972160 m '. This parameter has since been determined
with very great accuracy and has been named the Rydberg constant. Rydberg's
formula defined a double sequence in the two integers n and n', and so the
introduction of the new index ri raised the possibility of other as-yet unknown series of
found in the ultraviolet region by T. Lyman in 1914. Other series were also revealed
later on, in the infrared spectrum and beyond. Thus, the empirical results of Balmer
and Rydberg successfully broke the spectral code for hydrogen and remained to be
properly explained in some more comprehensive theoretical framework.
It had become apparent by 1913 that the planetary model of the atom was in
Bohr solved the problem of radiative instability by making a break with the
principles of classical physics. His proposed solution contributed another new idea to
the developing quantum theory, in the same spirit as the radical proposals put
forward by Planck and Einstein in the previous decade. Bohr believed in the nuclear
model of the atom but questioned whether classical electrodynamics should always
apply to the behavior of electrons. He began to entertain the possibility of a quantum
approach, thinking that Planck's constant h should have a natural place in the
description of the atom. He noted that the classical picture could not explain why all
constructed from the physical constants of the problem unless h was included along
with the classical parameters. It was observed that the product of factors
2 2
(47TE /e )(h /m e ) gave an appropriate unit of length and an approximate order of
magnitude to use as a practical scale of length for the atom.
Bohr speculated that electron motion in the atom should have quantized properties
in which only certain states of orbital motion were allowed. He proposed a new
quantum condition by requiring discrete values of the energy for these allowed states.
He could then show that his requirement was consistent with the observation of
discrete wavelengths in the spectra of atoms. His idea was partially inspired by the
form of the Rydberg formula, Equation (3-34), in which the wavelengths in hydrogen
were expressed as differences of pairs of discrete-valued quantities.
The Bohr theory of the atom is based on the following two postulates. First, the
electrons in the atom may exist in a discrete set of stationary states of definite energy,
defined so that radiation is not emitted by an atom in such a state. Second, the atom
may undergo a nonclassical transition from one of these allowed states to another and
thereby emit or absorb a single quantum of electromagnetic radiation. The new
concept of quantization of energy in matter originates in these two propositions.
We can interpret Bohr's postulates with the aid of the illustrations in Figure 3-13.
The proposed set of stationary states is represented by a sequence of discrete levels of
increasing energy in an energylevel diagram. We can then exhibit upward or downward
Figure 3-13
Energy level diagram for the stationary states of an atom. A transition to a state of higher or
lower energy is accompanied by the absorption or emission of a photon.
En'
3-6 The Bohr Model of the One-Electron Atom 149
photon, and a downward transition occurs with the emission of a photon. In either
casewe obtain Bohr's formula for the energy of the photon by applying conservation
of energy in the overall system of atom plus radiation to get
hv = j = En - E„,. (3-35)
Only quantized energies appear in this expression, and so only certain allowed
wavelengths are predicted for the emission and absorption of radiation.
Bohr's postulated energy levels gave a natural explanation for the discrete char-
acteristics of atomic line spectra. His theory also offered a solution-by-decree for the
problem of atomic instability. The existence of stationary states and quantized energy
levels was simply declared, while the applicability of classical radiation theory was
simply denied, and no theoretical arguments were put forward to support these new
ideas. This treatment of the atom associated the phenomenon of atomic radiation with
the possibility of transitions between pairs of states, and not with the acceleration of
electrons in orbits. Consequently, radiation frequencies were not to be identified with
frequencies of orbital electron motion in the atom. We should note that the theory did
not say what the atom was doing while the radiative transition was in progress. This
classical question was at variance with the new quantum point of view and was not
supposed to have an immediate answer.
Example
We have suggested the following natural scale of length in the context of Bohr's
quantum treatment of the atom:
2 2 2
—
477e
t,
2
h
= —
4ire
7,
2
(he)
7T
2
= -,
(l240eV-nm)
r, b z r
= 2.090 nm.
e m r
e m t
c ( 1 .440 eV •
nm)(0.51 10 X 10 eV)
(Note that the electron mass is given in MeV/t 2 units in this calculation.) We
learn in Section 3-6 how a scale of length proportional to the suggested quantity
arises naturally in Bohr's model of the hydrogen atom.
Bohr's theory of the atom proposed the existence of stationary states; however, the
procedures needed to determine these states were not discovered until 1925. In the
meantime it was possible to make progress in quantum physics by judiciously blending
new quantum ideas with certain concepts carried over from classical mechanics.
Bohr
took such an approach in hydrogen atom.
1913 to develop his model of the
Our treatment of Bohr's model generalizes the case of hydrogen to cover any
system of two bodies bound together by Coulomb attraction. We let the charges and
masses be denoted as — e and m for the one particle, and +Ze and for the other. M
Our intention is to construct a model ofan arbitrary bound system containing one
electron, and so we intend to set m = m, and let M
range over a variety of choices.
The special case of the hydrogen atom is obtained if we take Z= 1 and choose M to
be the proton mass M p
.Arbitrary one-electron ions can also be described by choosing
150 Introduction to the Atom
Figure 3-14
other integer values of and other larger values of M. We restrict the discussion to
Z
and we take account of the motion of both particles. Of
particles in circular orbits,
course, we expect that the mass M
should remain at rest in the limit M/m -» oo, and
we know that the hydrogen atom is close to this limiting situation since the proton-to-
electron mass ratio is quite large:
M =
— P
1836.
m.
The small effect of a finite nuclear mass M is detectable in atoms and should
therefore be retained as a small correction. Allowance should be made for finite M
also as a matter of principle, because the properties of two masses in motion are of
some interest and because the effects can be incorporated without difficulty.
We take the center of mass of the two-body system to be at rest at the origin and
define position vectors for the two particles as in Figure 3-14. Our first objective is to
reduce the two-body problem to an equivalent one-body description in terms of a
single vector variable given by the relative coordinate
r = r, — r2 . (3-36)
The origin is located at the center of mass so that r, and r9 must satisfy the relation
wr, + Mx 2 = 0. (3-37)
We can solve this pair of equations to obtain the particle position vectors
=
M =
m
r, r and r2 r (3-38)
M+m M+m
3-6 The Bohr Model of the One-Electron Atom 151
=
M =
m
v and v,2 v. (3-39)
v,1
M+m M+m
The relative velocity is defined as
v = -
dx
(3.40)
m M
— m I M \
2
M m
— /mM \
2 1
K= ,2
-vi + i _,.2
v
2
= - I \
v
d
?
+
.
M + m] v- i
M+m
\ 2
»
2 2 2\M+ mj 2 \ 2
K= l
2
fiir (3-41)
mM
M=T7—
M+m (3-42)
This mass parameter also appears along with the relative dynamical variables in the
formula for the angular momentum vector L. The special problem at hand involves
two particles in circular orbits. We recall that the total angular momentum is a
constant of the motion because the force of Coulomb attraction is a central force. The
magnitude of the conserved total angular momentum is found by using Equations
(3-38)and (3-39) to get
L = mv.r, + Mv2 r = m\
(MY 2
vr + M\
i m \ 2
vr
11 "
\ M+m J \ M + mj
or
L = fivr (3-43)
as the final result. We can put the equations to further use and express the centripetal
force on each particle as
mv
F=— r,
2
L = ^
Mv 2
r
2
=
txv
^—
r
2
(3-44)
The potential energy for the system of two charges is given by the Coulomb formula
1 Ze 2
V=-~477£ r
.
(3-45)
152 Introduction to the Atom
We note that this final dynamical quantity is expressed directly in terms of the relative
variable r.
E= K+ V
and consult Equations (3-41) and (3-45) to find
1 1 Ze 2
E= —jxv
2
— .
2 47re r
The centripetal force in Equation (3-44) is identified with the attractive Coulomb
force as
v 2
—
ixv
r
2
= 7
4we
1
1 Ze
r
f. (3-46)
1 \ Ze 2 1 Ze 2
E= --1
/
= . (3-47)
\ 2 / 477-e r 2 497C r
Note that E is a negative quantity because V is negative and because the potential
energy is larger in magnitude than the kinetic energy whenever the energies pertain to
a bound system.
The construction of the model has adhered to classical principles up to this point.
Bohr followed these steps to the same point and then introduced a new quantum
hypothesis in order to produce the desired stationary states. He argued that the orbits
in the atom should be discrete and that an allowed classical orbit was one whose
angular momentum L was an integral multiple of the quantum unit h/ltr. This basic
quantity was recognized to have dimensions of angular momentum and was found to
recur sufficiently often in quantum physics to warrant a special symbol, given by the
current value
h= —
h
2m
= 1
34
05457266 X 1(T J J •
s.
3-6 The Bohr Model ol the One-Electron Atom 153
Bohr's quantum condition restricted the angular momentum of the atomic system to
quantized units of h according to the formula
Thus, an integer n was introduced as a quantum number through the new concept of
quantization of angular momentum. Justification for this ad hoc rule came later in another
of the early developments in the old quantum theory.
We adopt Bohr's hypothesis and combine the classical expression in Equation
(3-46) with the quantum statement in Equation (3-48) to find
2*2
l
n h
v r
2 2 =
2
477€ () p /i
This remarkable result implies that the orbit radius r is a discrete quantity whose
allowed values are given in terms of fundamental constants as
n Arre^h'
r. = (3-49)
Z e~n
n m
a (3-50)
Z JU.
by defining
477£ /r
3-51
The important parameter a is approximately equal to 0.05 nm and is called the Bohr
radius. This quantity appears in Equation (3-50) as a unit of length and serves as a
useful scale of length for all problems in atomic physics.
The energies in the various orbits must be quantized since the orbit radii are
discrete. The prediction of these energies is the main conclusion of the Bohr model.
We obtain discrete values for E by inserting Equation (3-49) into Equation (3-47):
1
Ze Ze Z e
2
u
E=-~
1 1
2 4i7e r
n
2 477e n 47re /r
En = ~ — ~E
m
n~
(3-52)
e
The parameter E is significant because it sets the scale for the energy levels in the
hydrogen atom. We note that E can also be expressed as
2 2
a2
E° =
9
I
1 /
\
i
47TE —r e
nc
\
J
m '
c
'
2 =
v^
2
f2 ' (3 " 54)
where the final form involves the electron rest energy along with a new constant given
by
2
e
a=-47re -. (3-55)
«c
E = 13.6056981 eV
and
1
- = 137.0359895.
a
4ire () hc h 1 h
an = (3-56)
This observation tells us that the Bohr radius and the electron's Compton wavelength
h/m c are related by
t
a rather interesting numerical quantity.
The reduced mass jx appears explicitly in the formulas for r
n
and E n
via the ratio
\i./m t . This reduced-mass correction factor is very close to unity in almost all
H M
m t
M+m e
factor [J./m r is called the isotope effect. The ratio n/m e becomes rather different from
unity in the special case of positronium. We describe this electron-positron bound
system in the context of the Bohr model by taking Z = 1 and setting = m e to get M
jj./m r = £. Obviously, the reduced-mass correction has an appreciable effect on the
energy levels in this instance.
Bohr's quantization condition for angular momentum has led us to discrete orbits
and quantized The corresponding allowed states of motion in
energies. the Bohr atom
are labeled by means of the single quantum number n, whose origin is traced directly
3-6 The Bohr Model of the One-Electron Atom 155
to Equation (3-48). Quantization of energy presents the main results of the model in
terms of a system of energy levels ordered according to the values of n. Let us choose
Z= 1 Equation (3-52) so that we can visualize these results for the special case of
in
the hydrogen atom. The energy level diagram is shown in Figure 3-15 to consist of an
infinite sequence of values of E n as n ranges over all positive integers. The system has
n = l and E =
x
E = - 13.60 eV.
Equation (3-50) gives the radius for the first (and smallest) Bohr orbit as
r, = — a .
Larger values of n refer to excited states with higher energies and greater orbit radii.
The limit n —» oo defines a special limiting state of the system in which
The two bound particles have infinite separation and no motion in this state.
Therefore, the limiting value of n corresponds to the configuration of the atom at the
threshold for ionization. The system is allowed to have all values of the energy above
this threshold; however, the associated states pertain to unbound particles in relative
motion and are not interpreted as states of the atom. The ionization energy is de-
termined by the difference in energy between the ground state and the ionization
threshold:
£, = —
m.
E = 13.60 eV.
This quantity represents the minimum energy that the atom must absorb in its natural
state in order to release the bound electron. The absorbed energy can be supplied in
the form of an incident photon, and so the ionization energy is equal to the work
function for the photoelectric effect in hydrogen.
Bohr's model is not based on a consistent set of quantum principles and does not
qualify as a genuine quantum theory. The model is remarkable nevertheless, because
the results provide correct predictions for the energy levels of the one-electron atom.
We can verify these predictions by examining the wavelengths in the emission
spectrum of the atom. Let us consider a radiative transition from an excited initial
state i to a less energetic final state /, and let us label the states by the quantum
numbers n t
and n
f
as in Figure 3-16. We adapt Bohr's basic formula in Equation
(3-35) to the notation in the figure by writing
he
hv = — = E,- E
f
. (3-57)
The energies are obtained from Equation (3-52), and so the wavelengths are given by
>56 Introduction to the Atom
Energy levels in Bohr's model of the hydrogen Emission of a photon in the transition ;
hi
-(EQ /n 2 )(fi/m t ),
where E is the Rydberg
energy unit and fi/m the reduced-mass
correction factor.
r
is
V
-0.85
-1.51
-340
13.60-
the expression
1 E-E f
1
H E 1 1
Z
X he \"i n
l
-=z>-r\-- -
I fx / 1 1
(3-58)
2
lit
if we define
a" mc
/?- = (3-59)
he 2 h
using Equation (3-54). The result in Equation (3-58) is essentially the same as
Rydberg's empirical formula. We car. make the identification with Equation (3-34) if
3-6 The Bohr Model of the One-Electron Atom 157
*H = R,
Thus, it is evident that the energy levels are correctly given by Bohr's model since the
predicted transitions are in agreement with the observed spectral lines. Both RH and
R are known with very great accuracy from experiment; the latter constant has the
current quoted value
R^ = 10973731.534 m-'.
(The notation can be understood with the aid of Equation (3-42); R^ denotes the
Rydberg constant for an atom with a nucleus of infinite mass M.)
Equation (3-58) describes families of spectral lines according to the various series
Figure 3-17
Transitions in the emission spectrum of hydrogen. The spectral lines are grouped into series
according to values of the quantum number. The final state of the atom has n = 1 in the
final
Lyman series, n = 2 in the Balmer series, n = 3 in the Paschen series, and so on.
• • •
-0.85 -4
-1.51
r 3
f Paschen
-3.40
Balmer
En (eV)
A, n
-1360
Lyman
158 Introduction to the Atom
Figure 3-18
that Bohr's theoretical model predates the discovery of all the known series in
hydrogen except for the Balmer lines in the visible spectrum and the Paschen lines in
the infrared.
Let us now go back and look more critically at Figure 3-16 and Equation (3-57). If
the figure, then conservation of momentum requires the photon and the final atom to
have opposite momenta of the same magnitude p. We denote the energy difference
between the two states of the atom as
A£ = E, - E .
This quantity is the same as the difference in rest energies for the initial and final
atom. The kinetic energy of the recoiling atom given by p 2 /2M, where is the
is M
mass of the atom. Conservation of energy provides the determining relation among all
these quantities:
E =
i
hv + Ef +—.
P
(3-60)
hv
P= — c
,
(hv)
hv + E,+
J/
2Mc 2
or
The desired solution for the photon energy is given by the root
2A£\'/ 2
hv = -Mc 2 + i/M'-V + 2Mc 2 lE = Mc 2 1 +
Mc'
36 The Bohr Model of the One-Electron Atom 159
The ratio 2 b.E/Mc 2 is certainly very small if the energy difference A£ is in the
eV-to-keV range. Therefore, an expansion can be used to get a very good approxima-
tion:
1 2A£ 1 / 2A£ HE
hv = Mc' 1 + A£j 1
5 |. (3-61
2 Mc' 8 Mc 2Mc 2 '
This result tells us that Equation (3-57) should be modified by the correction factor
(1 — AE/2Mc 2 ). We conclude that the effect of recoil is negligible whenever the
transition energy A£ much smaller
is than the rest energy of the radiating system.
The Bohr model was a success for its time even though its limitations were
immediately apparent. The treatment could be generalized to noncircular orbits, but
the model could not be extended successfully to atoms with more than one electron.
The wavelengths in the spectrum of the hydrogen-like atoms could be calculated, but
the intensities of the spectral lines could not be predicted. All these deficiencies were to
be remedied in a more comprehensive quantum theory of matter and radiation.
Bohr regarded his model as a tentative blend of classical and quantum ideas. He
recognized that classical physics must have its domain of applicability, and he
believed that results from the quantum theory should merge into a correspondence
with classical predictions. Bohr formulated a correspondence principle in which he argued
that nature should be described by a quantum theory and that the theory should have
a limiting regime in which classical physics would become valid. He treated Planck's
constant as a small unique unit of quantum behavior and proposed that quantum
mechanics should somehow reduce to classical mechanics in the limit h —» 0. Oper-
ational evidence for such a limit was supposed to appear as a convergence of energy
levels for large values of the relevant quantum number.
Bohr's model of the one-electron atom provides an excellent illustration of the
correspondence principle in operation. Let us return to Equation (3-57) and consider
the frequency of the radiation emitted in the transition i -» /. Equations (3-52) and
(3-54) are used to obtain
E, — E (
a En j 1 1 \ a a 2 m,c 2 I 1 1
'
J_ >y 2 I — 72 '
2 2 2
h m e
h I n , n
J
m e
2h I nj n
Z
— (XC Z72 m Z7 2 am/
2 2
r _ _
v„
= n
= _ _ J
jit ac e
c ju
2 3
m m
2irrn
2tt—
n
Z
— ju
e
a
n e
2tt h
! 66 Introduction to the Atom
To obtain this expression we recall Equations (3-50) and (3-56) for the quantized
radius r
n , and we employ a result for the quantized orbit speed vn as quoted in
also
Problem 14 at the end of the chapter. We know that the frequency of the emitted
radiation is not to be identified directly with the frequency of orbital motion.
However, these last two calculations demonstrate the equality of vn and fn for large
values of n, as required by Bohr's correspondence principle.
Example
Let us begin by computing all the basic parameters that arise in the Bohr model.
Equation (3-55) gives the fine structure constant:
2
e 2t7 1 .440 eV •
nm 1
-277 =
47T£ he 1240 eV •
nm 137.0
Equation (3-56) gives the Bohr radius in terms of the electron's Compton
wavelength:
1 h 137.0
a = = (0.002427 nm) = 0.05292 nm.
2770! m t
C 277
6
0.5110 X 10 eV
£,, = — mc 2 = 13.61 eV.
2(137.0)'
D _
Ea _ 13.61 eV
00
= 0.01098 nm"
he 1240 eV •
nm
Next, let us choose Z= 1 and calculate the energy levels of hydrogen using
Equation (3-52). The reduced-mass correction for hydrogen can almost be
ignored since
/' M. 1836
/// M p
+ m t
1837
"
1 13.60"
1836 1/4 3.40
(13.61 eV) eV,
1837 1/9 1.51
1/16 0.85
as indicated in the energy level diagrams in Figures 3-15 and 3-17. Finally, let
us look at some of the hydrogen wavelengths that follow from Equation (3-58).
3-7 Characteristic X Rays 161
HW 1837
I 1
il"
( 1 91.12 nm
1836(0.01098 nm ')
1/»J
- l/»?
I"/
In the Lyman series we have w, = 1, and so we find the longest wavelength for
n, = 2:
'
91.12 nm 4
A = = -(91.12 nm) = 121.5 nm,
1-1/4 3
36/5 656.1 3
16/3 486.0 1
9/2 410.0 6
196/45 396.9 7
These wavelengths are included in Figure 3-12; all the values are within 0.1 nm
of Balmer's original results. Each series has a shortest wavelength that occurs in
the series limit as n —* oo. The limiting wavelengths are found from the formula
l
tn.
All higher series have their limits in the infrared and beyond.
Figure 3-19
Orbital model of atomic excitation and x-ray emission. The collision of a beam electron with a
target atom causes an atomic electron to be ejected from an inner orbit. Another atomic
electron fills the vacancy and emits an x-ray photon. The deexcitation of the atom occurs
between Bohr orbits labeled by the quantum numbers n, and n,.
The discrete lines are attributed in either case to transitions between the quantized
energy levels of the corresponding atom. X-ray spectral lines have short wavelengths
of nanometer order, and so x-ray transition energies of the atom are in the keV range.
The large energy required to excite the atom is supplied by the electron beam in the
high-voltage tube.
X-ray spectra are associated with complex atoms containing many electrons. This
apparent complication is not a serious analytical problem in the x-ray regime because
the excitation energies are large enough to remove a tightly bound electron from an
inner orbit near the nucleus of the atom. In this circumstance the emitted x rays are
assumed to result from the transitions of a single electron while the other electrons are
regarded as spectators. Bohr's simple one-electron model is quite useful in such a
situation.
Let us visualize the x-ray excitation and emission processes in the many-electron
atom in terms of a system of independent electrons occupying a collection of
quantized Bohr orbits. We label the orbits allowed for an individual electron by the
familiar quantum number n, in the manner of Equations (3-50) and (3-52). The
electrons of interest for the excitation process are those in the innermost orbits of the
atom, the so-called K and L orbits, where the quantum numbers are given by n = 1
and n — 2. Figure 3-19 shows a model of such a system in which the atomic electrons
are distributed over several different Bohr orbits.The figure also shows how the atom
becomes excited when the system an energetic incident electron, as in the
is struck by
bombardment of the target in an x-ray tube. The collision causes the ejection of an
atomic electron from an inner orbit, so that the resulting singly ionized atom is left in
a highly excited state. The ion deexcites when one of the remaining electrons makes a
quantum jump from an outer orbit and fills the vacancy left by the ejected electron.
This transition to a state of lower energy is accompanied by the emission of an x-ray
photon. The so-called K and L series of x rays result from all such electron transitions
to the n = 1 and n = 2 inner orbits of the ion.
The first comprehensive study of characteristic x rays was conducted in 1913 by
H. G. J. Moseley. He investigated the K and L spectra of many of the elements in the
3-7 Characteristic X Rays 163
periodic table and based his analysis on the notion of inner-electron behavior in the
context of Bohr's model. His survey of the elements revealed regularities that finally
established the identity between the nuclear charge index Z and the atomic number
in the periodic table.
Let us show how the Bohr model is applied in Moseley's analysis. We refer to
Figure 3-19 and consider the transition of orbits for a single electron from n t
to n,,
ignoring all the other electrons in the atom. The wavelength of the emitted x ray is
given in terms of these quantum numbers by Equation (3-58). Let us rewrite the
formula as an expression for the x-ray frequency:
v = Zl {
/
cR\- -
i
—M . (3-62)
Z =
eli
Z-zf . (3-63)
It is expected that z, should increase with the final quantum number n,, because the
final orbit radius increases with n, and because the screening is greater for transitions
to larger final orbits. Equation (3-63) may be inserted into Equation (3-62), and the
result can be rearranged to give the following expression for Z eff
:
fv
z =
!
cR(\/n)- \/n])
fv
with n { = 2,3,..., (3-64)
/<tf(l - 1/n?)
fv
Z= zL +^= with n,.= 3,4,5,..., 3-65)
when we choose «y = 2 and consider the L series. The equations predict linear
relations between Z and f ,
with different intercepts and different sets of slopes in
each series. The K series includes Ka and K^ spectral lines for n t
= 2 and 3, with
corresponding slopes ^4/3cR and ^9/8cR . The L series includes La , Lp, and L y
lines for n t
= 3, 4, and 5, with slopes ^36/5 cR ^16/3^, and fQQ/2UR. Figure ,
3-20 shows sets of Z versus f graphs whose linear behavior reflects these predictions
of the one-electron model.
164 Introduction to the Atom
Figure 3-20
Graphs of the atomic number versus the square root of the x-ray frequency in Moseley's
analysis.
Aii
Moscley was able to change targets in his x-ray tube and observe the frequencies of
x rays for more than 40 of the elements between aluminum and gold in the periodic
table. He treated Z as the atomic number for each of the elements in the table and
plotted his observations in the manner of Figure 3-20. The linear relations between Z
and yp were confirmed, and the slopes were found to agree with expectations from the
Bohr model for the K a and L a lines.Approximate values were also deduced for the
intercepts of the straight lines in the two series:
= 1 and = 7.4.
that the atomic number, the nuclear charge index, and the number of electrons in the
neutral atom were all given by the same quantity Z.
Figure 3-19 can be used to describe two other related phenomena. Both of these
possible processes involve the ejection of two atomic electrons instead of the one shown
in the figure. The primary types of x-ray lines are produced in the transitions of singly
ionized atoms, as discussed above. Doubly ionized atoms can also be formed when
electrons collide with atoms in the target of an x-ray tube. The orbit structure in the
resulting ion is somewhat different from the singly ionized case, and so the radiative
transitions in this system produce x rays with correspondingly different frequencies.
These x rays appear in the spectrum as secondary satellites to the primary spectral
lines. The ejection of a second electron can also occur in an entirely different
radiationless process, first observed by P. Auger in 1925. The phenomenon, known as
the Auger effect, is initiated by the excitation of the atom to a singly ionized
configuration containing the usual sort of inner-electron vacancy. The ion does not
deexcite by photon emission, however; instead, the system spontaneously ejects
another electron and becomes a doubly ionized atom. In such a process the second
3-8 Atomic Processes and the Excitation ot Atoms 165
K= £<" - Ef\
where the notation for the initial and final energies refers to the singly and doubly
ionized configurations. Thus, the system in the figure releases energy in the transition
n t
—*and the transition energy is immediately absorbed through an internal
n,,
conversionmechanism in which the second electron is detached and no radiation is
produced. The emission of an Auger electron is also called autoiomzation; this effect is
generally observed in competition with the emission of an x ray.
Example
Z-z K =
(3/4)cA> V 3AA>
= 29.00.
3(1.445 X 10" 10 m)(l.097 X 10
7
m" 1
The intercept for the A' series is given empirically as zK = 1. Hence, our
calculation tells us that Z should be equal to 30, in excellent agreement with the
known atomic number of Zn.
Figure 3-21
(d) Fluorescence
Ik
'
The incident photonmay have an energy large enough to cause a transition of the
atom an excited state. This possibility is illustrated in part (b) of the figure. The
to
process is an example of inelastic scattering since the radiation loses energy to excite
the atom. The energy of the scattered photon is given by
where AE is the indicated excitation energy between the excited state and the ground
state. Each excited state of the system corresponds uniquely to an observable shift in
the frequency of the scattered radiation. Hence, a measurement of the shifts in
frequency constitutes a determination of the energy levels of the given scatterer. This
procedure is used extensively in molecular spectroscopy. The process is called Raman
scattering after C. V. Raman, discoverer of the effect in 1928.
The energy of the incident photon may happen to be equal to one of the excitation
energies of the atom. The fundamental absorption and emission processes of Figure
3-13 can occur in tandem at this energy, as indicated in part (c) of the figure. Elastic
scattering is the result since the initial and final photons have the same energy hv. The
phenomenon is called resonance radiation because the effect is similar to the resonant
behavior of a driven harmonic oscillator. The photon energy matches the energy
required to excite the atom, and so the cross section for elastic photon scattering is
The figure shows a situation in part ( d ) where enough energy is absorbed to excite
the atom into one of the higher-energy states. Deexcitation may then proceed in a
sequence of downward transitions, accompanied by a cascade of emitted photons. This
phenomenon is known as fluorescence. A common example occurs when an atom
absorbs ultraviolet light and deexcites by emitting several wavelengths of visible light.
The figure illustrates the effect by showing two fluorescent photons hv[ and hv. ,,
1
along
with the inelastically scattered photon hv'
Part (e) presents the photoelectric effect as the absorption of an incident photon
accompanied by the ejection of a bound electron. Note that the excitation produces an
ionized system with energy A' above the ionization level. Note also that the work
function of the atom is equal to the difference in energy between the ionization state
and the ground state.
Part shows the Compton effect for the atom in its ground state. The energy of
( / )
the incident photon is large enough to eject a bound electron and produce outgoing
radiation of longer wavelength. The energies in the figure obey the familiar Compton
relation hv — hv' = K.
The excitation of an atom does not always have to be a radiation-induced process.
Figure 3-22 shows how an atom may gain energy and become excited in a collision
with another We
have already spoken of excitations by collision in our
particle.
remarks about the gas atoms in a discharge tube and the target atoms in an x-ray
tube. Thecolliding particle is an electron in these applications and in the two
situations shown in the figure. We let the incident electron have kinetic energy K as
indicated, and we consider two possibilities for the behavior of the atom. The recoil of
the massive atom is again assumed to have a negligible effect in each case. The upper
part of the figure shows an elastic collision, where the atom remains in its ground state
and the electron scatters without change in kinetic energy. Inelastic scattering can also
take place if the initial kinetic energy is large enough to raise the atom into an excited
state, as in the lower part of the figure. In this case the scattered electron has less
168 Introduction to the Atom
Figure 3-22
Elastic scattering
\ <&"
nelastic scattering
We note that Equations (3-66) and (3-67) refer to parallel processes of excitation
whose net effectson the atom are exactly the same. The excited atom may give up
energy in either case by the emission of radiation.
The first measurements of collisional excitation in atoms were made in 1914 by J.
Franck and G. L. Hertz. Their experiment studied a current of electrons in a tube
containing mercury vapor and revealed an abrupt change in the current at a certain
critical value of the applied voltage. They were able to interpret this observation as
Figure 3-23
Cathode Anode
Current
4.9 V
Accelerating
voltage
anode current is observed to grow with the accelerating voltage until the correspond-
ing value of K becomes equal to the mercury excitation energy A£. An abrupt
reduction in current occurs, signaling the onset of inelastic scattering, when the
voltage reaches this critical value. The sudden drop in the observed current is
Example
The critical voltage for the Franck-Hertz experiment in mercury is given as 4.9
V. The wavelength of ultraviolet light from mercury atoms in the Franck-Hertz
tube is quoted as A = 253.67 ran. Let us verify the consistency of these two
measurements. The energy of the emitted ultraviolet photon is the same as the
excitation energy £±E, so that A and A£ are related by
//,
A£ =
A' = kE.
he 1240eV-nm
K = 4.888 eV.
253.7 nm
Cu „ n i
-
(A + E
Parameter A n, "
E,
Absorption
Parameter C
Stimulated emission
Parameter B
Let us reconsider radiative transitions of the kind described in Figure 3-21 so that we
can bring together three basic types of photon-atom interaction. The processes of
interest are illustrated diagrammatically in Figure 3-24. In each case we are concerned
with a photon energy hv that exactly matches the difference in energy AE between
two particular atomic energy levels. We continue our practice of the previous section
and assume that the photon momentum is not large enough to cause appreciable
recoil of the atom.
The first two diagrams in the figure show the related phenomena of spontaneous
emission and absorption. We let A* denote an excited state of atom A and express the
two processes as
A* -> A + y and y + A -» A*.
y + A*^>A + y + y,
in which an incident photon induces the deexcitation of the atom from a higher to a
lower energy state. The transition mechanism is such that the triggering photon and
the emitted photon occur together, necessarily with the same frequency v, in the final
3-9 The Laser 171
state of the system. This simultaneous emergence of two photons from the deexcited
atom corresponds to the emission of coherent radiation in which two electromagnetic
waves are generated in phase. The emitted photons can proceed to interact in like
manner throughout a medium containing other excited atoms of the same species A.
The resulting cascade of photon-doubling transitions over the whole medium produces
monochromatic waves with the property of coherence on a large scale. This amplifica-
tion effect is the basis for the operation of the laser. (The word laser is an acronym for
light amplification by the stimulated emission of radiation.)
and n 2 The population of the upper level is fed by the absorption process, so that the
.
indicated growth rate Cu v n is proportional to the number of atoms in the lower level
x
the lower level grows by spontaneous emission at a rate An 2 and also grows by
stimulated emission at a rate Bu^n.,. Both of these growth rates are proportional to the
number of atoms in the upper level. Notice, however, that the rate of spontaneous
emission does not contain the spectral energy density as a factor because this effect
does not involve the presence of the stimulating radiation field. The three constants A,
B, and C are called Einstein coefficients. These quantities have been assigned in Figure
3-24 to parameterize the quantum behavior of the three basic processes.
The growth and decay of the upper and lower populations are evidently described
by the rate equations
dn 2
= Cu„n - (A + Bu v )n 2
l
(3-68a)
~d~t
and
'hi
]
= {A + Bu v )n 2 - Cu v n v (3-68b)
~dt
The numbers «, and n 2 must approach constant values after a sufficient length of
time. Hence, the equations imply a relation between the final populations:
(A + Bu v )n 2 = Cu v n v (3-69)
E /kB T
x
(3-70)
.
where hv = E 2 — E v An expression for the spectral energy density results from these
last two observations:
hv/h » T
A + Bu= Cu„e
hv/k » T - B
Ce
Let us now assume that the atoms are in the presence of radiation characteristic of a
blackbody field and recall the Planck formula from Equation (2-37):
8w v 2 hv
u.. =
3 hv/k B T _ j
•
: g
We can then deduce the following connections between the parameters A, B, and C
by comparing the two expressions for «,,:
C=B and
A
-
B
=
877Af
c
— (3-71)
excited atoms down to the level E2 . This state is described as metastable since its
properties are such as to inhibit spontaneous decay back to the ground state. An
incident photon of energy hv = E2 — E can {
then stimulate the desired laser transi-
tion to the level E x
.
The ruby laser is an example of such a three-level system. Green light from a flash
lamp pumps chromium ions in the ruby crystal to an excited level, and nonradiative
deexcitation takes the ions promptly to a long-lived state at slightly lower energy.
Stimulated emission then follows, generating a coherent beam of red (694 nm) output
light.
Laser start-up cannot proceed in the three-level system until the ground-state
population has been reduced by more than half. Large input pumping power is
needed to achieve this condition. Further inefficiency results from the fact that
ground-state atoms in the medium are likely to absorb the radiation emitted by other
atoms. These problems do not arise in the four-level laser because the device is
3-9 The Laser 173
Three-level laser with a pumping mechanism Four-level laser with a population inversion
for populating level E 2
at the expense of level between levels E3 and E2 .
E 3
—
Metastable level
Pumping
transition hv
designed so that the ground state cannot participate in the laser transition. Figure 3-27
shows a possible level scheme with such a property. Pumping raises atoms from the
ground state to level £4
and the rapid population
, of level £3 follows by spontaneous
emission. Laser transitions are stimulated from E 3
to E2 by photons of energy
hv = E 3 — E2 The . population inversion between these two levels is easily maintained
as atoms at E2 undergo rapid spontaneous decay to the ground state. It should also be
clear from the figure that atoms in the ground state are not able to absorb laser light
at the frequency v.
The He-Ne gas laser is a familiar laboratory instrument with the performance
properties of the four-level scheme shown in Figure 3-28. The active medium for this
device is typically a 10 : 1 mixture of helium and neon gases at low pressure (of order
1 mm
Hg). An electrical discharge pumps helium atoms to an excited state, which
happens to have approximately the same energy above the ground state as one of the
upper levels in neon. A resonant transfer of excitation energy occurs between helium
Figure 3-28
Pumping
transition
174 Introduction to the Atom
Figure 3-29
T^
S
Gaseous
medium Laser beam
Resonating Enclosure
Mirror Partially
transparent
mirror
He* + Ne -» He + Ne*.
The Ne* level then deexcites through a photon-induced laser transition, as indicated
qualitatively in the figure. Helium is used for the pumping phase because the lowest
excited states of the helium atom are uniquely situated for the collisional excitation of
specific neon levels. A discharge in pure neon would pump atoms less selectively and
populate too many different neon states. The He-Ne laser generates a red (633 nm)
output beam as one of several monochromatic possibilities. The beam can deliver
quite large amounts of power per unit area in a very narrow wavelength interval.
A resonating cavity of suitable length is an essential feature of the typical laser.
Figure 3-29 shows how the resonant characteristics of a gas laser are achieved by-
means of parallel plane mirrors at the ends of the gas enclosure. The separation of the
mirrors is designed to store radiation whose frequency corresponds to the desired laser
transition. The reflection system can accomplish a well-defined separation of frequen-
cies even though the length of the resonating cavity accommodates many closely
spaced optical modes. One of the two mirrors is partially transparent and transmits a
portion of the repeatedly reflected light to give the output of the laser. The storage
system also tends to eliminate photons whose directions are not parallel to the axis of
the resonator. As a result the long dimension of the cavity determines a precise
directionality for the output beam.
All the properties of monochromaticity, coherence, intensity, and directionality are
attainable at extraordinary levels in the laser. Many unique applications have been
discovered for the use of laser beams. These include the modulation and signal-carry-
ing capability of noise-free light in laser communication systems, the accurate de-
termination of very large distances (e.g., from Earth to Moon) by the reflection of
laser light, the investigation of multiphoton interactions with atoms using the methods
of laser spectroscopy, and the proposed fusion of nuclei in samples compressed by laser
radiation.
We have already noted that the theory of the laser began with Einstein's analysis of
stimulated emission. These basic ideas were eventually put into use almost four
decades later. The first laser was actually a maser, built in 1954 by C. H. Townes to
generate coherent microwave radiation from laser transitions in ammonia. The first
instrument to operate at optical frequencies was the ruby laser, constructed in 1960 by
T. H. Maiman. The invention of the optical gas laser followed soon thereafter.
310 The Quantum of Action 175
Example
depopulate the higher energy level in Figure 3-25. Let us examine this competi-
tion by constructing the ratio of the two rates from Equations (2-37) and (3-71):
Bu.
hv/k » T
'
A e - 1
The result may be recognized as the average number (ri) of photons with
frequency v in blackbody radiation. To see this recall Equation (2-38) for the
average energy (e) per mode at frequency v, and write
= =
<") T-
hv e
hv/k » T
- 1
'
hv Bu„ 1
= 80
kB T A e™ - 1
hv
-3
Bu v 1
Bohr's quantum picture of the atom did not develop at once into a fundamental
quantum theory. Quantization of energy was soon accepted as a demonstrable
property of the states of matter, and yet the underlying quantum principles remained
unknown for more than a decade. Some tentative insights were provided by Bohr's
quasiclassical model of the one-electron atom. Similar methods were also adopted by
A. J. W. Sommerfeld in his attempt to formulate the basic quantum principles.
Sommerfeld's approach to the mystery of quantization was based on a generalizing
concept known as the quantum of action.
The new ideas were focused on periodic behavior since periodicity was a common
feature of the earliest quantum systems. The frequencies of Planck's oscillators and the
orbits in Bohr's atom were instances where aspects of quantization were linked directly
to periodic properties. This connection was the key to Sommerfeld's proposal of a
generalized quantization principle. His procedure employed phase space as a scheme
for unifying all forms of periodic motion in quantum systems.
176 Introduction to the Mom
Let us introduce the idea of phase space by considering the evolution in time of the
classical motion of a particle. The dynamical equations of motion determine a
complete specification and velocity at any instant, if the particle's position
of position
and velocity are prescribed at some initial value of the time. We may use position and
momentum as an alternative pair of variables for an equivalent description of particle
motion.
Phase space is spanned by a system of axes whose definition is given in terms of
these position and momentum variables. The axes are constructed in pairs by taking a
position axis and a momentum axis for each of the particle's degrees of freedom.
Hence, a point in phase space denotes all the physical conditions of the system at a
specific instant since the point is located by the instantaneous values of all the
components of position and momentum. The point moves with time so that the
evolution of the system is described by a directed path in phase space. This locus of
connected points forms a closed path that repeats itself in cycles for the special case of
continuous periodic motion.
Sommerfeld's proposal expresses a general quantization principle for any kind of
periodic motion. The principle constrains the motion for each pair of conjugate
phase-space variables (q, p ), where the notation refers to the coordinate and momen-
tum for a particular degree of freedom of the particle. Every (q, p ) plane in phase
space exhibits periodic motion as a cyclic path surrounding a fixed area. Sommerfeld's
rule employs Planck's constant as a unit of area and defines an allowed quantum state
of the system to be a locus of points in phase space whose enclosed area satisfies the
formula
/cycle Pq dq = n h.
q
(3-72)
This area integral over a complete cycle in phase space is called the action integral.
The units of action are evidently the same as the units of h, and so Planck's constant
appears in the formula as a primitive quantum unit of action. Thus, the rule associates
a quantum number with the coordinate q and specifies an allowed path in phase
n
space to be one of a discrete set of paths whose separation in area is given by the small
quantum of action h.
Let us illustrate Sommerfeld's idea by examining the one-dimensional motion of a
mass on a spring. Phase space is the (x, p) plane, and the oscillating time dependence
of the coordinate is given for amplitude A and phase angle 8 as
The angular frequency is related to the mass and spring constant by the familiar
expression
'
k
dx
p = m — = —mijiA sin( ut — 8).
dt
These oscillating variables are connected through the definition of the conserved total
310 The Quantum of Action 177
Figure 3-30
Area h
energy
he 2
/•: = (3-73)
~2
2m
mw K
k k
E --
A 2 sin2 (ut - S) + -A 2cos 2 (ut - 8) = -A 2 .
+ (3-74)
2mE 2E/k
and thus identify the closed path in phase space to be an ellipse parameterized by the
constant energy E. Figure 3-30 shows how two such ellipses are constrained by the
quantization condition in Equation (3-72). We avoid computation of the action
integral by recalling the formula for the area of an ellipse, and we write the condition
in terms of the indicated semimajor and semiminor axes as
.IE
ny2mE \j = nh.
Hence, the allowed values of the energy are quantized according to the formula
h k
E= n—\-m
277 V
/
a hi (3-75)
f
Ld8 = n l
cycle
and immediately use the fact that L is a constant of the motion to obtain
LJ dO =L 2tt = n B h.
cycle
We note in passing that this simple calculation is valid even when the orbit is not
circular. Our result takes the form
L = neh (3-76)
and thus reproduces Bohr's expression for the quantization of angular momentum as
written in Equation (3-48).
Sommerfeld's quantization conditions were of some interest in the days of the old
quantum theory. The principle found extensive use in the generalization of Bohr's
model to the case of noncircular orbits. Degrees of freedom were associated with both
and r in this case, so that phase space was enlarged to include the two pairs of
conjugate variables (9, pe ) and (r, pr ). The angular momentum quantum number n e
retained its validity, while a second quantum number n r was also introduced by
applying the quantization principle to the radial variables as well:
/
p r
dr = n r h,
cycle
The two quantum numbers were employed by Sommerfeld and others to account for
a broader array of quantized energy states in the analysis of the hydrogen atom. Of
course, all these quasiclassical considerations passed rapidly into history with the
coming of quantum mechanics.
Problems
2. Obtain the general solution to the differential equation for Brownian motion,
dg 2k H T
—
dt
+
m
fi
—g =
m
,
2k B T
Consider Thomson's e/m experiment, and let 6 denote the angular deflection of the
electron beam as shown in the drawing. Deduce a formula to show how 6 depends on the
ratio e/m and on other parameters of the experiment such as the field strengths E and B.
The figure shows what appears to be an e/m apparatus similar to Figure 3-1. This
conventional cathode ray tube generates electrons from a hot filament and accelerates the
particles through a potential difference <j>
{)
. The electron beam passes between plates of
—fcHz>*
Determine the radius and the mass of the oil drop, and use the known value of e to
electron. Assume that radiation of the same frequency is emitted by the atom, and
compute the wavelength of the radiation. Take the radius of the atom to be R= 0.05 nm.
7. Prove that the a-particle orbit in Figure 3-8 has the same incoming and outgoing impact
parameters.
8. Refer to Figure 3-8 and show that the distance of closest approach min for
r a non-head-on
collision in Rutherford scattering is related to the scattering angle 6 by
d i e
'mm = ^1 1 + CSC-
where D is the head-on distance of closest approach. Check the validity of the formula by
considering the special case of a head-on collision.
180 Introduction to the Atom
9. Grains of sand are elastically scattered by a bowling ball as shown. Obtain a formula for
the differential scattering cross section. Let the total cross section be defined as
r da
a = / —dSt,
•'all a du
10. A 3 MeV beam of a particles strikes an aluminum target. Determine the distance of
closest approach between a particles and aluminum nuclei at this energy, and calculate
thenumber of aluminum nuclei per unit volume in the target. (Aluminum has atomic
number 13, atomic mass 27, and density 2.70 g/cm3 .)
11. Suppose that the a-particle beam in Problem 10 carries 10' a particles/s to the
-4
aluminum target, and let the target thickness be lO cm. Calculate the number of a
particles scattered per second into the backward hemisphere.
12. A beam of 6 MeV protons is scattered by a gold foil of 10 cm thickness. The proton
detector is ring shaped as in Figure 3-6. The ring collects protons scattered at 60° and
subtends a small element of angle equal to 1°. Calculate the distance of closest approach
between protons and gold nuclei at this energy, the number of gold nuclei per unit volume
in the target, the size of the solid angle subtended by the detector, and the fraction of
incident protons scattered into the detector. (Gold has atomic number 79, atomic mass
197, and density 19.3 g/cm'.)
13. Consider Rutherford scattering at beam energy A', and derive a formula for the fraction of
14. Show that the Bohr orbits have quantized speeds given by
Z
vn = — ac,
n
in which a is the fine structure constant. Use this result to assess the validity of the
16. The photon of largest energy in the hydrogen spectrum occurs at the Lyman series limit.
Calculate the momentum of the photon, and determine the velocity of the recoiling atom.
27
Take the mass of the atom to be 1.67 X 10 kg.
17. Obtain values (in eV) for the energies in the energy level diagram for singly ionized
helium He ' . Identify all transitions in He + for which the emitted wavelengths are in the
18. The Z = 1 neutral atom has three isotopic species: hydrogen, deuterium, and tritium. The
nuclear masses for these atoms are (approximately) M, 2M, and 3M, with M= 1.67 X
~ 27
10 kg. Obtain a formula and calculate a value for the difference of the Lyman a
wavelengths for hydrogen and tritium.
19. What are the Bohr- model formulas for the energy levels and the orbit radii in positronium?
What are the orbit speeds for the electron and positron? Obtain a formula for the
wavelengths in the positronium Balmer series, and calculate the corresponding maximum
and minimum wavelengths.
++
20. The doubly ionized lithium ion Li is a one-electron atom having Z= 3. Identify all the
++
transitions in the Li emission spectrum that lie in the far-ultraviolet region of the
spectrum and beyond, where the wavelengths are less than 50 nm.
21. A muonic atom is formed when a negative muon replaces an electron in a normal atom.
Determine the radius of the first Bohr orbit and the ground-state energy in a muonic
hydrogen atom. The muon mass is 105.7 MeV/c 2 .
23. The following La , Lp, and L y wavelengths are measured for one of the elements in
Moseley's survey:
Use the L a line to identify the element, and compare the Lp and L y wavelengths with the
values predicted from the Bohr model.
24. A Franck-Hertz experiment in hydrogen shows dips in the anode current at 10.20 and
12.09 V. What wavelengths should be observed in the radiation emitted from the
Franck-Hertz tube?
25. Consider the motion of a bead of mass m on a. wire of length a. The bead is free to slide
back and forth with constant speed between x = — a/2 and x = + a/2. What is the locus
of the bead's motion in phase space? Impose Sommerfeld's quantization condition to
determine the energies for the allowed states of motion.
FOUR
MATTER
WAVES
The matter-wave conjecture was motivated in part by the similarity between quan-
tized states and standing waves. The quantization of energies was associated with the
182
4-1 Be Broglie's Hypothesis 183
h
A= 7 ,
(4-1)
P
in obvious analogy to the relation for a photon given in Equation (2-52). Thus, a free
particle with mass m and speed v has a de Broglie wavelength
X = —
h
(4-2)
mv
for the special case of nonrelativistic motion. Equation (4-1) is understood to hold
more generally in situations where p denotes the relativistic momentum.
We can appreciate Bohr's condition for the quantization of angular momentum by
appealing immediately to de Broglie's relation. Let us suppose that an allowed orbit in
Bohr's model is such that the de Broglie wave for the bound particle is a standing wave
that just fits the circumference of the orbit. We illustrate this intuitive notion in Figure
4-1 and write the condition for the standing wave in terms of Equation (4-1) as
h
2ttt = nX = n— .
Figure 4-1
The result can be rearranged to produce the desired expression for the angular
momentum:
L = pr = n —
2v
= nh
Example
= 5.40 X 10~ 24 kg •
m/s,
34
6.63 X 10 J •
s
A = - = -24
= 1.23 X 10- 10
m= 0.123 nm.
5.40 X 10 kg • m/s
a particle. In contrast, a 1
g pellet with speed 100 m/s has de Broglie
wavelength
h 6.63 X 10" 34 J •
s
A = = 6.63 X 10" JJ m.
Mv " (l0~
3
kg)(l00m/s)
The value of A for this macroscopic object is many orders of magnitude smaller
than the actual size of the object, and so no experiment can be devised to
demonstrate its alleged wave properties.
4-2 Electron Diffraction 185
Example
Itmight appear from Equation (4-2) that the de Broglie wavelength is always
larger than the Compton wavelength h/mc. This conclusion is false because
Equation (4-2) is applicable only for nonrelativistic motion. Let us adopt the
more general form in Equation (4-1) and observe that the de Broglie and
Compton wavelengths can actually be equated as
h
when p = mc
mt
\E 2 — m 2c 4 = mc 2 => E- i/2mc
2
.
K= (i/2 - \)mc
2
.
Evidence for matter waves was found in 1927 in two separate laboratories. The
investigators were C. J. Davisson and L. H. Germer in one of the experiments, and
G. P. Thomson in the other. Both experiments demonstrated the effects of diffraction
angles by varying the angular position of a detector with respect to the direction of the
incident beam. The distribution was plotted as a function of the indicated scattering
Figure 4-2
Design of the Davisson-Germer electron diffraction experiment. The electron beam experiences
wave-like Bragg reflection from parallel planes in the scattering crystal.
Electron beam
Detector
Ni crystal
186 Matter Waves
Figure 4-3
angle <p for each beam energy in the manner shown in Figure 4-3. These observations
indicated that the scattering of electrons from the crystal target was just like the
diffraction of x rays by the same crystal. The intensity of scattered electrons showed a
large background of backscattered electrons and revealed a pronounced diffraction
peak in a certain direction for a particular value of the beam energy. This peak was
detected in the original experiment at an angle <p = 50° for a beam energy of 54 eV,
'/
= sin i
(4-3)
•1,1
77 OP
6= - - -
2 2
as the relation between the angle <p and the usual Bragg angle 6. Note also that the
spacing d between crystal planes can be regarded as known, since this quantity can be
measured separately by diffracting x rays of known wavelength from the given crystal.
Finally, observe that the Bragg angle is fixed in Figure 4-2 and that the diffraction
peak appears at this angle for a unique value of the beam energy.
4-2 Electron Diffraction 187
Figure 4-4
Example
p = pm K f
= /2(9.11 X 1(T 31 kg) (54 eV) (1.60 X 10"'°J/eV)
= 3.97 X 10" 24 kg •
m/s,
6.63 X 10" J4 J •
s
X= - = = 1.67 X 10~ 10 m = 0.167 nm.
p 3.97 X 10~- kg 4
•
m/s
:
* = V
V
—
40
(° 167 nm = )
°- 194 nm.
The Bragg condition compares these wavelengths with the quantity 2d. The
nickel crystal in the experiment known (from an x-ray determination) to have
is
interplanar spacing d = 0.091 nm, and so 2d = 0.182 nm. Equation (4-3) has no
solution when K = 40 eV, since A exceeds 2d in this case; therefore, no
diffraction peak is expected at this energy. The calculation at K= 54 eV gives
A 0.167 nm
cos- =
<p
2
—
2d
=
0.182 nm
= 0.918 => T =
o> 46.9°,
It is clear that the discrete nature of matter and the wave nature of radiation do
not provide a complete description, as classical physics has led us to believe. We have
to allow for complementary behavior that is wave-like in matter and particle-like in
radiation, because we are confronted with these properties when we perform the
appropriate experiments. Quantum physics thus requires the dual aspects of particle
and wave to be present simultaneously in matter and in radiation. This concept of
particle-wave duality also embraces a principle of complementarity . The dual characteris-
tics are complementary since matter and radiation are completely described by
adopting both particle and wave points of view. These two properties are not found to
be in contradiction because it is not possible to devise a single experiment that tests
both particle and wave aspects at once.
Particle-wave duality introduces the notion of a quantum particle. The extraordinary
behavior of this new entity is quite unlike any of the usual phenomena encountered in
our previous experience with classical systems. To illustrate, let us consider the
electron as a prime example of such a particle and explore some of the further
consequences of its dual qualities.
We expect the wave nature of the electron to carry over from electron diffraction to
any wave-type experiment. Hence, the remarkable properties of the matter wave may
also be seen if a double-slit experiment is performed with a beam of electrons,
provided the de Broglie wavelength is of the same orderas the spacing between the
4-3 Particle- Wave Duality 189
a screen, after the particles have passed through the pair of slits. We confirm our
expectation of wave behavior by obtaining a distribution of detected electrons that
conforms exactly to the familiar interference pattern produced by light waves in a
comparable Young's double-slit apparatus. If we reduce the incident intensity and
collect only a few electrons on the screen, we find that the sparse distribution of
collected electrons still resembles the original pattern to the extent that the inter-
ference minima are observed at the same locations as before.
The observations are totally different in a macroscopic double-slit experiment where,
for instance, a much larger pair of slits is used to transmit a beam of pellets. We recall
from a previous example that the de Broglie wavelength of such a particle is too small
to be detectable, and so we expect to find no interference effects. If we let the pellets
be embedded in a screen after passing through the slits, we see that the embedding
occurs in two regions of the screen directly downstream from the two slits. This
distribution tells us that each of the pellets has passed through one slit or the other, as
expected for a beam of classical particles.
The double-slit experiment for electrons cannot be interpreted to say whether a
given electron goes through a given slit. Instead, the two slits transmit electrons in the
manner of a wave passing through both slits at once. Let us suppose that the
experiment is altered by mounting a separate current loop around each slit. An
induced current can then be observed in one loop or the other to detect the passage of
an electron through the corresponding slit. This induced current affects the passing
electron, however, so that the particle is likely to be found at a different location on
the screen owing to the modification of the slits. Thus, we destroy the wave behavior
of the original two-slit interference experiment if we attempt to get classical informa-
tion about the particles by tracking the electrons through the separate slits. In fact, we
succeed in changing one wave-type experiment into another whenever we monitor the
two slits. The two-slit interference pattern appears when the monitoring device is off,
while two overlapping single-slit diffraction patterns appear when the device is on.
The real meaning of de Broglie's matter wave remains to be explained. We know,
of course, that the wave is not to be associated with the oscillations of a material
medium, but we are not yet in position to identify the actual oscillating entity. Let us
approach this ultimate question of interpretation in the quantum treatment of matter
by looking first at the implications of particle-wave duality in the more familiar case
of radiation.
The radiation field is and magnetic
a continuous system of propagating electric
oscillations. We and choose the electric field E(r, t to specify the
follow convention )
form of the propagating wave. We also assume a monochromatic wave so that the
oscillations have a single frequency and wavelength given by v and A. The flow of
radiated energy is expressed by constructing the electromagnetic Poynting vector in
terms of the total field E at a given point P in space. Thus, we find that the flow of
energy per unit time across unit area at P is given by the expression
ce E2 .
Figure 4-5 provides a specific example in which the monochromatic radiation field is
produced by the illumination of a double slit and the point P is identified by the
position of a small photoelectric cell. The total field E at P is the sum of coherent
waves E, and E2 arriving at the given cell from the two slits. An interference pattern
is observed in the flow of energy as P varies over an array of such cells. Constructive
interference occurs at a given cell if the two waves arrive in phase, as in the situation
shown in the figure. The indicated slit separation d, wavelength A, and direction 6
190 Matter Waves
Figure 4-5
Double slit
ptj Photocell
Monochromatic Interfering
plane wave cylindrical waves
The general expression ce E oscillates with time, and so a more useful observable
quantity is found by averaging with respect to time over a complete cycle. We
therefore define E^ as the time average of E~ at the point P and introduce the
radiation intensity
ce E^
to obtain a quantity whose value depends only on the position P. The figure shows
how this intensity varies with the location of the photocell in the case of double-slit
interference.
The implications of particle-wave duality emerge when these familiar results are
cast in terms of photons. We know that the photon carries radiation energy hv, and so
we may express the radiation intensity at P by means of the alternative expression
hvlp ,
where IP denotes the average number of photons entering unit area at P per unit
time. The conclusion is a simple equality between the two ways of writing the
radiation intensity:
This formula is a statement of particle-wave duality since the left side is written in the
language of discrete quanta while the right side is written in the language of wave
fields. Note that our example in the figure adapts at once to the introduction of the
quantity IP because the indicated photoelectric cell is sensitive to the arrival of
individual photons.
4-3 Particle- Wave Duality 191
numbers of photons
Double-slit diffraction for increasing levels of incident intensity. Increasing
random locations on a distant screen. The probability distribution of
are detected per second at
detected photons conforms to the wave diffraction pattern. The intensity of this pattern is given
at large distance by the angle-dependent function
,»n (<k/2)^—
2
4 /r ,cos" 2
2 (*./2)
i
sin 8
i;3;>S:&
The photon description on the left side of Equation (4-5) makes no reference to any
concept of path for the arrival of photons at position P. We entertain no such notion,
even when we imagine a very low level of intensity and consider only a single photon
in transit. Instead, the detection of a photon at a particular location is understood to
be a random event that represents the one and only operational specification of the
position of the photon. Our double-slit example illustrates this act of locating a photon
as a single random occurrence in which the photon is absorbed once and for all by an
atom in a photoelectric cell.
The wave description on the right side of the equation tells us where the random
events of photon detection are likely to occur. To draw this conclusion we let the
relation and IP establish a correspondence between an intensity distribution
between Ej>
for the behavior of waves and a probability distribution for the detection of photons. The
quantity IP is interpreted as a measure of the probability of observing a photon in a
small region located at P. The equation connects this probability to the square of the
field at that location. The familiar wave field E(r, is thus / ) interpreted as a
mathematical wave function, or guiding field in Einstein's words, that furnishes statisti-
cal information regarding the likelihood for the random arrival of photons. Our
double-slit experiment illustrates this interpretation even at very low levels of inten-
sity. The resulting photon distributions are very sparsely populated and show no
isolated events in anyregion where destructive interference occurs for the correspond-
ing wave. A more marked resemblance then develops between the photon population
and the interference pattern as the incident intensity increases.
Let us add two final remarks about the wave function E(r, t ). We recall that the
electric field for waves in free space satisfies the familiar wave equation in the form
192 Mailer Waves
^(r, t), as E,j, is derivable from E(r, /). We then interpret ^J as a measure of the
probability of finding a quantum particle in a small region located at position P. The
notation 4^ is introduced temporarily to convey the analogy with Equation (4-5), so
that the probability of finding the particle at P has the form of an intensity
determined by the square of the wave. The proposed wave function ^(r, / ) is the basic
element in a formal theory of a quantum particle. Since the formalism is based on a
treatment of waves, it is understood that the addition, or superposition, of waves is
supposed to be incorporated as a main feature. Interference and diffraction effects for
quantum particles are then immediately explainable in terms of a probability
interpretation of the observed wave behavior.
These anticipatory remarks are offered as a statement of our objectives. We know
that evidence for de Broglie's matter wave is to be found in electron diffraction or any
other wave-type electron experiment. Our arguments imply that such observations of
wave behavior are to be interpreted probabilistically, in the manner of the probability
interpretation of radiation. To illustrate, let us return once more to our double-slit
experiment for light in Figure 4-5 and recall our discussion earlier in the section
regarding an analogous experiment for a beam of electrons. Particle-wave duality
predicts a complete parallel between radiation and matter, as an interference pattern
is mapped out by an array of photoelectric cells in the one case and by a similar array
of electron detectors in the other. The common probabilistic interpretation tells us how
the observed pattern describes a distribution of discrete observations at each point P
in the array. A photoelectric cell at P measures the probability of detecting a photon
while an electron detector at P measures the probability of detecting an electron, as
random detection events in both instances.
Example
Both Thomsons, father and son, performed experiments with beams of electrons.
We should be able to understand why the father saw no evidence for matter
waves by computing the relevant de Broglie wavelength. A typical cathode-ray
experiment has been discussed in the first example of Section 3-2. The electrons
were found to have a speed of 1.68 X 10 m/s and were allowed to pass
between a pair of plates with 2 cm separation. The corresponding de Broglie
wavelength would have the value
34
h 6.63 X 10 J •
s
A = = -. -r. t-.
: = 0.0433 nm.
m t
v (9.11 X 10" 31
kg)(l.68 X 10
7
m/s)
4-4 Determinism and Randomness 193
This wavelength would be far too small for an observation of diffraction effects
to predict the time dependence of r( t) for a given applied force through the use of
Newton's law
d't
m —
dt
2
F.
Two pieces of initial data must be given if this second-order differential equation is to
have a unique solution. Such initial information may be expressed in terms of position
r and velocity v, or position r and momentum p, at time t = 0. The resulting
deterministic principles aside and turn instead to the question of predicting probabili-
ties. There is a clear necessity for the latter kind of particle theory whenever randomness
characterizes the observable behavior of the particle.
We have anticipated the formulation of this theory in the previous section by
referring to the expected properties of the wave function ^(r, in our interpretation t )
of the matter wave. The determination of ^(r, /) for the quantum particle becomes
the objective in quantum physics, replacing the problem of solving for r( /) in the case
of the classical particle. We incorporate the random detectability of the quantum
particle in the probability interpretation of ^ and make no reference at all to the
deterministic classical trajectory defined by r(/). Note that, while the detection of a
particle at a given location is an entirely random event, the probability of random
194 Matter Warn
Figure 4-6
raj-
rr
Figure 4-7
X ray illuminating an
atomic electron
exposure to the theory. Since the uncertainty principle can be established in both
matrix and wave pictures, we can regard the resulting idea as a deduction fundamen-
tal to any version of the quantum theory.
The principle delimits our ability to make accurate measurements of the observable
properties of a quantum particle. It is proved in the theory that corresponding
components of the conjugate variables r and p cannot both be known with absolute
precision at the same time and must exist in their simultaneous
that uncertainties
determination. It is specifically shown that the conjugate quantities ( .v, p ) have x
h
AxA A >-, (4-6)
196 Mailer Waves
Werner Heisenberg
and that identical statements hold for ( y, p y ) and (z, p,). The conclusion identifies a
Figure 4-8
Allowed region
6p^ to
8»
Sx 5p =
lack of a proof of Equation (4-6) should not concern us either, provided we recognize
Planck's constant as the main factor in the minimum product of uncertainties.
Thought experiments can be performed to illustrate the logic of the uncertainty
principle and explain the appearance of Planck's constant in the result. A simple
device for the purpose is the elementary thin-lens apparatus illustrated in Figure 4-9.
Let us use the device to form a real image of a particle and consider the uncertainties
involved in the determination of the particle's position. We orient the axis along the
vertical y direction and let the particle be illuminated vertically from below, as shown
in the figure. Formation of an image means that the particle must scatter at least one
photon into the aperture of the lens. The transverse x component of momentum of a
scattered photon may then have any value in the range
( —p sin#, p sin#),
where 6 is half the angle subtended by the aperture and p is the scattered photon
momentum. Planck's constant makes its appearance when we use p = h/X to express
the momentum in terms of the indicated wavelength. The act of measurement causes
the particle to recoil with an opposite transverse momentum. We therefore obtain
Figure 4-9
Real image formation to determine the location of a particle. The position uncertainty Ax can
be reduced by enlarging the lens aperture, but the momentum uncertainty &px is made larger
as a result.
t Image
Lens
\ )/ Scattered
\ (I photon
Particle
llumination
minimum of the other. This condition is illustrated in the figure as the means of
defining the uncertainty Ax in the measurement of a particle's transverse coordinate.
We can use the geometry in the figure to construct an expression for Ax once we know
the indicated diffraction angle qp. The angle for the first diffraction minimum is found
from the wavelength and the lens diameter according to the formula
A
sintp = 1.22-
d
(We recognize the analogous formula d sin <p = A as the condition for the first
diffraction minimum due to a slit. The additional numerical factor 1.22 is needed in
the case of a circular aperture.) Two geometrical relations connect the various
distances and angles:
Ax H
— = tan<p
y
and
= tan 6.
y
The related quantities are the object and image distances y and Y and the image
separation H. We eliminate y and d as follows to get a result for Ax in terms of the
angles:
d 1.22A tan<p
Ax = y tan q>
= tan op = .
The angle cp is intended to be quite small, so that the position uncertainty becomes
0.61A
A* = (4-8)
tan#
The other angle is constrained only by the condition 6 < 90° for a finite aperture.
Let us write our results in Equations (4-7) and (4-8) side by side as
0.61A 2/*sin0
Ax = and Aft
r" = (4-9)
tan X
kxkpx = (1.22cos0)A.
also be decreased to gain better resolution, with exactly the same result.
position for any particle in the incident beam. The wave is diffracted by the slit and
produces a distant intensity distribution just like the pattern obtained for light. The
figure describes a distribution with a large central diffraction peak whose first
minimum is given in terms of the indicated angle and slit width by the formula
sin i
Figure 4-10
We cannot predict the point of detection for any particle in the region to the right of
the slit. Instead, we use the wave intensity distribution to determine the probability for
detection of a particle at any given location. The slit localizes the probability of
finding the particle in the transverse y direction, as demonstrated by the intensity in
the central peak. We
may estimate the uncertainty in the transverse coordinate for
any such particle by setting
Ay = a.
The diffraction peak is actually broader than the slit, and so there must be a
transverse momentum uncertainty Ap to account for the divergence of the beam. We
may estimate this quantity for a particle in the central peak by referring to the sketch
shown in the figure and writing
Ap = psm6.
A y Ap = ap sin 8 =
h
a-
A
—=
X
a
h ,
again in agreement with the uncertainty principle. We note that a narrower slit
Example
X 10~ 34
A* =
h
= —
1.05
;
r
I • s
= 0.53 X 10" 31 m.
2Ap 2(10" 9
kg m/s)
4-5 The Uncertainly Principle 201
This result of the uncertainty principle is far below any realistic measuring error
and has no practical significance for such a measurement. On the other hand,
imagine that an electron is to be observed in its orbit in a hydrogen atom. We
assume that we can make a position measurement with sufficient refinement to
distinguish between two adjacent orbits. Let us take Equations (3-50) and (3-56)
from the Bohr model, ignore the reduced-mass effect, and express the nth Bohr
radius as
n a,
2
Ar = r — r ,
:
-[n
2
-(n- l) ]
= - -(2«
amc
2Ar 2(2 n
am ,c
from Problem 14 at the end of Chapter 3. We note that Ap and m e vn are of the r
same order, and we conclude that the act of measurement is disruptive enough
to affect the orbit of the observed particle.
Example
h 1.05 X 10" 34 J •
s
LP = = 1.05 X 10 -
4
kg m/s.
A7 10- 10
m
The actual electron momentum may be at least this large; therefore, the kinetic
202 Matter Waves
_ '-'
4 2
(1.05 X 10 kg m/s)
K= -31
= 3.78 eV.
2m e
2(9.11 X 10 kg)(l.60 X 10"' 9 J/eV)
This result is quite reasonable since the order of magnitude compares favorably
with values of the total energy in the Bohr model of the atom. If we attempt to
localize an electron in a region of nuclear size, we find a considerable increase in
our estimate of the energy. A position uncertainty of order A.v = 10~ 14
m
implies a much larger momentum uncertainty A/?. The electron is now likely to
have a large relativistic kinetic energy, of a scale appreciably larger than that
found for nuclear particles. We are then able to argue that electrons should not
occur as constituents of the nucleus. The numerical details of this argument are
left to Problems 7 and 8 at the end of the chapter.
Example
The uncertainty principle can be used to estimate a lower bound for the energy
of a particle. Consider an oscillating mass m on a spring with force constant k.
The energy is a constant of the motion, given in terms of the variables x and p
by the formula
2 2
E= —p
2m
+
kx
2
.
Classical physics allows a minimum energy equal to zero for the trivial case of
X 7
2m 2
The average values of x and p should vanish for an oscillating particle, and so
2
the average values of x and p can be identified with the squares of the
corresponding uncertainties:
<*
2
> = (A*) 2 and (p
2
) = (Apf =
2 A;
E= —-
2
h k
2
+ -8 2 .
8mS 2
Figure 4-1 1 shows that this expression has a minimum for a certain value of 8.
4-5 The Uncertainty Principle 203
Figure 4-1
«o
dE fr
= = + kS
4m8 :
to get
2
h
2
So =
Amk
2
4mk k h
= -co,0)
8m \mk
where co appears in the final answer as the familiar angular frequency of the
classical oscillator. Our estimation of the minimum energy from the uncertainty
principle happens to agree with the exact formula obtained from quantum
mechanics. The result is called the quantum zero-point energy for the one-dimen-
sional oscillator.
Example
Figure 4-12
Slit
Screen
h
Po = ir~
2 Jo
y = ya + — t= yo+ ~
n
— '<
where t is the time when the particle reaches the screen. The distance to the
screen is determined by the beam momentum P as
/'
x= -t,
m
h mx fix
y =y + ~7T = yo +
2my P 2Py
This expression has a minimum for a certain choice of y , as obtained from the
condition
— = 0=1
dy
dy
hx
2P7Z
2
hx j>
-fo
= /
VV tb
2P
and y
= yo + —
y
=
Q
2 yo-
4-6 Waves and Wave Packets 205
We conclude that the image cast by the pa rticle be am has its smallest width
when the half-width of the slit is taken to be y/hx/2P Note that the result for y .
can be rewritten as
hx
J
Xx
2P " V 477 '
We begin our formal treatment of the quantum theory in Chapter 5. Our purpose in
this concluding section is to prepare a foundation for the wave picture of quantum
mechanics by reviewing the properties of waves. We are especially concerned with the
use of a wave to convey the idea of localization, since we know that a wave distribution
is supposed to represent a distribution of probability for locating a quantum particle.
This review deals only with waves described by a single spatial variable. We begin
with the properties of a monochromatic traveling wave and illustrate by means of the
wave function
Note that factors of 277 are eliminated by introducing the wave number k and the
angular frequency w as
k = —
277
A
and to = 2mv. (4-11)
The argument of ty is called the phase of the simple harmonic wave. This quantity
determines the cyclic behavior of the wave as the function ^ varies with the
independent variables x and t. Figure 4-13 shows how two configurations of a
traveling wave on a string are seen as snapshots of ^ versus x, taken at two different
times during a cycle.
Equation (4-10) is not necessarily restricted to waves in a one-dimensional medium,
even though the expression has only one spatial variable. We can also let ^ represent
Figure 4-13
a plane wave in three dimensions and visualize the wave as the propagation of a plane
surface of constant phase. This wave front is defined by setting the argument of ^
equal to a constant:
kx — co/ = <j) .
The resulting plane surface has a varying position along the x axis, given by
<J>
+ co/
x = at time t.
k
Hence, the surface of constant phase propagates in the positive x direction with phase
velocity
v, = — = v\. (4-12)
* k
A similar conclusion is drawn for the traveling wave in Figure 4-13. The alternative
wave function
^ = Asm(kx - co/)
Figure 4-14
Analogue model of phase propagation. Clocks are arrayed at 3 m intervals along an axis, and
each clock is set in sequence to run 3 h out of synchronism with its predecessor. The hour hands
simulate wave motion, with wavelength X = 12 m and frequency v = (12 h)~ '. The system of
clocks is read at times / = 0, / = 3 h, / = 6 h, . . . .In each instance special notice is taken of
the indicated 1 2-o'clock reading as a particular choice of phase. The selected orientation of the
hour hand is observed to propagate with phase velocity v^ = 1 m/h. Thus, the phase travels
along the axis while the clocks remain in place.
x =
t=
(= 6
4-6 Wares and Wave Packets 207
describes a wave with the same properties as the original ^ in Equation (4-10), apart
from the obvious quarter-cycle difference in phase. Another entirely different example
of phase propagation is described in Figure 4-14.
A complex-number representation is often employed as a convenient mathematical
device for the handling of phase in an oscillating system. A brief summary of the
properties of complex numbers is given in Table 4- 1 The main result for our purposes .
Imaginary number i = /— 1
Imaginary part y = Im z
Complex conjugate z* = x — iy
2
Modulus \z\ where \z\ = zz* = x
2
+ y2 = r
2
Imaginary axis
Real axis
Complex plane
y = r sin0 = r(8 1- • • •
)
3!
z = r(cos + i sin 6)
3
{idf (id)
= r[\ + id + +
2! 3!
6
Phase factors e' = cos 6 + i sin 6 and e
,e
= cos 9 - i sin 6
Real part Ree' = cos0 = {e'
e
+ e~ ,e )/2
Imaginary part Imf' 8 = sin0 = (e
iB
- e~ ,e )/2i
208 Mailer Waves
in which the cosine and sine functions appear as the real and imaginary parts. This
formula can be applied advantageously to represent a wave in the form
- at)
* = Ae* kx = A[co&(kx - at) + ism{kx -at)}. (4-13)
Notice that the expression describes both cosine and sine monochromatic traveling
waves at once. The complex form can be used to make calculations of wave behavior
in a physical medium, and then the real or imaginary part can be taken, as
appropriate, at the end of the analysis.
The foregoing illustrations and formulas pertain to waves traveling in the positive x
direction. The mathematical expressions are immediately adapted for propagation in
the opposite direction by changing the phase variable from (kx — at)to( — kx — at).
In either case the various forms of the wave function are easily seen to satisfy the
one-dimensional wave equation, written (with speed of propagation z;) in Equation
(2-1 1). Of course, we are not yet able to write the wave equation for use in the case of
matter waves.
Let us consider the possible adoption of Equation (4-13) as a matter wave for a
quantum particle. We see at once that the parametrization defines a unique de Broglie
wavelength A = 27i/k and a correspondingly unique momentum for the particle:
h
p = — = hk.
A
differentwavelength and frequency. Let us illustrate this point by adding two such
wave functions, using the real-valued form in Equation (4-10):
Figure 4-15
Composition of two monochromatic waves with different wave numbers and frequencies
(k + 8k, w + Su) and (k - 8k, u - Sto). The composite system propagates with group velocity
v
g
= Su/Sk.
^v e
This superposition of waves exhibits the familiar phenomenon of beats. Figure 4-15
shows a snapshot of ^ versus x in which the oscillations of the wave are contained in
an envelope defined by the first of the two wave factors in Equation (4-14). This factor
by itself describes a monochromatic wave with wavelength 2^ /8k and frequency
540/277-. The whole wave pattern in the figure travels in the x direction with a speed
determined by the propagation of the envelope. The speed of such a composite wave
system is known as the group velocity, given in this instance by the formula
Note that the group velocity is quite different from the phase velocities
and
k + 8k k - 8k
Figure 4-16
Schematic wave packet representing the localization of a particle. The packet travels with
group velocity v .
/oo
'- a,)
* A(k)e iik dk. (4-16)
do)
Figure 4-17
A(k)
4-6 Waves and Wave Packets 21 7
where the infinitesimal increments are taken at the value of the heavily weighted wave
number k = k in the figure.
Equation (4-17) and the phase velocity v^ in Equation (4-12). A value of v^ is defined
for every wave number k in the integration of Equation (4-16), while a single value of
v is associated with the entire wave packet and the heavily weighted wave number
k = k. The two quantities can be connected by the relation
l'
s
=
du
lk
=
,1
-(K)
7k
»*
*
+ *
—
dvj,
dk
(4-18)
Thus, the group velocity and the phase velocity at k differ whenever v^ depends on k.
Phase velocity is independent of wave number for the special case of light waves in
vacuum. We see this from the simple formula
v+ = p\ = c,
and so we obtain the expected result v = c for the group velocity of any wave packet
(or pulse) of light in vacuum. A different result is found for the propagation of light in
a medium because of the effect of the index of refraction as a factor in the phase
velocity
The refractive index n has a dependence on k, and so the determination of the group
velocity for a light pulse in a medium calls for an identification of the wave number k
and a proper evaluation of Equation (4-17). This k dependence (or X dependence) of
the index of refraction causes the phenomenon of dispersion.
The dispersive property of a wave can also be described as the result of a nonlinear
relation between the wave parameters to and k. We see such an effect when we
compute the group velocity of a matter-wave packet for a localized free particle. Let
us introduce energy and momentum for this purposeand adapt the parameters in
Equations (4-11) accordingly, using the Planck-Einstein and de Broglie relations
E E p p
w = 2?:- = - and k = 2v- = -. (4-19)
h h h h
We can then regard the matter-wave packet as a superposition of matter waves with
definite momentum, summed over a continuous range of momenta and energies. In
this parametrization the formula for the group velocity of the wave packet becomes
dE
i--, (4-20)
p = hl. (4-21)
The effect of dispersion arises because the parameters satisfy the nonlinear relation
2m
212 Mailer Warn
for a free particle. The group velocity of the corresponding wave packet is then found
to be
v
g
=~ (4-22)
m
from Equations (4-20) and (4-21). We identify this result immediately as the velocity
of the localized free particle.
The mathematical properties of the wave packet in Equation (4-16) are not
difficult to analyze. Let us examine these details in the context of Figure 4-17
specifically so that we can visualize the idea of localization. We note that the equation
defines the wave packet as a sum over all wave numbers and that the range of
summation becomes finite when the amplitude A(k) has the shape indicated in the
figure. The formula for ^ therefore assumes the simpler form
~ ~
k+ K - ut k+ K
* = [ A{k)e* kx Uk - A(k) [ e
ikx
e-
iu
'dk.
The last step approximates the integral by taking the amplitude function to be
evaluated at k = k, where the distribution of wave numbers has its peak.
The remaining integrationis then performed via the following series of maneuvers.
k = k + k,
co = -
E
h 2mh
p
2
- = —h
2m
k
2 =
h
2m
_
(k
2
+ 2kK + k
2
).
^"v-' (A /2m,(p 2 * K K
~'
+ +
* = A(k)e ,ix f ' ,
^c
2
hi
co = .
2m
We then make a further approximation and replace the third exponential factor in the
2
integrand by unity. Our treatment of this function of k is justified since k remains
small when the integration is restricted to such a small k interval as that shown in
Figure 4-17. These steps lead us to the result
- _
- Ul) rK hk
*= A{k)e ,(kx \ e'
{x -"*' )K
dK, where v=—,
J-k rn
in which Equations (4-21) and (4-22) are recalled to identify the group velocity v .
4-6 Waves and Wave Packets 213
The elementary integral yields the desired formula for the wave packet:
*- S,)
* = A(k)e i(i r,
~ c
Ax v l
g )
= 2A{k)e* k *- z" ) —
sin( * - vj )k
^— (4-24)
The significant part of the final expression is the rightmost ratio of factors containing
the quantity (x — v t). The shape of this function of x at a given time / is just like the
behavior of the wave packet in Figure 4-16. We note that the oscillations are localized
within a region centered at x = v t and conclude that the wave system must be
traveling at group velocity v in the positive x direction.
AxAk ~ 1 (4-25)
for any consistent definition of the two quantities. This result turns into a statement of
when the second of Equations (4-19) is employed to relate
the uncertainty principle
wave number k and momentum p. The range of k values in the construction of the
wave packet corresponds to an interval Aco in the angular frequency. This localized
system of waves travels past a given location in a time interval A/. It can be proved
that the intervals Aco and At satisfy the additional relation
Ace At ~ 1. (4-26)
If we again consult Equations (4-19) to relate angular frequency and energy, we find
that this second conclusion becomes an uncertainty principle involving energy and
time:
AEAt~h. (4-27)
The generality of Equations (4-25) and (4-26) should be emphasized as both are
applicable to any type of localized wave. We should also acknowledge the primitive
status of the uncertainty principle
h
Ax Ap
H >
-
2
since the inequality can be proved from the basic quantum mechanical properties of
the observables x and p. All the proofs cited in this paragraph fall outside the scope of
our mathematical presentation.
214 Mailer Waves
Figure 4-18
Diffraction of waves by a single slit. Each element of slit width dy emits a wave element d^.
The emitted waves are in phase at the plane of the slit and travel parallel paths to a distant
point of observation.
Example
dy
= Ae k R -y 3ine )- ul ]—
dty >l <.
ffl/2
d '(*«-w')/ ,-'*>sintf_Z
Ae'/ />
•'slit
J
ka
|
— sin
Ae i(k*--)
ka
Oddness and evenness properties are employed in the integrand along the way
to the final result. The rightmost ratio of factors is the main part of the
expression since the square of this ratio describes the 8 dependence of the
distribution of intensity for single-slit diffraction.
4-6 Waves and Wave Packets 215
Figure 4-19
A-
Example
and let the distribution of wave numbers be specified by the amplitude function
2
A (k) =*-<*/*> .
The shape of this gaussian distribution is shown in Figure 4-19. We note that the
peak of the function occurs at k = 0, and sowe can assume that we are
describing a wave packet ^ for a particle at rest. The integration becomes
since the sine portion of the integrand is odd and integrates to zero. A table of
definite integrals yields the result for the wave packet at t = 0:
This function is also shown in the figure to have the form of another gaussian
distribution. Each of the two shapes has a readily identifiable width, defined as
the indicated spread in the distribution evaluated at half the maximum value of
the function. These definitions of Ax and A£ can be shown to satisfy Equation
(4-25) for any choice of ic; the demonstration of this property is left as Problem
1 7 at the end of the chapter. Note the reciprocal sense in which the parameter ic
.
controls the widths of the two distributions. We see that A(k) is sharp and ^ is
broad if it is small, and we find that the reverse is true if k is large. This striking
feature makes the gaussian wave packet an especially appealing prototype to
illustrate the mathematical concept of localization.
Problems
1. Let the kinetic energy of a particle at room temperature be expressed as -,k B -^ eV. T=
Calculate the de Broglie wavelength for electrons and for neutrons at room temperature.
21
(Take the neutron mass to be 1.67 X 10" kg.)
2. Consider the de Broglie relation for a relativistic particle and show that wavelength and
kinetic energy are related by
he
X =
2
]JK(K+ 2mc )
he
X ~ - for K » me
1
,
K
and
h
X ~ , for K <s: mc
flmK
A beam of 1 eV neutrons strikes a crystal whose crystal planes are spaced by 0.025 nm.
Determine the angle <p for which the first diffraction maximum is observed. Are there
higher orders in the diffraction pattern?
Plane waves of 500 nm light are incident on a photocell whose receptor is 1 cm square.
Suppose that the cell records a counting rate of one photon per second with 100%
efficiency. Calculate the intensity of the light (in W/m 2
), and determine the amplitude of
An object is dropped from rest so that its subsequent vertical position is given by
This deterministic result becomes blurred if the initial position and velocity are not so
accurately known. Let ^(0) and v(0) lie somewhere in the intervals — Sy, Sy) and
(
( — Sv, Sv), respectively, and deduce the resulting behavior of the variable y(t). Draw a
graph of the deterministic result and show how the graph is modified by these considera-
tions.
A proton bound in a nucleus experiences the strong force of nuclear attraction when the
-15
particle is within about 10 m range of another nuclear particle. Estimate the kinetic
Problems 217
energy of the proton for this sort of localization. What can then be said regarding the
potential energy?
8. Reevaluate the calculations of the previous problem for an electron localized in a region
of nuclear size.
E^ 2m
».2
e
477e
2
'—,
r
neglecting the reduced-mass effect. Let r denote the radial distance within which the
bound electron is localized, and estimate the corresponding momentum p. Continue
the estimation procedure to obtain expressions for the minimum value of E and the
minimizing value of r. Compare the results with those found in the Bohr model.
10. A particle of mass m is confined to a one-dimensional region of length a. Use the
uncertainty principle to obtain an expression for the minimum energy of the particle.
Calculate the value of this energy for a 1 g bead on a 10 cm wire and for an electron in a
region of 0.1 nm length.
11. Let pellets be dropped onto a small spot on the floor. The uncertainty principle implies
that the pellets cannot be assumed to fall straight down from rest. Allow for uncertain
initial conditions, especially those transverse to the vertical, and deduce an expression for
the minimum range R on the floor at a distance H below the location of release.
12. Show that any function of the form f(kx + ut) is a solution of the one-dimensional wave
equation. Identify the speed of wave propagation in terms of the parameters in /.
14. Consider the interference of two waves ^, and *$?.,, emitted in phase from two very narrow
parallel slits. The waves have the same amplitude, wavelength, and frequency and are
observed as shown, at an angle 6 and at a large distance R. Construct the superposition
4', + ^2 using the complex wave format, and deduce the 6 dependence of the resulting
interference pattern.
15. Consider the addition of waves emitted in phase from two parallel slits with width a and
separation d. The resultant wave is observed as shown, at an angle 6 and at a large
distance R. Let the wave element emitted at the indicated location y be written as
218 Mailer Waves
Determine the resultant wave and deduce the 6 dependence of the observed diffraction
pattern. Evaluate the limit of these results as a —* 0.
16. The width of a wave packet and the width of the associated wave-number distribution are
connected by the "equality" Ax AA ~ 1. Use this result to construct a simple proof of the
analogous "equality" AwA< — 1 involving the related intervals of angular frequency and
time.
17. Refer to Figure 4-19 and consider the gaussian wave packet discussed in the second
example of Section 4-6. Express the widths Ax and Ak in terms of ic, the parameter
hy
A(k) =e~ (h \
18. Determine the form of the complex wave packet ty at / = 0, given the amplitude
distribution
\ otherwise
2
Obtain the squared modulus |^| and sketch a graph of this function.
19. Let the amplitude function for a complex wave packet ^ be given as
sin ka
A(k)=A ()
-
ka
219
220 Quantum Mechanics
Erwin Schrodinger
Most of our discussion in this chapter is devoted to the special case of a particle in
one -dimensional motion. Our intention is to introduce the new mathematical procedures
in certain familiar situations where the behavior of the particle is described by a single
linear degree of freedom. Figure 5-1 shows an oscillating mass on a spring and a freely
sliding bead on a wire as two such examples. Each situation involves constrained motion
in which the quantum particle is highly localized with respect to the two coordinates
transverse to the particle's one degree of freedom. This constraint allows us to ignore
two out of three coordinates in all stages of the treatment. We also assume that the
motion of the particle is governed by a conservative force so that we can define a
potential energy as a function of the remaining spatial variable.
Let us assume that the particle is able to move in the x direction and that the
motion is nonrelativistic. We let the potential energy be expressed by means of a
function V(x) so that the force on the particle is given as F = —dV/dx. Classical
mechanics would have us determine x(t), the position of the particle at time /. This
sort of result is not the objective of interest in quantum mechanics. Instead, we want to
find the wave function ^(x, t) for the matter wave and then use ^ to make
predictions about the probability of random detection of the particle. These goals
5-1 The Schrodinger Equation 221
Figure 5-1
Constraining
Constraining tube
cannot be addressed until the wave equation for ^ is specified. It is obvious that we
are looking for a partial differential equation in the independent variables x and t.
The equation is evidently supposed to contain the given potential energy V(x) in one
of its terms.
Let us turn for guidance to the problem of wave propagation in a one-dimensional
medium. We recall the appropriate wave equation as
2 2
d u 1 d u
(5-1
dr 2 ^Yi 2
where v is the speed of propagation and u(x, t) is the displacement of the vibrating
medium. Equation (5-1) is linear in u; therefore, the existence of independent solutions
u (x, t) and u 2 (x, t) implies the existence of another solution given by the sum of
{
waves
In general, the linearity of the equation allows us to take any set of solutions u k (x, t)
and construct arbitrary linear combinations in the form
to obtain other solutions. This property is known as the principle of superposition. The
concept is fundamental and the construction of wave
to the description of interference
packets, since both of these procedures involve the summing of waves. We want the
desired equation for the matter wave to be linear in the wave function ty(x, t) so that
matter waves can also enjoy these important consequences of superposition.
We observe that Equation (5-1) is a differential equation of second order in the time.
It follows that two pieces of initial data must be given to determine the subsequent
time dependence of the solution. Thus, we may take an initial specification of
displacement and velocity for all points in the medium as
du
.(x.O) and (-1)
and then predict a unique solution u(x, t) for any later value of /. The matter wave
222 Quantum Mechanics
uniquely determined from a single initial condition given by the behavior of ^{x, t)
at t = 0.
Let us examine this feature of the quantum theory by considering the case of a free
particle. We choose the arbitrary reference level in the definition of the potential
energy V(x) so that the constant value of V is equal to zero for a zero force. A
localized free particle is described by means of a wave packet of the form
where ^ has properties as discussed in Section 4-6. The wave parameters satisfy the
relation
<o = —
M
2m
2
(5-2)
for a particle of mass m, and so the formula for the free-particle wave packet becomes
/oo
kx - (h/2m)k
.,
<9
2
* ,oo
- (h / 2m
T =( A{k)(ik)
2
e'\
kx
»'Uk
dx 2 J-oo
and
d^ ,oo / in \
= A(k)\
V
k
2
\e'^-' h/2m)k ~'Uk.
dt J-*, '\ 2m J
2 2
These two results tell us that the quantities (ih/2m)d ^ /dx and d^/dt are
identical. We express this equality as
2 2
h d d
-V =
2
ih —* (5-4)
2m dx dt
and thus obtain the Schrodinger equation for the wave function of a free particle.
Note that the Equation (5-4) are of different order because
partial derivatives in
the wave parameters co and k appear with different powers in Equation (5-2). We are
again reminded of the Planck-Einstein and de Broglie formulas
and we recall that the relation between w and k expresses the nonrelativistic energy*
5-t The Schrodinger Equation 223
2
P
i~=E. (5-6)
This connection holds for each component wave in the matter-wave packet <f
i . Our
observations suggest a simple procedure that produces the Schrodinger equation
directly from the energy relation. We find that Equation (5-4) is obtained immediately
if the two sides of Equation (5-6) are interpreted as differential operators according to the
substitutions
r) -
p
2 -* -h 2 —-, and E^ifi —r)
,
(5-7)
ox at
where the operations are allowed to act on the wave function ^(.v, / ). This proposition
introduces a pair of representation rules for the new quantum treatment of the physical
2
quantities p and E.
It has already been noted in Section 4-6 that the monochromatic wave functions
— — £'(** _w wave
cos(kx u>t), s'm(kx co<), and are possible solutions of the ordinary
equation. We now observe that the cosine and sine wave functions cannot possibly
occur as solutions of Equation (5-4) because of the appearance of the first-order time
derivative. In fact, the presence of the imaginary number / in the time-derivative term
leads us to expect a complex-valued time dependence in ^( x, t ).
may seem since the results are going to lead us to something new. A constant potential
energy V is incorporated in the energy relation for a particle of mass m by writing
—
p
Im
+ V=E, (5-8)
where both E and p are constant quantities for a free particle. We may use the simple
monochromatic wave function
- Ul)
*{x,t) = Ae ilkx (5-9)
and thereby assign a precise momentum hk, since we are not concerned with the
question of localization in this part of the discussion. The wave number k and the
224 Quantum Mechanics
2 2
h k
+ V= ho> (5-10)
1m
by virtue of Equations (5-5) and (5-8). This relation between w and k tells us that the
wave function in Equation (5-9) must obey the differential equation
2 2
h
2m dx
d
^ + V* = —
2
ih
d
dt
*. (5-11)
V
Thus, we obtain another version of the Schrodinger equation for a free particle, and
we conclude that no real physical distinction exists between Equation (5-4) and
Equation (5-1 1).
The same comments hold for the alternative wave function
- kx ~ at
*(x,0 = Ae i( \ (5-12)
We observe that Equations (5-9) and (5-12) refer to waves traveling in opposite
directions, since the expressions differ only in the sign of A', and that Equation (5-10)
controls the wave parameters without regard for this sign. It is therefore obvious that
both wave functions are equally valid solutions of the free-particle Schrodinger
equation.
We see the purpose of using a nonzero constant potential energy for a free particle
when we turn to the situation of interest, where the particle is not free and the
corresponding potential energy is not constant. A typical varying potential energy
might resemble the function shown in Figure 5-2. The energy relation maintains the
form of Equation (5-8), although the momentum p can no longer be treated as a
constant. It wave number k cannot be employed to parame-
follows that a constant
trize the wave function as in Equations (5-9) and (5-12). We can still regard £ as a
constant in Equation (5-8) and retain a constant angular frequency co. (We should
also say that our discussion assumes a choice of constant E that exceeds V(x) for all
values of x. This stipulation is already built into Figure 5-2.) It would seem that
Equation (5-10) has no further use now that V enters as a function of x. Let us
suppose, however, that we approximate the potential energy function V{x) in the
manner illustrated in the figure, so that the x axis is divided into pieces and average
values of the potential energy are substituted for V(x) in all the intervals. The figure
shows a new potential energy function that approximates V(x) by a stepwise construc-
tion of constant potential energy segments. The particle acts like a free particle in
every interval, and so the conclusions of the preceding paragraphs are still applicable
on this basis. We therefore identify a different constant wave number k interval-by-in-
terval and again rely on Equation (5-10) with a correspondingly different constant V
in each of the intervals. Hence, Equations (5-9) and (5-12) become valid forms for
ty(x, t), as Equation (5-10) holds in every interval for either of these wave functions
or for any combination. This piecewise construction of ^ satisfies the Schrodinger
equation as given Equation (5-11) for the case of a stepwise potential energy. We
in
can approximate the original potential energy function V{ x ) to any desired accuracy
by this procedure. Therefore, we argue that the corresponding exact wave function
ty( x, t ) can be found as an exact solution of Equation (5-11) by allowing the equation
to contain a potential energy that varies with x.
5-1 The Schrodinger Equation 225
Figure 5-2
E
1 1 1
1 1 1 1 1 1
Vj
\ —'V
1
We should look for plausibility instead of rigor in these arguments. The result
Example
kx ~ ut -««)
The functions e
'(-
)
ancJ ^'<~ are solutions of the Schrodinger equation
for a free particle, so any combination of the two functions is also a solution. Let
us gain some practice with the differential equation by considering the particular
combination
^( x, t) = A cos kxe
'<*>' _ \ gi(kx-ui) _|_
^i(-h-wl)]
1 2
h d h-
2
-\
k + V A \ cos kxe
2m ax- 2m
and
ih — ^ = huA
8t
cos kx
e~' ul
.
226 Quantum Mechanics
Schrodinger's wave equation is only one of the ingredients in the Schrodinger theory.
Attention must also be given immediately to the proper meaning of the solutions of
the new equation. Our remarks in Chapter 4 suggest that we adopt a probability
interpretation for these matter-wave solutions. We now wish to pursue this suggestion
so that we can learn how the matter wave is used to determine probabilities for the
random detection of a quantum particle. We continue with the one-dimensional
approach in order to introduce these important considerations while the theory is in its
observed properties of the system are obtainable from the solution, once the wave
function ty has been found. A given physical system is defined for a particle in one
dimension by a particular specification of the potential energy V(x). The resulting
differential equation for ^ is of first order in the time, and so a unique wave function
is determined if the initial behavior of the state is prescribed. This aspect of the wave
function is associated with the differential operator ih d/dt in the equation. The
Max Born
5-2 Probability Interpretation 227
presence of the imaginary factor implies that the time dependence of ^ must be given
by a complex-valued solution. This general property of ^ suggests that the wave
function itself cannot be a measurable physical quantity, as in the case of solutions of
the ordinary wave equation. Instead, the wave function must be a purely mathemati-
cal device to furnish probabilistic information about the state of the system. The
Schrodinger theory is then supposed to include further rules that determine the
various observable properties of the particle when the system is in such a state.
because this (and only this) operation on the wave function conveys the meaning of an
intensity for a complex-valued wave.
Let us postpone the actual definition of the probability for a moment and call
immediate attention to an especially desirable feature of the quantity ^*^. We
suppose that we have a system in which particles are transmitted by a double slit, and
we assume that the wave function for a single particle is written as the sum of two
complex waves
* = ¥, + %.
The separate parts of ^ are identified as single-slit wave functions with modulus and
phase such that
2|* 1
||* 2 |cos(4> 1
- 4> 2 ).
Figure 5-3
Ax-
incident on the double slit. We now clarify these observations by expressing the
probabilistic properties of ^ in terms of a new defined quantity.
Born's interpretation identifies ty*^ as the probability density for a system in a state
with wave function ty. This quantity depends on the variables that specify the degrees
of freedom of the system. Thus, a single particle in one-dimensional motion has
probability density
P{x,t)=\*{x,t)\\ (5-13)
in which Sk * ^ may exhibit a variation with time. Figure 5-3 shows a sketch of a
possible shape for the x dependence of P at a particular time t. If a certain value of
The probability of finding the particle in the finite interval Ax between the indicated
points x x
and x 2 is determined by computing the integral
f \V(x, t)fdx.
The total probability of finding the particle somewhere in the entire one-dimensional
space is then found by extending the range of integration to get
2
\^{x, t)\ dx.
/
Note that ^(x, t) must be a suitably localized wave function to ensure the conver-
gence of this integral. Note also that the numerical magnitude of ^ is not fixed by
solving the Schrodinger equation, since any solution ^ may always be multiplied by
any constant and still serve as a solution. The probability integral is employed to
remove this arbitrariness and set the overall scale of ty. The wave function is said to
be normalized if the solution ^ satisfies the additional restriction
/oo n
\*(x,t)\~dx = 1. (5-14)
5-2 Probability Interpretation 229
This normalization condition means that the particle has unit probability to be found
somewhere in all of one-dimensional space.
The probability density makes allowance for a possible time dependence in the
local behavior of the probability. This feature of the interpretation of ^ suggests the
introduction of another local quantity to represent the flow of probability. An
appropriate definition emerges from consideration of the free-particle equations
obeyed by ^(.v, t) and by <fr*(x, t):
2 2 2 2
h d d d 8
2m 3x
-V =
2
ih —*
dt
and
fi
2m dx
r^*
2
= -ih —
dt
¥*.
Note that the two equations are related by complex conjugation. These formulas may
be used to analyze the time dependence of the probability density for a free particle:
d 3* d**
dr ' dt dt
( h d
2
*\ l h d
2
**
\ lim ox J \
2im ox
h i d
2
* d
2 **
= \J/
* _ \W
2
2im 8x 3x 2
3
*
8* d**
\j/ >i/
~Yx 2im \ dx dx
h ( 8* d-** \
j(x,t)-— \* m -
lim \
r T
ox
-
~
ox
-*
)
. (5-16)
This new quantity appears in the defining equation as a flux of probability. Thus,
Equation (5-15) has the form of a conservation law in which a time variation of the local
probability is compensated by a space variation of the flux across the local region.
It is possible to cast the probability conservation law in integral rather than
differential form by considering the probability of finding the particle in some finite
interval Ax = x 2 - x v The rate of change of this quantity is evaluated as follows:
-d fx 2
P{x,t)dx=
fx 2 d
P— P{x,t)dx
-
dt J X] f JXi at
2
= ~ ^-j(x,t)dx=j(x ,t)-j(x 2 ,t).
f
Jx, ax
i
(5-17)
The result represents the net flow of probability into the interval Ax. Equation (5-15)
carries over in the case of a nonfree particle, with the same definition for the
230 Quantum Mechanics
2
\*(r,t)\ dT,
/ |*(r,0| ^.
•'At
The overall scale of the wave function is fixed by requiring ^ to obey the restriction
2
|^(r,/)| ^r = 1. (5-18)
fall space
This normalization condition generalizes the formula in Equation (5-14) for applica-
tion in three dimensions.
Section 4-6 has provided us with an intuitive picture of a matter wave for a
localized quantum particle. We have just learned that the property of localization is
described by the distribution in space of the real positive probability density ^*^.
Let us hasten to clarify our understanding of this quantity, since the notion certainly
does not mean that the particle itself is distributed over all space. Instead, the
probability of locating the particle is understood to have this interpretation. To
illustrate, let us appeal to the measurement process in a single-slit experiment and
recall that the location of the particle is not determined with certainty after passing
through the slit. We let the infinitesimal element dr represent the volume of a certain
cell in a particle detector, and we take the value of ^ * ^ dr to be the probability of
detecting the particle in the given cell. Note that the entire particle is either found or
not found in the cell at a particular instant. Hence, the process of measurement for a
random count somewhere in the detector system. If we make
single particle results in a
the same observations on a large number of similar particles in separate single-slit
experiments, we obtain a distribution of random counts that conforms to the distribu-
tion in space predicted by the probability density ^P*^. Thus, we find that the wave
function ^ for a single particle determines the single-slit diffraction pattern for a beam
of particles according to this probabilistic interpretation.
Example
We have glossed over a subtle point in the normalization of the wave function in
Equation (5-14). It is obvious that the element of probability <&* <}'dx may
<r
depend on t because of the t dependence of i . This dependence on / remains
after the integration over x, and yet the total probability is set equal to unity
5-3 Stationary States 231
with no account taken of the apparent / dependence on the left side of the
equation. We can justify the result, specifically for the case of a free particle, if
we consult Equation (5-17) and rewrite the formula for the time derivative of
the probability as
d
- fx 2
dt J Xl
^**dx =
2im
h I
\
**-3*
dx
d**
—
dx
It is clear that the probability of finding the particle in the finite interval
Ax = x2 — x x
may vary with t, since the right side of this equality is not
necessarily required to vanish. Equation (5-14) pertains to the infinite interval,
however, and so the question concerns the behavior of the equality in the limits
at, —> — oo and x 2 -» +oo. A properly localized wave function must vanish
rapidly at infinity so that
8* 8**
<$r * _ \J/- as x, — —
> oo and x, -» oo.
dx dx
co 2
Itfollows that the normalization integral / 00 vI'(^, t)\ dx has no t dependence, |
and so Equation (5-14) can be introduced to set the resulting constant value of
the integral equal to unity.
This expression is supposed to satisfy the Schrodinger equation, and so the product xpf
must obey a modified form of Equation (5-11):
2
h d 2xp df
-^—77/+
2m dx~
V4,f=ih)-
dt
Note that the partial derivatives acting on the wave function ^ turn into ordinary
derivatives acting on the factors \p and /. If we divide both sides of the equation by
.
*> j2
—
j
This result has an interesting structure, inasmuch as the left side of the equation
depends only on x while the right side depends only on t. It is possible to have an
equality between a function of x and a function of t only if each function is a
constant. We therefore set each side of Equation (5-20) equal to a separation constant X
and obtain separate equations for \p and / in which X appears as a common
ingredient:
2
h d 2^
2
+ Vrp = Xxp (5-21)
2m dx
and
df
ih-=\f. (5-22)
dt
Note that V must occur as a function of x alone; the method of separation of variables
would not apply otherwise. We conclude from this procedure that the Schrodinger
equation has solutions in the form expressed in Equation (5-19), provided the two
ordinary differential equations have solutions for the factors \p and /.
Let us look first at the solution of Equation (5-22). The first-order differential
equation can be solved at once to yield
-i\i/h
f( t )= e t (
5 . 23 )
A X
f(t) = cos — — / i sin — t.
h h
X
(0 = —
h
A unique energy may also be associated with this type of wave function by virtue of
the Planck- Einstein formula:
E= hco = \. (5-24)
Thus, Equation (5-19) describes a system in which the quantum particle is found in a
Equation (5-21), the other of the two ordinary differential equations in the construc-
tion of the wave function. We substitute E for A and rewrite the equation as
2
h d2
~1TT1^ X ) + V(x)4,(x) = E^(x). (5-25)
Im dx~
The resulting differential equation for \p(x) with energy parameter E is called the
time-independent Schrodinger equation. It is obvious that we must be given the potential
energy V(x) before we can proceed to solve for the unknown function yp(x). We note
that we are able to establish the form of the t dependence of ^ in Equation (5-19)
without any knowledge of the form of V(x). Equation (5-25) then plays the main role
in the remaining construction of ^. The product expression in Equation (5-19) is
indeed a valid solution of the Schrodinger equation, provided the value of E corre-
sponds to a physically allowable solution 4 ( x ) from the time-independent equation.
/
There is certainly no reason to suppose that every wave function is necessarily of this
sort. Indeed, we know that any number of these allowed solutions can be superposed
to generate a valid wave function. We also know that the determination of X
P( x, t ) in
state:
2 2
\*(x,t)\ =\^(x)\ . (5-27)
Hence, the state is said to be stationary because the probabilistic aspects of the
corresponding wave function do not vary with time. We have already attached
additional significance to this wave function by noting above that the energy of the
state is a precisely defined quantity.
We expect to find a set of allowed £"s and i^'s as solutions of Equation (5-25) for
any given potential energy V(x). These solutions represent the energy levels and
stationary state eigenfunctions for the particular dynamical system. We elaborate on
this observation by turning to special cases of quantum systems in the next two
sections.
Example
Let us accept the fact that the Schrodinger theory always yields a set of energy
eigenvalues E ,E2 ,...,
{
corresponding to the set of stationary state wave
234 Quantum Mechanics
functions
*,(*)«-*•"*, M*)*-*"*,
A particle in one of these states has a stationary probability density, as noted in
Equation (5-27). Suppose, however, that the state is a superposition of two
stationary states described by the wave function
*(x,t) = a^ x
{x)e-' E ^ h
+ a,t 2 {x)e~'^ l/h .
*(x,0) =a l
xp
i
(x) + a.^ 2 (x)
as the initial condition for ^(x, t). We then find that ty*^ is not t independent:
2 E >' /h
|*(x,0| = (afWeW + a*^*e'^' /h )(a^^ lE ^ /h + a 2 ^ 2 e-' )
2 - Ei)l/h
= \a l \^ i
(x)\ + a*a 2 iP*(x)UxyiE>
+ a*a iP*(x)^
l l ( X )e-*
E >-W' + \a 2 \
2
\$ 2 (x)\
2
.
|£, - E2 \
The state represented by the wave function ^(x, t) is not stationary, and so all
its probabilistic features can be expected to oscillate with the same angular
frequency w.
Several first principles of quantum mechanics have already made their appearance in
this chapter. It is appropriate that we pause and reflect on the new ideas by
considering models like the ones shown in Figure 5-1. We devote this section to the
simpler of the two indicated systems.
The illustrated example of a sliding bead on a wire is known in quantum
mechanics as the problem of a particle in a one-dimensional box. The classical
particle travels freely between the two ends of its course and abruptly reverses
direction at either end with no loss of energy. We describe this model by means of the
potential energy shown in Figure 5-4. The classical motion is confined to an interval of
length a, and the origin is taken to be at the center of the interval. The potential
energy function is then defined as
,w _/°
v
'
\ oo for
;°7*/2
\x\ > a/2,:r'
/2
- (5-28)
Figure 5-4
system as in the figure; hence, the kinetic energy of the particle is equal to E in the
region of vanishing potential energy. Classical motion is forbidden outside this region
because the infinite value of V exceeds any possible choice of E. Thus, the given
potential energy function provides barriers to confine the particle in the interval
[
— a/2, a/2] for any assigned energy.
Let us now insert the function V{x) in the Schrodinger equation and find the
stationary-state wave functions for the quantum particle. We learn as follows that
only certain discrete values of the energy E are obtained from Equation (5-25), the
time-independent equation for the eigenfunction \p(x). The equation takes the form
h
2
d 2^
2m dx
d 2^
~ + k^f = 0,
dx
where
2rnE
k = ' (5-29)
2
h
solution
~
\
for |*| > a/2.
ka ka
A cos V B sin — =
2 2
and
ka ka
A cos — — B sin — = 0.
2 2
(We use the evenness and oddness of the cosine and sine to write the second result.)
The two equalities can be combined to read
ka ka
A cos — =0 and B sin — =0.
2 2
We cannot allow both A and B to be zero, and we know that cos(ka/2) and sm(ka/2)
cannot both vanish for the same value of k. Only two other possibilities remain:
B = and cos
ka
—
2
= so that
ka
—
2
= —
tt
2
,
—
3v
2
. . .
.
or
ka ka
A = and sin — = so that — = 77 , 2 77 ... . .
2 2
Both options are permitted, and both make equivalent predictions for the parameter
k. The allowed values of this quantity are evidently restricted to the discrete set
kn =-n, (5-30)
a
rnrx
\p n { x) = A cos for x in the interval [ —a/2, a/2].
a
nirx
xp
n
( x) = B sin for x in the interval [ —a/2, a/2].
a
h h~ tt~
E„ = — k\ = rn" (5-31;
2m 2m a~
We note that the possibility of an n = state is ruled out because the corresponding
eigenfunction vanishes. The lowest allowed value of En occurs for n = 1 and gives the
i.2 2
n it
E, = (5-32)
1
2 ma
The nonzero result is interpreted as quantum zero-point energy in the ground state of
the system. We have encountered this notion before in our comments on localization
-iEj/h
cos e (odd n
a a a a
*„(*,') = for - - < x < - (5-33a)
2 mrx -' E r,'/ f
2 2
sin e
'
(even n)
and
The factor \2/a ensures that each tyn satisfies the normalization condition
/°° , ,2 ra/2 , ,2
\%{x,t)\ dx= I \%{x,t)\ dx.
_M J- n /9
We verify this property of the wave functions in the first example below. The energy
eigenvalues E n
appear in the / dependence of Equations (5-33) as multiples of the
ground-state energy:
E„ = n E,
2
(5-34)
Figure 5-5
a
/\ 7
VJ Vy
V=
and
a /2
cos
n 7TX
cos
n TTX
dx .
fa/2
/ sin —
"V
cos
n 7T.X
dx ,
/ a/2 a a J -a/2 a
5-4 The One-Dimensional Box 239
and
a /2 rnrx nrnx
sin sin dx.
/ a/2 a a
These expressions occur frequently in applications of the model. The integrals are
found to vanish, and the functions are said to be orthogonal, whenever the quantum
numbers n and n' have different values. Orthogonality is a general result for families
of eigenfunctions appearing in all kinds of problems. It is obvious in the case at hand
that the second of the three integrations is equal to zero, because the integrand is odd
and the range of integration is symmetric about the origin. The other two integrals are
left to be examined in Problem 20 at the end of the chapter.
Example
The problem of the particle in a box has many illustrations and applications.
Let us look first at the normalization of the wave functions in Equations (5-33).
We note immediately that the range of the normalization integral shrinks from
/f oo to j- a2/2 since the wave functions vanish outside the interval —a/2, [
a/2].
The cosine eigenfunctions produce the following integration:
fa/22 2 nwrx
^TITTX 2
*
fa/2 ra /2 l
1 / 2ri7TX
f°° f"/
I
T
I^J" dx
2
= —cos"
2
dx = — f — 1 + cos dx
•'-oo
f
J -a/2a a aJ- a /22\
a znirx
x + sin-
a \ 2niT a
-
a/2
The sine eigenfunctions produce the same integration except for the sign of the
second term in the end result. This contribution vanishes at the two limits, and
so both calculations reduce to the same first term and give the value unity, as
announced.
Example
Let us turn next to the formula for the energy of the ground state in Equation
(5-32) and apply the expression to a neutron confined in a linear region of
~
nuclear size. We take a = 10
14
m for the length of the box and get
2 2 2
h 7T [(1.05 X 10" 34 J •
s)t7/10~
14
m]
-27
2ma 2
1
This amount of zero-point energy is comparable with the scale of energies for
the constituents of nuclei.
Example
The quantization of states is the main nonclassical result of this model. Figure
5-5 reveals a separation between the discrete energy levels which increases with
the quantum number n. Let us interpret this observation in the light of Bohr's
240 Quantum Mechanics
correspondence principle and examine the limit of large quantum numbers. The
separation of adjacent levels is obtained from Equation (5-34):
A£„ = E n + l
-E n
= [(« + l)
2
- n-\E = x
(2n + 1)£,.
£„ " 2£ i
«
We recognize this as a sort of classical limit in which the discreteness of the levels
becomes increasingly difficult to establish, even as the levels grow farther apart.
Example
Let us investigate the classical limit from another angle by applying the
quantum formulas a macroscopic particle, such as a 5 g bead on a 40 cm wire.
to
= 6.80 X 65
X 10"-r-{
£, ;
2
-.
10 J
J.
1
3
2ma 2(5 kg)
It would be difficult to distinguish this quantum ground state from the classical
state of rest. Let us suppose instead that the bead is moving with speed 2 m/s.
We can try to associate a quantum number with this state of motion by
consulting Equation (5-31) and identifying the kinetic energy
2 2
m h ir
—v 2;
= n
y
'
2ma 2
~~h^
'
h 6.63 X 10 J •
s
The other of the two illustrations in Figure 5-1 shows the example of an oscillating
mass on a spring. The classical oscillator is an important topic in physics, and so the
5-5 The Harmonic Oscillator 241
investigation.
The classical particle is subject to a restoring force proportional to the displacement
from equilibrium. The oscillator has potential energy
dV
F= = — kx
dx
where k is the force constant of the spring. Newton's law then provides the differential
equation for the displacement of the particle:
m —— =
d x
2
— kx.
dt
and
dx
v( t ) = — = —u A sin( co / + <£)
dt
and thus parametrizes the motion in terms of amplitude A, phase angle <£>, and
angular frequency
wo =
V m
These expressions for x and v appear in the formula for the total energy and produce
a constant of the motion:
m k
E= K+ V = -v 2 + -x 2
2 2
m k
= (0
o^ 2sin2 ( wo' + <M + -^ 2 cos 2 (« + Z 4>)
Y
k
= -A 2
.
-(e-^
m\ 2
2
)
/
=J-(A
m
-x
V
2 2
)
242 Quantum Mechanics
Potential energy V(x) for a harmonic Classical probability density for a harmonic
oscillator. Classical motion with total energy oscillator.
E is not allowed in the region |*| > A, where P cl (x)
V exceeds E.
V(x)
Y////////A V/////////<
Forbidden Allowed Forbidden
It is obvious that x~ must not exceed A" if v is to be a physical velocity for a classical
particle.
The classical picture can be visualized with the aid of Figure 5-6, which shows the
potential energy V(x) along with a chosen value for the constant total energy E. We
note that any choice of energy is permitted in the classical treatment of the problem.
The figure indicates that
2E
V(x) = E = +A
at x
T
and that classical motion is forbidden if \x\ > A. If x is in the allowed region
— A, A], a classical probability density can be defined to express the likelihood of
finding the classical particle in a specified interval dx. The corresponding probability
Pd ( x) dx must be proportional to co dt, the portion of the cycle of oscillation that the
particle spends in the given interval. We find that the probability density is given by
the formula
/'.>(*) = (5-37)
n{£~-
The proof of this result is left to Problem 1 1 at the end of the chapter. Figure 5-7
shows how the function P c] (
x ) describes an increasing probability for intervals near
the turning points x = +A, where the oscillating particle reverses its motion.
We can use the harmonic-oscillator potential energy in a variety of physical
applications that do not actually contain a mass-arid-spring system. The interaction of
the two atoms in a diatomic molecule is a case in point. Figure 5-8 shows a graph in
which the potential energy V varies with the separation r between the two atoms to
simulate the main dynamical properties of the molecule. The function V has an
equilibrium position at separation rQ where dV/dr is equal to zero. We can place the
,
Figure 5-8
V(x)
1 / d 2V
V= V(r ) + 2
x
i
+
2 dr
Note that no term linear in .v is present since the coefficient (dV/dr) r vanishes. We
may neglect the higher powers of x if we confine our attention to bounded motion in
the neighborhood of the point r = r . The choice of reference level is immaterial and
can be shifted to eliminate the constant V(r ) and leave only the term quadratic in x.
The resulting potential energy is just like the harmonic-oscillator function in Equation
(5-36), as suggested in the figure. Any diatomic system is adequately described in
terms of harmonic-oscillator behavior as long as the energy stays near the minimum
value of V. The same conclusion holds whenever the physical problem of interest
involves a similar stable-equilibrium configuration.
The quantum description of the harmonic oscillator is based on the eigenfunction
solutions of the time-independent Schrodinger equation. These solutions determine the
quantum particle. We define this problem in terms
stationary states of the oscillating
ofEquation (5-25) by taking V(x) to be the potential energy given in Equation (5-36).
We then proceed to find a solution in the form of an eigenfunction 4'( x ) with energy
eigenvalue E. The differential equation for i// contains several bothersome constant
factors. Let us rearrange these constants so that the equation reads
d% 2m ink 2E\
Hx1
-£U = t*
and then manipulate the result to obtain the awkward-looking equality
ft d 2^ IE [W
mk dx
2
tVt
244 Quantum Mechanics
vmk
e = —-x 2
(5-38)
and
2E [W 2E
x=
TV7 =
w (5 " 39)
d'xb
—1 = (£2 -\H (5-40)
a?
ymk
-.2
max
= 1 A2 •
' ,
This expression simplifies when we use the classical connection between the amplitude
A and the energy E:
)/mk 2E
=
£Lx=-r-T
n k
X (classical). (5-41)
Our final result tells us how the dimensionless quantities £ and X are related in
classical physics. The relation has a special significance in the quantum problem
because of the presence of the factor (£ — X) in Equation (5-40). We observe from
2
2
the differential equation that the factors d \}//dt-' and \p have unlike signs for £-' < X
and like signs for £ 2 > X. We also realize <//(£) must have smooth behavior at the
that
matching points £ = ± vX (Recall that •must always be continuous, and note that
\p
dip/di; must also be continuous since d \p/di;~ exists at every point.) The two domains
2
£ < X and > X correspond to the allowed and forbidden regions of Figure 5-6.
!""'
Thus, it becomes apparent that the quantum problem has a nonvanishing solution in
the forbidden region, in sharp contrast to the classical situation. It follows that there
are nonzero probabilities of finding the quantum particle in locations where the
classical particle is never found.
We expect small probabilities for intervals in the region £" > X. This expectation
suggests the necessity for a suppression of the eigenfunction \p at large values of £". It
5-5 The Harmonic Oscillator 245
and
d 2yb 2
= t2 -f-'/2 _ ,-| 2 /2 = (* 2 _ 1^-S
f
/2
Thus, we see that the given function satisfies the differential equation provided A has
the special value
A = 1.
hco (l fico,,
The eigenfunction e
~^ /2 corresponds to a normalized stationary -state wave function
of the form
•/»
/ mk \
2
%(x,t)= -j-2 ^-v^* /2A,-<£o'/*. (5.43)
This simplest solution of Equation (5-40) produces the smallest possible result for the
parameter A and the energy eigenvalue E. Hence, the wave function ^„ and the
energy EQ refer to the ground state of the oscillator, and the quantity ^w /2 represents
the quantum zero-point energy of the system. The normalization and other features of
the ground-state wave function are discussed in the example below.
If we continue the analysis we learn that all other possible solutions of Equation
(5-40) are suppressed in the forbidden region by the same gaussian factor e~* ^ 2 The .
where the factor H( £ ) denotes a polynomial. This function is obtained first by deducing
\„ = 2n+ 1. (5-45)
Figure 5-9
Energy levels and eigenfunctions for the first four stationary states of the harmonic oscillator.
Shaded areas represent the penetration of the wave function into regions where classical motion
is forbidden.
7fitup
eigenvalue:
nu
En =
~Y
X» = W" + (5-46)
/?co„
The spacing is equal to the quantum oscillator energy hypothesized by Planck in his
treatment of the radiating blackbody cavity. The figure also shows how the eigenfunc-
tions "leak" into the classically forbidden region and how the nodes of \p n increase in
number with the energy E n
. This behavior is a general feature of any solution of
Schrodinger's wave equation.
Detail
and
d~H dH
-2£— +
,
(|
,
2
- 1)//
rfl
2
^
Therefore, our construction of >// produces a valid solution of Equation (5-40) if
—H
d2
.-
n - + (X-l)H-
dH
. (5-47;
The analysis of the quantum oscillator thus reduces to the problem of solving
this differential equation for every possible function H and parameter X. We see
that there is an immediate solution of the form
k
2^ a k i- for even n
even £=
"M) = (5-48)
£ a £
k
f° r °dd n.
odd k = 1
The differential equation is then solved by finding the coefficients a k . Our main
goal is to secure the important result in Equation (5-45) for the allowed values of
—
248 Quantum Mechanics
H n
= a + a2i
2
+ a4£
4
+ • • •
+aj" where an * 0.
The derivatives of H n
are
and
H" = n
2a 2 + 12a 4 £
2
+ • • •
+n(n - l)^!"" 2 -
[2a, + 12a 4 £
2
+•••+«(«- l)a„r~ 2 ]
-2€[2a 2 €+ •••
+™„r-']
2
+ (A- l)[a + fl
2|
+ ••• + „r] =0.
fl
The sum of the coefficients of each power of £ must vanish, power by power, if
this equality is to hold for all values of £. The coefficient of £" produces the
condition
X = In + 1.
a2 = —-X—
1
a = -na ,
5 - X
-a,
-'9 =
— —-
2
:
6
n
u
a.9
2 !
2(k-n)
'A + 2
a,.
(k + l)(k+ 2)
Equation (5-45) can be combined with Equation (5-47) to giv< : the Hermite
differential equation:
d2 H n
dH n
ff (O-l,
#,(£) = 2£,
H {i) 2
= ±e-2,
H,u) = %e- m,
Both the equation and its polynomial solutions are named after C. Hermite, a
French mathematician of the 19th century.
Example
Much can be said about the ground-state wave function ty () in Equation (5-43).
Let us begin by verifying the normalization. The probability density is given by
the time-independent quantity
1/4 1/4
mk mk
P( x ) = %*%=\—\ / \
,-«,
.
where £ =
/
I—
\
)
x,
1/4
/oo mk
/
-— \
e-tdx=-
1
rj
,oo 2
,-*#=-— j
., /-0C
e-tdl
..
The last step can be taken because the integrand is an even function. The final
result is equal to the value at oo of a tabulated function known as the error
integral
erf(a) = -j^ /
e~f dl-.
x
J= /
e~ dx
e~ r
(x ~
J =/J
2 x +r)
e~ dx- dy= dxdye~ .
J () JJ \ st quadrant
.
2
' dd
J =
2
[ ( rdre-' =
2 -2
The result is
J= \ir /2, and so the calculation finally gives the desired value
erf(oo) = ^J=
V7T
1.
£ in Figure 5-10. The indicated points £ = + 1 are significant for this state
because these values of the variable correspond to the extreme displacements of
the classical particle at energy E = ho) /2, as implied by Equation (5-41). It is
apparent from the figure that there is a nonnegligible probability of finding the
particle in the classically foi bidden region. The probability in the allowed region
is found in two steps, by computing
A
f P(x)dx = -j= Ce-P d£ = ' erf(l)
J -A vV ->q
and by consulting a table of the error integral to get erf(l) = 0.84. Hence, we
see that in the ground state the particle has 16% likelihood to be found outside
the range of classical motion. The n = state (or any other low-lying quantum
state) is not expected to exhibit classical behavior. We recall that the classical
probability density is given by the quantity P c]
(x) in Figure 5-7, and we note
that this distribution looks nothing like the behavior of ^^^ . Figure 5-11
shows a graph of ^o^io to illustrate the fact that the distributions begin to
compare with Pd when the quantum numbers become sufficiently large. Ob-
Figure 5-11
*o**o
-Classical domain-
5-6 Eigen functions and Eigenvalues 251
serve that all the wiggles in this n = 10 probability density occur in the
classically allowed region, and note that the average value of the distribution
approximates the shape of the classical function Pd .
The models in the last two sections provide insight into the workings of the
Schrodinger equation. These studies illustrate the properties of quantization, since the
stationary states of the two systems occur in discrete sets where only certain values of
the energy are allowed. We are now in position to examine the generality of this
E h
*(*,/) =^{x)e-' >/ .
The eigenfunction \p(x) and the energy eigenvalue E are obtained together by solving
the time-independent Schrodinger equation
- —h
2m
— + V(x)t =
2
d 2>L
dx~
E*.
Our goal is to explain the general circumstances behind the discrete occurrence of i/>
and E. We take the potential energy to be some given function V(x) and base our
claim of generality on the arbitrariness of this function. A representative potential
energy is provided in Figure 5-12 for use throughout the following discussion. The
graph also includes a chosen value for the total energy E, to be considered as a
candidate for an allowed energy eigenvalue in the Schrodinger equation. Note that
the selected energy satisfies the equality
These values of the coordinate play a significant role in the dynamics of the classical
and the quantum particle.
We begin with some classical observations regarding the kinetic energy
m
'
2
2
v
2
= -(£- V(x)). (5-50)
m
A real value of v is obtained from this formula only if the value of x is such that
E > V(x). We visualize this condition in Figure 5-12 by noting that one-dimensional
classical motion is bounded between the indicated points ,v, and .v., for the given energy
E. These special values of x are called the classical turning points. The figure shows that
the two points approach or recede from each other as E decreases or increases. We
observe that no turning points exist, and so no configuration of the system is possible.
252 Quantum Mechanics
Figure 5-12
Potential energy function V(x) and total energy E for classical motion with two turning points.
The classical particle is not allowed in the regions to the left of x, and to the right of x 2 The
.
if E falls below the minimum of V(x). We also note that the left turning point *,
remains finite recedes to infinity when E reaches a
while the right turning point x,,
certain critical value. The motion is unbounded in the positive x direction for any
larger choice of the energy. Our two models in the preceding sections can be seen from
Figures 5-4 and 5-6 to have the property that bounded motion persists, no matter how
large E becomes. The main thrust of our discussion in this section pertains to bounded
motion in the system described by Figure 5-12.
The conclusions drawn from Equation (5-50) are classical and do not apply to the
quantum particle. It is clear that the equation itself is in conflict with the uncertainty
principle since the formula defines an exact value for the momentum of a particle,
given an exact value for the coordinate. The classical turning points a:, and x 2 are
nevertheless important in the determination of quantum behavior. We can appreciate
their importance immediately if we rewrite the time-independent Schrodinger equa-
tion as
d 2^ 2m
~^(V(x) £)*, (5-51)
~~dx<
V'{x) 2m
= -T (V(x)-E). (5-52)
*(
The right side of the second equation represents given information with regard to the
variable x and indicates a change of sign at the turning points like that observed for
2
the quantity v in Equation (5-50). The quantum problem deviates from its classical
5-6 Eigenfunctions and Eigenvalues 253
Figure 5-13
counterpart, however, and offers a solution for all .v, in the form of the eigenfunction
ipi*), with nonzero probability density in the forbidden as well as the allowed regions of
classical motion.
Equation (5-52) prescribes a change of sign at a turning point as a property of the
ratio of unknowns \p"/ip. The equation expresses the ratio of the curvature of \p to the
value of \p at each point x and hence determines the shape of the eigenfunction on a
point-by-point basis. It is evident that the curvature of \p vanishes, so that \p has a
point of inflection, at each of the turning points. Thus, there is a one-to-one
correspondence between regions in which the curvature-to-value ratio is positive or
negative, and regions in which the classical motion is forbidden or allowed. This
observation is summarized in the lower part of Figure 5-12. Figure 5-13 goes on to
show the qualitative shapes of possible eigenfunctions deduced from Equation (5-52).
Note that \p may change its sign away from a turning point as long as \p" has a
coincident change of sign. The eigenfunction then has a node and a point of inflection
at the same point. The eigenfunction may even oscillate and produce a succession of
nodes, provided the region is one in which ip"'/\p is negative.
Equation (5-52) is not the only ingredient in the determination of i//. Appeal must
also be made to additional physical considerations beyond those expressed by the
differential equation. We recall that the probability density is a measurable quantity,
and so we be a continuous function of x with a unique and finite
require that ty * ty
value at every point, including the limits x -» + oo. The behavior at + oo is especially
important, since ^ *^ is supposed to be integrable over — oo < x < oo if ^ is to be a
normalizable wave function. The properties of continuity and finiteness must hold for
the eigenfunction \p(x) if they hold for '&*'&. These requirements on the behavior of
\p usually include one more condition. Equation (5-51) implies that d^/dx 1 is unique
and finite wherever the given potential energy V{x) is unique and finite. It then
follows that dip/dx must also be a continuous function of x. (An exceptional case is
not imposed in that problem because the value of V becomes infinite at the ends of
the box.) The additional conditions on ^(x) are to be incorporated in the procedure
for selecting a physically allowable solution. Some of the shapes drawn in Figure 5-13
can be eliminated from consideration on these grounds.
We can now assemble our new mathematical ideas and draw our main conclusions
about quantization. First, let us realize that we are concerned with an ordinary
254 Quantum Mechanics
Figure 5-14
0(X)
to be in the classically allowed region between the points x and x 2 in Figure 5-12.
l
We assign ^(x ) an arbitrary positive value and then examine different choices for
\p'(x () ) to see how the shape of \p develops for x > x . The upper part of Figure 5-14
illustrates some of the possibilities. Note that each possible \p turns downward to the
left of x., and fits smoothly onto another piece of \p that turns upward to the right of
x,y. Only one of the three indicated curves remains finite everywhere on the right; the
prospective solution has the property that \p
— > as x —> oo. We can tune \p'(x ) so
that i^( at ) assumes this particular shape for the given value of E. Ifwe then follow the
selected *p(x) to the left of x , as in the lower part of the figure, we are likely to find
that \p has unsatisfactory behavior as x —* — oo. We cannot expect i// to tend
asymptotically to zero on the left as well as the right unless we also tune a second
parameter in addition to i//(x ). The only remaining adjustable quantity is the energy
E. A properly behaved eigenfunction results when a suitable choice of E is made. If
we let E depart even slightly from its determined value we damage the asymptotic
behavior of the solution by forcing \p to diverge either above or below the x axis. This
argument rules out the occurrence of nodes in any classically forbidden region
extending to infinity, since \p must diverge once it crosses the axis in such a region.
Let us recall that the entire discussion pertains to values of E for which the
corresponding classical motion is bounded between turning points, as in Figure 5-12.
Our arguments imply that the energy E is quantized because only certain discrete
choices of E are found to have allowable solutions for ^(x). These discretely
determined stationary states are called bound states since the probability of finding the
quantum particle vanishes at asymptotically large distance.
It is also possible to have an eigenfunction for a value of E such that only a single
turning point exists. For instance, Figure 5-12 shows that x, becomes a lone turning
point when E is chosen large enough. The classically allowed region then has infinite
allowed above a certain threshold, once this constraint on \p is removed. The resulting
5-6 Eigenfunclions and Eigenvalues 255
Figure 5-15
Two eigenfunctions with different numbers of nodes. At x \p and ip\ have equal values, but
,
\p l has more negative curvature than ^ . Equation (5-51) then implies that £, must be larger
than En
2m
4/'
q
(xq) = -T2(V(Xq) -E ) O (*O>
<l>o(x)
<M*
stationary states are called continuum states. The associated wave functions have the
property that the probability of finding the quantum particle at large distance does
not approach zero.
We introduce a quantum number n to enumerate the discrete energy eigenvalues
En and eigenfunctions 4 „( x
/ The wave functions have the form
)-
%(x,t) = ^ n (x)e-' E ^ h
for the corresponding stationary states. We have devoted Sections 5-4 and 5-5 to
circumstances in which only this discrete category of stationary states can arise. These
models exemplify the general situation, where an eigenfunction $ n (x) has certain
nodes, all occurring at points in the classically allowed region, and where the quantum
number n denotes an ordering of the number of nodes with increasing energy E n The .
fact that the nodes of \p n increase in number as the energy E n increases can be
established by means of a rather technical proof. Let us summarize the content of the
proof qualitatively by noting that the chain of argument takes the following steps:
The link between the curvature and the energy eigenvalue is supplied by Equation
(5-51). Figure 5-15 illustrates this part of the reasoning with the aid of a simple
construction.
The potential energies in Figures 5-4 and 5-6 have the special property of
symmetry under the parity replacement x -» — x. The resulting stationary states are
therefore found to have definite even or odd parity. Our arbitrarily chosen function
V(x) in Figure 5-12 lacks this special symmetry feature. Hence, the eigenfunctions in
the general discussion do not necessarily have any predictable behavior relating
positive and negative values of x.
It is clear that the time-independent Schrodinger equation can have a collection of
distinct eigenfunctions 4' n ( x ) with energy eigenvalues En The. members of this set of
256 Quantum Mechanics
Figure 5-16
for any two different quantum numbers n and n' . We have already seen orthogonality
in the context of the particle-in-a-box problem at the end of Section 5-4. The proof of
the general formula in Equation (5-53) is demonstrated example below.
in the last
Our description of the eigenvalue problem is taken from an area of mathematics
known as Sturm-Liouville theory. This general analysis of the solutions of second-order
ordinary differential equations is named after the 19th century mathematicians C.-F.
Sturm and J. Liouville.
Example
Consider the V-shaped potential energy shown in Figure 5-16. The associated
differential equation has unfamiliar analytical solutions; however, there is no
mathematical obstacle to deter us from making the following qualitative re-
marks. Each choice of energy corresponds to bound classical motion with two
turning points, and so the sign of the curvature of \p varies according to
Equation (5-51) in three regions of the x axis. The shape of V(x) tells us at once
that all the stationary states are discrete and that the eigenfunctions \p n have
definite parity. We expect the nodes of \p n to increase in number with E n , so
that a given eigenfunction must have one more node than its predecessor. It
follows that the parities alternate with increasing n and ascending energy. These
observations on the curvature, nodes, and parity of the eigenfunctions are
displayed qualitatively in the figure.
5-6 Eigenlunclions and Eigenvalues 257
Figure 5-17
vn -
Mx)
Example
Next, we examine an eigenvalue problem that we can solve in more detail. Let
the potential energy be given by the square-well function in Figure 5-17, where
and consider the determination of the energy and the eigenfunction for the
ground state. We note that V is a symmetric function of x and conclude that
4>(x) must be an even function with no finite nodes. (The eigenfunction has to
be either even or odd. An odd function would have at least the one node at
-v = 0; however, the even function with no nodes would have the least energy.)
The definite parity of tp( x implies that the solution can be found by con-
)
2mE
= -kfy with k x
=
4>(x) = A cos k x ,
since the other possible solution sin£,x is an odd function. In the region
x > a/2 where V = V , the differential equation becomes
—
d'^ , „
2m(VQ -E)
-
2
= k;\b
2Y with kl2 = 7,
2
.
dx h
$(x) = Be~ k * x
258 Quantum Mechanics
k a l
-k,a/2
A cos
k.a
-k.Asm
.
= -k„Be'^ a/2 .
We divide the second of these equations by the first and make several cancella-
tions to find
k a
{
k, tan = k .
ma 2 V -E
~^E =
The formula has an interesting limit as V — > oo. We see that the argument of
Figure 5-18
I'
tan
Solution
5-6 Eigenfunclions and Eigenvalues 259
the tangent approaches 77/2, and we find that the ground-state energy becomes
equal to E x
, the energy obtained in Equation (5-32) for the problem of the
one-dimensional box:
t2 1
ma m n 77
E = E, =
2
~V? 2 2 ma
Note also that k — > oo as V — » oo, and so ^(x) no longer penetrates the
classically forbidden region in this limit. The original formula for E assumes a
tidier form when we introduce E x
as a parameter:
77
tan —
2 /•:,
The most useful way to solve this transcendental equation for the ground-state
energy is to plot both sides of the relation versus E and look for an intersection
of the two graphs. The procedure is employed in Figure 5-18 to provide a
solution below the limiting energy £",.
Example
m ax
—
h
Im
d*M
-— + V
ax
( X )W E n
.xb*.
T n
(We assume that V(x) is real, and we use the fact that En , is also real.) If we
multiply the equation by \p* and the second by ^„, and then subtract the
first
d'w
im dx
2 r ^i =(E
2 n
-E ,u:4 n n
.
dx
The left side of this equality can be rewritten and then integrated as follows:
h~ re
dJZ
2m 4>n\ ^ Yn *,
J dx\ dx dx 2m dx dx
The result of the integration is zero, since properly behaved discrete eigenfunc-
tions tend to zero at + oo. The right side of the equality must therefore vanish
upon integration:
/oo
4>:4„dx = o.
-oo
The wave function and the probability density are fundamental to the analysis of any
quantum system. We use the wave function ^ to specify a physical state, and we then
suppose that the probability interpretation of ^ determines all observable aspects of
the system in a manner consistent with the constraints of the uncertainty principle.
The probability density is only the first of a list of measurable quantities to be
associated with the given state. The list goes on to include position, momentum,
energy, and all the other observables of a quantum particle. We may regard the
values of these physical variables as information encoded in the wave function. Our
next objective is to learn how this information is extracted for any specified state of the
system.
Classical mechanics treats the position of the particle at time t as the primitive
observable quantity. The momentum p(t) is then found from the coordinate x{ t ) by
computing
dx
K,)-m-.
Quantum mechanics takes a different approach beginning with the description of the
state at time /. The normalized wave function ^(x, t) is then employed to construct
the probability density
We know that the concept of localization is embodied in P(x, /); hence, we expect the
probabilistic definition for the position of a quantum particle to reside in the
interpretation of this quantity. Let us visualize our localized particle in terms of Figure
5-3 and recall that such a distribution at time / assigns a likelihood P(x,t)dx for
of objects. We assume that there are N objects of several different types and that the
ah type of object occurs «, times in the sample. We wish to evaluate a quantity A
whose value is equal to A }
for every object of type i. The fraction n t
/N denotes the
likelihood that a given object is of this type, and so the average value of A is
W = E^--W-. (5-54)
J
5-7 Expectation Values 261
The wave function ^(.v, /) describes an analogous sample of probable positions for
a particle at time t. Figure 5-3 shows an example in which the coordinate x serves as a
continuous index and the probability P(x, t ) dx acts as a likelihood factor for each value
of x. Any x-dependent quantity A has an average value defined as in Equation (5-54),
except that the summation over the discrete index is replaced by an integration over
the continuous variable:
/oo
A{x)P(x,t)dx. (5-55)
-oo
This formula may be recast to exhibit the explicit wave function in the form
(A) = C **{x,t)A{x)-*{x,t)dx.
J- 00
(5-56)
Note that the quantity of interest A{x) enters multiplicatively and may therefore
appear anywhere in the integrand. We discover the motive for inserting A(x) between
ty* and ^ as we proceed with further related developments.
The average value of the coordinate has a special significance in this formalism.
We write
=
<*) f* **(x,t)x*(x,t)dx, (5-57)
and refer to (*) as the expectation value of the position of the particle at time t. The
terminology is linked to our earlier remarks regarding the value expected for the
measured quantity. It is clear from Equation (5-57) that (x) may depend on /. Thus,
the role of x(t) in classical physics is transferred to the expectation value ( x ) in
quantum mechanics. Consequently, we anticipate a definition for (p), the expectation
value of the momentum at time t, which satisfies the relation
d
(P)=m-(x) (5-58)
for a nonrelativistic particle. In fact, this formula tells us how to define (p) when we
return to that question below.
A good example of an .v-dependent observable is provided by the potential energy
function V(x). Equation (5-56) expresses the corresponding expectation value as
/oo
**V(x)*dx. (5-59)
oc
/oo
2
-**x *dx.
-oo
We note that (jt) and (x) 2 are determined by evaluating integrals, and we
different
therefore realize that there is no necessary reason to expect an equality between the
two quantities. It is apparent that a special type of state ^ must be involved if (x 2 )
and (x)- are to have equal values.
262 Quantum Mechanics
The definition of the uncertainty Ax assumes that the average value (x) has
already been computed, as in Equation (5-57), for a particle in the state ^. We then
imagine that we conduct a large number of experiments on particles in the same state
and observe the departure of the position of each particle from the expected value, as
measured by the behavior of the variable x — (*). The root-mean-square of this
deviation is defined according to Equation (3-4) as
to give the desired formula for the uncertainty in the position of the particle. We can
simplify the resulting expression for Ax by the following manipulations:
2 2
(A*) = ((, - (x)) ) = (x 2 - 2x(x) + (x) 2 )
= (x 2 ) - 2(x)(x) + (X y
2 2
= (x ) - <*> - (5-60)
We know that the integration over x, denoted by the symbol ( ), always results in a
quantity independent of x. We
use this observation above to factor (x) out of the
integral expression (*(*)) in the second of the three terms obtained from ((x —
We (x — (x))
2
(x)) ). also note that is always equal to zero and therefore cannot be
used to represent the uncertainty in x. The final formula in Equation (5-60) tells us
2
that (x ) and (x) 2 are equal only for special states in which the variable x has no
uncertainty.
We turn next to the momentum of a quantum particle and consider our first
d d rC
™-(x) m— I ty*x'i'dx
dtJ-
/SO O
— (y*xy) dx = m
/-00
/
/
xy + y* x .
dx.
~d~7
The potential energy V(x) is taken to be real valued, and the Schrodinger equations
for ty and for ty* are consulted, to obtain the relations
ih d
2
* V ih d
2 ** V
+ —^ and \p1
17 2m d; ih ~d7 2m dx 2
ih
We insert these formulas in the expression for md(x)/dt and continue to manipulate
the resulting integral:
ih d
2
** V ih d
2
* v
in
I
+ * dx
2m dx' ih 2m dx' ih
ih d
2
* d
2
^*
-x^ dx
I dx 2
ih re d d* 3** d*
~2
I
** x^ + ^*^ 24" dx
J- dx dx dx
The endpoint contributions vanish because ^ approaches zero rapidly at +00, and
the integral term remains as the desired result for md(x)/dt. We return to Equation
(5-58) and write this important conclusion in the form
/oo ho
**(x,t)-—*{x,t)dx. (5-61)
-00 1 ox
Note that the differential operation {h/i)d/dx must appear between ty* and ^ in this
formula.
An important procedural result emerges when the expression for (p) is compared
with our previous formulas for (a) and (A). Whenever a wave function ^(x, t) is
used to describe the state of the system, it is correct to represent the momentum
variable by means of the differential operator (h/i)d/dx, where the operator acts on
the x dependence of the wave function. We write this representation rule symbolically
as
h d
l ox
2 2
h 3 \
d
= -h (5 " 63)
~Y~
1ox \
/
*T*
ox
as another operator acting on ^(.v, t). These rules can be invoked to compute the
momentum uncertainty A/? according to the formula
2
{^pf=(p )-(p)\ (5-64)
since the development of Equation (5-60) for Ax is equally valid for A/? or any other
uncertainty. With these definitions of the uncertainties in v and p it is possible to
prove the uncertainty principle as stated in Equation (4-6).
The energy of a quantum particle is another primary observable to be identified by
these means. Let us proceed by regarding the classical nonrelativistic energy relation
as an equality of expectation values:
<*> = + <">.
(£)
We use Equations (5-59) and (5-63) to write the equality in terms of an integral:
2 2
/c h d
iH <f+ V(x)* <h
Im ox'1
/oo u
,
^*ih — ^dx.
,)
ot
dt
(5-65)
The final formula holds because the wave function ^ obeys the Schrodinger equation.
The evaluation of (E) employs a time derivative inside an integral, just as the
evaluation of (p) employs a space derivative. The above result for (E) is the basis
for a second representation rule
d
E -* ih—, (5-66)
ot
264 Quantum Mechanics
whereby the energy variable is represented by letting the differential operator ih d/dt
act on the wave function ^(x, t). We have entertained the possibility that p and E
are expressed by means of derivatives in Section 5-1, and now we see the underlying
reasons for the proposition. The operator assignments for p and E are general and
apply beyond the context of our one-dimensional treatment. We should be prepared
to use these representation rules, and others yet to come, whenever we wish to pass
from classical physics to Schrodinger's quantum theory.
It is obvious from Equation (5-57) that (*) is a real-valued quantity, as must be
the case since the value of (*) is measurable. The same must be true of (p) and (E),
despite appearances in Equations (5-61) and (5-65). The real-valued property of these
quantities is left to be examined in Problem 26 at the end of the chapter.
We can give a further interpretation to the stationary states of a system if we look
at the expectation value of the energy in such a state. We let the system be described
by a wave function with energy eigenvalue £„,
and obtain
ih — % = ^ {x)E e" E^
3
n n
h
= En % (5-67)
under application of the energy operator. The expectation value of the energy then
becomes
/oo O /-oo
%*ih—%dx = En \ %*%dx = E n
-oo ut •'-oo
2
d \
ih—\ *., = E 2 ^„
and
< £
W>*K)'*- a = £ --
The fact that (E 2 ) and (E) 2 have the same value tells us that the energy of any
stationary state is equal to the energy eigenvalue for that state with zero uncertainty.
We recall from Equation (5-27) that the probability density is independent of time
in a stationary state:
2 2
!*«(*» 1
= I
^.(*) I
-
Time independence implies a sort of nonlocalization of the particle with respect to the
time, since the probability of finding the particle has no time variation. We know that
localization in time and uncertainty in energy obey an uncertainty relation, and we
associate this observation directly with the vanishing of the energy uncertainty A.E in
the given state. Equation (5-67) formalizes the observation by stating that the energy
operator ih d/dt acts on a stationary state to produce the same state multiplied by its
energy eigenvalue. This kind of eigenvalue equation does not hold necessarily for a
5-7 Expectation Values 265
more general type of wave function. We stress the importance of the special property
conveyed by Equation (5-67) by letting the stationary states be known as eigenf unc-
tions of the energy operator, or energy eigenfunctions
Example
We learn more about the special nature of the stationary states when we
consider a superposition of two such wave functions:
^ = c% + c'%,, where E„ ¥= E n
,.
3
ih — * = cE % +
ot
n
c'E„,%,
/OO /-00
We and
recall that ^ ^
are normalized and orthogonal, as in Equations
n
/OO d
/OO
(
c **r* + c >*y* )(cE ^ n
+ c 'E n ,%,)dx = \c\
2
E„ + \cfEn ,.
- oo
These two results imply that the energy (E) lies between the eigenvalues E n
and
E n
,. We interpret the quantities |c|"' and \c'\
2
as probabilities of finding the
system in the stationary states with energies E n
and E n
,. It is obvious that the
state ^ is not stationary and that the corresponding energy uncertainty is not
zero.
Example
There is no simpler illustration of the ideas in this section than the problem of a
particle in a box. Let us take the particle to be in its ground state so that, for x
in the interval [
— a/2, a/2], the wave function is
h 27T~
/2-cos — "7TX
e
-' E ^' /h
with£,
2ma~
266 Quantum Mechanics
This integral vanishes on inspection, because the integrand is odd and the range
of integration is symmetric. The expectation value of p has the form
/oo
**-—*dx
h
l (IX
d 2 h
f /2 _ _ _ _
a
J- a/2d I
I
I
\
TT
a
\
cos — —
TTX
a
sin
77
a
X
dx
and vanishes for the same reason. The evaluation of (x 2 ) produces a nonvanish-
ing result, with the aid of a table of integrals:
3
2 a
<*
2
> = r - oo
^*x 2 ^dx= ["' -x 2cos 2
J -a/2 a
TTX
dx
a 77
J
I 77
24
2
1277^ T (* -6).
2
The calculation of (p ) is easier to perform because the Schrodinger equation
can be used:
(p
2
) =
f H-A^ *A
2
=
J
r A> ¥*[ 2mi'A— Wd* = 2m£, f'
J. l
***dx = 2mE,.
a/2 \
dt) a/2
(Axf=(x')-(xf 2
77 12
and
/t7
2
-6
Example
We now combine the main features of the previous two examples and assume
that our particle in a box is described by the wave function
*(*,«) cos — e
- ,E
>'/* + sin
277*
-4i£,//A
fa
5-7 Expectation Values 267
(,v> = **x<lrdx
f°°
ra/2
/•a/2
J - a/2
n /'» a
-
x
cos
2
77 .V
a
h cos —
77 .V
a
sin
2tt.v
a
(e
3,E,l/
!/n
<
+ e
3iE,t/h
-f sin
o
2
27TX
a
\
dx.
The first and last terms in the integrand are odd and do not survive the
integration. Only the middle contribution remains to be computed with the help
of a table of integrals:
<*>
C
1
,3<£,</A
= -(^'
a
''
f e
-3i£,f/«) f
'
a/2
-n/:>
:os —
77.V
a
sin
2tTX
a
dx
2
— cos
a
3E
h
i
t
•
— =—
8a 2
9tt~
t
16a
rcos
977"
?>E
h
x
t
This result describes an interesting time dependence for the average position of
the particle whereby (x) oscillates with amplitude I6a/9v 2 and angular
frequency ?>E /h. The behavior of (*)
x
is sketched in Figure 5-19, along with the
shape of the initial wave function ty( x, 0). The expectation value of p calls for a
Figure 5-19
2nx
ty(x,0) =
1
-j=\ cos
/
—
77 x
+ sin
—
V(x, 0)
—
/oo h d
**-—*dx
-oo I OX
fa/2
'
-a/2ia
h_l
/2 ia \
\
—
77*
a
e'
E ^ h
+ sin
2tTX
a
e
4i W
77
-sin
a
—
TTX
a
e-'
E ^ h
277
+ —cos
a
277AT
a
,-«*.'/*
Nonvanishing contributions arise solely from the even terms in the integrand:
A'
hTT I 277 77*
<P)=--\e hE l/h f /
flit
ia \
_
'
..
J
ra/2
ra /2
a/2
-a/2
sin
s
a
sin —a
dx
, „ ,,.
_ le -i,E,i/h fa/2
/
cos _
VX
cos
2-TTX
dx
\
\
J-a/2 a a
J
tin ( \a 2a \
= e
3iE t/k t .
2e~ 3!Et
' /fl
2
ia \ 3t7 3t7 /
= — — /?77
ia
-
\a
3tt
2i sin
3E
h
x
t
=
8h
3a
sin
3E
h
l
t
Let us finally make a test of the correctness of the / dependence in (*) and (/>):
d 16a / 3£, \ 3E t
m — (x)/ = m -\
x
sin
dr 9t7
2
\ h ) h
\bma h\ 2
3E,t 8k 3E,t
sin = sin
2 2 "
3t7 £ 2ma h 3a'
The framework of quantum physics supports the theory of quantized energy levels as
originally proposed by Bohr. Features of Bohr's picture can already be seen in the
assembled formalism, even though the presentation has been limited to one dimension.
It is actually quite instructive to look at the picture in this context because the
one-dimensional models are useful sources of insight for the more realistic situations
able to adopt an approximation procedure for the treatment of this complex problem.
We continue to use the familiar quantized states that arise from the internal dynamics
of the quantum particle system. We then let the resulting levels experience the small
effect of a perturbing interactionbetween the system and the electromagnetic fields.
The most natural way to admit the added influence of electromagnetism is to assign a
charge to the quantum particle and let the accelerated motion of the particle produce
classical radiation. This hybrid semiclassical approximation takes the energy levels
from the Schrodinger theory and includes the excitation and deexcitation of the levels
by appealing to classical radiation theory.
The procedure immediately accommodates Bohr's hypotheses. In fact, the desired
stationary states and radiative transitions are easily recognized, just by inspecting the
time dependence of the probability density \^(x, t)\
2
We let e denote the assumed .
charge of the particle so that we can use the expression for the expectation value of the
coordinate x to introduce the electric dipole moment
e(x) = e r
J- 00
x\*(x,t)\
2
dx. (5-68)
We are especially interested in circumstances where this quantity oscillates with time,
because we know that an oscillating dipole emits classical electromagnetic radiation.
No time dependence can appear in the dipole moment, and so no radiation is
/oo 2
*| \p{ X) |
dx.
iE iE„l/h
*(*,/) = c+ n ( X )e- -'/ k + c'yf, H .(x)e-
/oo
i{E "- E » )t/h
x{\c^f + c*c'$$y\> n >e
-oo
The integrals containing the two stationary terms are equal to zero for eigenfunctions
of definite parityand are not associated with radiation in any case. The other two
contributions have an oscillating time dependence, as desired, with frequency
E ~ E.
n n
v= 1 •
.
2 70 Quantum Mechanics
able to emit electromagnetic radiation of the same frequency. The associated quantum
system has a determinable probability of transition from a higher to a lower energy
level, resulting in the emission of a discrete spectral line. Thus, the transition states
automatically offer the means by which Bohr's hypothesized radiative transitions are
to be understood.
The wave function is supposed to contain the answer to every possible question
pertaining to a given quantum system. Transition states are equipped especially with
the information needed to predict intensities for the observed spectral lines. Informa-
tion of this sort appears specifically in the complex-valued coefficients, or amplitudes,
which accompany the oscillating time-dependent terms in Equation (5-69). We know
from classical physics that the radiation intensity is proportional to the square of the
amplitude of oscillation of the electric dipole moment, because the electric and
magnetic fields are directly proportional to that quantity. The complex amplitudes in
Equation (5-69) are called quantum mechanical transition amplitudes since the
corresponding squared moduli determine the probabilities for the transitions n —* n'
and ri —* n. We introduce the dipole transition amplitude for the transition n —* n' by
choosing the third term in the equation and selecting the particular integral factor
[%J(xK(x)A. (5-70)
This integral specifies uniquely the mutual contribution of the two states involved in
the transition. Indeed, the strength of the transition by the is largely determined
magnitude of &„„, as the predicted intensity of the spectral line depends on the square
of the modulus of this amplitude. Note that Equation (5-69) also contains a similar
quantity, with n and n' interchanged, pertaining to the other transition n —* n. This
second integral is not independent of the first, since the two amplitudes are clearly
connected by the relation x*n = &„'„. .
It may happen that the integral in Equation (5-70) vanishes for some pair of
quantum numbers n and n' The transition n —* n' is then not allowed to occur with
'.
Example
Let the quantum system be the harmonic oscillator of Section 5-5, and consider
the possibility of electric dipole transitions between the energy levels. We wish to
show that the amplitude x n n is equal to zero unless the quantum numbers n
.
and n' are linked in a very selective way. Recall that the eigenfunctions ^„(x)
for this model are written in terms of Hermite polynomials as
rnk >/ 4
\
H {$)e~ e/\
n
where £ = | -^ \
5-9 Barrier Penetration 271
We consult Problem 16 at the end of the chapter to find the recursion relation
Hn+l -2£H n
+ 2nHn _ ]
0,
\
(" H (i)[H M)
n n + 2nHn _ (0]*- e H.
l
•'-oo
/OO /-OO
-oo '-oo
applies to the second term when ri =£ n — 1. Thus, we find that dipole transi-
tions are allowed between states n and n' if and only if the quantum numbers
are related by the selection rule
n' = n ± 1.
We have had little opportunity to discuss continuum states since almost all our attention
has been devoted to systems with quantized energy. Recall that stationary states may
exist in the continuum, where the energy eigenvalues vary continuously in excess of
some threshold energy. Figure 5-20 indicates how such a state might occur and shows
the general behavior of a typical eigenfunction. We observe that any energy in the
continuum range corresponds to classical motion that is not bounded by a pair of
turning points. Consequently, the associated eigenfunction is able to oscillate inde-
finitely at large distance, like the example shown in the figure. We wish to study this
Figure 5-20
Eigenfunction for an energy level in the continuum. There is only one turning point on the left
forany energy E above the threshold. The eigenfunction oscillates in the region of classical
motion, which extends to infinity on the right.
V(x)
Threshold
,i(kx-wt) ,i(-b-ul)
and
are introduced as elementary wave functions. We can obtain standing waves instead
by constructing the combinations
and
e
i(kx- u i) _ e
n-kx-ut) = 2ismkxe-' ul
¥k
2 1.2
E= hu> =
2m
We note that the two traveling forms are distinguished by their respective behavior as
eigenfunctions of the momentum operator:
h d
i dx
Figure 5-21
V(x) V(x)
momentum eigenfunctions. We note instead that the two standing waves have their
own distinctive attributes of definite positive and negative parity. We are going to use the
relations among these free-particle eigenfunctions as we proceed with our main topic.
Barrier penetration is a process that enables a quantum particle to leak through a
classically forbidden region where V(x) is greater than E. We simplify the mathe-
matics of this problem by assuming that V( x ) consists of piecewise-constant segments
in the shape of a rectangular obstacle to the motion of the particle. Figure 5-21
illustrates the encounter of a classical particle with a potential energy barrier of height
V Both classical and quantum versions of the problem require that we specify
.
whether or not the energy of the particle exceeds the height of the barrier. The figure
shows that the classical particle experiences either reflection or transmission in the two
instances. These cleanly separated alternatives do not apply to the quantum particle,
as some transmission occurs for E < V and some reflection occurs for E > V . We are
especially concerned with the first situation since the case E < V pertains to barrier
penetration.
As preparation, letus solve for the wave function in the presence of a single-step
potential energy. We employ the function
x <
V{x)
V x>0
and consider a stationary state with E < VQ , as indicated in Figure 5-22. It is clear
that the point x = divides the axis into two regions such that classical motion is
Figure 5-22
V(x)
V —
</.(*)
274 Quantum Mechanics
allowed on the left and forbidden on the right. We wish to find the eigenfunction
solutions in each region and then match the functions smoothly at the turning point.
For x < 0, the eigenfunction satisfies
2
d 2m
- T=
xp
\p = A cos k x x
+ B sin k x x
in the region x < 0.
(The complex exponential functions e' k{X and e~' k <" are alternative solutions to be
employed below.) For x > 0, the differential equation becomes
/-'»// 2m
-rj=k& with k\=--(V -E). (5-72)
dx h'
Both e 2*
and e~ k2 * are possible solutions, but only the latter choice remains finite as
x —» oo. We therefore write the eigenfunction as
\p = Ce~ 2"
in the region x > 0.
*o \
x <
x > 0.
A sketch of this result in Figure 5-22 shows a standing-wave configuration, with nodes
at fixed locations along the negative x axis.
It is instructive to recast our solution in terms of the complex exponential
representation. The eigenfunction for negative x takes the form
i// = C
1
-('''*'* + <?~' v )
- —
k9
(e'''
,x
~ e~'
k'x
)
c
'
+ i— \e'
k
<* +1-1— \e- ,k >
2 *. *i
CI k,\ CI k.A
^ 1 + i-t ^i(*,*-«0 +_!_,_ ^i(-*,x-«0
in the region * < 0.
2 \
a:. / 2 \ A:, /
5-9 Barrier Penetration 275
2
k
1 + i
— 2
= 1 - -
1
k
—2
*i *l
Consequently, the incident and reflected traveling waves are able to combine and
produce the standing wave shown in the figure. The wave function becomes
This expression exhibits penetration and attenuation into the forbidden region in the
manner discussed in Section 5-6.
Our stationary-state wave function describes the reflection of a quantum particle
with definite wave number kv The momentum uncertainty is zero and so the
description is not that of a localized particle. To achieve localization we should make
a wave packet by superposing a continuous range of k x
values as in the construction
of Equation (5-3).
The comparison between real- and complex-valued solutions completes our pre-
paration for the problem of barrier penetration. We parametrize the barrier prob-
lem according to the model on the left in Figure 5-21. The barrier height V exceeds E
and is defined by the potential energy
x <
f
V(x) = lv < x < a
'
x > a.
The solution is more involved in this case because there are three regions and hence
three pieces to the eigenf unction. For x < and for x > a, the differential equation
for 4*( x ) is
d 2^
TT
lx-
= -W
k,x k,x
as in Equation (5-71). The complex exponential solutions e' and e~' are chosen
this time, for reasons that become clear below. If x is in the interval (0, a), the
eigenfunction satisfies
d 2xb
dx~
klX
as in Equation and the real exponential
(5-72), solutions e and e~ k -
x
are again
obtained. The variable is bounded in this case, so can that neither of the exponentials
be discarded because of unacceptable behavior at large distance. It would therefore
appear that our eigenfunction should contain all possible terms in the general form
In fact, we can remove a term from one of the pieces of ip if we appeal as follows to
the physical conditions of the problem.
Let us refer again to the classical picture, and let us choose to consider a quantum
particle incident on the barrier from the left as in Figure 5-21. This choice eliminates
lk x
the term Be~ '
A + B = C + D from ^(0),
ik {
(A - B) = -k 2 (C - D) from i//'(0),
Ce'
k'a
+ De k* a
= Ae ,k a '
from xP(a),
- k {Ce~ k '"
2
- De k * a
)
= ik^Ae ^"
1
from xp'(a). (5-74)
A solution can be extracted from these equations by finding four of the unknowns in
terms of a fifth arbitrarily chosen coefficient. The algebra is rather lengthy and is
relegated to the second example at the end of the section. We quote the main results
as ratios of squared moduli for two of the coefficients:
k, k,
,1
= 1 + -
1 /
—+— I sinh A,a
2
(5-75)
A 4 I k k ] 2
and
B k k
— + — |
sinh
2
yt
2
a. (5-76)
A 4 \ k l
k2
= {Ae>«>-°' +
*'< *.'-0 x<0
n }
\ Ae ,(k,*-uo
>
x>a
(5 _ 7?)
We observe that ^ consists of incident, reflected, and transmitted waves by noting the
directions of propagation in the two regions:
2 2
Equations (5-75) and (5-76) imply that \A\ exceeds \B\ ; hence, the incident and
reflected waves cannot combine to produce nodes as in the case of the single-step
Figure 5-23
nonzero probability for the particle to tunnel through the barrier. This remarkable
quantum result is attributable to the wave nature of the particle. Such wave behavior
is readily demonstrated in the case of light, using optical devices like the ones
illustrated in Figure 5-24. We should expect the quantum tunneling to be
effect of
expressed in terms of A for each value of h as in Equations (5-75) and (5-76). Such a
l
,
h ( 3* 8**
*
lim \ ox ox
Recall that we have introduced this quantity in Equation (5-16) to define a flow of
probability in the x direction. A current of probability provides a natural way to
compare the incident, reflected, and transmitted components of the wave function for
barrier penetration.
Figure 5-24
Transmission of light through an optical barrier. Total internal reflection takes place at a
prism-to-air interface ifthe angle of incidence exceeds the critical angle. Partial transmission
through a two-prism combination occurs when the intervening air gap is sufficiently small.
>-
—
W V
278 Quantum Mechanics
The piecewise form of the wave function ty( x, t ) generates a similar structure in the
current density j(x, t). The two regions on either
pieces of interest pertain to the side
of the barrier, where ^ has terms as given in Equations (5-77). The calculation of j
appears to have several contributions in the region x < 0:
j = —
2im
{(A*e-
,k x *
+ B*e'
k x '
)(ik l
)(Ae
!k ' x
- Be' ,k ^
x
)
-(-ik l
)(A*e~'
k x
'
- B*e ik x )(Ae ik * *
x '+
Be- ,k >")}.
All the cross-terms cancel, however, so that only the incident and reflected pieces
remain:
h
j= —ik x
{2\A\
2
- 2\B\
2
)
2im
Ilk ,
hk,
\A\< \B\'=jinc +j (5-78)
in in
Note that a one-to-one correspondence holds between j mc and ty-mc and between j refl
and ^.efl, because of the disappearance of the cross-terms. The calculation of j yields
a transmitted contribution alone in the region x > a:
hk,
2
J \A\ =ju (5-79)
Each of the current densities jmc j ref{ and j trans represents a flux of probability in the
, ,
direction of the associated traveling wave. recognize at once that the factor hk x /m We
denotes the speed of the particle. Let us take special note of the fact that this speed is
common to the results in Equations (5-78) and (5-79) because the height of the barrier
is the same from either side. It should be apparent that a different step height on the
right side of the barrier in Figure 5-21 would imply a different speed for x > a and a
correspondingly different factor in the transmitted flux.
The main conclusions of the barrier problem are found in the form of ratios
representing the transmission and reflection of probability relative to the incident flux.
The relevant defined quantities are the transmission and reflection coefficients
Jx Jref]
T= and R = (5-80)
J\
7 2
A 1
T= (5-81)
A 1
1 + - sinh
2
A:
2
a
4 [t, *.J
and
2 sinh"£,fl
B />'
i 4 \
k \
k1
R (5-82)
A \lk 2 A,
1 + +
_
— sinh
2
A: 9a
\ A, k2
5-9 Barrier Penetration 279
Note that the speeds always cancel in the calculation of R and that the speeds also
cancel in the calculation of T because of the symmetrical height of the barrier. We
see immediately that Equations (5-81) and (5-82) obey a sort of conservation law:
T+ R = I.
Thus, the coefficients T and R are able to give well-defined physical information
about probabilities and reflection, even though the
for particle transmission results are
deduced from an unnormalizable wave function.
The transmission coefficient is especially interesting because it measures the prob-
phenomenon. We may recall the defining relations for
ability for a truly nonclassical
kx
and k 2 in Equations (5-71) and (5-72) and derive an approximation to this
quantity for use whenever a is substantially larger than \/k 2 :
The proof of this formula is offered as Problem 37 at the end of the chapter. It should
2k2 "
be clear that the exponential in Equation (5-83) is the dominating factor since e~
becomes a very small number for k 2 a » 1.
Example
= 7.27 X 10
9
m-' = 7.27 nm" 1
.
k2 a
= -k 2 a = ^ = k2a _ -k 2 aj =
e 4 26j g q 235^ and sinh k ^ £ g g qj
1 1
Texact
1
- /
2 2 2
= 0.137,
k2 kx \ 1 + i(2 5) (2.01)
1 + - smh 2
k2a
4 k k2
\ \
)
Tapprox = 16 —Ey (
1 .-^-16(^)|
8
(0.235)
2
= 0.141.
To
The agreement indicates that k 2 a is large enough after all for the approximate
formula to be reliable with this choice of parameters. The answer itself tells us
that the electron has ; i remarkable 14% chance to tunnel through the potential
energy barrier.
Example
k,
a = — and T = Ae'
k a
'
+ De k >" = T ia(Ce~^' - De k a
=
k >"
Ce- and >
) T.
1 + la 1 — ia
Ce-
k* a
= T—. and De k *"=-T-
2ia 2ia
A+B = C + D= —
2ia
[** 2fl (l + ia) - f
_ * 2fl
(l - ia)]
and
A - B = ia(C - D) = - [e k *a
(\ + ia) + ^"(l - ia)]
A = T cosh k 2 a + — a — — I
sinh k a
2 \ a
and
i
( 1 \
+ —
.
B = \ a sinh k 2 a.
cosh
2
k. y a
/
1
+ —\ a — —\
M 2
.
sinh
2
/r, 7 a = 1
1
+ —1a + —
\
2
sinh A,a
'
4 a 4 a '
and
2
1 l
= —\/ a + — \
\
.
sinh~X>a.
4 \ a
Example
Consider the double-well potential energy for a particle in Figure 5-25 as a case
in point. The symmetry of V(x) implies the existence of eigenfunctions with
We can use these symmetry properties along with the curvature
definite parity.
arguments of Section 5-6 to deduce shapes for some of the solutions. The figure
shows the bound-state eigenfunctions \p and \p.> for the two lowest energy levels {
is, and E Let us suppose that the wave function of the particle is given by
.
2
some combination of these states, such as
E2 - E x
Figure 5-25
V(x)
<l>i(x)
*(x,
5-/0 Two-Particle Systems in One Dimension 283
Many of the important problems in quantum physics are concerned with assemblies of
several particles. The two-body system is already rich enough to generate a number of
new ideas for consideration in the Schrodinger theory. Our attention turns now to the
description of such systems, while the treatment continues to focus on behavior in one
dimension. This investigation leads us directly to a profound new concept when we
consider the symmetry of two identical particles in quantum mechanics.
Let us begin with some general remarks about the analysis of two quantum
particles in one dimension. We assume that the particles can be distinguished, at least
by their masses, for the first at, and
part of our discussion. There are two coordinates,
x 2 so that the state of the system
, wave function ^(x u x 2 t), which
is described by a ,
depends on these two independent variables. Note that we do not introduce separate
wave functions for each particle because, in general, we do not want the constituents
of the system to be isolated from each other.
The classical energy relation contains the kinetic energy of each particle and the
potential energy of the system:
P~ + £- + V{x ,x 2 )=E. x
We return to the representation rules in Equations (5-62) and (5-66) and write
h d h d d
Pi -* - t— , A> -» - T~ '
and E ~* lh
T~
i ox x
i ox 2 at t
as differential operators acting on ty( *,, x2 , t ). We then use these operators to convert
the energy relation into the two-particle Schrodinger equation:
2 2 2 2
h d h d d
2 2 V 2 '
2m dx l
2m 2 dx '
8t
The probability interpretation of the wave function makes a joint statement regarding
the detection of the two particles. We define the probability density in terms of the
normalized wave function as
2
P(x l
,
x2 , t) dx l dx 2 = \^(x l ,
x2 , t) |
dx l dx 2 (5-85)
/OO /-00
dxA dx 2 \^ = 1
We assume that V has no dependence on the time so that we can find stationary-state
Each of these states has an energy eigenvalue E
solutions of the Schrodinger equation.
appearing as a parameter in the t dependence of the wave function
2 2 2 2
h d h d
2*~ T~ + V(x Xl x 2 )) = E*. (5-86)
2w, dx 2 2m T^^
dx 2 2
Our general discussion concludes at this point. We can proceed further if we are told
how V depends on its two variables. Only one special type of dependence on x and x 2 x
V= V (x )+ V2 {x 2 ).
l l
(5-87)
Hence, the two particles may be subject to some external force but are not influenced
by each other since V includes no contribution linking the coordinates x, and x 2 The .
two particles are dynamically isolated from one another in this special situation, and
the solution for \p(x u x 2 ) simplifies as a consequence.
We construct \p by identifying separate eigenfunctions for each particle. The
eigenfunctions are defined as v//(x,) and \p(x 2 ), with energy eigenvalues E and £,
according to the one-particle equations
2
h d 24*
2
+ V (x )+ = E+ 1 i
2m l
dx
and
2 2
h d ^
,
+ V2 {x 2 )) = E).
lm 2 dx 2
We then find that, if we multiply the first of these equations by «K* 2 ) an<^ tne secon<^
5- 10 Two-Particle Systems in One Dimension 285
by i//( at, ), and add the two results, we produce the equality
2
h
2
dS> ~ h ~d 2$
2m x
dx\ lm 2 dx 2
using Equation (5-87) to write the third term on the left side. The result may be
compared with Equation (5-86) to reveal that a solution for \^(x x
,
x2 ) has been found.
The two-particle eigenfunction is evidently of product form,
E= E+ E. (5-89)
The particles share the total energy E as independent entities because the dynamics of
the system is based on a potential energy of additive form in the two coordinates.
Thus, the stationary state of the two independent particles is represented by a multiplica-
tive energy eigenfunction with an additive energy eigenvalue. It is apparent that this
construction would not hold in the absence of the hypothesis in Equation (5-87).
Our problem takes an entirely new and surprising turn when we consider particles
that cannot be distinguished. The constituents of such a system must have the same
mass and also the same value for every other identifying characteristic. We find that
the probability interpretation forces the probability density to have a certain unique
property in any state of such a system. The issue becomes clear if we return to
Equation (5-85) and realize that we cannot always say which particle is in dx and ]
Figure 5-26
O—--ZD C
< O
286 Quantum Mechanics
nates in the wave function. Thus, the probability density in Equation (5-85) is
2 2
|¥(*„* 2 ,0I =|*(*2»*i»0l ( 5 " 9 °)
in a system of two identical particles. The wave function can realize this symmetry in
one of two ways. It must be the case that ^ is a symmetric wave function satisfying
1 . - „ -
or antisymmetnzed by defining
A =
\p when ip and \p are identical functions.
This interesting observation means that no antisymmetric state can exist for particles
whose single-particle quantum states are labeled by the same quantum number.
5- 10 Two-Particle Systems in One Dimension 287
Example
nv _ J
\ oo
when both
otherwise.
.v, and x 2 are in (
— a/2, a/2)
2 2 2 2
h d h d
- ~ —
2m yr^
ax\
~ o
2m 2 T~2
{
ox 2
^
= E^ —
if both x {
and x 2 are in the interval (
— a/2, a/2) and that the eigenfunction
vanishes if either coordinate is outside this region. We appeal to our indepen-
dent-particle result in Equation (5-88) and write the solutions for «//(*,, x 2 ) as
products of particle-in-a-box eigenfunctions taken from Section 5-4. Each of
these single-particle states has a single quantum number, and so every two-par-
ticle eigenfunction has two such quantum numbers, one for each factor in the
product
hV h
2
m2 1 I hit \
2
1 »? n\\
E "^ =
W" 2
1
+
2^a~ 2
"2
2
" 2 l T) UT +
Z "
|2 i2
P(x ,x 2 ,t)dx
l l
dx 2 =\\P ni (x 1 )\
dx r \^ n2
(x 2 )\ dx 2 .
It is instructive to list a few of the eigenfunctions and energies for some of the
288 Quantum Mechanics
^i,
= —cos cos
a a a 2 \ a I \ m, m2
77,, 277* 2
i//,2 = —cos sin-
a a a 2 \ a j \ m l
m2
2
2 277.x, 7r;t \ ( h7T\ l 4 1
4> 2 \
= — sin cos
2
a a a 2 a
V 1
/here
2
1 t hTT\ ,
lm\ a )
«|/'
n
2
= —cos
77* ,
cos
77X 2
£,, = —
2m
— I
fl7T
a
\
^12 =
.,
¥'21 = ~1k
12/
~
a
C°S
77,,
a
Sm_
.
277,2
a
+ sin
277,,
a
cos —
77,,;
a
\J2 \
1 / Hit
E„ •
5.
2m \ a
a
/
cos
77,,
a
sin-
277, 2
a
277,,
a
cos —
77, 2
a
V2 \
Observe that \p n n
is automatically symmetric, while \j/*
n
is nonexistent,
whenever «, and « 2 are equal.
particle has a wave function ^(x, y, z, t ). The dependence on the three coordinates is
determined by solving a three-dimensional version of the Schrodinger equation. We
deduce the form of the equation by referring to the classical energy relation
—
P
2m
+ V(x,y,z)=E
and noting that the momentum p is a vector with three components. The kinetic
energy therefore consists of three terms:
f_ = p*+p; + p;
2m 2m
The customary rule for representing the momentum by a differential operator applies
to each Cartesian component of the vector p. We employ this technique to introduce
the set of differential operators
h d h d h d d
~* ~ — — E ~~
& ~T~
Px
i
~^~
ox
> Py ~*
i
T~
dy
> A "~*
i
~T~
dz
>
an d *
at
>
all acting on the wave function ^(x, y, z, t ). The energy relation is thereby converted
directly into the equality
~^~\
2m \
T^*
dx"
+ Tl*
dy
+ Tl*)
dz~ /
+ V(x,y,z)* = ih — *.
dt
dx dy 2 dz 2
2
h
-—V
2m
,
2
*+V(x,y,z)<l' = ih —d y. (5-93)
dt
The result expresses the Cartesian form of the Schrodinger equation in three dimen-
sions.
di = dxdydz
and define the expression
2
\^(x, y, z, t)\ dxdydz
so that the particle has unit probability to be found somewhere in all space.
A stationary state of the particle is an energy eigenfunction with energy eigenvalue
E. The corresponding wave function is written as
The spatial eigenfunction \p(x, y, z) and energy E are found by solving the eigen-
value problem conveyed by the time-independent partial differential equation
—
2m
V'^+ V(x,y,z)rP = E^. (5-94)
The dependence of V on its three coordinates has to be specified before any more
progress can be made.
A simple but instructive illustration is provided by the free motion of a particle in a
three-dimensional box. We confine the particle by means of an infinite exterior
potential energy:
I oo otherwise.
Figure 5-27
Rectangular boxes for the confinement of a particle. The energy levels are indicated below by
the quantum numbers n^n.,n i
. The states pass through different stages of degeneracy as the box
assumes higher degrees of symmetry.
y :/
c/ y / a / y
,/
/
a ./ a /
222
222 222
112 112
211 211, 121 211. 121, 112
121
111
111 111
5- 7 / The Three-Dimensional Box 291
The allowed solutions of Equation (5-94) must vanish outside the box and satisfy
c
x = + —
H
~ 2
, and + -
~ 2
We can assemble such functions of (x, y, z) by using our solutions \p n ( x) from Section
5-4 in each of the three variables. We denote the one-dimensional particle-in-a-box
eigenfunctions as
a a b b c c
.v in y in and z in
2> 2 2' 2 2' 2
We then take products of these solutions to form the desired eigenfunctions in three
dimensions:
t n] n nS X > y> Z )
= ^n (
X )^n {y)^n,( Z )- (5-95)
2 l 2
(A similar argument has led us to the expressions for the standing electromagnetic
modes in Equations (2-17).) Each factor on the right side of Equation (5-95) is
independently indexed by its own quantum number, and so a set of three quantum
numbers is needed to label all the requisite functions of the three Cartesian coordi-
nates. We can write an explicit formula for these eigenfunctions inside the box as
Tn n n -,
(5-96)
\ sin \ sin
,
i
v sin b c
where the notation { } ( ) tells us to choose the cosine or sine function of the indicated
variable when the relevant quantum number is odd or even.
The energy eigenvalues are labeled by the same three quantum numbers. We
determine the allowed energies by direct use of Equation (5-94) inside the box:
2m
2 2 2
h* 1 d d d \
2 2
2m\ dx dy az J
'
3
=
h
2m
2
-
I
I
—
n TT'
x
a ,
,
1
2
Yn x
tt
2n3
/
1
n 2 ir\
^ 1
2
T n| n 2 n 3 f n.n 2 n.
2 2 2
tl TT
(
n \
V +
(
n 2\
+
/"3\ 2
T n n n
2m (7) (7) (7). ^ »
t
292 Quantum Mechanics
It follows that the energy eigenvalues are given in terms of the three quantum
numbers by the formula
2 2
/n 2
— -"^7 \2 « 3 N2
h 2
ir [/ n, \
£ +
7 \ (5-97]
=J, - iE-
^ ( X y 2)e l -2'3 t / h .
These few steps produce a complete set of solutions to the problem and do so without
the need for any new mathematical machinery.
An interesting new feature appears in our results when the box has two or more
dimensions of equal length. Figure 5-27 includes the special case of a second box with
lengths (a, a, c), in which the energies are found from Equation (5-97) to be
2
2
m2 +
1m —
h I n nl n\
r^ + -
\
£ WJ
'
2 J =^H \ a c
• (5-98)
The significance of this case resides in the fact that the formula for the energy levels is
states whose energies E„ „ „ and E„ „ „ are the same. These states are indeed distinct
because the two wave functions ^„ „ „ (x, y, z.t) and ^ „ „ „ (x, y, 2, t) differ in their
dependence on x and y. This set of circumstances exemplifies a general quantum
phenomenon of far-reaching importance. The situation is called degeneracy, and the
states are said to be degenerate, whenever a quantum system has two or more distinct
energy eigenfunctions with the same energy eigenvalue. In the case at hand, we
associate the degeneracy of the states Sk n „ and ^n n n with the symmetry of the box „
E„ in
1
,
-
ni
'
=—
lm\
nv\
— 1 /
a j
',
» +«| + »i). (5-99)
The complete symmetry of the box causes the expression for the energy to be
completely symmetric under any interchange of the three quantum numbers. The
resulting degeneracy of the states is maximal since all the different states
,^„
ty„ „ „ have the same energy, regardless of the order in which the
„ „ ,-
quantum numbers occur in the eigenfunctions. A numerical example is given below to
illustrate how successive degrees of symmetry of the box result in sequential stages of
degeneracy.
We have encountered degeneracy before, and not taken note, in our discussions of a
free particle. It is clear that the monochromatic wave functions
Hkx «o *«'<-**-«0
e and
represent distinct degenerate states. These right- and left-propagating waves have
different momentum eigenvalues +M, but the same energy hu.
511 The Three-Dimensional Box 293
Example
The following calculations demonstrate the evolution of degeneracy for the three
systems indicated in Figure 5-27. First, we let the dimensions of the box have
different, nearly equal lengths (a, -^a, jffl)- Equation (5-97) determines the
energies as
ftV I 81 121 A
2 =
lOOnf + 81«1 + 121«|
n: + n.,
2
+ ;
two of the lengths be equal by choosing dimensions (a, a, jytf), and we use
Equation (5-98) to find
100
Finally, we let the box be a cube with lengths (a, a, a) and use Equation (5-99)
to obtain
100
n, n2 n3
£ ri,n 2 n 3 Q Tn,n 2 n 3
a
cos
a
a
cos
77
a
sin
'Ittz
a
X
sin
a
cos
7TZ
a
my
2 1 1 602 621 600 sin —
Ittx
—
mz
cos cos
a a a
2my 2m
1 2 2 908 984 900 cos —
77 x
a
sin
_
a
sin
a
2mz
2 1 2 965 -» 984 -» 900 sin
277X
a
cos —
77^
a
sin
a
2my
2 2 1 845 921 900 sin
2 77 x
sin cos —
mz
a a a
Problems
* = a,^, + a2 % + a3 * + 3
• •
,
in which the a's are constants, and show that ^ is also a solution. What mathematical
step needs to be justified when the construction of ^ involves an infinite series of
functions?
j(x,t) = —
2im
ft (
\
**~idx
<9* d**
^*
d>
n
(9
ox at
What condition must the potential energy V(x) satisfy in the derivation of this result?
3. Suppose that the state of a particle is described by the monochromatic wave function
Calculate the probability current density for this state and interpret the result. Replace ^
by the wave function in the example of Section 5-1 and repeat the calculation.
2
4. Show that the normalization integral /? oc >^(-«, 0| dx is time independent |
for the case of
a nonfree particle. What condition must V(x) satisfy to ensure this result?
~' E /h ~' E *'/ h
be two stationary-state wave functions with real-
{
5. Let )p l
(x)e \
and \p 2 (x)e
valued eigenfunctions \p i
and i//
2 - Suppose that the state of a particle is described by the
wave function
2 2 2
|*(x,0| = KI 2 (^,(x)) + KI 2 (^(*))
+ 2|a,||fl 2 |^ I
(x)^ 2 (x)cosl - t - $, +<#> 2 I,
6. A particle in a one-dimensional box is confined to the interval [ —a/2, a/2] and is in its
first excited state. Calculate the probability of finding the particle in the subinterval
(a/8,3a/8).
Problems 295
traveling waves that propagate in opposite directions inside the box. Obtain the phase
velocity of these traveling waves.
9. Suppose that the wave function for a particle in a one-dimensional box is given by the
superposition
ty = c% + c'%,,
where ^„ and ^„. represent any two of the normalized stationary states of the particle.
What condition must the complex constants c and c' satisfy in order for ^ to be a
normalized wave function? Interpret this result.
10. Suppose that the wave function at / = for a particle in a one-dimensional box is given
by
1 / 2irx 3nx
*(x,0) = -f=\ sin + cos
\Ja \
a a
What is the subsequent form of the wave function ^(.v, ()? Use this form to compute the
probability density, and interpret the time dependence of the result.
11. Show that the probability of finding a classical oscillator in an interval dx between —A
and +A is given by
dx
TT\ A' - X
i
where A is the amplitude of oscillation. What is the probability of finding the oscillating
2
* = e -fi^* /2h e -,u </2
unnorma ii zed),
(
and show that ^ has points of inflection at the extreme positions of the particle's classical
motion.
13. For an oscillator in its ground state, calculate the probability of finding the particle
between x = and x = A/2, where A is the classical amplitude of oscillation. Obtain a
numerical result and compare with the answer to Problem 1 1
14. Assume that an atom in a metallic crystal behaves like a mass on a spring. Let the spring
constant for a copper atom correspond to angular frequency con = 10' 3
rad/s, and
calculate the atom's amplitude of zero-point motion. Take the Cu mass to be equal to 63
H masses.
H„^2iH„--
by showing that (2£//„ — //„') obeys the differential equation for H n +
296 Quantum Mechanics
2(k- n)
(k+ l)(A + 2)
where V and x
(j (j
are positive constants. Sketch a graph of the function and determine the
equilibrium position of the particle. Deduce the form of the harmonic-oscillator ap-
proximation for this potential energy.
19. Let E be the total energy of the particle in the previous problem, and obtain a formula
for the turning points of the classical motion. Consider the results for E < and for E>
as separate cases.
/oo
* 2*(*)*o(*)*-0,
A W-shaped potential energy function V(x) produces a set of bound states, the three
lowest of which are arranged as shown in the figure. Sketch the qualitative behavior of the
three corresponding eigenfunctions.
V(x)
V(x)
N Mx)
Problems 297
which the probability density falls by the factor \/e. Deduce the formula for d and
23. Assume that the indicated square-well potential energy is capable of producing at least a
ground state and a first excited state, as shown. Sketch the behavior of the eigenfunction
V(x)
Ground state
24. Show analytically how to determine the energy eigenvalue for the first excited state in
Problem 23.
25. Write out the proof of the formula for the square of the uncertainty in x,
2
(A,) = (, 2 >-<,>2 ,
using explicit integral expressions for all the relevant expectation values.
26. Prove that (p) and (E) are real by showing that (/>)* = (p) and (£>* = (E).
27. Let the wave function ^(x, I) be expressed as the superposition of two stationary states
having different energy eigenvalues,
¥ = c% + c'%.,
and obtain a formula for the energy uncertainty AE in this state. As a special case let ^
be an equal-parts admixture of ^ n
and ^ n , and interpret the corresponding result for
AE.
28. Assume that a particle in a one-dimensional box is in its first excited state, and calculate
the expectation values (at), (x ), (p), and (/>'). Evaluate the uncertainties Ax and A/>,
29. Use the wave function ^(x, t) for the superposition of states in Problem 10, and evaluate
the expectation values (x) and (p). Verify that these quantities satisfy the relation
(p) = md(x)/dl.
30. Write down a suitable transition state for a particle in a box describing transitions from
the n = 2 level to the n = 1 level. Evaluate the corresponding dipole transition ampli-
tude.
31. Repeat the steps of Problem 30 for the case of particle-in-a-box transitions from n = 3 to
n = 2.
32. Generalize the considerations of Problems 30 and 31 to allow for transitions between
arbitrary pairs of levels, and deduce the appropriate selection rule for electric dipole
transitions. Obtain a general formula for the nonvanishing dipole transition amplitude,
and compare with the results of Problems 30 and 31.
298 Quantum Mechanics
33. Refer to Figure 5-22 and solve for the nodes of the single-step eigenfunction
where k x
= \J2mE /h and k 2 = \J2m(V — E) /h. What happens to this solution in the
limit E -> V ?
34. A particle with energy E is incident on a single-step potential energy barrier with step
height V Incidence
{)
. is from the left and E exceeds V , as indicated. Determine the form of
the eigenfunction that describes this situation, and interpret the various contributions to
the wave function.
V(x)
O^ V
35. Use the results of Problem 34 to derive expressions for the transmission and reflection
36. A particle with energy E is incident on a rectangular potential energy barrier with height
V ()
. Assume E< V as indicated, and let the particle be incident from the left. The
transmitted wave function has the form
_
= AMk,x-ut)
Ae in the region x > a.
*. ran s
Show that the probability density in the region of the barrier < x < a is given by
*** = \A\<
2
cosh £ 2 (a - x) + { M
—
2
2
sinh*A,(a - x)
k 2)
\
where k {
and k, are defined in Problem 33.
V(x)
Problems 299
'
/ k2 k \
T= 1
1
+ — — + —x
smh'k.,a
4 \ ft, k,
J
T= 16—
M 1
^o
38. A 50 g particle slides with speed 20 cm/s toward a bump 1 cm high and 2 cm wide.
definite parity.
V(x)
40. A particle is free to move inside a pizza box with dimensions (a, a, a/10). Determine the
six lowest energy levels, and identify the quantum numbers of the degenerate states at
each level.
41. A particle moves freely inside a cookie box with dimensions (a/5, a/5, a). Determine the
ten lowest energy levels, and tabulate the results according to the quantum numbers of the
QUANTIZATION
OF
ANGULAR
MOMENTUM
new quantum concept that does not arise in the restrictive framework of one
dimension. The entire chapter is given over to the development of this fundamental
topic.
A limited version of three-dimensional quantum mechanics has already been
introduced at the end of Chapter 5. The Cartesian coordinate system of Section 5-11
must be set aside, however, because such a choice of coordinates would be very ill
advised for the situation of interest here. We are concerned in this chapter with the
behavior of a system under the influence of a central force. We know from our
experience with classical physics that the general central-force problem exhibits
conservation of angular momentum and that the solution of the problem is expedited when
we build this property into the analysis. Our objective is to learn how the concept of
angular momentum appears in quantum mechanics and how the conservation law-
takes effect as a quantum principle.
It would seem that the central force embraces a rather restricted class of dynamical
problems. In fact, the ideas associated with angular momentum prove to be surpris-
ingly general and make their appearance on many different occasions throughout
modern physics. The topic is a vital ingredient in the theory of atoms, since the
Coulomb interaction between an electron and the nucleus of an atom is a prime
example of a central force. We treat the problem of central forces in generality so that
our can be regarded as comprehensive. We can then apply our conclusions
results
any other appropriate quantum problem.
specifically to the atom, or to
Our treatment is based on the Schrodinger equation in three-dimensional polar
coordinates. We are faced with certain complications when we adopt this coordinate
300
61 Centra! Forces 301
system and proceed to solve the resulting differential equations. We give these
equations serious consideration so that we can appreciate the interpretation of the
solutions. Our efforts are rewarded in the end as several very important principles
emerge from this investigation.
Let us begin by recalling the classical treatment of the two-body central force. We
suppose that the two masses m and M are separated by a variable distance and that
r
the center of mass is at rest, as in Figure 6-1. The force on each particle is assumed to
be oriented along the line defined by the separation vector r. We have learned in
Section 3-6 that this two-body problem is completely equivalent to a one-body problem
in which a reduced mass ju. is attracted or repelled by a fixed center of force located at
the origin. We recall that the reduced mass is given by
mM
M+ m
and that the separation vector r is the coordinate vector for the mass /x.
It is also assumed that the force on ju. is conservative and hence derivable from a
potential energy V. The central nature of the force implies that the potential energy
depends on the magnitude, but not the direction, of the vector r. We therefore
introduce a unit vector r and write the central force as
dV(r
F = (6-1)
dr
This expression tells us that F points away from the origin if dV/dr is negative and
toward the origin if dV/dr is positive. We take the function V( r ) to be completely
arbitrary to ensure the generality of our conclusions.
The force law in Equation (6-1) allows us to identify a constant of the motion. We
know from classical mechanics that insights are always gained whenever a conserved
quantity is revealed, and we expect the same experience to carry over in quantum
physics. It is evident that the angular momentum L is conserved since the force F
Figure 6-2
Figure 6-1
the angular momentum L. The momentum p
has polar components p and p±
Two interacting particles and the r
.
~~IP.
302 Quantization of Angular Momentum
dr
p = ju — and L = r X p
dt
dh dp
=>
dt
=
dr
—
dt
Xp + rX —
dt
=rXF = 0.
Since L is a constant vector, it follows that the classical orbit of the particle lies in a
fixed plane through the origin, perpendicular to the direction of L. In that plane the
momentum p has components along r and normal to r, as shown in Figure 6-2. These
components are expressed in terms of the radial distance r and angle x as
dr dX
Pr
= V- and Px =,r-,
L = rp ± .
We use this result to rewrite the expression for the kinetic energy,
2
P Pr + P\ Pr
+
2ju 2n 2n 2jur~
2
L
Pr
%- + -— + 2
V(r)=E. (6-2)
2jU 2jur'
This equation is our starting point for an examination of the Schrodinger equation in
Ku(r)= P(r) + — L2
j. (6-3)
and thus assumes a form just like the energy relation for one-dimensional motion. We
can use this result to find the radial turning points for the orbit of a particle, as in the
following illustration.
6-2 The Schrodinger Equation in Spherical Coordinates 303
Figure 6-3
Effective potential energy for an attractive Coulomb force. The two shaded regions indicate
allowed ranges of r for classical motion with total energies £, and E,. A bound orbit results for
Example
b L
V(, (b>0) so VM {r)= - - +
r 2jur
Figure 6-3 shows a sketch of the effective potential energy for some chosen value
of the constant L, along with two selected values of the constant E. Observe that
the centrifugal part of Feff is always a repulsive contribution. The figure tells us
that this term dominates V(r) at small r and prevents the particle from
approaching the point r = 0. The quantity E— V eff
cannot be negative because
of Equation (6-4), and so the equality E= V efr
(r) defines as turning points the
minimum and maximum radial distances for the given energy E. The shaded
domains in the figure indicate the allowed physical regions for classical motion
in the variable r. For E < x
we have a bound orbit with two turning points,
and for E2 > we have an open orbit with one turning point. Pictures of these
two orbits are also shown in the figure.
We have made the transition from classical to quantum mechanics on several previous
occasions. This time the appropriate form of Schrodinger's wave equation is to be
found from the classical energy relation in Equation (6-2). It is expected that the
terms in this equation should turn into derivative and multiplicative operations acting
on the wave function ty. We know in advance that E becomes the differential
operator ih d/dt and that the Schrodinger equation has the general three-dimensional
structure
V 2^ + V<V = ih —
dt
*, (6-5)
-V
where V = 2
d
2
/dx 2 + d
2
/dy 2 + d
2
/dz 2 The
. central nature of the potential en-
ergy function V is our main consideration.
.
Figure 64
Cartesian and spherical coordinate systems.
We are concerned with a general situation where V depends only on r. The radial
variable is a rather complicated function of the Cartesian coordinates:
2
lx +y 2 + z
2
(6-6)
Hence, it is obvious that the Cartesian variables (x, y, z) are not suitable and that
spherical polar coordinates (r, 0, </>) should be adopted instead. Figure 6-4 shows how
x = r sin v cos
z = r cos 6 (6-7)
guidance because the terms indicate a clear separation of radial and angular compo-
nents according to the classical construction in Figure 6-2.
We determine the form of p 2 by considering the special case of a wave
differential
function ^(r, / ), which has no dependence on and <p. The derivation begins with the
derivatives
d^ dty dr 3*
dx 8r 3;
and
d
2
* 1 3* 2
d* x- d
2
*
+
17 r dr "~d~7 ?- Ih 2
6-2 The Schrodinger Equation in Spherical Coordinates 305
Substitution of y for x and z for x then yields the other two derivatives:
d
2
* 1 3* y
2
d* y
2
d
2
*
1 3
dy r dr r dr r dr~
and
d
2
* 1 8* z
2
d<P z
2
d
2
*
+ -x
dz r dr r dr r dr
2d*
—*y.
2
d
V 2
*(r,t) = -—- +
r dr or
V z*(r,t) = - —d [r
\
r~ or
l
\
2 —
d*
dr
2
1 d
= -—^(r<k). (6-8)
r dr"
The rule p~ -» (h/i)"V " can now be used along with the first of these expressions to
write
2
/ h \
l d d
as the radial differential operator that acts on the wave function *{r,t) in the
Schrodinger equation. We is correct to express L" with no radial
then argue that, if it
2
derivatives and p with no angular derivatives, it must therefore be correct to express
2
p by the purely radial differential operator in Equation (6-9), acting on any wave
function ^(r, 8, (/>, t).
We expect the formula for L2 to contain no radial derivatives because the classical
construction in Equation (6-2) and Figure 6-2 has the effect of separating changes in
angle from changes in r. Let us clarify this point, and secure an important formula for
later use, by examining a specific component of the angular momentum. We choose
the z component
Lz = xPy ~ yP*
hid
— —
d \ hi
——
d
—
d
L z —* — x y = — \
r sin 8 cos <j> r sin 8 sin </>
i
\ dy dx J
t
\
if
y dx
d dx d dy d dz d d d
= 7~ + 77 T~ + 77 7~ = ~ r sin e sin ^7~ + sin e cos -
77
09 77
d<p dx d<p dy d<p dz dx
r
^Tdy »
-
and we then compare the two differential expressions. The result is an important new
representation rule:
Lz -- —
d h
• (6-10)
; o<f>
The z component of L is distinguished because the z direction has been selected as the
polar axis in Figure 6-4. The other two components of L also become purely angular
differential operators. We quote the relevant formulas as
7
h
\
— sind>—
r —
dd
d
--
cot cos <f>
— d
and
L -»
hid
- cos<f>—- - cot 0sin<J>—
d
\
(6-11)
/ \ OV <l() I
L2 = L\ + L; + L\.
2 2 2
/ h\ i h\ l l d d l d \
Note that a special symbol A"' is introduced in this formula to denote the differential
portion of the operator. We explore the detailed structure of A later on, when we
examine the form of the wave function.
The two rules in Equations (6-9) and (6-12) can now by employed to convert
Equation (6-2). The operators act on the wave function ^(r, 6,<f>, t) to generate the
Schrodinger equation in spherical coordinates:
2
h d d d
-^r~A
2\lt -T
or
r
i
\
V or
,
* r
+ A *
\
}
+ V{r)^ = ih—^f.
ot
(6-13)
It is interesting that the kinetic energy term in Equation (6-5) has turned into such a
complicated differential expression. The complications arise because the choice of
coordinate system has been tailored to the behavior of the potential energy term. We
pay this price willingly in order to take advantage of the fact that V depends only on
r. The mathematical strategy pays enormous dividends when we turn to the problem
of solving Equation (6-13).
Our next step is to seek solutions for wave functions of the form
^= ^(r,6,$)e-' E,/h ,
(6-14)
ih —
dt
Sk = E* (6-15)
and therefore has the properties of an energy eigenfunction. Thus, the proposed state
^ is a stationary whose wave function gives a time-independent probability
state,
density l^l
2
=
and whose energy occurs with zero uncertainty. The spatial
|i|/|
2
,
2
h
— — r
2
xp + Afy] + V(r)4> = Exp. (6-16)
2jur
Our previous experience in one dimension has taught us to expect a set of allowed
solutions for the eigenvalues and eigenfunctions.
It should be noted that the passage from Equation (6-5) to Equation (6-13) by way
of the classical energy relation has led to a derivation of V " in polar form, and a
great deal more. Our goal is to understand the concept of angular momentum in
quantum mechanics, and our approach has kept that quantity in view throughout.
The main conclusions are embodied in Equations (6-10) and (6-12), along with
Equation (6-13). We should regard the operator formulas for L. and L~ as essential
adjuncts to the Schrodinger equation. The eventual interpretation of the angular
momentum stems from this association.
Detail
Let us sketch the algebra involved in the derivation of the operator expressions
for L x and L . We begin by using Equations (6-7) to derive relations between
the sets of operators (d/dx, d/dy, d/dz) and (d/dr, d/dd, d/d§). The chain
rule has already been used to compute d/d<j>, and may be used again to find
Tr
8 dx
— —— + ——— + ——— =
d dy d dz d
sin 9 cos
d
§ —— + sin 6 sin
d
$ —— + cos 6 —
d
or ox or oy or oz ox oy oz
and
d dx d dy d dz d
~ + +
He JoTx YeYy YeYz
d
= r cos 9 cos 4>
—- + r cos 9 sin <p
—d
dx dy dz
The results for the operators (d/dr, d/dd, d/d<j>) can be assembled in matrix
— —
form as
8 3
sin 6 cos (p sin sin <p cos
Tr
d
—r
T r cos 6 cos $ r cos # sin </> si
Ty
d d
—r sin # sin </> r sin # cos (j>
8$
The square matrix can then be inverted to obtain expressions for the operators
(8/8x, d/dy, d/dz):
d sin 6 d
cos
Tz T$>
(It is a straightforward matter to multiply the two square matrices and get the
unit matrix as the expected result.) We proceed next to the calculation of the
desired components of L:
hid d \
•"*•-*'"'
7
1
'IT
z
Ty )
h d sin0 d
( n \
t sin u sin <b\ cos u
1
\ dr r dd)
1 d cos 6 sin <j> d cos <p 8 \
and
Lv = */>, ~ *fc
5 jt dz
d cos # cos
r cos 9\ sin cos <i) — +
4>
—d —
sin <£>
—
<9
dr r d6 rsind 8<p
8 sin 8
r sin 9 cos $ cos 9 —
|
dr ~TTe
These expressions reduce in one more step to the formulas quoted in Equations
(6-11).
Angular momentum takes on several curious properties in the passage from classical
to quantum mechanics. The peculiarities are fundamental to the new quantum
interpretation of this important physical quantity. We encounter some of these
6-3 Rotational Motion 309
Figure 6-5
P\
(6-17!
2n 2ixR-
We have introduced the angular momentum, noting that L, is the only nonzero
component of L.
The quantum particle has a wave function ty{<t>, t) and a probability density
P(<p, t) =
1^(0, 01 We obtain the Schrodinger equation from Equation (6-17) by
2
-
2 2
d
h
2jU,r?
r
2
r
2
^= ih —8 ^
dt
d<f>
2
d 2^ 2fiR
This familiar differential equation has complex exponential solutions of the form
^ = Ae
±,x *
with A2 = ——
2ixR
2
E.
We choose these functions instead of cos \<$> and sin \<j> to prepare the way for a
further property of the stationary states.
310 Quantization of Angular Momentum
At this point we bring forward the essential distinction between the circular system
and the particle in a box. The coordinate in <J>
the problem at hand is a cyclic variable
that repeats itself periodically after the basic interval [0,277]. Therefore, if we want
the stationary-state wave function ^ to be single valued, we must impose a periodicity
condition on the eigenf unction:
This requirement applies to our complex exponential solutions with the following
implication:
A = m, an integer.
2 2
h h
-X 2
2
so Em = -m 2 . (6-19)
2fxR 2iiR'
The resulting quantized energy Em defines a set of discrete energy levels in which the
integer m appears as an azimulhal quantum number.
These remarkable stationary states have an additional property with regard to the
angular momentum L,. We note that the complex exponential solutions are eigenfunc-
twns of the angular momentum operator:
h d
-—Ae ± m *=±hmAe ±,m '
*. (6-20)
i o<j>
momentum interpretation and have therefore not been employed to describe the
stationary states of the particle. Finally, note that the allowed values of Em in
Equation (6-19) are insensitive to the sign of m, so that every energy above the m =
level has a two-fold degeneracy. In other words, clockwise and counterclockwise
motions have the same energy.
The multiplicative constant in the solution is determined by normalizing ^ to unit
probability. The probability of finding the particle in the azimuthal interval d<p is
1 = / |^|
2
^= /
\A\
2
d<f> = 2w\A\'
6-3 Rotational Motion 311
,im<p
* = ''*-'/*
with m = 0, ±1, ±2, (6-21:
/277
Equation (6-18) has played the key role in these arguments. We have deduced
quantization of energy and quantization of angular momentum from the single-val-
ued property of the wave function. Let us reconsider this condition and acknowledge
that the property is not as self-evident as it may seem. We realize that the wave
2
Thus, we see that |^| is already single valued and continuous, whether or not A is
equal to an integer. Of course, this assertion is restricted to situations where * is a
stationary state. Let us reapply the requirement of single-valuedness in the context of
a localized particle and write the wave function as a superposition of stationary states.
^ = A e O#e -iEt/h + A ,
e
i\'$
e
-iE'l/h^
2
= \A\
2
+ 4*4',-«-(X-X')V(*-*')//»
|*|
We replace <J>
by <f>
+ 277 and find that |*|
2
becomes
e
2v,(\-\') _ j Qr x _ X' = an integer.
We can implement this conclusion by taking every A to be integer valued and thereby
recover the results of the preceding paragraphs.
This entire illustration of rotational motion is flawed from the outset by the
presence of the constraint. Our assumption of a precisely circular path for the particle
violates the uncertainty principle and causes the fixed direction of L to be a violation
of basic quantum principles. These difficulties are handled properly in the full
three-dimensional quantum treatment of rotational motion.
Example
Figure 6-6
the rod and write the classical energy relation in terms of the kinetic energy-
alone:
2
L
—
2/
= E.
The angular momentum L and the moment of inertia / appear in this formula.
The quantum mechanical rigid body is described by a wave function ^(6, t), <f>,
where 6 and <j> refer to the orientation of the rod as shown in the figure. We take
2
the operator rule for I from Equation (6-12) and apply the familiar operator
rule for E to obtain the Schrodinger equation for the system:
2
h d
A 2 * = ih— ¥.
2/ dt
2
h
A-ty = Exp ,
21
a partial differential equation in the variables (6, <j>). We must learn to analyze
this kind of equation in order to finish the problem. When we are able to return
to the solution we find that quantization of energy and quantization of angular
momentum are again obtained as joint results. In this case, the behavior of the
angular momentum is found to be consistent with the uncertainty principle.
6-4 Separation of Variables 313
of the time-independent equation first, thereby establishing the spatial behavior of the
stationary states. This complete family of energy eigenfunctions can then be used to
assemble a unique solution ^ for any given description of the state at t = 0. We take
the same approach in three dimensions and look first at the time-independent
equation for the allowed spatial eigenfunctions \p(r, 6,<j>). Equation (6-16) is the
starting point for this investigation.
We solve the partial differential equation for \p by appealing again to the method
of separation of variables. The straightforward procedure runs into complications this
time, because of the intricate nature of the differential operations in spherical
coordinates. The first step is to propose a product form of solution, where the radial
dependence and the angular dependence occur in »// as separate factors:
The motive for this construction can be seen in the following rearranged version of
Equation (6-16):
2
3 3 2fxr
or or h"
We note that a purely radial operation appears on the left side of the equality, while a
purely angular operation appears on the and usefulness of this
right. The validity
observation hinge on the fact that the central potential energy V depends only on r.
Hence, the adoption of the product form \p = RY causes the radial and angular
operations in the equation to act separately on the radial and angular factors in the
eigenf unction:
d
— —Y +
dr dr
r
2
dR
—r (E- V(r))RY =
2ur 2
h
"
-RA 2
Y.
+ —r (E-
dR 2ur 2 KY
2
R
d
-r 2
dr
— dr n
V{r))R
Y
and we observe that the left side of the result depends only r, while the right side
depends only on 6 and (p. Such an equality can hold between functions of different
variables onlyif each side of the equality is a constant. We therefore set the two sides
equal to a separation constant A and obtain the following pair of equations for Y
and R:
and
2
dR 2ur
- + ——(E- V{r))R = XR. (6-24)
dr dr
314 Quantization of Angular Momentum
Note that the two differential equations are linked together because the separation
constant appears in each equation. Thus, we conclude our first step by learning that
eigenf unctions exist in the product form \p = RY, provided properly behaved solutions
of Equations (6-23) and (6-24) can be found.
Our main goal in this section is to determine the angular dependence of \p. We set
aside the equation for R(r) so that we can concentrate on Equation (6-23), the partial
differential equation for Y{0,<$>). For this purpose we need to recall the detailed
structure of the differential operator A2 , as found in Equation (6-12). We apply this
-
d Y
—
d<p
j = sin — d
da
sin —
dY
da
+ X sin
2
Y. (6-25)
This equation is set up for our second step in the separation of variables, since the
equality involves a pure <p operation on the left and a pure operation on the right.
The next move is to look for solutions in which the function Y depends on these angles
through two separate factors:
We insert this construction into Equation (6-25) and divide through by 04> to find
d 2$ d®
1 1 /
= m2 . (6-27)
the right. We
argue as before that each side must be equal to a constant, and we build
2
this property into Equation (6-27) by introducing the indicated parameter m as a
second separation constant. We then conclude that solutions of Equation (6-23) exist
in the product form Y = 0$, provided properly behaved solutions can be found for
d 2 <b
+ m2 <S> = (6-28)
# 2
and
1 d d€)
sin d0
sin i
d0
+ (
X - —^
sin-
I
= 0. (6-29)
2
Equation (6-27) has been divided through by sin in the second of these results.
We have seen differential equations like Equation (6-28) many times and are well
aware that the solutions are oscillating functions. In fact, the <J>-dependent part of this
analysis can be identified immediately with the purely azimuthal problem of Section
6-3. Accordingly, we choose the desired solutions of Equation (6-28) to have the
complex exponential form
and we reject the alternative choices provided by cos m<$> and sin m<p. Section 6-3 has
taught us that these solutions have proper azimuthal behavior only if the functions are
periodic in <J>
with period 277. This requirement implies that m must be integer valued
in Equation (6-30), as it is in the case of the rotational wave functions in Equation
(6-21). Thus, m appears again as an azimuthal quantum number.
The 8 dependence of the eigenfunction is found by solving Equation (6-29). This
part of the problem presents a type of differential equation that has not come up in
any of our previous studies of quantum systems. Let us make the following brief
remarks about the solutions of the equation, without proof, and reserve the mathe-
matical details for later discussion. We note that the domain of the polar angle 6
spans the interval [0, -n] and that the differential equation contains infinities at the
endpoints owing to the zeros of the factor sin 6 at 8 = and m. The only solutions for
0(#) that are finite and single valued over the whole interval are those for which the
separation constant A has the special values
Thus, the integer m is joined by another integer { in the determination of the allowed
solutions of Equation (6-29). These two integers are linked by a further condition, as
each given nonnegative value of / is assigned a set of allowed values of m in the
range
The resulting properly behaved solutions are labeled by the two integers as © /m ( 6)
The labels are attached to reflect the fact that the functions vary with c" and m
through the explicit appearance of the two parameters in the differential equation.
Again, we emphasize that the unfamiliar assertions in this paragraph are presented
without proof and are not supposed to be self-evident. The particular statements in
Equations (6-31 ) and (6-32) are very important results; their content is essential to our
understanding of angular momentum. We provide some of the mathematics behind
these conclusions at the end of the section.
Our main interest in the quoted results lies in the physical interpretation of the
stationary states. Let us set the stage for this discussion by combining the acceptable
solutions for 0(0) and $(<£) and forming a family of functions of the two angular
variables:
The elementary <$> dependence in Equation (6-33) tells us that Y(m must also satisfy
1 d
-T7Y, m = mY, m - (6-35)
i o<j>
The members of this set of finite single-valued functions are indexed according to the
316 Quantization of Angular Momentum
scheme indicated in Equations (6-31) and (6-32). We arrange the indices represented
by m in steps of increasing ( as follows:
{= m =
t=0 m= -1,0,1
e= 2 m= -2, -1,0,1,2
The labels / and m are known as the angular momentum quantum numbers. The physical
meaning of these symbols is the primary topic of interest in Section 6-5.
We do not intend to write down specific expressions for the functions Q fm {8).
Instead, we follow established practice and present the 8 dependence along with the $
dependence in the combined form of the spherical harmonics Yf m (6,<§>). Table 6-1
furnishes a partial list of these functions for the £ values 0, 1, and 2. The tabulated
expressions are subject to a normalization condition and include certain multiplicative
constants as a result of this property. According to universal convention, all the
spherical harmonics are normalized to unity by the requirement
i = [ \Y, m {e,<i>)\
2
dsi, (6-36)
•',11 o
/= m = *»-M*
t= 1 m = 1
m =
m= -1
/= 2 m = 2
2
m = (3cos - 1)
m= -1 F, ,
= 1/
1
—
877
sin 8 cos 8 e "*
m= -2 2fl<,-2«f>
6-4 Separation of Variables 317
where the integration ranges over < 8 < tt and < $ < 277, and where the element
of solid angle is given by
d2 = sin OdOdQ.
tabulation, has the properties expressed by Equations (6-34), (6-35), and (6-36).
One very significant point should finally be emphasized. We observe that the
potential energy V(r) has no influence whatsoever in the solution of Equation (6-23),
and we conclude that the determination of the angular behavior is independent of the
specific nature of the central force. This conclusion means that the dependence of the
wave function on 8 and <|> is always given by the spherical harmonics for every
central-force problem. We also note that V(r) appears in Equation (6-24), the
equation for the radial function R(r), and that the energy eigenvalue E makes its
appearance in the same radial equation. Thus, the energy and the radial behavior of
the eigenfunction are controlled by the choice of central force, but the angular
behavior is controlled by the mere fact that the force is central.
Detail
@(8) = p(n.
The result is a differential equation for the function P(£) of the form
d dP I m2 \
n i-n-rr
'4 y
•
+ U
\ i-r
P=
dP
—d (1 - f
2
)— + \P= 0.
dr 'tf
Let us assert, without proof, that the only finite single-valued solutions of this
equation are polynomial functions. We define the integer ( to denote the order of
318 Quantization of Angular Momentum
This highly abbreviated expression suffices because only the highest power of £
is needed in the next few steps. We return to the differential equation and
compute the derivatives
dP
+ ^,?'- 1
and
dP
—d (i -J )— =2 —d •••+^,(r ,
-r ,
)i
•
+ <f(Y- l)^'"2 - /(/ + \)a^.
A = <f(<f+ 1),
Detail
d dP m
AY+ 1)
i - V
and
d dP,
-l-J 2
-/
d£
+ ^+l)^0,
«S
and make a series of swift remarks without further demonstration. The equation
d2 d'" d dm dm
(l - f
2
)—
'
d$
7
2
P,-
f 'H(m + 1)
m P,+
((-
'
V
m)U+
A m + l)-—
m P,= '
d$
0.
df" d<, dl
64 Separation of Variables 319
2 m/2
A newfunction can also be identified by defining the expression (1 — £ )
m m
d Pe/dK, The new equation can then be used to show that the new function
.
satisfies the original differential equation for P. These arguments tell us that a
whole family of solutions is obtained for each / and for every positive m by the
construction
2 m/2
JV-tt) - (l - f ) MS)-
dV
Example
The (0, 4> )-dependent solutions are easy enough to confirm as specific cases. Let
us turn to Table 6-1 for the spherical harmonic with /= m = 1,
K„ = -l/ —
077
sin***,
and verify that this function satisfies Equation (6-34). We recall Equation (6-12)
to perform the differentiation:
A *»-v^
/ 1
sin
d
86
sin —
d6
+
i
sin'0 dp
d
sin v e
5
^— —-sinflcosfo'* +
1 1
t-t (-*'*)
sin0 do sin
e* /3
= V
V
/ 3
8tt sin 6
-(cos
2
- sin
2
- 1) = -2i/
V
—
8tt
sin0*'* = 2K, '
1
~
i
— d
<9<fj
Y„ = -i/--sin0
V 877
,"t> }"
(
•'all £2
\Yn \
2
dto= f
•'0
2
" d<t> /""sin
•'O
</0
\
— 877
sin
2
3 4
- 1
4-3 '
.,
using f = cos to transform the integration. Note that the minus sign in Y l ,
plays no part in these exercises. This sign is due to a phase convention and does
not need to concern us.
We can interpret these results at once if we recall the angular momentum representa-
tion rules
L2 -» -h 2A 2 and L, -»
h
-
i
—
d4>
d
-h 2A% m = h
2
t(S+ l)Y, m and
l
-
i
—o<j>
Y, m = hmY, m . (6-37)
The two equations tell us that the spherical harmonics are simultaneous eigenfunc-
tions of the L 2 operator and the Lz operator. Equation (6-22) transfers these
properties directly to the stationary states, since the spherical harmonics provide the
(0,<j>) dependence of the various stationary-state eigenfunctions.
Let us summarize our observations in the following two statements. The stationary
states are eigenfunctions of the square of the angular momentum such that
The stationary states are also eigenfunctions of the z component of the angular
momentum such that
These statements express the first principles of angular momentum quantization. The
occurrence of integers means that the allowed values of L and L, are discrete. The
eigenvalue properties also imply that the allowed values occur in the stationary states
6-5 Angular Momentum Quantum (lumbers 321
This property of the angular momentum stems from fundamental quantum principles
and deserves a closer examination. Let us identify the three Cartesian components of
L according to the rule
h
L^ -A
i
by writing
d d d d d d
A x =y-
oz
z—
ay
, A = z-
ox
x—
oz
, and A = x-
2
ay
y—
'ox
. (6-38)
It is not difficult to prove that the three differential operators obey the following
multiplication formulas:
A X A,-A,A,= -A z ,
A A, - A,A,= -A,,
v
A,A,-A,A,- -A r (6-39)
The proof of these remarkable relations is left to Problem 9 at the end of the chapter.
We wish to show that the uncertainty principle is at work in Equations (6-39) by
demonstrating that exact nonzero values cannot be assigned to all three components of
L. We proceed by assuming the existence of a state ^ in which L has the precisely
specified vector value
h(<?x x + yy
<? + ri).
The factor h is included for convenience, and the { 's are introduced as dimensionless
parameters. This hypothesis about the state Sk can be formulated in terms of
eigenvalue equations:
ft
h
-A.* = i^t. (6-40)
i
Let us select the last of these statements and use the first of Equations (6-39) to obtain
Figure 6-7
\f2-h.
Uncertain
We conclude that the parameter c*z vanishes in the given state. Similar procedures
can also be used to obtain vanishing results for t"x and t* in the same state. It follows
that the vector L is allowed to have a precise vector value only if the state has angular
momentum equal to zero. Only one of the observables L L and L. can be specified x
, ,
with zero uncertainty in more general circumstances, and L. has been taken as the
chosen component.
Our quantization statement concerning L, tells us that each value of the quantum
number t° is assigned 2c" + 1 values of the quantum number rn between — ( and c"
precise direction for L is and so the vector may lie anywhere on a cone
not defined,
whose axis points in the z direction and whose apex angle is determined by the
quantum numbers c* and m. There are 2c" + 1 such cones corresponding to the
different m values for the given c° value. Figure 6-7 illustrates this state of affairs in
the t" =
There are three uncertain orientations for L, with three definite values
1 case.
of L z and with random values of L x and L
, subject to the length constraint ,
y
smallest angle between L and the z axis is equal to cos~' JS/(t+ 1) , an angle that
approaches zero only for very large values of the quantum number { . In this limit the
states having m = +£ are as close as possible to the case of a classical L vector with a
fixed direction. Figure 6-7 shows a situation far from this limit, where €= 1 gives a
minimum angle of 45° between L and the z axis.
The central-force problem is said to have rotational symmetry since the dynamics is
would be unrealistic to insist (or deny) that the angular momentum must have discrete
values, and to claim that the direction of L must be uncertain, for such a classical
object. However, we can expect to find firm evidence for these notions in submicro-
scopic systems, where angular momentum quantization becomes a vital consideration.
Example
Let us recollect the rigid-body problem from the example at the end of Section
6-3 and show that a connection develops between the quantization of angular
momentum and the quantization of energy. This direct link comes about
because the problem is defined by angular variables alone, as in Figure 6-6. The
stationary-state eigenfunctions are known to satisfy the partial differential
equation
21
A 2^ 2^-
</7 m = Y, m (6,<!>)
Ee( = —
2I
/(/+ v 1).
Note that the energy depends only on ( and that each / has 2/+ 1 different
wave functions corresponding to the different values of m. Thus, each of the
quantized energy levels Ef has 2/+ 1 degenerate states. Equation (6-36) tells us
that the wave functions are automatically normalized to unity over the total
solid angle:
= ( \%J
2
dQ.
•',11 o
Equations (6-37) imply that the stationary states are eigenfunctions of the Lr
operator and the L, operator:
h d
-h 2A2 *, m = hW+ l)% m and ~^-% m =
i o<j>
fim% m .
2 2 2
\%J = \Y,J = \G, m (0)\ -
This simple observation has deeper roots in the uncertainty principle. We note
that the L. operator (h/i)d/d<f> is related to <£ as the momentum operator
{h/i)d/dx is related to x. Therefore, we might suppose that an uncertainty
principle for <f>
and L z should hold by analogy:
h h
AxAft
!x
>- => A<J>AL.z >-.
2 2
3 3
l*u|
2
= l*.-,f
2
= ^sin 2 and |* 10 |
2
= —cos 2 0.
space by rotating these drawings around the z axis, as suggested in the figure.
We let the distance from the origin to the resulting surface represent the value of
2
I^J in that direction for the given value of t and m. Therefore, the quantity
2
\^/„,(&, <j>, t)\ dQ, expresses the probability of finding the rigid body in an
element of solid angle dQ, oriented in a direction defined by the angles (6, <p).
Let us suppose that observations of this orientation are made for a large number
6-6 Parity 325
of systems with energy E( . Since E( has a (2/+ l)-fold degeneracy, the results
of the measurement must conform to the average of the 2^4- 1 probability
densities for the set of degenerate states with the given energy. This quantity
always turns out to be a constant, independent of angles for any { . To illustrate,
let us consider our results for ( = 1 and compute the corresponding average:
1(3
1
{i*, 1^
* HI I
2
+ 1^
T I
-
3 \
—
4tt
(sin
2
+ 2
cos 0)
7
\
= -
477
1
The rotational symmetry of the problem is the basis for this conclusion. The
result cannot depend on angles because the dynamics does not identify a spatial
direction to use as a natural choice of orientation for the z axis. The (2/+ 1 )-fold
degeneracy of the energy levels is the manifestation of this symmetry in
wave-function language.
6-6 Parity
unique to quantum physics and has no natural place in classical mechanics. We have
326 Quantization of Angular Momentum
Figure 6-9
Space inversion of the coordinate axes. The coordinates transform from (x, y, z) to
(-at, -y, -z).
seen the idea of parity in one dimension. In particular, we have learned that a
symmetric potential energy V(x) leads to eigenf unctions \p(x) with either even or odd
behavior under the reflection x —> — x. The parity operation in three dimensions is
additional symmetry of V means that parity offers other information about the states
in addition to the angular momentum interpretation. Figure 6-10 shows how the
angular coordinates are affected by space inversion. Since the radial aspects of the
solution have nothing to do with this operation, and since the general spherical-
harmonic form of the angular solution is already known, it follows that the parity
properties are available for immediate inspection just by looking at the behavior of the
spherical harmonics.
Figure 6 10
and the azimuthal angle is measured from the new x direction as w + <£>. The radial coordinate
r is the same in the two coordinate systems.
6-6 Parity 327
Figure 6-10 shows that the spherical coordinates (r,0,<]>) are transformed in the
new spherical coordinates (r, m — 6, v + $). Let us apply this
inverted system into the
transformation to a stationary state with angular momentum quantum numbers if
and m, and with angle dependence given by F/m ( #,(£). The parity operation causes
the spherical harmonic to undergo the replacement of variables
The new function turns out to be unchanged in value except for a possible change in
sign. We refer back to Equation (6-33) and obtain a sign-changing factor through the
relation
Y, m {ir - d t
v + *) = {-\)% m {e,+). (6-41)
This formula tells us that the stationary states have definite parity given by the factor
(— 1) j in addition to the angular momentum properties associated with the quantum
numbers £ and m. The parity is determined solely by i such that states with even /
have even parity, and states with odd ( have odd parity.
Our deductions about parity are going to seem rather sterile until some physical use
for the concept arises. Parity makes its appearance here as part of a classification
scheme for certain types of wave function. Let us accept this notion on kinematical
grounds and wait for dynamical applications to come up in due course.
Detail
Let us establish the relation between @^ OT (7r — 0) and Q fm {6), and supply the
missing link leading to Equation (6-41). We return to Section 6-4 and recall that
our solutions for 0(0) have the form
,/'"
^) = (i-n n/2
^(n, r = cos
f
—> — £ since cos(tt — 6) = —cos 6.
/>,(-{) = (-i)^(f).
328 Quantization of Angular Momentum
m
>\""/2
d
^ m (o-/v m (-n = (i-n d(~0 M-i:
</'
Example
Let us test the parity formula by checking the behavior of the spherical
harmonic K n :
-\
V
—
877
sin0(- (0 +)
The result agrees with Equation (6-41), since the parity of K n is supposed to be
odd.
The central-force problem is solved in two stages after the separation of variables is
introduced in Equation (6-22). All aspects of the angle dependence are established
first on grounds of rotational symmetry, without any regard for the actual nature of
the central force. The specific dynamics enters the problem through the choice of
potential energy V(r), only when consideration turns to the solution for the radial
dependence. This next phase of the procedure also includes the determination of the
allowed energies of the system. Thus, the two-stage approach demonstrates quantiza-
tion of angular momentum and then quantization of energy, as the investigation
proceeds from angular behavior in Equation (6-23) to radial behavior in Equation
(6-24).
6-7 Quantization ol Energy 329
Our attention shifts to the differential equation for the radial factor R(r) in the
stationary-state eigenfunction. When we take Equation (6-31) into account and
rearrange terms, we find that Equation (6-24) becomes
2
dR h
2jur
2
dr dr
V(r) + --JV+
2/xr"
1) R = ER. (6-42)
An alternative radial equation can also be obtained by using the second version of the
2
formula for V given in Equation (6-8). If we substitute this expression for the radial
derivative in Equation (6-42) and multiply through by r, we generate the equivalent
radial differential equation
fi
2
d2 h
2
Our interest is drawn immediately to the close resemblance between the second form
of the equation and the classical energy relation in Equation (6-2). We see that the
radial differential equation follows directly from the classical formula when the two
replacements
—
n
—d\ and L -> hY(£ + 1) (one dimension)
i dr J
are introduced as operations acting on the radial function rR(r). Note that the
substitutions are specifically labeled as one-dimensional rules, intended only for
application to the radial dependence of the stationary states. These replacements
should not be confused with the three-dimensional operator formulas for p~ and L~
found in Equations (6-9) and (6-12).
We collect the terms in brackets in Equations (6-42) and (6-43) and define the
effective potential energy for the quantum central-force problem as
2
h
ya t(r)= V(r)+ -—J{{+
2jur~
1), (6-44)
by analogy with the classical expression in Equation (6-3). This ^dependent quantity
provides the means by which the given potential energy V( r ) makes its entrance in the
problem. The energy eigenvalue E also makes its appearance along with V(r) in each
of the two versions of the radial differential equation. The structure of Equation (6-43)
is especially interesting because this differential equation for rR( r ) is identical in form
to Equation (5-25), the differential equation for the eigenfunction \p(x) in one
dimension. The parallel between these equations suggests that our insights about the
solution ip(x) for a given potential energy V(x) can be transferred virtually intact to
the solution rR(r) for a given effective potential energy Veff (r). Of course, we must
allow for a certain distinction between the variables, since x varies over ( — oo, oo) in
one dimension while r varies over [0, oo) in three dimensions. This observation tells us
that the question of evenness or oddness does not apply to the r dependence, and it
problem, since the explicit appearance of the quantum number £ in the radial
equation causes the form of the differential equation to vary with the choice of £ . It
follows that every £ has its own sequence of allowed results for E and R(r) and that a
must be introduced to label all the radial solutions. We see that
pair of integer indices
£ needed as one of the labels to define the differential equation through the £
is
dependence of Veff and that another new quantum number, denoted by n, is also required
to list the quantized solutions for a given choice of £ . We identify the family of radial
functions and energy eigenvalues by the notation
R„s(r) and EH „
2 2
h d d h
-r 2 - +
\
2
V(r) + -—>£(£+ !)*„,= E n ,R n ,. (6-45)
2fir dr dr 2jir
This formula indicates a different operation on the left side of the equality for each
integer £ and produces a correspondingly different set of allowed solutions as a result.
We use £ to label the various sets, and we use n to index the members of each set.
We have proceeded to the solution of the central-force problem by following a
series of steps beginning with Equations (6-14) and (6-22). The result of these
procedures is a collection of stationary-state wave functions with the general structure
We note that the states of this three-dimensional problem are specified by the
assignment of three quantum numbers. The angular momentum quantum numbers £
and m
have already been interpreted in Section 6-5, and especially in Equations
(6-37). Let us return to these two eigenvalue formulas and observe that the eigenfunc-
tion behavior can be transferred directly to the stationary states:
-h 2A2 %, m = h
2
£(£+l)%, m (6-47)
and
h d ,
i o4>
The stationary states and energy eigenvalues are also designated by the third
quantum number n, whose meaning has been described in general terms in the
6-7 Quantization o( Energy 331
ih — %, m = E
ot
nf%, m ,
(6-50)
<
whereby the wave function \f
nfm is identified as an energy eigenf unction with energy
eigenvalue E nf .
We may assume that n labels the energy levels for a given ( such that the energies
increase with n, as is the case in one dimension where n is the only quantum number.
The one-dimensional problem has taught us that the larger energy eigenvalues belong
to the eigenf unctions \p(x) with the greater numbers of nodes. The same must be true
for the function rR{r) in three dimensions, since $(x) and rR(r) satisfy similar
differential equations. This correspondence suggests that we call n the radial node
quantum number and identify the integer with the number of nodes of the radial
function R n {. We illustrate these remarks in the example below.
The main conclusion of the central-force problem is the fact that wave functions
exist as simultaneous eigenfunctions of the energy, the square of the angular momen-
tum, and the z component of the angular momentum. We have just observed that the
stationary states ^n(m are endowed with these remarkable properties. Such findings
are noteworthy indeed, because the situation enables us to determine all three
independent physical quantities at the same time with zero uncertainty. Again, the
rotational symmetry of the central force is the essential ingredient behind this
It should be noted that the quantum number m is not among the indices for the
quantized energy in Equation (6-50). This integer does not appear anywhere in the
radial differential equation, and so the absence of m from the allowed solutions for E
and R(r) an expected result. The existence of a degeneracy is signaled by the
is fact that
E n( does not depend on m. Degenerate states occur for a given choice of n and c"
R{r) -» Ar e as r -» 0, (6-51)
1
Figure 6-1
.« =
^3
where A an unspecified constant. We verify this behavior in Equation (6-45) by
is
satisfy the differential equation. The proposed power law results in the equality
d dR
—
dr
r
2
—
dr
= t(t+ \)Ar'= t(t+ \)R near r = 0,
and so the first and third terms achieve the desired solution for small values of r.
(Note that a 1/r power law also satisfies the equality but introduces an unaccepta-
ble divergence at the origin.) Figure 6-1 power behavior of R n {{r) for
1 illustrates the
Example
Let the potential energy be a linear function of r and consider the /= case, so
that the effective potential energy is written as
Vef! = V=cr.
We are concerned with radial solutions of the form rR(r), and so we turn to
Equation (6-43) and rearrange terms to get
— (rR)=-^[E-V(r)](rR).
2
The analytical solutions of this equation are rather unfamiliar for a linear
6-8 Observables in Spherical Coordinates 333
Figure 6-12
Energy levels and radial functions rR(r) for a linear potential energy with £= 0.
V=V,
potential energy. Let us proceed graphically instead and deduce the shape of
rR{r) for the two lowest /= energy levels. The differential equation describes
the curvature of the solution relative to the value of the function at each point r.
The function rR(r) is subject to the condition that the solution must vanish at
crease, the corresponding solutions exhibit more curvature over larger intervals
and display more nodes in the region where nodes are allowed. These arguments
tell us that the first solution should have only the one node at r = and that the
second solution should have a second node. Figure 6-12 shows Vcff along with the
two lowest energies E nf for n = 1 and n = 2 in the tf= case. A qualitative
sketch of the two ^=0 solutions is also included to illustrate our remarks about
curvature and nodes. Note that each energy gives a single classical turning point
where the curvature changes sign and that each solution decays without
of rR(r)
nodes for values of beyond the turning point. The £— case is special because
r
the centrifugal contribution does not appear in the effective potential energy.
We leave the /=£ situation to Problem 14 at the end of the chapter.
The main goals of this chapter are fulfilled in Equation (6-46), the expression for the
stationary states. Thesewave functions exhibit quantization of angular momentum
and quantization of energy as compatible properties of the central-force system. The
methods can now be used to evaluate and interpret certain physical quantities in
terms of the angular momentum framework.
We begin with the probability properties of a general wave function ^! . The
2
corresponding probability density |^(r, 6, <$>, t)\ represents a probability per unit
volume in spherical polar coordinates. Figure 6-13 shows that the element of volume
in these coordinates is given by the infinitesimal quantity
dr = r
2
dr sin0 dd d<$> = r
2
drdtt,
where dQ, denotes the familiar element of solid angle. The wave function is normal-
334 Quantization of Angular Momentum
Figure 6-13
r sine d<t>
1 = \*\
2
dr.
j
all space
2
Thus, the element of probability \^(r, 0, <£, /)| d-r expresses the likelihood of finding
the particle in the volume di at the location (r, #, </>) at time /.
Let us examine the normalization condition for the special case of a stationary state
of the kind described in Equation (6-46). The multiplicative dependence on the
variables enables us to split the three-dimensional integration into separate radial and
angular factors:
all space
00 r
2 2 :
Jr r dr dSl\R n Ar)\ \Y, m (6,<i>)\
n •',11
'all O
a
If we then recall the normalization of the spherical harmonics from Equation (6-36),
we find that the normalization condition for a stationary state reduces at once to a
single integration over r:
/•OO
2
1 = /
\R n Ar)\ r
2
dr. (6-52)
2
Pn,(r) = r \R n ,(r)\
2
,
(6-53)
1 = CP Ar)dr.n
We interpret the differential element Pnf {r)dr as the probability of finding the
6-8 Observables in Spherical Coordinates 335
We note that Equation (6-53) contains no dependence on the time, and we emphasize
that P„/-(r) is defined only for a stationary state (hence the indices n and c"). Attention
2
should also be drawn to the explicit appearance of r in Equation (6-53), since this
factor causes the radial probability density to vanish at the origin for every <^ n ^ m .
2lf+2 2 '
Equation (6-51) tells us that P„^(r) behaves as r while l^/J 2 behaves as r , ',
near r = 0. This distinction in behavior at the origin arises becauses the radial
probability density is defined with respect to the thickness of a spherical shell, whose
2
surface area supplies the extra factor of r .
t/h
dependence of ty„f m are given by the simple phase factors c""* and e^' En ' These .
functions have unit modulus, and so the probability density is independent of 4> and /
in such a state:
22 2
\%*J -\Rn*(r)\ \Q< m (*)\ - (6-54)
The evaluation of any physical quantity depends on the state of the system and
involves the determination of an expectation value. We evaluate a given observable
by performing measurements of the quantity for a large number of separate systems in
the same state \K The average of these measurements corresponds to the expectation
value of the observable in the given state. We follow the rules established in Chapter 5
and express this expectation value in terms of an integral
f^*Q^dT,
reduces to a simple exercise in each case, because of the angular momentum properties
of the states.
We look first at the expectation value of r in the stationary state ^„ /m :
all space
The calculation simplifies immediately when Equation (6-46) is inserted and Equation
(6-36) is employed:
2
(r>= / r dr rffl r\Rjr)\ 2 \Y, m (6 , <|>)|
2
/
•'all Q
/•oo /-oc
= / r
3
\R„,{r)\
2
dr = / ri>fl/ (r)<fr. (6-56)
•'o •'o
Note that the final result can be given in terms of the radial probability density and
that the expectation value of any function of r can be similarly expressed:
when ^ is not a stationary state. We note that Equation (6-58) may depend on t,
while Equation (6-57) must be time independent and stationary, as expected for a
stationary state.
Let us turn next to the computation of expectation values for some of the important
angular operators. We start with the evaluation of L in the state ^„/ m and recall the
operator formula from Equation (6-12) to find
(L 2 ) = h
2
at+ l)f%%%, m dr
= h
2
t(t+ 1). (6-60)
We proceed in similar fashion to the expectation value of L, in the same state and use
the operator formula in Equation (6-10) to write
= hm. (6-62)
1
Equations (6-60) and (6-62) tell us what the measured values of L and L z should be,
2
<U 2
)
2
> = j%*, m {-h 2K2 ){-h 2K2 )% fm dr = [hW+ i)]
and
h d ( h d
< L-> = /*"*- *„a„^ = (ton)'
\1T*
2
<(L 2 ) ) = (L 2 ) 2 and {L]) = (L Z ) 2
for a stationary state, and hence confirm the claim that the uncertainties AL~ and
AL, are equal to zero in any such state. We conclude that the determination of L
1
and L z always gives the answers expressed in Equations (6-60) and (6-62) for every
measurement of every system described by ^,„/ m The examples at the end of the .
The last item on our agenda pertains to the orthogonality of the stationary states.
The proof of this property involves the use of vector calculus and is left as Problem 16
at the end of the chapter. Orthogonality has already appeared as a mathematical
concept in one dimension and is formulated as before in terms of the spatial
eigenfunctions in the stationary states
iE -' t/ ''.
%, m = t n , m (r,e,4>)e-
We let 4*n s m and *P„ t m be any two different members of the family of eigenfunc-
tions and write the orthogonality condition as
/ \L* , \b , dr = 0. (6-63)
all space
This integral vanishes whenever the two sets of quantum numbers («,/,?«,) and
(n 2 f2 m 2 ) are not the same in all three integers. The spherical harmonics have a
338 Quantization of Angular Momentum
whenever (^m,) and (t2 m 2 ) are different pairs of quantum numbers. These integral
formulas are used frequently as calculational tools to supplement the normalization
properties of the various functions. Their usefulness is illustrated in the following
exercises.
Example
We have concentrated on the properties of a pure state *„/„, and should now
consider a more general kind of wave function. We know that a superposition of
these states is a solution of the Schrodinger equation, so let us construct our wave
function by combining two pure states with different L, quantum numbers:
,,,/ *-
* = ^f (*... + *n-i) = ii(F " + ir
i-i)'""
7r*
We note that *,,, and *,, ,
are degenerate, so that the t dependence of * has
a unique frequency. It follows that a measurement of the energy in this state
must yield the result
2 2
/|^| rfT=-/|^ ul 2 rfT+-/|^ u _ | 1 |
rfT=- + -.
We observe that the cross-terms drop out of the calculation because of the
orthogonality property, and we recall that every *„^ m is normalized by the
condition
1=/ I*./.!
2
*.
^all space
2
-h 2A 2 * = 2h
2
V and ( -h 2A2 )(-h 2A 2 )* = (2a
2
)
*
and apply these eigenvalue relations along with the normalization condition to
obtain the expectation values
and
2 2
<(L 2 ) ) = [**{-h 2A2 ){-h 2A2 )*dT = (2h
2
)
.
6-8 Observables in Spherical Coordinates 339
The uncertainty in L, is not zero for a particle in the state ty. To establish this
property we first consult Equation (6-48) to find
h 8
i o$
and
h d \( h d .
d
(L z ) = f**-—*dT=-f(%*u + *,!_,)*(*,„ - *„_,)rfT = o
and
^/""(^(t^H
1
2
(AZ-J = (I 2 ) - (L z ) 2 = h
2
so AL = 2
h.
We consult Table 6-1 and find that the angle dependence of the wave function
is given by
= —i\ —
V 277
sin 6 sin d> = — M/
V 277 r
.
y\2
\Yn + Y 1
_,|' = —3
Z77
sin'0 sirnf, = r
277
3 /
\ r
340 Quantization of Angular Momentum
This probability density is not symmetric about the z axis, unlike the situation
for a pure eigenf unction of L,. Instead, the distribution has lobes of maximum
density aligned along the ±y axes, so that the probability of finding the particle
is somewhat localized in the two directions 4> = 77/2 and <J>
= 377/2. Some
localization in <p is expected since the uncertainty in L, is not zero. The fact that
2
l^l behaves as (y/r) 2 tells us that the probability distribution is symmetric
about the y axis and suggests that our state is an eigenfunction of L y It is .
therefore apparent that a choice of polar axis in the y direction would yield a
simpler description of the state. We should recognize from the outset that we are
considering a rather contrived wave function. The state ^ is stationary since
l^l
2
is not time dependent, even though ^ is not a pure state of the form ^nfm .
Example
\U —
/?"
(<i,
°°
+ \U \ =
F)
(
*
p
10
v
°°
f-'
E \«'/''
^
+ R
nnV / io e
-' E \x'/ h
\
)•
d
'
10 °
"*"
"())
= ^10 MOO "*"
^UMIO
rit
and
I ih
^T: ih
T: (moo + *no) = £ 2omoo +
i ^?1*1 10-
dtj\ I
dt
I
(E) = [**ih
J
—8 *d
dt
at
1 1
f
Problems 341
and
2
(Ais) = (is
2
) - (is)
2
= \(El + E n2 ) - \(E l0 + E u f - \{E n - E w )\
and so
A£=i(£ n -£ 10 ).
Our conclusions should seem reasonable for an equal-parts wave function, since
the average energy (is) lies midway between the two levels E u) and En , and
the energy uncertainty A is gives the deviation in energy on either side of the
average. Let us apply the same logic to the calculation of (E 2 ). We have an
equal-parts admixture of £= 1 and /= eigenfunctions, and so we expect to
find
(L 2 ) = l
2
(2h
2
+ Oh 2 ) = h
2
and M 2
= l,(2f) 2 - Oh 2 ) = h
2
.
Hence, measurements of L z must always yield the value zero for any system in
the state ^ '.
Problems
1. Determine an expression for rmin , the distance of closest approach on the orbit of a
classical particle, for the case of a repulsive Coulomb potential energy
b
V(r) = -, b>0.
r
b
V(r) = --, b>0,
r
and determine expressions for the radial turning points when E negative and when E
is is
positive. Note that, for E< 0, there is a minimum possible E for each nonzero value of
the angular momentum L.
3. Use the operator results for L x , Ly and , L,, as found in Section 6-2, to derive the operator
formula for the quantity L2 = L2 + L2 + L2 . ,.
342 Quantization ot Angular Momentum
4. The particle in Figure 6-5 has a wave function ^(^>, /) and a probability density
P(<f>, t) = 1^(0, t)\
2
. The wave function satisfies the equation
2
f>- d d
2 2
2iiR <9</> dl
in which V(<j>) is included as a real- valued potential energy. A probability current density
j(<p, I) can be defined such that j and P obey the probability conservation law
5. Set V = in the previous problem, and let the wave function have the form
where m is an integer. Use the result of the problem to compute j(<f>, I) for this state, and
interpret the answer.
6. Consider a bead constrained to move on a circular wire, as in Figure 6-5, and assume the
existence of a barrier at <j> = that the bead cannot cross. (Imagine an inaccessible region
— <£ < () <f>
< <j>
()
in the limit <j>
()
—> 0.) What condition on the wave function SE'(^), /) does
this stipulation imply? Determine the allowed energies and the corresponding eigenfunc-
tions. Are there any degeneracies among the states?
7. A particle of mass \i moves freely on the surface of a sphere of radius R. Express the
classical energy relation in terms of the angular momentum L, and use the operator
formulas to deduce the Schrbdinger equation for the system. Let the wave function ^ be
an energy eigenfunction with energy eigenvalue E and spatial eigenfunction \p. What is
8. Refer to Table 6-1 for the spherical harmonic Y22 , and verify that this function satis-
the system, and indicate the quantum numbers and degeneracy of each level. Let the
separation between the hydrogen atoms in the molecule be 0.075 nm, and calculate the
2
numerical value (in eV) for the energy scale unit h /I, where / denotes the moment of
inertia. Determine the allowed transition energies if the transitions obey the selection rule
A<?= ±2. Calculate the emitted wavelength corresponding to the transition t= 2 -» /=
0. (Symmetry principles prevent transitions in which £ changes by one unit.)
11. Refer to the rigid-body example in Section 6-5, and consider the state obtained by
superposing two degenerate normalized £= 1 wave functions:
Problems 343
Write out the explicit expression for ^ and show that ^ is normalized. Calculate the
expectation value
, ft d
(L.) = /** ^dQ,,
J i d<p
and determine the uncertainty AL. in this state. Show that the probability density
depends on <J>,
and interpret this observation in terms of the uncertainty principle.
12. Refer to the rigid-body problem at the end of Section 6-5, and calculate the average of the
five probability densities for £= 2:
2
\
5
I l*2 J -
m=-2
Interpret the result.
13. Determine the parity of the spherical harmonic X,, by examining the explicit behavior of
the function under the angle-inversion operation (0,4>) —* (w — #, 77 + $).
14. Consider a linear potential energy V(r) = cr, and assume a nonzero value for the angular
momentum quantum number t '.
Make a drawing of the effective potential energy and
identify the regions in which classical motion is not allowed. Consider the two lowest
energy levels, E x/
, and E2/ , and sketch the behavior of the solution rR(r) for each case.
15. Consult Table 6-1, and determine the values of 6 for which the probability density van-
ishes in each of the five (=2 stationary states. Sketch the resulting conical surfaces of
constant 0.
\b* , \L , dr = 0.
/ all space
Prove this result for the case of two nondegenerate eigenfunctions by proceeding as
Assume a real-valued potential energy, and use an identity from vector calculus to
complete the proof.
17. Verify the orthogonality properties of the pairs of spherical harmonics K2 ,
and Y20 and ,
K2I and Yu by , explicit calculation of the integrals fY2 ^Y20 dQ, and fY2 *Yu dQ..
344
7-1 The Radial Differential Equation 345
Let us generalize the hydrogen problem immediately to include two features that have
already appeared in the Bohr model. We assume that a single electron is bound to a
nucleus of charge Ze, and we treat the atom as a two-body system in motion with
center of mass at rest. The groundwork for the quantum construction of this problem
has been laid in Chapter 6, beginning with our discussion of Figure 6-1. For the
electron-nucleus system with masses m and M, we
e
define a reduced mass
M
r e
M+m e
and then cast the analysis in terms of an equivalent one-body problem to describe the
behavior of a particle of mass fi. The correction factor M/(M + m e )
is very near
unity; nevertheless, this generalization is kept in the problem (at no extra cost)
because the reduced-mass effect is subject to experimental test. The binding force is
due to the Coulomb attraction between the charges Ze and — e. Therefore, the
potential energy of the system is given by
Ze 2
nr)=-~47T£ r
,
(7-1)
The angular momentum quantum numbers c" and m convey the angular momentum
properties of these states, as discussed in Section 6-5, while the third quantum number
n remains to be clearly identified. We can determine the r dependence of a wave
function with angular momentum quantum number t° by solving the differential
equation found in Equation (6-42):
2 2
dR
ft
2jur- dr
d
r2 —dr
+ V(r)
h
+ —<<(<>+
2jir-
1) R = ER, (7-3)
where R(r) and E denote the radial function and energy eigenvalue appearing in
Equation (7-2). The Coulomb potential energy in Equation (7-1) is employed for V(r)
to give the explicit form for this radial equation. The result is an unfamiliar
second-order ordinary differential equation whose solution must be investigated in
some detail.
We gain considerable insight into the radial behavior if we first restrict our
attention to wave
functions ^fnim with no dependence on angle. Necessarily, these
spherically symmetric solutions have angular momentum quantum numbers
£= and m = 0,
346 The One-Electron Mom
since Y00 (6, <$>) is the only possibility for a constant angular function. We insert this
information along with Equation (7-1) into Equation (7-3) and find that R(r) satisfies
2
h l 2 \ Ze 2
\R" + -R'\ R = ER. (7-4)
Since the solution is expected to approach zero as r — > oo, it is reasonable to try an
exponentially damped radial dependence:
where the parameter a has units of length and sets the scale of distance to describe the
damping. This trial function provides a solution to Equation (7-4) for all values of r.
To prove our point we compute
A R
R' = - -e~ r/a = - -
a a
and
R" = —A e-^
2
/a
= —R 2
.
a a
Then, when we put these results into Equation (7-4) and divide by i?(r), we obtain
2
h t 1 2 \ Ze 2
= E.
2ju \ a ar J
4ire r
The equality holds for all r, provided the coefficient of \/r vanishes:
2
h Ze 2
= 0.
jUfl 47T£
a =
Ze
—
4<7TE
7,
2
ti
.
We consult Equation (3-51) so that we can rewrite the formula in terms of the Bohr
radius a :
2
m
a=
e
a
-, where a = —
4-7TE h
. (7-5)
ju Z e'm e
The energy eigenvalue is also obtained by returning to the equality and comparing
7-1 The Radial Differential Equation 347
2
h
2
2fia
2 2 2 2 2
2
h l M e ) e n'
e
~ >'"( 1
2f x\m e
47re
j
47re
0/ 1 2/T
We recognize this result as the Bohr formula for the energy of the ground state. Let us
introduce the Rydberg energy E into the expression by writing
2 \
2
/
E= Z EQ 2
, where E = (7-6)
y
47re j
2^ 2
as in Equation (3-53).
These conclusions are reassuring since our attempt at a simple solution has
produced an £= wave function with the correct ground-state energy for the atom.
The full wave function for this stationary state is written in the form
-r/a
e ^B.ihr
Y I00 / 7 e ' V' ' /
yira*
Inn
°° '477
and we observe that our result can be expressed according to Equation (7-2) by
identifying the radial function and energy eigenvalue as
R w -)l '-"-
7
and
u
F = ZT Bohr _ 7 2p
m.
Other (= m = wave functions with n > 1 also exist as solutions of Equation (7-4)
for other discrete values of E. We proceed more systematically to find all the radial
solutions, for tf= and for ^# 0, in Section 7-2.
Let us conclude these opening remarks by quoting a useful integral formula:
/•OO
r/r
= +l
/ r"e- °dr n\rj . (7-8)
Example
2r/a
/l*,ool
2
^ (duTr
Jn
2
dr-
na~
4 77
-:
[ r
2
e~
2 '/ a
dr = -2\ - = 1.
Tra
We are concerned with the discrete bound states produced by the force of Coulomb
attraction in the one-electron atom. The corresponding energy levels are found by
solving the radial differential equation with V(r) given by the Coulomb potential
energy. This mathematical problem has a remarkable solution with a very interesting
physical interpretation.
We begin by adopting an equivalent alternative to Equation (7-3). Let us return to
Equation (6-43) and insert the Coulomb potential energy to obtain
fi
2
d2 Ze'
2
R Y(Y+ \)R = ER. (7-9)
2\ir dr 477e r 2/x;
We have learned in the Bohr model, and again in Section 7-1, that the Bohr radius a
and the Rydberg energy E ()
are natural parameters to use as scales of length and
energy in the atom. It is fruitful to introduce these scale parameters in Equation (7-9),
since the result is a greatly simplified expression for the differential equation. To this
end we refer to Equations (7-5) and (7-6) and define a dimensionless variable p and a
dimensionless eigenvalue tj by writing
2
4iTE h
"/' (7-10)
Ze fi
and
E=
m
Z E 2
r) = -- —
l\xa~
jtj. (7-11)
r
/r
-apR
r R + e(e+ \)r = j^,
3 2 2 2 2
2ixa p dp \xa p 2[ia p 2\i.a'
2 2
and cancellation of the factor —ti /2\ia p produces the final form
d2 /M+
— pR + 2R-
dp
1)
-R = -qpR.
7-2 Solutions of the Radial Equation 349
Thus, the use of the quantities p and tj enables us to remove all the excess constants
from the radial differential equation.
# = ,-Ap-1^. (7-12)
—
'
d* d2 r p = —(e-fi
rp r- r d
1
pR = —e-fi F F' - J^e-^ P F)
/ \
dp dp- dp
and so the differential equation can be stripped of the factor e~v np to read:
2 SU+ 1)
F" - 2yft]F' + -qF + -F F= t\F.
P p-
These maneuvers lead us to the final version of the differential equation for F(p):
2 <f(7+ 1)
F" - 2Ji]F' + F=0, (7-13)
P P
F(0) = 0. (7-14)
We impose this requirement so that the radial function R in Equation (7-12) remains
finite at the origin.
It is not obvious that any real progress is made by this procedure, since the
unknown function F obeys a differential equation that is just as unfamiliar as
the original equation for the desired function R. Let us look again at the solution for
the ground state to relieve some of this skepticism. We know from Section 7-1 that the
radial function has the form
for this state, in clear agreement with Equations (7-13) and (7-14). Thus, we see that
F is a simpler function than R in the case of the ground state, and we might suppose
that this property of F relative to R persists for the other states of the atom.
350 The One-Electron Atom
The quantum number { can have any integer value in Equation (7-13). It is
instructive to isolate the ^=0 case and begin the investigation with the differential
equation
We may not recognize this as one of our familiar differential equations, but we can
still solve the equation by means of standard methods. The results of these procedures
include a determination of the dimensionless parameter tj in terms of an integer n:
A = -- n
(7-16)
This integer arises from the fact that F must be a polynomial in p, where n defines the
order of the polynomial. Thus, the integer n serves as a quantum number to label a
whole family of /= solutions. We examine the mathematics behind these conclu-
sions in more detail at the end of the section.
Equation (7-16) is our main result for the <f = case. The result tells us, through
Equation (7-11), that the energy eigenvalue is given by
u En
En0 =-^-Z*-i (7-17)
m t
n
in the nth (=
state. We follow the notation of Equation (7-2) and label the values
of by the integers n and t, taking <f = 0. This conclusion is very striking because
E
the formula reproduces the one given in Equation (3-52) for the «th energy level of
the Bohr model. Let us draw attention to this point by setting
Bohr
E n0 = £„ (7-18)
and writing
* n oo
Taken together, these expressions give the eigenvalue and eigenfunction of the energy
in the nth /= state. The radial function must have the form
according to Equations (7-12) and (7-16). The details at the end of the section tell us
why F must be a polynomial and how n defines the order of the polynomial. We see
that the ground-state wave function of Section 7-1 is found among these results when
the n = 1 level is chosen.
The solution of Equation (7-13) proceeds in similar fashion for the /=£ states.
The energies allowed for each £ are independent of the value of t'.
7-2 Solutions of the Radial Equation 351
We express the general result for the energy levels of the one-electron atom in the
Ene = --Z^=E^\
m n
(7-19)
e
Thus, only the one quantum number n is needed to determine the energies, as in the
Bohr model, even though two quantum numbers are normally required to label the
energy eigenvalues in a central-force problem. Of course, the radial functions vary
with t* as well as n, since the differential equation for F(p) contains an explicit
^dependent term.
We have introduced the integer n in the stationary state n /m
of Equation (7-2) by ¥
defining n as the order of the radial polynomial F. This definition of a third quantum
number sets the hydrogen problem apart from the general central-force problem of
Chapter 6. Recall that in the general treatment the third index is introduced as a
radial node quantum number. Hydrogen is a special case, and so n is given a different
meaning and a special name. In the case of hydrogen we refer to n as the principal
quantum number, and we note that n alone determines the energy. Equations (7-12)
and (7-16) define a collection of radial functions
FAp)
-p/n JltlLL
r
R ,=n
e wit hp=-. (7-20)
P a
Note that F[p) bears the labels n and f to display the fact that both of these
quantum numbers control the composition of the polynomial. If we examine R n( {r)
for various n and c", we see that the principal quantum number n does not count
8
Table 7-1 Radial Functions for the One-Electron Atom
n = 1 /= R l0 =
l2a' \ 2
'= 1
*2i = 7T=rP'- p/2
2V6a
« = 3 /= tf 30 = |1- -p
-o +
2
—-p 2
2
\e- p/3
3v/3a
3
\ y 27
4
i c n 32 . p e
SlV^Oa 3
The variable is denoted by p = r/a and each of the functions obeys the normalization condition
coo, r.
f(r(R„,fr
^2 Jl =
2 j„
dr= .
1.
352 The One-Electron Mom
radial nodes. Several of the radial functions are listed in Table 7-1, and graphs of the
functions are shown in Figure 7-1. It is easy to infer from either the table or the figure
that each R ni has
, n — £— \ nodes in the interval (0, oo). Hence, the combination of
integers n — /— 1 plays the role of a radial node quantum number for the one-elec-
tron atom. We emphasize once again that n has a new meaning in this special
problem and that the energy depends only on n. We also observe, from the (/
behavior of the functions in Table 7-1, that each R n( has an Ah-order zero at r = 0,
as required by Equation (6-51).
Figure 7-1
sla* Rn(
7-2 Solutions of the Radial Equation 353
Figure 7-1
Continued.
J L J L
n= 3
e = 1
0.1-
J I L I I
10 15
The entries in the table contain certain multiplicative factors to ensure the
normalization of the wave function ^„^ m We know from Equation (6-36) that the
.
spherical harmonics Yfm are normalized separately over solid angle, and we recall
from Equation (6-52) that the radial functions R nf also satisfy their own normaliza-
tion over r:
\= C{RjrUr (7-21)
354 The One-Electron Atom
Finally, we observe that the radial functions obey the orthogonality property
00
Jr R n( R n( r 2
dr — when n ¥= «',
o
Detail
Equation (7-13) can have only polynomial solutions for F(p) because, otherwise.
the associated radial function R would not remain finite in the limit p —» oo.
Let us accept this statement without proof so that we can determine these
polynomials as our next step. The differential equation for the f=Q case is
the polynomial has order n and satisfies Equation (7-14). The first two deriva-
tives of F are
F' = A(\
2
+ 2a 2 p + 3a 3 p + • • •
+na n p"-*)
and
~
(2a 2 + 6a 3 p + • • • +n(n - l)a n p""
2
)
- 2^(1 + 2a 2 p + • • • +na n p n x
2
+ -{p + a2p
2
+ ••• +a n p") = 0.
P
This string of terms in powers of p can vanish for all values of the variable only
if the assembled coefficients are equal to zero for each power. We examine the
highest power and find
_1
-2\fr)na n + 2a n = from the p" term.
The stipulation in Equation (7-22) rules out the possibility that a n vanishes. The
important result is the determination of tj quoted in Equation (7-16):
]fn = -.
n
The lower powers of p in the equality provide the following further information:
We can solve these equations for the unknown coefficients in ascending order.
7-2 Solutions of the Radial Equation 355
_ =
1
2^-1 1/2
U.rx - 1 - - 1
3 3 «
2( k/n- 1
a,. (7-23)
k{k + 1)
Detail
The general case for <f # is considered next, so that F is now supposed to
satisfy Equation (7-13). We generalize the nth-order (= polynomial in
Equation (7-22), and thereby accommodate the additional independent term in
the differential equation, by adopting an F function of the form
2
+2
+ -[p'+1 + a,+ 2 p' + ••• +a n
p"}
<f(<f+ 1)
2 — .
+1
+ "t+2 p'
+2
+ ^+3 p/+3 + • • • +ay] = o.
,,
Note that the highest and lowest powers are p"" and p'', and that the terms in 1
f~
p cancel because of the choice of leading term in Equation (7-24). Again we
'
as well as
The first of these equalities is the same as in the £= case. This most
remarkable result tells us that Equation (7-16) holds for any value of /. The
remaining equations can be solved as before to yield the coefficients of the
polynomial in ascending order:
] fn(<?+ l) - l l l
V+2 '
f+ 1 n f+ 1
fi{{+ 2) - 1 1 / £+ 2 \( 1
^+3 + 2
2£+ 3 /
2/+ 3 \ n ~j\n £+ I
2{k/n - 1)
+,= (7 " 25)
fl
*
a-(a- +0-^+1)^
This formula generalizes the £= expression in Equation (7-23). We leave the
derivation to Problem 3 at the end of the chapter and illustrate the use of the
formula as follows.
Example
Let us pick quantum numbers n = 2 and £ = 1 and construct the wave function
^2\m with the aid of Equation (7-25). We must begin with the definition
a f+x = 1 to adhere to Equation (7-24), and so we set a 2 = 1 in this application.
Equation (7-25) gives the next coefficient as
a,=
3
—6-2
2(2/2 - 1)
-U) = 0.
F2 \(p) = A P 2 -
% Xm
= —e~ r/2a Y Un (e,<$>)e-' E ^' ,/h with m = 1,0, -1.
a
A2 /-oo
/•OO A"
A'
/ r\- r/a dr= —4! a
5
= 24a 3A 2 ,
a ->o
for the normalizing factor, and we observe that the resulting expression for R 2X
agrees with the entry in Table 7-1.
7-3 Degeneracy
It is remarkable that the main features of the hydrogen atom can be rigorously
understood with so little difficulty. We have been able to solve the Schrodinger
equation exactly and obtain convincing results from a detailed study of the radial
solutions. Let us move past this important mathematical phase of the problem and
show that the physical interpretation is even more interesting.
We have found that the energy levels in the Schrodinger theory agree with the
Bohr formula and vary only with the principal quantum number n. This unique
feature of the one-electron atom is expressed in Equation (7-19). We know from the
general central-force problem that every stationary state of the atom is assigned a set
of three quantum numbers (n/w). It is expected that the energy eigenvalue should be
independent of m on grounds of rotational symmetry, but it is surprising to find that
the energy does not vary with ( for a given value of n. This circumstance implies an
unusual degeneracy among the states, since a choice of n fixes the same energy for
several possible states with different values of /.
We can establish the nature of the degeneracy if we examine the structure of the
radial portion of the wave function
%, m = e~^"^^-Y, m (e,^e-
iE
p
^ withp--.
a
(7-26)
This rewritten version of Equation (7-2) incorporates Equations (7-19) and (7-20) and
358 The One-Electron Atom
introduces E n
as shorthand notation for the Bohr energy £' Bohr
n
. A glance at Equation
(7-24) reminds us that the polynomial Fn/ {p) has its lowest and highest powers in the
+
terms p and p". It is obvious from the construction that n cannot be smaller than
'
n = f+ 1,/+ 2,
A sequence of levels of increasing energy arises with this list of n values for the given
/. The degeneracy pattern is easier to see if the bookkeeping scheme is turned around
and hence a particular
so that a particular energy level, n, is chosen. The / values
with the same energy must be those for which
n > t+ 1,
namely
£= 0,l,2,...,n - 1. (7-27)
The combined array of allowed values of 4 and m embraces a collection of states ^fn f m
of the form expressed in Equation (7-26), all with the same Bohr energy E n We .
dn = n\ (7-28)
an expression for the number of distinct states with the same energy En .
„ = 2 £=0 m = % 00
= R 20 Y00 e- E ?'/>>
four
degenerate StatCS
/= 1 *- 1,0. - 1
% lm = R 2l Ylm e-^l
= * l/h
„ 3 /=o ™ = * 300 = RwYm e- iE '
iE3 /h nine
4-\ m= 1,0,-1 %i m = R 3l Ylm e'
'
\
degenerate states
'=2 E /h\
m = 2,1,0,-1,-2 * 32m = R 32 Y2m e~"> it
7-3 Degeneracy 359
Figure 7-2
Energy levels and degeneracies of the one-electron atom. There are n' distinct states ^tnim with
the same energy E„. Spectroscopic notation is used to label the various shells and orbitals. The
nth shell contains n orbitals, and the n£ orbital consists of 2/+ 1 states.
- n = 4 ——— 4s
l
. 1— 4p - — - Ad —— - Af N shell 16 states
(' =
Is K shell 1 state
spdfgh
01 2345
The single-electron energy levels are called shells, with labels
KLMN
corresponding to the n values
1 234 ••
360 The One-Electron Atom
Example
The n = 2 level of the one-electron atom is known as the L shell. The figure
indicates the existence of degenerate s and p orbitals in this shell, comprising
one 2 s state and three 2p states. The four different normalized wave functions
have the same energy E 2 and appear in Table 7-2 as follows:
where
Y200
= "20*00
= " 1
'
12a
1
2 /477
r2i + l
= "2\Y\ ± = pe -P/2 sin ue
l
3
2\/6a
T2I0
= "21M0 = TP e -P/2 cos
2vW 4w
for the 2p states. Entries from Tables 6-1 and 7-1 have been used to assemble
the (r,d,(j>) dependence of these energy eigenfunctions.
74 Probability Distributions
The Bohr model of the atom is based on the classical notion of planar electron orbits.
We know from the quantum treatment of angular momentum that this picture has
only qualitative validity, and then only in the classical limit of large angular
momentum quantum numbers. The proper way to visualize the atom in any of its
2 2
\%,J dr= [(R n ,)\ dr][\Y, m dQ]
2
\
(7-29)
Figure 7-3
Pn*(r) - r
2
(R m <? (7-30)
This quantity is introduced only for stationary states and has a different structure for
each central-force problem. We interpret P nif
(r)dr in the case of the atom as the
362 The One-Electron Atom
74 Probability Distributions 363
in marked contrast with the plane circular orbit of the simpler Bohr model. We can
2
integrate l^ool over the surface of a sphere of radius r to find the radial probability
density
2 ' /a
Pjr) = V'~
(or we can take R l0 from Tableand apply Equation (7-30) to obtain the same
7-1
answer). This radial distribution shown in Figure 7-4 to have a sharp peak at p = 1,
is
where r = a. Thus, we make contact with the Bohr model to the extent that the most
probable distance of the electron from the nucleus is equal to the radius of the first
Bohr orbit.
The figure also describes the radial probability distributions for the other states
listed in Table 7-1. We note that the existence of nodes in R n/ -
causes the occurrence
of peaks in Pnf and that the distributions with a single peak are those for which
{= n — 1. These states of maximum / correspond to the circular Bohr orbits, since a
single most-probable radial distance is defined in each case.
The average distance between the electron and the nucleus is given by the
expectation value of r. In a stationary state we find
2
(T)-
J**/m
r%, m dr- fr'iR.,) di rrPn ,(r)dr, [7-31)
as in Equation (6-56). General formulas exist for R nf , with arbitrary values of n and
t, and so genera! expressions also exist for (r) and for other expectation values
involving functions of r. We are not concerned with the derivation of these quantities,
so let us just quote the following results:
1 /(•+ 1)
<r> -A* 2 1 + - 1
- (7-32)
2
' (7-33)
an
2 3
(7-34)
a n (2S + 1)'
3
(7-35)
a «Y(/+ 1)(2/+ 1)
Note that the appropriate power of the length scale a appears in each expression. We
can apply Equation (7-32) right away and calculate (r) for n = 1 and /= to find
the average distance (r) = ^a for the ground state. The point (p) = (r)/a is
364 The One-Electron Atom
Figure 74
Radial probability distributions for the radial functions in Figure 7-1. The graphs describe the
dimensionless quantity aPn ^ = i
a p
1
(R n/^) 1 versus the dimensionless variable p = r/a. The
expectation value (p) = (r)/a is shown for each of the states. The distributions satisfy the
normalization condition \^aPn( dp = 1.
0.1 -
a/' ,,,
aP 30
0.1-
<P>
J I*
10 15
„/\
0.1 -
<P>
I T J L
10 15
7-4 Probability Distributions 365
DDBD
Distributions of probability in various states of the one-electron atom
3d m • 3d * > 4f
QEJ©D 4f
m ° 2S
m m id
3p o
4(j 4d '
"
sd m ' sd
indicated for this case, and for each of the other radial probability distributions, in
Figure 7-4. Equation (7-33) can also be used immediately to determine the expecta-
tion value of the Coulomb potential energy:
Ze 2 / 1 \ Ze
<F>= -
47T£ \ r
J 477e a«"
Example
Let us confirm our observations about the ground state. We can obtain the
probability density by computing
Pw = 47rr
2
I*
1MOOl
2
I
= —4 3
r
2
e~
2 r/a
.
a
366 The One-Electron Atom
dP 10 4 / 2
2r r\\ ~ 2r/a
dr a'
a\ 4 3
(r) = j rP l0 dr=-f rV 2
'"«fr = -3! -
2
inagreement with the result from Equation (7-32) for this state. Note that we
have again found use for the integral formula in Equation (7-8).
Example
2 2 2 2
Ze' Ze jti e m r
\
(V) 2 2 2
4Tre n a 47re n m r
4 nt n h
f
477£ I h~n
ju Z
-- —
Zli
,2 \2
Z2 { e
2
V M
<£>= TE
= 2 2 2
m. n 47TE
°/
2h 477£ I h n
'
The two results for (V) and (E) satisfy the equality
(V) = 2(E),
W W .<K>-- W
- -
Z"
T |— j^.
I JLI
These relations among the average energies (V), (E), and (K) hold also in
classical mechanics, where the result is a special case of the virial theorem.
The states of the atom reveal their existence by the radiation emitted in transitions
between pairs of energy levels. These radiative processes cause the instability of the
excited states and generate the emission spectrum of the atom. Figure 7-5 reminds us
7-5 Electric Dipole Selection Rules 367
Figure 7-5
-0.85 - n AT shell
-1.51 - n M shell
Paschen
series
(infrared)
.,,
-3.40 L shell
Balmer
series
(visible)
E n (e\J)
"-
1360 n= 1 K shell
Lyman
series
(ultraviolet)
of the organization scheme for the familiar spectral lines of hydrogen, in which each
series of lines corresponds to a complete set of transitions downward to a particular
energy shell. We have learned in Section 5-8 how quantum mechanics predicts the
and determines the intensities for the various
probabilities for the different transitions
emitted wavelengths. Selection rules emerge from these predictions to tell us that the
transition mechanism allows only certain changes in the quantum numbers between
the initial and final states of the atom.
We convert our one-dimensional treatment of transitions in Section 5-8 to the case
of the one-electron atom by introducing transition states for the three-dimensional
system. Let us construct such a state by superposing two stationary states of the atom:
ncm n c m
= C*n<m<-
iE " t/h
+ f'W^ With E„ > E,. (7-36)
This wave function represents a situation in which the atom is found in both
2
stationary states at once, with probabilities |c| and \c'\ 2 and thus describes an atom
,
by the interaction atom with the electromagnetic radiation field. The perturb-
of the
ing effect of the radiation is time dependent, and so the probabilities \c\ 2 and \c'\ 2
vary with time to indicate the course of the transition. The oscillatory time depen-
dence of the probability density is the main feature of this construction. We know
368 The One-Electron Atom
from Section 5-8 that |^|" oscillates with the Bohr frequency
n n
(7-37]
and we conclude that the expectation value of the electric dipole moment of the atom
oscillates with the same frequency. Our semiclassical picture of the interaction of the
atom with the electromagnetic field identifies this system' of oscillating charge as a
source of electromagnetic waves emitted in a characteristic electric dipole pattern.
Hence, the construction describes the emission of electric dipole radiation at the Bohr
frequency v.
We express the electric dipole moment of the atom as — ?(r), where r denotes the
coordinate vector of the electron relative to the nucleus. (Actually, the electric dipole
expression is given exactly by — ex only in the case of the hydrogen atom or in the
limit of infinite nuclear mass. This subtle point is addressed in Problem 1 1 at the end
of the chapter.) We then use Equation (7-36), and set co = 2ttv, to obtain
-*(r) = -ei-ty**^ di
in analogy with Equation (5-69). The coefficient of the time dependence e~'°" may be
taken to represent the amplitude of the oscillating dipole. The squared modulus of this
complex quantity determines the probability for the transition between the initial and
final atomic states, labeled respectively by the quantum numbers (nfm) and (nY'm').
The amplitude coefficient contains the integral factor
as its principal contributor. This important quantity is called the dipole transition
amplitude for the transition (n£m) -» (nY'm').
We observe that this integral carries all the essential quantum mechanical informa-
tion about the two atomic states involved in the electric dipole transition. The integral
defines a vector quantity in which r has the Cartesian components
*„, B (r,M)=* B
,(r)y, m (0,*)
JY&wxB<x»lYim Xk
2
f™R n .,.rR n ,r dr
JYfm .smO sin+Y, m dQ (7-40)
[YAs*»$Y,m JQ.
7-5 Electric Dipole Selection Rules 369
Note that only the angular part of the integration differs from one component to
another. The selection rules for the radiative process follow upon inspection of these
integrations, since the integrals over and <f>
are nonvanishing only if the quantum
numbers (^m) and (£'m') are related in a special way.
The <f>
integration presents the more straightforward part of this analysis because of
the elementary $ dependence of the spherical harmonics:
V' m V m *</4>.
_|_
e e e e
f~"e -»»'+ m4, %-'""'
e' d<l>, [ :
e'^dQ, and /
Jq 2 •'o 2* •'o
/"
•At
2
V '
, m >, ±,
Vm *</<f>
= unless m - m' + 1 = 0,
and
2
( " €-""'*€""+ d$ = unless m - m' = 0.
The first result implies a restriction on m and m for the x and y components of the
dipole amplitude, while the second result applies only to the z component. We
summarize these restrictions by writing
Aw = 0or±l. (7-41)
This selection rule expresses the allowed change m — m for the azimuthal quantum
number of the atom in a transition accompanied by the emission of electric dipole
radiation.
The 8 integration is not so easy to evaluate in general terms. The three components
of the transition amplitude contain the following two integrals in the polar angle:
We may assume that the quantum numbers m and rri are already constrained as in
Equation (7-41) when we consider these expressions. Evaluation of the integrals for
arbitrary quantum numbers involves the recursion relations and orthogonality proper-
ties of the Legendre functions /^ m (cos 6 ). We have looked briefly at these functions in
Sections 6-4 and 6-6, but we have not developed the mathematics far enough to
perform the desired integrations. The general result is easy enough to state, however:
We express this selection rule for the orbital quantum number by writing
and we illustrate the result by evaluating certain specific integrals in the example
below.
370 The One-Electron Atom
Figure 7-6
Ad 4/
E3 -3d
- n= 2
- n= 1
It should be emphasized that radiative transitions are not strictly forbidden if the
quantum numbers of the initial and final states fail to obey Equations (7-41) and
(7-42). Such transitions are actually allowed to occur, but not with the emission of
radiation characteristic of an oscillating electric dipole. These other kinds of radiative
processes are highly suppressed whenever the wavelength of the radiation is very large
compared to the size of the radiating system.
The hydrogen transitions in Figure 7-5 can be displayed in more detail if the
degeneracy with respect to ( is included and if the A/ selection rule is taken into
account. Figure 7-6 provides such a display in which all the wavelengths of the
Lyman series are seen to arise from transitions of the type np —> Is. Figure 7-7 goes on
to show that all the wavelengths of the Balmer series are due to transitions connecting
Figure 7-7
Aw selection rule can also be confirmed if the experiment is able to split the
degeneracy of the states with different values of m for the same n and /. The means
to these ends are found Chapter 8.in
Let us return to the transition amplitude and inspect the parity properties of the
integral j4/ *y m TXPnt'm ^ T We observe that a nonvanishing result is obtained only from
-
integrands that are even under the inversion r —» — r. (Odd integrands contribute with
opposite signs for volume elements in opposite directions. These contributions to the
integral cancel pairwise in the integration over all space.) We then recall from Section
6-6 that each factor in the integrand has a well-defined behavior under the parity
operation. The vector r is odd by definition, while the eigenfunctions i/v^'m' ar*d ^ n (m
have parities given by (— Vf and (— 1) , respectively. Therefore, inversion produces
an overall-even integrand only if the two atomic states have opposite parity. We can
state this conclusion as a selection rule:
It follows that the quantum numbers c" and (' must differ by an odd integer, in
agreement with the more restrictive result in Equation (7-42).
It is instructive to put these mathematical selection rules in terms of conservation laws
involving parity and angular momentum. Let us describe the radiation process as
(nc*m) -* (n'c°'m') + y ,
in which y denotes an electric dipole photon. We then assign an odd parity to y and
use Equation (7-42) to find that the overall parity of the atom-radiation system is
conserved according to the multiplicative law
(-l)'=(-lf(-l).
The overall angular momentum of the system must also be conserved. In fact, the
rules in Equations (7-41) and (7-42) tell us that the electric dipole photon carries away
a single h unit of angular momentum in the radiative process. We express the vector
conservation law for the angular momentum of the system in the schematic form
{= t + 1,
where the three vectors have lengths y7(<?+ l) V V"(/' + l) and \/2 with ( and , , ,
('
constrained to differ by one unit. This interpretation of the selection rules by means of
conservation laws is used often in atomic and nuclear physics.
Example
The radial integral is the same for each component in the calculation. We
consult Table 7-1 and carry out the following integration over r, again with the
aid of Equation (7-8):
2 1
/ R l0 R 2l r 3 dr = -=«"" -=p,-P/ 2a y dp
f
a ,oo a / 2\ 5
128 ^
The Aw selection rule implies that the angular integration of the z component is
nonvanishing only for the m = case. We use Table 6-1 and Equation (6-36) to
perform the relevant angular integration:
and blends the result with the similar (j> dependence of the spherical harmonics.
Thus, the angular integrations for the (x ± iy) components assume the general
form
JY^smOe^+Y^dSl.
We again refer to the Aw selection rule and observe that a nonzero result follows
for the case at hand only if we take m= — 1 for the (x + iy) component and
m = 1 for the (x — iy) component. The angular integrals in the two instances
are evaluated with the aid of Table 6-1 and Equation (6-36):
and
f
Y»™ e <-* Y»*~f^[-{r '
j for * - iy.
Problems 373
We combine the radial and angular portions of the three integrations as follows:
IY 256
for the ( x + iy ) component if m = — 1
V 3 243
128 IT 256
v/oV
~ a for the {x — iy) component if m= 1
243 V 3 ~ 243
1 128 ^
for the 2 component if m= 0.
, fi 243
Vanishing results are found if the three components of the dipole transition
amplitude are evaluated for any of the other possible values of the quantum
number m.
Problems
r/2a
R(r)=A\\ - —I
is a solution of the radial differential equation for the one-electron atom in the f = case.
3. For any /, the radial solutions for the one-electron atom have the form
, HP) r
R = e
p/ " with p = - ,
P a
F{p) = A £ aK p
k
with fl
%
, ,
= 1.
* = /+!
k/n - 1
4. Use the recursion relation in Problem 3 to construct the radial polynomial in the case
where n = 2 and { = 0.
5. Use the recursion relation in Problem 3 and apply the normalization condition to
construct the wave function ^ 31m .
6. List the degenerate orbital states that belong to the M shell, giving the explicit (r, 6, <J>)
expectation values (r) and (F) by explicit integration, and compare the results with the
general formulas given in the text.
8. Repeat the calculations of Problem 7 for a state with quantum numbers n = 3 and t= 1.
9. Let the one-electron atom be in its ground state, and calculate the probability of finding
10. Consider a one-electron atom in a stationary state, and derive a formula for the rms speed
of the electron relative to the nucleus. Compare the answer with the result obtained in the
Bohr model.
11. Derive a formula for the electric dipole moment of the one-electron atom shown in the
figure, taking the origin to be at the center of mass. The result should be proportional to
— er and should reduce to — er if Z= 1 or if M— » oo.
rlii e
"n
( * ' '*e - 'V ra
* d<f> = unless m - m = ± 1
Jn
'0
and
lv
( e ""'V"" f>
d<j> = unless m' - m = 0.
These calculations appear in the demonstration of the selection rules for electric dipole
transitions.
13. Show by direct calculation that all three components of the dipole transition amplitude
vanish for the forbidden transition (« = 2, /= 0) — » (n' = 1, £' = 0).
14. Evaluate the z component of the dipole transition amplitude for the Balmer transition
3p -> 2s.
15. Evaluate the z component of the dipole transition amplitude for the Balmer transition
3d-> 2p.
16. Stationary states of the one-electron atom can exist in the form of superpositions of
degenerate states. An example is given by the wave function
,£2 ' / *-
*=-^(^200 + V'210)'"
Comment on the parity properties of ^, and compute the components of the electric
incomplete until such behavior is taken into account. One of our goals in this chapter
is to introduce relativistic corrections as secondary contributions to the leading
approximation. We are interested in these corrections because we can detect their
influence in experiments of sufficient accuracy.
The Schrodinger theory has also been restricted to a treatment of the electron in
which the spatial coordinates are the only variables. Our main goal in this chapter is
to show that the behavior of the electron cannot be fully understood unless the particle
is endowed with an additional intrinsic variable known as electron spin. This new
Lorentz interpreted this effect in terms of his classical electron theory and predicted
that the magnetic field should split each spectral line into a fixed number of separate
components with different frequencies and definite polarizations. Line splitting was
later observed when the sodium lines were examined with improved resolution;
however, the number of split components did not agree with Lorentz's predictions.
This puzzling result became known as the anomalous Zeeman effect because the
375
376 Spin and Magnetic Interactions
Normal Lorentz triplet in zinc and anomalous Zeeman patterns in sodium and in zinc
No field
111
Normnl Triplet
HI 111
Anomalous Patterns
Weak field
. •
eld
inn mi
Anomalous Patterns
Weo* "f'ela
observations differed inexplicably from the normal effect associated with the classical
theory. Normal splitting could be detected only for certain types of lines in certain
elements. The early quantum ideas were also applied to this phenomenon, but the
application shed light only on the normal effect. Thus, the observed anomaly in the
Zeeman effect remained an unsolved problem through the period of the old quantum
theory.
Improvements in spectroscopic resolving power also revealed the splitting of lines
into multiplets of closely spaced components without the influence of an applied
magnetic field. This behavioratoms was given the name fine structure. It was
in free
discovered that the split multiplets were the ones that developed an anomalous
Zeeman pattern when an external magnetic field was applied. Hence, a link was
believed to exist between the phenomenon of fine structure and the anomalous
Zeeman effect, indicating the possibility of a common solution for the two problems. It
was also found that the spectra of the elements in a given column of the periodic table
displayed multiplets with common
features. This connection between the structure of
multiplets and the periodicwas the crucial piece of evidence that inspired W.
table
Pauli to make the first contribution toward the eventual discovery of spin. Pauli was
able to explain the occurrence of multiplets by letting the electron have a nonclassical
two-valued property, which appeared as a fourth quantum number in the description of
every atomic electron. Others later identified this peculiar quantized variable as the
electron spin. All these remarkable developments preceded the coming of the
Schrodinger equation and were carried over without difficulty into the new formula-
tion of quantum mechanics.
We confine our discussion in this chapter to the role of electron spin in the
one-electron atom, and we extend the treatment to atoms with many electrons in
Chapter 9.
Faraday believed that magnetism should influence the light emitted by atoms, but he
could not develop apparatus sensitive enough to demonstrate an observable effect. His
expectations were not realized until Zeeman's experiment was performed on the
8-1 Atoms and Light in a Magnetic Field 377
Figure 8-1
Classical Lorentz-triplet predictions for an atomic source in a magnetic field. The source emits
a spectral line of frequency V and the application of the field splits the line into three
,
frequencies v - 8v, v , and + Sv, where the shift 8i> depends on the field strength B. Light
v
observed along the direction of B has only the two shifted components with circular polarizations
in opposite directions. Note that the up-shifted frequency v + 8i> corresponds to circular
polarization in the same sense as the current in the electromagnet. Light observed transverse to
B has the two shifted components with linear polarizations perpendicular to B, and the one
unshifted component with linear polarization parallel to B.
Electromagnet _
H\ £7\Circular polarization
the emitted light to the oscillation of charge within the atom and showed that the
oscillation would be modified by the application of a magnetic field. His theory
selected a given spectral line, with frequency P in the absence of the applied field,
and predicted a splitting of the line into three different frequencies owing to the
interaction between the field and the atom. He determined a frequency shift 8v such
that this triplet would occur as v — 6V, vQ and v + 8i>. The members
of frequencies ,
of the triplet were expected to have certain polarization properties depending on the
direction of observation of the light relative to the direction of the applied field, as
described in Figure 8-1.
The classical theory of the Zeeman effect is interesting because the analysis
identifies the basic parameters of the interacting system. A similar parameterization of
the interaction between the atom and the applied field appears in the quantum
version of the problem. Lorentz's model associates the effect with the oscillation of the
electrons in the atom and treats such an oscillation at frequency v as a source
emitting classical radiation of the same frequency.
The application of a constant magnetic field B fixes a direction in space to which
the electron motion can be referred. Let us look first at the oscillation of a single
electron in a plane perpendicular to this direction. Any such linear oscillation at
frequency v (with no field) can be resolved into clockwise and counterclockwise
circular motions in phase, each with angular velocity 2ttp , as shown in Figure 8-2.
We can attribute the oscillation of the electron to a simple-harmonic binding force of
the form — b and define the force constant by means of the frequency formula
277 1/ m.
.
Figure 8-2
The magnetic field subjects the electron to the additional Lorentz force — ^v X B,
which acts in opposite radial directions for the two counter-rotating motions of the
oscillating charge. This effect produces two different values for the new angular
velocity 2tTv and for the new orbit speed
2mrv
m.v'
= A^2„2
4ir v rm t ,
and we assume a small Lorentz force so that we can take the radius of the orbit to be
unaffected by the field as a first approximation. Figure 8-3 describes the centripetal
Figure 8-3
kr
2nvreB
Clockwise
.' rrirrH
kr
©,
Counterclockwise
81 Atoms and Light in a Magnetic Field 379
force as
\m 2 v 2 rm = 2mvreB +
e
kr
\ rn 2 v 2 rm e = —2'rrvreB + kr
for counterclockwise motion. We employ Equation (8-1) to eliminate k and rewrite the
results as quadratic equations for the unknown frequency v.
eB
v — Vq = (clockwise)
and
eB
v
2
+ v — Vq = (counterclockwise).
277 772,
v = vQ + —— eB
for the clockwise case
,
(8- 2a)
x
477171.
and
eB
v = vQ for the counterclockwise case. (8-2b)
The two electron motions are therefore supposed to generate classical radiation at
these two shifted frequencies. Note that the frequency shift has the form
eB
8p = (8-3)
477W f
in each case.
If we view the source along the direction of B, as suggested in Figure 8-3, we expect
the observed light to have circular polarization in the clockwise sense for v = p + 8i>,
and in the counterclockwise sense for v = v — hv. This conclusion agrees with
Zeeman's observations of the light from his broadened spectral line. Note that the
result also confirms the fact that the sign of the oscillating charge is We have
negative.
viewed transverse to the direction of the applied field. All these conclusions from
Lorentz's theory are illustrated in Figure 8-1.
We should note the presence of the factor e/m e in Equation (8-3). This aspect of
the analysis was appreciated at the time of Thomson's cathode-ray experiments,
because it was possible to measure the frequency shift in a magnetic field of known
380 Spin and Magnetic Interactions
strength and thereby determine the charge-to-mass ratio for the radiating charge in
the atom. The resulting determination agreed with the value obtained for Thomson's
cathode rays and thus supported the idea of the electron as a universal constituent of
the atom.
Example
The frequency shift 8v is quite small compared to the line frequency v , even
for rather large values of the magnetic field. We illustrate by choosing B = 1 T
in Equation (8-3):
T m2•
V=
C C V-s J-s
—
kg
T=
kg m-
=-
kg •
m 2
The quantum version of the Zeeman effect involves familiar observable quantities and
makes no use of the oscillatory motion described in the classical model. Quantum
mechanics teaches us to recognize a shift in the frequency of a spectral line as evidence
for a change in the transition energy of the emitted photon. This change in turn
implies a shift in the quantized energy levels of the atom owing to the application of
the magnetic field. Hence, the observation of Zeeman splittings in spectral lines
translates directly into the deduction of Zeeman energy splittings among the states of
the atom.
We are concerned with the additional energy of the atom associated with the
presence of the applied magnetic field. We treat the atom as a system of moving
charges and identify a magnetic dipole moment |i, so that we can introduce a magnetic
interaction energy
F„=-|i-B (8-4)
as an added contribution to the potential energy of the system. The magnetic moment
vector parametrizes the magnetic structure of the atom, and the applied magnetic
8-2 Orbital Magnetic Moments 381
Figure 8-4
Angular
velocity 2ttv
field acts as an external probe of this structure. We take B to be constant in space and
time and observe from Equation (8-4) that the magnetic configurational energy is
minimized when \i is aligned along B. The dynamical behavior of \i is determined by
angular momentum considerations, since the magnetic moment of the atom is directly
related to the angular momentum. This section is devoted to the relation between
these quantities for the case of the one-electron atom. We are dealing with small
energy shifts, and so we can ignore the small correction for nuclear motion and simply
replace the reduced mass by the electron mass throughout. (Thus, the symbol ju. for
reduced mass does not appear in this chapter and cannot be confused with the new
symbol \l for magnetic moment.)
We consider a circular Bohr orbit for an electron around a fixed nucleus, and we
take the angular velocity of the electron to be 2 77 p. The current in the electron orbit
2
has magnitude ev, and the orbit has area 7rr , so that the product of these quantities
gives the magnitude of the magnetic moment vector
The direction of |X is shown along with the various parameters of the orbit in Figure
8-4. We recall that the magnetic field of such a current loop is similar to that of a bar
magnet, at distances far from the respective dipoles. The electron orbit also represents
mechanical motion with angular momentum of magnitude
L = m e
vr = lirr m e
v.
2rn
The figure shows that the vectors |i and L have opposite directions for an orbiting
particle of negative charge. We take this observation into account and relate to L by
|i,
L (8-5)
2m.
The final result holds for elliptical as well as circular electron orbits.
382 Spin and Magnetic Interactions
ft.
p S
2Af
where the details of the rotating distribution determine the value of the g-factor
denoted by the symbol g. We are using Q= —e for an electron orbit in Equation
(8-5), and so we identify the corresponding orbital g-factor as
g= I-
where
eh
Ma-TT-
lm
(8-7)
t
This combination of physical constants is known as the Bohr magneton. The dimensions
of ju
fl
are the same as those of \i,
b serves as a natural unit for the magnetic
so that jx
moment of the atom. We express the numerical value of the Bohr magneton in two
ways,
HB = 9.274 X 10" 24 A •
m = 2
5.788 X 1(T 9 eV/G,
and observe that the second figure has the especially useful units of energy per
magnetic field strength.
We can use the relation between fi and L to determine the classical behavior of |i.
—
dh
dt
= a
P X B.
—
dL
=
dt 2m
e
Lx B
e
= to X L, (8-8)
co = —
2m,
B. (8-9)
8-2 Orbital Magnetic Moments 383
Figure 8-5
The dynamics described by Equation (8-8) is exactly the same as that for a spinning
top in precessional motion about a fixed axis defined by the direction of to. Figure 8-5
summarizes the physical properties of the oppositely directed vectors p. and L. We
observe that the magnitude of L is constant:
dL
-L l
= -L • L = 2L • — - = 0,
dt dt dt
using Equation (8-8) in the last step. We then use conservation of the energy |x • B to
argue that the polar angle 6 between L and to also remains constant. It follows that
the angular momentum varies only with respect to the azimuthal orientation of L
about the to axis. We consider a time interval dt and identify an increment in the
azimuthal angle
dL
L sine/
according to the figure. We can then express the precessional angular velocity of L
and |x by writing
1 dL u>L sin
= to,
~dt L sin 6 dt L sm8
again using Equation (8-8). This behavior of the magnetic moment vector is an
example of Larmor precession, a phenomenon in mechanics treated by a general
theorem due to J. Larmor. Equation (8-9) gives the frequency of precession as
U eB
2tt ^irm.
an expression identical to the formula for the frequency shift bv in Equation (8-3).
Thus, the classical dynamics magnetic moment presents a picture in which the
of the
applied field affects the orbital motion in the atom by superimposing a precession at
.
the Larmor frequency about the direction of B. A schematic rendition of this effect is
shown in the figure.
The main result of this section is the relation between fx and L in Equations (8-6)
and (8-7). We are aware that (X and L have been handled classically, and we should
question whether the resulting formula can be carried over to quantum mechanics. A
proper quantum approach would show how electrodynamics is incorporated in the
Schrodinger theory and would prove the validity of the relation between fx and L as
quantum observables. We leave the demonstration of these steps to a more thorough
treatment of quantum mechanics. It should also be noted that precessional motion is a
classical phenomenon involving the behavior of a well-defined L vector. We must
reinterpret the word if we wish to speak of precession as a concept in quantum physics.
Example
Let us prove that Equation (8-5) holds for any elliptical Bohr-Sommerfeld orbit.
We consider an element of angle dO swept out by the electron in a time interval
dt and identify the area of the resulting sector as
2
r
dA = — dO.
2
dq dO
r = dA
da — = dq.
dt 2 dt '
2
The force on the electron is a central force, and so the quantity r dd/dt is a
constant proportional to the angular momentum:
L = mr 2 '
—= dt
constant
We use this observation to obtain the formula for the magnetic moment:
!' f
J,„w,,
rbit
du = —
Am
2m.
/.
J
[ dq =
2m
/..
Example
Let us also verify the value quoted for the Bohr magneton:
_,9 34
eh (1.602 X 10 C)( 1.055 X 10" J •
s)
^B ~ 31
2m e '
2(9.110 X 10" kg)
= 9.274 X 10' 24 A •
m 2
24
9.274 X 10 J/T -
- .
ir . . , .
=1 7rt<"t
3 /OO
.
v
A in
1U
9
eV/G,
19 4
(1.602 X 10~ J/eV)(l0 G/T)
These quantization properties are passed along to the magnetic moment p., since |x is
directly related to L. The magnetic moment determines the behavior of an atom in an
applied magnetic field via the p. B interaction, and so the quantization of
• accounts |jl
for the Zeeman splitting in the energy levels of the atom. Thus, the Zeeman effect
provides evidence for the quantization of L as a result of the quantization of |i.
Let us suppose that the one-electron atom is in the stationary state ^fn f m in the
2
absence of an applied field. The quantity L has the value h c*(i* + 1), and so the
magnetic moment of the atom has magnitude
gf-B
(L z ) = mh
These remarks suggest that we employ the interaction of the magnetic moment to gain
access to the angular momentum properties of the state through the dependence of
(ju..) on the quantum number m.
The applied B field is a constant vector whose fixed orientation in space offers a
natural direction to choose as the axis of quantization. We align the z axis along B for
the definition of L z and we specify the state ^nfm with azimuthal quantum number
,
m according to this choice. The interaction energy in Equation (8-4) then assumes the
simpler form
VM =-ii z B, (8-11)
Figure 8-6
Normal Zeeman splittings in the energy levels of the one-electron atom.The magnetic field
breaks the degeneracy of the states with respect to the quantum number m. States ^„^ m with
adjacent values of m are spaced in energy by an amount SE U = gfi B B, for any choice of n and
/. The scale of the Zeeman splitting is grossly exaggerated in the diagram.
(=2
e=i - 2
1 1
- n= 3 m =
<L
f= 1
-*
i
n = 2 m= m = -r-
6EM
-1 *
f =0
n = 1 m =
in OT7 n<£orbital system. The sign of the energy shift (VAf ) is the same as the sign of
m, and only the m = states are unaffected by the field. We let each of the split levels
shown in Figure 8-6 refer to a distinct precessional state of the atom, whose energy
maintains the fixed value
E + ms Bm
n
in the presence of the constant magnetic field. The azimuthal quantum number m is
from the atom. We determine the shift by identifying the allowed radiative transitions
according to the selection rules discussed in Section 7-5. The rule A/= + 1 has
already been used to pick out the electric dipole transitions for the Lyman and Balmer
series in Figures 7-6 and 7-7. The other rule requires
Aw = or + 1
8-3 The Hormal Zeeman Effect 387
Figure 8-7
•= 1
1
(= o e=
E: n = 2
i
t <5£\
m =
-1
T
n=l -1-1-1 m =
e = o f=
and gives more pertinent information about the Zeeman effect, as illustrated in
Figures 8-7 and 8-8. We observe that all the indicated transitions involve only three
distinct emitted-photon energies,
where A£ denotes the usual Bohr transition energy without the applied field. A triplet
of spectral lines is the predicted result, implying a shift in frequency to either side of
the usual line by an amount
8EM g^B B eB
Sf = '
2'ITh 4:77171,
in agreement with the classical formula in Equation (8-3). The illustrations refer
specifically to the Lyman a line (where n = 2 -* n = 1) and the Balmer a line (where
Figure 8-8
=
t 2
2
(= i
1
t = o t = i e = 2 f
-n=3
SE,
, n m = m =
1 1 -1
-2
1 I 1
-n=2 • m = -m = m =
( = o e = i e = o -1 -1
t = 1 ( = 1
388 Spin and Magnetic Interactions
n = 3 —> n = 2). It is clear, however, that the triplet prediction is quite general, since
the normal Zeeman splitting 8EM is common to all nt° orbitals. The figures tell us that
the photon energies AE — 8EM and AE + 8EM are associated with the Am = + 1
and Am = — 1 transitions, and that the unshifted photon energy AE is due to the
Am = transitions. It follows that the Am = + 1 linesw correspond to the shifted
frequencies in Figure 8-1. We know that these lines exhibit circular polarization when
viewed along the direction of the field. We also know fromSection 7-5 that the single
unit in Ac" and Am represents a single h unit of angular momentum carried away by
the radiated electric dipole photon.
The predictions of the normal Zeeman effect are not borne out in observations of
the spectrum of hydrogen, as anomalous splittings are seen instead of the expected
triplets. The classical results have not been altered by taking a quantum approach to
the problem. We have included the flawed classical treatment as background in order
to point out the fact that the preferred quantum solution also fails to resolve the
anomaly. We are forced to concede that our picture of the magnetic structure of the
atom is incomplete.
Example
The Balmer a processes in Figure 8-8 have transition energy AE = 1.89 eV and
emitted wavelength
he
X = = 656.1 nm
AE
with no applied field, as in the calculations at the end of Section 3-6. Let us
consider the abundant collection of normal Zeeman transitions for the illustrated
case
The Aw selection rule allows nine different transitions, as shown in the figure,
AE - 8EM AE AE + 8EM
, , and .
h h h
he
X + 8\
AE- SEM
'
and we use the condition AE s> 8EM to make the following approximation:
A + 5A= --
AE\
he I
1 + —
8EM
AE
\=\
-
\
I
1 +
8EM
—
AE
^ \
- 8X = \-
8EM
AE
I I
For B = 1 T we find
and
5.788 X 10- -5
8X = (656.1 nm ;
— n noni nm,
u.uzui m-n
1.89
a very small departure from the zero-field value for the Balmer (V line.
Example
Let us prove that the stationary-state wave function of the atom is unaffected by
the application of the constant magnetic field, while the energy of the state
undergoes the normal Zeeman energy shift. Without the field, the stationary -state
—V
2/i
2
+ V(r) 4>nfm = E nt n / m ,
additional term:
gV B B gV-B B h
-fi.B = +
h
,
L, -»
h i
:
—d
d<f>
1
h Q&rB d
2ju i d<j>
1 3
^ gV B B
— V
2/i
,
' + V ( r ) +
i
3
^ntm = Er&n(m + £M B Bm 4> n(m-
d<t>
We may therefore take \p = ^ n/m and claim the new energy eigenvalue to be
E = En + gV-B Bm ,
It was known by 1920 that spectral lines were split into multiplets without the
influence of an applied magnetic field. The phenomenon was originally attributed to a
force within the atom, where the outer electrons interacted with a proposed " magnetic
core" consisting of the inner electrons and the nucleus. This interacting system of inner
and outer magnetic moments was supposed to explain the structure of multiplets in
the absence of a field, as well as the anomaly in the Zeeman effect in the presence of a
field.These ideas were tested in 1922 by O. Stern and W. Gerlach. Their classic
experiment employed a magnetic field of special design to study the behavior of a
beam of atoms. Their observations demonstrated the quantization of the magnetic
moment and also appeared to confirm the magnetic-core hypothesis. Soon, however, it
became necessary to reject this idea and reinterpret the outcome of the experiment as
unequivocal evidence for the spin of the electron.
The Stern-Gerlach experiment examines the dynamics of a magnetic dipole in a
nonuniform magnetic field. Let us refer to Figure 8-9a and recall that a uniform field
produces a torque but exerts no net force on such a dipole. Figure 8-9b goes on to
show that the forces on the two poles of a magnet yield a net force, as well as a torque,
if the field is not uniform. Let us assume that the nonuniform field varies appreciably
in the z direction component of the net force according to the
and consider only the z
details shown in the figure. We express the z component of the field at the locations of
the N and S poles of the magnet by the expansions
BM = B, + z —
8BZ
oz
and B Z
(S) = Bz0 - z —
dBz
oz
to first order in the indicated distance z . The coefficients Bz0 and dBJdz denote the
field strength and the field gradient, evaluated at the center of the dipole. We let the
magnet have hypothetical pole strengths +g , and we write the net force as
F =
z g BM - go Bz (S)
= 2 go z —
3BZ
oz
= M, —
dBz
(8-14)
Note that the z component of the magnetic dipole moment appears in the last step.
The force causes a beam of magnets entering the field along the y axis to experience a
vertical deflection, up or down depending on the sign of ju.., as illustrated in Figure
8-10. The classical quantity \i
z
is continuous in value because of the arbitrariness in
the observable orientation of the dipole. The beam therefore produces a continuously
distributed image beyond the region of the field.
in a detector
Very different results are found in such an experiment if the beam of magnets is
replaced by a beam of atoms. Each atom has a certain probability to be in some
quantized state of the angular momentum component L,. The corresponding mag-
netic moment ji z is then observed with the same probability to have one of several
discrete values. If we consider a beam of hydrogen atoms as an example, we expect to
(ii z)
= ~iii B m with w =-/,..., /
8-4 The Stern- Gerlach Experiment 391
Figure 8-9
Forces on the poles of a magnet in a uniform field and in a nonuniform field. The field gradient
exerts a net force on the magnet in the second case.
(a) (b)
for atoms in definite Lz states. The force F, becomes a discrete-valued quantity in this
phenomenon space quantization because we imagine the effect to occur via discrete
trajectories in space. The result is actually observed as a discrete distribution of the
beam image in a detector beyond the region of the field, as sketched in Figure 8-11.
Hydrogen atoms with orbital quantum number c* are expected to display a (2f+ 1)-
fold splitting of the beam, and no deflection is supposed to be seen for a beam of
ground-state t"= atoms. Complex atoms are also expected to undergo an odd
Figure 8-10
Beam of
magnets
1
Figure 8-1
Space quantization for a beam of hydrogen atoms. An odd number of deflections is expected for
a purely orbital magnetic moment, but a twofold splitting of the beam is observed in a
Stern-Gerlach experiment.
number of discrete deflections if the magnetic moments are due entirely to orbital
electron motion.
The employed a beam of neutral atoms ob-
original Stern-Gerlach experiment
tained by evaporating silver in a heated oven.The silver atoms were passed through a
strong transverse field gradient and were deposited on a glass plate where their
deflections could be measured. This image of the beam was found to be discrete rather
than continuous, in striking agreement with the notion of space quantization. In fact,
the beam was observed to form a two-fold image, the result predicted on the basis of
the prevailing magnetic-core theory of the silver atom. was The core interpretation
discredited, however, when a subsequent experiment
same design was per- of the
formed with a beam of hydrogen atoms. It was expected that hydrogen should show
no deflection for atoms in the ground state and should give an odd number of beam
splittings for excited atoms if the magnetic moment of the atom were attributed solely
to the orbital motion of the electron. Instead, the beam of hydrogen atoms again
displayed a two-fold splitting, so that expectations and observations conflicted in the
manner described in Figure 8-11. The " magnetic core" could not account for this
result, since the core consisted of the nucleus alone and the nuclear magnetic moment
was too small to explain the observed effect. (It was recognized that the natural unit
for nuclear moments should be the nuclear magneton eh/2M instead of the Bohr
magneton eh/2m f The substitution of the proton mass
. for the electron mass m e M
made the scale smaller for nuclear moments than for atomic moments by three orders
of magnitude.)
The hydrogen experiment demonstrated that (/a.) had a nonvanishing two-valued
quality in the ground state of the atom.The two-fold splitting indicated a quantiza-
tion of the magnetic moment in which two, and only two, quantized states contributed
probabilities to the discrete deflections of the beam. It was apparent that this aspect of
the magnetic moment of the atom was attributable to the electron but was indepen-
dent of the orbital motion of the electron. Thus, a new intrinsic property of the
electron seemed to be in evidence as a quantized two-valued variable. The bifurcation
of the beam of silver atoms in the original Stern-Gerlach experiment also appeared to
be due to this same two-valued magnetic moment for the outermost electron in the
silver atom.
8-4 The Stern - Gerlach Experiment 393
Stern-Gerlach experiment, the fine structure of spectral lines, and the anomalous
Zeeman effect.
Example
Equation (8-4) offers an alternative route to the expression for the force on a
magnetic dipole. We write out the interaction energy as
VM = -P x Bx -ti y By -n 2 B z
oz
find
dB,
as in Equation (8-14). The deflection of the beam can be predicted on the basis
of familiar classical arguments. We use the temperature of the sample to obtain
the beam velocity according to the relation
M
— 2
3
= -k„T.
v
B
2 2
where M is the mass of the atom and k B is Boltzmann's constant. then let y We
denote the beam distance across the field as in Figure 8-10 and identify the time
of traversal as t = y/v. The vertical acceleration FJM is a constant, and so the
.
1 Fz 1 dBs y 2 dBz y
:M z4 "= =
2 M 2M' 3z v'
? rz
P,
dz 6k B T
The predicted deflections follow for a given field gradient and a known
magnetic dipole moment. We take the latter quantity to" be quantized when we
apply the formula to a beam of atoms.
Sz = hm s . (8-15)
We identify this variable with Pauli's formal two-valued degree of freedom, and we
attribute the results of the Stern-Gerlach experiment to its discrete properties. The
vector Sbe treated like the quantized angular momentum vector L. We know
is to
that L, has 2/4- 1 discrete eigenvalues for some choice of the quantum number /,
and so we argue that there must exist a number s such that m s
has the analogous set
of 2 s + 1 allowed values
m =
s
—s,. . , s.
The quantization rules also require S to have a fixed magnitude determined by the
eigenvalue
S 2 = h 2s(s + 1) = ffc
2
. (8-17)
These properties are illustrated schematically in Figure 8-12. The z direction has been
selected arbitrarily, and so the quantized behavior of S holds for any chosen direction
in space along which the component of the vector may have either of the two definite
values +h/2. The figure reminds us that only one component of S can be specified
with certainty, as must be the case for any angular momentum vector. The two
illustrated orientations of S describe the spin-up and spin-down states of the electron.
We call the electron a spin-^ particle to summarize the quantization properties
contained in this description.
8-5 The Properties of Electron Spin 395
Figure 8-12
Uncertain
The quantum number m becomes part of a revised picture for the states of the
s
atom. Now that we have adopted the symbol m to define the discrete values of Sz we
s
,
must go back and refine our previous notation for the magnetic quantum number m
associated with L,. Hereafter we denote the eigenvalues of L, as
and employ the two independent indices m ( and m s to specify states according to the z
components of L and S. A stationary state of the one-electron atom is then assigned a
set of four quantum numbers {n£m ( m s ) to determine a complete configuration of the
quantum system. We express the corresponding wave function ^nfm ,„ for the given
energy state of the atom, with definite orbital description in space and definite up or
down orientation of electron spin, by writing
%, m , m ,
= R„Ar)Y, m ,(9,4>)e- E ^Hl or 1). (8-18)
We use the rightmost symbol to convey the meaning of the new electron quantum
number; spin up ( T ) denotes m =
s
~, and spin down ( J, ) denotes m = —
i \. This new
two-valued property doubles the number of states that appear in the energy level
diagram of Figure 7-2. Thus, Equation (8-18) is the basis for the revised scheme
shown in Figure 8-13, in which the energy levels depend only on the principal
quantum number n, and the degeneracy of the states at energy En is equal to 2n 2 ,
Figure 8-13
Energy levels E n
and degenerate states ^f„/ m m for the one-electron atom. The dynamics of the
atom is governed only by the Coulomb force, as in Figure 7-2. The spin of the electron may be
either up (T) or down (i) for each assignment of the set of quantum numbers (nc'm^).
e=o e = l < = 2 e=3
I or 1 t or l t or I i or 1
n = 4 4s 4p Ad "4/ N 32 states
(1 x 2) (3 x 2) (5 x 2) (7 x 2)
K-. n=3
(1 x 2)
3s
(3 x 2)
3d
p
(5 x 2)
3d M 18 states
- n = 2
(1x2)
2s
(3x2)
2p 18 states
- n = 1 Is K 2 states
(1 x2)
The Stern-Gerlach experiment implies that both L and S have their own unique
magnetic moments and g-factors. Let us rewrite the orbital magnetic moment from
Equation (8-6) in the form
L
= 8 " 19 )
V-l -SlVbJ (
as before.We then introduce the new spin magnetic moment of the electron by the
analogous relation
S
V-S
= -gsPBj- ( 8 " 2 °)
We regard this formula as the defining equation for the spin g-factor g s a parameter to ,
about the value of this quantity. In the relativistic quantum theory of the electron,
however, it is possible to show that the spin g-factor should be given by
fo-2.
This number is interesting because the predicted value is exactly twice that of g L and,
furthermore, is in good agreement with experiment. It is customary to express
measurements of g s in terms of the deviation from unity for the ratio g s/2. The
current value for this deviation is quoted to astounding accuracy as
fa- = 0.001159652193.
Example
Figure 8-12 provides a way to visualize the two electron spin states in which Sz
has the values h/2 and — h/2 with zero uncertainty. The spin vector S makes a
fixed angle with the z axis, given by
h/2
cos = cos 54.7° in the spin-up state.
fih/2
Notice that S has a completely random azimuthal orientation in each case. The
spin magnetic moment p, 9 is aligned antiparallel to S, according to Equation
(8-20). The magnitude of this vector is given in Bohr magnetons by the value
This fixed result should be contrasted with the magnitude of the orbital
magnetic moment for a one-electron atom,
OJ,
^0 and VMn = -V-' Bo
2m.
The energy VM is altered, and the angle between and B is changed, if a second
\l
weaker field is imposed so that its direction varies to accommodate the precessional
motion of p,. Figure 8-14 illustrates the effect of a second field B^, which oscillates with
frequency u/2tt along the indicated y axis. The additional torque \i X Bw acts in the
left part of the figure to give an increase in the polar angle 6. If to is chosen to equal
to the direction of Bu reverses after half of a Larmor cycle and acts in the right part
,
(V M ) = g^B B m -
Figure 8-14
Figure 8-15
&vbB„
(We refrain temporarily from specifying g and m so that we can discuss magnetic
resonance as a generic process.) Figure 8-15 shows two such levels for successive values
of the magnetic quantum number, with energy spacing
8E Mn g^B B
eh
hu = 8EM = g—t-B =ghuo, (8-21
so that the result is the upward transition indicated in the figure. (Note that c" may
remain unchanged since the electric dipole selection rules do not apply in this
instance.) We know from our calculations in Section 8-3 that the magnetic splitting is
The resonance condition in Equation (8-21) can be used to make precise g-f actor
determinations from precise measurements of the resonance frequency. Of course, a
prior calibration of the field B is necessary in such an experiment. The resonance
technique can be employed in a situation where the transition occurs between the
spin-down and spin-up states of an atomic electron. In this case the relevant g-factor is
and the process is called electron spin resonance. The quantity g s is very
identified as g s ,
Figure 8-16
Signal
Vapor
chamber
T
A/, M 2
Resonance
Magnetic
Stern-Gerlach field resonance Stern-Gerlach field
fields
dBz 3B
gsVB m s
(8-22)
oz
for an atom in a definite m state. We let the field strength increase vertically as
i
course. The figure shows how the slit S, selects two such orbits, one curving down and
one curving up in A/,, the first Stern-Gerlach field. These paths cross and continue
into a second nonuniform field M 2
where the field gradient is reversed relative to v M
The split beams with m = + s \ and - \ are steered through 2
M
as shown and are
and a horizontal oscillating field B w This part of the apparatus does not introduce
.
transitions and does not affect the paths of the atoms unless the frequency w is at
resonance. The magnetic-resonance effect flips the electrons from spin-down to
spin-up, so that atomic transitions from m = - \
i
to m = +
s \ occur where the
beams Those atoms that flip
cross. from - \ to +
\ are deflected up rather than
down in the field
2
and M
fail to take the proper path for detection at S 2 Hence, fewer .
atoms are collected at resonance, and a sharp dip in signal is observed in the detector.
A high-resolution measurement of the resonance frequency and an accurate calibra-
tion of the constant field BQ can then be used to give a precise determination of the
g-factor (gs in this case), with the aid of Equation (8-21).
The first studies of directed beams of neutral molecules were carried out in 1911 by
L. Dunoyer, and the earliest molecular-beam techniques were developed subsequently
by Stern. The magnetic-resonance method was built into atomic- and molecular-beam
experiments by I. I. Rabi in 1938. This innovation enabled Rabi and others to make
precise measurements of various kinds of magnetic moments, first for nuclei, later for
molecules, and finally for atoms. P. Kusch developed another resonance technique
and used it to measure the spin magnetic moment of the electron.
specifically at the properties of spin- \ particles and are " performed" with the aid of
8-7 Spin-' Thought Experiments 401
Figure 8-17
dz I
Beam of
•ground-state
H atoms
Stern-Gerlach fields. These studies are interesting because they illustrate some of the
basic language and notation of quantum mechanics, without the intrusion of differen-
tial equations.
We suppose that the beam particles are hydrogen atoms in the ground state. The
atoms have orbital angular moment ^=0 and exist in two quantum states labeled by
the quantum number m s . Let us first consider a beam directed along the horizontal
axis of a Stern-Gerlach magnet, where the field gradient points in the negative z
direction. We find from Equation (8-22) that atoms in the m = +
s \ state curve
upward and atoms in the m = —
s \ state curve downward, as indicated in Figure
8-17. Note that the orientation of the magnet defines the z axis for the specification of
the Sz eigenvalue hm s
.
We can guide the two split components of the beam back together by introducing
other Stern-Gerlach magnets in the manner of Figure 8-18. The three indicated field
sections are aligned in tandem with their z axes parallel to the same direction and are
assigned field gradients of equal magnitude and alternating sign. The central section is
twice as long as each end section so that the curvature of the trajectories causes the
two paths to rejoin at the left end of the apparatus. (We let the beam travel from right
to left to accommodate the quantum mechanical notation introduced below.)
It is clear that this system of fields has no net effect on the composition of the beam.
Atoms enter together on the right and follow discrete orbits up or down with random
probabilities. The same atoms rejoin and leave together on the left, as long as the
device contains nothing in between to affect one or the other of the two m i
pathways.
We can convert the instrument into a polarizing filter if we insert an absorbing
obstacle in one of the paths. Figure 8-18 indicates how such a barrier removes atoms
in the m = —
s
-, state and transmits a polarized beam in which every outgoing atom
has spin up with respect to the z axis defined by the direction of the fields. The figure
also shows a simplified version of the apparatus, with and without the barrier, for use
in the following thought experiments. We emphasize that the indicated signs of the
quantum number m i
refer throughout to a particular choice of z direction. We denote
this direction by the symbol Z in the figures as a reminder of this important point.
Let us now examine what happens when we send the beam of atoms through two
such devices in series. The polarizing filter of Figure 8-18 is used to prepare a
polarized beam for transmission into a second apparatus with the same coaxial
alignment. We assume that the fields are designed with equal strengths and equal
gradients. The fields are oriented in this part of the experiment so that the z axis
defined in the second device has the same direction as the z axis defined in the first.
402 Spin and Magnetic Interactions
Figure 8-18
Stern-Gerlach fields with alternating gradients.The nonuniform fields divide and rejoin a spin- 7
beam in the first apparatus. The second apparatus acts as a polarizing filter to transmit only
m = +
t
7 atoms. Schematic pictures of these transmitting and polarizing devices are shown at
the bottom of the figure.
Figure 8-19 shows the results of three simple thought experiments. The transmitting
apparatus in (a) presents no barrier to upward- or downward-curving trajectories and
simply transmits any polarized beam without modification. The filter at the second
stage in (b) allows only m = +
s ^ atoms to pass and has no effect on the given
polarized beam. The second filter in (c) terminates thebeam because the apparatus
passes only m = - s \ atoms. Note that the quantum number m retains its meaning
s
from the first filter to the second, since the z axis has the same direction in each
device.
It is instructive to associate quantum mechanical probabilities with experiments of
this kind. We begin by defining the complex-valued quantity
X(d,p)
as the quantum mechanical amplitude for an atom, prepared in the state p by the first
apparatus, to be detected in the state d by the second apparatus. We then express the
8-7 Spin-' Thought Experiments 403
Figure 8-19
Stern-Gerlach apparatuses in series. The z axes have the same direction in each case, as
indicated by the common label Z.
(a)
(6)
(c)
2
\x(d,p)\ -
(Note that the preparation and detection states read from right to left in these
expressions, while the atoms proceed from right to left in the accompanying figures.
The final <— initial notation for a quantum process follows a commonly adopted
convention. We have already used this notation in the transition amplitude of
Equation (5-70).)
The experiments in Figure 8-19 are described in terms of states specified by the sign
of m s , where the quantum number is defined according to a certain direction for the
axis of quantization. We are using the label Z to indicate this direction in the figure,
so let us represent the experiments in parts (b) and (c) with the aid of the same
notation. We introduce the amplitudes
an atom with spin up and down along Z, given that the atom is
for the detection of
prepared with spin up along Z. These two particularly simple situations have rather
obvious probabilities:
X ( + Z, +Z)\ =
2 2
\
1 and \ X (-Z, +Z)| =
in case (b) and Another simple thought experiment can also be performed
in case (c).
with a beam of atoms prepared in the m s = — \ state. The resulting probabilities are
\ X (+Z,-Z)\
2
= and \ X (-Z, -Z)\
2
= 1
Figure 8-20
Coaxial Stern-Gerlach devices. The quantization axis in the analyzer makes an angle /? with
the quantization axis in the polarizer.
_, 2
^ •
z
Analyzer Polarizer
2 2
\X(+Z\ +Z)\ # 1 and \x{~Z', +Z)\ *
when we deviate from the simple /? = situation. The two alternatives are shown
schematically in Figure 8-21. Again, note that the indicated signs specify values of m s
peculiar to the particular quantization axis. Only these two detection alternatives
exist, and so the sum of the two expressions must give unit probability:
If we let the polarizer prepare a beam of atoms with spin down along Z and conduct
another thought experiment, we find another pair of probabilities whose sum must
obey a similar relation:
lx(+z\ + X (-Z',-Z)| 2 =
|
1. (8-24)
These equations are constraints on the four amplitudes parametrized by the angle /?.
It is reasonable to suppose that the amplitudes are periodic functions of /?. We already
know the probabilities for the special case fi
= 0, and we can also deduce the
Figure 8-21
Thought experiment with nonzero angle of rotation between analyzer and polarizer. Atoms
prepared with spin up along Z are detected with spin up and with spin down along Z'.
8-8 Addition oi Orbital and Spin Angular Momenta 405
probabilities for the special case of a rotation with angle (i = it. It is possible to use
these logical arguments, and no other mathematics, to obtain the following general
results:
| X (+Z',+Z)|
2
= X (-Z',-Z)| 2 =
|
J (8-25)
and
sin
J-.
2
(8-26)
Example
| X (-Z',-Z)|
2
= cos
2
- =0 and | x( + Z\ -Z )|
2
= sin
2
- = 1
for finding the atoms with spin down and up along Z'. We might have
anticipated this result on logical grounds and put the argument to use in the
deduction of the general equations. If we take fi
= tt/2 as another case, we find
the probabilities
77
Hence, we conclude that there is a 50-50 chance for an atom with spin up along
some direction Z to be found with spin up or down along another direction Z'
at right angles to Z.
The angular momentum of the one-electron atom includes both orbital motion and
electron spin. Each contribution has its own magnetic moment, and hence its own
interaction in an applied magnetic field. We have been able to concentrate on the spin
of the electron by selecting states in which the orbital angular momentum is equal to
zero. The quantized vectors L and S must be added together in situations where more
general kinds of states are considered.
m
We define the total angular momentum of the atom by the addition of vectors
J = L + S. (8-27)
Equation (8-27) is intended to hold for orbital and spin angular momenta with
quantized magnitudes and orientations. The resulting vector J is itself a quantum
mechanical angular momentum, and so the quantities J and Jz are supposed to
obey quantization quantum properties of L and S. Hence, there must
rules akin to the
exist a discrete nonnegative number j such that the quantized values of J~ occur as
eigenvalues of the form
2
h j(j+ 1).
The numerical variable j is called the angular momentum quantum number. Each
total
set of J. eigenvalues
hm )
with m = -j,-j + l,...,j-
/
1 , j.
These rules employ the quantum numbers j and m to express the quantized ]
magnitudes and orientations of the vector J, just as the quantum numbers £ and m f
determine the quantized behavior of the vector L.
The quantum number m takes on 2j + possible values in integer steps between
f
1
tn
j
m,+ s
since J =
2
L z + Sz . (8-28)
half-integral, so that j itself must range over the positive half-integers \, |, |, . . . . The
allowed values of j vary with the orbital quantum number /. We note that L makes
no contribution to J in the special £ = case, and so we find
The determination of j for nonzero £ requires the addition of two quantized angular
momenta L and S to yield a third quantized angular momentum J. Only two possible
values of j result from this procedure:
We look more closely at the addition of these vectors in the example below.
The vector-addition problem is described schematically in Figure 8-22. We choose
for illustration the £= 1 states of the vector L and combine these configurations with
the spin-up and spin-down states of the vector S. This combination of angular
8-8 Addition of Orbital and Spin Angular Momenta 407
Figure 8-22
Vector addition of orbital and spin angular momenta. The factors of h are suppressed, and the
£= 1 case is chosen for L. The sum 1 + \ produces the
results § and \ for the vector J.
momenta can be represented (without the h factors) according to the addition formula
1 + \ = | or £
as stipulated in Equation (8-30). It is understood that the lengths of the vectors in the
figure are L/k, ]/3 /2 for S/h, and either /l5 /2 or
]/2 for /2 for J/h. In part (a) $
add = 1 and m s = 2 to § et m = i an<^ obtain j = A as the only possible
, ' ,
result. In part (b) we show that the combination of m ( = 1 with m, = and [he-
combination of m { = with m = ^ contribute together to m = \. Consequently,
s
both of these vector sums can be arranged to produce a j= configuration. In part '
(c) we find that the same orbital and spin states can also be combined in another
arrangement with m- = \ to form a j = % state. Note that the pictures incorporate
'
the feature of azimuthal uncertainty, as required for any angular momentum with
definite z component. This aspect of the vector-addition problem is rather difficult to
408 Spin and Magnetic Interactions
Figure 8-23
IVA
<-i
new scheme is based on the construction of wave functions of the form ty„f m where ,
with definite assignments of m f and m Figure 8-22 suggests how such constructions
s
.
are shown together as a (j = \, m = L,) state in configuration (b) and are again
shown together as a (j = \,m-= \) state in configuration (c ).
'
A transformation of the wave functions takes the original set of quantum numbers
{n^m f m s ) into the alternative set (n^jm^. We observe that a given energy level En
has orbital quantum numbers
c°= 0,1,2,..., n - 1
j=f+ \oxt- \,
Figure 8-24
En atom. Each assignment of
Energy levels and degenerate states
%y m for the one-electron
each level. The spectroscopic notation nL is used to designate the states. This scheme is an
j
alternative to the one described in Figure 8-13. The Coulomb force provides the only
interaction in each of the two figures.
(= 1 (=2 (=3
i ._ 3 5
;
=
2
'- 2 >= 2
EA 4Sy2 4 '%-
78T 4F ^
n = 4- 4Pi,
74T 4P % -4D3/ -4D5/
(2) (2) (4) (6) 76T
n=3- 3Sy2
1W 3D ^ Wb
(2) l2T 3P '/
2 (4)
3P3L
~W l2
-2P*
T2T 2P %-
- n=2-
l2T 2Sl /2 (4)
- n= 1' •lSi
(2)
This part of the designation is equivalent to the use of the symbols s, p,d, /, . . . for
single-electron states.
It should be emphasized that the energy levels in the figure are still those
determined by the effects of the Coulomb force alone. We have yet to be given a
physical reason for setting aside the quantum numbers {n£m f m ) s
of Figure 8-13 in
favor of the quantum numbers (nfjm ) of Figure 8-24. We may continue to use mf
and m as long
s
as L, and 5. maintain their status as separately conserved angular
momenta. The necessity for the alternative scheme based on j and m emerges in
Section 8-9.
Example
Let us examine the reasoning that leads to the possible va lues of J in Eq uation
(8-30). We consult Equations (8-28) and argue that 3 cannot exceed ^+ l
,
because
01
We may test this constraint by inserting any value of j in the quantized range of
possibilities
s
7 '
2 » 2 ' 2 > 2 >
'
• *
'
The proof that j cannot be smaller than ( — 7 is left as Problem 13 at the end
of the chapter.
The energy levels of the one-electron atom deviate from the results shown in Figure
8-24 because of two additional dynamical effects. Both of these further contributions
to the energy of the atom are relativistic in origin and secondary in influence
compared Coulomb potential energy. One of the effects is associated with the
to the
spin of the electron and is known as the spin-orbit interaction. This source of
interaction energy introduces a fine structure among the degenerate states in the figure
and splits the levels into multiplets of states with slightly different energies. The
multiplet structure of the energy levels is observed in the emission spectrum of the
atom and is interpreted as direct evidence for electron spin.
A coupling occurs between the spin of the electron and the orbital angular
momentum, owing to the existence of a magnetic field internal to the atom. We employ
Figure 8-25 to deduce the nature of this interaction. The figure shows an electron in
orbit around a nucleus at rest and also shows the motion with respect to a reference
frame moving with the instantaneous velocity v of the electron. The nuclear charge Ze
has velocity —v in this frame, and the resulting current produces a magnetic field at
the instantaneous location of the electron. We can use the Biot-Savart law to write the
field in terms of the variables indicated in the figure:
Mo Ze{-\) X r
B — ;
3
.
477 r
Ze r
E = 3
47re r
8-9 The Spin - Orbit Interaction 41
Figure 8-25
Atomic orbital motion in two frames. The nucleus is at rest in the original frame, and the
electron is instantaneously at rest in the transformed frame.
Mo e o 2
c
v X E
B = =—
Ze r X v
B internal 2
i
477C,i C r
3
477£ m ec 2 r 3
We note that the final result contains the orbital angular momentum L = m r X r
v.
The electron spin magnetic moment interacts with the internal B field at the site of
the electron, with the same interaction energy as in the case of Larmor precession. We
make a provisional definition of this potential energy by setting
Km = ~~ V"S' B internal'
Ze
l\si
(*"T' 4:TT£ m e
c
2
r
3
Equation (8-7) is recalled and the value g s = 2 is inserted in the following steps:
Ze
VSL =
m e
4WE »?/V
Ze 2 S • L
2 2 3
(8-31)
47T£ m e
c r
412 Spin and Magnetic Interactions
Ze 2 S • L
vsL=-7—rm-
477£ 2m e
c r
( 8 - 32 )
Figure 8-26
Random orientations
8-9 The Spin- Orbit Interaction 413
Figure 8-27
Coupling of spin and orbital angular momenta owing to the spin-orbit interaction. The effect
total angular momentum of the system is conserved as long as the system consists of
the isolated atom. We therefore turn to j and m as good quantum numbers and use
these parameters instead oi m f and m s
to designate the stationary states. The wave
functions *&„?.„ have been introduced and Figure 8-24 has been drawn in anticipation
indicated configurations are composed of two distinct (m^m^ states with the same
value of m-, where the combinations of L and S are arranged to form states with
j = c* — j = i + l Let us picture the synthesis of the two states with the aid of
'
\ and ' , .
Figure 8-27. The influence of the spin-orbit interaction is indicated in the figure as a
precession of the coupled vectors L and S around
the vector J, while J assumes an
uncertain azimuthal orientation about the axis of quantization. Precession is used in
this instance as a schematic device to represent the behavior of the two (m^mj states
in the construction of the two possible values of j. We are about to learn that these
constructions of the quantized total angular momentum correspond to states of the
atom with different values of the energy. This mechanism causes the degenerate states
nfjn in Figure 8-24 to split apart so that atoms with the same n and c" acquire
slightly different energies for different choices of j. Let us also observe in passing that
the properties of L, and Sz in Figure 8-26 become interesting again in cases where the
spin-orbit coupling has a negligible effect. This situation prevails when the atom is in
an external magnetic field whose strength greatly exceeds that of the internal magnetic
field leading to Equation (8-31).
The spin-orbit interaction is one of two relativistic effects that contribute to the
fine structure of the atom. We recall from Section 3-6 that a useful dimensionless
parameter is furnished by the fine structure constant
a = '
Aire hc
and now we are able to appreciate the meaning of this parameter. We incorporate a
into Equation (8-32) to get
h S-L
VSL = Za 2
2m c
4
J =
2
(L + S) • (L + S) = L 2 + 2L • S + S2
J -L -S
2 2 2
h
VSL = Za ir-r -3 .
(8-33)
\m 2c
When the expectation value of this quantity is evaluated in the state ^„^. m , the
expression takes the form
The quantum numbers appear in this result as a consequence of the relabeling scheme
for the wave functions. Each wave function is denoted by the quantum numbers
2
(ntfjm^ so that the observables J , L2 , and S2 have the definite values
h j(j + 1), /iV(/+ 1), and \h in the given state. The expectation value (VSL ) is
interpreted in terms of Figure 8-24 as the amount by which the spin-orbit interaction
shifts the energy levels for the various assignments of these quantum numbers. Note
that the square-bracketed quantity in Equation (8-34) vanishes for {= states, where
j can only be equal to £ , and is double-valued for / ¥= states, where j can be either
/+ l or
, t- \.
The computation in Equation (8-34) is completed by inserting Equation (7-35)
from Section 7-4:
3
/ aW(/+ 1)(2/+ 1)
(We observe that the expectation value of any function of r in the state ^ n ^ ]m can
only depend on n and ( the quantum numbers for orbital motion.
, The calculation of
(1/r ') therefore reduces immediately to the integral
/;
-P Ar)dr n
as in Equation (7-31), where P„f(r) is the radial probability density for the given
state.) We also recall Equation (7-5) and again use the fine structure constant to get
2
477£ ^
Ze m. Zamc
(The reduced-mass correction m e /\i. is replaced by unity for this purpose.) The desired
expectation value is then written as
1 \ / Zam r
c
r
1) " \ hn j /(/4 1)(2/
8-9 The Spin- Orbit Interaction 415
for /^ states. We insert this result in Equation (8-34) and obtain the final formula
for the energy level shift due to the spin orbit interaction:
Zam/
<ySL ) = za j(j +l)-/(7+l)- -
4m~c hn f(f+ 1)(2/+ 1)
zv j(j+ i)-/(/+ i)
(8-35)
3
2n ' /(/+ 1)('_V+ 1)
This version of the formula gives the energy shift in terms of m e c the rest energy of ,
the electron. We can refer back to Equation (3-54) and substitute the Rydberg energy
unit instead. We use the equality
m c
and find
=
ZV j(j+ 1) -/(/+ 1)-
Wsl) '•^o
(8-36)
e(t+ \){2t+ i)
This version of the formula gives the shift in terms of E , the natural scale of energy
2
in the atom. The spin-orbit energy is proportional to <x EQ and is therefore a rather
small correction to any given energy level.
These results can be applied to the states with nonzero ( in Figure 8-24. We see
that the values of the quantum numbers (n£j) are indicated at each of the energy
levels in the diagram, and we note that j is equal to either tf+ ^ or ( — \ for each
selection of n and /. Equation (8-36) determines an energy shift at every /=£ level.
Z A
a 2E
j(j+ l)-S(S+ 1)- 7 =£ and (VSL ) =
«V+
'
1)(2/+ 1)
= (— \ has
Z*a 2E n
j(j +!)-/(/+ 1)- - = -€- 1 and (VSL )
nV(2/+ 1)
The spin-orbit splitting is found by taking the difference of these two expressions:
M.i^i.
'
z * a 2E ° l
8ESL I (8-37)
n\2t+ 1)\ t+ 1
" {)" «V(/+ 1)'
Hence, the coupling of L and S causes the state nL,+ l/2 to lie higher than the state
nhf_ x/2 and results in a doublet of states with the same n and ( as shown in Figure
8-28. The remaining contribution to the fine structure of these levels arises from
another relativistic effect, to be discussed in Section 8-10.
.
Figure 8-28
one-electron atom.
nl
i'<-\
Example
Let us apply Equation (8-36) to the 2P states of the atom and compute the
magnitude of the splitting illustrated in Figure 8-28. The energy shifts for n = 2
and (— 1 are
=
ZV f -2-
— J
=
Z'a'% 3
= -
(VSI
sl/) E for j
\
8 ° 1-2-3 48 2
and
=
ZV -\-1~\ =
Z 4 a% 1
= -.
<Fe,>
\ w./
E° for j
8 1 •
2 3 24 2
We take Z= 1 in the case of the hydrogen atom, and we find the difference of
the two shifts to be
a 2E eV
8E S/ = a'E.A
/
—
1
48
+ —
1 \
= -
16
()
13.6
r
2
= 4.53 X 10~ 5 eV.
I 24 J 16(137)
The smallness of this splitting is due to the smallness of the factor a'. The
spin-orbit interaction has an even smaller influence on the states of the
one-electron atom for larger values of n and (
The Bohr energy levels reveal a doublet substructure when the analysis of the atom is
Z2 I.
—M
E =
.
and observe from Equation (8-36) that the ratio of (VSL ) to E n is of order Z a We
2 2
.
know (from Problem 14 in Chapter 3 and Problem 10 in Chapter 7, for example) that
v/c is proportional to Za for orbital motion in the atom. It follows that the spin-orbit
energy shift is a relativistic correction of order (v/c) relative to the Bohr energy. The
other fine structure effect is of the same order in v/c and should therefore appear to
the same order in powers of Za.
This second correction involves the relativistic treatment of the kinetic energy of the
electron. We recall that the Schrodinger equation for a particle of mass m is associated
with the classical energy relation
—P
2m
+ V= E,
in which the energy E excludes the rest energy mc and the kinetic energy varies with
p in nonrelativistic fashion. The relativistic formulas in Equations (1-35) and (1-37)
give the desired version of the kinetic energy as
p + m c — mc
2 2 2 A 1
\c .
The expansion of this expression in powers of (p/mc) 2 yields the following result:
1/2
—rr-j
2
p
1 1 p*
+ - -^
\
mc~\ 1 + I
- mc 1 = mc' 1 )
1
~ ~
o —r-7
4
+ l>
• •
m
<-
m~c 2 m c o c
1
P P"
2m 8mY 2
We recognize the leading term as the familiar nonrelativistic kinetic energy, and we
regard the next-to-leading term as the first relativistic correction. The leading contri-
bution turns into the usual differential operator — {h 2 /2m)\7 2
acting on the wave
function in the Schrodinger equation. We define the correction term for application to
the atomic electron by the formula
4
P
*rel=-7-TT- ( 8 " 38 )
8m c e
2
It is easy to verify that this correction is of order (v/c) relative to the leading
nonrelativistic term.
The extra kinetic energy # rel
is treated as another small interaction to be added to
the energy of the atom along with VSL . We compute these corrections by obtaining the
expectation value of K kX in the state
- lE ,/h
^ncjm =
, xb ,
T nejm e »
and by combining the result with the already-derived evaluation of (VSL ) in the same
state. The formula for (A' re ,
) is found in the derivation at the end of the section:
ZV 3
^m/ —
/ 1 \
(^rel) = 3 f \ c
•
(8-39)
rc 2/+ \ 1 8n
\
This expression contains the same power of Za as occurs in Equation (8-35) for (VSL ).
We then determine the total fine structure shift in energy for any /¥= state by adding
the two corrections:
^ 4« 4 Jj(j+ !)-'('+ 0- f 2 3
<ySL ) + <* rel > = ^r^|- /(/+1)(2 , + 1)
-
^TT +
^
This combination of terms simplifies considerably to become
ZV / 2 3
<^> + <^> = -^r^ (^-TT-^)- 2
< 8 - 40 >
ZV /
—— - —
2 3 \
Enj = En
J rE . (8-41)
n
\
\ +
2j 1 in J
The corresponding energy level diagram is shown in Figure 8-29. We magnify the
small effect of the fine structure so that we can see the emergence of a remarkable
pattern of degeneracies. We observe that the energies of the states nL, depend on the
values of n and j, but not on the value of /. The resulting degeneracies appear
,
throughout the figure in the pairs of states (26'
1/2 ,2/ 1/2
),(35'
1/2 ,3/ 1/2 ),
)
(3P 3/2 ,
3D 3 /2 )>> where each pair has a certain energy for the given n and j, without
regard for /.
The fine structure of the energy levels affects the wavelengths and splits the spectral
lines in the emission spectrum of the atom. The electric dipole selection rules
determine which initial and final states of the atom are linked in the various radiative
transitions. We have studied these rules in Section 7-5, prior to our introduction of
electron spin. The properties of the electric dipole vector are spatial and do not
pertain to spin, so that the spin quantum number plays no part in these selection rules.
We know that atomic states can participate in a given transition if a change in parity
and a change in the quantum number c" occur such that
We also know that the emitted electric dipole photon carries away one h unit of
angular momentum. Therefore, the initial and final quantum numbers j and j' in the
process
(nSjmj) -» (n'l'j'm'i) + y
in order for the total angular momentum to be conserved. It follows that changes in j
810 Results from Helativistic Quantum Mechanics 419
Figure 8-29
Energy levels En for the one-electron atom. Each choice of n and j gives a pair of degenerate
nL with two different values of /. The fine structure
states
t
effects shift the levels away from
their positions in Figure 8-24. The greatly exaggerated splittings indicate how the shifts
diminish with increasing values of the quantum numbers.
e= (= 1 e = l f = 2 6=2 ( = < = 3
= ._ 3 ._ 5 7
} 1 1-2
- n
- n
= 4
= 3
4Si/ 4J>i/
2
3Si/ 3Pi/
2
-
- -4P3/ 4Z)3
2
3/2
-
-
-Ws/AFs,
3D
W
51
AFi.
2 2
-2P3/,
n = 2
'2
- n= 1- lSi/.
Figure 8-30
-2P3
I
IP, '
15 i ::
and 2P 1/2 l.S
1/2)
resulting in two distinct wavelengths for the two different transition energies. The
difference is small enough to require high-resolution interferometry as a means of
distinguishing the spectral lines.
A consistent fine structure theory was developed by Pauli in 1927 as part of his
method for incorporating spin in the Schrodinger equation. This treatment of relativ-
istic corrections belonged to the framework of nonrelativistic quantum mechanics and
offered no theoretical explanation for the origin of electron spin. Another theory of the
was put forward with more spectacular consequence by P. A. M.
relativistic electron
formalism based jointly on the principles of relativity and the principles of quantum
mechanics. His differential equation for the wave function employed the relativistic
relation between energy and momentum and assumed a form entirely different from
Schrodinger's wave equation. This new relativistic theory automatically allowed for the
spin of the electron and reduced to Pauli's theory in the nonrelativistic limit.
810 Results from Relativislic Quantum Mechanics 421
Paul Dirac
Let us recount the successes of Dirac's theory and leave the elegant formulation to
a more advanced treatment of quantum mechanics. The theory starts with the free
electron and imposes the classical relativistic energy relation appropriate for a particle
of mass m:
2„4
c~p~ + m^c
Since the relativistic momentum p and energy E appear with the same exponent, the
corresponding differential operators d/dx, d/dy, d/dz, and d/dt occur to the same
order in the differential equation for the wave function ^(x, y, z, t). A first-order
time derivative is assumed so that the wave function evolves in time from a
specification of the state at / = 0, as in the Schrodinger theory. The Dirac equation is
therefore of first order in the spatial derivatives of ^ . It follows that the differential
equation for ^ can be properly constrained by the relativistic relation between p and
E only if ^ has a certain multivalued structure. The wave function has a two-valued-
ness corresponding to the spin-up and spin-down properties of a spin- -, particle, as
described by the Pauli quantum number m s The wave . function also has another
two-valuedness associated with the characteristics of particle and antiparticle . Thus, the
free-particle Dirac equation predicts the existence and describes the behavior of
spin- 7; particles and antiparticles (such as electron and positron) with spin up and
down.
An B can be introduced in Dirac's theory of the electron.
external magnetic field
The result an interaction with B via a spin magnetic moment whose g-factor is
is
exactly given by g s = 2. The Dirac equation can also be applied to an atomic electron
in the Coulomb field of a nucleus. When the interactions of the electron are examined
in powers of v/c, the relativistic corrections describe K rel as given in Equation (8-38)
and also VSL as written in Equation (8-32). The latter result automatically expresses
the spin orbit interaction in correct form, complete with the Thomas factor of l t
. The
Dirac equation can also be solved exactly to determine the energy eigenvalues for the
422 Spin and Magnetic Interactions
hydrogen atom. These solutions contain the effects of relativistic motion to all orders
in v/c and all orders in the fine structure constant a. The resulting exact energy levels
depend only on the quantum numbers n and j and reduce to the energies E n in
Equation (8-41) when the solution is expanded through terms of order ct 2E .
Dirac's theory was verified decisively in every relevant experiment. The prediction
of antiparticles was especially significant as a bold new innovation in theoretical
physics. The proposed existence of the positron was finally confirmed by the discovery
of the antiparticle in 1932. The theory p rovided the general basis for a complete
understanding of the relativistic quantum behavior of any spin-^ particle and was
hailed as one of the great accomplishments in 20th century physics.
Figure 8-31
Highly schematic plan of the Lamb-shift experiment. A beam of ground-state hydrogen atoms
is passed through an electron-bombardment region where the atoms can be raised to excited
states by electron collision. Atoms excited by the 15, ,
2
-* 25, ,., transition are left in a
metastable state where the electric dipole selection rules prevent radiative transitions back to the
ground state. The excited atoms proceed through the waveguide to a detector designed for the
collection of metastable atoms. A field is applied to the beam in the waveguide
radio-frequency
and induces transitions from 2S
2 l
, when the frequency corresponds to the difference in
to 2-P,/
energy between the two nondegenerate states. The radiative transition 2P1/2 lS 1/2 is allowed
so that excited atoms in the 2/>1/2 state radiate to the ground state before they reach the
detector. This depopulation of the metastable 25, /2 state is observed as a decrease in signal at
the detector for an rf frequency around 1060 MHz.
$
<&+[
-» Waveguide Detector
H atom source
r
$ "I
Electron bombarder
?S:
-2Pu
rf-induced
transition
Collisional excitation
e+ H - e+ H*
IS:
810 Results from Relatimtic Quantum Mechanics 423
The multiplet structure of hydrogen has been presented in Equation (8-41) and
Figure 8-29. This system of levels was investigated experimentally with high-resolution
radio-frequency techniques by W. E. Lamb and co-workers during the late 1940s. In
studies like the one sketched it was found that the predicted
in Figure 8-31,
degeneracy of the (n 2, j = was =
actually broken and that the 2S ,2 state
r,) states X
had a slightly higher energy than its partner the 2P, /2 state. This measurement
became known as the Lamb shift. The observation was important because it indicated
the influence of phenomena that were not yet included in the relativistic quantum
theory. The departure of the spin g-factor from the value g s = 2 was another
slight
property of the electron that had no explanation in Dirac's theory. These discoveries
took place in the next period of exciting developments in quantum physics.
Detail
We derive the formula for the extra kinetic energy in Equation (8-39) by first
1 1
K„,=
- r.-l 2
o(e- vy
2m c 2m, 2m c
(* e.) r
((E 2 ) - 2(VE) + (V 2 )),
2m c
lE » t/h
to be evaluated in the stationary state ^ = ^ n fjm e . We are using a state
with energy eigenvalue E n , and so we find
2\ = ZT2
<^> = e:
and
The Coulomb potential energy appears in the second and third terms of (K rel ).
We employ Equations (7-33) and (7-34) to obtain the expectation values
Ze 2 I 1 Ze<
(V) =
4we \ r 4iTE an~
and
Ze' 2
(V 2 ) 2 i
47T£ f 477£ a n (2f+ l)
= ahc ,
477£ r
424 Spin and Magnetic Interactions
and we ignore the reduced-mass factor \i/m f to rewrite the Bohr formulas:
Z" Z1 m e
c h
-EQ = —a" and a =
1
n n 2 Zamc
Zahc Zamc Z 2
a2
and
,,(
2
Zamc\ 2
2 Z 4a 4 m 2c'
(V 2 ) = (Zahc) l—r-) =2-
3/0/i
3 ,, 3
'
h J « (2/+ 1) « (2/+ 1)
7?
1
1
/
Z-74 a 4 m,c
2 4 /
I Zamc
7 2 2 2 \
\
/
/ Zamc- 3 2
<**> = -7T-2—
2m.r 4n
T-4
4
2
2n
+2 J
n (2/+ 1)
4
Z
— ^mc
a4
2
/
\
1 1
)n 2n 2tf+ 1
The formula quoted in Equation (8-39) follows immediately from this result.
Example
Let us investigate the small shifts in wavelength for the Lyman a transitions
shown in Figure 8-30. We set Z = 1 in Equation (8-41) and obtain the following
energy levels:
a2 / 1 3 \ a2
E(2P.
v i/2)
i/2 )
= E,2 EA - -
- = E2 - -E°Q
64
,
8 \ 2 8/ "
2
a2 5a
E(2P X/2 )=E2 - -E \\-
1
-j
3 \
=E 2
- —E ,
3 a2
E(lS l/2 ) = E, - a 2 E Q ^\
1
-
-j =£,-
\
-£ .
15a 2
£(2P 3/2 ) - £(lS 1/2 ) = £2 -E x
- a 2£
(
— --
-j = A£ 4-
64 °
and
11a 2
£(2P I/2 ) - E(lS 1/2 ) = E2 -E x
- a 2£
(
—-- j
= A£ 4-
64
F
°'
811 The Zeeman Effect 425
AE = E2 - E = l
10.2 eV.
he he I
l5a L E t)
\(2P3/2 - lS l/2 ) = 1
and
2
In hi 1 la £
X(2/> - lS l/2 ) =
AE{\ + lla%/64AE) AE 64 A£
he 1240 eV •
nm
= 122 nm.
Je 10.2 eV
2
15a'£ he 15/ 1 \ 13.6
(122 nm) = 2.02 X 10" 3 nm
64 A£ A£ 64 137 10.2
and
1 \a'E he 11 1 \-13.6
(122 nm) = 1.48 X 10
(
nm.
64 A£ AE 64 \ 137 / 10.2
We conclude that the two wavelengths must be measured with at least six-figure
accuracy to be resolved as separate lines.
The Stern-Gerlach phenomenon and the multiplet structure of the atom demonstrate
the influence of electron spin. Further evidence for spin is found when the spectral
lines of the atom are observed in an applied magnetic field. We have discussed these
observations of the Zeeman effect early in the chapter in Sections 8-1 and 8-3. We are
now prepared to learn why the prediction of triplet line splitting is not confirmed, and
why the so-called anomalous pattern is seen instead. Once the Zeeman splitting is
Both orbital and spin magnetic moments contribute to \i, but only the former
426 Spin and Magnetic Interactions
contribution appears in our earlier treatment of the Zeeman effect. We obtain the
total magnetic moment of the one-electron atom by recalling Equations (8-19) and
(8-20) and inserting the g-factors g L = 1 and g s = 2:
We then determine the magnetic-interaction shift for a particular energy level of the
atom by calculating the expectation value
(vM )= -</*,>£
This formula employs the usual choice of z axis along the direction of B and calls for
the evaluation of (ju.) in a state that describes the atom without the application of the
field.
Let us first assume conditions in which the applied B field is much stronger than
the internal magnetic field responsible for spin-orbit coupling. This situation has
already been mentioned in our discussion of Figure 8-26. We neglect the spin-orbit
interaction and let the vectors L and S be decoupled so that the orbital and spin
magnetic moments perform independent Larmor precessions in the strong applied
field. The calculation of the magnetic interaction is then based on states of the atom
defined by m e and m as good quantum numbers. We use the wave function ^nfm m
v ,
A given orbital configuration has quantum numbers n and £ and contains 2(2/ -t- 1)
different magnetic substates. These magnetic energy levels are shifted by the energy
(VM ) according to the values of m f and m s We emphasize that we cannot label the
.
states by these quantum numbers unless the spin-orbit interaction VSL is dominated
by the magnetic interaction VM . This strong-field phenomenon is called the
Paschen-Back effect, a special case of the Zeeman effect, named after the spectrosco-
pists Paschen and E. Back.
interesting case arises when both VSL and VM come into play, particularly
The more
where the latter is the weaker of the two interactions. Our solution of this
in situations
while the state refers to the total angular momentum J = L + S. Figure 8-32 shows
that this feature of the problem prevents the vectors |X and —J from pointing in the
8-11 The Zeeman Effect 427
Figure 8-32
J = L + S and \i = \i
L + \i
s
. The g-f actors
M7
=!-r- :
-tt(l + 2S)-(l + s)
J hj
= - —
Mb
(L 2
+ 2S 2 + 3S-L).
Mb
M/ L 2 + 2S 2 + -(J 2 - I - 2
S2)
My
= ~—(3JJ y
2
+ S2 -L 2 '
(8-47)
2hJ
M, = \V-j + V-±j + \l
±b \cos6, (8-48)
J* =Jcos
428 Spin and Magnetic Interactions
Figure 8-33
Precession around B
average over the two periodic motions and obtain vanishing results for both vectors |i ± ,
and |j.
± H.This argument implies that Equation (8-48) can be replaced by the simpler
expression
M.-
the replacement is
2
(m,7 > = (JJiij)
OAV 2
+ s2 -l 2
)).
2h
811 The Zeeman Efleet 429
The two sides of this equality are to be evaluated for a level nL-, where eigenvalues
are specified for the observables J
2
, L 2 and S 2
, . We use these eigenvalue properties to
substitute
(JAV +
2
S
2
- L 2 )) = h
2
[3j(j + 1) + s(s + 1) - t(/+ l)](Ja )
on the right side. The resulting equality can then be rearranged to read
3j(j+l) + s(s+l)-S(t!+l)
<^>--2*
, x
ii B
JJTT) a >
= -EHb^t-, (
8 - 5 °)
n
where
j(j + i) + s(s+\)-n/+i) . .
2jU + U
This last coefficient in the final expression for (ju.) is called the Lande g-factor after A.
Lande, a pioneer in the period of the old quantum theory. We take note of the fact
that the spin g-factor has been explicitly written as g s = 2 from the beginning of the
calculation.
Equation (8-50) takes the place of Equation (8-10) in all applications of the
magnetic moment. The revised formula is more involved since the Lande g-factor is
not just a constant like g L or g s . We get g = 1 from Equation (8-51) and thus
reproduce the orbital g-factor g L if we arbitrarily , set s = and let J and L become
identical. Of course, we must take s = ~ and write
j(j+l)-t(t+l) + *
s= l + 8 " 52 )
2j(j + n
oY <
1)
in the case of the one-electron atom. This expression reproduces the spin g-factor
g s = 2 if the given state has quantum numbers /= and J
= L
,. Equation (8-50) can
be recast in vector form as
<H>=--/<J>^ (8-53)
since only the z component can have a nonvanishing expectation value in the state
^nfjmj- This result tells us that the observable magnetic dipole moment vector is
directly proportional to the total angular momentum, the only vector available to
characterize the state of the atom.
The Zeeman energy shift is deduced directly from Equation (8-50). We evaluate
the expression
430 Spin and Magnetic Interactions
in the state ^„^„, and use the fact that the wave function is an eigenfunction of J. to
write
a> = k-
The conclusion follows at once:
a result to be compared with Equation (8-12) for the fictitious normal Zeeman energy
shift. The actual situation pertains to a typical level nL
]
whose 2j + 1 magnetic
substates have the same energy E n) in the absence of an external magnetic field.
Equation (8-54) represents an energy shift due to the application of the field, causing
the magnetic substates to split apart and acquire different energies for each value of
the quantum number m . These Zeeman splittings differ from one nL to another
because of the variation of the Lande g-factor with £ and j. We therefore predict more
than three different transition energies for the various transitions allowed by the electric
dipole selection rules. The anomalous pattern of Zeeman spectral lines is observed as a
result, instead of the normal Lorentz triplet. The following illustration demonstrates
this effect.
Example
where each level nL- is split into its 2} + 1 magnetic sublevels. The Lande
g-factors and Zeeman energy shifts are given by Equations (8-52) and (8-54)
according to the values of m- as follows:
in the 2P W2 state g = 1 4-
The resulting sets of split levels are shown in Figure 8-34. Note that the Zeeman
812 Hypertine Structure 431
Figure 8-34
Zeeman splitting in the ground state and first excited states of hydrogen. An applied magnetic
field splits the levels for the various values of w ;
. The Lyman a fine structure transitions
™ 3
2P %: 2 1
2
3
2
1
2Si 2/»i
2
W V
_i
lSi 2 Mfi B W V W
I ZP 3/2
- IS i 2Py2 -lS 1
6 lines 4 lines
krrij = or + 1
asshown in the figure. It is apparent that all the transition energies are different
and that the fine structure doublet of Figure 8-30 turns into six-plus-four distinct
Zeeman lines.
The proton is a spin-^ particle, like the electron, with its own magnetic moment due
to spin. In fact, every atomic nucleus with spin is likewise endowed with intrinsic
magnetization determined by the distribution of constituents in the nuclear system.
The implications for the one-electron atom are that the nuclear magnetic moment sets
up a permanent magnetic field inside the atom, and the electron magnetic moment
432 Spin and Magnetic Interactions
experiences a weak Zeeman interaction with the internal field. This additional
magnetic atom is known as the hyperfine interaction. Its
effect within the influence is
x 1
2
such that the magnitude of the nuclear spin is given by the eigenvalue property
I
2
= h
2
i(i+ 1). (8-55)
The assignment of the quantum number i depends on the particular quantum state of
the given nuclear species. A magnetic moment can be defined for any nucleus by
analogy with Equation (8-20), the formula for the magnetic moment due to electron
spin. First, we reverse the sign of the charge and alter the mass scale of the Bohr
magneton in the equation. These steps replace the factor — fi B by the nuclear magneton
eh m
M" =
^ =
JT/''
(8 " 56)
The magnetic moment and the nuclear spin are then related by the vector equality
V-l
= + gifl NT- ( 8 " 57 )
— = 2.792847386
2
12 2
1
-.
Vss= 7 zT^'M-zV (8-58)
4i7e c 5 r
The two spin vectors appear in the formula when Equations (8-20) and (8-57) are
inserted:
2
1 2 / e m r
1
= \
^—\ M S Iv
*
2oSsgi\
~~a4tt£ c* 3 \ 2m TT r
•
q f j p
812 Hyper fine Structure 433
Note that Equations (8-7) and (8-56) are also employed to eliminate the magneton
factors in favor of fundamental constants.
The treatment of S • I in the expression for Vss parallels the technique applied to
F = S + I (8-59)
F2 = S 2 + I 2 + 2S • I
2
? t ?, rn,
Vss --- 1 / e \
2
-S 2 -t 2 )V 2 --
1
4ire () c 2^nU-
lm 3 \ £ J
TT(F
M p
r
(8-60)
The hyperfine energy shift in any ^=0 state is determined by computing the
expectation value of this expression. The state in question has a grand total quantum
number / associated with the angular momentum behavior of the vector F, such that
quantized values of F2 are given in terms of / by the eigenvalue property
F2 = h
2
f(f+ 1). (8-61)
We also specify the state by the familiar principal quantum number n, orbital
quantum number £ = 0, and total angular momentum quantum number j '
= s = l
,.
i +i
and yields two allowed values for the grand total quantum number:
f= + i £or z- \. (8-62)
These two / values refer to two different states of the atom with two different energies,
split by the small effect of the hyperfine interaction. This splitting is present in every
nS x/2 energy level and is of greatest interest in the \S 2 ground level of the hydrogen X
,
atom.
We calculate (Vss ) in an (= state with quantum numbers n and / by first
identifying definite values for the angular momentum factor in Equation (8-60):
F 2 -S 2 - I
2
= h
2
[F(f+ 1) - f
- i(i+ 1)]. (8-63)
The expectation value of Vss also requires an evaluation of the remaining factor
V '( 1 /r ). We examine this last detail at the end of the section and deduce a
prescription in terms of the radial portion of the wave function, evaluated at the
origin:
V -j = -\RJ0)\\
2
(8-64)
These two contributions are combined to produce the general formula for the ^=0
M
f gsSi
<Vss) = /(/+ i)- " -i'(*+ i; RJ0)\~. (8-65)
477e 3 \ 2m e
c j M p
Let us specialize to the n = 1 case and consider the hyperfine splitting of the ground
level. We consult Table 7-1 to find R 10(0), and we also express the Bohr radius in
terms of the fine structure constant a to obtain
Zam,
|* 10 (0)| = -j =4
SsSi Zamx
= ahc /(/+ 1)- - -i(i+ 1)
(V.ss)
2m,c J M
= Z a — in /(/+1) i(i+l) (8-66)
M
i
3 k
This final result is valid in the /= ground level of the one-electron atom for any
given nucleus.
The hyperfine energy shift depends on the value of the grand total quantum
number /. Two possibilities appear in the bracketed factor in Equation (8-66):
i for f= + i
We see that the state with the larger energy is the one with the larger value of /. The
difference in energy between the two states determines the hyperfine splitting
8Ey «4 -^/(2i+
M 3
1). (8-67)
2 m e
8E, (8-68)
Transitions between the lS 1/2 hyperfine levels of hydrogen have been observed in the
laboratory in the radio range of photon frequencies. The measured frequency is
812 Hyper fine Structure 435
change the orbital quantum number ( and does not have the behavior of a radiating
electric dipole. The probability for the transition is severely suppressed because of
these considerations.
The famous 21 cm line has played an important role in radio astronomy ever since
its is detectable in the radio spectrum of galactic
discovery in 1951. This radiation
hydrogen clouds, despite the extreme suppression of the transition probability. Detec-
tion is made possible by the appreciable abundance of hydrogen atoms in the galaxy
and by the inappreciable attenuation of radio waves by interstellar dust. Observations
of the intensity of these 21 cm emissions are employed to map the distributions of
hydrogen in the galaxy, and measurements of the Doppler-shifted wavelengths are
used to determine the radial motion of the emitting sources. Such techniques are in
current use in the investigation of galactic structures.
Detail
Let us outline the steps leading to the formula for the hyperfine spin-spin
interaction. We turn to Problems 2 and 3 at the end of the chapter and use the
relation ju. £ £
2
= 1 to find the magnetic field due to the nuclear magnetic
moment ji
7
:
1 '
B = -
4tte c
^v x U,xv-
/
r
\
I T 1 / 1
2
fX/V" V U,'V-
47re c
This internal B field interacts with the spin of the electron in an (= state
through the spin magnetic-moment interaction
vss = -V-s'K = -.
4ire c-
2 {v-s'V-F
2
r
(|i. s ' V) hi/* V- r
{ \
1 2
Vss= ~.
j{ TV-s' V-F
-
^v-s'v-N - iv-s'^ I
v-r v-
and remark without proof that the expectation value of the terms in square
brackets vanishes in any ( = state. The remaining portion of V
ss is quoted in
the text as Equation (8-58).
436 Spin and Magnetic Interactions
Detail
,
= *.o(0
^ nr-
V '- =
1
— —d1
r
1
d 1
= 0.
r r~ dr dr r
/ \ c 1 1 t [ \ d d \\
V - = Jl**V -VdT=
2 2
/ xP*\——r' U4wr"-'<fr
\ r / r •'r-o \ r~ dr dr r
j
.,
r
Id d \\
2
-I^0)|'(,^i) = -|*.o(0)| .
r =
Example
Kf = 3
« 2g S gpJfEo-
19
4 (2)(2)(2.79) 13.6 eV 1.60 X lO" J/eV 9
Vu, = J4
= 1.42 X 10 Hz,
J
3 (i37) 1836 6.63 X 10 J •
s
Problems 437
and so we obtain
8
3 X 10 m/s
K= 1.42 X 10
9
Hz
= 0.211 m
Problems
1. Obtain the formula for the magnetic field due to a circular current loop and for the
electric field due to a pair of opposite charges, in each case at a location on the axis a
distance z from the origin. Show that for large z the two results are of identical dipole
form.
1
i
2. The general formula for the magnetic field due to a magnetic dipole moment \i is
Mo I
-V
477
X /
\
p. x v- r
Let \i = fiz and show that at the axial point (0,0, z) this expression reproduces the large--
V X (C X v/) =c(v •
V/) - V(C- V/),
where C is a constant vector and / is a scalar function. Apply this identity to the
magnetic dipole formula in Problem 2, and reduce the form of the result.
4. A uniform solid cylinder of mass M and total charge Q rotates about its axis. Determine
the g-factors relating the magnetic moment and the angular momentum, assuming that
the charge is uniformly distributed in two ways: (a) over the cylindrical surface and (b)
through the cylindrical volume.
5. A rectangular current loop is oriented with respect to a uniform applied B field so that the
magnetic moment p, makes an angle 6 with B. Show that the loop experiences a torque
given by \i X B, and use this result to determine the work required in order to increase the
angle between \i and B by an amount dO.
6. The Paschen a line in the hydrogen spectrum is due to the transition n = 4 —* n = 3.
Sketch the normal Zeeman splitting for the 4p and 3d energy levels of the hydrogen
atom, and compute the magnitude of the splitting in a 2 T applied magnetic field.
438 Spin and Magnetic Interactions
Identify the allowed Ap —* 3d transitions and determine the shift of the Paschen a
wavelength for each case.
deflected beams upon emergence from the magnetic field. Why is it valid to assume that
the hydrogen atoms are in the ground state?
ground state. A constant field B splits the magnetic energy levels of the atoms, and an
oscillating field Bu is tuned to the frequency corresponding to transitions between these
levels. Calculate the value of the resonance frequency for a field B = 2000 G.
10. A beam of hydrogen atoms in the ground state passes through two coaxial Stern-Gerlach
filters in series. The analyzing filter makes an angle B with the polarizing filter, as shown
in Figure 8-20. Let the polarizer transmit atoms in the spin-up state with respect to the z
axis defined by this first filter. Deduce the probability that any such atom is then detected
in the spin-up state with respect to the z' axis defined by the analyzer.
11. An electron is known to have spin down along a certain direction Z. Calculate the
probability of finding the electron to have spin up along another direction Z' at a 60°
angle with Z. What is the probability of finding spin down along Z'? Repeat the
are possible for these tn-= \ states? Draw sketches to show how L and S may be added in
the two given states in order to produce the allowed values of j.
must hold for £= 1,2,3,... . Prove (using a suitable graph) that, out of the range of
possibilities
only the £ + \ and (— \ cases satisfy the inequality for every integer (> 0.
14. Choose a 3D state of the one-electron atom and carry out the explicit integration required
and 3Z)r)/2 . These states are degenerate if relativistic effects are ignored. Evaluate the
spin-orbit energy shifts for the 3P and 3D states, and calculate the spin-orbit splittings
16. How many AF states of hydrogen have the same Bohr energy £4 in the absence of fine
structure effects? Evaluate the spin-orbit energy shifts for the two possible values of j at
the AF level. Identify the degeneracy that remains among the 4F states after this splitting
17. Prove that the energy shifts owing to the spin-orbit interaction and the relativistic
ZV / 2 3
\ SL/ rel/ i
\
2n
'
^ 2> + 1 An
The formulas in the text are derived for £+ states; however, the final result is valid for
any nL state.
18. Identify all the electric dipole transitions that contribute to the Balmer a line in the
hydrogen spectrum. Determine the fine structure shifts for each of the n = 2 and n = 3
19. Use the value of the Lamb-shift frequency, quoted in Figure 8-31, to compute the
difference in energy between the 25,,,, and 2P, ,
2
levels in hydrogen. Compare this result
20. The 3D level of the one-electron atom comprises ten states whose energies are equal if
relativistic effects are ignored and if no magnetic field is applied. Let an external B field
be introduced, and consider the case in which the strength of B greatly exceeds the
internal magnetic field responsible for spin-orbit coupling. Determine the energy shift for
each magnetic substate at the 3D level, and draw an energy level diagram in which the
split levels are labeled by the quantum numbers of the states. Is the tenfold degeneracy of
21. Reconsider the atom of Problem 20 for the situation in which the applied B field is weaker
than the internal magnetic field. Determine the energy shift of the 3D states resulting
from the magnetic interaction. Draw an energy level diagram indicating the quantum
numbers of the states and the magnitude of the splittings. Is the tenfold degeneracy at the
22. Let the atom of Problem 20 have the fictitious g-factor gs = 1 (instead of g s = 2) for the
spin magnetic moment, and consider the effect of a weak applied B field. Derive an
expression for the Zeeman energy shifts at the 3D level in terms of the quantum numbers
of the 3D states. Draw an energy level diagram labeled by these quantum numbers. Is the
tenfold degeneracy at the 3D level completely removed under these circumstances?
23. Calculate the fine structure splitting at the AF energy level of the hydrogen atom.
Determine the magnetic splitting of the 4F states when a weak B field is applied to the
atom, and calculate the value of the magnetic energy shifts in a 5 G field. Draw a
diagram of the final 4F array of energy levels, identifying the quantum numbers of each
state.
NINE
COMPLEX
ATOMS
of atomic structure furnish the background for the Bohr theory of the atom, the
concept of electron spin, and the Schrddinger theory of quantum mechanics. The
primitive atom with one electron has provided our first opportunity to test the
predictions of the quantum theory. The complex atom with many electrons is our next
system to analyze for further evidence of quantum structure. We find that the analysis
is complicated by the large number of degrees of freedom and that an exact solution is
out of the question for any atom with more than one electron. must approximateWe
the many-particle dynamical problem, and so we obtain a less-quantitative body of
predictions as a result. Our main goal is to understand the complex atom in terms of a
set of quantized energy states with a suitable assignment of quantum numbers. The
We then turn to a more detailed picture of the complex atom based on further
considerations of quantum mechanics.
We concentrate on the neutral atom and use the atomic number Z to locate the species
in the periodic table. The corresponding system consists of Z electrons bound to a
nucleus of charge Ze, assumed to be at rest. We must approximate a certain essential
440
9- 1 The Central-Field Model 441
Wolfgang Pauli
aspect of this problem, and so we regard any allowance for nuclear motion as an
unwarranted correction.
It is instructive to return to the hydrogen atom for purposes of reference. The
Coulomb potential energy and the stationary-state wave functions are given by the
well-known expressions
Ej/h
V= and ^ / = ib / (r)e
lE "
Attest
We recall that the angle independence of V makes possible the separation of radial
and angular variables in the eigenfunction nfm m (r). The results for the energy
i^
eigenvalue En are found in Figure 7-2 and are transferred to Figure 9-1. Note that the
scale of energy is shifted in the new figure so that the excited levels are displayed
relative to the position of the ground state. The figure shows the familiar n /-orbital
labels for the single-electron configurations as well as the spectroscopic notation for
the states.A left superscript is added to these designations to indicate the doublet
property of the states nL- with j values equal to either t° + L or c* — ~ (The states of
, . .r
the one-electron system are labeled as ~S doublets, even though j = \ is the only '
possible choice. This usage anticipates a development yet to come.) The figure also
shows transitions allowed by the electric dipole selection rule t\( = ±1 and includes a
scale of energy in inverse nanometers along with the usual electron volts. We relate
these units with the aid of the Rydberg definition in Equation (3-59),
R,
he
442 Complex Atoms
Figure 9-1
Grotrian diagram for the hydrogen atom. The fine structure of the doublet levels is too small to
be seen on this scale. Electric dipole transitions are labeled by the emitted wavelengths in
nanometers.
.
R +e
2
S 2p 2
D 2
F
2e< 2e<
K= -
—
(9-1
477£ r, 4:ire Q r2 477£ |r, r2 |
9- I The Central-Field Model 443
Figure 9-2
in which the differential operators V{ and V 2 " refer to the two sets of coordinate
variables (r,, #,, <£,) and (r2 , #,, <£>.-,). We observe that the electron-nucleus contribu-
tions to V have the familiar central form and do not complicate the solution for »//.
is clear at the outset that the third term in Equation (9-1) prevents the separation of
the one coordinate from the other.
We are faced with a worsening situation when we consider atoms with larger values
of Z. The general Z-electron problem involves an eigenfunction ^(r,, . .
.
, rz ) depend-
ing on Z independent coordinate vectors. The eigenfunction for energy E satisfies the
differential equation
2
Ze
V=
I 1
+ —1
47re r
\ >
2
e
+ + -
477E IM r > l
z-i
\
Ze
2 z 1
z
477e
I
4™ .<.._, |r, - r ;l
(9-4)
,= 1
we must adopt a simplifying model of the atom if we are to make any progress toward
a solution of this problem.
Let us suppose for the moment that electron-electron repulsion can be regarded as
a secondary correction so that the eigenfunction ip and the eigenvalue E can be
obtained in first approximation from electron -nucleus attraction alone. This proce-
dure decouples the electrons from one another and treats each electron independently,
with its own central Coulomb potential energy for every independent coordinate
vector r,. The approach allows the application of separation of variables and produces
a solution in which the eigenfunctions for the one-electron atom are used to describe
each of the Z Of course, the approach is valid only if the noncentral
electrons.
Coulomb enough to be ignored. It is clear that the attractive potential
forces are small
energy between the nucleus and the electrons is the main effect since the binding of
the electrons is the net result. Equation (9-4) represents a competitive situation where
a given electron makes a central-attractive contribution with an enhancing factor of Z
and also makes Z — 1 noncentral-repulsive contributions to offset the binding effect.
The repulsive terms add up to an appreciable correction whose order of magnitude is
not much smaller than that of the overall nuclear attraction.
It is possible to improve the approximation and still retain the independent-particle
approach. We observe that each atomic electron is shielded from the nucleus by the
other Z— 1 electrons, and we argue that every independent electron experiences a
screened nuclear attraction attributed to an effective nuclear charge smaller than Ze. This
screening of the nucleus is expected to vary with the radial distance to the given
electron and is be representable on the average by a spherically
supposed to
symmetric potential energy function. Thus, our improved first approximation to the
eigenfunction for the atom is based on an independent-electron model in which every
electron interacts with a central field and has a potential energy V (r) with
c
the limiting
behavior
2
Ze
as r —»
4wc r
K(r)
as r -> oo
I
477£ n r
2
h
-T- V.V.-r VXrM, = E,^, (9-6)
2m f
to obtain an eigenfunction ^,(r, ) with energy £,. We let the index i run over all Z
9-1 The Central-Field Model 445
electrons and express the total potential energy of the system by the sum of terms
V=tv c
{ri ). (9-7)
product form,
^(r,,...,rz )
= ^ 1
(r,)--- * z (rz ), (9-8)
i=i
The product construction for the eigenfunction of the atom in Equation (9-8) is the
basic feature of the independent-particle model.
The Vc is central, and so the single-particle eigenfunction ^,( r ) is
potential energy ,
Tnis solution of Equation (9-6) for a single electron is called a spin orbital. The
eigenfunction for the entire atom is then constructed as a product of Z such factors in
themanner of Equation (9-8). The familiar spherical harmonics appear in \p a with
indices { and m r bearing the usual angular momentum interpretation. The assign-
ment of a unique angular momentum to each individual electron is a consequence of
the central-field property of the independent-electron model. Equation (9-7) conveys
this dynamical property by excluding any explicit coupling between the electrons. The
radial function R n/P (r) obeys a differential equation like the one for the hydrogen
atom, except that the purely Coulombic potential energy is replaced by the central-field
function V (r):
c
2
h ac*+ 1)
VR nS + K(r)+ ,
R nf =E nf R nf . (9-10)
2m r dr" Im.r
either since spin effects are not included in the potential energy of the atom. This
procedure is expected to result in single-electron functions and energies different from
the analogous solutions for the hydrogen atom because of the non-Coulombic nature
of Vc .
this problem exists in the form of a method, originally proposed by D. R. Hartree and
eventually improved by V. Fock and J. C. Slater. Their solution treats the average
behavior of each atomic electron and also incorporates the identical-particle symmetry
of the multielectron system. The whole procedure is supposed to be self-consistent,
Example
The ground state of the helium atom has energy — 79.0 eV (on an energy level
diagram where the zero level refers to a He + + ion plus two electrons, all at rest
with infinite separation). Let us make a primitive attempt to understand this
number. The crudest approximation to the He eigenfunction is found from
Equation (9-2) by retaining only the first two terms from the potential energy in
Equation (9-1). The resulting ground state is described by the product of two
hydrogen-like n = 1 eigenfunctions,
- (r, + r2 )/<z
g
rp(r ,r 2 t )
= i//
1(l0
(r, )<//„,„( r,) = ,
ita
in which the spins of the two electrons are left unspecified and the radius
parameter is given by a = a /2 for Z= 2. The ground-state energy in first
E = 2E = -2Z 2E = X
-8(13.6 eV) = - 108.8 eV.
We should not expect this result to be close to the correct answer since we have
ignored the repulsion of the two electrons. Let us estimate the repulsive
contribution by making a gross classical calculation. If we replace the electron
separation |r, — r2 |
in the neglected term of Equation (9-1) by the constant
value 2a = a , we find
477e a
e
2
= ahc —- =
am.c
a2m c
2
= 2E n = 27.2 eV.
The sum of the two numbers — 108.8 and 27.2 yields —81.6 eV as an estimate
for the ground-state energy. This computation is inadequate for a quantum
system and serves only to suggest the relative sizes of the two competing effects.
Electron spin has proved to be essential for the understanding of multiplet structure
and Zeeman splitting in the one-electron atom. Complex atoms also reveal the
influence of the spin quantum number, particularly in the periodic table of the
elements. This revelation of a special role for electron spin in many-electron atoms is
deduction of the spin degree of freedom originates in these observations, and the
exclusion principle provides the underlying deductive framework.
The exclusion principle is a statement, due to Pauli, about the assignment of
electron quantum numbers in the complex atom:
No more than one electron is allowed to occupy a given quantum state specified
by the complete set of single-particle quantum numbers (n^m f m s ).
This prohibitive rule acts as a constraint to exclude the existence of certain states in
Z= 3,11,... .
Figure 9-3
Transitions of outer- and inner-shell electrons in optical and x-ray spectra. The excitation of the
atom is by electron collision in each case.
Optical transition
Excitation Deexcitation
ition
electrons in the ground state of a given atom were not simply assigned to the
innermost shell of minimum energy.
Pauli was an ardent advocate of the notion of closed shells. He argued that these
distributions of electrons did not participate in the optical properties of the atom
because the closed shells did not contribute to the total angular momentum and
magnetic moment. His argument drew only on the Bohr model and therefore did not
address the basic question of shell closure.
Another problem with Bohr's model was its failure to predict all the lines observed
in the x-ray spectra of the atoms. This defect was remedied by E. C. Stoner, who
suggested on empirical grounds that a fully occupied shell should contain twice the
number of electrons deduced on the basis of Bohr's three quantum numbers. The
suggestion influenced Pauli to propose that the doublet structure of the alkali spectra
and the anomalous Zeeman effect could be explained if a nonclassical two-valued
degree of freedom was associated with the electron. The new variable turned out to be
the spin quantum number and the new idea proved to be the inspiration for electron
spin.
Pauli went on to observe that four quantum numbers were needed to specify a state
for each electron in the atom. He then showed that the shell structure of the atom
followed as a natural consequence if no more than one electron occupied any given
single-particle state in the many-electron system. His proposition of the exclusion
principle resolved the problem of shell closure and explained the organization of the
periodic table. The original statement of the principle was put forward in 1925.
Pauli's idea was later recast in more general quantum mechanical language and
became one of the cornerstones of quantum mechanics.
9-3 The Ground Slates of Atoms and the Periodic Table 449
because the potential energy of each electron is spherically symmetric and spin
independent. Hence, there are 2(2/+ 1) degenerate states with the same energy En( ,
corresponding to the two possible values of m s and the 2/+ 1 possible values of m t .
This definition of n agrees with the radial-node properties of the hydrogen solutions
found in Section 7-2.
Each value of n determines an electron shell for a given atom. A shell consists of n
subshells labeled by n and /, as/ ranges from to n — 1, and every «/ subshell
contains 2(2/4 1) spin-orbital states as observed above. Thus, the single-electron
energies in the central field depend on / and n according to a scheme in which the
principal quantum number retains the same meaning for all atoms. The radial
functions and probability distributions for a single electron can therefore be given the
same interpretation from one atom to another, in the manner of Figures 7-1 and 7-4.
We wish to know the ordering of the energies Enf so that we can assign electrons to
subshells of increasing energy. The determination of these single-electron energies is a
technical problem that must be solved anew for every choice of Z. A representative
sample of the energy levels for a particular Z might resemble the diagram shown in
Figure 9-4. we compare this picture with
If the analogous diagram for hydrogen in
Figures 7-2 and 8-13, we can observe certain points of similar and dissimilar behavior.
The energy E nf increases (becomes less negative) with n for fixed /, reflecting the
usual relation between the energy eigenvalue and the number of nodes of the radial
function. The energy also increases with / for n fixed in a given shell. We can
450 Complex Atoms
Figure 9-4
Energy levels and subshells for a single electron in a non-Coulombic central field. The number
of spin-orbital states is given in parentheses for each of the nf subshells.
(=0 ( = 1 e = 2 ( = 3
Ebp E, 4/
Ess d 5p
5s (6)
•43 (14)
(2) 4p (10)
E*s Eid '
4s (6) 3d
(?) (10)
Eip
(6)
3s
(2)
-.7> 2p
ib)
2s
(2)
Eu
(2)
understand this new feature by considering the following arguments. The smaller
find eigensolutions R n f{r) and Enf from Equation (9-10). We must then assemble the
state of the given atom by invoking the exclusion principle and assigning the collection
of Z electrons to the single-particle energy levels. The resulting probability distribu-
tion for the system of particles determines an electron charge density from which a
classical electrostatic potential can be computed. The potential determines a potential
energy for each electron, and the average of this result over the state of the atom
generates a predicted central potential energy Vc {r), to be compared with the
hypothesized quantity at the beginning of the cycle of calculations. Self-consistency is
achieved when satisfactory agreement is obtained between the input and output
versions of V( r ). This procedure summarizes the technique developed by Hartree.
The subsequent modification of the method by Fock and Slater incorporates the
properties of identical-particle symmetry to produce the best possible eigenfunctions
for the atom.
Figure 9-5 contains a few graphs of the single-electron energy E nf versus Z, taken
from calculations based on the self-consistent method. We can use this sort of
information to and construct an energy level diagram like
order the subshell energies
the one in Figure 9-4. The exclusion principle allows no more than 2(2/+ 1 electrons )
competing subshells at higher energy. Let us summarize the solution of the ordering
problem by listing the subshells with increasing energy as follows, noting by parenthe-
ses the subshells of nearly equal energy:
We can use this list to build up the ground-state configuration for any atom in the
periodic table.
The lowest Is subshell is occupied by as many as two electrons to account for the
elements hydrogen and helium ( Z= 1 and 2). We describe the electron configurations
of these atoms by the following notation:
H He
2
Is Is
Li Be Li Be
2 2 2
°r 2
\s 2s ls 2s 2 [He] 2s ...2s
452 Complex Atoms
Figure 9-5
One-electron energy En( versus atomic number Z for the lower subshells. The energies are
expressed on a logarithmic scale in terms of the hydrogen-atom ground-state energy. The data
are taken from tables compiled by F. Herman and S. Skillman.
100
£i,(l)
Note that the symbol [He] refers to an inner helium-like closed shell in the structure of
each atom. It should also be noted that the Be configuration constitutes a filled
subshell but not a closed shell. The 2p subshell follows with a maximum occupancy of
six electrons. We fill these states by adding one electron at a time to generate the
configurations of the next six atoms from boron to neon (Z = 5 to 10):
B C N O F Ne
2 2 2 2 :i 2 4 2 5 2 6
\He]2s 2p ...2s 2p ...2s 2p ...2s 2p ...2s 2p ...2s 2p
9-3 The Ground Stales of Atoms and the Periodic Table 453
Each of these structures contains an inner closed [He] and an inner filled 2s 2
shell
subshell. The Ne configuration at Z= 10 closes the L shell, and the eight L-shell
atoms from Z = 3 to 10 make up the second row in the periodic table.
The 3s and 2>p subshells are filled in similar fashion by two and six additional
electrons. This construction produces the ground states of the eight atoms from sodium
to argon (Z = 11 to 18), corresponding to the eight elements in the third row of the
periodic table. The configurations of these atoms are listed in Table 9-1, along with
other pertinent information for all the ground states through atomic number 54. Note
that the second and third fully occupied configurations at Z= 10 and 18 are
abbreviated in the table by the symbols [Ne] and [Ar]. We use these abbreviations
when we fill the succeeding subshells beginning at Z = 11 and 19.
The Ar atom closes the third family of subshells, even though the 3d subshell
remains unoccupied. This type of closure is akin to the closing of the K and L shells
is just like the two previous gaps at the Is and 2p levels where the K- and L-shell
closures occur. The 4s subshell is actually the next to be filled after 3p because the
screening effect of the [Ar] core tends to favor As over 3d for atoms in this region of
the periodic table. We
from Table 9-1 that the fourth family of subshells opens
see
with the beginning of the As subshell by the K atom at Z = 19 and closes with the
filling of the 4/> subshell by the Kr atom at Z = 36. The 3d subshell is occupied in
ten steps along the way, at interior values of Z in the fourth row of the periodic table.
Notice that this resolution of the (4s 3d) ambiguity undergoes a reversal at Z = 24
and again at Z= 29. In these special cases the Cr and Cu ground states prefer to
keep only one 4s electron and add two more 3d electrons instead.
The same pattern of subshell formation is reenacted in the fifth row of the periodic
table. The 5s subshell is opened by Rb at Z = 37, and the 5p subshell is closed by Xe
at Z = 54. Occupation of the 4d subshell occurs at interior values of Z, and several
reversals in the resolution of the (5^ 4d) ambiguity take place along the way.
The periodic table unifies these observations about the filling of subshells. Figure
9-6 shows how the shell structure of the ground-state atoms is organized by rows in the
table and how the various rows are subdivided according to the occupation of the
successive subshells. Closures occur at the ends of rows, at the noble-gas locations
Z= 2,10,18,36,54
These special atoms are characterized by the closing of a p subshell (except in the case
of He), followed invariably by the opening of an s subshell in a shell of higher n.
Each successive row in the table refers to states of electrons assigned to lie outside closed
subshells. These assignments of n and c* determine the outer electrons of each atom
because the corresponding spin orbitals give weight to the larger distances between
electron and nucleus. Columns of the periodic table contain different atomic species
with the same numbers of electrons outside closed subshells. These atoms are expected
to have similar physical properties since their electron configurations have similar
patterns ofquantum numbers. The alkali elements in the first column are prime
examples. The common properties of these atoms are clearly demonstrated in their
hydrogen-like optical spectra.
Table 9-1 includes a list of experimental values of the ionization energy for each
neutral atom Z. This quantity is defined as the negative of the ground-state energy on
an energy level diagram in which the zero level denotes an ionized system consisting of
454 Complex Atoms
o V
•;^f^oioo-iin-'Ocooco*coomMO*coo)a)i -iinn
s ^«i>;?:qaiooh;cqq-r« cnoqto-<(Ni«'tiy)inc>is(n(Oo*'-
, '*,
i
g i I ~H ~- ^^
oocoooo
-ft, "Vi^ "*<, "*"V| "^t, "^"**
"*"S
^* TJ^ ^7^ Tj* ^^* ^* io lo in in in io
a <S! <u - o
J3
n o a > s _a
Ih
m >i N 1 2 CU M
N M
k
eg
, C,
<m
o — CM en
en co en en
i -r
cn
m
en
CO
en en en
o -r C 1 en -r
-t 1
-f
m r-~ co a-
-r
3
i.-. IT
CM
m m
B
(eV
in en
CI
en en C
o
10 en C
IO
e
1 m
1 i io
m
-r
10
C7,
01
m or, (O
en
i
cr,
-
i -
*
en
-H IN
-h in cq
* *
r--
1^ •* r^ (O
r~.•«r CO CO
13.60
C I
m en SO -f en
2
- i
C i
m i - m CO O o r i
m
e Energy
a
o
-
^^
CM CM rCM CM CM CM
«*. <», «», -a, •<&, -c^ •<&,
en en en en en en
«*, •«, ^ "S3 "<3 "S3
en en en "« en en en
"S3 "S3
I C^i ^l ^ I ^t ^4 '"M CM CM
<o
CM CM
«5
CM
"3
CM
<5
CM
<3
CM CM CM en cm
<s
CM CM CM CM CM CM CM i
en en en en en en en ,
^T^ ^mP ^nP ^T^ ^7^ ^T^ ^r ^^
(j j: z ^
CO
53
'5
.« if-
U^^Uc^H>uSfeCJ
|.HMeM^if)(Ol^COO)0- ' CM CO -f iT) (D NCOCflO-i(NCO'tin«3N
—<—<-hcmcmcmcmcmcmcmcm
9-3 The Ground Stales ol Atoms and the Periodic Table 455
Figure 9-6
Periodic table of the elements from Z = 1 to 54. Atoms in the same column have similar
physical and chemical properties. The rows are divided into blocks of atoms whose outer
electrons occupy the indicated subshells.
Is
1
2
H II.
2s 'ip
3 4 5 6 7 8 9 10
Li Be B C N O F Ne
3s 3p
11 12 13 14 15 16 17 18
Na Mg Al Si P S CI Ar
4s 3d If,
19 20 21 22 23 24 25 26 J 7 28 29 10 31 i2 !3 !4 35 36
K Ca s, Ti V Cr Mn Fe Co Ni Cu Zn Ga Ge As Se Br Kr
bs u< <P
37 38 39 41 12 43 11 15 In 47 48 19 JO il i2 1
Kl> Sr Y Zr Nb Mo Tc Ru Pd Ag Cd In Sn SI, Te 1 V
an electron at rest farfrom a ground-state Z" ion also at rest. In practical terms, the
ionization energy represents the minimum photon energy that the atom must absorb
in order to free a bound electron. We plot the tabulated energies against the atomic
number in Figure 9-7 so that we can display the evidence of periodic shell structure.
The largest values of the ionization energy are observed for the noble gases, while the
smallest values are found for the alkalis at the opening of each succeeding shell. We
can understand this behavior if we recall how the screening effect operates. Electrons
in the same subshell tend not to be screened by each other since their spatial
distributions are equivalent. Consequently, their binding is governed by an effective
nuclear charge that grows with Z to a maximum at the noble-gas closure. The next
electron must be added with a larger n assignment, implying a larger average distance
from the nucleus. The result is an abrupt rise in screening accompanied by an abrupt
fall in nuclear attraction for the added electron, as seen in the figure. Hence, we
identify a closure to be the most tightly bound structure, and we recognize a single
electron beyond a closure to be the least tightly bound constituent.
Electrons in unfilled outer subshells are called valence electrons. These weakly
bound particles control the chemical properties of the atom and determine the
interaction of one atom with another. The noble gases have no unfilled subshells in the
ground state, and so these elements tend to be chemically inert. On the other hand,
the alkalis are chemically active because their ground-state configurations contain a
456 Complex Atoms
Figure 9-7
Ionization energy versus atomic number up to Z= 54. Extreme values of the ionization energy
occur as indicated for the noble gases and the alkali elements.
Ionization energy (eV)
He
Ne
,'(!
At
Kr
.«:,-
10
Li Ne
Rb
10 20 30 40 50
single weakly bound valence electron. These atoms are easily ionized and readily give
up their lone valence electron in the formation of molecules. The chemical characteris-
tics of the elements recur across the periodic table in a columnwise pattern of
regularities, as discovered originally in the design of Mendeleev's table. This pattern is
Example
The ionization energy is then given by E /n 2 the Bohr expression for hydrogen, ,
with principal quantum numbers n = 2 for Li, n = 3 for Na, n = 4 for K, and
so on. The resulting energies are
——
eV
13.6
= 3.40 eV, 1 .51 eV, and 0.85 eV for n = 2, 3, and 4.
//
These predictions are not close to the actual values 5.39, 5.14, and 4.34 eV listed
for Li, Na, and K in Table 9-1. It is obvious that a better model is needed to
include the screening effect.
94 XRay Spectra 457
Example
Oxygen, sulfur, and selenium occur in the same column of the periodic table at
O Se
2 4 2 4 2 I0 4
[He]2/ 2/> [Ne]3j 3/> ;Ar]4j 3rf 4/>
These elements can fill their vacancies by accepting electrons from two hydrogen
atoms to form the molecules H_,0, H.,S, and H 2 Se. The two vacant sites in O, S,
and Se have angular probability distributions characteristic of an /= 1 subshell.
We can interpret the three m f assignments for £ = 1 in terms of large distribu-
tions of probability along three perpendicular directions. Therefore, we expect to
find right-angle shapes when we attach two H atoms to O, S, and Se to make
the three triatomic molecules. This expectation is borne out since the hydrogen
bonds are at angles of 105°, 93°, and 90° in H.,0, H >S, and H ,Se. (The angle
tends to exceed 90° because of Coulomb repulsion between the two hydrogens
and approaches 90° in the largest molecule where the repulsion is weakest.)
The periodic table organizes the states of electrons outside closed subshells as the basis
for the low-energy regularities of atoms. The underlying shell theory also governs the
inner shells of the atom where the electrons engage in higher-energy processes such as
the emission of x rays. This inner-shell behavior can be examined in a given atom by
analyzing an x-ray spectrum like the one shown in Figure 9-8. We have discussed the
production of x rays in Section 2-6, and we have taken a preliminary look at
characteristic x rays as a source of inner-electron information in Section 3-7. We now
return to the study of x-ray spectra with a proper quantum theory at our disposal and
find that the independent-electron model provides an especially suitable approach.
Let us reconsider our previous picture of x-ray emission in Figure 9-3 and visualize
the excitation and deexcitation of the system from the viewpoint presented in Figure
9-9. The revised picture describes these processes in terms of the single-electron energy
levels in the central-field model. We represent the collisional excitation of the atom by
the creation of a vacancy, or hole, in one of the fully occupied inner subshells. The
Figure 9-8
X-ray lines in the K and L series of tungsten. The heights of the lines are indicative of the
observed intensities. The Ka line at 0.0209100 nm is used to define an x-rav wavelength
]
standard.
<>
K series L series
Figure 9-9
resultis the formation of a highly excited ionic state with initial energy E
v then We
represent the radiative deexcitation of the system by letting an electron from a higher
subshell fill the hole and emit an x-ray photon. The emitted photon has energy
hv = E - E2 x ,
(9-11)
where E2 is the energy of the excited final state in which the vacancy appears in the
higher subshell. Observe from the figure that the x-ray transition occurs in the ion
Let us also consider the related quantum phenomenon of x-ray absorption. This
process is an example where the absorption of an
of the familiar photoelectric effect,
x-ray photon excites the atom above and ejects a bound electron. A
its ionization level
quantum mechanical probability can be introduced to describe the photon-atom
interaction, and an absorption cross section can be defined to account for the behavior
of a beam of x rays incident on the atoms in a sample of matter. We measure
absorption in the laboratory by observing the attenuation of an x-ray beam in its
passage through a thickness of material. The fractional decrease in intensity — dl/I is
related to the element of thickness dx by the proportionality
dl
= u x dx,
I *
dl
,,
Jr
-=- r
,
ii x dx =* ln-=-/x^
I
=>
/=V" M ' X
.
I Jo I
The absorption coefficient varies with the material and depends on the wavelength of
the x rays. We can use measurements of the attenuation to determine this dependence,
and we can then infer the related behavior of the absorption cross section for the given
element.
94 X-Ray Spectra 459
Figure 9-10
K and L absorption edges of lead. The wavelength thresholds occur where the x-ray photon
energy becomes insufficient to eject a K- or L-shell electron. Emission lines of lead in the K and
L series are also shown.
Absorption
coefficient
k (nm)
K series
minimum needed to ionize the atom and leave a vacancy in the shell. When A K
becomes larger than A K the x-ray photon energy becomes too small to free a A'-shell
,
electron but remains large enough to eject an electron from an L (or higher) shell. We
again observe a steady growth in absorption as the wavelength continues to increase
until we reach one of the indicated L absorption edges. The various absorption
thresholds are tabulated along with the characteristic x-ray emission lines. Both
features provide a signature of the particular atom, and both give an indication of the
energy levels of the system. We include the emission lines of the K and L series in the
figure so that we can note the positions of these spectral lines relative to the absorption
edges.
X-ray absorption can be interpreted with the aid of our representation of the atom
as a collection of occupied subshells. Figure 9-1 1 describes the process in these terms at
two different wavelength thresholds. The wavelengths A, and A., correspond to
photons whose energies just suffice to eject an electron with no kinetic energy from the
particular subshells shown in the figure. We leave the excited ion in states of energy E x
E > E2
x
for A, < A ->•
1
Figure 9-1
The figure tells us that the wavelengths and energies obey the formulas
hi k
E ~
\ ^atom and E, - E» (9-12)
A, A,
where EaUm denotes the energy of the initial atom. If we compare Figures 9-9 and
9-11, we see that the two ionic energies are the same as the energies E x
and E., in
hv (9-13)
A, A
This formula relates the wavelengths for two absorption edges to the frequency for a
certain emission line in the spectrum of a given element. The equality implies an
inequality of the form
he ht
or A > A,
A
for an emitted x ray of wavelength A = c/v. This observation explains why the lines
in a given series have wavelengths above the corresponding absorption edge, as
indicated in Figure 9-10.
The absorption and emission of x rays reveal a new problem that requires an
amendment to the central-field model. Figure 9-10 shows the existence of three L
absorption edges instead of the two expected for the 2s and 2p subshells in the L
shell. This behavior is a clear indication that the nc° assignments of the model are not
adequate to describe the energy levels of the independent electrons. The effect is
Single-electron subshells including the effect X-ray emission associated with the transition
A' —> L lu An electron in the L
of spin-orbit coupling. .
m subshell fills
Electron a vacancy in the K shell while the hole makes
n( occupancy j Subshell a transition from A' to L m The standard .
Ka t
line of tungsten corresponds to this
transition.
5
M\V
3rf<T 2
3
3
2
Z
Mlv !° n
3p^ '
' '
'
2 i
Mm
—
: ; ~ i 2 m„
3s 2 M,
2p< ^
^
— «—
I
1
*m
L\\
^^l!^
7
Hole transition
2s ~ 1
" i!
l.s
spin-orbit coupling experienced by each electron in addition to its interaction with the
central field. We discuss some of the details of this coupling at the end of the section.
Figure 9-12 shows how these considerations alter the independent-electron picture
of the atom by the introduction of the quantum number j. We let each energy level
with subshell quantum numbers n and ( be split for /# in the manner of Figure
8-28, and we recall the familiar quantum numbers (nfjm^ to specify the new
single-electron states. We also label the energy levels by the x-ray spectroscopic notation
K L LnLm
x
M Mu M m M M
x lv x
according to the tabulation given in the figure. The exclusion principle is built into
thisnew scheme of states by stipulating that no two electrons are allowed to have the
same four quantum numbers (nt*jm Hence, each of the split nc"j subshells has a
).
states for the given value of j. (We note in passing that the figure orders the energies
of the inner subshells in ascending fashion as
Is 2s 2p 3s2p3d ....
462 Complex Atoms
This ordering of values of Enf holds for the inner electrons of a large-Z atom.) We
note especially that the occurrence of the three split L subshells in Figure 9-12 is
directly correlated with the existence of the three L absorption edges in Figure 9-10.
Let us now return to our picture of x-ray emission in Figure 9-9 and revise the
diagrams to account for the splitting of the subshells. We illustrate the result in Figure
9-13 in terms of a particular x-ray transition. The indicated process shows the filling
of an inner-shell hole by an electron from a higher subshell, along with the corre-
sponding transition of the hole in the opposite direction. Since the initial and final
states of the ion are associated with the location of the hole, it is conventional to
represent the process as a hole transition. The convention employs an energy level
diagram which the highest energy state refers to a hole in the K shell, and in which
in
the zero level refers to the ionic ground state where the hole occurs just beyond all the
electron subshells. This presentation of the energy states of the ion inverts our view of
the levels found in Figure 9-12 and produces an x-ray level shown
diagram of the kind
in Figure 9-14. The figure lists the values of the hole quantum numbers (n£j) and
indicates the allowed hole transitions. These radiative processes are governed by the
electric dipole selection rules, so that the single-particle quantum numbers are
Figure 9 14
X-ray levels of lead and transitions allowed by the electric dipole selection rules. The energies
are plotted on a logarithmic scale.
Energy (eV)
Pb (Z= 82)
10 5 -
K series
n= 1 f =
i=2
.
L series ( = >=\ 1
1
hi n = 2 3 2
^III 2
Hr
...
'
'
'
'
!-
94 XRay Spectra 463
as Equations (8-42) and (8-43). The various transitions are organized in the
in
diagram into the different series of emission lines. We can employ this same general
scheme whenever we wish to analyze the x-ray spectrum of any element.
Detail
S • L \ dV
VSL
2m e
c r dr
/here
2
Ze 1 dV Ze 2
V = and =
3
47re r r dr 477e n r
in which
1 1 dV
Lm f
c r dr
J -L -
2 2
s-
<ySL ) = uo
2
h
j(j+ 1)-/{S+ 1)- - <U0>
2 4
2
h for j"
= t+ \
= y<€c(0>
-{- 1 (orj = + /- \.
This energy shift affects the energy level of an electron in an r\{ subshell
according to the value of j and causes a splitting in every subshell with /# 0.
464 Complex Atoms
Example
The K and L m absorption edges of lead in Figure 9-10 are observed at 0.01408
and 0.09511 nm. Let us consult Equation (9-13) and use these numbers to
X(KL m )
1111
compute the wavelength emitted in the K —» L lu transition:
\K \ Lm 0.01408 nm 0.09511
1
nm 0.01653
1
nm
'
This emission line can also be predicted from the energies of the K and L m
x-ray levels of lead. We use the tabulated values EK = 88.00 keV and EL =
13.03 keV, and we find
he 1.240 keV-nm
A( KL m = 1,1 )
= = 0.01654 nm.
'
E K -E. 74.97 keV
Both calculations agree with the wavelength listed in the tables for the KL m line
of lead.The line and the transition are included among the information given in
Figures 9-10 and 9-14.
We have been able to implement the exclusion principle and explain the periodic
table with only minimal use of quantum mechanics. The exclusion principle itself has
been expressed in a limited context, based entirely on the assignment of electron
quantum numbers in the central-field model. We now want to examine the fundamen-
tal quantum nature of Pauli's principle from the viewpoint of electron indistinguishabihty
*(r ly S l
„r2 ,S2 „t) = *(l,2,t).
We then determine the probability of finding the two particles in two volume elements
c/t, and dr.) at time t by evaluating the expression
2
|^(l,2, t)\ dTydT2 .
The electrons in the system have identical physical attributes of charge —e, mass m e,
Figure 9-15
O^
tides has been discussed previously in Section 5-10. We are reminded in Figure 9-15
that any pair of classical particles can always be analyzed in terms of distinguishable
particle orbits, but we recognize that such a treatment violates the uncertainty
principle in the case of identical quantum particles. Since the two particles in the
figure are identical, their indistinguishability allows us to prepare a state of separately
identified colliding particles but prevents us from ascertaining the identity of the
detected particles after the collision. We build this property of electrons into the
probability density by imposing the requirement of identical-particle symmetry:
This statement follows Equation (5-90) and says that 1*1" is not altered by tin-
exchange of the two sets of space and spin variables in the wave function.
The exchange symmetry of |¥|" can be realized in two independent ways. These
alternatives correspond to mutually exclusive symmetry properties of the wave func-
tion itself. The two-particle state may satisfy either the exchange-symmetric condition
A system containing more than two identical particles obeys a similar relation with
regard to the exchange of any two sets of degrees of freedom in the many-variable
wave function. We emphasize that the symmetry or antisymmetry of ¥ is an
additional quantum property to be applied to solutions of the Schrodinger equation
and that the choice of sign + for symmetry or — for antisymmetry) is not meant to
(
Pauli's original exclusion principle follows directly from this statement by arguments
to be demonstrated below. The Pauli principle more comprehensive because its is
implementation is not tied to any specific treatment of the electron system, and
especially because its implications are immediately applicable to any particles certified
as fermions. Thus, the shell structure of atoms is only one indication of the influence of
this new quantum property.
Let us see how the exclusion principle is contained in the statement of antisymme-
try by looking first at the case of the two-electron atom. We consider a stationary-state
wave function of the form
fi// *
¥(1,2,0 =^(l,2)«-'
and note that the Pauli principle requires the eigenfunction ^(1,2) to be antisymmet-
ric under the exchange of variables 1 «-» 2. We determine \p from Equation (9-2) by
applying the central-field model to the pair of independent electrons. We know from
Section 9-1 that solutions for \p exist as products of two single-particle eigenfunctions,
each having the spin-orbital form
(Recall that we are using a single Greek index to represent a complete set of four
spin-orbital quantum numbers for a single particle.) Thus, if a and /? denote any two
spin-orbital states, it is clear that
are degenerate solutions of Equation (9-2) since both product expressions have the
same energy eigenvalue Ea + En. Any linear combination of the two expressions is
also a solution with the same energy, and so the requirement of antisymmetry can be
met by constructing the particular combination
This two-electron eigenfunction has the desired antisymmetric behavior under the
exchange of variables 1 <-> 2:
(We include the factor l/v2 to secure the normalization of \p. This point
9-5 Electron Antisymmetry 467
antisymmetry of Equation (9-17) that \p vanishes whenever a and /? refer to the same
quantum numbers. We
set of single-particle conclude that no such two-electron state
and we thus affirm the outcome of the exclusion principle.
exists,
These arguments are readily extended beyond the case of the Z = 2 atom. We
observe that Equation (9-17) has the structure of a determinant, and we write
*(1,2) =
*«(2) *,(2)
1
* o 0) M M1
)
1
)
MO
*«(2) *„(2) ^(2)
^(1,2, Z) = (9-19)
lz\
Mz )
for the Z-electron atom. These expressions for the eigenfunctions are known as Slater
determinants. The general algebraic properties of determinants imply that the expres-
sions change sign when any two variables are exchanged and vanish when any two
spin-orbital indices denote the same state. Consequently, Slater determinants auto-
matically satisfy the Pauli principle and furnish the basic solutions for use in the
Hartree-Fock self-consistent procedure.
We have discovered the joint concepts of spin and fermion antisymmetry as new
quantum mechanical properties associated with the behavior of electrons. It is
Example
-
-fdrJdr^lUDfl V2)T' " ^(1)^(1)^(2)^(2)
2
-^(1)^(1)^(2)^(2) +|^(1)| |^(2)
= \ {
1 -0-0+ 1}.
/ I
^«( 1 ) |
dt\ = 1 and similarly for \pp
/^(1)^(1)«/T, =0 if a *fi.
Example
and »|/. A non-antisymmetrized eigenfunction for the system has the form
f'(l,2) =^(1)^(2)
2 2
P4(l,2)=\f(l)\ \$(2)\ .
2 2
P(l,2) = ±{|if(l)| |#(2)| " lf*(l)*(l)**(2)if(2)
2 2
+l^(i)l u"(2)i -r(i)j(i)r(2)t(2)\,
96 The Helium Atom 469
as in the previous example. The probability of finding e (an electron in state i|>
2 2
P=-(
2 At
J\t
\f{l)\ drJ\$(2)\ dT2 - - f «j>(l)£(l)rfT,/ ^*(2)if(2)rfr2
J I At •'At
+
1
2
f \${2)\ drJ\t{\)\
dr
2
x
--( r(2)H2)dr.2 f ^(D^(l
At J 2 At At
At At At
2
= f |ftl)| rfT, " [ f*(l)^(l) dT,
At At
We have used the fact that »// is a normalized single-particle eigenfunction to get
the first line of this result. Note that the orthogonality of \p and ip does not cause
the integral of the product of these functions to vanish unless At refers to all
2
P*= f |^(l)|-^T,f|^(2)|^T, = ( |lf(l)| rfT,.
At J At
At
for the states \\/ and Such a circumstance arises when e and e are
\p is negligible.
an appreciable distance apart. The overlap
electrons in separate fields of force,
is small in this case because the eigenfunctions \p and \p do not have large
The Grotrian diagram for the He atom is shown in Figure 9-16. The striking feature
of these energy levels and transitions is the apparent existence of two distinct He
varieties, denoted in the figure as parahelium and orthohelium. The two species
present themselves in separate families of spectral lines since the transitions occur
within, but not between, the para and ortho systems of energy levels. This behavior
indicates the dynamical influence of a certain quantum number that distinguishes the
two forms of helium. One of our main objectives is to show how this new quantum
number arises from considerations of the Pauli principle.
470 Complex Atoms
Figure 9-16
Energy levels and electric dipole transitions of helium. Emitted wavelengths are given to the
nearest nanometer, as in Figure 9-1 All states below the ionization level have Is n rf configurations
.
denoted by the indicated n£ assignments. The levels are organized by the notation at the top of
the diagram according to total orbital and total spin quantum numbers.
He +e
ortho-He
We know from Section 9-3 that the He ground state is a closed-shell system of two
Is electrons. The higher-energy states must therefore involve configurations in which
at leastone of the electron orbitals is beyond the n = 1 shell. In fact, only one particle
can be excited beyond n = 1 since the energy required to excite both particles exceeds
the 24.6 eV ionization threshold for the ionized system He + (l.r) + e. We can therefore
classify all the discrete energy states of the bound He atom as singly excited \s n£
configurations. Accordingly, the levels in Figure 9-16 are labeled by the n^orbital
designation of the excited electron, and the energy values are given by the sum of
L, + L,. (9-20)
9-6 The Helium Atom 471
A total orbital quantum number l° can then be associated with L by the usual quantization
rules. Since the one electron is never excited, it is clear that the value of ( in the He
atom is always equal to the £ value of the lone excited electron. We indicate this
assignment of the total orbital quantum number by the familiar spectroscopic label
S, P, D,. . . at the top of each column of energy levels shown in the figure.
We get to the heart of the He problem when we consider the imposition of the
Pauli principle on the two-electron system. Let us start with the ground state and
2
rewrite Equation (9-17) in the form of a Slater determinant for the Is configuration:
*u>('i)loo(0i»*i)T, *io('i)Ioo(0i»*iHi
*(1,2)
f2 Rjr 2
)Y00 (02 ,4> 2 )T 2
R [O (r2 )Ym (02 ,<t> 2 )l 2
The expanded result exhibits a separation of the space and spin dependence into a
spatial eigenf unction multiplied by a spin eigenf unction:
/ 1
*(1,2) = {(/? 10 (r 1
)K00 (« ,* 1 1 ))(/? IO (r2 )yoo (fi2 ,* 2 ))}|^-(t 1 i 2 - i , t 2 ]
(9-21)
fashion. The first bracketed quantity is obviously symmetric under the exchange of the
two spatial coordinates r, *-* r2 , while the second is antisymmetric under the exchange
of the two spin orientations 1 *-» 2.
U s
(r ,r 2
1 )
"
!
X (l,2)
<Ml,2) = I or (9-22)
l^(r lf i
r 2 ) X *(l,2).
and
1
r
^(r„r2 ) = ^[*„(r,)* 4 (r 2 ) + ^ A
(r, )f (r 2 )]
;
(9-24)
1
4
X' (l,2)= -^(t,4 2
- 1,T 2 ) (9-25)
472 Complex Atoms
and
X f(l,2) = T,t 2 ,
xg(l,2) = -^(T,l 2
+ 1,T 2 ),
v2
X-i(l,2)= i,i 2
. (9-26)
v
f (r,,r 2 = )
^(r,)^(r 2 )
*(1,2) - ^[(VoJ.fVJ. T+
~7j |A'*!0 / 00M A 20 / 00</2 f^A,)^,,^),]
^'Jo'lNl/l^ill'oo^l \ ^(Til2- Ma)}
~fK
(9-27)
4 /
( 1 >2) = {
-^[(#i *oo)i( #20*00)2 ~ ( #20*00 )i( #10*00)2]
T,T,
\ -7^(t,i 2
+ Ma) (9-28)
1 1 i 2
for the triplet states. (We adopt the abbreviating subscripts 1 and 2 to denote the
spatial variables r, and r 2 .) The total orbital quantum number has the value €= for
each of these eigenfunctions, and so the resulting singlet and triplet states appear in
9-fi TAe Jfe/wm /l/om 473
3
Figure 9-16 with the assignments S and
l
S at the top of the energy level diagram.
Singlets and triplets are also indicated in the figure for other configurations and other
values of tf.
The singlet and triplet states are distinguished by their exchange properties under
the abstract concept of fermion antisymmetry. Let us attach a more concrete meaning
to the two kinds of spin eigenfunctions by recalling our interpretation of electron spin
as an angular momentum vector. The two-electron system has spins S, and S.,, and so
a total spin angular momentum can be defined by the vector sum
S = S, + &,. (9-29)
l
The individual spin vectors have the eigenvalue properties of separate spin- ,
par-
ticles, as discussed in Section 8-5:
3
2
s, = h\(s + i) = -h' S„ = /zw = + —
y
4
* - 2
and
3
sj = h%(s.2 + 1) = -ti1
4
^= /?w
'-'
= H
" 2
The total spin S is supposed to obey the same general quantization rules in terms of
the quantities S" and Sz Hence,
. there must exist a nonnegative numerical quantity s,
called the total spin quantum number, such that quantized values are given for the square
of S by the eigenvalues
2
S2 = h s(s +1). (9-30)
Each possible s then implies a set of 2.$ + 1 quantized values for the z component of
S, given by the eigenvalues
Sz = hm 3
with m = s
-s,..., s. (9-31)
These equations describe the familiar behavior of an angular momentum in which the
S vectors have definite magnitudes determined by s and discrete 2-axis projections
determined by m s
.
m = m + m .
It follows that m s
must be integer- valued, since m and m s are half-integral, and that s
5 = and s = 1
since any larger s would permit w to exceed unity, the maximum allowed (
for the sum
of m s
and m s The respective z-component quantum numbers are
.
m = s
for s = and w = —
(
1 , 0, 1 for s = 1
These four assignments of quantum numbers (smj refer precisely to the four spin
474 Complex Atoms
Figure 9-17
Vector addition of two spin-^ angular momenta. The sum S = S, + S_> is constructed, without
factors of h, for the case m %
= m %}
= h. The resulting spin state has s = 1 and m = 1
1, as
described by the symmetric spin eigenfunction x'\-
s = 1
eigenfunctions identified in Equations (9-25) and (9-26). The total spin quantum
number and the spin-exchange property are correlated such that
= A
s denotes the antisymmetric singlet \
and
S
Note that the x' states in Equations (9-26) are already labeled by their m s
values in
anticipation of this development.
Figure 9-17 shows how the addition of two quantized spin-^ vectors S, and S 2
produces a quantized result for S. We illustrate the selected spin state xf by the sum
of two spin-up vectors, and we argue that x^ is represented by an inverted version of
i
the same picture. Any combination of spin-up and spin-down states can be taken to
A
form an m =
s
state. The particular construction x corresponds to a total S vector
of vanishing magnitude, while the Xo state describes a combination of spin up with
spin down in which the total S vector has magnitude v2 h.
The new quantum number s has dynamical as well as notational significance. We
recognize the total spin S to be a conserved angular momentum for two electrons
subject to the Coulomb interactions described in Equation (9-1). We also make a
similar claim regarding the total L vector defined in Equation (9-20). These state-
ments mean that we can regard the associated total quantum numbers (sm s ) and
{£m f ) as good quantum numbers, provided we ignore spin-orbit coupling. This
interaction is of some interest since it causes a small splitting of the helium levels with
nonzero f in Figure 9-16. If we take account of the effect we find that L, and S, are
no longer conserved, as in the one-electron problem, so that m f and m do not remain s
J = L + S,
and we use the corresponding quantum number j along with / and s to label the split
energy levels. The complete spectroscopic designation for a He level has the form
2j+1
L;.
This notation includes a value of n for the excited electron and assigns a superscript
9-6 The Helium Atom 475
1 % with n = 1, s = 0, <f = 0, j = 0,
and we denote the \s2s excited states in Equations (9-27) and (9-28) by
2 \ with n = 2, s = 0, /= 0, j =
and
2 5,
3
with n = 2, j = 1, (= 0, j'= 1.
2i+1
Only the partial notation L is employed to classify the helium levels in Figure
9-16. Notice that the same scheme has already been adopted for the hydrogen levels in
Figure 9-1 by setting s = \ and 2s + 1 = 2 for every state in the one-electron system.
We observe from Figure 9-16 that the singlet and triplet helium levels separate into
the two varieties parahelium and orthohelium. We also note that the triplet always lies
below the singlet in a given electron configuration, except for the special case of the Is
configuration where the triplet possibility does not exist. To choose a typical example,
i
consider again the ls2s system and note that the level 2 S i
lies lower than its partner
2
X
S . We know that the solutions of the central-field model describe configurations of
degenerate spin-orbital states. We also know that the central field takes in only a part of
the noncentral interaction between electrons. The obvious conclusion is that the
separation of singlet and triplet He energies must be due to some aspect of the
interaction between the electrons, beyond the effects included in the central-field
potential energy.
The repulsive electron-electron interaction favors the triplet over the singlet as a
consequence of the Pauli principle. The two electrons in a triplet state must have a
spatial eigenfunction of antisymmetric form, as given by Equation (9-23). Such a state
vanishes when r, = r 2 , so that there is small probability for the particles to be found
near each other. Consequently, the two electrons tend to remain at a distance where
the repulsive interaction has a reduced effect. On the other hand, a symmetric spatial
eigenfunction governs the singlet state and permits the particles to be found closer
together where they undergo a greater Coulomb repulsion. Thus, electron-electron
repulsion is more effective for the singlet state in a given configuration, and so the
triplet states experience more net attraction and have lower-lying energy levels. These
arguments imply that the choice of spin state influences the spatial distribution of the
particles and simulates a sort of interaction.The effect is called an exchange force keeping
triplet-state particles apart and bringing singlet-state particles together. The coupling
of space and spin variables operates through Equation (9-22) by virtue of the Pauli
principle and generates this important nonclassical contribution to the behavior of
indistinguishable particles.
Figure 9-16 also shows that no radiative transitions occur between para and ortho
states. We understand this situation at once when we recognize that the observed
spectral lines are due solely to electric dipole transitions. These processes are associated
with the oscillations of a spatial vector, the electric dipole moment of the atom, and do
not involve the spin vectors of the electrons. The total spin quantum number s cannot
change in an electric dipole transition, and so the singlet and triplet energy levels can
only participate in transitions among themselves. The electric dipole selection rules
476 Complex Atoms
apply to the orbital quantum number of the excited electron and, therefore, to the
total orbital quantum number t We summarize the constraints on the total spin and
'.
and by noting that all the transitions in the figure obey these conditions.
Example
We have identified all the bound helium levels above the ground state with
singly excited two-electron configurations. Let us examine this claim by describ-
ing the atom in first approximation without the electron-electron interaction.
The energy of a state with independent-particle quantum numbers («, n2 ) is
Z2 Z 2
—E — —+—
/ 1 1 \
E(n ,n 2 )x
= E - = - (54.4 eV).
"l n2 \
n \
n 2 )
n, n 2 1 1 1 2 1 3 1 00 22
/-."(
n,, « 2 ) ' n e^ - 108.8 -68.0 -60.4 -54.4 -27.2
E(n v n2) - £(l,l)ineV 40.8 48.4 54.4 81.6
Note that the (1 oo) level refers to the ionized system He + (1^) + e, and observe
in the third line of the table that the doubly excited energies above £(1, oo). lie
Of course, the numbers in the line cannot accurately approximate the actual
values plotted in Figure 9-16. The point of this simple calculation is that the
neglected electron -electron repulsion contributes a positive energy shift that
elevates all values of E to higher levels. The energy of the (1 oo) level is scarcely
affected by the shift since the electrons stay far apart in this state. Appreciable
upward shifts occur for levels with small n values, however, particularly in the
(11) ground state and even in the (2 2) doubly excited system. We conclude that
all doubly excited states remain higher than the ionization level after the
repulsive shift is taken into account. These systems are known as autoionizing
+
states since they transform spontaneously into the ionized atom He + e.
The optical spectrum of an atom spans the visible range of wavelengths and reflects
the activity of the outer-shell electrons. We have illustrated the excitation and
deexcitation of these optically active electrons in Figure 9-3. This picture of optical
phenomena is clearly realized by the alkali atoms corresponding to the elements
9-7 Alkali Atoms 477
directly below hydrogen in the periodic table. Hydrogen-like qualities appear in the
spectra of the alkali atoms and give evidence that the transitions are attributable to a
single valence electron. The one electron occupies an orbital state outside a core of
closed subshells and thus provides a simple structure to study as a testing ground for
the quantum number n opens a new shell after all available orbitals of lesser energy
are filled by the core electrons. We consult Table 9-1 and find that the ground states
2 2 2 6 2 2 b 2 6
Is , ls 2s 2p , \s 2s 2p 3s 3p , ...
Each core electron has its own individual orbital and spin angular momenta L and (
S Both quantities add up to zero when the sums are taken over all electrons in the
;
.
core. To understand this point we examine the quantum numbers of the core electrons
and recognize that the individual z-component quantum numbers have vanishing
sums
V.
= and Y m, = 0, (9-33)
since every available assignment of the quantum numbers is taken to assemble the
closed subshells. It follows that the overall core angular momenta
must have vanishing magnitudes, otherwise core states with nonzero ^-component
quantum numbers would also exist, in conflict with Equations (9-33).
We summarize this situation by describing a core of closed subshells as a S system l
with total orbital and total spin quantum numbers equal to zero. The result implies
that the single valence electron sees the core as a spherically symmetric distribution of
charge. We should expect the central-field potential energy V (r) to be especially good c
valence electron has large (r) and look for good agreement between the alkali and
hydrogen energies. We meet this criterion in states of maximum / for given n. The
478 Complex Atoms
Figure 9-18
Grotrian diagram for the lithium atom showing wavelengths to the nearest nanometer.
Hydrogen levels for n > 2 are taken from Figure 9-1 for comparison on the right side of the
diagram.
Li + e
2g 2p 2
D 2p
H +
n = 4
n = 3
n= 2
figures tell us that these particular alkali states are in fact quite close in energy to their
hydrogen counterparts. These observations of the lithium and sodium levels are in
keeping with our general comments in Section 9-3 about the n dependence and /
dependence of the energy eigenvalues for a non-Coulombic potential energy.
The angular momentum of an alkali atom is easy to analyze in view of the simple
properties associated with the closed-subshell core. We define the total orbital and
total spin angular momenta of the atom by the expressions
L = E L, + L r
(9-34)
core
(9-35)
where Y. e and Se refer to the valence electron. The core makes no contribution to L
and S, and so the whole atom assumes the angular momentum properties of the one
electron outside the core. Thus, each atomic state has total spin quantum number s = ^
and total orbital quantum number / given by the particular n ^orbital state of the
valence electron. The resulting spectroscopic notation for the alkali levels has the same
9-7 Alkali Atoms 479
Figure 9-19
Grotrian diagram for the sodium atom showing spectral-line splittings of at least 1 nm. Levels
of the hydrogen atom for n > 3 are included as in the previous figure.
2
S 2p 2
D 2
F H +
Na + e e
5- -0.004
n = 4
4rf / 4/
4
-0.003
4p
s SI n= 3
3d
r& /
3- 4s /
**
*
/
/ i
-0 002 *
/
vo
f
2- 3p
/ /
-0001
1-
1/
/
-l
eV nm /
3s
doublet form as in the case of hydrogen. Therefore, the same 2 L designations appear at
the top of the energy level diagrams in Figures 9-18 and 9-19 as in Figure 9-1.
Every alkali doublet level with nonzero ( has a fine structure like that observed in
hydrogen. (The effect is not large enough to be seen on the scale of the two figures for
lithium and sodium.) This splitting of the energy levels is due to the spin-orbit
interaction of the valence electron. Spin-orbit coupling has a small effect on the levels
ofhydrogen and becomes more appreciable in complex atoms with increasing atomic
number. The interaction has already been applied to the behavior of an inner-shell
electron in our discussion of x-ray spectra. We reconsider its influence in the case of
alkali spectra and again restrict the interaction to a single particle, the valence
electron.
We express the spin-orbit interaction for a single particle in the central-field model
by the formula
1 d_K
VSL = S-U (r) c
where |,(r) =
o
lm,c
2 i
dr
(9-36)
These equations are carried over from our treatment of x-ray levels at the end of
Section 9-4. We
then introduce J, the total angular momentum of the atom, as the
sum of the quantities L and S defined in Equations (9-34) and (9-35). Again the core
makes no contribution, so that the quantization properties of J are the same as those
480 Complex Atoms
Figure 9-20
--
2.11
i
0.0021 eV
J
C
2.10 , o
//
eV
//
',s
1 2
of the single valence electron. As usual, the eigenvalues of J are equal to h j{j + 1),
where the total angular momentum quantum number j takes on either of the two
possible values {— ~ for every nonzero <f. The spin-orbit interaction in
{+ l
, or
Equation (9-36) causes the energy of a given n^j-orbital level to shift by an amount
that depends on the values of the three quantum numbers. The formula for the shift is
the same as that found in our detailed remarks at the end of Section 9-4:
h' forj = /+ |
This result implies a fine structure splitting in the form of a doublet for every ( ¥=
level in Figures 9-18 and 9-19. The final outcome is a splitting of spectral lines that
bears a certain resemblance to the line splitting in the spectrum of hydrogen.
Example
figure above the energy of the unsplit 'S ground state at the 3^ level. The i
spectroscopic notation
2
i( L.
summarizes all the quantum numbers (ntsj) needed to identify any of these
states. The figure also quotes the wavelengths of the radiation emitted in the two
transitions
2 2
3p P3/2 3.r o 1/2 and 3p P ' -
3s S 1/2"
9-8 Angular Momentum Coupling 481
These radiative processes obey the electric dipole selection rules appropriate to
the transitions of a single electron:
The central-field model describes the complex atom in leading approximation, and the
Pauli principle defines the electron configurations of the atom within that context.
Two additional dynamical effects also influence the structure of the atom. One of
these corrections arises from the fact that the central field includes only part of the
noncentral Coulomb repulsion between the electrons. The spin-orbit coupling of the
electrons provides the other correction. These secondary considerations have already
been encountered in our discussions of certain atoms.
We wish to organize the corrections to the central field on a systematic basis and
apply the resulting scheme to all electrons outside filled subshells. We continue to be
concerned with optical excitations, and we recognize that every outer electron
participates equally in the optical activity of the atom. The configurations of the atom
are known in terms of the orbital and spin angular momenta of each electron. Our
intention is to sort out the secondary corrections in stages with the aid of conservation
laws among these quantities.
Let us examine the construction of the central field more closely so that we can
specify the first correction to the model. The Coulomb potential energy for the
Z-electron system consists of central and noncentral contributions, as written in
Equation (9-4):
Ze
2 z 1 e
1 z 1
477£; n ,= ,
1
r,
',
4-7TE
,•</=!
We know that the noncentral repulsion between electrons is not small enough tobe
treated as a secondary effect. We proceed instead to extract a central approximation from
this part of the potential energy by rewriting V in the following manner:
V=LK('i)+ K„ (9-37;
where
LK(r,)= -T—E- +
477£ r 477£ o |r,
(9-38)
\ ,< ;
and
V.. = (9-39)
4-ne ,< j,Ti-*j\ 4 ^«>
\ ,<j |r,- - Tj\
482 Complex Atoms
Figure 9-21
Note that a new quantity is added and subtracted to make this construction. We
define the new expression by an averaging procedure in which an ith particle is
singled out, as in Figure 9-21, and an average is taken over all other particle
coordinates and over all directions of the vector r ;
. The result for the ith particle is the
desired spherically symmetric potential energy V c
with only one remaining variable,
the radial distance r
r Equation (9-39) then defines the residual electrostatic interaction Vrr
as the leftover part of the repulsion between electrons, after the removal of the central
approximation. Our treatment of the model in Equations
independent-electron
(9-6)-(9-10) is based on the leading central contribution in Equation (9-37). We have
given the main role to this part of V, and so we regard the residual term Vee as a small
perturbing effect.
The spin-orbit couplings of the electron account for another secondary interaction.
We employ the central-field function £/r) in Equations (9-36) and express the sum of
these couplings by the formula
The two Vee and Vsl perturb the central field with different relative
corrections
strengths depending on the position of theatom in the periodic table. We have learned
that the spin-orbit effect is quite weak for small Z and grows much stronger with
increasing Z. We therefore identify two broad regimes of atomic number, and we
argue that Vei dominates over VSL for low-Z atoms, while VSL dominates over Vf( for
high-Z atoms. Different conserved angular momenta are associated with the two
effects, and so different procedures are needed to analyze the corrections in the two
regimes.
We devote most of our attention to atoms at low Z and apply the so-called
Russell -Saunders coupling scheme. (The method is named after the astronomers H.
N. Russell and F. A. Saunders for their pioneering interpretations of complex atomic
spectra.) We temporarily ignore the spin-orbit interaction VSL , and we use the
Coulomb potential energy in Equation (9-37) to find the states of the atom, in first and
second approximation. Figure 9-21 tells us that the noncentral Coulomb interactions
of the ith electron cause the orbital angular momentum L, to be a nonconserved
quantity. The forces produce torques, but the torques sum to zero over the whole
9 8 Angular Momentum Coupling 483
atom, so that a conservation law holds for the total orbital angular momentum
L=EL,. (9-41)
The interactions in Equation (9-37) do not affect the spins of the particles, and so
another conservation law also holds for the total spin
IS,. (9-42)
The separate conservation of L and S implies that the states of the atom can be
specified by the associated total orbital and spin quantum numbers (fm^-) and (sm We s
).
define these parameters in terms of the quantities L2 , L,, S2 , and S: by the following
eigenvalue assignments:
S. — hm
2 2
S -» h s(s + 1) and >
s
.
The total j-component quantum numbers are given as sums of the individual
^-component quantum numbers,
in keeping with Equations (9-41) and (9-42). The allowed values of the quantum
numbers <? and by the rules of vector addition and angular
s are determined
momentum quantization. This method of combining angular momenta produces a set
of good quantum numbers (c"m^sm ) for each atomic state. Since the procedure is s
based on the construction of L and S, the more suggestive name for the method is the
LS coupling scheme.
We interpret LS coupling according to the roles played by the two parts of
Equation (9-37). The central contribution Y.,V-(r ) gives the familiar electron con- :
figurations in which the good quantum numbers refer to the individual angular
momenta L, and S,. The noncentral interaction Vee violates the conservation of L, and
S, but maintains the conservation of L and S. This violation affects the configurations
in the following way. A typical configuration may contain many assignments of the
c
individual quantum numbers (n c m^m The corresponding states can be associated t l i
).
superposed in linear combinations to form states with definite values of c" and s.
Electrons in such a configuration experience the noncentral and assume Vet effect
different energies for different orientations of their angular momenta. Thus, the
correction Vtt produces a splitting of degenerate states in the given configuration so
that separate energy levels emerge for different pairs of total quantum numbers ( and
s. We have seen a simple example of this splitting in the case of the \s2s configuration
of the He atom. Equations (9-27) and (9-28) represent product combinations of
{n x
^ x
m^m 5 ) and {n 2 t"2 m f m s ), with ^=0, in which the m =
i
eigenfunctions
separate into an (c*= 0, 5 = 0) state and an (/= 0, s = 1) state. The energies of the
states are different because of the Vee effect. We have given these He energy levels the
3
names 2s
X
S and 2 s S in Figure 9-16.
.
J = L + S.
h j( j + l)fory~' and hm )
for J,
2s + X
L
and is called a term. The (2/+ \){2s + 1) degenerate states of a given term are
designated by the final spectroscopic notation
2s+ \r
when the states are sorted according to the quantum number j. We have already-
3
state and the S and S
X
employed this notation to describe the
X
S ground t
excited
states of helium. We emphasize that it is meaningful to assign and spin
total orbital
(2j + 1 )-fold degeneracy with respect to this quantum number for each assignment
of j.
Figure 9-22 summarizes the LS coupling treatment of the two corrections to the
central field. We begin with the central potential energy £,F(r ;
) and specify con-
figurations by sets of independent-electron orbital quantum numbers («,^,). We then
9-8 Angular Momentum Coupling 485
Figure 9-22
2s +
Development of atomic energy levels 'L
;
in the LS coupling scheme. The corrections to the
central field break the degeneracy of a given electron configuration in two stages.
CM)
Multiplet
(s
2s+1 L,.
2sTT^ 1
v (2; + 1)
^- Degeneracies —*
Fine structure
Configuration Terms components
?V c
(r£ )
+ V ei 'SL
add the residual electrostatic interaction Vtt and divide each highly degenerate
configuration into term levels with quantum numbers ( and s. We finally include the
spin-orbit interaction and split each term into its j-dependent fine structure compo-
~ J +
nents. The final collection of all levels
1
L ;
constitutes a multiplet originating from a
single configuration. Note that if and s continue to be used through the final stage of
the procedure, even though these quantum numbers are only approximately good in
the presence of weak spin-orbit coupling. It is clear that the utility of f and s, as well
as the validity of the entire LS scheme, must eventually break down as the spin-orbit
interaction grows in strength with increasing Z.
We turn to another method known as jj coupling when VSL becomes dominant over
Vee The two
. corrections have this property for atoms in the high-Z part of the
periodic table. The states of such an atom are determined in first and second
approximation by the central-field and spin-orbit interactions
LKW + VtSI •
while the weaker electrostatic correction Vee is set aside until the final stage. We see at
once that the conservation laws for the various angular momenta take a different
route under these conditions. The leading potential energy E,Fc(r;) conserves L and S ( ;
for each electron. The next-to-leading contribution VSI does not conserve these
angular momenta individually but does conserve the sum
J,
= L, + S,.
2
h2J, (Ji+ 1 ) for J, and hm h for Jiz
to introduce j and
i
m as good quantum numbers, and we can label the states of the
electrons as (j x
m )
j 2 m Ji • • •
) in the absence of the residual interaction Vtt The
. total
486 Complex Atoms
Figure 9-23
L, L2 ... —^ L
\ I
angular momentum of the atom is given by the sum of the individual J vectors
J=IJ,
and is conserved whether Vet is included or not. Consequently, we are able to use the
already-defined total angular momentum quantum numbers j and m along with the
individual quantum numbers j i
for each electron, as an alternative set of good
quantum numbers (j j2 l
" " "
jm ,)- We are then in position to use this specification of
the states when we finally include the correction Vee .
These brief remarks summarize all that we intend to say about the method of jj
coupling. Figure 9-23 shows a schematic plan of the LS and jj coupling procedures. It
These exercises in angular momentum addition are quite similar to the vector-addition
problems encountered in Section 8-8 and Figure 8-22. We begin with the observation
that electrons in filled subshells form a S core and make no contribution to L and S.
X
It follows that the vector sums E,L and £,S pertain only to the optically active
; (
electrons in the atom. Let us be content to examine the case of two electrons outside
filled subshells. We recall that S has already been constructed for two electrons in
Section 9-6 and that the results s = and s = 1 are obtained when we add the two
spin-
]
- vectors. The remaining constructions
L = L, + L2 and J = L + S
then reduce to a single problem involving the addition of two integer-valued angular
momenta.
9-8 Angular Momentum Coupling 487
The quantized vectors L, and L 2 have magnitudes h jtf (if + ) and ti^2 { tf2 + 1 ] ] l
1 ) ,
This parameter has its largest value when m f = /, and m, = £2 and , so the total
orbital quantum number { cannot exceed «f, + £2 . Every lesser integer down to
\£x — t?
2 \
is also allowed in the addition of the two quantized vectors. We therefore
obtain the following list of possibilities for the total orbital quantum number:
t=\t - x
t2 \> K, - <,i + i
,
• • •
,
t +
x 4- 1 > 4 + lv ( 9 - 44 )
As usual, m f ranges from — £ to ( in integer steps for each of these values of ( Thus, .
£= with m;= 0,
t= 1 with m t = -1,0,1,
{= 2 with m f = -2, -1,0,1,2.
These nine different states describe all nine possible orientations of the selected pair of
vectors L, and L 2 .
The vector addition of L and S proceeds along exactly the same lines to a similar
conclusion regarding the total angular momentum quantum number j. For given
values of t and s we find the following list of possible results:
Each choice of j admits a range of values for the ^-component quantum number m-
from —j to j in 2j + 1 steps. It should be clear that Equation (9-45) holds equally
well for integral or half-integral values of the total spin quantum number s. We add
in closing that these same rules for combining angular momenta are readily adaptable
to the jj coupling scheme.
Example
Let us apply LS coupling to an atom with two optically active electrons, taking
the configuration to consist of one p electron and one d electron. The spin-orbital
assignments for the two particles are (/, = 1, s {
= ^) and {(2 = 2, s2 = -,). We
combine the spins in the usual way to form total spin states with s = and 1,
and we use Equation (9-44) to obtain the values /= 1, 2, and 3 for the total
orbital quantum number. The terms in this configuration are classified as
singlets (2s + 1 = 1 for s = 0) and triplets (2s + 1 = 3 for s = 1). The three
choices of / generate the singlet terms l
P,
X
D, and ]
F and the triplet terms 3P,
3 3
D, and F. Each term level has fine structure components with values of j as
given in Equation (9-45). The s = terms must have j = /, and so the singlet
components are
l
'P,, 'A, and F..
488 Complex Atoms
J = 0, l,and2 for (- = 1,
1
= 2, 3, and 4 for^ = 3.
3n 3
ro, 1 ,2'
Z)
^1,2,3) and r 2,3,4-
Thus >, the entire multiplet consists of 12 different energy levels in all.
The dynamics of LS coupling produces a multiplet from a configuration via the two
steps shown in Figure 9-22. This procedure has clear implications for theory and
experiment, since the breakdown of configurations into multiplets is a predictable
phenomenon with observable spectroscopic consequences.
We are concerned with the evolution of multiplets in electron configurations of two
types. A configuration is said to describe equivalent electrons if the same orbital
quantum numbers are assigned to all the electrons outside filled subshells. Examples
2 2 2
are the Is configuration of helium and the ls 2s 2p configuration of carbon. The
electrons are called inequivalent if their («,-/,-) assignments are different, as in the He
configuration ls2s and the C configurations ls 2s 2p3s
2 2
and ls
2 2
2s 2p3p.
We find that particular attention must be given to the Pauli principle whenever we
analyze a system of electrons in the equivalent category. The analysis isassisted by
three observations, put forward originally by F. Hund during the early period of the
quantum theory. These empirical rules refer specifically to the application of LS
coupling in ground-state configurations containing equivalent electrons. The first two
of Hund's rules pertain to the Vee effect and the splitting of a configuration into
A-term levels:
and
for maximum s the lowest-energy term occurs for the largest possible t '.
These guidelines can be used to address the problem of finding the lowest-energy level
in a given configuration. The statements do not imply that every term level of
maximum s must lie below all those of the next largest s and do not imply that the
maximum ( must
term of have the lowest energy when s is not maximum. We
illustrate the limited applicability of the rules later in this section.
The first Hund an extension of our arguments about the energies of the
rule is
singlet and triplet states in the two-electronatom. We know from Section 9-6 that
there is less repulsion between electrons in the spin-symmetric triplet states of helium
because the accompanying antisymmetric spatial eigenfunctions give smaller probabil-
ities for finding the two electrons close together. An atom with several electrons is in a
state of maximum s when the system has parallel spins. Such a state is spin
9-9 Spectroscopic Aspects ol LS Coupling 489
configuration. The two spin- particles can form singlet and triplet term states with
77
principle. The exercise becomes more involved for equivalent electrons because the list
posed by equivalent electrons. To analyze this case we could list all the possible values
for the electron quantum numbers {m ( m s m e m s ) and then strike those entries with
the same assignments for {m / m and (m ( m $ ). i
) Instead, we adopt a more efficient
way of treating two electrons antisymmetrically by following the approach introduced
in Equations (9-22). These constructions sort out the choices for m and m into
singlet (5 = 0) and triplet (s = 1) combinations and associate antisymmetric and
symmetric behavior under exchange of the two electron spins. The spin states are
accompanied by spatial eigenfunctions whose exchange properties are appropriate for
the overall antisymmetric description of the two electrons. We can take advantage of
2
this procedure and organize our list of
to reduce possible np states. The detailed
arguments are discussed at the end of the section.
The main result of our analysis of the np 2 configuration is the list of values of (
and s in Table 9-2. We find that (= and f= 2 occur only for s = and that f= 1
occurs only for 5 = 1. Hence, the only singlet terms are '5 l
and D, and the only triplet
3
term is P. The six s = entries in the table can be organized so that one state
belongs to 'S and five states belong to D. X
The 3P term contains nine states since there
are three mf values for £= 1 to m s values for s = 1. Thus, the
be coupled with three
3
P terms of the np 2 configuration represent 15 different states in all,
l
S,
l
D, and
appreciably less than the number found in the np n'p configuration.
490 Complex Atoms
m, m,c
'1 2
= mf + mf f m,<l
m'V,
'2
m /= m A + m f, *
1 1
'
1
&
1
1-1"
& 2,0
--1 1 .
2,0
"
- 1
&
-1
1 -1 -2
A third Hund rule is also available as another guideline to use in the final stage of
multiplet formation. We recall from Figure 9-22 that the spin-orbit correction VSL
acts as the last step to split the various A-term levels into their j-dependent fine
structure components. The empirical rule identifies an incomplete subshell of an atom
as either less than or more than half-filled and arranges the fine structure in either
normal or inverted order:
We offer the following remarks to give some theoretical justification for this rule.
Equation (9-40) describes the spin-orbit interaction VSL in terms of the coupled
orbital and spin angular momenta of the individual atomic electrons. It is possible to
prove that electrons in filled subshells make no contribution to this expression. It is
also possible to reexpress the Vsl effect in terms of the total orbital and spin angular
momenta I, and S. In the end we find that the expectation value of Equation (9-40)
turns into a simple formula for the spin-orbit energy shift:
= 4<S-L>. (9-46)
The expectation value is taken in a term state with quantum numbers {^sjm
t
) and
produces a proportionality constant A whose value is independent of j. We use the
familiar inequality
and we immediately reduce the formula to a final expression involving the quantum
numbers t s, and j: ',
Ah 2 r
This result predicts a splitting of fine structure components in a given A term and
orders the energies monotonically with j for the given choice of i and s. It is easy to
derive two conclusions from Equation (9-47):
and
It follows that the shift to the lowest energy occurs either in the state with j = j mm if
rule.
= —
Ah 2
[j(j+ !)-(>" lb'] =Ajh 2 . (9-48)
The result confirms Lande's interval rule, another relic from the early era of atomic
spectroscopy:
applicable.
Inverted fine structure is seen instead of normal fine structure when the incomplete
subshell is more than, instead of less than, half-filled. These situations are connected
by the fact that a fully occupied subshell has vanishing angular momenta and
vanishing spin-orbit interactions. We can relate occupancy and vacancy in a partially
filled subshell if we interpret summations of the type found in Equations (9-40) to
(9-42) as
n N N— n N-n
E
electrons
= I
electrons
E = - E ,
holes holes
where jV denotes the number of electrons needed for full occupation of the subshell.
Thus, n electrons and N—n
holes make equivalent contributions to quantities like L,
S, and (VSL ), except for the change in sign. Fine structure inversion follows from this
observation when the more-than-half-filled case n > N/2 replaces the less-than-half-
492 Complex Atoms
Figure 9-24
Schematic diagram of terms and fine structure components for an np n'p configuration and for
an np' configuration. Fine structure splittings are greatly exaggerated relative to the energy
differences between terms.
s=0 f=l ;= i
np n p np
^±Jl2is _ili_3o
7=2 j
= 2
3P 2 3
\s=l -p 2
f=l \s=i e=i
;'= l
3
p
<>
\ ;
=
1
7"
Pn
>=3 ,
D
W_f=2 3 ,
>= 2
;'= i
filled case n < A^/2. The argument also implies that no fine structure is expected
when the subshell is exactly half-filled.
We can use Hund's rules to deduce the state of lowest energy in a normal multiplet
by selecting the largest s, the largest /, and the smallest j, in that order. Let us also
imagine a conjectured extension of this procedure in which we order all the levels of a
multiplet with increasing values of the energy first by decreasing s, next by decreasing
l
£ and last by increasing j. This strategy is applied in Figure 9-24 to the terms S, P,
l
,
1), 'A', 7\ and I) in the np n'p configuration. Note that all s = 1 terms are predicted
1
I)
1,2,3 0,1,2
l
D, %
The figure goes on to show the altered prediction of levels for the equivaletit-electron
configuration np 2 . We have just learned that the Pauli principle excludes half of the
above terms so that the resulting multiplet contains only the levels
3d 17)
1Ji
"0,1,2
9-9 Spectroscopic Aspects of LS Coupling 493
2
It is instructive to count the possible values of m- for each assignment of j in the np
case:
3
P , 2
has m- = 0, m = —
]
1 to 1 , and m — — )
2 to 2,
while
l
D
2
has m = —2 to 2, and 2S has m ] = 0.
The exercise demonstrates again the existence of 15 different states in this equivalent-
electron system.
The proposed ordering of energy levels in Figure 9-24 is actually realized by many
atoms, at least in equivalent-electron situations of the type np'. Figure 9-25 shows the
extent to which the carbon atom fulfills the predictions. We know that the ground
state of carbon has the configuration Is 2s 2p and we see in the figure that the ,
lowest configurations of the two optical electrons are 2p 2p3s, and 2p3p. It is clear ,
that the actual levels of carbon agree with the predictions for 2p~ but disagree
considerably with the ordering proposed for 2p3p.
The LS coupling scheme is based on certain good angular momentum quantum
numbers in the system of interacting electrons. Parity furnishes another good quantum
number to associate with the levels of the complex atom. This property is expected
because the dynamics of the atom is governed by the interactions
LK(r,)+ K + e
v,
SL
and because these quantities are not affected by the parity operation r -> — r. We use
the orbital quantum number c*
t
to obtain — 1)
( as the parity of the ith electron. We
then use all the independent-particle quantum numbers ((*](*, • • f7 ) to establish the
parity for an entire multiplet of states. Thus, Figure 9-22 tells us that the space-inversion
property of every term and component is decided at the configuration level and
remains good even after the conservation of each L ;
is broken by the secondary
corrections. A fully occupied subshell contains an even number of identical t"
l
assignments and contributes even parity, so that the optically active electrons de-
termine the overall parity of the multiplet system. As examples, both of the multiplets
for np n'pand np~ have two odd-parity electrons in their starting configurations, and
soan assignment of even overall parity follows for every level sketched in Figure 9-24.
Note that the evenness or oddness of the total orbital quantum number ( has no
bearing on this conclusion.
The energy levels in Figures 9-16, 9-18, 9-19, and 9-25 show many differences in
energy of order 1-10 eV. Radiative transitions among these levels generate spectral
lines in the visible range of wavelengths. We can attribute such optical transitions to
oscillations of the electric dipole moment of the atom, and we can assume the validity
of the electric dipole selection rules. We have argued that these rules act as conserva-
tion laws for the system of atom-plus-radiation, in which an electric dipole photon
away one
carries unit of vector angular momentum. Changes in
ft the total angular
momentum quantum numbers of the atom must therefore obey the conditions
as in Equations (8-43) and (8-44). The parity of the atomic state must undergo a
change, either odd -* even or even -» odd, because the emitted photon also carries
away odd parity. We can go beyond the domain of strict conservation laws and
494 Complex Atoms
Figure 9-25
Energy levels of the carbon atom. Singlet and triplet terms are shown for the configurations
2/r, 2p3s, and 2p3p.
»S »P l
D '" 3
S 3
P 3
D
C * e
11
-
... ...
10 -
% l
D2
9 3
J\n.2
^X
2p3 Pj
3
«i 3
£,
/':
;
8 -
3
2p3s^ *0.l.:
L
s
2
2P
eV ^0.1.2
include the quantum numbers LS coupling scheme in these selection rules. The
of the
properties of the electric dipole moment are independent of spin, and so the total spin
quantum number is not affected in the transition. This observation implies that the
condition on j becomes an immediate condition on the total orbital quantum number
and results in the additional pair of selection rules
These constraints on / and s hold as long as <f and s are sufficiently good quantum
9-9 Spectroscopic Aspects of LS Coupling 495
numbers for the description of the atom. The independent-electron quantum numbers
(^,^2 • •
/z ) have a similar status in the radiation process. If the radiation can be
attributed to a single electron, then the electric dipole moment refers to that electron
A< = +1.
(Note that the possibility A/, = is ruled out because the parity of the atom must
change.) We observe that the A = 5 rule forbids singlet -triplet transitions in atoms
with two optical electrons, as in the case of the para and ortho systems in helium. Our
diagram for the carbon atom in Figure 9-25 is patterned after the helium levels in
Figure 9-16 so that attention can be drawn to this analogy.
We have been assuming that the atom is isolated from any applied external field.
We take the z axis in the direction of B and express the magnetic interaction by the
usual formula
VM = -n-B= -ix z B.
The magnetic moment of the atom is constructed from the orbital and spin contribu-
tions of each electron in the manner of Equation (8-45):
)
). We
write the formula for the energy shift due to the magnetic interaction in terms of the
expectation value of VA{ in this state:
(VM )= - B ( ti z ) = ^-(L Z
+ 2SZ ).
We then recall the derivation of (/*,) in Equation (8-50) and realize that the same
techniques can be carried over by substituting the quantum numbers of the many-
electron atom. The final formula for the energy shift can therefore be quoted directly
from Equation (8-54):
These results admit the real possibility for an s = atom to exhibit a normal
Zeeman effect. An
atom is in a s = singlet state with j = ( and g = 1, just like the
hypothetical hydrogen atom without spin in Section 8-3. The selection rules allow
transitions only to other singlet states, and so the application of a magnetic field causes
the emission lines of the singlet-state atoms to trifurcate in the classical manner
predicted by Lorentz. This possibility is left for further exploration in Problem 24 at
the end of the chapter.
Detail
Table 9-2 lists the quantum numbers m f and m ft for two equivalent electrons
configuration. The compilation is separated into columns for s =
2
in an np
entries in the table correspond to the following spatial eigenfunctions for two
electrons with quantum numbers «, = n 2 = n and ( = = 1 x
(.-,
for \m f — 1 mf = l]
mf = 1 mf =
for
mf = mf
A
mf = 1 mr =
for
777^- = mf = 1
Note that the angle-dependent factors control the exchange behavior of these
expressions in a manner consistent with the definition of the functions \p and
"V =
A
\p in Equations (9-22). It is clear that the symmetric function i/^ nas -
and /= 2, given the assigned values m f = m e = 1. Both functions »//f, and i^fj
have ^2^= 1, in view of their {m f m r assignments, and can belong to either
)
must also have £— 2, so that the antisymmetric partner ^f, must have /= 1.
We have anticipated the demonstration of these total orbital quantum numbers
by labeling the functions as 4> S/J. Other entries and { values in the table can be
understood in similar fashion. (We note that our use of the eigenfunctions i|/^
is a device intended to impose the Pauli principle on the two outer electrons
alone. Thus, our procedure is not a rigorous application of the Pauli principle to
the entire Z-electron system and is meant to be applied only for the determina-
tion of the term quantum numbers £ and s.)
9-9 Spectroscopic Aspects of LS Coupling 497
Example
The ground-state configurations at Z= 12, 13, and 14 are given in Table 9-1 as
2
[Ne]3/-' forMg, [Ne]3s 3p for AI, and [Ne]3j 2 3/> 2 for Si.
term with s = ^ and /= 1. The possible j values are \ and so that the P 77
term splits into 2P 1/2 and 2P 3/2 components. The P 1/2 state is predicted to have
the lower energy. The Si atom has two equivalent p electrons outside filled
subshells. The lowest levels should be just like those sketched on the right side of
3
Figure 9-24, where the P state is found to have the lowest energy.
Example
equivalent to a subshell with four d holes. The largest possible value for the total
spin quantum number corresponds to the case of four parallel spins, where s = 2
and 2s + 1 =
happens that the \D term has the lowest energy. The
5. It
form an inverted system in which the 5 Z)4 component occurs at the level of least
energy.
Example
Let us conclude the section and the chapter with some remarks about the
He-Ne laser to add to our previous description of the device in Section 3-9. We
illustrate the relevant transitions of the two atoms in Figure 9-26. Operation of
the laser begins with a pumping process that raises helium atoms from the
ground state to the singlet and triplet ls2s excited states. Note that the electric
dipole selection rules do not affect these transitions since the excitation of atoms
in the gaseous medium of the laser is collisional rather than radiational. The
excited He states are metastable because the selection rules inhibit spontaneous
transitions back to the ground state. Helium and neon atoms transfer their
energies of excitation via the collision process
He* + Ne -» He + Ne*.
Figure 9-26
Energy levels and transitions of He-Ne laser. Two four-level laser systems operate in neon
5
through population inversions in the 2p^5s and 2/r 4s states with laser transitions to the 2p 5 3p
levels.
21
ls2s
20
L9
Pumping transitions
17
eV
Ls<
Problems
- i —
as
2. The ground state of the helium atom lies 24.6 eV below the ionization level for the singly
ionized system He + + e. Deduce the position of the helium ground state relative to the
t +
double-ionization level for He + 2e.
-—
2m
Vf + V./H-
477f r,
+
r.,
U = £*
e y J
with Z= 2. What would the ground-state eigenfunction and energy be in this case?
Problems 499
The sixth row of the periodic table lists the elements from Z= 55 to 86. What electron
subshells are filled in assembling the ground states of these 32 atoms? Identify the
ground-state configurations for Z= 55 and 56 and for Z= 81 to 86.
Consider an alkali atom in its ground state, and assume that the single electron outside
The MN x-ray lines in the M series of any element correspond to M —> N hole
KL U KL m KM 1U L X
M XU
0.02138 0.02090 0.01844 0.12627 nm
The K absorption edge occurs at 0.01784 nm and the A'-shell energy is given as 69.52
keV. Use these data to calculate the wavelength for each of the L absorption edges and
the energy for each of the L subshells. Compare the computed values with the results
Let the eigenfunction for the three-electron atom be written as a Slater determinant:
*.0) M l
) * Y 0)
Demonstrate the exchange antisymmetry of this construction, and show that \p(l,2,3)
satisfies the exclusion principle. Verify also that \p(l,2, 3) is properly normalized.
9. Show that a definite parity can be identified with any Slater determinant for the
Z-electron atom.
10. Express the singlet and triplet He eigenfunctions in the ls'2s configuration in terms of
Slater determinants.
11. Let there be a spin-spin interaction for two electrons given by the formula
vss = fs, • s,
in which f is a constant. Evaluate the interaction energy in the singlet and triplet states of
the two electrons.
12. Refer to the wavelengths given in Figure 9-18 for the spectral lines of lithium, and
calculate the energies above the ground state for the lithium levels shown in the same
figure.
13. The spectrum of sodium includes lines with wavelengths (in nanometers)
corresponding to the 3d —> 3p transitions indicated in Figure 9-19. Use the spectroscopic
notation n£ L: to specify the states involved in each of these transitions. The deduction
should allow for the fact that the sodium 1
D states form inverted doublets in which j '
= \
lies above j '• = f Use the wavelength data
. to calculate the splittings of the initial and final
doublet levels.
500 Complex Atoms
14. The ~F levels of sodium consist of pairs of states whose closely spaced energies are difficult
to resolve. Assume a normal fine structure ordering of the j = -, and j = j states, and use
a hydrogenic model of the valence electron to predict the spin-orbit splitting in the lowest
F doublet.
15. Consider two optically active electrons in orbital states with quantum numbers (/, = 1,
m/ = 0) and (Y, = 1, m f
- = 0). Use graphical constructions like those in Figures 8-22
and 9-17 to represent the possible results for the total orbital quantum number t.
16. Let an atom have one p electron and one d electron outside filled subshells, and use jj
coupling to determine the possible values of the total angular momentum quantum
number j. Show that the occurrence of j values is the same as in the LS coupling scheme.
17. The lowest excited states of the Be atom correspond to a 2s2p configuration of optically
^ >+
active electrons. What 'Z,
;
components originate in this configuration? Assume that
Hund's rules hold for the multiplet and deduce the ordering of the energy levels. Refer to
a table of beryllium levels to see if the ordering occurs as predicted.
LS
in the coupling scheme. Obtain expressions for the shift in states of maximum and
minimum j for given values of t and s.
19. Examine the terms and fine structure components in the configuration np n'd, and use
Hund's rules to identify the term and component of lowest energy. Extend the scope of
Hund's rules to predict an energy level diagram for the whole multiplet, showing the
ordering of all the various terms and components.
20. The atoms potassium, calcium, scandium, and titanium have atomic numbers Z= 19, 20,
21, and 22. Identify the ground-state configuration of each atom, and organize the energy
levels of each of these configurations according to the total quantum numbers s, /, and j.
Deduce the quantum numbers of the lowest-energy state of each atom, and identify the
";v f
state by the spectroscopic notation L .
21. Predict an energy level diagram showing the terms and fine structure components in the
2p3i configuration of the carbon atom. Compare the prediction with the known results
22. Refer to the energy level diagram for the carbon atom in Figure 9-25 and indicate all the
23. The fine structure components for the lowest term of the iron atom have the following
tabulated energies (in cm _1 units):
5
A '7J j
5
a 5
A f,
A,
415.932 704.004 888.129 978.072
Use these numbers to check the validity of the Lande interval rule.
24. The excited singlet states of carbon 2p3s 'P, and 2/r 'Z)2 are shown on the energy level
diagram in Figure 9-25. The two excitation energies are quoted in tables as 61982 and
10194 cm" '. Let a 1 T magnetic field be applied to the atom and identify all the allowed
transitions between the two split levels. Show that the normal Zeeman effect should be
observed, and calculate the corresponding wavelength shifts.
TEN
MOLECULES
The simplest molecular form is the diatomic hydrogen molecular ion HT , consist-
ing of two protons and a single electron. The next in order of increasing complexity is
As with atoms, the most fruitful way to probe the structure of molecules is by
spectroscopy. Not surprisingly, molecular spectra are often more complicated than
those of atoms. This additional complexity arises because, in addition to electronic
transitions, there are energy changes involving the relative motions of the nuclei that
make up the molecule. Such motions are classified as rotational and vibrational. Because
the rotational transitions involve relatively small energy changes, the spectra occur in
characteristic groupings of very closely spaced lines called bands.
501
502 Molecules
While the most mundane features of our lives depend on molecular processes,
molecules are by no means restricted to our local environment. Astronomical observa-
tions at radio-wave frequencies have shown in recent years that molecular clouds are
spread throughout the universe. Molecules that do not occur naturally on Earth have
been discovered in space. Molecular clouds are found to be closely associated with
regions giving birth to stars in galaxies. It would seem that molecules may be equally
fundamental to our understanding of nature on every scale from the cosmological to
the biological.
+
10-1 Binding by Quantum Tunneling: H 2
A molecule is a collection of two or more nuclei and their associated electrons with the
whole complex bound together by electromagnetic forces. Solving the problem of a
complex atom is difficult enough; here we have an even more complicated multicenter
problem so that the electrons can be associated with one or more of several nuclei. To
simplify the difficulties inherent in studying molecules we look primarily at the
diatomic (two-atom) system. Furthermore, we begin our discussion with the simplest of
these, H.^, which consists of two protons and just a single electron.
Electrons are considerably less massive than protons so that electron rearrange-
ments occur much more rapidly than those of the more ponderous nuclei. This fact
results in a useful technique, introduced by Born and J. R. Oppenheimer and known
as the Born-Oppenheimer approximation. The nuclei are assumed to be at fixed points
and the electron energies and wave functions are then found as functions of the fixed
nuclear positions. The total electronic energy is then used, together with the direct
internuclear forces, to form an effective potential energy for the nuclear motion.
Figure 10-1 shows the arrangement of the two protons in H^; both are on the z
axis, one at — R/2 and one at R/2, so the distance between them is R. The electron is
at arbitrary position r. The nuclear positions are taken as fixed so we need write the
Figure 10-1
Figure 10-2
Potential energy seen by the electron in Hj as it moves along the z axis. The dotted line gives
that part of the potential arising just from the proton at z = - R/2. The dashed line shows the
same for the proton at z = R/2.
2
ki- ke ke'
+ - - }xb (r) = e.iMr), (10-1
2ra,
r
|r+ R/2| |r- R/2| R '
where m t
is the electron mass, k = l/47re , and ee is an energy eigenvalue of the
electron. The first term in curly brackets is the electron kinetic energy operator. The
second and third terms are the attractive potential energies caused by the protons at
— R/2 and R/2. We have also included the last term, the proton-proton repulsion,
as a part of the electronic potential energy for convenience.
Figure 10-2 shows a cut through the potential energy function as one travels along
the z axis.The potential energy diverges at z = — R/2 and z = R/2 as it must for
the Coulomb force. Near z = the function is a bit lower than it would be if there
were only one atomic potential energy curve. The potential energy is symmetric about
this point. This double-welled potential energy function is analogous to that consid-
ered in Chapter 5 (Figure 5-25) and the wave functions have properties similar to
those found there.
Suppose that only one of the potential wells is present (e.g., the one around
z = R/2 as indicated by the dashed curve). Then the problem is quite familiar and
the ground-state wave function is just ^^(r — R/2), the hydrogenic Is state centered
at r = R/2. Similarly, a particle in the potential well centered at r = —R/2 has
wave function ^(r +
R/2). Because both potential wells are the same, just shifted in
Now consider the situation in the combined problem represented by the solid line in
Figure 10-2. While the electron is in the neighborhood of r = R/2 we expect the
ground-state wave function to be approximately equal to <p u (r
— R/2) because in
much of that region the dashed line and the solid line coincide. While the electron is in
the neighborhood of r = —R/2, the wave function is approximately u (r + R/2). A <j>
further necessary property of the wave function arises from the symmetry of the
potential energy about the point r = 0. We expect the probability P(r) of a particle
504 Molecules
Molecular wave functions that are even under a parity change (the upper sign) are
odd are ungerade. (These are the German words for
said to be gerade; those that are
"even" and "odd.")
Eigenfunctions that satisfy both of the above sets of requirements are given by
and
When -
r is
u (r + R/2) * lt (R) is small and $ ± (r) * (1/ y/l )<$> u (r
near R/2, 4> <f>
72
From Chapter 7 we know that the l.r state for hydrogen is also even under parity
inversion, that is,
<M-r) =*,,(*)>
Figure 10-3
Graph along the z axis of the approximate eigenfunctions \p _ (r) (Equations (10-3)) for H^
101 Binding by Quantum Tunneling: H2+ 505
(e) = e
±
= jr ±
— h
2
v; + V(r) xp
± dr. (10-4)
2
h
lm e
2 2 2
ke ke ke
v — V ~ vR =
|r-R/2|'
b
|r + R/2f Y 2 '
V(r) = K+v b
+ vR .
Note that the \p ± are real because <pa and <p h are real. Putting the ip
+ of Equations
(10-3) into Equation (10-4) gives for the energy estimate
e
± =-f(<t>a ±<t> b )[T+ Va +V b
+ VR ](<j>a ±<t> b )dr
=
-f(<t>a ±<P b )[(T+ Va )4>a ±(T+ Vh )4> b }dr
'
1 , 1 ,
+
J Jti( y> + V^ dr + 2
}ti {V° + V ^ dr
(T+ Kh a
= ^A and (T+ V )$ = b h
e is<j> b .
corresponds to the average value of the Coulomb interaction between the electron and
nucleus on site a (at R/2) and the nucleus on b (at — R/2) or vice versa.
,
506 Molecules
deal smaller than the direct energy terms, such as e u or G, which depend on 4% or 2b <f>
.
Classically, a particle is trapped in a potential well (e.g., the electron near r = R/2
can not get into the well at r « — R/2 if its energy is less than the central barrier
height as indicated in Figure 10-2). But because of the quantum mechanical barrier
penetration effect, the wave function does enter this classically forbidden region,
resulting in a nonzero overlap integral. Thus, we obtain
e
±
=e u + G±S (10-7)
and hence becomes exponentially small. It can be shown that G also becomes
S,
exponentially small. Then we have e + —* e, However, for smaller separations the two 5
.
energies e + and e_ are not equal but are split by 2S. The exchange integral 5 turns
out to be negative so that e + (corresponding to ip + the eigenfunction with no nodes) ,
is lower than e_ and is the ground-state energy. More importantly, even though G is
V± = e
- eu
±
as the nuclear potential energy functions. Written in this way, the potential energy
goes to zero as R —> oo. V _ is always repulsive but V + is attractive until R gets so
small that the Coulomb repulsion between the protons dominates. There is a mini-
mum in V + defining the equilibrium separation of the two protons in the molecule.
,
The binding of the H^ molecule arises from the exchange integral S in Equation
(10-7). The physical origin of this term can be seen from the behavior of \p in .
Equations (10-3). From the plot of these two functions in Figure 10-3 we see that ^ +
is larger than either 4> = <|>,,(r — R/2) or = <p u (r + R/2) in the region between
a b <f>
the two protons. The probability density associated with the two functions is
2 2 2
l* ± l
= i[l*.l + l*»l ±2fc* 4 ]. (io-8)
The last term is the contribution from the overlap region. In \ip
+ \' this extra electron
10-1 Binding by Quantum Tunneling: Hf 507
Figure 10-4
density implies a region of extra negative charge, which attracts each proton to it and
partially shields one proton from the repulsion of the other. This is the effect causing
molecular binding in H^; hence \j/
+ is said to be a bonding molecular orbital. On the
other hand, l^^l" has less electron density than it would have without the overlap
term (note \p _ vanishes at the midpoint between the protons), and \p_ is said to be
antibonding.
The G and
1
Figure 10-5
9<L
ruined the spherical symmetry of the electron's potential energy. The problem is no
longer one having a simple central force. Thus, molecular orbitals cannot be classified
according to if values.
Molecular wave functions for diatomic molecules like HT do have cylindrical
Detail
We can easily show that \p + is an eigenf unction of L.. Using the coordinate
system of Figure 10-5, write
l/2
|r - R| = = (r- + R2 - 2rR cos o\
a )
h d
-
L. -»
1 1$
so that
M +
= 0,
and \p ^
therefore corresponds to X = 0.
The H.J molecule discussed in Section 10-1 is useful in showing the elements of
molecular binding in a reasonably straightforward way. It is the "hydrogen atom" of
molecules. However, the simplest neutral molecule is H.,, which carries two electrons.
H 2 However, there
Figure 10-1 continues to describe the proton configuration for .
;Hl,2)=4> l5 (r 1
)<|>
li
(r 2 )x"(l,2), (10-9)
where x A (h2) ls the antisymmetric or singlet spin function given by (see Equation
(9-25))
4
= ^(T,1 - l,T (10-10)
X' (1,2) 2 2 ).
The spin function must have one electron with spin up and the other with spin down
because both electrons are in the same atomic orbital. In this way eigenfunction
i^(l,2) satisfies the Pauli principle; that is, it is antisymmetric and i//(2, 1) = —^(1,2).
Next, let the two protons be drawn apart along the z axis, one to R/2 and the
other to — R/2. We might expect one electron to be dragged along with each proton.
Then Equation (10-9) would be transformed into
This might seem a good initial guess at a molecular eigenfunction except for two
important deficiencies. First, Equation (10-11) has not incorporated the physical
features found to be necessary for molecular binding in the HT analysis. Each
electron should have a probability of being at each proton, thereby spending extra
time in the intermediate region between the protons. This extra electron density yields
the net attraction that binds the molecule. A second deficiency arises from the fact
that the proposed function does not obey the Pauli principle; it is not properly
antisymmetric as it must be for fermions.
^ L
(l,2) = -^KCr, - R/2)<Mr 2 + R/2) + *„(r2 - R/2)* If (r, + R/2)]
X X ^(1,2), (10-12)
There is an alternative antisymmetric wave function that we can form from the Is
atomic orbitals. First, we construct triplet spin states from the spin eigenfunctions as
done in Chapter 9, Equation (9-26). These are
s '=
xi T 1 1 2 (10-13)
,X-i = i i
i 2- (10-15)
Since these states are symmetric under electron exchange, the spatial part of the wave
function must be antisymmetric. By analogy with Equation (10-12) we have
*" L == ^k^i r
" R /2H l5
(r 2 + R/2) - *„(r2 - R/^Jr, + R/2)] X Sm/
(10-16)
The interference term (the last one, like that of Equation (10-8)) depends on the
overlap of electronic wave functions in the region halfway between the nuclei, and
implies that there is extra electronic charge there in the singlet case (plus sign) and
diminished charge for the triplet (minus sign). While the added charge density of the
singlet wave function does enhance electron -electron repulsion, this effect is more
than offset by the added attraction between the extra charge and the two protons. It is
this effect that provides the covalent bonding in the singlet case. The triplet wave
where the upper sign refers to the singlet case and the lower to the triplet. The
exchange integral S' depends on teo-particle overlap integrals and contains terms like
where F(r,,r 2 ) can be any one of the various terms occurring in the energy operator.
G' is also a two-particle quantity somewhat analogous to the G of Equation (10-5). S'
turns out to be negative so that the singlet energy is lower than the triplet. The singlet
energy has a minimum as a function of R and exhibits binding while the triplet has no
bound state.
L
As mentioned briefly above, the approximate Heitler- London eigenf unction \p s
L
^(1,2) = *? (1,2) + Y^[<Mr, - R/2)*u(r2 - R/2)
4
+ « lj (r + R/2)* u (r2 + R/2)] X
1
'
(1,2), (10-19)
where y is an adjustable coefficient. The first term of the new addition has both
electrons near the same nucleus at R/2; the second has both near that at —R/2.
While each of these terms increases the average value of the electron-electron
repulsion, it also gives the system an ionic structure in which there is some probability of
the formation of the ion combinations (H + ,H~) or (H~,H + ). The ions attract one
another and add to the molecular binding. At the equilibrium separation of the
molecule (the minimum of the electronic energy), it is found that y = 0.2, implying
that only a small percentage of the binding results from this process. Note that the
configurations (H~, H+ ), and (H + ,H ), with both
with both electrons on proton 1,
on proton 2, are equally likely so that H 2 does not have a permanent electric dipole
moment. In Section 10-3 we study systems that owe their bonding to the existence of a
permanent separation of charge and the resulting electric dipole configuration.
It is possible to construct bonding or antibonding molecular states from excited
atomic orbitals as well as from Is states. For example, simply replace the Is orbitals in
e s —> 2e u which is the electronic energy associated with the two isolated hydrogen
,
atoms. The bonding state constructed of two 2s states would have the limit 2e 0s as
R — * oo. A bonding excited molecular state such as £ 4 in Figure 10-6 has a minimum
at some R value and does not necessarily dissociate (i.e., separate into its atomic
constituents) even though its R -» oo limit is greater than that of the ground state. On
the other hand, excitation of the molecule from the ground state to an excited state
such as e2 or e 3 in Figure 10-6 would result in dissociation.
As a we consider why a closed shell (noble gas) atom does
last topic of this section
not interact covalently with any other atom. Consider the case of hydrogen and
helium. The bonding molecular wave function has been seen to have a singlet spin
configuration with one spin up and one spin down. Consider a covalent bond
involving the hydrogen Is electron with spin down, for example, and one of the
helium Is electrons with spin up. The wave function involves an exchange term that
brings the down electron over to the He, which then has two spins down and violates
the Pauli principle. This is a somewhat simplified view; a rigorous argument would
construct three-electron wave functions for the two nuclear centers. These wave
functions can be shown to vanish because of the Pauli principle for all but antibond-
ing molecular orbitals. Thus, the molecule H — He does not form.
While similar arguments can be made to show that two noble gas atoms, such as
He — He or He — Ar, cannot interact covalently, molecules such as Ar, do exist.
512 Molecules
Figure 10-6
These occur because there is another attractive force mechanism known as the
van der Waals interaction, which is discussed in Section 10-4.
Hydrogen has a single electron, and the alkalis (Li, Na, Rb, Cs) have a single electron
outside a closed shell. When any one of these react with any one of the halogens
(F, CI, Br, which are one electron short of completing a shell, they form tonic bonds.
I),
Basically, what happens is that the single s electron of the alkali element (or of
hydrogen) is taken by the halogen to fill its shell; the resulting positive and negative
ions attract one another and the molecule becomes bound.
Consider the molecule lithium fluoride, LiF. The primary question to be answered
+
is why an electron would be pulled off the lithium atom to form Li and move over to
fluorine to form F . The case rests mainly in the fact that all the electrons in a single
shellhave roughly the same average atomic radius. The electronic structure of fluorine
is
22
\s 2s2p\ The n = 1 shell is complete and the n = 2 shell is missing one electron.
All the n = 2 electrons reside at approximately the same average radius, which, of
course, is larger than that corresponding to the n = 1 electrons. Each of the seven
n = 2 electrons is thus screened efficiently from the nucleus by only the two \s
electrons. The resulting Z value seen by each of the n = 2 electrons is not much less
than 7. An extra tenth electron borrowed from the lithium atom sees roughly the same
effective Z value and is bound. The binding energy of this electron to form a negative
ion is known as the electron affinity. This electron affinity, however, is usually smaller
than the ionization potential associated with forming the positive ion. In the case of
10-3 Ionic Bonding: LiF 513
2
lithium (whose structure is \s 2s) it costs 5.4 eV to pull off the 2s electron (the
ionization potential), and in forming the negative fluorine ion 3.4 eV is released
(the electron affinity) — a net cost of energy of 2.0 eV. However, the energy released
by the binding of the resulting ions (about 8 eV) more than compensates for this
energy expense. This ionization process goes only in the direction indicated, as is easily
+
seen by considering the energy required to form (Li~F ). The electron affinity for
lithium is 0.6 eV; the ionization potential of fluorine is 17.4 eV. The energy
requirement is 16.8 eV, while the net gain through binding of the two ions is the same
+
as for (Li F~).
Note that homonuclear diatomic molecules (those made up of like nuclei, such as
H 2 )
have constituents with equal attractions for the electrons and so do not form
permanent ions within the molecule, although they may have ionic structure in their
wave functions as discussed in Section 10-2. Only heteronuclear molecules (those made
up of unlike nuclei, as HF) can have the permanent charge separation described
above.
There is a simple test by which we can judge the accuracy of the above idea of the
ionicbond in LiF or in any diatomic molecule. If the lithium's electron spends all its
time on the fluorine atom, then the electric dipole moment of the molecule, from
Figure 10-7, is
R\ R
— \
+ e— = eR.
2 / 2
of the electric dipole moment is p =2.11 X 10 -29 C m, 85% of the above model •
value.The lithium's electron does maintain some probability of being on its home-base
atom in addition to spending time at the fluorine; this reduces charge separation from
the model value. There are also other small effects that tend to reduce the polariza-
tion.
The above result indicates the bonding of LiF is only partly ionic; it is also partly
covalent. The lithium's 25 electron pairs with one of the fluorine's 2p electrons to form
this covalent bond. To see how this takes place consider the possible angular parts of
the 2p wave function. There are three degenerate states with angular components
3~
10-20a;
and
(l0-20b)
(Refer back to Table 6-1.) Figure 6-8 shows how the angular lobes of the probability
density corresponding to these states are positioned. Combinations of the F, + ,
functions have lobes in the xy plane. The Y w function shown in Figure 10-8 has lobes
along the z axis. The upper lobe is positive and the lower one negative. Of the three
functions, the one expected to have the largest overlap with a 25 function centered at
a distance R away along the z axis is F 10 . We pair this so-called 2 p. function of
fluorine with the lithium's 25 function for a covalent type of wave function.
514 Molecules
Model of an ionically bound diatomic Angular function F10 -a factor in the 2p,
molecule. Atom B has the greater electron wave function of fluorine.
affinity and takes an electron from atom A.
The two atoms form an electric dipole with
charges +e and —e separated by distance R.
^- y
2 2 2
where normalized by taking \ A + B + C = 1 (see Problem 9 at the end of
\p iS
the chapter). The term with coefficient A is the covalent part of the wave function
and those in B and C involve the ionic structures. Unlike the case of H 2 we have no ,
reason to take B = C and indeed the ionic bonding mechanism we have described
requires B =£ C. Because we know the electrons may double up on fluorine but are
very unlikely to do so on lithium, it is reasonable to set B= as an approximation.
It can be shown that the electric dipole moment of the molecule may be related to
the parameters in Equation (10-21). The result, for the case of arbitrary A, B, and C,
is
p = eR(C 2 - B 2 ). (10-22)
p = eRC 2 . 10-23)
The expe rimental ratio of p/eR = 0.85 for LiF gives us C= v0.85 = A =
0.9,
Figure 10-9
^R
the molecule. However, a more direct approach is one in which we guess the form of
the electronic energy on the basis of general physical principles. Such a "phenomeno-
logical" form contains coefficients that vary from molecule to molecule and are
determined by fitting the predictions derived from the potential energy function to
experiment.
The potential energy function usually chosen for a general ionically bound alkali
halide molecule has the form
aR
V( R =
) ae (10-24)
4ire Q R
The second term obviously represents the Coulomb attraction, at large separation R,
between the positive and negative ions. When the ions are so close to each other that
their closed-shell electronic clouds overlap, the electronic energy becomes repulsive
due to the Pauli principle as represented by the first term. The constants a and a,
which control the strength and slope of the repulsive part, are determined from
comparison with experimental data as shown in the example at the end of this section.
A qualitative plot of V(R) is shown in Figure 10-9. While V(R) looks much like V + of
Figure 10-4 or £ of Figure 10-6, it falls off at large R as \/R, which makes it much
longer ranged than either of those energies, since each drops off exponentially.
The equilibrium separation of the nuclei in the molecule is very near the position
R of the minimum in the potential energy. (Zero-point motion changes this slightly.)
This position satisfies
dV
~aae' aR + <>
- =0
~d~R R = R„ 4ite Rq
or
//.'..
(10-25)
477£ aR^
-
576 Molecules
K= —
d2
dR
V
2
= a
2
ae- aR » -
477e
2e
2
/t
R=«
KRl\
a= — 1 /
2 + 4^-^- . (10-26)
Spectral data give R and K directly, as we see in Section 10-8. Putting these values
in Equations (10-25) and (10-26) yields a and a.
The dissociation energy is the work needed to pull the two atoms completely apart
into isolated neutral atoms. If we were to separate the ions to infinity the work needed
would be — V(R ), the depth of the potential energy. The removal of the electron
from the negative ion and its return to the positive ion net a positive energy
d=I -A ,
(10-27)
where /,, is the ionization potential and A is the electron affinity. The dissociation
energy then is related to the potential energy by
Example
For the molecule lithium fluoride spectral data yield the values of the inter-
atomic separation R =f)
0.156 nm and the potential energy function curvature
K= 248 J/m2 . From these values Equation (10-26) yields
3
2
1
(248 J/m )
(1.56 X lO^'m) \
6.13
2 +
0.156 nm (8.99 X 10
9
N -m 2 /C 2
)(l.6 X 10~ 19 C)
2
/
0.156 nm
\
39.2 nm -1 .
(8.99 X 10
9
N m /C 2 )(l.6
•
2
X 10' 19 CV 6Ai _ in-16
l0
' .6.13
-e = i
1.1
1 ss 10" lb
J
6.13(1.56 X 10~ m)
= 688 eV.
/ 1.44 \
39 2 *
V(R)= |688<-
-
eV,
1.44
F(i? )
= 688^- ,392)(0156) -
0.156
= 1.5 - 9.2 = -7.7 eV.
Atoms of the noble gases, having closed electronic shells, do not interact with one
another via covalent or ionic forces. However, there is a weaker attraction between
such atoms, called the van der Waals force. This force is also present in molecules that
do interact covalently or ionically but it is much weaker and of lesser importance.
Despite its relative weakness the heavier noble gas atoms — neon, argon, and so
on — are able to form molecules via the van der Waals interaction. Helium liquifies at
The Dutch physicist J. D. van der Waals proposed that all molecules have
attractive forces that, at sufficiently low temperatures, can cause liquification. The
force we discuss here has thus been named after him. However, it was London who in
1930 first explained the physical origin of this interaction, which is occasionally called
the London force.
The an induced dipole-dipole effect. We consider here a
attraction arises from
classical model is not totally adequate but does have many of the
of the force that
elements of the quantum mechanical picture. Suppose two neutral H atoms approach
one another, and assume no covalent force. At any instant in time an electron and the
nucleus that it is orbiting form an electric dipole. This dipole creates an electric field,
which is felt by the second atom whose positive charge is pulled one way and its
negative another. It becomes polarized; that is, it develops an induced dipole moment
as shown in Figure 10-10.
We can easily find the behavior of the electric field as a function of distance R
from the first atom. Consider for simplicity a point A on the perpendicular to the
dipole as shown in Figure 10-11. At this position only the vertical components £,sin#
of each field do not cancel. From the figure we see that sin satisfies sin 6
\a/ \JR
2
+ (a/2)" . When the field point is far away from the dipole so R » a, we
518 Molecules
Figure 10-10
2Q sin0 2Qa a
Ed = (10-29)
4ire 2
R + (a/2)
2
^e R 2R 2
4ire R3
'
where p x
= Qa is the dipole moment of the first atom. From this we see that E
d drops
3
ofT as l//? as we move away from the dipole, a result that holds even when the point
A is not on the perpendicular.
The second atom in the field Ed acquires an induced dipole moment p 2 having size
«E //5 (10-30)
where a is called the polanzability of the molecule. This is a kind of Hooke's law
approximation in which the separation of the charges is proportional to the applied
force. (See Problem 11 at the end of the chapter.)
Figure 10-11
Electric field E rf
due to a dipole p, . The field is the sum of the electric fields E + and E of the
individual charges + Q. Position A in the xy plane at a distance R from the dipole is
considered. The charges + () are separated by distance a. The distance from either charge to A
2 2
is //? + \a .
»
104 Van der Waals Interaction 519
U= -p,Ed . (10-31)
To derive this note that if a field is along the z axis, then while a dipole rotates from
the xy plane into alignment with the field, the field does work QEa/2 on the charge
+ Q and work - QE( — a/2) on the charge - Q. Thus, the total dipole energy is
-2Q<La/2)E= -pE.
If we combine Equations (10-30) and (10-31) and then use Equation (10-29), we
find
.? 1 aQ2a 2 1
U = - <xE; = - a
(4we )
2
-K
b
(4we )
2
Rb
a = — qt
— = 47re/ J ,
477C /2
where / is some length in the problem. For a dipole the appropriate length is the
charge separation a, which in this case is approximately the atomic radius. So we
substitute
a = Aire Q a
z
. (10-32)
U= - —^
477£ <2 \
-
R
• (10-33)
J
The leading factor is of the same order as I , the ionization potential of the atom.
Thus the van der Waals potential energy function has the form
520 Molecules
a (nm) e (eV)
He 0.256 8.79 x 1(T 4
~
Ne 0.275 3.08 X 10 3
~
Ar 0.340 1.05 X 10 2
"
Kr 0.368 1.44 X 10 2
~
Xe 0.407 1.94 X 10 2
Source: Data from E. R. Dobbs and G. 0. Jones, Rep. Prog. Phys. 20: 516 (1957).
Vu (R) = 4e (10-36)
R \ R
The constants e, a, and n are parameters to be determined to fit gas, liquid, and solid
data. (Even though the energy function is sufficiently weak that the so-called noble
gases are indeed gases at room pressure and temperature, they do liquify and solidify
at sufficiently low temperature.) The most frequently used value of n is 12, in which
case we have the Lennard-Jones 12-6 potential. It is easy to verify that the potential
depth is f and that a is the value of R for which V\AR) = (see Problem 13 at the
end of the chapter).
Some values of e and a for the 12-6 potential for various noble gases are given in
Table 10-1. Note that as one progresses to heavier atoms the polarizability, which
depends on atomic volume, increases and the strength of the interaction also increases.
Molecules with one member an alkali atom (which has a high polarizability) and
the other a noble gas atom can be bound by the van der Waals interaction and can be
observed spectroscopically. Molecules made up completely of noble gas atoms are
difficult to observe but have been detected by use of mass spectrometers.
Example
The noble gases neon, argon, krypton, and xenon are able to form molecules
because of the van der Waals interaction. However, they are weakly bound by-
comparison with molecules held together by other forces. For the ionically
bound molecule LiF we know that the dissociation energy- is about 6 eV. For the
heavy noble gases the dissociation energy is approximately equal to the well
depth. The Lennard-Jones 12-6 potential has a depth equal to the parameter e
given in Table 10-1. For Xe, e = 0.0194 eV. A thermal energy equivalent to e
0.0194 eV
TXe = r = 225 K.
8.62 X 10" 5 eV/K
At this temperature, which is a little below room temperature, most of the
10-5 Polyatomic Molecules: H2 and CH4 521
6eV
TUF
L,F
= 1
= 7 X 10
4
K.
8.62 X 10~ 5 eV/K
Obviously the van der Waals interaction weak by comparison. For a
is quite
4
pair of helium atoms the well depth 10
is eV. The equivalent
only 8.8 X
thermal energy is only 10.2 K. However, below this temperature we do not find
bound He 2 molecules. Quantum mechanical zero-point motion is large enough
to prevent these light atoms from forming any bound state at all. (See Problem
19 at the end of the chapter.)
i i i
The electronic configuration of oxygen is ls 2s 2p . As discussed in connection with
Figures 10-8 and 6-8 the p functions have threefold degeneracy; one (call it 2p x
) has
lobes along the x axis, another (2p ) along y,
y
and the third {2pz ) along z. If the four
2p electrons occupy only two of these three states, with two electrons each, then each
of those two states is full. Then, according to the discussion at the end of Section 10-2,
none of the electrons is available to form a covalent bond. However, if the four
occupied states are, say, 2p:2px 2p y then two of the states each have only one electron,
,
which is said to be unpaired. Such a situation allows the possibility of the formation of
two covalent bonds, one with each of the unpaired electrons. In water this results in
the bonding with two hydrogen atoms. A first-approximation model of this molecule
involves simply the one hydrogen s function overlapping with an oxygen 2p and x
the
other with 2py as shown in Figure 10-12. This model has the bonds at a 90° angle.
Figure 10-12
Methane molecule CH 4 .
> *
.
522 Molecules
The repulsion between the two hydrogen ions causes the angle in the actual molecule
to increase to 104.5°. We have seen this example briefly at the end of Section 9-3.
The bonds in H 2 have a fixed angle. Further examples of such directed bonds occur
in the many compounds of carbon. One such molecule is methane, CH 4 Carbon's .
2 2 2 2
electronic state For a 2p configuration of 2px 2p it seems that carbon
is \s 2s 2p .
should form just two directed bonds like oxygen. However, carbon can bond to four
hydrogens by a kind of trick. One of the 2s electrons goes" into an excited p state so
2
that the configuration becomes \s 2s2px 2p 2pz There are then four unpaired elec-
.
It can be shown that these states are directed toward the vertices of a tetrahedron with
the carbon at the center as shown in Figure 10-13. These four states are completely
equivalent to one another in contrast to the original four functions. The ^ (
of
Equations (10-37) are known as hybrid orbitals . The energy cost of having a 2s electron
move to an excited 2p state is more than offset by the binding energy of the four
hydrogens whose \s functions each pair up with one of the states of Equations (10-37).
It costs 8.3 eV to excite the 2s electron but the binding of the four hydrogens yields an
energy of 25.3 eV.
106 Rotation
In the previous sections of this chapter we have assumed the nuclei are fixed while
solving for the electronic states. Starting with this section we consider how the nuclei
move if we allow the electronic energy found in the previous sections to act as an
interaction potential energy function between the nuclei. In a stable state the
electronic energy has a some internuclear separation R This distance is
minimum at .
scratch.
The time-independent Schrodinger equation for the two nuclei of a homonuclear
diatomic molecule is
- tHv, 2
+ V-/ *(R„R 2 )
+ V(R l2 ) + (R lt R 2 ) = £*(R„R 2 ), (10-38)
2m
where R,, R 2
are the positions of the two nuclei and R x2 = |R, - R2 |. If we separate
10-6 Rotation 523
F — +
2
2
A
— —
h
R 8R
2/x \
I
8R
1
2
8
2
8
—}
R
+ V(R) ^ rel
(^,0,$) = £ re ^, (i?,0,$), el
(10-39)
where R, 0,$ are the spherical coordinates of the vector R 12 is the reduced mass ,
ju
m m 2 /(m + m equal in this case to m/2, and A 2 is the angular operator (Equation
i l
)
(6-12))
2
i d d d
— —— sin0—— +
1
A2 = (10-40)
sin© 50 8@ sirr'0 (9<t)
L>
where Y^ m (S,^) is a spherical harmonic. The operator A" acting on Yim yields
— €({+ 1), where £ = 0,1,2,... with t* the nuclear orbital angular momentum
quantum number. That is, / is a measure of the angular momentum of the pair of
nuclei rotating about an axis perpendicular to the line between the nuclei.
In general, the total angular momentum of the molecule is the sum of the nuclear
plus electronic angular momenta. In the case where the electrons are in a state of zero
angular momentum, the nuclear quantum number f may be replaced by the total
momentum) 2 with the difference being absorbed into the electronic energy V(R). In
either case we end up with the relative motion equation
2
d d
h
27
— -R —
R dR
1
dR 2
2
R2
+ V(R)\F(R) = E^F(R). (10-42)
Given a functional form for V(R) we could now solve this equation for F(R) and
then find the average value of the rotational energy (h 2 /2n)j(j + \)(l/R 2 ). How-
ever, a simpler approach is possible. V(R) has a minimum at R so that the
equilibrium separation of the nuclei, that is, the mean value of R, is approximately
R . The kinetic and rotational energies are usually small enough that they do not
significantly alter this estimate of the mean value. So the molecular rotation energy is
very nearly
2
h j(j+ 1)
Etot i = 0,1,2, (10-43)
2M A*;
The quantity jiR^ in Equation (10-43) is the moment of inertia of the molecule.
E rot
is the energy associated with the rotation of a rigid dumbbell as obtained in
Section 6-5. The energy levels represented by Equation (10-43) lead to spectral lines
whose separation is so small that they are often unresolved and appear as bands, as
discussed in Section 10-8.
524 Molecules
Example
E rol
(1.7
— (1 X 10" 34 J
X 10- 27 kg)(0.7 X 10- 10 m)-
- •
s)
1
2
r = 1.2 x 10" 21 J
or
1.2 X 10 "- 1
J
£ = 7 X 10" 3 eV.
rot
1.6 X 10-',„9 J/eV
which can be shown (see Problem 16 at the end of the chapter) to be of order
h
2
=
M
— h h
2
E. = (1.8 X 3
10 )(7 X 10
3
eV) = 13 eV,
''
m R2
e
m M R t p
2
Q
where m and r
M are the electron and proton masses, respectively.
10-7 Vibration
The problem of the relative motion of two nuclei in a diatomic molecule (as described
by Equation (10-42)) is that of a particle in a potential well, V(R), having a
minimum. If the well is deep and narrow, it is reasonable to assume that the nuclear
rotation is the most easily excited type of motion. However, we know that a particle in
—
any well for example, in a parabolic well vibrates about the position of the —
minimum, even in the ground state. The harmonic oscillator treated in Section 5-5 is
our prime model in this regard. We now consider molecular vibrations and show this
problem can usually be reduced to the case of a one-dimensional harmonic oscillator.
Returning to Equation (10-42), we write
where E vib is the vibrational energy that we want to determine and E rot
is the
rotational energy of Equation (10-43). We then have
2
dF\
h
- — d
—\R
R dR
1
2
I
2
—\
dRj
+ V{R)F(R)
K V
= E v,b
vlh
F(R).
V
'
'
'
2n \
In analogy with the hydrogen-atom problem we let U{R) = RF(R). This results in
10-7 Vibration 525
Figure 10-14
the equation
ti
2
d 2 U(R)
+ V(R)U(R) = E vib U(R). (10-44)
2/i dR 2
(10-45)
where A, a, and R are parameters varying from molecule to molecule. The form of
this function is basically just an intelligent guess, as in the cases of Equations (10-24)
and (10-36), but it does do an adequate job of correlating many molecular properties.
For only a very few molecules, notably H 2 can accurate theoretical potential energy ,
potential energy falls off with distance and curvature of the well.
For small oscillations about equilibrium such that the nuclear separation does not
deviate very far from R ,
it is reasonable to approximate V( R) by a parabola centered
on R Such a parabolic potential energy
. is shown as the dashed curve in Figure 10-14
and is denoted as V(R). We can write
where the constant K is to be determined so that V(R) and V(R) have the same
curvature at R . This means the second derivatives of V(R) and V(R) have to be the
526 Molecules
same:
V"(R ) = K= V"(R ) = 2
2a A. (10-47)
2
h d 2 U(R) 1
"2^"^" +
2
K{R ~ Ro) U{R) = ^ vib ~ F(*o)M*)-
2 2
ti d u{y) 1
-" 2
y-r- + -Ky u{_y) = eu( y). (10-48)
This equation is almost the Schrodinger equation for simple harmonic motion, as
treated in Section 5-5. The one small difference is that the radial coordinate R is
restriction because if the particle motion had such an amplitude that it got to
invalid anyway. The gaussian wave functions for the low states of the harmonic
oscillator restrict y to avoid this forbidden region and we can simply forget the
restriction. On the other hand, the highly excited states in the oscillator well have
much wider amplitudes than the ground state and they should not be expected to be
reliable approximations to the true states of V( R ).
The radial Schrodinger equation with the exact Morse function in Equation
(10-45) can actually be solved analytically. However, the mathematics involved is
sufficiently complicated that we lose sight of the simple physics of the vibratory
motion if we attempt to review the exact solution here.
The energy eigenvalues derived from Equation (10-48) are just those of the
harmonic oscillator (see Equation (5-46)) with quantum number v:
e„= fiLo (v + l
2 ), v = 0,1,2,...
where
u (10-49)
We therefore obtain
These are the energy levels corresponding to oscillations in the length of the diatomic
molecule, in the approximation in which we think of the two nuclei as being
connected by a stiff spring. For highly excited states this approximation fails and the
Morse or other appropriate potential energy must be used to find Evib .
Example
K= 2a A
2
= 2(27 nm"' )"'(5.2 eV) = 7600 eV/nm 2 = 1.2 X 10
3
J/m 2
Note that the reduced mass used is one-half the oxygen atomic mass. This
vibrational energy should be compared with typical electronic and rotational
3
energies of about 10 eV and 10 eV, respectively, as computed in Section 10-6.
Example
The dissociation energy D, the energy to pull the molecule apart, is approxi-
mately A, the depth of the potential energy well. This is not exactly right
because the molecule has zero-point vibrational energy i,hu . Thus, we have
L
D ,hco .
For O, we obtain
10-8 Spectra
molecule is more complex than an atom, molecular spectra are often more
inherently
complicated than atomic spectra. From the spectral lines the various effects caused by
electronic, vibrational, and rotational transitions must be disentangled. The energy
scales of these three types of transition are quite different, as we have seen above, with
typical values being of order 10, 10" ', and 10 3 eV, respectively. Because the
rotational energy is so small, the spectra often appear as bands of very closely spaced
lines, the most noticeable characteristic of molecular spectra.
We examine three main spectral types: pure rotational, rotation vibrational, and
electronic. To
keep the discussion simple only those transitions between molecular
states having the angular momentum quantum number A = are considered. Our
analysis is therefore not quite as general as possible but does illustrate most of the
essential elements.
528 Molecules
Figure 10-15
V~i>=2
v= 1
where
B = (10-52)
2fiRl
Figure 10-15 presents a schematic illustration of transitions involving all three terms in
Equation (10-51).
The first transition to be considered is a pure rotational change of state. Such a
change is shown in Figure 10-15 as I. Because of symmetry considerations, a pure
rotational transition, that is, onewhich j changes but the electronic and vibrational
in
For absorption spectra j — > j + 1 and from Equation (10-51) the frequencies are
given by
Figure 1 0-1
Spectral form for a pure rotational transition as I in Figure 10-15. The lines are equally spaced
with energy interval 25.
-2B-
-^- hv
2B AB 6B SB \0B 1ZS
0-1 1-2 2-3 3-4 4-5 5-6 y'-y + 1
In both cases the frequencies vary linearly with j, which means the lines are evenly
spaced with frequency separation
2B
Ar = (10-54)
h
hv p = Au (l + L
2
)+B(j- 1)>- Lhu -Bj(j+ 1)
As indicated by the subscripts on the v 's, these spectral lines are as the R and
known
P branches. Each of these branches gives a group of lines much due to the pure
like that
rotational transitions except that the frequencies are in the range about hco so that ()
530 Molecules
Figure 10-17
Spectral form for a vibrational- rotational transition. There are two branches with a missing
line at hu f)
. The corresponding transition in Figure 10-15 is II.
P branch R branch
-> hv
7H
Figure 10-17 shows how the vibrational -rotational spectrum splits into the two
branches in accord with Equations (10-55) and (10-56). Note that there is a gap
between the two bands at the vibrational frequency go /27t. This exists because
A7 = is not allowed and because in Equation (10-56) j = does not occur. (That
would correspond to the transition j = to j = — 1.)
Transitions in which there is an electronic change of energy are represented by III
in Figure 10-15. The transition frequencies can again be deduced from Equation
(10-51). However, now note that co and B are not generally the same for the initial
and final electronic states. B depends on the molecular moment of inertia and hence
on the interatomic separation R which varies with the electronic state. In Figure
,
10-15, for example, the minima in e, and e., occur at different values of R. The
vibrational frequency w depends on the curvature of the electronic energy in the
neighborhood of its minimum. (See Equations (10-47) and (10-49).) The curvature is
also expected to be different for different states. For absorption the change in
hv. es ,
s
+ h^{v'+ 1) -ha (v+ i), (10-57)
an R branch corresponding to j
'
— j + with energy changes
> 1
Because B =£ B' the quadratic terms in j no longer cancel out and the spectra are
not made up of evenly spaced lines as in the previous situations. We can write
Figure 10-18
Spectral form for an electronic transition. The electronic- vibrational transition energy is kvev
The band due to rotational transitions has a band head at the left (red) and degrades to the
right (blue).
-> hv
IBand head
hr,
Figure 10-19
Pure rotational absorption spectrum of HC1 gas. Note the even line spacing as indicated
schematically in Figure 10-16. The frequency axis is v = v/c in units of cm~ .
20 40 60 80 100 120 140 160 180 200 220 240 260 280 300
Hem" 1
Figure 10-20
r n 1
R branch P branch
532 Molecules
Figure 10-21
Electronic spectrum of the molecule AlO. See Figure 10-18 for a similar schematic version. Note
the bands caused by the small spacing of the rotational states.
and
region the lines are also more closely spaced than at high j values; at high j the
spacing becomes greater because of the increasing influence of the j 2 term. A
schematic drawing of this spectrum is shown in Figure 10-18. The band is said to be
degraded to the blue and to have a band head (the
(i.e., the high-frequency end)
minimum) at the red (low-frequency) end. If B' band head is at the blue end < B, the
and the degradation is toward the red. The details of just how the spectrum comes to
look like that of Figure 10-18 are given in the example below. Such a band appears
for each possible v -» v' transition. The spectra appear as a series of adjacent bands.
Examples of real spectra of the types discussed in this section are shown in Figures
10-19, 10-20, and 10-21.
Example
To see how the frequencies in Equations (10-59) produce a spectrum like that
shown in Figure 10-18, rewrite the equations in the form
yR {j) = 10
2
| 1) ={b' - b)j
2
+ (3b' - b)j + (2b' - b) (10-60a)
and
yp {j) = 10
2
—
''/
-!=(&'- b)j
2
- (b' + b)j, (l0-60b)
where
W 2
B 2
10 B'
b = and b' = .
Then let b and b' take on the typical values b = 3, b' = 4, so that
}'r(
2
j)=J + 97 + 5 and yP { j )
= j2 - Ij.
Problems 533
Figure 10-22
Frequency functions yR {j) and yr (j) of Equations (10-60) for 6 = 3 and b' = 4. The y values
corresponding to integers are the allowed frequencies and are indicated by arrows on the y axis.
The band head is at the left end corresponding to the minimum in y,,.
dyr
= 2j - 7 = at j = \ = 3.5.
dj
Problems
sufficiently large, good approximations to the eigenfunctions for the two lowest states are
*±"
jf(*«±*-«)'
where
.
2«\''/4
(a) Given the above approximate eigenf unction, show that the energies ? + of the two
lowest states can be written in the form of Equation (10-7), where
G= ftf_ a
V(x)dx and S = jV( *) £_ a
dx
with F= -/:(* + |jc|)«, and where eu is replaced by e = t#w > the ground state
of a single oscillator,
(b) Evaluate S and show that the separation between the two lowest states is propor-
2 "" 2
tional to e" .
The ammonia molecule NH ., has a tetrahedral structure as shown in Figure 5-25. The
nitrogen atom has two equivalent positions either on the right of the plane of the 3
hydrogens or in the "inverted" position on the left. It can tunnel between these two
positions and so has a wave function analogous to that of the H.J electron or to the
particle in Problem 1. The splitting between the two lowest energy levels is e_— e+ =
r
9.8 X 10 '
eV.
(a) Find the time it takes the nitrogen atom to invert.
(b) Use the potential energy of Problem 1 as a model for that seen by the nitrogen
atom. Take a = 0.038 nm. What is the height V of the potential barrier separating
What is the electronic ground-state eigenf unction of He + ? Show that \p+ of Equation
(10-3a) does not reduce to it for R —» 0.
(a) By what constant should the eigenf unction of Equation (10-19) be multiplied in
(b) For H 2 , y is about 0.2. What is the probability of both electrons residing in a single
nucleus?
(b) Express this eigenfunction in terms of the Is atomic orbitals and compare with the
form of Equation (10-19). In particular, determine the value of y that governs the
(c) What is the probability of both electrons residing on a single nucleus in this
approximation?
In the binding of HJ a portion of the electronic charge is shifted from the regions
immediately around the nuclei to a position in between. A simple model of this is shown in
Problems 535
the figure. The total charge in the system is +e. What portion of electronic charge Se
must be placed in the intermediate position for binding to result? [Hint: Note that binding
implies a negative total energy of the system.]
7. Other molecular orbital combinations, besides that in Problem 5, may be formed out of
products like 4' + (r )ip_(r2 l
) and 4'-( r \ )4'-( r2) times singlet or triplet spin functions. Five
additional functions, all representing excited states, may be formed. What are these
functions? Be sure they are all antisymmetric in electron coordinates. By writing these out
in atomic functions and examining the form of the covalent part of each function,
determine which are bonding and which are antibonding.
8. (a) The ionization potential for hydrogen is 13.6 eV. The electron affinity for fluorine is
0.8 eV. Evaluate the Q_ value of Equation (10-27) for HF. Compare this with the
value for LiF as worked out in Section 10-3. From this result do you expect HF to
(b) The equilibrium separation of the two nuclei in HF is R = 0.092 nm. The
experimental value of the electric dipole moment of this molecule is p = 0.6X
10 C •
m. Calculate p/eR. Does the result here confirm your answer to the
question in part (a)?
(c) Determine the coefficients A and C (assume 5 = 0) for this eigenfunction by use of
the value of p/eR found in Problem 8(b).
determine the parameters a and a in Equation (10-24), and find the dissociation
energy D. How does your result compare with the experimental value of 4.22 eV?
(b) Find the zero-point energy correction to the calculation of D.
11. As a classical model of atomic polarizability consider a positive point charge +e (the
(a) Show that if the nucleus is displaced by a distance z from the center of the sphere it
(b) Show that if this system is put into an electric field E it develops a dipole moment
given by p = aE with a = 4fre a
i
.
12. Show that a hydrogen atom placed a distance R from H* has a potential energy U of the
form U ~ — I^a/R)* where , /n and a are the ionization potential and the atomic radius
of the hydrogen atom, respectively. How should the phenomenological potential energy,
Equation (10-24), for ionically bound molecules be modified to take account of this effect?
13. Show that the Lennard-Jones 12-6 potential of Equation (10-36) has a minimum of —e at
15. The Lennard-Jones interaction between a carbon atom and a helium atom is given by
Equation (10-36) with n = \2, f = 1.5 X 10 eV, and a = 0.3 nm. Suppose a helium
atom is at position r = (0,0, h) above the flat surface of a piece of solid carbon. The
origin (0, 0, 0) is on the surface and the carbon solid extends from — oo to oo in x and y,
and — oo to in z. The carbon density is n .
(a) Give a brief argument that shows that the helium atom has potential energy
£/(r ) = n /^Lj(l ro
~~ r l
) dr, where the integral extends over all the carbon, which
is taken as continuous.
(b) Carry out the above integral and show that U has the form
a b
(c) Find the value of the helium position h for which U is a minimum when the carbon
density is n = 113 particles/nm
!
. What is the potential energy function depth
there?
(d) To what value must the temperature be lowered for a layer of helium atoms to
begin forming on the surface?
16. In the example at the end of Section 10-6 the expression £d = h^/m R^ f
is used to
estimate typical electronic molecular energy. Justify this formula by use of (a) the energy-
17. The Born-Oppenheimer approximation, the basis of studies of molecules, claims the
nuclear motions are so much slower than electronic that the electronic energies can be
computed for fixed nuclear positions. Compute the orders of magnitude of the periods of
rotational and vibrational nuclear motion for H , and compare them with the period of an
electron in a Bohr orbit. The Morse potential parameters for the H-H interaction are
EmX , and £vib are electronic, rotation, and vibrational energies, respectively, m is the
electron mass, and M is a nuclear mass. Evaluate these relations numerically (with
E ~
rt
10 eV) and compare with the estimates given in the examples at the ends of Sections
10-6 and 10-7.
19. (a) By determining the curvature K at the minimum of the Lennard-Jones 12-6
potential, evaluate the harmonic oscillator approximation \fiuQ to the zero-point
energy of the argon molecule Ar>. Compare this value with the depth of the
potential and estimate the binding energy of Ar,. See Table 10-1.
(b) Repeat the above for the He_, molecule. What does your result suggest about the
possibility of ever forming a He_, molecule?
20. The Morse potential for two interacting hydrogen atoms has the parameters given in
Problem 17. From this compute the vibrational frequency wn and the dissociation energy
22. The separation between spectral lines in the pure rotational spectra of HC1 is 6.35 X 10"
Hz. Use this information to find the internuclear spacing Ru .
23. Draw a diagram analogous to that of Figure 10-22 for the case b = 2.5, b' = 2. Is the
band head to the red or blue? In which direction does the degradation take place?
24. Suppose you are a molecular spectroscopist and you have collected the data shown in
Figure 10-18. The values of the first few y's (defined in Equation (10-60)) are —4.4,
-4.2, -3.4, -3.0, -2.0, - 1.4, 0.8, 1.6, and From these data
3.6. find the values of the
molecular rotation constants b and b' . [Hint: What value of the initial state quantum
number j corresponds to the band head at y = —4.4?] Give the values of the j's
corresponding to each of the quoted transitions and tell to which branch each belongs.
ELEVEN
QUANTUM
STATISTICAL
PHYSICS
billion times any result not very close to an equal number of heads and tails (within
~ 10~ 3 %) is quite surprising.
The use of a billion coins to determine the probability of tossing a head or tail is an
example of the ensemble method of determining the averages. This technique is
introduced and used in the present chapter to study the thermal properties of
collections of particles having three different kinds of probabilistic behavior: those
obeying Maxwell-Boltzmann statistics, Bose-Einstein statistics, or Fermi-Dirac statis-
tics.
538
//-/ Particle Indistinguishability 539
after the collision. However, for microscopic particles, we now realize that the
uncertainty principle makes it impossible to figure out which particle was incident and
which particle was target. This feature of particle indistinguishability implies that all
interpreted as describing the energy of n photons each of energy hv. This interpreta-
tion is consistent with Einstein's treatment of the photoelectric effect. Photons are thus
found obey Bose-Einstein statistics.
to
Blackbody radiation certainly did not provide the only evidence for the need to
modify classical mechanics. The classical theory of solids predicted the heat capacity C
to be independent of temperature. While this result held for high temperatures, at low
temperatures C was found to drop toward zero. Einstein explained this phenomenon
by modeling the solid as an independent set of quantized oscillators. This model was
improved upon in 1912 by Debye who considered the solid as a set of particles
coupled harmonically to one another. This theory, which gave the correct low-temper-
ature behavior, has turned out to be remarkably similar to that of blackbody
radiation. Indeed, the concept of massless Bose-Einstein particles called phonons
traveling through the crystal lattice has proved to be very analogous to the photon
idea.
Fermi and Dirac are responsible for the development of the statistics obeyed by
electrons, protons, neutrons, and a host of other particles. The statistics forms the basis
of the Pauli principle according to which one Fermi-Dirac particle is allowed per
quantum state. The thermal properties of electrons in atoms and metals, protons and
neutrons in nuclei, the atoms in liquid ''He, and even the nature of white dwarf stars
can be understood with Fermi-Dirac statistics, as described in this chapter.
If two identical particles A and B pass within a de Broglie wavelength of each other,
then after the encounter we are unable to tell which of the two particles is A and
)
>«(D^(2) (11-la)
M 1 '2) = \
<Ml)* a (2) (11-lb)
2
V, + V(i) ^(0 = ^„(0,
2m
as in Equation (9-15).
We know from Chapter 9 that particles with half-integral spin values (\Ti, |A, . .
.
have antisymmetric eigenfunctions and are known as fermwns. These include electrons,
protons, and neutrons, as well as composites of odd numbers of particles having
3
half-integral spin such as He atoms, and others. All these particles are said to obey
Fermi-Dirac statistics.
Figure 11-1
Intuitive description of the terms in the probability density Equation ( 1 1 -6). The coefficient g
has the value g = 1 for bosons, g = - 1 for fermions, and g = for Maxwell-Boltzmann
particles.
2 +
l* ai8 (l,2)l
=
}t*a(l)* a (l)*p(2)* /
8(2) ^(D^(D*o(2)* a (2)
©
Let us now examine more closely the properties of these two-particle eigenfunc-
tions. The measurable quantity, the probability density, is, for both cases simulta-
neously,
2
1^(1,2) |
= ^(1)^(1)^(2)^(2) + ^(1)^(1)^(2)^(2)
distinguishable with, for example, particle 1 painted blue and particle 2 painted red.
The new effects due to symmetry are contained in the third and fourth terms, the
quantum interference or "exchange" terms. The third term has particle 1 starting out
in /? and particle 2 in a. However, after the encounter the two switch places so that 1
is in a and 2 is in ($. The last term describes a similar process with 1 and 2
interchanged in roles. We have already seen how such interference terms can affect
the energy of a system in our treatment of the H 2 molecule in Chapter 10. In Problem
2 at the end of the chapter we consider some further aspects of this phenomenon.
Particles of the same species, yet somehow distinguishable, are described by either
of the Equations (1 1-la) or (1 1-lb). We can tell the difference between the two forms;
for example, the blue particle is in a and the red is in /?, or vice versa. Such particles
do not actually occur in nature but are useful conceptually. They are said to obey
542 Quantum Statistical Physics
difference between the two forms in Equations (11-la) and (11 -lb), and still have the
Maxwell-Boltzmann case in Equation (11-6) while keeping both with equal ampli-
tudes, as long as we do not let the two forms interfere.)
We
can see when quantum effects become important by considering the pair wave
function in the case that 4> a{i) and \pp(i) are localized, or peaked up at some position
in space. We assume each of ^ a (l), ^(1), ^P a (2), and ^(2) has the same spatial
width d. It is fairly obvious that the appropriate de Broglie wavelength with which to
characterize such a localized state is A ~ d. (This can be proved but we are not going
(11-6) vanish; that is, where ^„(1) is large, ^(1) is zero, and vice versa.
Maxwell-Boltzmann statistics is then valid. On the other hand, if A > a the single-
particle eigenfunctions overlap, the interference term is nonzero, and quantum effects
are present. In Section 11-3 we introduce the thermal de Broglie wavelength A th ,
^(1,2) = 0.
Fermions are antisocial. Bosons, on the other hand, are gregarious. They actually
show a preference for being in the same state or place when compared to distinguish-
able particles. To see this take both particles at the same position, that is, r L
= r2 = r,
2 2
k a/J
(r,r)| =|^(r)r|^(r)| (l+ 5 ).
For fermions g = — 1 and this expression vanishes as one expects. For the
Maxwell-Boltzmann case g = and (1 + g) = 1, while for bosons (1 + g) = 2. This
preference that bosons have for being in the same place (or state) is important in
4
understanding several physical phenomena including the superfluidity of liquid He.
As by the Slater-determinant eigenfunctions of Section 9-5, it is possible
illustrated
to generalize our symmetry principles to functions for more than two particles. To get
the boson version of the Slater determinant, expand it into a series of terms and
change all the minus signs to plus signs. The result is a function symmetric under the
interchange of any pair of the variables.
Example
Suppose we have three particles in three states a, ft, y. The fermion version of
the eigenfunction for this case is written in Problem 8 of Chapter 9. The boson
/ 1-2 Thermal Distribution Functions 543
+ ^(1)^(2)^(3) +^ a (l)^(2)^(3)
+ ^(l)^ y (2)^(3)+^ y (l)^ a (2)^(3)}.
(In the fermion version the second, third, and fourth terms are negative.) The
indices a, /?, y on the right appear in all possible permutations of three quanti-
ties so it is not surprising that, when two variables are exchanged, or when any
possible permutation of the three variables 1,2,3 occurs, the eigenf unction is
unchanged.
A distribution function n t
gives the average number of particles to be found in an energy
state e, at temperature T. If a system is heated(T raised) particles are redistributed to
higher energy states and « ;
is altered. We also expect n t
to depend on the nature of
the particles. For fermions n t
must be zero or one because of the Pauli principle; for
bosons n i
behaves in a manner consistent with the particles' tendency to be in the
same state.
2 somewhat before we can apply them to quantum particles. The need to do this
provides us with the opportunity to look more closely at the ensemble concept that is
at the heart of all statistical reasoning.
In the introduction to this chapter, we mention the flipping of a coin a very large
number of times to determine the probability of finding a particular head or tail
forexample, a smoker or has had a heart attack. The ensemble consists of people of
known medical history all of whom have died so that an average age of death may be
computed. Whether our individual dies much earlier or much later than the average is
not terribly important to the finances of the company. On the average they are going
to collect sufficient premiums to cover both possibilities.
The last sentence tells us that a statistical analysis gives information not only about
average values but about fluctuations about those values. An instructor may construct
a bell-shaped curve by using the grades on an exam; the curve is described by a width
as well as a mean or most probable value. An interesting aspect of physical systems
containing very many particles is that fluctuations of most variables about their
averages are so small that we need concern ourselves only with computing the mean
values.
We are interested in the behavior of a system composed of N particles. To measure
any quantity, such as pressure, we put a gauge on the system and follow its readings
over a sufficiently long period. A
same quantity might in
theoretical calculation of the
principle be carried out by solving the time-dependent many-body Schrodinger
equation and following the evolution of the wave function. This is extraordinarily
difficult; what is done instead is to examine the average behavior of a great number of
similarly constructed systems at one single time. There are reasonable assumptions
built into the latter approach which make it much the simpler procedure.
Suppose our system is a box containing a gas of N particles. We assume there are
energy levels e,, e
2 , e3 , . . . for each particle in the box. This assumption is usually
possible only if the particles are nomnteracting. The energy levels are, at any one time,
occupied by n ,, ™ 2 « 3 , ,
particles just as in the notation of Section 2-3. To find the
average values of the n i
a large number M of apparently identical boxes, each with N
particles, is examined. In accord with the fact that it is impossible to follow individual
particle motions, no attempt is made to select M systems all with identical sets
Figure 11-2
Ensemble of M identical systems. Each system contains an average of TV particles and has
single-particle energy states e,, e2 , e3 , .... Any state e, of system y contains ny' particles. The
total number of particles at level e, for all the members of the ensemble is T.
y
n\
y) = N,.
3
_ y M
System number >
/ 1-2 Thermal Distribution Functions 545
differfrom system to system because of the random variation of the states of particle
motion from system to system.
If each system has a total of exactly particles the sum over energy levels (a N
vertical sum in Figure 11-2) yields
oo
£ n^ = N.
i= i
However, in the following derivation we are going to relax this condition somewhat.
The particular procedure we use results in an average of N particles per system, with
fluctuations about the average that are quite negligible. With this in mind we replace
the last equation with
00
£ n\
y) = N {y)
,
(11-7)
where N <- y)
is the actual or instantaneous number of particles in system y. Since the
average value of jV (y) is N in each of the M systems, the following relation holds:
M
£N {y) = MN. (11-8)
y=l
If we consider a given energy level e, and sum horizontally in Figure 1 1-2, the result
we have
M
X>! T) = ty- (11-9)
y=\
The quantity of interest is the mean of the n\ y) averaged over all the systems of the
ensemble. This average is denoted simply by n without the superscript y and :
is
n, = — (11-10)'
'
M V
Another important attribute of each system is its total energy. Just as in the case of
particle number we relax the restriction that each system have exactly energy E and
assume only that it has that value on average. System y has actual energy E iy) so that
E" (
,
Y)
e, = £ (Y \ (H-ll)
which is another vertical sum in the scheme of Figure 11-2. Since the average value of
E {y) is E in each of the M systems, we have
N
£,E™ = ME. (11-12)
y=l
(i) Maxwell - Boltzmann Statistics Let us begin by reproducing the Maxwell-Boltzmann results
of Section 2-3 in order to illustrate the method. systems each with an average of A* M
particles imply a total of NM
particles. We distribute them all throughout the entire
ensemble without worrying about getting exactly of them into each system. The N
number of particles in all energy levels £, throughout the ensemble is there are X, N x
;
particles in all levels e 2 and so on. We want to count the number of ways to distribute
,
N t
particles horizontally in level e t
across the ensemble. The first of the N t
particles
can be put in any one of M
systems so that there are ways of placing it. There are M
also M
ways for the second particle. For each of the ways of placing the second
particle there remain the original M
ways for the first so that there are 2
ways of M
positioning the two particles in the M
sites. One of the ways involves, for example,
two particles in level e of system 1. The method counts this situation only once; an
;
interchange of those two particles does not produce a new configuration even when
they are distinguishable. For the third particle placed in the ensemble on a level e,
W£fi = MN >.
The total number of arrangements of particles on both levels £, and £ is the product
W^W^ J)
. If this process is repeated for all energy levels the number of all possible
horizontal arrangements is found to be
w h
= w^wpwp .
horizontal interchanges on each of the levels e,; these have already been considered in
the calculation of h
W
To remove them from the total number we must divide by the
.
number ways of interchanging N particles on level e,, given by A7,!, and the
of x
number of ways on level e 2 given by N2 \, and so on. It follows that the total number
,
of vertical or diagonal interchanges that can be made to obtain new microstates from
one of our original configurations is
(MN)l
Ko N \N2 \N3 \
X
Figure 11-3
Ensemble with three Maxwell- Boltzmann particles in two systems. Each system has two energy
levels with TV, = 2 and N, = 1 (In this example each system does not contain the same number
.
of particles.) The particles are distinguishable and so can be labeled 1, 2, and 3; microstates 2
and 3 are then distinct. There are 24 possible configurations of which four are shown. We can
generate other configurations in two ways by interchanging particle 1 with 2 or with 3 for
3X4=12 microstates. The other 12 microstates are found by putting any single particle in
energy level e , of system 2.
© 0) © ©
® ® (D © ©©
System 1 System 2 System 1 System 2 System 1 System 2 System 1 System 2
Since there are W vd new configurations for each original one previously considered,
the total number of microstates that can occur for a fixed set of the numbers
Nu N N 2 ,
3, . . . is the product of W h and W vd :
AT' M N;
W= W W vd
= W vd Wi Wt
l) 2)
= {NM)\
h
N x
\ N 2
\
M'
W= (NM)\]~[ (Maxwell-Boltzmann), (11-13)
•V
of particles in each system; that condition has been relaxed as mentioned above.
Equation (11-13) tells us that the total number of configurations is
2 1
2 2
3! 24.
2! 1!
As in Section 2-3 we find it easier to maximize In W rather than W itself. This process
gives the same result because the logarithm of any function has its maxima and
minima same locations as those of the function itself.
in the
Unfortunately, we cannot maximize In directly because not W all the Nt
's are
independent variables; there are two relations involving the N t
's. One such constraint
A 1 .
is that the sum of all particles on all energy levels is the total number of particles in
the ensemble:
oo oo M M oc M
E^ = E=
1=1 i 1
E"! T) =
y= 1
E
y= 1 i
E»! Y) = I,N {y) = MN,
= 1 y= 1
(ll-14a)
which follows from Equations (11-7), (1 1-8), and (11-9). Similarly, the total energy of
the ensemble is given as
oo oo A/ M oo M
£.*M= E^,E"! Y, = E= Ev4 Y) = L£ {y)
= ME, (n-i4b)
/=1 i =1 y= y
= l' l
Y
= '
which makes use of Equations (11-9), (11-11), and (11-12). By using Equation (11-10)
we can put these constraints into a slightly different form for later use:
N= T:E^=E", (H-15a)
M
and
F(N ,N2{
a, fi) = \nW - d^N, - NAf) - fil Ec.-ty - EM ] ,
where a and /?, often called Lagrange multipliers, are as yet undetermined. We then
apply the conditions
3F
--0 (11-16,
F= ln( NM\) + £ {
r
,ln M - A> A[ + N, - aN, - (3N,e, } + aNM + (SEM.
11-2 Thermal Distribution Functions 549
dF
0= = \nM -InN,- a - fie r
dN
M .
Z-E*-**, (11-18)
then
AT
e~° = — (11-19)
Putting this back into Equation (11-17) gives the result for the most probable N t
value,
MN
N, = e-K.
Z
AT
»,.= —*-* (Maxwell -Boltzmann), (11-20)
(ii) Fermi - Dirac Statistics We again suppose our MN particles have been spread over the
levels so that N i
are in all the e ,
levels, N.> in all the e levels,
2
and so on. We ask how
many ways a horizontal dispersal of Nt
particles can be made. Since these particles
are fermions, no more than one can be put in a single energy level of one system.
Obviously, we must have > Nr M In any configuration we have N filled levels t
(particles) and M
— N empty onest
("holes") as shown in Figure 11-4. To develop
new and holes.
microstates interchange particles The number of ways all objects M
can be interchanged M\; however, this involves interchanges of particles with
is
particles and holes with holes which do not generate distinct microstates for indis-
tinguishable particles. Consequently, we must divide by the number of interchanges of
each of these quantities, given by, respectively, A^! and (M — A',)!. We find then that
>
Figure 11-4
Possible configuration of N:
Fermi-Dirac particles in M systems on energy level e t
. A new-
configuration can be generated by interchanging a particle and a hole (empty
12
_o o_ _o
3
;;;
N,
o_
level).
E .
'
1 2 3 4 -
M-\ M
System number —
Ml
ryfo =
an expression that the reader may recognize as the coefficient of the general term in
the formula for the binomial expansion.
In the Maxwell -Boltzmann case the next step is to consider vertical or diagonal
rearrangements. However, here the particles are indistinguishable so that an inter-
change of a particle with energy e, with one of energy e does not produce a new state. •
Ml Ml
W=
N X
\(M - N x
)\ N l(M- N2 2
)l
^=11
„ Ml
;
r (Fermi-Dirac). (11-21)
Figure 11-5 illustrates this counting scheme for the example of M= 2, TV, = 2,
N 2
= 1. Equation (11-21) gives
2! 2!
= 2.
0!2! 1! 1!
In this case there are only two ways of interchanging particles to generate distinct
microstates.
Next we maximize In subject W to the constraints. From Equation (11-21) and the
Stirling approximation we find
F = £{lnM!-(Af- A0ln( M - N, ) + (
M - N, )
- N,\n N,
dF
=
dN
= ln(v M-N ,))
- In N - a - fie,
/ 1-2 Thermal Distribution Functions 55!
Figure 11-5
o o
o o o o
System 1 System 2 System 1 System 2
Microstate 1 Microstate 2
so that
M- N,
a
= e e^
N
The solution for the most probable value of TV, then gives N /M as
t
' pt
(Fermi-Dirac). (11-22)
„<»„P£
e"e -
+ 1
a p *'
The quantity e e is nonnegative for all values of the parameters. This property
leads to a result expected for fermions, namely, that for all energy levels £,,
«,< 1.
Instantaneously, no state can hold more than one particle; on the average, there is
Now that we have gone through the procedure of deriving the fermion distribution
function it is possible to see why the simpler method used in deriving the
Maxwell-Boltzmann distribution in Chapter 2 could not have been used here. That
derivation assumes that n t
\ can be represented by the Stirling approximation, which
requires n !
^> 1 for its validity. This condition does not hold for fermions. The
derivation presented here requires only that jV, » 1, which can be satisfied for
particles of by making
any large enough.
statistics M
a
The constant e in Equation (11-22) is not as easy to eliminate as in the
Maxwell-Boltzmann case. For this system a has important physical significance, as
we see in Section 1 1-6. Nevertheless, it is still related to the particle-number condition
— !
Figure 11-6
Possible configuration of N, Bose- Einstein particles in M systems on energy level e,. New
configurations of indistinguishable particles can be generated by interchanges of the N :
particles
and M— 1 partitions.
• •
6
'66ol 61 ooool lo-l —
System number
I
12
,
1 t^l
^Partition
I
3
,
1
1^1
4
l^J
5
...
K= E", = E ,^p ,^ t ,
(Fermi-Dirac). (11-23)
In certain limiting cases this equation can be solved analytically and a can be
expressed in terms of N, as we show in Sections 11-3 and 11-6.
(iii) Bose - Einstein Statistics In this case we are concerned with indistinguishable particles
but are not limited to assigning just zero or one particle per energy level. Horizontal
counting is a bit tougher here, but there is a trick that can facilitate the task. A
possible configuration of N l
particles among the M systems is shown in Figure 11-6.
The systems are separated from one another by partitions. We generate all configura-
tions by considering all possible interchanges of the partitions and particles shown in
the figure. For example, if we interchange the second particle and the first partition
we get a new microstate having one particle in system 1 and three in system 2. The
number of ways of making interchanges of all N particles and all — 1 partitions is t
M
(N + t
— M
1)!; however, this includes the N interchanges of particles among them- t
\
selves and the (M — 1)! interchanges of partitions among themselves. To get the
correct count we must divide out these interchanges. It follows that the number of
horizontal configurations on the th energy level is /
{N, + M- 1)!
W U) =
N,\(M - 1)!
Again we need not consider interchanges of particles on different energy levels for a
given N,, N 2 , . . . because of particle identity. So the total number of states is found by
1
taking the product of all the Wf '
to get
oi
„(N;'+M-
— 1 )
Figure 11-7
Ensemble with three Bose- Einstein particles in two systems. Each system has two energy levels
with jV, = 2 and Af, = 1. The particles are indistinguishable but more than one can be in a
state. Three of the possible configurations are shown. The other three are obtained by changing
the single particle in e, from system 1 to system 2. Note that an interchange of the
two e, particles in the middle configuration does not generate a new state, unlike the
o o o
E, -
oo o o oo
System 1 System 2 System 1 System 2 System 1 System 2
If we now repeat the maximization procedure with use of Equation (1 1-24), we find
n. = (Bose-Einstein). (11-25)
e"e^ 1
(The proof is left to Problem 8 at the end of the chapter.) The Lagrange multiplier a
is again determined by the condition in Equation (1 l-15a) so that the average number
of particles in each system is given as
*—> „«„/3f,
(Bose-Einstein). (11-26)
1
To see the effect of statistics on the distribution function in this case let us consider
the lowest energy level, which we can take to be e, = by adjusting the arbitrary zero
of energy. Equation (11-25) gives
(11-27)
e
a - 1
Since n x
is number it must be positive; Equation (11-27) then tells us that
a particle
the parametera must be positive for a Bose system. It turns out that at a sufficiently
low temperature a becomes quite small and n then increases dramatically. This {
sudden congregating of particles into the lowest state is called the Bose -Einstein
by Einstein alone). This phenomenon is a vivid example of our
condensation (discovered
claim that bosons prefer to be in the same state, and is used in Chapter 1 3 to illustrate
some aspects of the superfluid behavior of liquid He at low temperatures.
.
We end this section by stressing the limitations of our results. The derived
distribution functions are useful in treating independent, or noninteracting, entities
(e.g., elementary particles, atoms, molecules, phonons) that have well-defined single-
particle energy states. Many (perhaps most) systems are made up of particles that
interact with one another so that single-particle energy eigenvalues are not identifi-
able. There are procedures for dealing with the thermal properties of these systems;
however, we do not treat them here.
There are circumstances in which quantum indistinguishability effects are not particu-
larly important. Our discussion of eigenfunctions in Section 11-1 shows that this
occurs when exchange terms (as in Equation (11-6)) become small. The special case
given there, of localized single-particle functions each with de Broglie wavelength
small compared to the interparticle separation, illustrates one way
this could happen.
Such a situation actually arises in a tightly bound solid where strong interatomic
forces hold the particles close to their lattice positions and the eigenfunction for any
given particle is highly peaked.
For a system of independent particles like those described by our thermal distribu-
tion functions, we can define an average or thermal de Broglie wavelength A th This .
To see how thishappens refer back to Equation (11-19) where the Lagrange
multiplier a in the Maxwell Boltzmann case is given by
Z
e
a = —
By evaluating the partition function Z, defined in Equation (1 1-18), we can show that
a
the conditions of low density and high temperature make e very large. This gives us
a clue about the appropriate limit to take to find classical behavior in the Fermi and
Bose distributions. In the example at the end of this section we show that Z is given
by
V
(11-28)
where X th is given by
l 2
th
\2itmk B T] '
V 1
(11-30)
*K
which we see obeys e
a
;» 1 if we have small density N/V or high temperature.
11-3 High-Temperature. Low-Density Limit 555
a t
If, in the Fermi or Bose distributions (Equations (11-22) or (11-25)), we take e e^ '
dE=Y n de J i i
. (11-31
This increase in energy comes from the work done on the system by the external agent
compressing the system. If the gas is in a box having edge of length L, the length is
dW = -FdL.
(The minus sign occurs because positive work is done on the system when L
decreases.) The force F is opposed by the pressure of the gas on the walls of the
container, each of which has area A. Since pressure is force per unit area we have
F= PA
or
This work dW must be equal to the change in energy in Equation (11-31) so that
the pressure becomes
P=-E«,^. (H-33)
i
For an ideal three-dimensional gas in a cubical box of length L the energy levels
are given by Equation (5-99),
2 2
«W,= ('l + '2 + 'l)T-7
LmL~
(11 " 34 )
with ff
l5
nf
2,
«f
3
positive integers. Since L is V x/i we can write e, = C V~ 2/3 where C
t
,
t
556 Quantum Statistical Physics
—=
de
rfF
t
2
--C,F" 5/3 =
3 '
--C
2
3 '
F-2/3
V
=
2
3
e,
V
L
.
Thus, for any ideal gas, whether classical, Fermi, or Bose, we have
21 2 E
P = --Y,n,e, = , (11-35)
3 V; ' '
3 V
(e) = \k B T. (11-36)
E= N(e)
so
PV=Nk B T, (H-37)
a pt '
(11-38)
IU
'
e"e '
±1 1 + e~ e- '
where the upper sign refers to fermions and the lower to bosons. The Maxwell
distribution results from taking the denominator of the last form in Equation (11-38)
equal to unity. This is equivalent to e'^'^ 1 ' <s: 1. Suppose we go one step further
and make a binomial expansion in small x using
= 1 + x + x- + ••• , (11-39)
1 + x
„.= e-^-^il + e- e-
a pf -
+ ). (11-40)
The terms given are the first two in an infinite series, called the virial expansion, which
is powers of (X\ h N/V). This factor is small at low densities and high temperatures.
in
Note the sign of the correction term. The upper sign in Equation (11-41) refers to
fermions, the lower to bosons. The pressure for fermions is higher than for Maxwell
particles, confirming what we expect from the Pauli principle. Because two fermions
cannot be in the same spatial location, each fermion moves in a smaller volume than
if the particles were distinguishable so that the pressure of the gas on the walls of the
container is increased. Thewhat occurs if there is a force of repulsion
result is just
between Maxwell -Boltzmann Bosons, on the other hand, prefer to be
particles.
located in the same state. Their statistics gives a result analogous to a Maxwell-Boltz-
mann gas with an attraction; the particles do not hit the walls as often, so the pressure
is reduced slightly over the high-temperature limit.
In Section 11-6 we show the effect of statistics on the Fermi gas pressure in the
Detail
To compute a sum over single-particle energies of the form L /(e), where /(e) is f
an arbitrary function, we need to carry out a procedure that has been discussed
in the example at the end of Section 2-3. We briefly repeat portions of that
derivation here to establish notation for later use and thereby introduce the
density of states function. We consider free particles in a large box with sides of
h 77 /2mL~, is very small so that sums over the ( can be converted with t
where the factor g enters because we have extended the /, integrations over
negative as well as positive values. Just as in the case of blackbody radiation, we
can consider the £ as components of a three-dimensional vector t whose length
{
f depends on e according to Equation (11-34). Since /(e) varies only with the
magnitude of vector /, we change from the Cartesian integration of Equation
(11-42) to spherical coordinates and then change to an integration over e:
_
E/(e) = —
477
8
/
/.oo
Jn
dM f(e)2
= -
77
I
r oo
J(\
de^-f(e)
di
(it
(11-43)
2
with t= ^{[ + c°
2
+ t°l .The 477 in Equation (11-43) comes from integrating
over all directions of / The density of states function is defined as the number of
levels per unit energy in the range e to e 4- de. This quantity is denoted by Z)(e)
in the relation
E/(*)= pe/)(e)/(e).
\
1/2
77 de 77 / 2mL 2 e 2mL ' !
\ f
2\ aV J { nv ) 2^
or
m 3/2
/2
DU) Vc' (11-44)
/2W .
1/2
Note that D(e) is proportional to e , a result that holds for nonrelativistic
particles of any statistics in three c imensions.
Detail
This quantity is easily computed by use of the density of states function derived
in the previous Detail. We have
2 3
^0 /2 77 ^ ^0
taking x = fie. The dimensionless integral has the value V77 /2 so that
V
Z=tt-, (H-46)
A th
"- 47)
x
* = [^f) •
(
n, *-*
/'(£,) = — = (11-48)
«"* 2
p(e) = D{e)— = -frrfiW2*-* (11-49)
z yw
11-4 Photon Statistics 559
< e>
= (°°deep(e) = -^ 2
f A^"'' = -^(k B T) 5/2 ^ 2
C dxx^e-
2 Zfr 3
=
7=(* B :r)—---k B T. (n-50)
In this calculation we again use the substitution x = fie to make the integral
dimensionless. The final result is the same as that found in Section 2-3.
Einstein pointed out that this argument was inconsistent because the part of the
derivation involved in finding the relation between the radiation energy density u v
and the average energy (e) of an oscillator, given by
2
877- v
", = ~^-( £ )'
was completely classical; that is, it did not take into account the quantization of the
oscillators emitting the radiation. It was only in writing down the correct expression
for (f) that the oscillator quantization was invoked.
We have avoided this pitfall by considering the radiation field itself to be
quantized. The radiation field energy can be thought of as that of distinguishable
oscillators of frequency v, which can be in states nhv.
This view is quite valid but can be improved by making greater use of the photon
picture involved in the photoelectric effect and in Compton scattering. Instead of
thinking of the radiation field as having, for example, one oscillator of frequency v in x
an excited state nhv and another of frequency v2 in state mhv>, we envision there
x
being n photons each of energy hv and m of energy hv 2 and so on. This is the view
]
,
fih.
e
560 Quantum Statistical Physics
hv
<«>=«>= -^ (11-51)
in agreement with Equation (2-38). The rest of the discussion of blackbody radiation
follows precisely as in Chapter 2.
11-5 Phonons
It was Einstein (again!) who recognized that the quantization of the oscillators in the
blackbody cavity walls had implications concerning the thermal properties of solids
themselves as well as the radiation with which they were in equilibrium. Deviations
from the classical laws for the heat capacities of solids had already been discovered.
The explanation of these deviations was another step toward the verification of the
new quantum ideas.
That the thermal properties of radiation and of solids are explicable by the same
quantum ideas comes from the very close analogy between photons and the quantized
sound waves in a solid.
We first consider a solid by a simplified treatment known as Einstein 's model. In
most crystals each molecule is bound tightly in place by the forces resulting from the
surrounding molecules; a molecule oscillates about its lattice site. Einstein's model lets
every molecule oscillate harmonically with the same frequency v as if each particle is
attached to its lattice site by a single spring with the spring constant the same for
every particle. A more accurate picture is one in which all the particles are intercon-
nected by springs. We see below how this changes the results, but first let us consider
the Einstein picture.
If the particles are highly localized about lattice sites, their wave functions do not
overlap much. Hence, particle indistinguishability need not concern us and we can use
Maxwell-Boltzmann statistics. Each oscillator has energy levels nhv and the average
energy per oscillator (e) is calculated in precisely the same way as in Equation (2-42),
so that we find
< e>
X /
= —-— with0=
3hi>
-Si
e^"" 1 kBT
1
.
(11-52)
E= N(e).
The heat capacity C is defined as the total heat required per degree of temperature rise
in the entire system. (The specific heat is the heat capacity per particle, C/N ). Here the
heat added is equal to the increase in internal energy E so the heat capacity is
C= —
dE
dT
=
3N(hv
T kB
a)
,
2
e
e^""
P»'»-\f
-. (11-53)
{
Figure 11-8
3» T
in agreement with the classical law of equipartition, Equation (2-20). The heat
capacity becomes
This is the law of Dulong and Petit (P. L. Dulong and A. T. Petit, 1819); the heat
capacity is independent of temperature at high T. Deviation from this law at low
temperatures was another element along with blackbody radiation in the case against
the classical theory. It was found experimentally that C dropped below Nk B at low
enough T.
We can see how quantization gives a cure for the inadequacy of Equation (1 1-56).
Figure 11-8 shows a plot of Equation (11-53). As the temperature drops C begins to
fall for k B T < hi> ; here the thermal energy is of the order of the energy level
separation and the discreteness of the energy levels becomes important. When
kB T <§: hv , the exponential in Equation (1 1-53) is large so
which goes to zero as T —* 0. At T= all oscillators are in their ground states and if
the system is to absorb energy some oscillators must make the transition to a state hv
higher in energy. Until k B T approximates hv , this is unlikely so C is exponentially
small in this model.
Measured values of C in real solids do show C — Nk B > as T becomes large, and C
does indeed drop toward zero at low temperatures in agreement with the Einstein
model. However, at low temperature it is found that C does not obey the exponen-
tially decreasing form given by Equation (11-57).
562 Quantum Statistical Physics
Figure 11-9
^ — ~ ~ ~ X
The explanation for this deviation from the Einstein model was given by Debye
who assumed that the molecules in a solid were connected together by harmonic
forces. The solid was visualized as a set of masses in a kind of three-dimensional
bedspring.
One of the characteristics of such a system is that longitudinal and transverse waves
can travel through it. These modes are the solid-state version of sound waves. Such
waves can have very long wavelengths ~ length of sample) and very short wave- (
lengths ( ~ separation of molecules). For long wavelengths where the lattice discrete-
ness is not important, the modes obey a standard wave equation that shows the
frequency and wavelength of the standing waves to be related exactly as they are for
blackbody radiation in Equation (2-18). However, here c becomes the velocity of
sound in the crystal.
While the theory of sound waves becomes quite analogous to that of the radiation
discussed in Chapter 2, there are three differences between sound and electromagnetic
modes: (1) The number of modes is not infinite. Very short wavelengths (very high
frequencies) are not allowed; the minimum wavelength is equal to twice the distance
between particles in the lattice. (2) A relation like Equation (2-18) holds for low
frequencies but deviations from it occur for higher frequencies. This phenomenon
corresponds to dispersion of the sound waves (higher frequencies have smaller veloci-
ties) and occurs because the system is not continuous but is made up of discrete masses
connected by springs. (3) just two polarizations but three. For some
There are not
directions of sound waves through a crystal these correspond to two transverse modes
and one longitudinal.
A one-dimensional model of a crystal is shown in Figure 11-9 in which a large
number of particles are attached together with springs. In this model we consider only
longitudinal motion. We can show that in the limit of very large N the frequencies of
this system are
vn = v\\ -cos
/
—
mn \
j
'/ 2
(11-58)
For small n/N, the cosine in Equation (11-58) can be expanded and only
lowest-order terms need be kept. We use
2
x
cos x = 1 — — +
2
7 1-5 Phonons 563
Figure 11-10
I 2
l / irn
-
+
1
ib'
vf m
n «: N. (11-59)
ft N
'
This equation describes the initial linear part of the curve in Figure 1 1-9. If we reduce
the discussion of electromagnetic waves summarized in Equation (2-18) to one
dimension by taking n, = n, n2 = n3 =
Equation (2-13)), we see that (or refer to
Equation (11-59) has the same form with the velocity of light now replaced by the
velocity of sound given by
c
s
= \f2TTP a. (11-60)
We have taken the length of the crystal to be Na with a the equilibrium interparticle
spacing. The group velocity of waves along this chain of particles is proportional to
the slope of the j'-versus-H curve, as illustrated by Equations (11-59) and (11-60). The
decrease in the slope of the curve in Figure 11-9 as n/N —* 1 corresponds to
dispersion; higher-frequency waves move more slowly.
Let us next consider a three-dimensional model of a solid, with each particle
connected by springs to nearest neighbors in all directions (bedspring model). Then
the frequencies v may be characterized by three integers «,, « 2 , n 3 , and for small n t
/ I
<L
i, + i
n2 + i
n 3
(H-61
21
The total number of frequency modes is still jV if we allow only longitudinal waves to
564 Quantum Statistical Physics
move through the crystal. If we assume a crystal with a cubic shape having length L
on a side, the number of particles along one edge is
L
N x
= - = N l/ \
a
and each n t
then ranges over
I < n,< Nr
Small n t
in Equation (11-61) means n t
<sc N x
. For large «, Equation (11-61) is no
longer valid and the frequencies begin to show dispersion.
For each set (n {
, n 2 , n3 ) three polarizations of waves can occur, which in the
simplest situation are two transverse and one longitudinal as mentioned above. In
most situations the sound velocities of the three polarizations differ. We ignore this,
consider c to represent some average sound velocity, and simply multiply our results
i
by 3.
The analogy of sound waves to electromagnetic radiation is so strong that the idea
of a phonon may be introduced. A mode v (characterized by the three integers
f„ = n v hv.
We say that such a situation corresponds to the presence of n v bosons called phonons,
each of energy hv, that move through the crystal. These massless "particles" are
actually collective modes, that is, cooperative motions of large groups of particles. These
particle-like objects move through a real "aether," the crystal lattice. The
phonon-photon analogy carries us quite far. Particle scattering is often used to study
the structure of crystals; for example, neutrons shot through a crystal interact with the
nuclei at the crystalsites. In the scattering processes we can see analogues of Compton
scattering of phonons by neutrons; the neutron can absorb or emit phonons while it
passes through the crystal.
At temperature T the average number of phonons is given by the Bose-Einstein
distribution function (with the Lagrange multiplier a set equal to zero as for photons)
"
e^ - 1
'
n l ,n 2 ,n 3 on which v depends:
e
phv _ ! = l + p hp + . . . _ x „ p hv
£= 3 £ —
15
hv
hv
- =3Nk B T, (11-64)
where the last equality follows because the total number of integers n x
,
n2 n3 , is N.
This is the same classical result that occurs in Einstein's model, Equation (1 1-55), and
leads to the Dulong-Petit law, Equation (11-56), which is experimentally correct at
high temperatures.
At lower temperatures, such that k B T < hvm we cannot get away with such an ,
easy derivation. However, as we change one of the integers n, by a single unit while
summing over «,, n,,, « 3 the summand in Equation (11-63) varies very little. This is
,
large number. Thus, it is always valid to convert the sum to an integral, so that we
have
-. (11-65
where A/ ,,
is another density of states function defined as the number of modes at
frequency v per unit frequency interval. To find this function we need detailed
knowledge of how v depends on n ,n 2 ,n 3 which for an arbitrary crystal
l
is often
known only numerically and is not just a simple generalization of Equation (11-58).
If we are interested only in the very- low-temperature behavior of E and C, we can
proceed further. For very low 7", many of the higher frequencies satisfy hv ;§> k B T or
fihv 3> 1. For only these frequencies, it is true that
1 1
nv = -^ « -jr- = e-
pi"
« 1 (^ » k R T)
so that these frequencies contribute negligibly to the sum in Equation (11-63). The
conclusion is that only frequencies such that hv E and, if the < kB T contribute to
temperature is low enough, only the low-energy nondisperswe frequencies given by
Equation (1 1-61) are important. The value of A^„ associated with this relation between
v and the n t
is now essentially the same as in the blackbody case. We find, by
examination of Equation (2-19),
2
\2-nv
566 Quantum Statistical Physics
E= — 1277
cs
V
^oo
Jq
dvv2
n^—
e—H
hv
T
1
(
k sT«h Vm ). (11-67)
We are able to replace the upper limit vm on the integral by infinity because, at low
temperatures, the higher frequencies do not contribute anyway. Let us make the
substitution
z = fihv
\2ttV ,oo
~
.
E = -iir(* fl
7 ')7 dz
4
The integral has the value 77 /15 so the heat capacity dE/dT is given at low
temperatures by
Solids, at low temperatures, should have heat capacity contributions due to their
phonon excitations that are proportional to T h
(rather than exp( — fihv ) as in
Einstein's model) and this is the result that is found experimentally.
Debye was the first to derive the T !
result. He also defined a convenient
characteristic constant for a solid, called the Debye temperature, given by
3
he Af\'/
0n= —
kB
I
677
2 -
v
\
C= — W *M^-T\
12 /
. (11-69)
Example
Solid helium can be made only by pressurizing liquid helium at very low
temperature. It has a Debye temperature of about 30 K as determined by heat
capacity measurements. Its sound velocity is, by the definition of @D ,
_
kB @D 1
'
2 ,/3
* (6t7 A7F)
//•<? Low-Temperature Fermi- Dine Systems 567
For comparison, the velocity of sound in air is 330 m/s. Most solids have sound
velocitiesat least an order of magnitude larger than this. Solid helium is
anomalous in many other ways as well.
Example
vm = —
2L
4n 2 * ±
C
The vel ocity of a longitudinal sound wave in copper is about 5000 m/ s and the
average interparticle spacing isa ~ 0.2 nm. Thus, we find
3
5 X 10 m/s
vm ~ = 2 X 10 13 Hz.
2 X 10" 10
m
This va lue is considerably above audible frequencies It corresponds to wave-
lengths on the order of an interatomic spacing.
Many common assemblies of particles in nature are degenerate Fermi systems. This
term means that the particles have crowded into their low-lying energy states; this in
turn implies that temperature is small on a scale that is explained below. Examples
3
are nuclei, metals, white dwarf and neutron stars, and liquid He. Before considering
these specific examples we examine the general low-temperature Fermi system.
The distribution function for fermions is, from Equation (11-22),
(3
= H-70)
+a
,j8t,
+ 1 kB T
a= -0e p (T),
where e F (T) is called the Fermi energy. Equation (11-70) then becomes
,/8(e,-f A
(11-71)
)
+ 1
568 Quantum Statistical Physics
Figure 11-11
>£,-
f,.(0). To see this let us consider a very small positive value of T; note that for
e, < eF, exp[/?(f ;
— e F )] has a negative argument and goes to zero as T— > or
/?
—> oo. For those e, values n :
is unity. However, for e :
> eF, exp[/3^ ;
— e F )] has a
positive argument and goes to infinity as ft
— » oo. In this region n(e) is zero. Raising
the temperature from zero removes particles from the levels e ;
< e^(0) and excites
them to higher states.
Let us investigate further the situation at T= 0. The average number jV of
particles in the system is given by Equation (11-23):
A^=I>,= £ ,£(£,
which determines e F in terms of jV and T. To evaluate the sum we use the density of
states function given by Equation (11-44) and obtain
Y £«,= 2 /"%/)(£,>,,
Jn
The factor of 2 in this equation arises from the fact that fermions are spin- \ particles.
In zero magnetic field each energy state is doubly degenerate and can contain two
particles, one with spin up and one with spin down just as in atomic orbitals.
Because n t
is a step function that cuts off at e, = e F (0) for T= 0, the result is just
2i/2 m 3/2 V
N= 2T/)(e) de = ' 2
(o). (11-72)
•'n
'o 3tt-' /r
2
h
a> \2/3
MO) (3^ p) (11-73)
1m
11-6 Low-Temperature Fermi- Dirac Systems 569
where the particle density is p = N/V. The Fermi energy is seen to depend on the
two-thirds power of the density so that, as more particles are placed into the system,
the highest occupied state eF increases. As the temperature is raised slightly from
T= 0, £ Fno longer the highest occupied state but is the energy for which n = l2
is t \
(which implies that all states contain on the average less than one-half of a particle).
The total energy of the A^-particle system at T= is
i/2
£=!>,", = 2/
f eF
deD(e)e=—T
2/2
—
m
r Ve /
5 2
(0).
J 5?T h
so that the average energy per particle E/N is three-fifths of the maximum energy eF .
We have seen in Section 11-3 that the Pauli principle causes an increase in the
considerably more difficult than at T= Rather than carrying out these computa-
0.
tions we give a heuristic argument for the temperature dependence of one of the most
important of the experimental quantities, the heat capacity. As the temperature is
raised particles in states having energies within an interval k B T below eF are raised to
states of order k B T or less above eF , as can be inferred from Figure 11-11. If
I k T\
kE~ N\—\k B T (A a r«f,.).
It is easily seen that the Fermi temperature provides a scale to measure the degree
of degeneracy. If r« 7F the system has strong particle-indistinguishability effects
and the low-temperature formula in Equation (11-76) holds; for T » TF the classi-
cal-limit formulas become valid.
In the following examples we present some applications of the Fermi formulas to
specific systems.
Example
2 V3 2 2
eF (3 W )
h h
2
kH 2 kBm k B ma
(i x io- ;i4
j- s r 5
J
= 8 X 10 K.
23 31 10
(1.38 X 1(T J/K)(9.1 X 1(T kg)(lO" m)
(11-77)
carried out at temperatures that are below both the Debye temperature QD and
the Fermi temperature TF , the total heat capacity of a metal has the form
C = Nk B (11-78)
Example
10
7> - 10 K.
The nucleons in a nucleus are obviously degenerate under any normal condi-
tions. Only in the early stages of the universe, soon after the Big Bang, could
temperatures of this order exist.
Problems 571
Example
Electrons in a white dwarf star: Despite the high temperatures found in stellar-
interiors, the electrons in a late-stage star known as a white dwarf are quite
degenerate because the value of TF is so large. When a star burns all its
Ec GM 2
1
Pg ~
T 1"?' ~
N k
2
I N\ 2/ 3
Pf'~
F^\F
where m e
is the electron mass and the number of electrons is actually 2N y
twice
the number of helium nuclei. Equating Pp and PG gives the radius of the star:
R ~ ^ (11-79)
Gm e
m^M x
We have dropped all numerical constants from this expression to get just an
order of magnitude estimate. we put in numbers we get R = 7 X 10 2 km for
If
M = Mq, the solar mass. This means a white dwarf is a very compact object.
However, it is not quite that compact. Equation (1 1-79) is derived on the basis of
nonrelativistic theory. We show in Problem 21 at the end of the c hapter that an
R given by Equation (11-79) results in a Fermi velocity (v F = J2e F/m) large
enough that relativistic effects must be taken into account. Quite a different
radius-mass relation then results. While the basic idea of our calculation
remains valid, the relativistic treatment yields the interesting result that, for
stellar masses greater than about 1.4 MQ , the Pauli pressure can no longer stop
the gravitational collapse. The star continues its infall, a supernova occurs, and
ultimately a neutron star or a black hole is formed. This critical mass is known
as the Chandrasekhar limit, discovered by S. Chandrasekhar in 1934.
Problems
1. Verify by explicit calculation that the three- particle boson eigenfunction given in the
example at the end of Section 11-1 is symmetric under the interchange 2 <-> 3 and the
permutation 1,2,3 ** 3, 1,2. Write down the equivalent Fermi-Dirac function (see
Problem 8 in Chapter 9) and test its symmetry in the same two cases. Explain your
fermion results.
572 Quantum Statistical Physics
2. A single spinless particle has potential energy V(l), and has two real eigenstates ^ a (l) and
v/^(l) with energies ea and e^, respectively. Two such particles have potential energy
V= V(l) + V(2) + Vmt (l,2), where VmX represents an interaction between the two par-
ticles. Suppose that
/^ B (r)^(r)rfr-0
(a) Evaluate the energy expectation value in the four two-particle states
(b) Which of these states is allowed if the two particles are identical bosons? fermions?
distinguishable particles?
(c) Explain why, for the fermion case, the introduction of an intrinsically positive
interaction (J > 0) leads to a reduction of the two-particle energy.
3. How many distinct throws of a pair of dice can occur? How many ways can a four be
thrown? What is the probability of throwing a four? What number has the highest
probability of occurring in a throw of a pair of dice?
4. Consider an ensemble made of two systems ( M= 2) each having two energy levels f , and
f ,. Energy level f, is occupied by a total of two particles (jV, = 2), as is level e_, (A
:
,
= 2).
Find the total number of ways W of placing the particles in the states if the particles are
(a) distinguishable, (b) fermions, or (c) bosons. Use Equations (11-13), (11-21), and
(11-24) and also find W by explicit counting.
partition function. Use this formula to derive the result of Equation (11-50).
6. Evaluate A/(/?')' / " and show that the result, up to constant factors, is equal to A tl
.
7. (a) Use Equation (5-60) (with position x replaced by energy as a variable) to show that
given by
2
(A£) =^((£2 >-< e >2 ),
(b) Show, from the definition of the partition function Z, that the mean square
single-particle energy deviation for Maxwell- Boltzmann particles is
(c) Use theresults of parts (a) and (b) to show that the fluctuations in energy of a large
JF) ~ V 3W
'
8. Carry out the details of the maximization procedure for the function W in Equation
(1 1-24) to prove that the Bose-Einstein distribution function is given by Equation (1 1-25).
9. Consider a system of jV distinguishable particles in which each particle has two possible
energy levels e, = and e2 = A. An example of such a system is a crystal that can form
(a) Find the distribution functions n, and n 2 and plot the ratio n 2 /n ]
as a function of
the parameter t = e~^, which has range < t < 1, while the temperature varies
(b) Find the average energy (e) and the specific heat c = d(e)/dT. Plot (e)/A and
c/k H as functions either of t or of k s T/A. Explain the behavior of these functions.
10. (a) Consider a two-level system as in Problem 9 but containing spinless "Fermi-Dirac"
a
particles. Find n t
and n.,. To do this the Lagrange parameter e must be
eliminated by use of Equation (11-23). Carry the solution out for the only two
possible cases N= 1 and N= 2. Why is N > 3 not allowed? Plot the ratio n 2 /n ]
(b) Repeat with Bose-Einstein particles for the cases N= 1,2, and 50. The elimination
of the Lagrange parameter e" is not as easy here and may best be achieved by
numerical or graphical methods. Plot n .,/«, as a function of t for each of the above
values of N. Can you generalize the trend to N —» oc?
(c) Compare the plots of n _,/«, from Problem 9 and from parts (a) and (b) of this
problem. Explain the physical basis of the behavior. Why do even the N= 1 Fermi
or Bose distribution functions show quantum behavior?
11. A container filled with H, gas has a pressure of 1 atmosphere (1.013 X 10 5 Pa) at room
temperature. To what temperature must the gas be cooled for quantum effects to become
important? Estimate this temperature by equating the thermal de Broglie wavelength to
the interparticle spacing. Use the ideal gas law to estimate the density. Repeat for the
conduction electrons in a metal for which the average interelectronic spacing is fixed at 0.
nm.
12. Equation (11-49) gives the Maxwell-Boltzmann probability density for finding the
+ de as p(e) ~ e ""exp( — /?e). Find the most
single-particle energy in the range e to f '
likely energy by maximizing p(e). Compare with (e) of Equation (11-50). Find the
13. Find the total energy E and heat capacity C of blackbody radiation in a cavity of volume
V. Show that C has the same temperature dependence as the low-temperature phonon
system. Find the average total number of photons present in the cavity as a function of
temperature. Estimate this number for room temperature (300 K) and a volume of 1 cm'.
14. Atoms adsorbed in a monolayer (a layer one atom thick) on a surface at sufficiently high
density sometimes behave like a two-dimensional solid. The low-frequency sound waves
obey a relation of the form of Equation (11-61) with n 3 = 0. Find the heat capacity of
such a solid at high and low temperatures. [Hint: A new density of states A^, appropriate
to standing waves in two dimensions must be derived. Paraphrasing the derivation for
574 Quantum Statistical Physics
radiation in Section 2-2 or even that for particles in the Detail at the end of Section 11-3
may be helpful.]
15. A scale of temperatures for solids, by which we judge whether T is high or low, is set by
the velocity of sound and density, through the Debye constant © D What
. is the corre-
sponding characteristic temperature in Einstein's model of a solid?
17. Aluminum has a Debye temperature of 420 K. Use this to find the velocity of sound in
aluminum. What is the specific heat of the phonon system at 300 K? Assume the
low-temperature formula in Equation (11-69) is sufficiently accurate.
18. Calculate the Fermi temperature TF for metallic aluminum. What is the specific heat of
the conduction electron system of aluminum at 300 K? Compare this result with the result
of Problem 1 7.
19. Show that the condition T « TF for the degeneracy of a Fermi gas is equivalent to the
statement A th » a, where a is the mean interparticle spacing.
20. Liquid 'He is a system of fermions that interact quite strongly. Nevertheless, a useful
first-stage model is that of a noninteracting gas of helium atoms. Find the Fermi
temperature, for a mean interparticle spacing of a = 0.4 nm in the liquid. Find the
thermal de Broglie wavelength for
f
He at T= 1 K and compare with the given value of
1 1-6 must be treated relativistically. Do this by computing the Fermi velocity for a radius
22. The rotational motion of diatomic molecules contributes to the specific heat c = d(e)/dT
in a gas of molecules. In this problem you are to calculate this contribution by considering
a set of distinguishable rigid rotators each with moment of inertia / and energy levels
Bj=j(j+l)B, J = 0,1,2,...,
where B = 2
h /2I.
(a) Show that the partition function is
1 /•«> d
J
BB J dj
Discuss the approximation. Find (f) and C by using the theorem proved in
Problem 5.
(c) Evaluate Z, (f), and C for low temperature (BB » 1) by keeping only the first
SOLIDS
tin to form bronze and to mix the right combination of iron and carbon to make steel.
As in these examples, fundamental knowledge has often followed from the need to
improve technology. In more recent times the order has often been reversed and
technology has entered quickly into areas where new knowledge has been found. In no
area of science has this interrelation been more apparent than in solid-state physics.
The outstanding examples of this are the developments of the transistor and the
integrated circuit, which have brought us into the computer age. These inventions
followed directly from the understanding of the nature of semiconductors and could
not have occurred without that basic understanding.
As important as solid-state physics is to the development of technology, even more
so is its fundamental contribution to our basic understanding of nature. Einstein's
explanation of the breakdown of the Dulong-Petit law of heat capacity in solids is
only one of several first examples of this. Furthermore, progress in solid-state physics
has often stimulated new lines of basic research in other areas. For example, after
years of study of the magnetic transition in iron, recent breakthroughs have led us to
an understanding of many kinds of phase transitions in a wide range of substances.
575
576 Solids
Several recent advances in other parts of physics are based on ideas used to describe
phase transitions in condensed-matter systems.
The passing of ideas from one area of physics to another is a common theme. We
know from our study of diatomic molecules in Chapter 10 that two atoms of hydrogen
bind together because the electrons are able to hop from one atom to the other; this is
the basis of the covalent bond. The periodic potential energies of electrons in regular
crystals also can have sufficiently low barrier heights so that electrons can tunnel from
one atom to another. This effect contributes to binding and so the solid behaves
somewhat as a gigantic many-atom molecule.
This rapid hopping results in a highly characteristic structure for the energy levels
of electrons in solids. The states fall into bands of very closely spaced energy levels
separated by forbidden energy regions called band gaps. The basic behavior of
crystals depends greatly on how the electrons fit into these states, which are filled
according to the Pauli principle. On this basis we are able to understand how
substances fall into the categories of metals, insulators, and semiconductors.
To understand the detailed distinctions among the various kinds of solids, experi-
mentalists can probe the materials by measuring such static properties as their
pressure-volume curves, heat capacities, and magnetizations. They also can examine
their transport properties by setting up nonequilibrium flow conditions. For example,
if we establish a voltage across a sample of metal, electricity is conducted through it
and we can easily measure its electrical conductivity. When we maintain a tempera-
ture difference between the ends of a sample of material, heat flows and we can
determine the thermal conductivity coefficient. The dependence on temperature, and
on other variables, of these coefficients tells us about the characteristics of the carriers
of electricity and heat, and also tells us with what objects these carriers may be
colliding as they traverse the solid.
Insulators are able to conduct heat, though not electricity, by means of lattice
vibrations. The associated phonons have been introduced previously in Chapter 11.
Several properties of phonons have only been quoted there in order to discuss heat
capacity as an example of the methods of statistical physics. In this chapter we justify
those results by deriving them.
Since the first awards in 1901 the Nobel Prize has gone to many physicists for their
work on solids. The achievements have been as significant as any in physics. More
research is published each year in solid-state physics than in any other field. Many of
the papers are reports of technological applications and many are quite basic. These
forms of activity demonstrate to every student of physics the importance of under-
standing the physics of solids.
We cannot define a solid simply as a substance that is hard. Suppose that we put
water into a closed container with a plunger on one end and then attempt to squeeze
it. We would find it to be "hard" as well since under the application of ordinary
The molecules of a gas are basically free particles that only occasionally feel the
effects of intermolecular forces as they collide with one another. The molecules of a
liquid, such as water, constantly feel forces due to the other molecules. They are in a
many-body bound state. A bit of water floating freely in the space-shuttle stays
together as a roughly spherical blob. On the other hand, the total kinetic energy
shared by the molecules in water is still sufficiently great that each molecule does not
find itself bound permanently to a given set of neighbors. Such a particle almost
always has an escape route in one direction or another as it rattles about trying to
move to a new neighborhood. Even if the molecule momentarily finds it has
insufficient energy to escape from a certain small region, collisions are so frequent that
it soon has enough, so a new lower energy pathway opens up because of the other
particles' random motions. However, if one of the particles tries to leave the surface of
the liquid, it usually is unable to do so, because the forces from the other molecules
near the surface pull it back. Occasionally, however, some particles do have sufficient
kinetic energy to evaporate.
If we try to compress a difficulty because there is so much empty
gas, we have no
space. But liquids are far denserand compression causes the hard cores of the particles
to touch and create resistance. The same is true of solids. A shear force applied to
water does not try to push the water into a smaller volume like a compression does. It
simply slides the particles relative to one another somewhat like the playing cards
mentioned above. This costs very little energy; it is essentially what the individual
molecules are already constantly doing on their own.
We are led then to the idea of a solid, with its shear strength, as resulting from the
fact that each particle is bound in a potential energy well formed by its neighbors
from which it rarely has enough energy to escape. Shear forces are resisted because it
takes energy even to slide particles relative to one another. A solid can usually be
picked up without a container because of its shear strength.
One way that every particle can be made to sit in a potential energy well formed
by its neighbors is to construct a perfect array, that is, a crystal lattice. In the simplest
such situation every particle sees precisely the same arrangement of neighboring
particles. If one particle is bound then all of them must be.
Consider a situation in two dimensions
temperature. Suppose the
at absolute zero
particles are xenon atoms, each pair
which has a potential energy of interaction
of
described by the Lennard-Jones form, Equation (10-36). If we neglect the small
amount of zero-point motion, a pair of these atoms sits at the distance a of the
minimum of the potential energy curve and has energy — e, equal to the depth at the
minimum.
The best thing for a third particle to do is to position itself so that the three atoms
form an equilateral triangle having sides a. The energy of this trio is — 3e, correspond-
ing to the three bonds formed as shown in Figure 12-1.
A fourth particle can now be added, in our two-dimensional example, at any one of
three possible positions so that it similarly has two neighbors each at a distance a. If
we neglect the interaction energy between the two particles that are not near
neighbors, the total energy of the two triangles is now — 5e. We can continue in this
manner until whatever amount of two-dimensional space available is filled with
triangles as in Figure 12-2.
Note that every particle in this figure is the center of a hexagon so that it has six
neighbors. If we were arranging Ping-Pong balls on a table so that they were touching
one another, the most that we could pack about any center ball would be six. Those
six would form a hexagon made up of equilateral triangles. We would have packed
578 Solids
energy — e. The X 's mark possible positions and 2 are nearest neighbors; particles 1 and 3
for a fourth particle. are second neighbors.
O-O ® o o
o o Q-® o
o cAiA) o
O O O-O o
o o o o o
the balls as closely as possible to one another. More balls could be added to fill
Figure 12-3
Close packing of Ping-Pong balls. Each ball is at the center of a hexagon and touches all six of
its nearest neighbors.
12- 7 The Structure of Solids 579
o o o o
o o o o
o o o o
lowest energy possible for any structure in two dimensions for an isotropic interaction
like the Lennard-Jones function.
An alternative two-dimensional structure is the square lattice shown in Figure 12-4.
It is easy to see that the energy of this structure is — 2cjV, which is a third less negative
than the energy of the triangular lattice. Also its density is considerably lower.
The above discussion of crystals in two dimensions is far from purely pedagogic.
Gaseous xenon and many other elements, when bound to certain carefully chosen
planar solid surfaces, behave like two-dimensional gases, liquids, or solids.
We now try to treat three-dimensional systems in the same manner as we have
handled two-dimensional solids. One way to start out is by forming an equilateral
Figure 12-6
Ping- Pong balls stacked to make as many tetrahedra as possible in an object consisting of a ball
and its near neighbors. The geometrical shape is a distorted icosahedron.
bound in a potential well. Generally, there is, for these materials, some lattice
structure having lower energy, but the atoms cannot get to that state because they are
each trapped in a local potential energy minimum.
Since nature cannot make regular solids by filling three-dimensional space with
tetrahedra, as it can fill two-dimensional space with triangles, it usually proceeds to
make regular lattice structures in other ways. Rather than maximizing the number of
tetrahedra, we look for a way
maximize the density of the entire system such that
to
all nearest neighbors of any atom are at some distance a. It is speculated, but has
never been proved, that the densest such arrangement is provided by the face-centered
cubic (FCC) lattice. This crystal structure is easily constructed by starting with the
two-dimensional triangular lattice of Figure 12-2. The particles in this layer are
designated by B's, as indicated in Figure 12-7. Particles are next added in layers
above and below this layer. Particles in the next layer above the first go in at sites
A third layer is placed below the first at the positions indicated by the C 's.
Additional layers are added so that the sequence • • • ABCABC -is formed.
If we look at the arrangement of a particle and its 12 nearest neighbors in the FCC
lattice, as shown in Figure 12-8, we see how
from the distortedthis grouping differs
structure of Figure 12-6. The seven second-layer particles are all on the same level,
whereas the levels of these same particles in Figure 12-6 are seen to alternate. Not so
many perfect tetrahedra are formed in this argument, but that is more than com-
pensated by the regularity of the packing throughout all space.
We
can easily compute the energy of the FCC lattice if we include only nearest-
neighbor interactions. Since each particle has 12 neighbors, this energy is — (v,) X
12eN = —6eN, where the factor l avoids the double counting of the bonds. This
,
Figure 12-7
The FCC lattice gets its name from viewing the structure from a different angle.
Figure 12-9 shows this alternative view. The demonstration of the equivalence of the
two views is left to be shown in Problem 3 at the end of the chapter.
There are many other possible lattices. A simple variation of the FCC case is the
three-dimensional hexagonal close-packed lattice. Instead of placing a particle in a C
layer as shown in Figure 12-7, it is placed directly over a particle in the A layer
resulting in an • • •
ABAB pattern. With this arrangement each particle again
has 12 neighbors. It differs in energy from the FCC only in interactions involving third
nearest neighbors. Other possibilities are the simple cubic (SC) lattice shown in Figure
12-10 and the body-centered cubic (BCC) of Figure 12-11.
Figure 12-8
Figure 12-9
Figure 12-12
The types of binding that occur in molecules also hold solids together. A solid can
often be considered simply to be a large molecule. Covalent bonding occurs in the
diamond The NaCl crystal shown in Figure 12-12 is ionically bound, with Na
crystal.
the positive ion and CI the negative.
The hydrogen atoms in a water molecule are bound to the oxygen atom covalently.
Each hydrogen can also be attracted to an oxygen of another water molecule, thus
forming a second bond called the hydrogen bond. Ice then consists essentially of water
molecules connected together by such hydrogen bonds.
Hydrogen itself is another example of a substance that retains its molecular identity
in the crystalline state. The H 2 molecules attract each other by the van der Waals
force and form what is called a molecular solid. Other molecular crystals include, for
example, 2
and the solid states of the noble gases helium, neon, argon, and so on.
Under very high pressure, it is expected that the diatomic molecular bonding in solid
hydrogen gives up in favor of a homogeneous bonding among all atoms so that the
solid undergoes a transition to a metallic phase. This important effect is being looked
for inmany laboratories around the world.
The binding that occurs in a metal is somewhat analogous to that occurring in the
hydrogen molecule, with electrons hopping among all atoms instead of just between
two. We investigate this binding and the other properties of metals in the next two
sections.
Example
The NaCl crystal is ionically bound. We can try (unsuccessfully) to estimate the
contribution of its Coulombic energy to the binding of the crystal by summing
over the pair interactions in the lattice. Any given Na + atom is surrounded by 6
near-neighbor CI" atoms at a distance R , 12 second-neighbor Na + atoms at
+
v2/? , 8 CI" atoms at ]/3 , 6 Na at )/4R , and so on. The contribution to
584 Solids
12 8 6
E=
R \
+
w -)•
The bindin g energy per particle is N times this result, v\ 'here N is the number of
sodium ions. The sum in parentheses converges very slowly. Its value upon
truncating it after a given numbers of terms, is shown n the following table:
3 -2.1
1 0.9
5 '1.9
6 -0.7
If the com plete sum is done, the resu It is - 1.75 ; spec rial mathematical proce-
dures must be used to evaluate it.
the surface of a crystal represents one of a parallel family of such dominant planes.
From Figure 2-15 we recall that constructive interference occurs when the path
difference between the two rays in the figure is equal to an integral number of whole
12-2 Bragg Scattering 585
Figure 12-13
O O O O O O
wavelengths. According to Equation (2-49), this happens when the angle 6, between
the crystal planes and the incoming or scattered beam, satisfies
2 77
|k| = Ik'l = k = (12-2)
Ak = 2*sin0 h.
Substituting from Equation (12-1) for sin#, and from Equation (12-2) for A., gives
Ak = — mh = G,
2 7T
d
w = 0,1,2,... (12-3)
Figure 12-14
Photographic record of the x-ray diffraction pattern due to Bragg scattering from an NaCl
crystal.
The infinite, but discrete, set of these G vectors can be shown to define a lattice,
called the reciprocal lattice. Each point in this ficticious lattice corresponds to a possible
momentum that can be absorbed by the real lattice of particles. There is a one-to-one
correspondence between the sites in the reciprocal lattice and the spots in a scattering
pattern.
We see below that these vectors show up repeatedly in our analyses of solids. For
example, a conduction electron in a metal can be diffracted constructively by the
lattice of the atoms, just as an x ray is, if the scattering produces a Ak equal to one of
the G's. This phenomenon has important consequences relative to the possible energy
states of electrons in crystals.
Figure 12-15
Example
mX 1 X 0.2 nm 0.2 nm
- (1 IS nm
2sin0 2 sin 42° 2 X 0.67
he 1 .240 keV •
nm — ° L-rV
fi
0.2 nm
Example
If neutrons are used for the Bragg scattering in the above example, each has
wavelength 0.2 nm and energy E related to X by
h h
X = — = , =0.2 nm,
p )/2ME
where p and M are the neutron momentum and mass, respectively. Thus, we
find
34 2
2
h (6.6 x 10 J •
s)
2MX2 2(1.7 X 10
-'-' 7
kg)(0.2 X 10 9
m)
2
3.2 x 10" 21 J
3.2 X 10"-'
J iy
= 0.02 eV.
1.6 X 10" J/eV
Note how much smaller the neutron energy is than that of the photon of the
same wavelength. Neutrons having such low energies are said to be thermal, since
a k B T of this value implies a T near room temperature. Nuclear reactors are
often the source of neutrons for Bragg scattering. Because most such neutrons
originate with energies in the MeV range, it is necessary to slow them down by
many inelastic scatterings before using them in a diffraction experiment.
The most notable properties of metals are their abilities to conduct heat and
electricity. The idea that metals contain electrons that can move freely and can carry
energy and charge was developed soon after Thomson discovered the electron in 1897.
P. Drude first suggested a model that assumed an electron behavior like a classical
gas. In Chapter 1 1 we have examined this assumption and have found that the
electrons must be a highly degenerate set of fermions. In 1928 Sommerfeld modified
the Drude theory to take degeneracy into account. The resulting model did a
surprisingly adequate job of describing many of the properties of metals.
588 Solids
Figure 12-16
Some of the static properties of a degenerate electron system have been described in
Chapter 1 The most important one for metals is that the electron heat capacity has
1 .
dT
Jt kt (12-4)
where K T is called the coefficient of thermal conductivity. The minus sign in Equation
(12-4) indicates that heat travels from high temperature to low temperature or in the
direction of negative dT/dz.
In a similar manner we define the electrical conductivity ke as the constant of
proportionality between an electric field E established in a metal and the charge
current density
JE =« E E, (12-5)
giving the charge per second per unit area moving in the material. From elementary
electrostatics, the field vanishes in a metal at equilibrium. The situation established
here is a nonequilibrium one. The charge is flowing in an attempt to set up a
distribution that cancels out the field in the metal. If the charge is removed from one
end and reinserted in the other continuously, a steady-state nonequilibrium situation
can be established as it is in any simple DC resistive circuit.
The transport properties of electrons are usually discussed by considering the states
of the particles in momentum space. We need to digress briefly to see what this space
12-3 The Free-Electron Theory of Metals 589
is and how it is filled by a set of degenerate free particles. We know from Chapter 1
that the energy states for a Fermi system at T= K are filled up to the Fermi energy
eF and states above that are empty. If we calculate the energies of traveling-wave states
3
in a cubical box of volume L , we find
i 2
e = + +
^AA -
2mL71 \
o I 2 ^3 )•
Pi=
K
— , < = 0, ±1, ±2, ±3,... (12-6)
where / is 1,2,3 or a-, y, z. These results for the energy and momentum are not quite
the same as the previous forms of Equations (5-99) because we are now considering
traveling rather than standing waves. We consider these differences in more detail in
Section 12-4; for now just note that both positive and negative momentum values are
allowed corresponding to particles traveling in opposite directions.
Each of these discrete momentum states may be occupied by one electron of each
spin type. The three momentum coordinates allow us to construct a space of states
similar to the one illustrated in Figure 2-7 for standing waves of electromagnetic
radiation. At T= K, the states in p space occupied by particles form a sphere,
known as the Fermi sphere, as shown in Figure 12-17. The radius of the sphere is pF ,
pF =fime F . (12-7)
Figure 12-17
Pz
.
590 Solids
vF =—m . (12-8)
dv
m —
at
= -eE - av, (12-9)
where a is a constant determined by the nature of the frictional forces. Obviously, the
frictional force operates only when the electron is moving and increases in size with
the velocity, much as the retarding force of air resistance on a moving automobile.
Suppose we consider the which the electron has been given an initial
situation in
velocity v in the absence of an electric field (and in the absence of any other
electrons). Then the electron's equation of motion is
dv a
dt m
which has solution
v *-
(fl/m) '.
v = (12-10)
m
t= — (12-11)
a
This constant is called the relaxation time for obvious reasons. By means of microscopic
calculations one can show that r is not much longer than the collision time, the time the
electron travels between interactions with impurities, ions, or whatever it is that is
^E ei
v = = E. (12-12)
a m
If other electrons are present, a given electron is not able to relax to an arbitrary
velocity or momentum because of the Pauli principle. In particular, in zero field most
electrons certainly cannot relax to zero velocity as implied by Equation (12-10)
because that state is probably already occupied. However, the energy loss processes
can relax a system of electrons to the many-particle ground state, which involves a
filled Fermi sphere centered on zero momentum.
12-3 The Free-Electron Theory of Metals 591
Figure 12-18
When there is an external field E every particle cannot have the same small
nonzero velocity implied by Equation (12-12), since that violates the Pauli principle.
What happens is that every particle in the Fermi sphere is shifted in velocity by an
identical amount Sv equal to the v of Equation (12-12). So we have
Sv = E. (12-13)
m
Thus, the entire Fermi sphere is shifted over by 5p = m 5v, as shown in Figure 12-18.
Note that because this shift of origin of the sphere is usually quite small, most of the
momentum states that were occupied before the shift are still occupied. However, a
thin crescent of states at the left-hand edge of the figure has been emptied and another
at the opposite side has been filled.
This kind of effect is characteristic of degenerate Fermi systems; only the particles
near the Fermi surface take part in the process. We know from Chapter 1 1 that the
specific heat capacity involves absorption of energy by particles within k B T of the
Fermi surface.
In general, a current density or intensity of a beam is particle density times particle
velocity, as we illustrate in Problem 9 at the end of the chapter. Thus, the electrical
current density is charge density times particle velocity:
JE = -e8pvF ,
(12-14)
where 8p is the density of particles involved in carrying the current; these are the
particles in the crescent at the Fermi surface. All these particles move with velocity
very nearly equal to the Fermi velocity v F defined by Equation (12-8).
We can find the density 8p of current carriers by using the relation for the density
of states developed in the first Detail at the end of Section 1 1-3. We have seen that the
number of energy levels per unit energy in the range e to e + 8e is
D(e)= -
=^-Vt^\
1 2 3
]f27r h
592 Solids
3 P Ve ^
2 1
D(e)
4 4/2 '
Since, for fermions, there are two particles per energy level, the density of particles per
unit energy R(e) is obtained by multiplying D(e) by 2/V to give
2
3 pe'/
*(*)=«^7T- (12-15)
The density of current carriers is then the density of particles per unit energy'
evaluated at the Fermi energy times the energy width of the crescent:
3p
8p = R(e f )8e = 8e. (12-16)
2e F
de 1 d(mv 2 )
8v = -
8v = mvF 8v. (12-17)
2 do
Combining the results of the last few equations, we have for the current
p er e~rp
JE - ev F —mv F —E E, (12-18)
eF m m
where we have dropped constant factors of order unity because our estimate of 8p
contains inaccuracies of that order. From Equation (12-5), the electrical conductivity
is
2
e rp
(12-19)
Although only the electrons near the Fermi surface contribute to the electrical current,
the density p corresponding to all electrons, even those away from the Fermi surface,
appears in Equation (12-19). This feature arises because the number of electrons that
are forced by the Pauli principle to be at the Fermi surface depends on the overall
density.
For classical particles (which need not obey the Pauli principle) Equation (12-12) is
valid. Furthermore, for such particles, the current is simply JE = — epv instead of that
given by Equation (12-14). Curiously, the combination of these two equations also
leads to Equation (12-19), even though such a simplified derivation cannot be valid
for electrons.
In order to make contact with experiment we need to specify the behavior of t in
Equation (12-19). The major effect involved in slowing the conduction electrons is
their interaction with vibrating positive ion cores in the solid. We see in Section 12-4
that if the ions form a perfect crystal, the electrons move through the lattice essentially
12-3 The Free-Eleclron Theory of Metals 593
as an ideal Fermi gas, which has been the basic assumption of this section. It is when
the lattice ions vibrate thermally that interactions with the electrons occur. A good
picture of the process is obtained by considering an electron as absorbing and
reemitting phonons as it moves through the lattice. The relaxation time t can then be
supposed to depend on np ,
the number of phonons of frequency v present in the
lattice. We know from Section 11-5 that n v is given by
n„ = (12-20)
e
fih> _ i
•
The fewer phonons that are present, the longer t is expected to be, so that we can take
-(?")
1 kB T
1 + fihv + ••• -1 hv
r~T, T»Q D .
I T
L*. - T«@ D .
t~t\ r«e D .
At very low temperature, the number of phonons may become so small that
impurities in the crystal may account for the frictional forces on the conduction
electrons. Such impurities may be foreign atoms, vacancies (absent atoms), and other
imperfections in the crystal lattice. For sufficiently small samples the boundaries of the
crystal may even be a factor in determining T. In most of these cases, this contribution
to t is independent of temperature so that r approaches a constant at the lowest
temperatures.
A somewhat similar treatment can be given to the thermal conductivity of metals.
The heat current is now driven by a temperature difference. One end of the metal
sample is hotter than the other. Although we assume that there are equal numbers of
594 Solids
Figure 12-19
Distribution of electrons in a cut through the Fermi sphere along the p. axis. The distribution is
that at a single point in real space. Particles with momentum + p. have just come from a hotter
region; those moving with momentum —p. have just come from a cooler region. These
distinctions show up in the slight differences in the spreads of the distributions around +p F .
electrons flowing in both directions along the z axis in Figure 12-16, so that a charge
does not build up at one end, those flowing from the hot end carry more energy. There
is a flow of heat in the absence of a net flow of particles. (Actually there can be a
thermoelectric effect in which temperature differences give rise to voltage differences,
but we neglect such details here.)
Again it is the electrons near the Fermi surface that are responsible for thermal
conduction. Consider the cross-sectional view of the Fermi sphere shown in Figure
12-19. The distribution function n fi
in the figure is the number of electrons having
momentum p. The
deep within the Fermi sphere have no effect on
electrons
conduction because they come in matched pairs; for every one moving toward +z
there is one moving toward — z.
The heat current density is the number of heat carriers per unit volume times the
velocity of a carrier times the heat carried per particle. The first of these factors is the
density of conduction electrons p multiplied by the fraction k B T/e F of the particles
that are thermally excited. Only this fraction can carry heat. So we have
kB T
number of heat carriers per unit volume = p .
£F
velocity of a carrier = vF .
The final factor, the heat carried per particle, is a bit more subtle to compute. The
energy carried by a particle moving in the +/>. direction is of order e F + kB T+ and
that in the — p, direction is of order e F + k B T_, where T+ is a bit larger than 7'_ as
explained above. The net energy transported is then kB ST = k B (T + — T _). To
compute 8T we note that particles coming from the left are cooled down by collisions
with other particles so that they tend toward the local temperature. Particles moving
from the right are heated up by collisions. Since the average time between collisions
for a particle is assumed to be about t seconds, the distance between collisions is
/= v f t, a quantity known as the mean free path. We can assume then that a
temperature change of order 8T occurs every distance / and that the gradient in
temperature is
dT ST
~ ~~7
~~dz~ '
12-3 The Free-Electron Theory of Metals 595
or
dT
8T = -TvF —-.
dz
dT
net energy carried per particle ~ ~k B TV F — dz
.
p dT
2
JT k B Trvj—. (12-21)
eF dz
If we set e F — mvF and compare with the general form, Equation (12-4), we find the
coefficient of thermal conductivity to be
kt = -k\TT. (12-22)
m
It was noted very early in the history of the study of metals that good electrical
conductors are also good conductors of heat. If we take the ratio of k t to k e from
Equations (12-19) and (12-22) we find
k
2
T
*r= -4-f
e
£ >
(12-23)
a rule named the Wiedemann- Franz law after G. H. Wiedemann and R. Franz (1853).
Over certain temperature ranges Equation (12-23) is found to be obeyed quite well.
However, the validity of the relation depends on an implicit assumption, namely, that
the relaxation times, the t's appearing in Equations (12-19) and (12-22), are the
same. It turns out that the phonon processes that relax electrons in electrical
conductivity are not identical to those involved in thermal conductivity. The t 's then
are not quite the same and the Wiedemann-Franz law often breaks down.
In this section we have investigated how we can understand some of the properties
of metals on the basis of a free-electron model. The model is found to work pretty
well. But surely electrons must interact strongly via the Coulomb force with the ion
coresand not just weakly with core vibrations. Interaction with other electrons ought
tobe present as well. How are we to understand the origins of a free-electron model?
The answer to this question is the subject of Section 12-4.
Example
596 Solids
m 9.1 X 1CT 31 kg
T = 2 3
'V (1.6 X l(T 19 C) [l/(2 X 10 ,0
m) ](l. 7 X 1(T 8 B m
• 2 X 10
H s.
In Problem 8 at the end of the chapter the reader is asked to compute the mean
free path, the distance an electron travels in this time.
The sodium atom has a single 3^ electron outside a neon-like closed electronic shell
^iven by Is Is 1p . When a large number
sodium atoms are assembled they form
of
a bound metallic crystal. We this result by examining a simple
attempt to understand
crystal model. Our primary result is that the outermost atomic states no longer have
the energy values found in atomic sodium but are split into bands of energy levels.
These states correspond to delocalized electrons that are no longer fixed to individual
sodium atoms but are able to move through the entire crystal. The electrons in states
of a band behave, in many ways, like free particles as claimed in Section 12-3.
We do find fundamental alterations to the free-particle picture. Between bands of
allowed energy states there are forbidden energy regions called band gaps. The
existence of these gaps leads to the possibility of understanding the nature of insulators
and semiconductors as well as metals.
Just as the quantum tunneling of an electron from one hydrogen atom to the other
results in the binding of H.j , so also the electrons hopping among the many ions of a
metal can lower the total energy of the system.
Equation (10-7) illustrates what happens to the electron energy levels when two
hydrogen nuclei are placed close together. The tunneling of the electron through the
small barrier midway between the two nuclei results in a splitting of the I* atomic
level into two unequal energy levels. The wave functions corresponding to these states
no longer describe an electron situated on just one or another atom; they are
delocalized. Nevertheless, they are, to a good approximation, combinations of the
localized atomic states. The energy banding in metals is basically the same effect;
tunneling splits the levels.
Figure 12-20 illustrates the potential energy seen by a single electron in a sodium
crystal. One of the crystal edges is shown at the right end of the figure. Note that the
potential energy inside the crystal is lower than its value at the edge or at infinity. The
Figure 12-20
Schematic diagram of the potential energy curve seen by an electron in metallic sodium. Also
shown are the atomic 2s and 3s levels of the sodium atom.
/YVY\
124 Energy Binds in Solids 597
reduced interparticle barrier is what allows the electron to move from nucleus to
nucleus. At the edge the electron sees the higher barrier and is unable to escape from
the interior.
The 15, 2s, or 2p states, see hardly any change in the
core electrons, those in the
potential energy when the atoms are put together and so their energy levels are
affected very little. Only the 3* and higher levels are changed.
We now consider the details of a model in which an electron moves along the x axis
in just one dimension. The potential energy function centered at one site R n is the
Coulomb potential energy
V = V(x-R )= n
.
(12-24)
47teJ.v-.KI
Note that the charge on the sodium ion is just + e. The electron has a total potential
energy W{.x) given as a sum of all such Coulomb functions as
W{x) = V + V2 + x
••• +VN (12-25)
for N ions.
If our model crystal has boundaries like a realistic one, the Cn values at and near
the right and left edges of the crystal have a fundamentally different character from
8
those near the center. In a very large crystal, say, one with 10 sites along an edge,
these boundary positions play a relatively unimportant role in determining the energy
of a given state; an electron spends only a negligibly small percentage of its time near
an edge. To remove the complications of these edges without fundamentally affecting
the physics, we use instead what are known as traveling-wave or periodic boundary
conditions. Rather than demanding that the electron wave function be zero to the left of
site 1 or to the right of site N, we assume that the wave function begins to repeat itself
when it reaches a boundary. Thus, as shown in Figure 12-21, when the electron passes
site N by one interatomic distance, denoted by a, it finds itself back at site 1.
Similarly, if the electron goes to the left of site 1 it finds itself at site N, and so on. This
arrangement can be thought of as the crystal having itself wrapped in a circle and tied
head to tail as shown in Figure 12-22. While such a circular arrangement is difficult to
visualize in three dimensions, we need not worry about it because it is only a
mathematical artifice used to make the solution easier to find. The type of boundary
condition used, as noted above, makes negligible difference to the energy levels for a
very large system; it does make a large difference in the amount of work necessary to
find the energy levels.
598 Solids
Figure 12-21
N - 1 N 1 2
[T+ V + x
V2 + ••• +V„]+(x) = E+(x), (12-27)
where the kinetic energy operator is T = —(k 2 /2m) d 2/dx 2 We can make an .
immediate simplification if we use the fact that each <£„ is a 3s function corresponding
to the potential energy Vn located at R n that is, we have ;
[T+ Vn ]<t> n
= E <t>„, (12-28)
where E is the energy of the 3s state. Then when the quantity in square brackets in
Equation (12-27) acts on «f>, in \p(x), we get [E + V2 + + VN ]. When it acts on • • •
4> 2 we get [V + EQ + V3 +
x
+ VN ], and so on.
• • •
C,(£ + v2 + • + vN )* + c2 (r, + E +
x
V, + • • • + VN )ct> 2
+ ••• +CN {V X
+ ••• +VN _ + E X
)4> x
= £(C>, + C2 4> 2 + +CN N <i> ). (12-29)
What we now want to do is to develop algebraic equations for the Cn 's from this
Figure 12-22
N- 1
12-4 Energy Bands in Solids 599
simplifications and approximations are possible in the resulting equations. Since, for
example, § x
is large only in the neighborhood of site 1, and V2 in the neighborhood of
site 2, and so on, we expect any integral involving more than one site to be relatively
small. These integrals are analogous to the overlap integrals encountered in Chapter
10. Integrals involving three different sites involve the "leakage" of a function into a
region two sites away so that, for example, we have
~ dx ~
f^y^ dx or /"<f>,K>4> 3 0.
Integrals involving only two different sites that are nearest neighbors cannot
normally be neglected. Thus, the integral
J = faVfadx
X2
(12-30)
is a measure of the probability that an electron hops from one site to its neighbor, and
must be kept in our equations. All near-neighbor J 's have the same value, which we
denote simply as J:
J 2=J2 =Jn»-l=J-
l i
(12-31)
There are two other near-neighbor overlap integrals that arise in the operations we
are considering. These are of the form
d= jW^i + v3 h^ 2
and
that occurs in the normalization integral of the eigenf unction \p. While Q and / are
corrections that ought to be included in any rigorous calculation of the electronic
energies, they do not change the basic physics and we simply drop them in this
qualitative discussion.
Finally, we assume that each of the atomic functions <f> n
is normalized as
hi dx=\,
It follows easily from the above discussion of overlap integrals that the result of
multiplying Equation (12-29) by <#>„ and integrating over x is to give the equality
Cn E +C _J+C + J =
n n
EC n
or
Cn (E -E) + (C n _ l
+ Cn+l )J=0 (12-32)
These results have a very simple structure. If J= 0, the solution for the energy- is
just E , the localized atomic energy. However, quantum tunneling of the electron to
nearest-neighbor sites is represented by the overlap integral J. Each Cn is coupled to
those of the neighboring sites, Cn _ and Cn+
,
by these small hopping terms. Equation
,,
We can guess that such solutions are provided by choosing the form
hR *.
Cn = Ce' (12-33)
The variable k turns out to be the wave vector for the electronic motion, and C is a
normalization constant. By substitution we show that this choice does indeed satisfy
Equation (12-32).
The ion sites are taken to be at 0, a, 2a, ... ,(N — 1 )a, or
where a is the distance between lattice sites. With the ansatz of Equation (12-33)
plugged into Equation (12-32), we find
u" "-'i)a
Ce'
,)a
(E - E) + C{e'
k(
+ e'
k "a
)J =
01
The guessed solution, Equation (12-33), has worked and the result for the energy is
precisely the same energy. This equality has now been removed by the possibility of
tunneling of an electron from one ion to the next, so that we now have N distinct
energy levels each characterized by a different value of k. Before we can further
understand the nature of these levels we need to determine the values taken on by the
quantum numbers k.
The k 's are determined by the periodic boundary conditions. When we go to site
N + 1, we assume that we are back at site 1. From Equation (12-33) this implies
oi
= 1, (12-36)
so that
k = —
277
Na
t, {= 0, ±1, ±2,... . (12-37)
J 2-4 Energy Bands in Solids 601
Figure 12-23
Plot of the electronic energy levels Ek for a one-dimensional model of a metal. We assume
J< and find the width of the band to be 4[/|. The band is centered on the atomic energy E .
We see that for N large, the allowed k values are very densely spaced just as are the k
values of Equation (5-30) or (12-6) for a particle in a large one-dimensional box of
size L = Na. However, these k values are not precisely equal to those of the particle in
a box treated in Chapter 5. The difference is in the factor of 2 in Equation (12-37)
and the fact that negative as well as positive { 's are allowed. These distinctions arise
only because of the different boundary conditions used in the two cases. In Chapter 5
the infinite potential energy well forces the wave function to be zero at the ends of the
box, thereby producing standing waves in the box. In the present case we have used
traveling-wave boundary conditions. If we use these conditions on a free particle in a
have N
distinct states when the ions are close enough for tunneling to take place.
However, there are an infinity of k values listed in Equation (12-37). On the other
hand, the energy levels of Equation (12-35) are periodic in k. What we intend to show
is that certain k values are exactly equivalent to other k values so the set given by
Equation (12-37) has multiple redundancy. To see this redundancy we take as our
basic set the values
where we have assumed N is even so N/2 is an integer. There are exactly Nk values
,
602 Solids
in the set given. If k is inside this basic set, the vector k + G, with G defined by
G= —
277
a
(12-39)
is always outside the set. This is easily seen by considering several explicit choices of
wave vector. For k = we see that k + G = {2"n/Na)N corresponds to ("= N, which
is certainly outside the k set given by Equation (12-38). For k = {2m/Na)( — N/2) we
have
-N
k + G= —
277
Na
•
2
+ —277
Na
N=
277 TV
Na 2
,
corresponding to c" = N/2, which is just barely outside the set of allowed k values of
Equation (12-38). G is the width of the allowed /:-vector set. Next, consider the
eigenfunctions for the various k values. Upon substituting the Cn values of Equation
(12-33), the general eigenfunction ip(x) of Equation (12-26) is
-
4> t (x) = C[<f>, + *'% + e
i2ka
<t> 3 + +*'< JV 1
>% 1
. (12-40)
we see that
The k values outside the basic set do not lead to distinct wave functions but simply
reproduce those corresponding to k values in the basic set. Obviously, we also have
Ek + G = E k
. Additional redundant sets of k 's can be generated by considering multi-
ples of the G value of Equation (12-39).
The basic set of k values is said to be inside the Jirst Brillouin zone, a concept devised
by L. Brillouin. Other k values are in second or higher Brillouin zones. The vectors in
each zone are perfectly equivalent to those in any other. The unit G and its multiples
make up a set of wave vectors known as the reciprocal lattice that we have discussed in
Section 12-2. The values of these vectors in three dimensions are given in Equation
(12-3). The G's that we have been considering can also be defined by the relation
>g„r„
= 1, (12-42)
which is equivalent to the one-dimensional version of Equation (12-3). The fact that
the reciprocal lattice vectors, originally associated with Bragg scattering, also arise in
the study of the states of electrons in crystals is no coincidence. The relation is
We now examine the energy levels for an electron in a crystal a bit more closely.
The energies of Equation (12-35) are centered on the atomic energy level E ,
which in
the case of sodium metal is the 3s energy. They are spread in a band about E . If we
124 Energy Bands in Solids 603
Figure 12-24
zone.
assume that this band has J < 0, the energy corresponding to k = has the lowest
energy, namely, E — 2\J\. The largest energy occurs at the two edges of the Brillouin
zone where k = +ir/a, because there we have
The energies for k values in the first Brillouin zone are shown in Figure 12-24.
The energy minimum around k = is parabolic so that we are able to show the
relation of our band energy to free-particle states. To see this, expand the band
energies of Equation (12-35) for k around k = 0. We use the Taylor series result
(kaY (ka]
cos ka = 1
- +
2! 4!
E, = (E -2\J\) + \J\a-k\ k ~ o.
2 2
h k
(12-43)
2m'
where the electron moves as if it had an effective mass identified by the relation
2
L/l« = ir-z
or
tr
2
(12-44)
2\J\a
Note that this mass has nothing directly to do with the real electron mass; it depends
mainly on the tunneling integral J. For small k the particle moves like a free particle
but its inertia depends on the overlap of neighboring atomic wave functions and the
604 Solids
resulting tunneling rate rather than the real mass. In some circumstances the effective
mass can even be less than the true mass m. However, if the tunneling is large because
the atomic energy level happens to be very close to the top of the potential energy
barrier, the effective mass is nearly equal to the real mass m.
When the tunneling is very large it may
start off with an assumed be incorrect to
eigenf unction of the form given by Equation (12-26). Such a wave-function ansatz is
known as the tight-binding approximation in which it is assumed that the atomic energy
,
is a good first estimate of E and that the electronic wave function is reasonably close
k
277 2lT
k ir /a
The wavelength is now commensurate with the lattice so that strong backscattering is
present, and, more importantly, the incident and scattered waves strongly interfere.
To see this, consider the condition of strong reflection from two near-neighbor nuclei,
as illustrated in Figure 12-25. Two rays of the wave incident from the right are
reflected, one from site 1 and one from site 2. There is strong reflection when the
wavelengths are such that the two waves' components interfere constructively. This
occurs when their path difference, which is 2a, is equal to one wavelength, or A = 2a.
Thus, as the electron's k vector approaches the zone boundary, the electron is more
and more strongly reflected. The effect is equivalent to an electron possessing a
negative effective mass. The electron gains energy but slows down as it does so.
Obviously, this discussion is just the one-dimensional version of Bragg scattering where
the scattering plane has reduced to a single particle. It is easy to see that the
momentum transferred to the lattice in this case satisfies Equation (12-3). Values of k
outside the Brillouin zone are not needed because every time an electron reaches a
zone boundary it is backscattered to the other end of the zone. Its k value is trapped
in the zone.
The above discussion illustrates an important point; the quantity hk of an electron
in a crystal often behaves like a momentum but really should not be considered to be
one under all circumstances. This situation arises because pushing on the electron may
cause the entire crystal, and not just the electron, to gain momentum via interference
effects and interaction with the lattice. The vector hk is often called a quasimomentum
for this reason. Nevertheless, hk behaves sufficiently like a momentum that the
free-particle picture often remains reasonable.
Up to now only a single energy band has been considered. We must consider bands
corresponding to other atomic orbitals. The energy band under discussion for sodium
has been the one spread out around the 3s state. Lower energy levels, the Is, 2s, and
12-4 Energy Bands in Solids 605
Electron reflection from the ions of a one- Some energy bands in a one-dimensional
dimensional crystal. The waves are shown at model Note that the value of Jm
of a metal.
an angle for clarity. The rays reflected from increases with m band is
so that the higher
sites 1 and 2 are in phase when the path wider. The forbidden region between bands is
difference 2 a is exactly one wavelength. called a band gap.
Band gap
2p states, are so tightly bound that tunneling rarely takes place; the crystal energies
are almost unchanged from their atomic values. On the other hand, the higher states,
3p, 3d, and so on, are much more loosely bound. Tunneling is very large between
different sites. These states form bands that are broader than the 3s band. In general,
we have several energy bands given by
where m is the band number and refers to the atomic state, 3s, 3p, on which the
band is based.
Note that the overlap integral Jm in Equation (12-45) depends on the band
number m. This is in accordance with the idea that the tunneling rate should increase
as m increases. In Figure 12-26 we illustrate a typical band structure for two of the
bands in our simple sodium model. The higher band is broader than the lower one.
We have assumed that the sign of Jm alternates as we proceed from one band to the
next. This is not unusual but we do not justify that assumption here.
A further feature of the spectrum is the existence of band gaps. These are the
forbidden energy regions between bands. Obviously, for a single atom, the forbidden
energy regions dominate the allowed regions, which are the discrete levels. The
opposite is usually true for the higher bands in a crystal; states where hopping is easily
possible have allowed regions that are much wider than the gaps. Band gaps are of
fundamental importance to the understanding of metals, insulators, and semiconduc-
tors as we see in Section 12-5.
Example
We consider a metal for wh ich a = 0.2 nm and m* is the true electron mass.
The tunneling constant J is
2 -34
h (1 1 x 10 j •
*f
171 2
= 1.0 eV.
2 ma 2(9.1 X 10"
31
k g )(2xicr
10
m) (l.6x 10" 19 J/eV)
606 Solids
The width of the band associated with this J is then 4\J\ = 4 eV. The time It
for an electron to hop from one ion core to the next can be estimated from the
uncertainty principle as
-34
h 1.1 X 10
a' ~ (,.o.v)(..6x,o-jav)
J •
s
=- 66 x 10
"6
=w s'
the Fermi sphere is shifted slightly in momentum space. On an energy level diagram
like the ones we have been using, this shift is shown as illustrated in Figure 12-28.
There are slightly more electrons with positive k vectors than with negative k and so
there is a general drift of electrons in the positive k direction; a current is set up and
we have a metal. On the other hand, with a full band as in Figure 12-276, there is no
allowed energy region for the tilt to occur because of the gap. Obviously, materials
with full bands like this one are insulators.
Figure 12-27
Filling of bands by electrons. A solid with one electron per atom has a half-filled band as
indicated by the cross-hatching in (a). This material is a metal. If there are two electrons per
atom the band is full, as shown in (b), and the substance is an insulator.
a i
(6)
12-5 The Band Theory of Metals, Insulators, and Semiconductors 607
Example
We list the band gaps of several materials below. Excep : for diamond all are
considered to be semiconductors.
CdS 2.4
Si 1.1
Ge 0.7
Te 0.3
InSb 0.2
608 Solids
Figure 12-30
Conduction band
sc
Valence band
n(R)
12-6 Semiconductors
based on this technology. We discuss these applications in Section 12-7 but here we
outline some of the basic ideas of semiconductors.
Wehave seen in Section 12-5 that semiconductors are characterized by small band
gaps so that charge carriers can be thermally activated from the filled band. This
filled band contains the electrons from the outermost shells of each atom and so is
called the valence band. The normally empty band to which charge carriers can be
excited is then the conduction band.
In an intrinsic semiconductor, all the electrons in the conduction band have been
thermally excited from the filled valence band. Such a situation occurs in pure
crystals. However, semiconductors containing impurity atoms may have localized
energy normally forbidden band gaps. The activation energy of the
levels within the
electron in these states can be considerably less than that of valence electrons. Such a
doped semiconductor is said to be of the extrinsic type. While most technical applica-
tions involve extrinsic semiconductors, we first consider the intrinsic type.
The number of charge carriers in the conduction band is given by the usual Fermi
function
"*=-57—^ > (
12 " 46 )
where e F is the Fermi energy. Figure 12-30 shows the almost-filled valence band and
the partially filled conduction band. The Fermi energy is shown as lying within the
band gap. This may be a bit of a surprise since, for a gas of free particles at T= K,
the Fermi level is the energy of the highest filled level, and for T> K it is the
energy for which n h has dropped to one-half of its maximum value. But the Fermi
energy is more fundamentally the Lagrange multiplier in our statistical treatment of
Chapter 1 1, which determines that the sum over all k of Equation (12-46) comes out
to be equal to the number of particles N. We now show that this condition puts e F in
Assume that the top of the valence band is quadratically inverted downward at
k = 0, indicating negative effective mass. This could just as well be a region of a band
out near the Brillouin zone boundary as in Figure 12-276, rather than at k = as in
Figure 12-30. The results are the same. The energy e c is defined as the bottom of the
conduction band, and c„ is the top of the valence band as shown in the figure. Then
the number of particles in the condition band is
N
t
= I nk . (12-47)
k
If there are N electrons in the valence band at T = K, the number of these electrons
excited out of the valence band by thermal agitation is ./V minus the number
remaining in the band:
N. = N- £ nk = £ (1 -n k ), (12-48)
k k
(**<«) (e*<e„)
where the last equality comes from the fact that the total number of states in the
1 14
conduction band is equal to the total number of electrons in it at T= K when it is
full. In an intrinsic semiconductor all the thermally excited electrons must come from
the valence band so that we have
N C
= N V
. (12-49)
nk ^e-K*-*'\ ek >e c
. (12-50)
The last sum on k is easily done by changing to an integral over energy. We know
how to do this when energy depends quadratically on wave number k (like a free
particle) as it does near the bottom of the conduction band. We have measured
energies from e c so that the sum in Equation (12-51) is
Z= £,-*
£>0
2Trm ( k B T\ 3 / 2
Z = C
2V\ (12-52)
610 Solids
associated with the bottom of the conduction band and we denote the corresponding
sum by Z c
. There is an extra factor of 2 in Equation (12-52) to account for the spin
degeneracy of each energy level. We end up with the result
N t
= e^"-^Z t
. (12-53)
From this expression we see that the missing electron population in the valence band is
described by a sort of Fermi function, but with a minus sign in the exponential instead
of a plus sign as in Equation (12-46). However, this minus sign is rather natural since
the valence band, as we have drawn an inverted quadraticit in Figure 12-30, is
1 - n k -» *-««*-«*>
and
k
(e„-ek >0)
Changing the energy variable from e k to e„ — e k gives us precisely the same sum as for
the conduction band, except that the curvature of the band is determined by an
effective mass m v
(a positive quantity here). We get
N v
= €-*"-'•%„
where
3 /2
2irmk R T\
Z„ = 2V\
fr
Equating A^ and A^, according to Equation (12-49), gives an expression for e^:
g
P(e F -e,)2 _ g
-P(e F -t- c )2
P(e F -e )+ t
\\nm c = -fi(e F -e v )
+ \ In m,
or
eF = l
,(e c + e„) + fk B T\n(m v/m t
). (12-55)
From Equation (12-53), the number of charge carriers in the conduction band is
now seen to be
V - e
/;/
'/
(12-56)
where
Eg = ec -t (12-57)
is the band gap. We can see from Equation (12-56) that if E gets too large, the
number of charge carriers rapidly drops toward zero.
We might think that Equation (12-56) enumerates all possible charge carriers.
However, recall that the valence band is unable to conduct electricity only if it is
completely full. When some electrons are thermally excited out of the valence band, it
can again contribute to the conductivity. It is a curious feature that it is much more
convenient to speak of conduction not by the electrons in the nearly full valence band,
but by the holes left behind when electrons leave the band. The existence of
J=-^Iv„
v
(12-58)
k
vt
kx
= (12-59)
h dk x
i 1
with similar equations y and z components. When ek = h k /2m, as for a free
for the
particle, v kx = hkjm is just the normal relation between velocity and momentum.
When e k has a more complicated dependence on k, as can happen in some parts of a
band, we justify Equation (12-59) by considering the electron as a wave. The usual
group velocity of a wave is derived from the wave's angular frequency w A by
du k
Vk *
= -
~dT
x
band. When the band full the total velocity vanishes. For a band energy
for a full is
no surprise
function e k having a complete symmetric shape as a function of a k, this is
612 Solids
d^^y==i
because for every positive vk there is a negative vk . It can be shown that even if the
band is unsymmetric this sum vanishes. However, if one electron is missing from
momentum state k', the current density is
J
= (12-60)
V k k
(occupied) (all)
The band conducts electricity as if there were a positive charge moving with the
velocity of the missing electron. A full band of electrons is in essence electrically
neutral. Remove one and the equivalent of a positive charge appears. The number of
charge carriers in an intrinsic semiconductor includes the holes in the valence bands as
well as the electrons in the conduction band. In an intrinsic semiconductor these two
numbers are equal.
As it turns out, an extrinsic (or is much more useful than the
doped) semiconductor
intrinsic type discussed above. A
atom has four valence electrons outside a
silicon
closed shell. The partially filled shell could, if filled, hold eight electrons. So in the
solid each atom has four covalent bonds arranged in the same crystal structure as
diamond. We show this very schematically in Figure 12-31. Silicon is a semiconductor
with a band gap of 1.1 eV. Suppose a small percentage of the silicon atoms are
replaced by phosphorus or arsenic, each having five valence electrons. Each of these
elements can form the four covalent bonds that silicon has but there is then an extra
electron left over. That electron is loosely bound to the impurity nucleus as shown in
Figure 12-32. It takes only 0.05 eV to break it away from the phosphorus or arsenic
nucleus. This situation introduces new energy states into the band structure. Suppose
we plot band energy versus the position in the crystal as in Figure 12-33. The states of
the impurity are localized, that is, they correspond to motion of the electron in a
restricted region of space. These new states are somewhat like the states of hydrogen
because the electron orbits a single ion. Since it does not take much thermal energy to
break the extra electron loose, the particle is easily excited into the conduction band
and contributes to the number of charge carriers. Because of its ability to contribute to
conductivity in this way, this impurity is known as a donor. As can be seen from
12-6 Semiconductors 613
Figure 12-33
^
^- x
Figure 12-35
Figure 12-34
Energy versus position with acceptor levels in
Silicon crystal with an aluminum impurity. the band gap. A small amount of thermal
One of the covalent bonds is unsatisfied.
energy can cause the excitation of an electron
to an acceptor level, leaving a conducting
hole behind in the valence band.
E
Conduction band
T»- X
~
614 Solids
Figure 12-36
Properties of w-type and p-type semiconductors. The Fermi level in an n-type system is closer to
the conduction band than to the valence band while the opposite is true in a p-type system. The
energies of donor and acceptor levels are indicated by e d and e a respectively. The relative ,
distributions of electrons and holes are represented by the n(f) functions and by the plus and
minus signs.
»(£)
Semiconductors doped with donor levels are called n-lype because the charge
carriers are negative electrons in the conduction band. Those doped with acceptor
levels are p-type because the charge carriers are positively charged holes. Transistors
and other semiconductor devices depend fundamentally on the existence of these two
types of semiconductor, as we show in Section 12-7.
In order to understand those devices, it is important to know how the presence of
impurities modifies the Fermi level. In an rc-type semiconductor there is clearly an
increase in the number of electrons in the conduction band. The tail of the Fermi
distribution is enhanced. However, holes in the valence band can be filled, at least
partially,by electrons dropping into them from donors. The result of this is that the
Fermi level (which still gives the energy at which the distribution function n k falls to
one-half) has moved up toward the conduction band as shown in Figure 12-36a.
Similarly, in a />-type substance the number of holes is enhanced and the number of
conduction electrons diminishes so that e F moves down toward the valence band as
shown in Figure 12-366.
Example
3/2
V iOE* /2
2tt(9.1 X 10~ 31 kg)(l.38 X 10~ 23 J/K)(300 K)
i = 2e- 2
V 10 34
(6.6 X J •
s)
- 40E /2
= e t
(3 X 10
25
electrons/m
3
) (
Eg given in eV).
~9
For a band gap of 1 eV the exponential factor is 2 X 10 which is a
,
Except for the development of nuclear weapons, no other area of technology has had
such an impact on society as the transistor and related semiconductor devices. After J.
Bardeen, W. H. Brattain, and W. Shockley invented the transistor in 1948, small
solid-state devices soon replaced cumbersome vacuum tubes. These devices have been
miniaturized to a very small and can operate very rapidly. Microelectronics, with
size
and monstrous before the transistor, rapidly shrank in physical size, grew in memory,
and became enormously faster. Someone has said that this electronics revolution that
we are witnessing is much more important than the industrial revolution of the last
century because the latter had amplified only human muscle power, whereas the
present revolution is expanding the range of the human mind.
Of course, it is not possible to review here more than a few of the general ideas
involving these devices. We discuss only simple p-n junctions, solar cells, light-emit-
ting diodes, and transistors.
The diode or rectifier is a system that allows current to flow in only one direction.
Such a property is obviously important when using AC to operate objects like
calculators or battery chargers that require DC. Rectifiers can also be used as
detectors in radio receivers; they aid in the separation of the modulating signal from
the carrier wave. They are used in logic circuits in computers as well as in many other
electronic systems. The p-n junction is a rectifier as we now proceed to describe.
A semiconductor doped with acceptors is placed next to a material doped with
donors to form a p-n junction. Actually, the two regions may be part of one single
crystal, which has had from the two ends, but we
different impurities diffused in
pretend that it is made up of two separate pieces just placed next to one another. Just
before the two pieces touch, the energy level scheme and the electron distribution
functions appear as in Figure 12-36. Note especially that the n end has a larger
concentration of conduction electrons than the p end and the p end more holes than
the n end. Placing the two regions together is somewhat like placing a box of oxygen
atoms and a box of nitrogen atoms next to one another and opening the connecting
wall. The original density gradient in each gas cannot be maintained; the oxygen
flows toward the nitrogen and vice versa.
Where this analogy breaks down is in the fact that oxygen and nitrogen are neutral
while electrons and holes have charge. The p region and the n region are both
electrically neutral to begin with, but the flow of electrons from the n region leaves
positively charged donor behind; the flow of holes out of the p region leaves
sites
negative acceptor sites behind. These charged regions set up an electric field that soon
halts the flow of electronsand holes. The result is a region on the face of each material
which is depleted of local charge carriers, holes on the end of the p side and electrons
on the end of the n side, as shown in Figure 12-37. These two depletion regions each
have the charge of the fixed impurities of each material. An electric field exists only in
the narrow depletion regions. This zone turns out to have a width ranging from 10 to
10 nm,
as determined by the material and the amount of doping of each region.
Wemight ask what happened to the electrons that left the n region and the holes
that escaped the p region. Note that the entire system shown in Figure 12-37 remains
neutral. The number of bare donors equals the number of bare acceptors in the
depletion regions. The electrons and holes from each side have simply recombined and
canceled one another.
616 Solids
Depletion regions in an n-p junction. Thin Electrical potential <M*) seen by a charge in
charged layers form on the contact surfaces of the neighborhood of the depletion regions.
the junction. The layer on the n side is charged The electric field E = — d<p/dx points to the
positively because the depletion of electrons right and is confined to the charged layers.
bares the positive donor ions. On the p side
the loss of holes bares the acceptor ions.
Electric
field
n type ptype \ t
Depletion
layers
Depletion
regions
E. =
dx
Since the electric field is confined to the depletion zone, <£(*) is constant except in that
region, as shown in Figure 12-38. The energy of an electron placed in the potential is
changed from ek to e^ — e<f>(x). Thus, the band edges and the Fermi energies are
shifted relative to one another by
as shown in Figure 12-39. Electrons moving from an n region toward a p region now
encounter a potential barrier that most cannot surmount. Holes moving from p
toward n also find a barrier (seen by viewing Figure 12-39 upside down).
Note in Figure 12-39 that the condition of equilibrium between the two materials is
that the Fermi energy e F is the same on both sides of the junction. This is a very
general thermodynamic principle. Just as heat flows when there is a temperature
difference and volume changes when there is a pressure difference, so particles flow
when there is a Fermi energy difference. At equilibrium e F is the same everywhere.
This principle by itself tells us that energy levels readjust as shown.
We can now see how this junction acts as a rectifier. If we apply an external
voltage V to the device with the circuit shown in the inset of Figure 1 2-40, we enhance
the internal electric field and the potential energy barrier as shown, preventing
electron motion from the n side to the p side and hole motion in the opposite
direction. The system is said to be reverse biased. Very little current can flow in such
circumstances. However, if we provide forward bias as illustrated in Figure 12-41, the
12-7 Semiconductor Devices 617
Figure 12-39
* Wo
C
fir.
F
s- H*>
e
-T-r^
external electric field opposes the internal one, the potential barrier is lowered, and
thermally activated electrons and holes can much more easily surmount the barrier.
As V increases, the barrier decreases and the current flow increases substantially.
Clearly, the system behaves as a rectifier.
It is not difficult to be more quantitative about our analysis of this effect. When
V= 0, electrons and There are as many
holes occasionally flow across the junction.
going one way as another and the net current is zero. A few electrons on the n side
have sufficient thermal energy to overcome the potential barrier and move from the n
side to the p side. This current is called the electron generation current I . If the few
electrons that are on the p side diffuse to the junction, they are swept across by the
electric field in the junction. This flow is called the electron recombination current Ier .
/.„ + /„ = 0. 12-61a)
Figure 12-40
1
Forward-biased p-n junction. The barrier is
±ll 1
lowered and thermally activated electrons and
holes can easily flow across the junction.
e(<t>o + V) n p
-III
V
e(<t>o - V)
++ +
618 Solids
Suppose that a reverse bias voltage V is now applied. The number of electrons in
the conduction band on the n side at energy e is, by Equation ( 12-50), proportional to
the exponential factor exp[-/?(e - e F )]. When the energy is shifted by eV, 7 is
e
therefore reduced by an exponential factor according to
~ 'v
tjv) = ies (P) e p -
( 12 - 62 )
For any reasonable V this quantity becomes quite small. 7er on the other hand, , is not
affected by the biasing so that the total electron current becomes just 7er The hole
.
current also is reduced to 7 hr . In the figures 7er is to the left and 7 hr to the right so
that the effects of these two currents add to give a total current
Under forward bias the two generation currents are each enhanced by the
exponential factor so that, for example,
+/wr
IJ<V) " Ag(0)« ,
(12-64)
h= 'eg(O) + / hg (0)
dominates unity and we have Equation (12-65). If V is negative, the exponential can
be dropped; by using Equations (12-6 la) and (12-61b) we can see that the resulting
current — 7 is correctly equal to that of Equation (12-63). Figure 12-42 graphs the
I-V behavior of the p-n junction. Clearly, much more current flows in one direction
than the other, as one needs in a rectifier. We also see that the exponential growth of 7
with V given by Equation (12-66) allows the p-n junction to act as a part of an
amplifier.
The p-n junction has several other uses as well. Solar cells, known also as
photovoltaic cells, are appropriately designed p-n junctions. Light shown on the
junction region, as in Figure 12-43, produces electron-hole pairs. Electrons on the p
side and holes on the n side are swept across the junction by the internal electric field.
These excess charges provide an external voltage, the photovoltage, between the ends
of the material or between metal conductors attached to the ends. Such cells are used
to power calculators, watches, and spacecraft, and to provide commercial electricity.
The latter use is still very limited because of the high cost of fabricating solar cells.
12-7 Semiconductor Devices 619
Figure 12-42
An inversion of the solar cell concept leads us to the light-emitting diode (LED). By
forward biasing a p-n junction a current of electrons and holes is established across
the junction. Some electrons arising from the n side and reaching the p side recombine
with the holes there, while holes going in the opposite direction annihilate electrons.
Radiation may be released in this process (see Figure 12-44). The doped semiconduc-
tor GaAsP emits red light and is often used as a display in calculators and other
electronic instruments. It does not burn out like a regular light bulb and requires very
little power. It is to build a semiconductor device that acts as a laser and
even possible
produces coherent Such lasers are now commonly used in compact disc players.
light.
While the simple p-n junction diode is obviously an important device, a more
complicated semiconductor system, the transistor, is even more vital. While transistors
come in many forms and perform many functions, we consider only one type and one
use —
namely, the p-n-p structure used as an amplifier.
An amplifier has many uses; its role is to turn a weak signal into a strong one. In
high-fidelity equipment, this might be involved with making the very weak signal
from a phonograph cartridge into one sufficiently powerful to drive speakers.
Figure 12-45 shows a double junction, p-n-p, and its associated circuit. We show
that the AC voltage input Vm on the left junction is strongly amplified by the junction
Figure 12-43
- i p
n
FT"
+
+ +
+
t b)
620 Solids
Figure 12-45
p-n-p transistor and circuit to illustrate amplifier operation. The forward- biased p-rt junction
is known as the emitter; the reverse- biased side is the collector. The connector to the n region is
called the base. A small change AFm in the input voltage results in a much larger change AI* iul
Emitter Collector
v
U c
h|iHi>
-v) v„
R Base
Rr-
=^v t
on the right. The left junction, known as the emitter, is forward biased so that a large
current flows. By Equation (12-66) the current is
*E J
E0 e >
(12-67)
where VF is provided by the battery shown. What the emitter does is to remove
electrons from the n region and, more importantly, emit holes into it. These holes drift
across the n region and, if the geometry is right, most are picked up at the second
junction, the collector, before they can reach the base. The current at the collector
(which is reverse biased by the battery of voltage Vc ) is, in the absence of the
picked-up emitted part, just the saturation current ICQ Inclusion of that portion of the .
emitter current picked up by the collector gives for the total collector current
where y is a geometrical factor that measures the fraction of emitter current picked up
by the collector. The final approximate equality assumes that the saturation current is
quite small. The fraction y can be very close to unity.
When the input voltage undergoes a small change AFm , it causes a change in the
emitter current A/£ given by differentiating Equation (12-67):
M ~ yME
c .
12-8 Phonon Dynamics 621
= p,v'
T) I e 12-70)
kB T
As we see in an example at the end of this section, this ratio can be more than 100.
Note that the right side is independent of Vm in this approximation. A sinusoidal Vm
results in a sinusoidal output voltage Vonl of larger amplitude; the device is linear.
However, if V
m gets too large the approximation used in deriving Equation (12-69)
breaks down and the output voltage becomes distorted, which is undesirable in, for
example, high-fidelity applications.
Transistors have many other applications, including uses as switches and computer
memory elements, which we do not discuss here. The reader might take an inventory
of the electronic equipment in daily use to see how amazingly dependent we have
become on semiconductor devices.
Example
A familiar light-emitting diode gives off red light. From this information we can
figure out the band gap E in this material, because we know that E = hc/X.
Since the wavelength corresponding to red light is around 650 nm, we have
1240 eV •
nm
1.9 eV.
650 nm
Example
-5 3 _1
7j * (I0 3 fi)(l0 ^)(10 )(40F )
= 400.
Our discussions so far in this chapter have been concerned with systems involving
electrons. We now switch emphasis in this section to some properties of solids not
dependent on the presence of mobile electrons but
dependent rather on the motions of
the much more massive ion cores. While these atomic structures do not generally
diffuse throughout the crystal they do vibrate about the lattice sites and create
622 Solids
Figure 12-46
fe o
,v l2 * v 23
collective wave motions that we have called phonons. In Chapter 1 1 we have seen the
influence of phonons on the thermal properties, especially specific heat, of solids.
There we use some properties of phonons that are just quoted. In this section we
provide further justification of those properties.
The Born-Oppenheimer method mentioned in Section 10-1 in the treatment of
molecules is based on solving the electronic Schrodinger equation by assuming fixed
nuclei. The electronic energy then acts as a potential energy function for the
interaction of the nuclei. This same principle is important for solids. The forces an
atom in a crystal feels from neighboring atoms are caused mainly by electronic
interactions.
Suppose we consider a one-dimensional line of atoms as a model of a crystal, as we
have done above to investigate the electronic structure. Now we want to use the model
to study the motion of the entire atom. Figure 12-46 shows the curves that might
describe the potential energies of an atom due to its two nearest neighbors in a
covalent or van der Waals crystal. The total potential energy is seen to be symmetric
and to have a minimum at the equilibrium position of the atom. Such a picture
applies to each atom in the crystal (except for the atoms on the surface whose
potential energies are not symmetric).
Generally, the motion of an atom about equilibrium is small so that it is reasonable
to expand the potential in powers of the distance from the equilibrium position. This
argument has been used to discuss diatomic molecules in Section 10-7. There the
atoms in a diatomic molecule are considered to be bound harmonically. Here the
same result is true; the bottom of the potential energy well of any particle can be fit
by a parabola. However, that parabola is determined by interaction with many
neighbors, not just one other atom as in the diatomic case.
In this way we justify, to a certain extent, the Einstein model of a crystal treated in
Chapter 1. But particles 2 and 3 in Figure 12-46 do not remain stationary; they also
1
Figure 12-47
12-47. The particleon the right of the iVth is the first, and that to the left of the first is
the A^th. We can consider the entire crystal as strung out in a large circle with the
ends tied together by a spring.
We want waves on our jV-body one-dimensional
to solve for the frequencies of
system. We can do this classical equations of motion or the
by solving either the
Schrodinger equation. The allowed frequencies turn out to be identical in the two
approaches. The former approach is simpler and we proceed that way.
The spring constant is K, the mass of an atom w, the position of the nth particle
xn , and the equilibrium separation of the particles a. A particle's displacement from
its equilibrium position R n
is
un = xn -R n
(12-71)
We have taken the spring to be at equilibrium (/?,, = 0) when the particles are
separated by a lattice distance a, that is, when x .,
— x ]
= a. In terms of the u 's, we
have
Figure 12-48
Displacement of a particle in a one-dimensional crystal. In the system shown, all particles are at
their equilibrium positions Rn except particle 2, which is displaced a distance u 2 = x., - R ,.
x7
I <h
Ba
624 Solids
Figure 12-49
Example of a sinusoidal wave at one instant of time. The equilibrium positions are indicated by
the vertical lines. The wavelength is 12 lattice spacings.
(J)
10
—^H-OJO^O-^^^H^f^T-^H— c| (J)
The plus sign in front of the last term means that the force on particle 2 due to
particle 3 is pulling it to the right when u 3
— u 2
is positive. Since x 2 = "_>• Equation
(12-73) can be written completely in terms of the u's.
There is nothing special about particle 2 or any particle in the system when
periodic boundary conditions are used. Hence the equation for the nth particle is just
like Equation (12-73) and can be written as
ikR
un = se 'e-
iut
t
(12-75)
where k is the wave number and s is the amplitude. We could use sines and cosines
but the exponential form is easier.
— se' na ^' wl
Dividing out e gives
cc
2
m = K{\ - e-'
ka
)
+ K{\ - e'
kn
)
= K(2 - 2 cos ka).
X 1
sin
2
— = — ( 1
— cos .
2 2
we have
4 A' ka
or
2
= •
sin'
2 —
m 2
or
= 2l/-
[* ka
—
to sin (12-76)
V m 2
where the absolute value signs occur because a frequency is always a positive quantity
by definition.
12-8 Phonon Dynamics 625
For small wave numbers (or long wavelengths according to the relation k = 2ir/\)
the discrete character of the system, that is, the fact it is made up of many individual
masses, is not noticeable to the wave. Then Equation (12-76) can be simplified by
using the approximation sin x ~ x so that
(12-77)
V m
In this long-wavelength limit the phase velocity co/A and the group velocity dic/dk
are identical and equal to
Ka 2
.
(12-78)
m
The velocity does not depend on wave number so there is no dispersion in this limit.
For shorter wavelengths the approximation of Equation (12-77) is no longer valid and
waves of different frequencies travel at different velocities; the dispersion arises
«o
= un (12-79a)
and
u N+l =u 1 ,
(12-7%)
since to the left of particle 1 is particle N and to the right of particle N is particle 1.
and
ikNa
e = 1
so that
2tt
k=—{, (= 0, ±1, ±2, ±3,... . (12-80)
Na
These are precisely the same k values as those allowed by Equation (12-37) for
electrons moving through a crystal. And as in that case there are limits on k space.
Values of k out of the first Brillouin zone are equivalent to those within it. This is easy
to see by adding the vector G of Equation (12-39) to any k value:
u n
(k + G) = se
,{k + G) " a
e-'"' = (»«'*"««-«'•"
y 2 »" = u n (k).
Thus, any region of k vectors having a width G is adequate to specify all possible
waves. We choose these vectors to be those in the first Brillouin zone as given by
Equations (12-38) and (12-80).
Note again that there are exactly N of these vectors in Equation (12-80); this as is it
should be because there are N particles and one expects to find exactly N distinct
waves corresponding to the N degrees of freedom of the system.
626 Solids
Figure 12-50
The k values of Equation (12-80) are spaced quite closely together because of the
larger number N in the denominator. In plotting to versus k from Equation (12-76),
we can treat k as a continuous variable. Such a plot is shown in Figure 12-50. There
are two branches, one corresponding to positive k and one to negative k. Because we
have used periodic boundary conditions, we can have traveling waves on our chain;
the waves do not meet a wall and undergo reflection resulting in standing waves.
Positive k corresponds to waves traveling in the positive direction and negative k to
oppositely directed waves.
In Chapter 1 1 , we have derived the contribution of phonons to the heat capacity of
crystals. Our discussion there considers a chain of particles connected to walls, rather
than having periodic boundary conditions. The use of wall boundary conditions
results in standing waves. Negative k has no meaning in such a situation and the
spectrum shown in Figure 11-10 has only a positive k branch. However, there are no
is only one branch in the
differences in the thermal properties because, although there
wall-boundary case, the k values are twice as densely distributed. This discussion is
identical in spirit to the one given for electrons right after Equation (12-37).
The use of a complex value for the displacement u n in Equation (12-75) may
bother some readers since displacements must actually be real. However, since
solutions for plus and minus k correspond to the same frequency, the complex
conjugate of Equation (12-75) is also a solution for that frequency. Furthermore, since
the sum or difference of two solutions corresponding to the same frequency is also a
solution, we have, for example, as another solution
ikR nB + iul
-I- se~ 2scos{kR n -at),
= C V V 2T, (12-81]
where Cv is the heat capacity per unit volume, v is the velocity of the heat carrier, and t
12-9 Magnetism in Solids 627
is the relaxation time (time between collisions). We find in Problem 1 1 at the end of
the chapter that k t of Equation (12-81) reduces to the proper result, Equation
(12-22), when applied to a degenerate electron system. While phonons differ from a
usual gas of particles (such as electrons) since they can be created and destroyed,
Equation (12-81) still applies to them.
From Equation (1 1-68) we have, for low temperatures (ignoring factors of order 1),
(k B Tfk k
C vv ~ 3
(*0 '
where cs is the velocity of sound in the crystal. The heat carrier velocity in Equation
(12-81) for phonons is also cs .
k%LT 3
•
h\2
AT 3
temperature dependence of k t is indeed observed in insulators at the lowest
temperatures.
Because the atoms that make up a crystal can have electrons with unpaired spins, the
solid as awhole can have a net magnetic moment. Because the atoms are in the solid
between them can often alter the character of this magnetism
state, the interactions
where [i
B is the Bohr magneton and g s is the electron g-f actor. In a magnetic field B
the possible energies of the spin are, from Chapter 8,
(12-83), spin-down states, that is, those that have a = — 1 and are antiparallel to B.
have a lower energy than those aligned with the field. Thus, the more probable states
in a thermal distribution have more particles with spin-down states than with spin-up
states. The entire collection of electrons then has a net magnetization, which can be
detected.
The magnetization per unit volume of a collection of electrons is defined as
M= M (p + -p_), (12-84)
M = X B, (12-85)
Z= £ e
frBa = e
foB + g
-frB = 2 cosh fifiB.
a= ± 1
P+ - p_=p(o), (12-86)
where p is the density of all electrons. The average value of a in Equation (12-86) is
oe foBa e
PnB _ e
-PfLB
M = ixptanhBjiB. (12-88)
12-9 Magnetism in Solids 629
For very high temperature, such that uB «: k B T, the argument of the hyperbolic
tangent is small. For small x a Taylor expansion gives
tanh.v * x, x «: 1, (12-89)
so that we get
X =
A
M
—
5
= —
pur
kB T
(12-91)
This expression is known as Curie 's law, after P. Curie, giving x for a classical electron
weakens the ability of the external field to align the spins. Many solids exhibit
Curie-law behavior.
Because the hyperbolic tangent goes to unity for large values of its argument,
Equation (12-88) gives, for low T,
M r-»o
> up,
which corresponds to all particles aligned with the field. Of course, if Fermi statistics
has become important before the low-temperature condition k B T <^ [iB has been
satisfied, the last limit is not valid.
The magnetism of electrons in a metal cannot be described according to the
treatment just given. Those electrons are highly degenerate since the temperature
satisfies T <s: TF (the Fermi temperature is 10
4
or 10
5
K in most metals). We can take
the temperature to be essentially K. With no magnetic field, any filled momentum
state contains one spin up and one spin down and then the net magnetization M is
zero. When a magnetic field is turned on, the Zeeman energy of Equation (12-82) is
lowered if some spins align along the magnetic field. But such spins cannot just flip
into a different alignment, because then there would be two identical spins in the same
momentum state. The spin that is flipping must also be promoted to a higher
unoccupied momentum As further spins are turned over by increasing the
state.
external field they must be given even more additional kinetic energy to avoid
violation of the Pauli principle. Even at T = K a degenerate Fermi system has a
finite susceptibility, unlike the Boltzmann spin system. The behavior of the magnetism
Figure 12-51
aEfe.
Example
M pB
P=
HP
24
(9.3 X 10 J/T)(1 T)
" _23
=
"
6.6 x 10~ 3 .
(1.4 X 10 J/T)(100K)
Problems 631
Problems
1. (a) Compute the number density in terms of the nearest-neighbor distance a for the
two-dimensional triangular lattice, the square lattice, and the honeycomb lattice
(b) Assuming only nearest-neighbor interactions of strength -e, find the energy per
particle of the three lattices in (a) and of an arbitrary two-dimensional lattice in
o o o o
o o o o
o o o o
o o o o
o o o o
o o o o
o o o o
2. (a) In the triangular lattice of Figure 12-2, particles 1 and 2 are nearest neighbors, and
particles 1 and 3 are second- nearest neighbors. Each particle has six near neighbors.
How many second neighbors does each particle have? How many third neighbors?
(b) Find the ratio of total second- neighbor interaction energy to total first-neighbor
energy if the potential energy falls off like the van der Waals interaction, — l/r b . Is
numbering the particles in each of your own versions of the two drawings show that the
two views are equivalent.
4. (a) Find the volume per particle in terms of the nearest-neighbor distance a for the SC
lattice, the FCC lattice, and the BCC lattice,
(b) What is the energy per particle in each of these lattices if only nearest-neighbor pair
interactions, each of energy — e, are taken into account.
the example at the end of Section 12-1. Attempt to estimate this energy by
considering a large number of terms in the series.
(b) Evaluate the sum exactly by comparing it with the Taylor series expansion of
ln( 1 + x ).
where the n, are arbitrary integers and i,y, z are Cartesian unit vectors, gives the
,
632 Solids
R = (a/2)(n k + i
n2y + n 3 z),
with B,, b 2 ,
"i integers whose sum is even?
(b) Find the set of vectors representing all the lattice points of a BCC lattice.
R= a( «,u + n v)
where n ]
and n 2 are arbitrary integers and u and v are unit vectors given by u = x,
(b) The reciprocal lattice vectors for the triangular lattice are given by
G = {2m/a)(m ]
\i* + m.,v*),
where m {
and m, are integers and u* and v* are vectors defined by u* v = v* •
•
u = 0, and u* • u = v* v = 1. Find
• u* and v*. What kind of lattice does the set
of G vectors describe? What is the "nearest-neighbor distance" in the reciprocal
lattice?
8. (a) The Fermi energy of copper is about 7 eV. What is the Fermi velocity?
(b) Given the value of the relaxation time t in the example at the end of Section 12-3,
find the mean free path of a conduction electron. Compare this with the average
interatomic spacing.
9. Show that the current density (intensity) of a particle beam (number per unit area per
unit time) is particle density times particle velocity.
10. The thermal conductivity of copper at room temperature is 400 W/m K. Use this value •
to estimate the relaxation time T. Compare with the value obtained from the electrical
conductivity in Section 12-3.
11. Show that Equation (12-81), when applied to a degenerate Fermi system, reduces properly
to Equation (12-22).
12. (a) Repeat the derivation of Equation (12-32) for the case N= 3. Here the periodic
boundary conditions require that particle 3 be a near neighbor of particle 1. Find
the energy levels and solve for the Cn 's by setting the determinant of the coefficients
in the equation to zero,
(b) What are the k values that make up the first Brillouin zone for N= 3? Using these,
compare your values for the energies from part (a) to those of the general solution of
Equation (12-35).
13. Consider a free particle in a one-dimensional "box" of length L where the boundary
condition is periodic, that is, i//(0) = 4>(L). Show that the allowed k values satisfy
Equation (12-37) with Na replaced by L.
14. Show that the product of the number of conduction electrons N c
and the number of holes
Nr
in a semiconductor is independent of the Fermi energy eF . The relation for NN
( r
is
Problems 633
called the law of mass action. Since the relation is valid for any eF , this product is the same
independent of degree of doping. If the number of conduction electrons is increased by
adding donors, the number of holes must decrease correspondingly. Evaluate N N /V~
C V
for diamond, silicon, and germanium at room temperature by using the gap data given at
the end of Section 12-5. Use effective masses equal to the electron mass.
The bulk of solar radiation has a wavelength less than 1 fim. What is the minimum
energy gap that a solar cell should possess to take advantage of this? Is silicon ap-
propriate?
A small change AFm in the input voltage of the transistor circuit of Figure 12-45 produces
a change A/3 ,, in the emitter circuit power and a corresponding change APC in the power
delivered to the resistor R (
. Show that the power amplification ratio satisfies
APC 2y
2
fieIE Rc
APE 1+ ln(/£//£0 )
Evaluate this ratio for the conditions given in the example at the end of Section 12-7.
17. A positively charged arsenic donor ion in silicon attracts an electron much like a hydrogen
nucleus. However, the intervening silicon behaves like dielectric material having permittiv-
ity constant e = 12e , where e is the permittivity in vacuum. The electron also has an
effective mass in silicon of 0.2 times the electron mass. The ionization energy of this system
is the separation between the lowest donor level and the conduction band as shown in
Figure 12-33. Estimate this ionization energy by using the appropriately modified hydro-
genic energy levels. Compare your result with the accurate value given in Section 12-6.
18. Write down the set of equations analogous to Equations (12-74) for a ring of three
particles connected by springs. Substitute u n (t) = e~'"' u n (0), and solve the resulting set of
linear equations for the three possible values of co by the method of setting the
determinant of the coefficients to zero. Compare your results with the general solution
given by Equation (12-76). One of the solutions is u> = 0. To what values of the «„(0),
and what resulting motion of the particles, does this special solution correspond?
19. Using the general form of Equation (12-81), estimate the temperature dependence of the
observed they were not recognized or were thought to be due to experimental error.
When H. Kamerlingh Onnes succeeded in liquifying helium in 1908 it became
possible to observe these effects. Soon after he reached the liquification temperature of
4.2 K, Kamerlingh Onnes was able to cool this very light and transparent material
below 2.2 K, the transition temperature to the superfluid state. He must have observed
the cessation of boiling, the most apparent signal that this new state of matter has
been reached. And yet he made no mention of the effect. It was not until 25 years
later that a comment about it finally appeared in the literature. Soon, other strange
properties were discovered, some by Kamerlingh Onnes, but most by other workers.
Kamerlingh Onnes used helium as a refrigerant in order to study a variety of
substances at very low temperatures. In examining mercury in 1911 he found that its
electrical resistance disappeared around 4 K. Although he first assumed that the result
was an experimental error, he was able to find the effect in other substances and
coined the term superconductivity to describe it.
the supercurrents. All these effects are now understood to some degree. For superfluid
helium, there is an excellent phenomenological theory, known as the two-fluid model,
634
13-1 Experiments I Characteristics of Super fluids 635
4
theory of superfluid He as there is for the superconducting state. While a two-fluid
model of a superconductor was used with some success for many years, in 1956 a
microscopic theory was finally put forward by Bardeen, L. N. Cooper, and J. R.
Schrieffer. This explanation, known as the BCS theory, gave such good agreement
with experiment that it was felt that the problem of superconductivity was solved.
Liquid
3
becomes a superfluid around 2 mK, as discovered by D. D.
He also
Osheroff, R. C. Richardson, and D. M. Lee in 1972. Despite the fact that this atom is
an isotope of helium carrying no charge, the transition in this case is much more like
4
that in superconductors than in He. Because much of the theory existed before the
experimental discovery, we now have a rather complete picture of this material.
Superconducting and superfluid systems are properly treated together in the same
chapter because of their many similarities. Both involve phase transitions to rather
special ordered states that allow friction-free flow. In neither case is the order a spatial
one as in a crystal; the substance remains a fluid. (The "fluid" is a gas of electrons in
the case of a superconductor.) Both transitions are macroscopic manifestations of the
laws ofquantum mechanics, and in each case particle statistics, Fermi or Bose, plays a
Both phenomena reveal themselves at low temperatures because then the
crucial role.
thermal de Broglie wavelength becomes large enough to make quantum and statistical
effects evident.
4
The properties of liquid He that we describe in this section are unique to the
substance. The reason for this is not that the fundamental causes are absent in other
substances. It is mainly that every other material freezes at a temperature higher than
the transition point to the superfluid state. Despite the seemingly wide variety of
effects that we describe here, all are shown to be closely related.
4
In the introduction we have noted one characteristic of liquid He below the
superfluid transition — namely, the cessation of boiling. In normal liquid helium and
in other liquids, local hot spots cause bubbles of vapor to be formed. Below the
superfluid transition temperature of 2.17 K, the heat conductivity of the liquid
becomes so large that the hot spots necessary for the formation of the bubbles of
boiling cannot occur. Under the proper conditions the thermal conductivity of the
superfluid can be as large as 2000 times that of copper at room temperature. A drop
in temperature of only 1 K in going from the normal liquid to the superfluid state
leads to an increase in conductivity by a factor of several million.
As the temperature is lowered in any liquid, we expect to reach the solidification
point. This is true in all substances except helium. The condensed phases of both
helium isotopes remain liquids all the way to absolute zero. The only way to make
solid helium is to pressurize either of the two liquids. At very low temperature, solid
4
He finally forms at 25 atm, and solid 3
He around 30 atm. A diagram showing the
4
phases of He is given in Figure 13-1.
Helium atoms
interact with a potential energy approximately described by the
Lennard-Jones function. Equation (10-36). The strength of the interaction (as mea-
sured by the well-depth value e/k B =10 K) is very small compared to other
substances. Also the mass of the helium atom is so small that quantum zero-point
motion is large. If the separation between atoms is a, the lowest energy of a particle is
of order E ~ tr /ma
2
, the result of considering the lowest level of a particle in a box.
636 Superfluids and Superconductors
Phase diagram of 4 He. Lowering the Specific heat of liquid He. The transition
temperature at atmospheric pressure (along temperature is called the X-point because of
the dotted line) takes the system from gas to the shape of the curve. At low pressure 7\
normal liquid to superfluid. For temperatures occurs at 2.17 K and decreases slightly at
above the critical point, liquid and gas are higher pressure.
indistinguishable. The solid does not form, C
even at T= K, until a pressure of 25 atm is
Critical point
This energy is so large in helium compared to the potential energy that it melts the
solid at low pressure. When enough pressure is applied the potential energy grows
faster than this kinetic energy until the material is finally able to solidify.
When the normal fluid is cooled through the transition at 2.17 K, the specific heat
has a spectacular increase as shown in Figure 13-2. Because of the shape of the curve
the transition temperature to the superfluid is known as the X-pomt. The peak has
been studied in great detail and is thought to become infinitely high if the number of
atoms in the system is infinite. The transition is of second order since there is no latent
heat. At low temperatures the curve shows aT 3
dependence that is characteristic of
phonon systems. We know from Chapter 1 1 that the specific heat of solids depends on
temperature in this way. At somewhat higher temperatures, an extra exp( — A/k B T)
behavior is observed, a form characteristic of an excitation spectrum with a gap A, as
discussed in Problem 9 in Chapter 11.
As mentioned above, the superfluid has a very high thermal conductivity. There is
yet another closely related peculiarity; heat can flow in the form of a wave known as
second sound. Under normal conditions heat diffuses; it moves away from a hot spot
much like molecules spreading away from an open bottle of perfume. However, in
superfluid helium, a pulsed heater causes a temperature pulse to travel, largely
undistorted, across the container where it can be detected by a thermometer. A heater
that is cycled sinusoidally produces a sinusoidal temperature wave that travels across
the liquid.
The viscosity of a liquid is related to its resistance to flow. One of the standard
ways of measuring this quantity is with a torsion oscillator made up of a plate
suspended by a thin rod. As the plate oscillates about the rod axis it drags along any
viscous fluid; the rate of damping is easily related to the viscosity. This technique
applied to helium shows that the viscosity just below the X-point is about the same as
above, although it then apparently decreases toward zero as the temperature is
On the other hand, experimenters using very narrow channels, such as those in very
have found that above 2.17 K the fluid is unable to flow through, while at
fine filters,
any temperature below the A transition the liquid pours through as if the viscosity
were zero. This extraordinary flow property is one of the main reasons for applying
the name "superfluid" to this substance. Narrow channels that allow superfluids, but
not normal liquids, to pass through are known as superleaks. If the flow velocity
exceeds a critical velocity v
c,
viscous drag occurs.
The two kinds of viscosity experiments seem at first sight to be quite irreconcilable,
with one showing a viscosity and the other showing none at all. Nevertheless, we see in
Section 13-5 how the two-fluid model nicely explains the problem.
When helium is in a container, the vapor above the liquid coats the walls with a
thin film that is usually several atomic layers thick. This in itself is not unusual and
occurs for any enclosed liquid. However, the superfluid film shows some unique
properties. If a breaker holding some of the liquid is raised above the general liquid
level, fluid travels via the film flow over the edge of the beaker, accumulates in drops
on the bottom, and then drips back into the main body of liquid. This continues until
the beaker is empty. This creeping film has, in effect, siphoned the bulk liquid out of the
small container.
The film can be used for another experiment that shows the amazing flow
properties of the superfluid. Suppose the film coats a ring of glass. The film can be
made to flow continuously around the ring. Once the flow has started it continues
without dissipation indefinitely. Similarly, bulk fluid made to rotate by spinning a
bucket continues to flow without friction. Such friction-free flow patterns are known as
persistent currents.
Figure 13-3
Heater
Superleak
638 Super fluids and Superconductors
called the Meissner effect after the work of F. W. Meissner and R. Ochsenfeld.
If the external field is increased, superconductivity is destroyed at a critical fie Id Bc
.
Excluding the field has cost the superconductor an energy per unit volume equal to
the energy density of the excluded magnetic field 5 2 /2ju , where ju is the permeabil-
ity of the vacuum. When this energy exceeds the difference in energy between normal
and superconducting states, the system reverts back to the normal state. An amazing
~8
feature is how small this energy turns out to be, only 10 eV per atom. Consider-
ing that the various energies involving electronic interactions are all around
1 eV, including some that are not so accurately calculated, we might think it hopeless
to explain such a small effect. Fortunately, it is not.
Like helium, superconductors have a specific heat increase at the transition
temperature. However, the peak in superconductors is a finite jump as we see in
Figure 13-5. At low T the specific heat falls below the linear dependence of a
degenerate Fermi gas and is proportional to exp( — constant/ 7"). Such a dependence
is, like the situation in the superfluid, characteristic of a spectrum with a gap. If the
superconductor is placed in a magnetic field with B> B (
, the specific heat reverts to
its normal linear behavior.
Another indication of the existence of a gap in the energy spectrum is given by
absorption of radiation. A superconductor reflects radiation of frequency less than
some critical value lying in the far infrared. Radiation above that frequency is
Figure 13-4
(a)
13-2 Experimental Characteristics of Superconductors 639
of which is made up of one isotope of the same element, depends on the atomic mass
M of the isotope according to
T. ~ AT",
The seeming flaw in this argument is that the superconducting transition tempera-
ture is generally orders of magnitude lower than the Debye temperature. Nevertheless,
we see that an effective force between electrons caused by lattice vibrations acting as
intermediary is indeed basic to the occurrence of superconductivity.
One of the most exciting periods in the history of condensed matter physics began
in late 1986 and continues to the present: high-temperature superconductivity has been
discovered in certain oxides. In order to keep up with fast-breaking advances in the
subject physicists have had to read current news releases as well as physics journals. At
present, the highest critical temperature liesabove 100 K and occurs in materials
having a layered structure containing planes of copper and oxygen atoms. Such high
transition temperatures are technologically remarkable because they can be reached
easily and cheaply by using liquid nitrogen, at 77 K, rather than liquid helium at 4 K.
This opens the door to a myriad of applications. Room-temperature superconductivity
is by no means out of the question.
Example
2
(0 01 T)
= 40 J/ m3 .
28
Because the density of aluminum is 6 X 10 atoms/m 3 ,
this becomes
40 J/m3
28
= 4 < 10" 9
eV/atom
(6 X 10 atoms/m3 )(l.6 > 10- 19
j/eV)
640 Super fluids and Superconductors
Example
~4
Element rc
(K) Resistivity ( 10 Q / m)
Ag — 1.6
Cu — 1.7
Au — 2.4
Mo 0.9 5.7
Ga 1.1 17.4
Al 1.2 2.8
Sn 3.7 11.5
Ta 4.4 15.5
Pb 7.2 22.0
Nb 9.2 12.5
We have noted in Sections 13-1 and 13-2 how certain experimental data, including
the specific heats of both superfluid helium and superconductors, indicate the ex-
istence of an energy gap in the energy spectra of each of these substances. Such an
energy gap leads to the possibility of friction-free flow and persistent currents.
The explanation is basically quite simple. Suppose we consider a sphere moving
with velocity v through a fluid. The sphere feels a drag when immersed in a normal
fluid. This is equivalent, by the principle of Galilean relativity, to the typical
experimental situation in which the fluid flows with friction at velocity —v past some
stationary object, in this case the sphere. The frictional forces arise from the transfer of
energy from the moving sphere to the liquid. When there is a gap in the spectrum,
then, for sufficiently small velocities, no energy can be transferred — because there are
—
no fluid states available and we have a superfluid. For sufficiently large velocities,
enough energy can be transferred to the liquid that the gap can be surmounted. Then
frictional forces are felt by the sphere.
In superfluid flow the walls of the channel through which the helium flows, or other
obstacles, play the role of the sphere. For superconductors phonons and impurity
atoms can cause drag on the electron fluid.
To make the issue more quantitative, we use a discussion similar to one by L. D.
Landau. Denote the energy excitations of the fluid by e^ as a function of momentum
p. (In a gas of free particles we have, of course, e^ = p /2m; we consider other
2
functional forms as well.) Let the mass of the sphere be M. When the sphere causes an
excitation, its velocity changes to v' and the liquid absorbs energy e and momentum
p
13-3 Super/low and the Energy Gap 64
My = My' + p (13-la)
and
\Mv 2 = \Mv' 2 + e
p ,
(13-lb)
where v' is the final sphere velocity. Solving Equation (13-la) for v' and substituting
into Equation (13-lb), we obtain
2
P
e„ + v •
p =
H 0.
" 2M
Since M is a macroscopic mass the second term is negligible and
vp = e, (13-2)
the fluid, which is then viscous. If Equation (13-2) cannot be satisfied, the excitation
does not occur and the sphere moves through the fluid without friction.
With the cosine of the angle between v and p, frictional flow requires that
jti
i>ii= —. (13-3)
The quantity e
p /p
is a characteristic of the fluid and, since it is always positive, takes
on values extending from some minimum Suppose the minimum value is to infinity.
larger than zero. Then, if v\i is smaller than this minimum, Equation (13-3) cannot be
satisfied and no excitation of the fluid can take place. With ju set at its largest value,
namely unity, there is some smallest value of v, call it vc for which there can be an ,
vc = minimum of — . (13-4)
P
vc = minimum or(( \ ,
\ 2m I
which, of course, is zero for p = 0.That is, a normal gas of free particles is not a
superfluid; excitations can be created at any velocity.
Suppose, however, that there is a gap in the spectrum. For example, suppose
A
P
"
2m'
642 Superfluids and Superconductors
Excitation spectrum with a gap. The system Excitation spectrum for superfluid He. At
behaves as a superfluid at velocities less than small momenta the curve is linear,
the critical velocity, which is given by the corresponding to phonon excitations. The
slope of the line through the origin tangent to sound velocity is denoted by cs . At larger
the curve. momenta there is a minimum with gap A.
G, The excitations here are called rotons. The
dotted line shows the critical velocity-
construction.
2A
2
P dp P
>i\
de
p
V (13-5)
" dp
Figure 13-6 shows a hypothetical curve of e versus p. Equation (13-5) is the equation
p
of a straight line passing through the origin and yet somewhere tangent to the
excitation curve as indicated by the dotted line in the figure. The critical velocity vc is
There is no need for the lowest energy of the excitation spectrum z to occur at
A. Despite the existence of states between zero and A, the tangent construction still
gives a nonzero critical velocity. The linear region of the spectrum is typical of a
phonon spectrum with cs the sound velocity, as we have seen in our discussion of solids
in Chapter 12. Why there are phonons and no single free-particle modes is fundamen-
tal to understanding the nature of superfluidity and is discussed in Section 13-5. The
A2 + {p 2 /2mY
This form looks much like that shown in Figure 13-6. There is obviously a nonzero
critical velocity.
Example
The excitation spectrum of helium shown in Figure 13-7 has the parameter
valuesA/k B = 8.65 K and p /h = 19 nm The sound '. velocity is cs = 238
m/s. The critical velocity is then approximately
least one other kind of order can occur — in momentum space. The Bose-Einstein
condensation of an ideal gas involves a sudden collapse of a macroscopic number of
particles into the lowest single-particle momentum state.
N=I t
n (l ,
13-6)
644 Superfluids and Superconductors
and
= (,*,+« - I)"
1
. (13-7)
Obviously, we must have a > so that n is finite and positive for bosons. (Recall
that for fermions at low temperatures a is negative since it is proportional to minus
the Fermi energy.) If a is very small, and in particular if a = a/N, where a is a
number of order unity, a Taylor series expansion of the exponential shows that
A"
That is, the number of particles in the lowest allowed state of energy and momentum
can be a substantial fraction of the entire set of particles. Bosons, as we have shown
previously in Chapter 1 do not avoid one another like fermions; they actively prefer
1 ,
to be in the same state. What we are going to show is that, while for high temperatures
n is no larger than any other occupation number, when we lower the temperature
below a value Tc a becomes of order jV~ and the lowest state fills until it contains a
,
'
substantial fraction of the total set of particles. We then have had a transition to a
phase that is partially or, at T = K, completely condensed into the zero-momentum
state.
It is easy to show that n for any state other than the ground state, is very much
smaller than n . This is true even for the first excited state as shown in Problem 3 at
is
the end of the chapter. Although any individual n is small compared to « the sum ,
N= n + £'n,,
P
where the prime on the sum means that we omit the ground state. Since, in the sum,
no single state is significantly larger than the rest, it is possible to change from the sum
into an integral by using the standard technique of the density of states function. (It is
always a good approximation to change a sum into an integral if the summand does
not vary much from one term to the next. This is the case once we remove « This .
/•OO
N=n + AV
•'n
dee l/2 (e fic
+a
I)"
1
-
z = fie
so that
/2
N= n + AV(k B Tf G(a), (13-9)
where
+a - l)~\
G(a) = (™ dzz^2 {e* (13-10)
Jo
V7T
G(0) = 2.612. (13-12)
,,, fr
nQ = N-AV{k B TY"—
3/
2.612 (13-13)
ut
3/2
« = iV (13-14)
where
J !
_'.V 277^'- i
rs (13-15)
A-
fi \ ^^(2.612) kBm \ 2.612
Occupation number n for the lowest energy Plot of the Lagrange multiplier a for an ideal
state of the ideal Bose gas. At T= K, Bose gas. This quantity is of order \/N below
n = N; macroscopic occupation continues up T {
but takes on values of order unity above
to a critical temperature Tc
. Above T, n is that temperature. The discontinuous nature of
negligible compared to the total number of its behavior in the limit of an infinite system
particles and is shown as zero in the figure. is characteristic of a phase transition.
3/2.
N= AV(k B T) ^G(a)
Here, we have used the fact that n Q is very small (of order 1) and have dropped it
from Equation (13-9). The reader is asked to show in Problem 5 at the end of the
chapter that the result for very large a is just the classical high-temperature, or ideal
gas, case (equivalent to that of Equation (11-30)).
Figure 13-9 shows how a behaves over the entire temperature range. The im-
portant point is that it has a discontinuity (in first derivative) at Tc
. This is a sign that
a phase transition occurs at this temperature. As the temperature is lowered through
T c , the transition is one in which particles begin cascading into the zero-momentum
state. This is the Bose-Einstein condensation. It is an ordering in momentum space
and not in real space.
Other thermodynamic quantities of this ideal Bose gas system have discontinuities
at T(
. The specific heat C/N is shown in Figure 13-10. At the transition C has a cusp
of finite height. The temperature dependence of this quantity below Tc
is T ' .
at the transition and is proportional to T 3/2 below it; the specific heat of helium has
an infinite peak at 7\ and is proportional to T s
at low temperature. The ideal Bose
transition is of first order — since it has a latent heat — rather than second order as in
helium.
More importantly, the ideal Bose gas does not exhibit superfluidity. The spectrum
of excitations depends quadratically on momentum corresponding to free particles. As
we have seen in Section 13-3, this gives a zero critical velocity.
The atoms in helium are constantly interacting with one another and certainly
cannot be considered an ideal Bose gas. It is impossible that n could ever be equal to
/ 3-5 The Two-Fluid Model of Superlluid Helium 647
Figure 13-10
at T c
but has a discontinuous derivative. At
low T, C is proportional to T 3/2 . It
high temperature.
repulsive core of the potential energy curve, we know that the wave function for liquid
helium cannot be a constant but must vanish whenever any two particles approach
too closely. Nevertheless, it is possible to have a partial Bose condensation into the
zero-momentum state in liquid helium. Theoretical calculations predict that n /N «
0.1 at absolute zero. Experiments to detect n are very difficult and have often been
controversial; however, recent results tend to confirm the theory.
Although the ideal Bose gas is not a good model for liquid helium, the Bose nature
of the helium atom and the presence of a condensation are thought to be responsible
for the characteristic behavior of the superfluid. We discuss this point more fully in
Section 13-5.
Example
For the atomic mass and density of helium, Equation (13-15) gives the Bose
condensation temperature as
2w(l.05 X 10~ 34 J •
s)" 2.2 x 107m 3 \ 2/3
While liquid helium cannot be described as an ideal Bose gas, it is commonly believed
that the Bose character of the helium atoms is fundamental to the nature of the
superfluid state. Although there is as yet no truly comprehensive microscopic theory of
helium, numerous theoretical calculations have indicated how important it is to
include Bose statistics in the description. We describe briefly and qualitatively some of
the approaches that have been used. Unfortunately, most of the details of these
theories are beyond the level of this text; however, there is a less fundamental
648 Super fluids and Superconductors
to those appearing in the small-momentum region of the plot in Figure 13-7. This
behavior is what allows a nonzero critical velocity so that superfluidity can occur
according to the Landau argument of Section 13-3. Weak interactions and the
existence of a large condensate are essential to the Bogoliubov approach so that this
theory, while very suggestive, is not directly applicable to helium either.
It is perhaps not too hard to understand qualitatively why the single-particle
<x< itations, which would destroy superfluidity, are no longer present in the interacting
Base gas. Bosons prefer being in the same state with one another, so that if one atom is
pushed on by an external force, all the particles within a de Broglie wavelength A
(which is large at low temperature) want to move in the same way. The collective
motion of a sound wave allows this while the single-particle motions are frozen out by
this tendency.
R. P. Feynman was He showed on the basis of mathemati-
able to go even further.
cal as well as physical arguments that helium should have a phonon spectrum and
moreover that the roton spectrum, the region of the minimum in Figure 13-7, is just a
natural continuation of the phonon part. The excitations in the phonon region are
longitudinal compressional waves, but the rotons are pictured as mini-vortices. The
wavelength X = h/p corresponding to a roton is small, about 0.3 nm, which is
approximately the average interparticle distance. Thus, a roton very likely involves a
small number of particles. One model has them moving in a way somewhat analogous
to a microscopicsmoke ring. Feynman's calculations of the spectrum are in good
agreement with experiment and so this aspect of helium is thought to be well
understood.
Superfluid helium is condensed into an ordered state in momentum space although
we have shown that it cannot be a state with nil particles in the lowest single-particle
level even at 7=0K. However, at absolute zero the entire system is certainly in its
superfluid is nonviscous and carries no heat. The percentage of superfluid present goes
from 100% at T = K to 0% at and above Tx .
The two kinds of viscosity experiments mentioned in Section 13-1 are now easily-
understood. The torsion oscillator experiment actually measures not simply viscosity
but the product of density and viscosity. As the plates oscillate in the fluid they feel no
friction from the superfluid component but the normal fluid has density p„, which
/ 3-5 The Two-Fluid Model of Superlluid Helium 649
Figure 13-11
4
Superfluid density p of liquid He as a
%
function of temperature. At T = K, p is (
exerts a drag and is carried along as the plates oscillate. This drag diminishes as the
temperature is lowered because the normal density decreases; the result is an apparent
decrease in fluid viscosity. By a slight modification of the torsion oscillator experiment,
the Russian physicist E. L. Andronikashvili was able in 1946 to measure directly the
normal density p n The superfluid density is
. given by p = p — i p„, where p is the total
and can be plotted as shown in Figure 13-11.
density,
Two misconceptions must be avoided here. First, despite the similarity between the
curve for ps in Figure 13-11 and that for n in Figure 13-8, the two quantities are
quite different. Recall that at T= K perhaps only 10% of the particles in real
helium are condensed into the zero-momentum state, and not 100% as in the ideal gas
of Figure 13-8; however, the entire helium system is superfluid then, so that p = p.
s
Second, it is incorrect to think of the superfluid and the normal fluid as made up of
two different sets of atoms. Because a phonon is a collective motion, it involves many
atoms throughout the liquid, and these same atoms can also be involved in the
superfluid component.
The second kind of viscosity experiment discussed in Section 13-1 involves flow
through a small channel known as a superleak. The superleak is so small that the
normal fluid is almost unable to move through the channel; however, the superfluid
flows readily. Since almost everything that gets through is superfluid, this measure-
ment of viscosity gives zero. Normal fluid carries the heat and is largely left behind.
This fluid is now and shows an increase in temperature if the
deficient in superfluid
entire system is not maintained at a given value by the external refrigerator. Because
of the removal of the excess heat, some of the normal fluid is quickly converted to
superfluid, which then flows through the superleak. The observer gets the impression
that none of the liquid has viscosity.
The thermomechanical effect also involves having a superleak that allows the
superfluid through but not the normal fluid. When the heater in the chamber on the
right side of Figure 13-3 is turned on, the rise in temperature creates normal fluid at
the expense of the superfluid on that side. There is then a mismatch of superfluid
concentrations on the two sides of the apparatus. This is as though a gas is on the left
at a certain pressure and a partial vacuum of the same gas is on the right. Superfluid
flows through the superleak from left to right. There is also a mismatch in concentra-
tions of normal fluids on the two sides, but since normal fluid cannot readily flow
650 Superfluids and Superconductors
As the electron passes through the lattice it attracts the neighboring positive ions
toward it. Another electron nearby sees the positive grouping and is attracted to it.
The resulting attraction can even overcome the bare electron -electron Coulomb
repulsion so that the net interaction is attractive.
There is an equivalent alternative view of this interaction. When an electron passes
by a positive ion core it interacts with it via the Coulomb interaction and can set the
13-6 Cooper Pairs and the BCS Theory 651
Figure 13-12
q and — respectively.
1,
p +q = -I
I =p -
q
Electron 1 Electron 2
ion vibrating about its site. This in turn sets up lattice waves so that the electron can
be said to have emitted a phonon. The vibrating ions can affect the motion of a second
electron; the second electron absorbs the phonon. A simple diagram representing this
process is drawn in Figure 13-12. As is shown in Chapter 16, analogous diagrams can
be drawn for every force that occurs in nature, especially the fundamental ones like
the Coulomb force. The force that results from phonon exchange is attractive and
allows the formation of bound pairs of electrons.
In order to understand the various terms that occur in the final results we look at
the effective interaction between electrons a bit more closely. Consider the wave
function corresponding to a pair of free particles, the first having momentum p, and
the second momentum p2 . Call it (f». Jr,,^), where r, and r 2 are the coordinates of
the two particles. The total momentum of the system is P = p, + p>, which corre-
sponds moving with a center-of-mass
to the pair velocity through the crystal while
they are orbiting around each other (if indeed we find that they are bound together).
For the moment we can just consider the situation in which the center of mass is at
rest, that is, P = 0, so that p 2 = — p,. The resulting pair wave function can be
represented by the simpler notation
(Of course, Cooper pairs can have P =£ 0; indeed, as we see below the pairs are
responsible for carrying the supercurrent, so such moving states are essential.)
If the effective attractive interaction between the electrons is written as Vn —
F(r,,r 2 ), an important quantity is given by
eP = fjM &P dx \
(13-16)
It turns out that it is this quantity that is actually represented by the diagram in
Figure 13-12. A pair of electrons in momentum state [p] = (p, — p) exchanges a
phonon that carries momentum q. The momentum of the one electron goes from the
652 Super fluids and Superconductors
value p to t= p — q, while the other electron that absorbs the phonon goes from — p
to — p + q = —{. The final pair state is then [f\ = (t, — t) as shown. (Because of V(p ,
the simple momentum state 4> is no longer an eigenfunction. The true eigenfunction is
p
a sum of all the different <£ 's combined in a manner representing a localized bound
state.)
The magnitude of Vf depends on a variety of things. For example, it certainly
depends on the strength of the interaction an electron has with an ion that must be
wiggled to get a phonon excited. Furthermore, we know that the two electrons that are
interacting must be near the Fermi surface. Otherwise, their interaction with one
another does no good. That is, when one electron interacts with another it feels a force
and normally might then change its momentum. However, if it is deep within the
Fermi sphere a small kick has no effect because the electron cannot go anywhere in
momentum space because of the Pauli principle; all the nearby momentum states are
already occupied by other electrons. However, the maximum energy that can be
transferred from one electron pair to another by the phonon exchange interaction is
where 0^, is the Debye temperature. Phonon energies are quite small compared to the
Fermi energy; S D is a few hundred kelvins, while TF is of order 10 5 K. Thus, only
electrons very near the top of the Fermi sphere can interact successfully with one
another.
Therefore, we might write
v = I~ F >
e, and e, in the interval [e F , e F + e^]
(13.17)
I 0, otherwise,
where F is a positive constant that represents, among other things, how strongly an
electron is coupled to the ions. The minus sign signifies an attractive potential. This
interaction allows electron pairs to interchange energy with one another only in a very
small band of energies around the Fermi surface.
Cooper put all these ideas together, solved the Schrodinger equation, and found
that therewas a bound state for arbitrarily small F. If E is the pair energy then, when
there is no interaction, we expect E = 2e f since both electrons are at the Fermi
,
surface. A bound state has an E less than this by a bit. The binding energy- can then
be defined as
E = c
2e F - E
2 R F
E = (
2e D e- / » ,
(13-18)
where R is the density of particles per unit energy (Equation (12-15)) evaluated at
the Fermi energy e = e F This says that the binding energy can be much smaller than
.
temperature. The exponential factor easily accounts for that size reduction.
/ 3-6 Cooper Pairs and the BCS Theory 653
The BCS theory goes far beyond the consideration of just Cooper's treatment of a
pair of electrons. It takes into account the cooperative nature of the pairing. At T>
K single electrons are thermally excited into momentum states above the Fermi
surface. Because of the Pauli principle, these states are then not available to be
involved in the formation of pair states. As the temperature rises, there is a cooper-
ative blocking of the formation of the bound state causing the binding energy to be
temperature dependent. At some temperature T c,
the binding energy E. goes to zero.
This is the transition temperature for the superconducting state.
In the superconducting system the pairs are in a highly coherent state; the
formation of a few encourages the formation of others in a cooperative way. This
tendency is quite analogous to what happens in a Bose condensation. However, the
average radius of the Cooper pair state is quite huge, on the order of 1 /im. This
means that the centers of approximately a million other Cooper pairs sit inside the
volume encompassed by one of them. While an even number of bound fermions often
behaves like a boson, as in the case of the helium atom, the severe entanglement of the
Cooper pairs implies that we cannot very well think of the superconducting transition
simply as a Bose condensation of Cooper Such a picture would be valid only if
pairs.
the Cooper pair radius were small compared to the average interpair separation.
The BCS theory predicts that the ground state involves many electron pairs
occupying states in the shell of states about the Fermi surface. A single-particle
excitation has an energy (measured relative to this ground state) given by
ep = n+A .
which bears a strong resemblance to Equation (13-18) for the Cooper pair binding
energy. When a pair is broken, two electrons are excited so that the smallest energy
involved in an excitation is the energy gap,
1/R " F
2k = 4e D e- . (13-19)
There are no states between the ground state and this energy. However, there is a
continuum of states above this energy. In this way superconductors differ from
superfluids, which have the phonon modes all the way to zero energy.
For T > K, the gap becomes temperature dependent, as we have mentioned
above, and diminishes until it finally reaches zero at T. This transition temperature is
given by
F
kB T = lA4e D e- l/R
c
«
.
(13-20)
In Section 13-2 we have pointed out that the isotope effect hints at T being
proportional to the Debye temperature. From Equation (13-20) we see that this is
We have now briefly seen a few of the elements of the theory of superconductivity.
In Section 13-7 we take a closer look at the theoretical interpretation of the
experimental data.
Example
1 1
~
ln(7yi.l40 o ) " ~ ln(l.2K/1.14 X 420 K)
~
A simple form of the BCS theory, the weak-coupling case, gives the results
quoted in this section. This case holds when R F is less than unity, as occurs for
aluminum. The combination of Equations (13-19) and (13-20) gives us a simple
relation between two experimentally measurable quantities, T and the gap 2 A.
We find
2A 4
3.51.
kB T
c
1.14
Experimentally, this ratio is found to range from 2.8 to 4.6. For aluminum
2A/k B is 4.2 K so that the ratio is 3.4.
In Section 13-3 we have seen how the existence of an energy gap explains superflow in
liquid helium. The condensation of Cooper pairs also provides a spectrum with an
energy gap so that we can discuss nonresistive flow in superconductors in a very
analogous way. (We point out, however, that this argument is not complete and the
question of nonresistive flow and persistent currents is more complicated than we have
indicated. However, we are unable to go into the details here.)
The energy gap, equal to 2 A, also gives rise to the experimentally observed specific
heat dependence, which has roughly the form exp( — A/k B T) below T The theoreti- (
.
cal values of A give good agreement with the experimental results from specific heat
and radiation absorption experiments.
Because a magnetic field above a critical value B(
can destroy superconductivity,
the energy density of the critical field, given by the value B 2 /2jx
c , must represent the
difference W Q in energy densities between normal and superconducting states. It is
easy to calculate the value of this in terms of the energy gap. We make use of methods
similar to those used in Chapter 12. Since the pairing involves electrons in a thin shell
of states at the Fermi surface, the density of pairs is one-half the number R of
particles per unit volume per unit energy interval times the width of the energy shell,
given by the gap parameter A, or
/? A
pair density =
/ 3-7 Theoretical Interpretation of Superconducting Experiments 655
In the superconducting state, each of the pairs is lowered in energy by — A, so that the
total reduction in energy density is
W =-^=-2R eD
2 e-VW ( 13 . 2 1)
By setting this equal to the magnetic energy density, we can evaluateB as we do int
an example at the end of this section. The results are quite good. The amazing thing is
8
that Wq is so small, ~ 10 eV, as calculated in Section 13-2. There are many
energies in normal metal physics that are known to much less accuracy than this. It is
only that those effects are the same in both normal and superconducting states that
allows us to ignore them and focus on this small but obviously vital superconducting
effect.
We next look into some of the physics of the Meissner effect. Our discussion
involves the use of the differential version of Maxwell's equations in electromagnetic
theory.
The Meissner effect is the expulsion of the magnetic field from the interior of the
superconductor as shown in Figure 13-4. Surface currents are set up that establish a
magnetic field opposing the external field. The field deep inside is canceled out
exactly. It turns out that it is not sufficient just to assume that the system is a perfect
conductor. A perfect conductor has the property that once a field is established the
internal magnetic field is unchanging in time. Thus, it is trapped at the value it had
before the material went superconducting; it does not necessarily vanish as the
Meissner effect requires.
In Section 12-3 we have treated an equation for the particle velocity of a charge in
a resistive medium. In the case of no resistive forces , Equation (12-9) combined with the
definition of current density, J =~pe\, gives
dj pe*
-T- = E.
dt m
If J is
X, the induced current density in the superconducting sample, this equation
must be modified because the charge carriers are Cooper pairs; e is replaced by 2e
and m by 2m to give
17 = E. (13-22)
dt m
Since we consider an external field to be present, there must also be a current density
J, that acts as a source for that field. The total field is given in terms of both sets of
currents according to the Maxwell equation
V XB = /i
(J, + J,). (13-23)
We are interested in the value of the field at the position of the superconductor; we
assume that the external currents J, are zero there and need not appear further in our
equations.
.
V XE= -— dt
The substitution of E from Equation (13-22) into the last equation gives the result,
d m
i7(
(
B+
^ vx *)= \
' (13 - 24)
This relation is what the Maxwell equations require. Simply having a perfect
conductor obviously requires only that the quantity in parentheses be constant in time.
We see in Problem 10 at the end of the chapter that this also causes the field in the
m
B+ ^~F V x l= °- < 13 " 25 )
2pe~
We accept this equation without proof, and next we see how the Meissner effect
follows from it. The BCS theory does indeed give Equation (13-25). The term
containing J can be reexpressed
t
in Equation (13-25) by use of Equation (13-23).
(Remember that J, is zero at the positions we are considering.) Then we use the
mathematical identity
V X (V X B) = V(V • B) - V 2
B.
B = X2 V 2
B, (13-26)
where
m
X2 =- -. (13-27)
2MoP*
£(*)=A 2
—V- ax
The solution of this equation has B = constant on the boundary at x = 0, while for
x > 0, inside the material, the solution is
This result says that the field decays exponentially as one goes into the superconductor
/ 3-7 Theoretical Interpretation of Superconducting Experiments 657
Figure 13-13
^x
and vanishes deep in the interior. So the Meissner effect is not perfect at the surface;
the field actually extends approximately the penetration depth A into the material. Since
A can be as large as 0.1 jum very thin films having a field extending all the way
through can be made. Since these systems do not expend much energy expelling the
field, they have very high critical fields.
We show via Problem 1 1 at the end of the chapter that the supercurrent J also (
resides on the surface within a penetration depth. For later use we note that this result,
together with Equation (13-22), implies that the electric field E always vanishes in the
interior of a superconductor.
The unusual properties of superconductors make them useful in many different
applications.The recent discovery of oxides that superconduct at high temperature
may mean the practical possibility of superconducting electrical transmission lines,
magnetically levitated trains, and a host of other uses. In Section 13-8 we see further
properties and some of the devices that can be made based on another effect in
superconductors.
Example
Bf R tf
2(i 2
where the density of particles per unit energy at the Fermi surface (e = eF ) is
2e F
658 Superfluids and Superconductors
Thus, we obtain
/3, oP
B c
'
V 2£ F
We recall from the example after Section 13-6 that, for aluminum, \/k B =
2.1 K. Also, since p = 6 X 10
28
atoms/m and 3
/k B =
eF 1.4 X 10
5
K, we have
"
1/2
23
'
1.5(477 X 10
7
N/A 2
)(6 X 10
28
m~ 3 )
B = (1.38 X 10" J/K)(2.1 K)
X 10" 23 J/K)(l.4 X
c
5
(1.38 10 K)
= 7 x 10
3
T,
Example
1/2
m 9.1 X 10~ 31 kg
, /
2 2
28
V 2/i P' 2(477 X 10~ 7 N/A 2
)(6 X 10 m~ 3 )(l.6 X 10~ 19 G)
= 1.1 X 10 8
m = 1 1 nm.
made to differ by the application of a voltage, a net current arises. The existence of
this current not only reveals the existence of quantum tunneling but dramatically
illustrates theremarkably coherent character of the superconducting state.
Back Chapter 5 we have learned that an energy eigenfunction for a single
in
particle corresponding to energy E can be written, for one-dimensional motion, as
/ 3-8 The Josephson Ellecl 659
Figure 13-14
-x^- I
-5>- x
where to = E/h and 4>(x) is the time-independent part of the wave function. Because
of the highly ordered nature of the many-particle superconducting wave function, we
are able to define a similar quantity that is a function of a single variable (in three
dimensions it is a function of vector position r). This quantity behaves like a
single-particle wave function; but it actually describes the behavior of every particle
— actually every Cooper pair — simultaneously. It is called an order parameter. Such a
function is possible only because the condensed pairs of electrons are so coherent; we
cannot push on one without affecting all of them.
2
In this interpretation |i//(*)| , which in a normal wave function is the probability
density, here becomes the density of superconducting pairs
!*( ft
•
a real quantity while \p(x) itself can be complex. Quite generally we can write
where <p Q is the phase of the time-independent part of the order parameter. The full
j= — 2im
h (
\
**-
3*
ox
d**
—*
ox
\
. (13-30)
electron. The two factors of 2 cancel so that the electric current density is given by
J= ~ej.
Next, we show that the current is dependent on the spatial variation of the phase of
the order parameter. We then show how that phase can be controlled by applying a
voltage in order to generate a tunneling current. Suppose that ps in Equation (13-29)
is independent of x. Then combining Equations (13-29) and (13-30) gives
7=—
m
y-.
ehp t
dd>
ox
(13-31)
If we put a voltage V across the junction, the resulting change in electronic energy
then affects the time-dependent phase. We take the zero of potential at the center of
the gap, x = 0, in Figure 13-14. Then the left side of the junction is described by
where now
The potential on the left side is — V/2, which is multiplied by ~2e, the pair charge,
to get the associated potential energy given in Equation (13-33). The order parameter
on the right side of the gap is described by an equation analogous to Equation
(13-32), with 1 replaced by 2 and with the phase on the right side given by
on both sides of the junction. We have also assumed that the zero-potential frequency
to is the same on both sides of the junction. What the potential V has done is to alter
the frequency in a different way on the two sides of the gap so that the phase has a
gradient; a current is then established according to Equation (13-31). Since the phase
difference is time dependent, the current is AC. The only reason that we can maintain
such a voltage is because the oxide layer is insulating.
Is any current that might want prevented from doing so by the insulating
to flow
layer? For thick layers gap is thin enough quantum tunneling through
it is, but if the
it allows the current to proceed. We assume that the oxide layer represents a constant
¥,(*,*) = P y e-
2 !
The decay constant a is determined by the size of the potential barrier as shown in
Chapter 5. Similarly, pairs originating on the right side of the gap are described by
/2 + a(x - a)
%(*, = pl e-'^e ,
x < a. (13-36)
13-8 The Josephs on Effect 661
Figure 13-15
>X
This function decays as we move from x = a to smaller x values. The values of ^P,
We expect, from our treatment of the binding of the diatomic molecule in Chapter
10, that an electron pair capable of being in each of two regions has an overall wave
function in the intermediate region given by either of the two functions
We can work with either of these functions; both give the same result for the current.
We use ty + .
We find the current in the gap by substituting ^ + into Equation (13-30). After a
small amount of algebra we reach the result
/here
2eV
(13-39)
and
ap/h
7o
=
The details of the calculation are left to Problem 12 at the end of the chapter.
There are several things to note about the Josephson current density of Equation
(13-38). If the potential V vanishes, there is no current. (We have assumed that <p is
the same on both sides of the gap. If it is not — and we have no control over this phase
— there is a DC current dependent on the difference in the two <J>
values.) If V is not
zero an AC current results. The frequency of this current is
2eV
V = (13-40)
J ~h~'
For V= 1 juV, this frequency is 484 MHz. Microwave radiation originating from this
oscillatory current has been detected. This phenomenon is known as the AC Josephson
effect.
662 Superfluids and Superconductors
Figure 13-16
Behavior of a ring of superconducting material when the ring is placed in a magnetic field and
the temperature is lowered below Tc
. If the field is then turned off, magnetic flux is trapped.
The path of integration for Faraday's law is shown in (a). The trapped flux is illustrated in (b).
(a) (b)
A very interesting device can be made from the Josephson junction. Before
considering it, we need to look at the phenomenon of flux quantization. Suppose we
form a ring of superconducting material, place it in a magnetic field, and then reduce
the temperature below Tr While the Meissner effect excludes the field from the
superconducting material itself, the field is still able to thread through the hole in the
ring. If the external field is turned off, the flux through the hole is unable to decrease.
The reason is Faraday's law. The rate of change of flux $ through an area is the
integral of the electric field E around the perimeter of the area:
^E •
dt- (13-41)
~Jt
If, as illustrated in Figure 13-16a, we take the circuit of integration around the ring
within the material, then, since Ewe must have
is zero inside any superconductor,
d<b/dt = 0. That is, the flux is trapped as Figure 13-166 indicates. It is maintained
by the flow of surface currents in a thin layer of depth A in the surface of the
superconductor. An important feature here, that we have not derived, is that the
amount of flux threading through the ring is quantized. It is known that
2irnh
<D =
where n is an integer and q is the charge on the carrier of the supercurrent. Since the
charge carriers are Cooper pairs, the value of q is 2<\ The result, that the minimum
13-8 The Josephson Effect 663
Figure 13-17
Josephson junction
7rk
(13-42)
e
and not 27rh/e, has been verified experimentally. London originally predicted this
effect before the BCS theory and assumed q = e. The result q = 2e is thus a strong
element in the verification of the BCS theory.
Quantized lines of flux can also be trapped within the interiors of certain forms of
superconductors. The Meissner effect is not really violated here because the "hole"
through which the field is threaded is a line of normal material. Such superconductors
are designated as type-II. The critical fields in these materials are much higher than in
the usual type-I form of superconductor, which does not allow such penetration by
magnetic fields without destruction of the superconducting state.
in Figure 13-17. The resulting object is the superconducting quantum interference device or
SQUID. Analysis shows that the current density in the ring, which necessarily involves
tunneling through the junction, has the form of Equation (13-38). Now, however, the
phase difference 8 depends on the magnetic field and is given by
277$
«IV
where O is the total magnetic flux through the loop. This total flux is made up of the
flux due to any external field as well as that resulting from any surface currents in the
ring. We do not enter into the details of the theory or the use of this device, but only
mention that it has become a very useful tool in condensed-matter physics as well as in
other areas. It can provide an extraordinarily sensitive measurement of magnetic
fields. A flux may be measured to within 10~ $
4
which is equivalent to a sensitivity
,
of order of 10" 13
T!
As we have seen, Josephson junctions are illustrations of fundamental principles in
solid-state physics and quantum mechanics, and at the same time are extremely
practical devices.
664 Superfluids and Superconductors
Example
h V 1
*o
= = 2.068 X 10" 15 Wb.
2e Vj 4.836 X 10
14
Hz/V
~6
For a loop of radius — 1 mm and area 10 m 2
this corresponds to a magnetic
field of magnitude
2.068 X 10~ 15
— Wb -9
B = 5
= 2.068 X 10 T.
10" 6
m 2
4
Other substances, besides He, have the potential for superfluidity. As we have
already mentioned, most boson systems solidify before they have a chance to become
4
superfluid. The only other commonly available material, which, like He, does not
solidify even at T= K, is
3
He. Although
3
He is a fermion system it is found to have
a superfluid transition, but at a temperature of 2.7 mK, almost three orders of
4
magnitude lower in temperature than the transition in He. There is also an
"artificial" substance, atomic hydrogen, which is now being studied in hope of finding
a Bose condensation. We do not intend to describe either of these substances at length
but instead present a brief qualitative picture of each.
3
In 1972 Osheroff, Richardson, and Lee were running an experiment on solid He
and found two "glitches" in the melting curve. These were ultimately attributed to the
transitions to two different superfluid states in the liquid. After 70 years, superfluidity
4
was no longer restricted to just He.
3
The He system is a set of fermions as are the electrons in a metal. The onset of
3
superfluidity in this system involves the pairing of He atoms analogous to the
superconducting phase change in metals. There is little relation to the A transition in
4
He. However, there are some considerable differences from the superconducting state.
Obviously, 3 He atoms do not carry any charge and so electrical supercurrents do not
exist; instead, the viscosity of the liquid is considerably reduced. In superconductors
the electron pairs are in an = singlet state. However, in 3 He, the pair state has a
c"
new vector into the problem makes this state quite complex. The ( vectors of the 3 He
atoms can align so that the liquid develops a directional quality known as texture.
More than one kind of pairing can take place so that there are two superfluid
states, one as described above and the other isotropic. In an external magnetic field a
atoms of hydrogen, approaching each other with antiparallel spins, feel a very strong
attraction.
On the other hand, it turns out that two hydrogen atoms approaching one another
with parallel spins (the triplet state) feel at most a very weak van der Waals attraction
at a distance of several Bohr radii (weaker even than the He-He attraction) and a
strong repulsion at short distances. A gas of hydrogen atoms all with parallel electron
spins is called spin-polarized hydrogen or H[ . The interatomic force is so weakly binding
that, like helium, no solid forms, even at T= K. Even more astonishing is the
prediction that the liquid state does not condense at any temperature; the material is
expected to remain a gas all the way to absolute zero at low pressures.
Furthermore, the hydrogen atom, because it is a combination of two fermions, is a
boson. In principle then, it is possible to cool HI until it undergoes a Bose
condensation. Because this system is a dilute gas, its theoretical description is a much
simpler problem than that for the dense He liquid. Much of this theory already
exists.
The experimental problem with atomic hydrogen is stabilizing all the atoms into
the parallel-spin state. A strong magnetic field is used to align the electronic spins.
(The electron spin has its lowest state in the direction opposite to the magnetic field;
this is the reason for the downward pointing arrow in H J, .) Unfortunately, there are
several interactions in the gas and on the container walls that can flip spins leading to
the destruction of H J,
through the formation of H2 . For this reason the highest density
24
achieved to date is only about 5 X 10 atoms/m 3 which , is two orders of magnitude
too low to have the Bose condensation at the convenient temperature of 1 K. For
temperatures much lower than this the gas becomes adsorbed onto the container walls
where it rapidly recombines. To overcome these problems experimentalists are devel-
oping magnetic and away from walls
laser "traps" that confine the particles to regions
and simultaneously provide unusual cooling procedures. Considerable interest has
been shown in this system and there is a continuing effort to reach the conditions
necessary to observe the Bose condensation and the presumed accompanying transi-
tion to a new superfluid state.
Problems
( p - PoY
rotun
= £ +
4
with &/k H = 8.65 K, p /h = 19 nm" ', and ju. = 0.16m m = ( the He atomic mass). Use
this form to find the numerical correction to the approximation vc ~ &/p = 59.8 m/s for
the critical velocity calculated in the example at the end of Section 13-3.
=A 1
-
*P
A, / I Ao
3. In considering the Bose condensation of an ideal gas, we argue that n is of order N for
T< T c , while n, , n2 , . . . are all much smaller so that we can separate off n and treat the
.
sum of the remaining terms as an integral. Using the expression in Equation (12-6) for the
momentum and assuming the form a = a/N, show that the occupation numbers n, , n2, . .
for the very low excited states are all of order N 2/i . This is indeed much less than N
when N is very large. How many terms having this size are there and of what order is
4. Compute the total energy and heat capacity for the ideal Bose gas for temperatures less
5. (a) When a is large the exponential of Equation (13-10) is much greater than unity.
Show that Equation (13-11) follows. In this case n can be neglected compared to
the second term in Equation (13-9). Show that the result for a is that of the classical
ideal gas of Equation (11-30).
6. An early form for the excitation spectrum of rotons, due to Landau, was
/'
£
roton = A +
2p.
Compute the heat capacity implied by this spectrum when kB T «c A. Note that rotons,
like phonons, have a Bose distribution n = (e^ 1 ? — 1)"'.
p
7. The radius of a Cooper pair state is about 1 \im. By using the data for aluminum given in
the text, calculate how many other Cooper pairs have their centers within the volume
occupied by one pair.
10~ 4 T. The mass density of tungsten is 19.3 g/cm3 and its Debye temperature is 310 K.
(a) Evaluate the energy gap 2 A. (b) Evaluate the energy density of the superconducting
state by two different methods: (i) compute B^/2fi and (ii) evaluate W in Equation
(13-21). To carry out (ii) you need to estimate eF by an appropriate formula from
Chapter 11.
9. Evaluate the penetration depth A of tungsten by using the data given in Problem 8.
10. The London equation of BCS theory, Equation (13-25), leads to the Meissner effect,
Equation (13-26). Show that the simpler assumption of perfect conductivity, as embodied
in Equation (13-24), leads instead to the result
B = A'v'B,
where B = dH/dt. Prove that this implies that the field value becomes fixed at the value
it had when the material became a perfect conductor rather than at zero.
11. By using the relations developed in Section 13-7, prove that the current in a superconduc-
tor resides within a penetration depth A of the surface. (You also need the equation of
continuity V '
J,
= dp /dt,
f
the right side of which vanishes in a steady-state condition.)
12. Carry out the steps needed to derive the Josephson relation, Equation (13-38).
13. The flux through a superconducting ring is <I> , the basic unit of quantized flux. To what
average magnetic field does this correspond, if the ring has a diameter of 2 mm?
14. When a time-dependent potential is placed across a Josephson junction, the phase eV t/h
is replaced by ejV(t) dt/h. Suppose that V(t) = V + fcos^yi, where v «: V . Use the
Josephson relation, Equation (13-38), and the approximation sin(x + 8x) ~ sin x +
8x cos x (for Sx «: x) to show the DC Josephson effect. That is, show that J has a
nonzero time average if the impressed frequency is y = 2eV /h.
24
15. Spin-polarized hydrogen has been concentrated to a density of 5 X 10 m~ 3 What . is the
Bose condensation temperature for this density if you assume this system is an ideal gas?
FOURTEEN
PROPERTIES
AND
MODELS
OF
THE
NUCLEUS
emission of two new forms of radiation, and he gave the names alpha and beta to the
two types of emitted particle. His studies of a radiation identified the a particle as a
doubly charged helium ion. These emissions were used as beam particles to probe the
atom before the coming of the accelerator. The decisive experiments on the scattering
of a particles by atoms began in 1909 under Rutherford's direction. The results
established the existence of the nucleus and supported the nuclear model of the atom.
Subsequent experiments in Rutherford's laboratory found that protons were pro-
duced when a particles collided with the nitrogen atoms in a gas target. These
observations of the splitting of the nucleus came in 1919 and gave the first indications
of nuclear substructure. Rutherford drew upon this evidence to propose the existence
of the neutron, a neutral particle supposed to occur along with the proton as a second
fundamental constituent of the nucleus. He and his associate J. Chadwick undertook
an experimental search for the neutron in order to verify his bold prediction of a new
type of nuclear particle. Rutherford's laboratory was dedicated to these explorations
of the nucleus at a time when other investigators were focusing their attention in
another direction toward the understanding of the quantum theory.
Rutherford's foresight was rewarded when the neutron was finally discovered by
Chadwick in 1932. Several other important discoveries also came to light in the same
"miraculous year." The chemist H. C. Urey identified the deuterium atom as a heavy
form of hydrogen. The deuterium nucleus, the deuteron, provided the simplest
667
668 Properties and Models of the Nucleus
proton-neutron system for the study of the force between nuclear particles. Accel-
erated beams of protons were employed for the first time to cause the disintegration of
the nucleus. Accelerators made it possible to probe the structure of the nucleus at
controlled energies higher than any obtainable from radioactive sources of a particles.
Thus, in a single year, the second constituent of the nucleus was detected, the basic
two-body nuclear system was found, and the examination of the nucleus at increasing
energy was begun.
Inquiry into the experimental and theoretical problems of nuclear physics quick-
ened after 1932. The pioneering Rutherford passed his inspiration along to the next
generation whose leaders included Bohr, Heisenberg, and especially Fermi. These
proponents of the quantum theory of the atom advocated the extension of quantum
mechanics to the theoretical treatment of the nucleus. It was realized that the nuclear
particles were governed by an unknown interaction, and it was believed that the
properties of the unknown force could be deduced from experiments on nuclear
scattering and nuclear binding. The lack of an underlying theory drove the investiga-
tors to adopt a phenomenological approach where models could be used to interpret
the accumulation of experimental facts. Several of these models have enjoyed a degree
of success within their limiteddomains of validity.
Nuclear models have developed on two separate levels. The basic force between
nuclear particles and the application of the force to complex nuclear structure fall into
separate areas of speculation. Progress has been made in these areas through the use of
different kinds of models. A truly fundamental theory of the nuclear force is only now
beginning to germinate by way of an underlying theory of the elementary particles.
properties of the nucleusand the nuclear force. We take advantage of our experience
with atoms and apply our knowledge of quantum mechanics to introduce some of the
prevailing models. The phenomenology continues in our treatment of nuclear decays
and nuclear reactions in Chapter 15. The specific nature of the fundamental force
between the constituents of the nucleus remains submerged until the question can be
brought back to the surface in the final chapter on elementary particles.
A preliminary picture of the nuclear size, charge, and mass was in existence by 1920.
The a-particle scattering experiments found the radius of the nucleus to be of order
10 '
m, four orders of magnitude smaller than the size of the atom. Moseley's
analysis of characteristic x rays established the connection between the nuclear charge
and Mendeleev's atomic number Z. The smallness of the electron mass meant that
almost all the mass of the atom was concentrated in the nucleus. It was recognized
that the atomic mass was close to a whole number of hydrogen mass units, and it was
also noted that this mass number A was approximately twice as large as the correspond-
ing charge number Z for most atoms.
The first tentative nuclear model assumed a bound structure of protons and
electrons, the only known It was thought that the nucleus contained A
particles.
protons to account for the observed mass number, and A — Z electrons to give the
total charge of the system the value
Ae — (A — Z)e = Ze.
The electrons were believed to be present in the nucleus because the /?-ray process was
known to involve the emission of electrons from the core of the atom.
14-1 Nuclear Particles 669
Figure 14-1
5 MeV
Paraffin
detector
The proton -electron model had several fatal flaws. It was difficult to understand
why the magnetic moment of a nucleus containing electrons should have an order of
magnitude equal to the nuclear magneton instead of the Bohr magneton. It was also
impossible to reconcile certain nuclear spins with the spin- ^ properties of the proton
and electron. (For instance, the N atom was known from studies of molecular nitrogen
spectra to have an integer-spin bosonic nucleus, while the mass and charge numbers
A = 14 and Z = 1 called for a nuclear constituency of 14 protons and 7 electrons.
The odd total number of fermions was not consistent with the existence of a composite
boson.) Finally, it was unrealistic to suppose that the electron could be localized in
such a small region of space because of the large kinetic energy implied by the
uncertainty principle. Rutherford's proposal of a massive uncharged nuclear particle
opened the way for a satisfactory picture of the nucleus, and Chadwick's discovery of
the neutron removed the electron from consideration as a permanent nuclear particle.
These developments made it necessary to regard the emission of /?-ray electrons as a
separate phenomenon unrelated to the binding of the nucleus.
The discovery of the neutron required a special technique for detecting neutral
particles. (Neutrons would not leave visible tracks in a material medium because they
would not interact electrically with the atoms in the material.) Chadwick's method
relied on the observation of the secondary charged particles that appeared when the
primary neutral particles reacted with the atoms in the medium of his detector. His
experiment used a radioactive polonium source to generate a beam of 5 MeV a
particles and employed a beryllium foil to provide a target, as shown in Figure 14-1.
Neutral "Be radiation" emerged from the target and passed through sheets of
paraffin, ejecting protons from the hydrogenous material with kinetic energies as large
as 5 MeV. At first the neutral particles were hypothesized to be photons so that the
process in paraffin was presumed to be the Compton scattering of y rays by protons.
The kinematics of the Compton process implied that the incident y energy must have
been as large as 50 MeV to explain the ejection of protons with the observed 5 MeV
kinetic energy. Chadwick did not accept the view that a 50 MeV photon could be
emitted in the collision of a 5 MeV a particle with a Be nucleus. He also rejected the
y-ray hypothesis because the detection of protons in paraffin greatly exceeded
predictions based on the cross section for Compton scattering. He argued instead that
the energetic protons were ejected by an unknown form of Be radiation consisting of
neutral particles with mass approximately equal to the mass of the proton. This
alternative hypothesis enabled him to conclude that the 5 MeV a particles produced 5
MeV neutrons in beryllium and that these neutrons then transferred their energy to
protons in the paraffin detector. Similar studies were also performed with nitrogen gas
substituted for paraffin. Tracks of recoiling nitrogen nuclei were detected in this
version of the experiment, making the y-ray hypothesis even more untenable and
670 Properties and Models of the Nucleus
leaving the neutron as the only satisfactory interpretation for the unknown Be
radiation. These investigations were convincing enough to establish the existence of
the neutron.
Ever since the discovery of the neutron, the accepted view of the nucleus has been
to treat the proton and neutron as the basic nuclear particles and to regard the
existing species of nuclei as bound configurations of various numbers of these
constituents. The positively charged proton and the uncharged neutron are spin-^
fermions with only slightly different masses:
M p
= 938.27231 MeV/c 2 and M„ = 939.56563 MeV/c 2 .
In fact, most of the nuclear characteristics of the proton and neutron are identical, and
so the generic name nucleon has been given to the two particles.
Each nucleon has a magnetic moment associated with its spin. These two quantities
are known from experiment to have the values
in terms of the nuclear magneton defined in Equation (8-56). Let us recall that the
general relation between the nuclear magnetic moment ji
7
and the corresponding
nuclear spin I is expressed in Equation (8-57) as
In the case of the proton and the neutron we use the vector I to denote a nucleon spin
with quantum number '= ^, and we quote the results for the magnetic moments as
i
The nuclear g-factors of the nucleons are therefore given by the values
(The proton g-factor has already come up in our discussion of hyperfine structure in
Section 8-12.) These experimental results for the proton and neutron are rather
different from the g-factor for electron spin, g s/2 = 1.001 .... and are also quite
different from the values predicted for charged and neutral spin- particles in Dirac's -,
n p
— = 1 and — = (Dirac theory).
The magnetic moments may be taken as evidence that the nucleons are structured
particles, unlike the electron, where the structure is not simply described by the Dirac
equation.
The force that binds nucleons together in the nucleus is very different from the
electrostatic force responsible for the structure of atoms. It is clear that the nuclear
force must be very complex to explain the observed size, shape, stability, level
structure, and reaction behavior of all nuclei. The interactions among the nucleons
can be associated with a complicated two-particle potential energy acting between all
14-1 Nuclear Particles S7I
Figure 14-2
Atom
pairs of constituent particles. It is known that the nuclear forces between proton and
proton, proton and neutron, and neutron and neutron are essentially identical. Evi-
dence for this important simplifying property of the nuclear force has been gathered
from proton-proton and neutron-proton scattering experiments and from many other
sources. The small size of the nucleus is a qualitative indication that this two-body
force has a very short range. Figure 14-2 shows a highly schematic (and dispro-
portionate) comparison of the orders of magnitude for the atomic radius, the nuclear
radius, and the internucleon range.
Figure 14-3
V(MeV)
k r(fm)
672 Properties end Models of the Nucleus
It is obvious that the attractive force between nucleons must be very strong at short
range because the nuclear force between protons overwhelms the destabilizing Coulomb
repulsion of like charges. The strength of the two-nucleon interaction varies with the
separation of the particles and can also depend on such other variables as the
momentum and spin of each nucleon. The graph in Figure 14-3 describes a possible
model of the nuclear potential energy for a pair of protons in a state with zero total
spin. The figure also shows the Coulomb potential energy to indicate the dominance of
nuclear attraction for ranges smaller than a few fermi. (The natural scale of length in
nuclear physics is defined by the unit 1 fm = 10" 15
m. This length is written as one
femtometer and is read as one fermi.)
The units on the vertical axis in Figure 14-3 indicate a nuclear interaction energy 7
in the MeV range. Excitations of the states of nuclei require similar amounts of energy
to reach levels much farther apart than those involved in the analogous atomic
processes. This upward adjustment in the scale of energy is commensurate with the
reduction in the scale of distance suggested by the descent from the atomic to nuclear
size in Figure 14-2. The change of the scale of energy has certain physical implica-
tions. We witness the excitations of atoms in the optical phenomena of ordinary life,
but we observe the excitations of nuclei only under extraordinary circumstances such
as high-energy collisions or high-temperature environments. Our initial discussion of
the nucleus is restricted accordingly to the properties of the lowest energy state.
Example
Let us calculate the electrostatic repulsion between two protons and compare the
two effects shown in Figure 14-3. We express the Coulomb potential energy in
terms of the fine structure constant as
ahe
Vcouiir)
AttEqT r
he = 1240 MeV •
fm and he = —=
he
277
197.3 MeV •
fm.
1 197 MeV •
fm
*o™.i
= = L44 MeV -
0,1,1
137 1 fm
The figure shows that Vnud dominates V^^ at 1 fm by almost two orders of
magnitude.
Example
We can use the de Broglie wavelength to demonstrate the relevance of the wave
nature of nuclear particles. A proton with mass M p
and kinetic energy K has
wavelength
h h he
\= - = . = .
P fiMp K pKM/~
14-2 Nuclear Systematic* 673
1240 Me V- fm
\ = = 12.8 fm
/2(5 MeV)(938 MeV)
and
/
— (12.8 fm) = 5.72 fm.
These values are of the same order as the nuclear radius, so that a proton beam
with kinetic energy in the range 5-25 MeV is expected to undergo appreciable
diffraction in collisions with nuclei. Note that X represents a scale of distance for
probing the structure of the nucleus, and recall that this scale diminishes with
increasing beam energy. The calculation also tells us that protons with kinetic
energies of order 5-25 MeV can be bound in nuclei by attractive potential
energies of realistic strength, since the wavelengths of the wavefunctions fit
The nuclei of atoms exist in a multitude of different species known as nuclides. These
systems cannot be charted on the periodic table of the elements because the assign-
ment of atomic number alone does not suffice for a classification of nuclei. Every
element corresponds to a specific atom with a particular number of protons, and each
may have any one of several possible nuclei with different numbers of constituent
neutrons. Nuclides are called isotopes if they bear this relation to one another. These
varieties of nuclei have different values of the mass number A for the given choice of
atomic number Z.
The isotope concept was proposed in 1913 by F. Soddy, another of Rutherford's
many colleagues. The name "same place" in the
implied that atoms could occupy the
periodic table and acknowledged that atoms could be chemically identical and still be
physically distinct. Soddy's idea was put forward, almost 20 years before the discovery
of the neutron, in an effort to understand why different types of radioactive behavior
should be observed for the same element.
The numbers A and Z are used to systematize the properties of the nuclides. We
identify the mass number A and the charge number Z for a given nucleus as the
number of nucleons and the number of protons, and we define N to be the number of
neutrons in the nucleus according to the relation among the three integers
A = Z+ N. (14-1)
where X is the chemical symbol for the atom with atomic number Z. Of course, it is
redundant to employ all four symbols since X and Z convey the same information,
and since A, Z, and N
satisfy Equation (14-1).
sample of the unstable nuclei decays to half of its original population. A naturally
occurring radioactive isotope has a measurable abundance on Earth and must either
decay with a very long half-life or exist as part of a disintegration chain originating
from some other long-lived decaying nucleus. We include a few half-life data in
Figure 14-5 to convey a sense of the enormous range of variation of this quantity.
Let us return to Figure 14-4 and note that the most conspicuous property of the
plotted nuclei is the near equality in the numbers of protons and neutrons. We explain
this trend later on when we apply the exclusion principle to the nucleus. The figure
also shows a secondary tendency for nuclides to fall below the line Z = toward the N
region where the neutrons outnumber the protons. This preference for neutrons stems
from the Coulomb repulsion between protons and grows with increasing nucleon
number. The Coulomb effect causes protons to experience less nuclear binding than
neutrons, particularly in the larger nuclei where the nucleons are more likely to be
farther apart.
The locations of the stable nuclides are clearly marked in Figure 14-4. (By
convention, a nuclide is said to be stable if there is no known decay, or if there is an
extremely long half-life expressible only by a very large lower bound.) We see at once
that the radioactive entries are in the majority and that the stable species cease to
occur at beyond a particular mass number. The first stable nucleus on the chart is
all
209
'H, the proton, and the last is Bi. The figure shows that the stable nuclei occupy
central positions in each isotopic or isobaric family, while the radioactive members of
the family lie to either side. We can visualize by drawing an
this distribution
imaginary valley of stability through increasing values of the mass number from A = 1
to A = 209. The stable-nuclide path between 'H and
209
Bi has two noteworthy gaps,
at A = 5 and A = 8. We interpret the nonexistence of stable candidates for these
Figure 14-4
Distribution of stable and radioactive isotopes. Data are taken from Chart of the Nuclides, 13th
edition.
14-2 Nuclear Syslematics 675
J^
100 -
^n
i 1
90-
80
130 140 150
J c
j£r
I
J LU-
/(i
"I
60- -cz r
n_c
50 £="
Trrr
80 90 100 110 120
n_q n
40;-
TT'
30 ^ :
:••
:••
-
•
i
>-
H 1
•
-r
H 1
• :•- m I
• • • •
20-
F=F
I
10 " , F a Stable
^=^~ Radioactive
,i", i u
10 20 30 40 50 60 70
TV
676 Properties and Models of the Nucleus
Figure 14-5
Stable and radioactive isobars at low mass number. Isotopic abundances are quoted for the
stable nuclides, and radioactive half-lives are given for the unstable isobars.
io
c . . .
19.3 s
io B
199
10
Be
. . .
1 6*10 6 y
Radioactive half-life
s = second
m = minute
y = year
The stable odd-A nuclei exist in roughly equal numbers of even-odd and odd-even
varieties. This observation is a hint that the nuclear force does not distinguish between
protons and neutrons. The distribution of the stable even-A nuclei is more remarkable
14 -2 Nuclear Syslematics 677
since almost all these nuclides fall into the even-even category. The odd-odd entries
2
H, 6
Li,
10
B, and l4
N at low A
r,0
V and l80
Ta at higher A.
We take the imbalance between even-even and odd-odd nuclei as evidence that the
nuclear force has a pairing property. The evidence tells us that the force between
nucleons in the nucleus has a strong preference for paired-proton and paired-neutron
configurations. This feature of the nuclear force must be incorporated in the building
of nuclear models.
Figure 14-4 suggests another numerological exercise that also has implications for
model building. If we distribute the 268 stable nuclei over approximately 100
elements we find that an average of two or three stable isotopes is expected for each
element. In fact, an inspection of the nuclear chart reveals some very marked
departures from this expectation. If we follow the valley of stability in the figure we
encounter unusually large stable populations of nuclei along the succession of lines
N= 20, Z= 20, N= 28, Z= 28, N= 50, Z= 50, and N= 82. The most striking
occurrence of stability is seen at Z= 50 where the element tin is found with ten stable
isotopes. These patterns of exceptional stability in nuclei are reminiscent of the shell
closures observed in atoms. The numbers 20, 28, 50, and 82 are among the magic
numbers associated with the shell structure of the nucleus.
We might wonder whether any stable nuclei exist beyond the range of the current
nuclear chart. Nuclear stability tends to terminate with increasing numbers of
nucleons because of the destabilizing influence of Coulomb repulsion among the
growing numbers However, the shell theory of the magic-
of protons in the nucleus.
numbers indicates the possible existence of an "island of stability" off the chart, at
coordinates given by the predicted magic numbers 114 and — 184. New Z= N
superheavy elements ought to exist in this vicinity if these predictions are correct. No
trace of any such element has yet been found in any samples of naturally occurring
material. An attempt to synthesize superheavy products in nuclear reactions is also
underway using accelerated beams of heavy ions (such as 48 Ca or even 2iH \J) incident
on heavy targets (such as 248 Cm). These studies have not produced any positive
evidence in their early stages of investigation.
Example
4
He 2 +*Be 5 - 12
,C f) +X-
Note that conservation laws are imposed on the sum of the charge numbers and
the sum of the mass numbers to identify the carbon nucleus in the final state.
678 Properties and Models of the Nucleus
4
He+ 9 Be^ ,2
C+V
or in even more streamlined fashion as
9 12
Be(a,rc) C.
We include the superfluous left and right subscripts Z and N in the first version
of the reaction to draw attention to the conservation laws for the total charge
4 9
and the total number of nucleons. Let us also note in passing that He, Be, and
l2
C are stable nuclides and that the only unstable participant in Chadwick's
experiment is the neutron itself. This instability causes the neutron to decay and
transform into the proton via the /2-radiation process. We see from Figure 14-5
that the transformation is between isobars and that the quoted half-life is 10.5
min.
Example
The first determinations of nuclear size came from the Rutherford scattering experi-
ments. These estimates of the nuclear radius were obtained by observing deviations
from the Rutherford cross section and attributing a deviation to the effect of a nucleus
of definite volume.
More refined measurements of nuclear structure became feasible with the develop-
ment of accelerators. These machines were able to produce beams of particles with
de Broglie wavelengths small enough to probe the details of the nucleus at short range.
Protons, deuterons, a particles, and other ions could be accelerated in primary beams,
and neutrons could be extracted in secondary beams, all for the purpose of bombard-
ing nuclei at variable energies. The accelerators came into use, along with the
different kinds of detectors, to provide the "microscopes" for a systematic exploration
of the stable nuclei.
All the probes just mentioned are nuclear particles whose reactions with nuclei are
governed by the nuclear force. Such beam particles are ideally suited for the
unknown interaction. On the
investigation of this other hand, studies of nuclear size
and composition are more clearly interpreted if the electron is chosen as the beam
particle because the interactions of the electron are dominated by the well-known
electromagnetic force. The probing electrons see mainly the protons in the nucleus
since the main effect is the electrostatic force between charges. (Electrons also have a
weaker magnetic interaction with the neutrons in the nucleus. This effect can be
observed in domains where the electric interaction is suppressed.) Unique information
about the distribution of the nuclear charge is obtained by performing diffraction
14-3 Electron Scattering and the Nuclear Radius 679
Figure 14-6
nator
Collimator ^^>/^
Magnet
Scattering chamber
Slit -
==-/— Target
(^ J
Magnet
Shielding Spectrometer
experiments in which electrons are scattered by the nucleus. These data can be used to
determine the nuclear radius.
The electron-scattering experiments had to wait for the construction of high-energy
electron accelerators. A comprehensive series of investigations of nuclei was finally
undertaken by R. Hofstadter and his associates in 1953. Eventually, these studies were
extended to include measurements of the internal electromagnetic structure of the
proton and the neutron. Thus, the whole range of electron-scattering experiments gave
a description of the constituents of the nucleus as well as the nucleus itself.
e + X -» e + X.
The equipment includes an electron accelerator and deflecting magnets to prepare the
high-energy electron beam, a scattering target of species X, and a spectrometer to
detect electrons scattered elastically in directions given by the indicated scattering
angle 6. This apparatus constitutes an elaborate high-energy device for the study of
electron diffraction, since the angular distribution of the scattered electrons has the
appearance of a diffraction pattern. We represent these observations by means of the
differential cross section do/dSl for elastic electron scattering, an angle-dependent
quantity analogous to the Rutherford cross section for the scattering of a particles.
Experimental values of do/dQ, are plotted in Figure 14-7 for a single beam energy
and for several nuclear targets. We see the characteristic features of a diffraction
pattern in each of the graphs, as the cross sections fall rapidly from the forward
direction at 6 = and exhibit small peaks at other angles.
The behavior of do/dQ, is similar to the diffraction of light by a spherical obstacle
with a dense interior and a diffuse surface. A good characterization of electron
scattering can be given in these terms by adopting a spherical model of the nucleus in
which the nuclear charge density has the form
p{r) = (14-2)
1 + e
(r-*)Ai
Kr
680 Properties and Models of the Hucleus
Figure 14-7
Differential cross sections for elastic electron scattering at 183 MeV. Data are plotted versus
scattering angle 6 for calcium, indium, and gold targets.
do
(barn/sr)
-2
10
10
10
10
10
10
10 8 (deg)
r
30 ,0 70 90 110
This expression is like a Fermi distribution in which the two parameters R and z,
P(0)
=
R/z
1 + €
so that p, and p(0) are approximately equal for R » zv We can interpret the
significance of these features with the aid of Figure The illustrated charge
14-8.
density falls through the value p,/2 at r = R, dropping from 90% to 10% of the
maximum density over a small distance given by the indicated surface thickness t.
Figure 14-8
-Surface
thickness
t
formula
R = R A X/3 (14-3)
over the whole survey of nuclei. Other methods of determining the nuclear radius
confirm the A dependence of Equation (14-3). In general, these techniques employ the
parameter R alone and yield values of R in the range 1.18-1.40 fm.
The decrease of the central charge density p(0) is a noteworthy feature of Figure
14-9. This behavior opposes the tendency for neutrons to outnumber protons with
Figure 14-9
Nuclear charge densities deduced from electron scattering. The cases illustrated correspond to
the nuclei considered in Figure 14-7.
p(10" C/fm 3 )
r(fm)
682 Properties and Models of the Nucleus
increasing values of A. We can blend these two opposing effects and obtain an
effective nucleon density with practically the same central value for all nuclei. We define
this quantity by noting that the density of protons in the nucleus is p(r)/Ze and by
assuming that protons and neutrons have the same distributions. The result is a
density of nucleons given by the expression
A
-p(0-
Ze
The approximate uniformity of the central value (A/Ze)p(0) over all nuclei suggests
the approximate uniqueness of the mass density for all forms of nuclear matter.
The same conclusion can be drawn from the A l/3 behavior of the nuclear radius in
Equation (14-3). If we compute the nuclear volume as
t^7tR — -^ttRqA
M p
A 3 Mp
'
3
±77/? 477 R%
Example
The following two computations of the nuclear mass density illustrate our
conclusions. We take 0.17 nucleons/fm
3
as a reasonable estimate of the central
nucleon density, and we multiply by the proton mass to obtain
A
—
Ze
p(0)M b
=
,
= 2.8 X 10
17
kg/m 3 .
Alternatively, we use the result obtained from the A x/Z behavior of R to find
3 M 3 1.67 X 10~ 27 kg
4tt R6
h
f
=
4tt (i.07 x 10" 15 m)
%= 3.25 X 10
17
kg/m3 .
3
enormous value compared to the representative figure 10 kg/m for ordinary
atomic matter.
Figure 14-10
magnetic field
Ion collector
Amplifier
3»i:!=rrs^-~Jfr. and
recorder
inaccurate terminology) to denote the average mass of any naturally occurring element.
It was possible to use Thomson's neon masses along with an estimate of the two
abundances and obtain an average atomic mass of 20.2, a value consistent with the
known atomic weight of neon. Eventually, the isotope concept was put forward and
the meaning of atomic weight was made clear. It was found that the atomic weight of
any element could be explained by averaging over the abundances of the correspond-
ing stable isotopes.
Thomson's experiments employed the acceleration and deflection of positive ions by
electricand magnetic fields. An improved version of his apparatus was developed in
1919 by F. W. Aston in the design of the first mass spectrograph. The instrument
separated isotopes according to their masses and gave accurate mass determinations
for the observed ions. Aston used these mass spectra to analyze the isotopic composi-
tions of more than 50 elements. He showed that the isotope masses were nearly equal
to integral multiples of the hydrogen mass, and he found that the small deviations
from whole-number multiples could also be measured in more refined experiments.
Measurements mass provided direct information about properties of the
of the atomic
nucleus. The measured deviations from whole numbers proved to be especially
significant as clues to the binding of the nucleus. Mass spectrometry flourished in the
1930s because of improvements in detector design and vacuum technology. The mass
spectrograph demonstrated its practical value as a separator of isotopes in 1935 with
the discovery of the rare long-liveduranium nuclide 235 U.
Figure 14-10 shows the design of a mass spectrometer for the isotopic analysis of
gaseous elements. The apparatus admits a sample of gas at low pressure and
bombards the atoms with electrons to convert the sample into positive ions. Electric
and magnetic fields guide the charges to an ion detector where the ions are collected
separately according to mass. The ions are accelerated to speed v by an applied
voltage V and are then deflected in a circular path of radius R by a magnetic field of
strength B. Nonrelativistic ions with mass M
and charge e acquire a kinetic energy
M
— 2
v = eV
2
684 Properties and Models of the Nucleus
Figure 14-11
Mass spectrum of xenon indicating the relative abundances of the nine stable isotopes.
Xe
134
136
128
126 124
A. A —k—
Mv'
= Bev.
~R~
Ml — e
v = BR - BR
M M
The quantity of interest in the last equality is the mass-to-charge ratio
M (BR)
2
(14-4)
t IV
The mass spectrometer operates at fixed values of B and R and employs a varying
voltage V in order to collect ions with various M/e ratios. This technique produces
mass spectra like the one sketched in Figure 14-11. Note that the output signal from
the ion detector provides a measure of the relative abundance of each isotope.
Masses can be measured more precisely by comparing unknowns with certain
carbon-bearing calibration standards. Let us illustrate by considering a sample
containing atomic and molecular ions of hydrogen, deuterium, carbon, oxygen, and
methane. The resulting mass-spectroscopic lines include the following three M/e
doublets:
+ + + +
('H'H) - 2H + , (
2
H H 2 2
H) - 12
C++ , and (
12
C H H'H H) - 16
, 1 1
,
+
where the symbols ( • • •
) refer to singly ionized molecules. We note that the two
144 Huclear Mass and Binding Energy 685
members of each doublet have the same nominal value of M/e, and we find that a
small line splitting appears in the mass spectrum at the location of each pair. We can
measure these three splittings and use the results to deduce the masses of 'H, 2 H, and
16 12
relative to the mass of C. Problem 5 is included at the end of the chapter to
illustrate this procedure.
These investigations of ions in mass spectrometry provide the means of determining
the masses of neutral atoms, each with its full complement of atomic electrons. A
representative listing of atomic masses is provided along with other pertinent nuclear
data in Appendix A. By convention, the quoted values have atomic mass units (symbol
I2
u), defined such that the commonly occurring neutral carbon atom C has the
ground-state mass value
M{ X2 C) = exactly 12 u.
2
uc = 931.49432 MeV.
Eb =Y,M c -Mc t
2 2
. (14-5)
£ 6
(atom) = [Af(nucleus) + Zm - M(atom)]c 2
r
. (14-6)
686 Properties and Models of the Hucleus
This atomic binding energy has values like 13.6, 13.6, and 79.0 eV for hydrogen,
deuterium, and helium and becomes as large as hundreds of keV for atoms of much
larger Z. In all cases we can justifiably ignore such comparatively small quantities
whenever we examine the binding properties of the corresponding nuclei. Equation
(14-5) tells us that the binding energy of a nuclide with Z protons and A r
neutrons is
£ A
(nucleus) = [ZMp + NM n
- M( nucleus) ] c
2
. (14-7)
We can eliminate the nuclear mass between the last two relations and replace the left
£ 6
(nucleus) = [ZMp + NM n
+ Zm - M(atom)]c 2
e
.
The combination M p
+ m can e
be set equal to the mass of the hydrogen atom since it
is safe to neglect the small 13.6 eV of atomic binding energy. The final formula for
the binding energy of the nuclide
A
X then assumes the following explicit form:
E b(
A
X) = [ZM('H) + NM n
- M^X)]^, (14-8)
in which the symbol M{ A X) refers to the mass of the neutral atom. This version of the
formula expresses the desired quantity in terms of masses found directly from mass
spectrometry. These atomic masses are listed as neutral-atom properties in Appendix
A. We can extract the corresponding nuclear masses if we wish by simply subtracting
the masses of Z electrons.
Figure 14-12 illustrates an interesting systematic property of the nuclear binding
energy. The graph shows the binding energy per nucleon E /A
b
for the various nuclides,
plotted as a function of the number by the fact that an
of nucleons. We are struck
approximate plateau is reached around the value 8 MeV per nucleon for all nuclei
beyond A = 16. This behavior indicates a saturation phenomenon that reflects the
short-range nature of the nuclear force. A long-range force would subject a nucleon to
binding interactions with all the other A — 1 nucleons in the nucleus and would cause
the binding energy of each nucleon to grow with A. Instead, the binding energy per
nucleon saturates around a particular energy and tells us that each nucleon experi-
ences binding interactions with only a limited number of nearest neighbors in the
nucleus. We also observe a slight decline in the plateau at larger values of A. This
secondary feature indicates the growing effect of the Coulomb repulsion between
protons to reduce the nuclear binding in larger nuclei. We have already seen such
behavior in the tendency for neutrons to outnumber protons at the higher values of A
in Figure 14-4.
The formulas for the binding energy can be employed to introduce the useful
concept of neutron separation energy. This quantity is defined as the binding energy of
the last neutron in the assembly of nucleons
A- I
X + n^iX.
We adapt Equation (14-5) to this definition and write the separation energy accord-
ingly:
~l
E„(
A
X) = [M( A X) + M n
- M( A X)]c 2 . (14-9)
144 Nuclear Mass and Binding Energy 687
Figure 14-12
Binding energy per nucleon versus nucleon number. A smooth curve is drawn through the
plotted points indicating the positions of several of the stable nuclides. Values of E b
are taken
from tables compiled by A. H. Wapstra and K. Bos.
%Mev) 56 84
40 Ca
JX Kr 120 Sn
197
Au 208p b
o 3 He
o2H
Note that the species X maintains its identity throughout since Z does not change with
the removal of a neutron. We can use Equation (14-8) to convert the formula for E n
into the difference of nuclear binding energies for the two isotopes:
EH ( A X) = {[ZM( H) + l
(A - 1 - Z)M,y - Eh (
A - l
X)} + M n
c
2
- {[ZM('H) + (A -Z)M n
]c
2
- Eb ( A X)}
= Eb ( A X)-Eb ( A - l
X). (14-10)
Both formulas for E n are of some interest. The second version is a clear statement of
the distinction between the binding energy of the last neutron and the nuclear binding
energy. The statement tells us that the latter quantity is the larger of the two energies.
Example
2
The atomic mass unit i i is defined in terms of the carbon standard as £Af(> C).
Let us convert the unit to kilograms with the aid of Avogadro's number:
1 12 g/mole
u = (io-
3
kg/g) = 1.6606 X 10"
21
kg-
23
12 6.0221 X 10 /mole
688 Properties and Models of the Nucleus
ik
2 =
(1.6606 X 10" 27
1.6022 X 10" 13 J/MeV
8
kg)(2.9979 X 10 m/s)~
^-p: — = 931.50 MeV.
This conversion factor is used over and over in nuclear physics calculations.
Example
Carbon appears on the nuclear chart with the following data for the two stable
12
isotopes C and 13 C:
atomic mass (u) exactly 12 13.00335482
isotopic abundance (%) 98.90 1.10
This result agrees with the quoted value for the atomic weight of carbon.
Example
Equation (14-8) may be used in conjunction with the atomic masses in Appendix
A compute the binding energy for any
to of the tabulated nuclei. In the case of
4
He we take Z = N = 2 and obtain
£ 6(
4
He) = [2(1.007825) + 2(1.008665) - 4.002603] (931. 5 MeV)
= 28.30 MeV.
For
12
C we take Z= N= 6 and get
£,(
12
C) = [6(1.007825) + 6(1.008665) - 12] (931 .5 MeV) = 92.16 MeV.
Note that large cancellations take place inside the square brackets and that both
calculations employ the conversion from uc~ to MeV. The binding energies per
nucleon in the two cases are
E b
28.30 MeV
= = MeV
T( 4He )
A 4
7.075
and
92.16 MeV
= 7.680 MeV.
A 12
These results appear to be well on their way toward the approximate universal
4
value 8 MeV. (Actually, the He figure is unusually large, lying well above the
rising portion of the data plotted in Figure 14-12.)
14-5 The Semiempirical Mass Formula 689
Example
Finally, let us illustrate the idea of neutron separation energy by applying the
formulas to some of the isotopes of cadmium. Equation (14-9) is particularly
suitable because the relevant masses are listed in Appendix A. We consider the
separation energies for the two cadmium nuclides
113
Cd and 114
Cd. For A = 113
we find
£„(
113
Cd) = [M( 112 Cd) + M n
- M( 113 Cd)]c 2
= (111.902758 + 1.008665 - 112.904400)(931.5 MeV)
= 6.542 MeV,
£„(
1,4
Cd) = [M( 113 Cd) + M„ - M( ni Cd)]c 2
= (112.904400 + 1.008665 - 113.903357)(931 .5 MeV)
= 9.043 MeV.
We use these two results again when we continue this illustration at the end of
Section 14-5.
The binding energy of a nucleus defines the position of the ground state on a nuclear
energy level diagram. This state, and all the higher excited states, should be de-
terminable from a theory of the nuclear force. We recognize, however, that any
attempt to solve the quantum problem of the binding of many nucleons would be a
most ambitious undertaking. Our approach takes a more deliberate course and turns
first to some rather elementary models. We aim these models at the nuclear ground
state and direct our attention to the systematic features displayed on the nuclear chart.
Let us begin with an early model, introduced von Weizsacker, in in 1935 by C. F.
which the nucleus is compared to a classical liquid droplet A certain resemblance exists .
between the two systems since the density and binding energy per nucleon are
essentially independent of the number of nucleons in the nucleus, while the density
and latent heat of vaporization do not vary with the number of molecules in the
liquid. Weizsacker's model also takes account of the Coulomb forces among the
nucleons as well as the quantum effects associated with nucleon spin and fermion
antisymmetry. The end result is a parametrization of the atomic mass as a function of
A and Z. The predictions pertain to the nucleus as a whole without reference to the
individuality of the nuclear constituents.
We construct the Weizsacker formula by identifying five different contributions to
the nuclear binding energy E The
b
. first effect is analogous to a heat of vaporization,
written as Lv M mo]ecul(,n, where n denotes the number of molecules in a drop of liquid
with latent heat per unit mass Lv . In the case of the nucleus we associate a similar
number dependence with the plateau behavior of E /Ab
in Figure 14-12, and we write
E = b\
a \
A
since the linear dependence on A is related to the volume of a sphere whose radius
varies as A i/3 in the manner of Equation (14-3).
We
have already interpreted the A independence of E b /A in terms of a fixed
number of internucleon bonds for a nucleon in any nucleus. The nucleons on the
nuclear surface are not surrounded by as many of these bonds, and so the binding
energy is reduced by a surface correction of the form
Eh2 = -a 2 A 2/\
(Zef 3
V= +-- — (14-11)
5 47TE R
and we turn again to the relation between R and A in Equation (14-3) to obtain
Z2
2
(A/2- Z)
*4
= ~° A '
energy as nuclides deviate to either side of the line Z = N, while the denominator
preserves the necessary linear dependence on A. We develop a special model in
Section 14-6 to explain this term in context with the meaning of the line Z= N.
Wehave also noted the overwhelming preference for stable even-/l nuclides to
occur in the even-even, rather than the odd-odd, category. This observation has been
taken as evidence for the pairing property of the nuclear force. Let us provide for
pairing in phenomenological fashion and include a contribution to E b
called the
pairing energy, which applies only for even A and favors specifically the even-even
14-5 The Sent/empirical Mass Formula 691
'5 even-even
± ^3/4 odd -odd
Ebb £5
It has been found that this formula gives a good empirical representation for the A
dependence of the pairing effect.
The nuclear binding energy is assembled from these five terms:
E b(
A
X) = a x
A - a2A
2
^- a3
Z2
-^ - a,
(A/2-Zf
— + e5 . (14-12)
M( A X) = ZM('H) + (A - Z)M„
(A/2- Zf 2
2/3
{
A - a2 A - «3^T7J - aA /c . (14-13)
(Actually, the fifth term in the binding energy is a more recent addition to the original
model.) The formula is called semiempmcal because the various constants are de-
termined by securing a best fit to the atomic masses and not by invoking any further
theoretical arguments. An excellent fit is obtained with the following parameters:
a, = 15.76 MeV,
a2 = 17.81 MeV,
a3 = 0.7105 MeV,
a4 = 94.80 MeV,
a5 = 39 MeV.
A
Xy and *YX .
These species are related by the interchange of proton and neutron numbers Z and iV
and are known as mirror nuclides. The interchange operation Z <-> N affects only the a 3
term in the formula for Eb and so the difference of binding energies turns into the
,
expression
2 2
Hence, the difference in binding energy is the same as the difference in Coulomb
692 Properties and Models of the Nucleus
energy for any pair of mirror isobars. Furthermore, the nuclei in question have equal
radii corresponding to the given value of A. We can take advantage of this
observation and determine the common radius by using measurements of E b
in
conjunction with the formula for the Coulomb energy in Equation (14-11). We
illustrate the procedure in the second example at the end of this section.
The mass formula offers many useful insights into the stability properties of nuclei.
We can establish a criterion for stability if we organize the nuclides as isobars and
examine the variation of the masses with regard to the (A, Z) dependence predicted
by the formula. We select a set of isobars by choosing a fixed value of the mass
number A, and we note that the chosen nuclides have nearly equal masses. Equation
(14-13) describes these species by a quadratic function of Z, which minimizes as Z
varies for constant A. The minimum is determined by the vanishing of the partial
derivative
dM . . Z Z-A/2
-c 2 =
,1/
[Af('H) - M„]c 2 + 2a 3 -^
A l/3
+ 2a,-
A a4 + [M„-M( l
H)]c 2
Z = ZA = (14-15)
a4 + a^A 2/z
Thus, we predict a stable isobar at ZA , where the lowest isobar mass occurs for the
given mass number A. Of course, this minimizing value of Z should not be expected
to coincide with a true integer-valued atomic number. The actual state of affairs is
illustrated for three consecutive choices of A in Figure 14-13. The e5 term makes no
contribution to the formula when A is odd, and so Equation (14-13) represents a
single parabola opening upward on the corresponding graph of versus Z. When A M
is even the e5 term adds a constant shift to the other contributions on the graph,
lowering the even-even masses and raising the odd-odd masses relative to the central
Figure 14-13
Isobar masses versus atomic number for three consecutive values of the mass number. The
stable isobars are ™Mo and ™|Ru for A = 98, ^Ru for A = 99, and '$Mo and '^Ru for
A = 100.
M M M
even^ odd
V v-odd
-even
97 91 - 9891 9991
V\Tc
Mo " Ru
A = 98 A = 99 A = 100
1 1 l
1 ,
1 1 I I III z
38 40 42 44 39 41 43 45 40 42 44 46
14-5 The Semiempirical Mass Formula 693
parabolic curve.The figure employs two parabolas in this case to show how the
measured masses hop sequentially between the even-even and odd-odd isobars. The
minima on the curves lie close to the indicated integer values of Z at which the stable
nuclides occur.
Equation (14-13) determines a mass surface in the variables A and Z. Figure 14-13
shows three slices at constant A through the surface, with minima corresponding to
the quantity Z 4 (Obviously, there are three surfaces: one for odd A, and two more for
.
the even-even and odd-odd varieties with even A.) These graphs furnish a clear
picture of the valley of stability to supplement our previous discussion of Figure 14-4.
Let us conclude our observations about stability with the following incidental
remarks. We note that nuclides of technetium appear in all three parts of Figure 14-13
and that none of these species is stable. In fact, no stable jTc nuclide exists for any
value of A. Technetium at Z = 43 and promethium at Z = 61 are unique in this
respect since these are the only elements with no stable isotopes in the domain of
stable elements below bismuth at Z= 83.
Example
The constant a 3 is the only parameter in the mass formula to be associated with
a specific theoretical prediction. If we compare Equation (14-11) to the expres-
sion for E h3 and use Equation (14-3), we find
3 ahc
,1/3
5 4we R A*
/:i
5 4ire R 5 R
(The fine structure constant has been introduced here to simplify the numerical
work.) Since a 3 is regarded as known from the empirical fit, the relation yields a
determination of the radius parameter:
Example
The A = 15 mirror nuclides 'yNg and 's0 7 have atomic masses 15.000109 and
15.003065 u, respectively. The difference of binding energies is obtainable from
Equation (14-8):
A = £ A ( 15 N) - £„(
15
0)
= [7Af('H) + 8M - M( n
,5
N)]c 2 - [8M( H) + l
lM - M( l5 0)]c 2
n
We have identified this quantity with the difference in Coulomb energy. Let us
consult Equation (14-11) and take Z= 1 and 8 to get
3 e cchc
V8n - V = -(64
V n
7
- 49) = 9
5 477e tf R
as an alternative expression for the quantity A. Note that R has the same value
for the two isobars since R depends only on A. We calculate the radius from the
result obtained for A,
R =
()
—Rl/3
7-R =
3.666
m— l/3
fm
= 1-487 fm.
A 15
Other pairs of mirror nuclides also give values for R in the vicinity of 1.5 fm.
Example
\[E b{
A
^X) + i^X)] - £,('
4
X) = -^ + ... .
(The proof is left as Problem 1 1 at the end of the chapter.) The left side of this
relation can be written in terms of two particular neutron separation energies as
follows. Equation (14-10) gives the equalities
E„(
A + l
X) = E b (
A+
>X)-E b(
A
X) and E„(
A
X) = E ( A X) - E ( A ~
b b
l
X),
H^ + ,
-[£„(
.
U4 Cd) - EB ("'Cd)] =
. 9.043
~ — - 6.542
-MeV= 1.251 MeV=
T ^
a5
7?
.
The result, a b = 43.35 MeV, is a bit larger than the average value given for this
constant in the text.
14-6 The Fermi Gas Model 695
Nucleons behave as fermions and obey the Pauli principle. Hence, the protons and
neutrons in the nucleus are influenced by the same constraint of fermion antisymmetry
as the electrons in the atom. We know that the Coulomb force and the exclusion
principle determine the structure of the atom, and we suppose that a similar
construction works just as decisively in the case of the nucleus. Nuclear binding is a
very different dynamical problem since the constituents attract each other with a
strong short-range force and the system as a whole has no obvious center of attraction.
Nevertheless, we are able to adopt an independent-particle central-field approach as
one of the avenues in our study of the nucleus. We find that this familiar outlook
enables us to draw several important phenomenological conclusions.
Let us begin with a qualitative argument to explain why the nucleus prefers a
composition of equal numbers of protons and neutrons. The nuclear force is not
needed in any detail since the exclusion principle is the main ingredient in this
explanation. We know that the Coulomb repulsion between protons distorts the
systematic Z=N property in favor of neutrons for the larger nuclei, so let us consider
the smaller nuclei and ignore the Coulomb effect. An individual proton or neutron is
then subject to the same average force arising from all the other nucleons. Figure
14-14 shows a short-range central potential energy to represent the influence of this
force on a single nucleon. Also shown are the resulting single-particle energy levels,
analogous to those found in the central-field model of the atom. (We ignore the
angular momenta of the levels since these additional properties do not affect our main
conclusion.) The exclusion principle allows us to fill each level with no more than two
protons and two neutrons corresponding to spins up and down. The example in the
figure describes two occupation schemes for A nucleons with different assignments of
Z and N. We see that the binding energy of the last nucleon is greatest for the
Figure 14-14
. , .
&
vy
fn\ f^l
~\nj-\nj
-<&-
-<£H?M»H|)- —(J)-(n)
—(p> -®" —®~~®—
{P)—{P) —(n)—(n) —(p> -0- —®~~vV—
—(p> -®- —©-vy
Z=10 N=ll z = 6 N= 15
:
configurations closest toZ = N, and we conclude that these circumstances are the
most favorable for nuclear stability.
We can employ fermion antisymmetry, and little else by way of input, to justify the
expression given for the symmetry energy in Section 14-5. Our procedure retreats
temporarily from the independent-particle picture and takes an average over the
detailed dynamics of the nucleons. The nucleon number A
assumed to be rather is
large for this purpose so that the nuclear system becomes vast enough to warrant the
use of quantum statistics. We treat the nucleus as a large collection of protons and
neutrons moving freely in a spherical enclosure defined by the nuclear volume, and we
describe the system as a degenerate Fermi gas in which the nucleons occupy their
lowest energy states consistent with the exclusion principle. To be specific, we let the
zero-temperature Fermi-Dirac distribution function be the sole vehicle for the descrip-
tion of the nuclear particles. The proton and the neutron are distinguishable from
each other, and so the exclusion principle and the methods of Fermi-Dirac statistics
apply to the two types of nucleon separately. Thus, the resulting nuclear system is
regarded as a mixture of proton and neutron Fermi gases.
We can adapt this picture to the independent-nucleon point of view with the aid of
Figure 14-15, in which we introduce a constant potential energy well for the free
motion of each individual bound particle. Since A is large we must incorporate the
effect of Coulomb repulsion for the proton and employ wells of different depth for the
two types of nucleon. A large number of energy levels is assumed in each case, and all
are presumed to be fully occupied by the gases of neutrons and protons up to the
indicated Fermi energy levels. Note that the Fermi energy and the number of particles
are larger in the case of the neutrons because the Coulomb effect diminishes the
nuclear attraction and elevates the well in the case of the protons.
The Fermi gas model of the nucleus follows directly from our zero-temperature
results of Section 11-6. We only need to modify the formulas for separate application
to neutrons and protons. The nucleus has a spherical volume determined by the mass
number A
V = \mFO = \mR\A.
We express the nucleon numbers in terms of the respective Fermi energies as in
Figure 14-15
Neutron and proton potential energy wells in the Fermi gas model. The proton potential energy
is elevated by the effects of Coulomb repulsion.
Vn (r) Vp (r)
Equation (11-72):
i/2
(2M)
z= .,.
3
Ve 3 / 2
for protons (14-16)
and
(2M) V2
N= , Vs
3
/ 2
for neutrons (14-17)
(The proton and the neutron are assigned the same mass throughout this M
discussion.) We
then introduce our expression for the nuclear volume and invert these
relations to solve for the Fermi energies:
b
> 2M
I 3v 2 h 3
V
z\
\
J
2/3
= —
h
2M\
2
-A
I
-.
—^z\
3tt
±itR
3
2
A
\
2/3
977 Z \
2 /3
(14-18)
2MRl\ 4 A
and
!r , 9v7 A \
2/:)
Equation ( 1 1-74) gives the total energy of the particles in each gas as a function of the
number of particles and the Fermi energy. We use this result to obtain
2 2/3
3 3 h (9ttZ\
Ey=
z —ZipF = -Z\
5 " 10 MR 2
\ 4 A
-V 3 2 5 /3
3 / 977 \ h I Z\
10 \ 4 / MR~ \ A
3 / 977 \
2/3
h
2
IN\
v
~ To t) A/^f It
A A
N= - + £ and Z= - - £
698 Properties and Models of the nucleus
and evaluate the relevant factors in Ez and EN by the following binomial expansions:
+
n 5/3
/ 2f\
5/3
/ 2f\
5 /3
1
" ^ + l+
a) U) 2 J [I j [ AJ
M 5/3
r 5 2i
+
5/2f\ 2
2/ [
" 3 A 9\T
5 2f
+ 1 +
3 A
+ -
9\ A
5 /
—
2f\
2 3 2
1\ / / 20 f
1 + +
2 \ Ta->
The total energy of the system of nucleons then takes the form
2 2
3 / 9tt\ 2 / 3 k ( 1\ 2 / 3 / 20 £
2
3 ,„ h 20 £*
-(9tt);
V3 A + (14-20)
40 MR* 9 /I
to second order in f. This quantity contributes to the rest energy of the nuclide ^X
and must therefore appear in the mass M( A X.) as parametrized in Equation (14-13).
We use f = A/2 — Z and rewrite the second-order term in Equation (14-20) in the
form
/J
(9irr h
2
(A/2-ZY
MR*
This final result reproduces the contribution of the symmetry energy to the atomic rest
nuclear reactions, where the higher energy permits access to the unfilled higher states.
14-7 The HucleonHucleon Interaction 699
Example
2
Let us connect the £ term in Equation (14-20) with the a 4 term in Equation
(14-13) by identifying the constant coefficients in the two expressions:
2 3
(9tt)
MR*'
If we set R = 1.2 fm and take Mc 1 = 939 MeV, we find that our statistical
model predicts
.2/3
/3
(Sir)"* (he)' (9ttY (197 MeV -fin)'
4
- = 44 MeV.
a \a-1d2
Mc'Rq 6 (939MeV)(l.2fm)'
Recall from Section 14-5 that the atomic masses are fit with a A = 94.80 MeV.
The agreement is only qualitative, as befits the crudeness of the model.
Example
* 977 \
2 ^
. -
F I
2MRl\
2/3
(9tt) h* 3
Ef ~ ~ -a 4 = 33 MeV,
8 MRl
using the result of the previous example, Figure 14-15 then tells us that the
depth of the potential energy well must be of order 40 MeV if a typical nucleon
has separation energy around 7 MeV.
The models in the last two sections are concerned with the global properties of all
nuclei and not with the details of nuclear structure. Let us set these first observations
aside now and take up the structural approach, beginning with the most primitive
nuclear system.
The fundamental problem of nuclear physics is the determination of the force
between two nucleons. We can probe the basic two-nucleon system to study this
unknown interaction by scattering nucleons from a proton target and by examining
the properties of the nucleon-nucleon bound state. The deuteron is the only existing
A = 2 nuclide in the latter category. This bound system of proton and neutron is
unique since there are no excited pn structures and no analogous pp or nn counter-
700 Properties and Models of the Nucleus
parts. We find that several important features of the nucleon-nucleon interaction can
be inferred from the deuteron, while many other details of the force can only be
learned from the higher-energy processes of nucleon-nucleon scattering.
Let us compile the following principal characteristics of the deuteron (symbol d)
and interpret the various properties afterward:
Each of these quantities is known from a number of experimental sources. One way to
measure the deuteron binding energy is by observing the energy of the y rays emitted
in the np capture reaction
n + p -» d + y.
The deuterons are formed in this process when slow neutrons from a reactor are
absorbed by protons in a hydrogenous target. The small value of Eb corresponds to
just over 1 MeV per nucleon, the lowest point plotted on the graph in Figure 14-12.
The nuclear spin and magnetic moment are determined from measurements of atomic
hyperfine structure and from magnetic-resonance experiments. The quadrupole mo-
ment is obtained by other applications of the beam-resonance technique. We define
the quadrupole moment below and interpret the quantity as an indicator of the
nuclear shape. The nuclear radius is measured in electron-scattering experiments as
discussed in Section 14-3. Recall that these measurements also provide a picture of the
nuclear charge distribution.
We assemble the nuclear spin vector of the deuteron by adding the spins of the
proton and neutron to the orbital angular momentum L of the proton-neutron sysem:
I = L + S, + S„. (14-21)
A highly schematic picture of this construction is shown in Figure 14-16. Note that S.
and S„ are nuclear spin vectors themselves, each with quantum number i = | (or,
equivalently, s = r,), while the deuteron has nuclear spin quantum number 7=1. We
can generalize Equation (14-21) immediately and give the following formula for the
nuclear spin of any nucleus consisting of Z protons and neutrons: N
We already know that the vector I and the quantum number are related by the rules i
I
2
= h
2
i{i+ 1),
Iz = hm l
with m = — i,l
. .
.
, i:
in integer steps.
14-7 The Hucleon- Hucleon Interaction 701
Figure 14-16
M
L
(-)
This construction endows a nuclide with half-integral spin if A is odd or integral spin
if A is even. We include a listing of nuclear spins in Appendix A, and we observe that
the deuteron is an even-A nuclide with three quantized spin orientations for nuclear
spin i=l.
We recognize the expressions in Equations (14-21) and (14-22) as exact analogues
of the formula for the total angular momentum J in the theory of complex atoms. We
know from Section 9-8 that we can describe an atomic state in the LS-coupling
scheme by the notation 2s f '£,, provided we can regard / and s as good quantum
numbers for the total orbital and spin vectors L and S. It is valid to adopt the same
spectroscopic notation for nuclei, under analogous dynamical conditions, and simply
substitute the quantum numbers i and m in place of j and m t
•.
the state of the bound nucleons, although the interpretation in terms of nuclear
structure is likely to be quite complicated for most nuclei. We can interpret the result
for the deuteron at once, however, by noting the near equality between \i d and the
sum of the magnetic moments of the proton and neutron:
Ax^ = 0.8574 ju
A . and /i, + ju„ = 0.8798 fi N .
(The data for the second figure are taken from Section 14-1.) It would appear that
almost all of the deuteron moment can be explained by assuming a parallel-spin s = 1
configuration for the proton and neutron in an tf= orbital state. The /=
assignment is expected if the ground state of the two-body system is governed by a
central force. The combination of (= with s = 1 is also consistent with the known
value i = 1 for the nuclear spin. If we can treat ( and s as good quantum numbers
+
particles subject to the nuclear force, then we can invoke the notation
:
for two L,
+
and refer to the 1
3
deuteron as a 5, two-nucleon state. Note that the /=
assumption agrees with the assignment of positive parity.
702 Properties and Models of the nucleus
Two spin- nucleons can form states with total spin quantum numbers s = and
-,
s = 1. The nuclear interaction between the two particles is evidently spin dependent
since an / = bound state exists for s = but not for s — 0. To model this situation
1 ,
we might assume a central interaction for the main part of the nuclear force and add
an extra spin-dependent piece to make the nuclear attraction greater for parallel
spins. The added feature is analogous to the hyperfine effect discussed in Section 8-12.
We have seen how the interaction of spin magnetic moments in Equation (8-58)
reduces to a spin-spin coupling, and we might consider the adoption of a similar
coupling here. An expression of the form S^, • S„ can produce the desired energy
splitting for the two total spins s = and 1 in the (= proton-neutron state. Of
course, the situation at hand is not exactly like the atomic hyperfine splitting since the
underlying dynamics is not of electromagnetic origin.
An {° = orbital state describes a spherically symmetric probability distribution
and a spherical shape for the corresponding nucleus. The small deuteron quadrupole
moment d implies a small deformation of the basic spherical shape of this nuclide.
Q
Let us explore these observations with the aid of the relevant classical and quantum
formulas.
We define a classical electric quadrupole moment in terms of a given charge
density p by the integral
It is clear that () has dimensions of charge times area, and it is convenient to express
Q, for nuclei in e • barn units. We can interpret Equation (14-23) more readily if we
rewrite the polynomial in the integrand as
2
/ x +y—2
3z
2
-x -y 2 -z 2 =
2
2\ z
2
This expression samples the shape of the charge by weighting the amount distributed
along the z axis against the amount distributed around that axis. A positive (or
negative) value of Q. ' s therefore an indication an elongated (or
of flattened)
deformation. We add some further classical remarks about the form of Q in the
example at the end of the section.
The quantum mechanical electric quadrupole moment is defined by the expecta-
tion value of the quadrupole polynomial, taken in the state of maximum /,. In the
case of the deuteron the coordinates in the polynomial are those of the proton:
<Q> = ej**(3z; - r
2
)^dT = - jV(3z 2 - r
2
)*dr. (14-24)
Figure 14-17
that s = 1 and c* = 2 can combine to preserve the = 1 quantum number for the i
nuclear spin. The assignment of a small probability to the t°= 2 correction results in a
fit for Qd and also accounts for the small discrepancy between the magnetic moments
\>-
d and \i
p
+ [i
n
.
We
have remarked in Section 14-1 that the interaction between two nucleons is the
same for pp, nn, and pn pairs of particles. We have just learned, however, that the
deuteron stands alone as a pn bound state. These seemingly inconsistent observations
are reconciled by the Pauli principle. We note that the deuteron is in a spin-symmetric
state because of the s = 1 assignment and is also in a space-symmetric state because of
the admixture of the two even values of /. The Pauli principle forbids this combina-
tion of symmetries for the identical-fermion pairs pp and nn but has no immediate
bearing on the deuteron since the pn system iscomposed of distinguishable particles.
The admixture of c° = and /= 2 in the wave function of the deuteron has
interesting implications for the nucleon-nucleon interaction. Since the physical state is
not characterized by a unique value of (" , it follows that c° is not exactly a good
quantum number fundamental pn interaction must not be due to a purely
so that the
central force. We conclude that an additional L-nonconserving effect, called the tensor
force, is also present in the dynamical problem. We have already cited the evidence for
a two-nucleon force involving the spins of the two nucleons. The new tensor interac-
dependence on the angles between the nucleon spins and the
tion introduces a further
two-body coordinate vector r. All these complicated aspects of the nucleon-nucleon
interaction are apparent in the properties of the primitive A = 2 bound system.
We identify other properties of the interaction when we probe the two-nucleon
system at higher energy in processes such as pp and np scattering. An experimental
finding of some interest is the existence of a large spherically symmetric component in
the distribution of protons scattered from protons. This effect is attributed to a strong
repulsive core at very short range in the nuclear interaction between the two particles.
Figure 14-3 shows how such a repulsive interaction dominates the attractive nuclear
potential energy of two protons at small values of r. We see this strong short-range
phenomenon in spherically symmetric c*= states, and we find that the effect is
masked by the repulsive centrifugal potential energy in states with nonzero c".
Signs of a repulsive core are also revealed in electron-scattering studies of the
deuteron. A pronounced dip is observed at the center of the deuteron charge
distribution, unlike the situation for the larger nuclides in the range considered in
704 Properties and Models of the Nucleus
Figure 14-9. This electron-scattering view of the proton-neutron bound state indicates
a strong tendency for nucleons to repel and avoid each other at very close range.
It is obvious that a very complex interaction exists between two nucleons. We treat
Example
Let us offer the following construction as background for the classical quadru-
pole formula in Equation (14-23). First, consider a point charge e at the origin
and recall the well-known formula for the electrostatic potential at distance r:
•Mr)
47TE r
(0,0, d/2) and a charge — e at (0,0, —d/2). The corresponding potential is the
sum of the two point-charge potentials:
•Mr)
47T£ 2 _
+y 2 + (z- J /r>\ 2 2
+ y2 +
rl
[ p
,/ , .,2 , /
d/2) ]jx (z + d/2)
where x 2 + y 2 + z
2
= r
2
. We expand this expression for d <^ r and obtain the
following result to first order in d:
ell zd
l \ -1/2
zd d2 \
->/*,
+ +
4we„r
0' \ r
z
\r^ 7< ^) )
Attest
(I
l\
\ \
1 + —+ Zd
2r ^
zd
+
-)>^
\\
477e
ed z
r
o
J
1
Let us denote the electric dipole moment by the quantity p = ed and express the
dipole potential by the familiar formula
P z
"
*(0 = J-
477e r
Finally, consider two such dipoles with opposite signs and separate locations,
taking p to be at (0,0, d/2) and -p to be at (0,0, -d/2). The resulting
potential is the sum of the two dipole potentials:
~ d/ 2
fS =
<Mr)
Pi z z + d/ 2 \
- d/2) 2 /2 2 2 V2
4^o 1 [x
2
+y + 2
(z
Y [x
2
+y + (z + d/2) \
14-8 A Simple Model of the Deuleron 705
Figure 14-18
Monopole Dipole
" 3/2
( zd d2
shir\ '
1
- - + —A \
477e r
J
\\ -J) {
i
r
2
4r'/
1 d\( zd d 2 ]' 3 ^]
+ + +
i' 2)l
1
^ 4?) )
"/ \"
3 zd \ ( 3 zd
p !
3
477e r \
d \
--[(! + .
..) + (l + .v)]j
pd t z
2
\ pd 3z
2
- r
2
The properties of the deuteron can be understood with the aid of the Schrodinger
equation. The binding energy is especially interesting because the smallness of this
quantity can be explained from a simple description of the proton-neutron interac-
tion.
Let us argue that the spin-dependent aspects of the interaction are only secondary
considerations and concentrate on the main effect of a two-body central force. We must
adopt a model to describe the unknown central potential energy V(r). The selection of
the model is not a critical matter since the binding of this nucleus is not very sensitive
to the detailed shape of the potential energy function. We find that the deuteron is
706 Properties and Models of the Nucleus
Figure 14-19
Square-well model for the binding of the deuteron. The function rR(r) satisfies Equation
(14-27), and the f— stationary-state eigenf unction \p depends only on r. The dashed curve in
the figure shows the limiting form of rR(r) for zero binding energy.
V(r) rR(r)
adequately treated with the use of a square well, parametrized by a well depth V and
a radius r . The parametrization of this model is illustrated in Figure 14-19. Note that
the energy level of the deuteron is found near the top of the well and that no higher
excited states are supposed to exist.
We ignore the neutron -proton mass difference and assign the same mass M to
each particle. Consequently, the reduced mass in the central-force problem has the
value n = M/2, and the coordinate vectors in Figure 14-17 satisfy the relation
'-2' (14-25)
We let the deuteron be described as a pure £, state with energy eigenvalue — E and
b ,
The function R(r) satisfies the general radial differential equation, as formulated in
Equation (6-43), for the special situation
situ; where jn = M/2, /= 0, and E = —E We
b
.
— (rR)- jM
d2
1
r
[V(r) + E b
](rR) = 0. (14-27)
Since V(r) is a discontinuous step function, the differential equation holds in piecewise
fashion:
M
i(^) -(Vo -E b
)(rR) = iorr<rQ
d,
14-8 A Simple Model of the Deuteron 707
and
M
— (rR)-—E
d2
2
(rR)=0 b
for r > r .
ME
*=— M(V -E
— n
b
and K= 1
n
b )
,
(14-28)
(rR)" + K 2
(rR) = for r < r
and
a sin Kr
rR(r) = { l _ kr . (14-29)
be
(The function cos Kr cannot be used inside the well because an infinite result would
+ ir
follow for R(0). The function e cannot be used outside the well because a
divergence would occur as r -> oo.) A sketch of ihe final solution is included in Figure
14-19. The figure also shows the stationary-state eigenfunction
R(r)
IAtt
and
These results represent two conditions in the determination of the three unknown
constants a, and Eb The normalization of the wave function provides the third
b, .
take the ratio of our two conditions in order to remove a and b from this part of the
problem. The result is a single equation relating E b
to the parameters of the model:
The second form of this relation follows from Equations (14-28). The usual methods of
quantum mechanics would treat the energy Eb as an unknown quantity to be
708 Properties and Models of the Nucleus
adopted.
We can penetrate Equation (14-30) most effectively if we look first at the limiting
situation where Eb —> 0. This limit is not far from the physical case, if indeed E b is as
small compared to V as Figure 14-19 suggests. The range-depth relation has the
limiting form
cot r = 0,
,a/j; 77 m 2h 2
p
>0
- V
K 0'0
r
2
= (£, = 0). (14-31)
h 2 AM
We examine this result numerically in the first example at the end of this section and
learn that the estimated well depth is at least an order of magnitude larger than the
measured binding energy, given a realistic choice for the radius parameter. Equations
(14-28) reduce to the limits
{MV»
k -» and K-+
h
-> —77
2r
as E b
-* 0.
77r
a sin — r < r
R(r) - 2r
in the limiting case. The behavior of this solution is indicated by the dashed graph in
the figure. Note that when Eb is quite small the solution rR( r ) just turns over inside
the well. The corresponding broad shape of the eigenfunction \j/ implies that this state
is not able to probe the details of the two-nucleon force with very much resolution.
Hence, the square well is able to give an adequate picture of the binding interaction.
The deuteron is said to be barely bound because of these properties.
The range-depth relation can be analyzed by graphical means when E b is allowed
to have its actual nonzero value. Let us rewrite Equation (14-30) for this purpose,
setting Kr = u:()
u u
— cot u = -k => tan u =
kr
Figure 14-20 shows graphs of the left and right sides of the final equality, plotted as
functions of u. The deuteron solution is determined by the value of u at the first
V(r)
rc r
particular model.) The figure indicates the following bounds on the solution:
tt JM{V -E b
< Krn < 77 or — <
Krn — > — as k
in the limit of zero binding energy. This result agrees with the conclusions of the
previous paragraph.
Our model for the nucleon-nucleon interaction has not taken account of the
short-range repulsive core discussed at the end of Section 14-7. We can make way for
this effect if we substitute the modified square well in Figure 14-21 for the original
well in Figure 14-19. The problem is altered by the infinite repulsion near r =
because the modified eigenfunction must vanish everywhere inside the proposed core.
This new property gives the probability distribution a vacant region at its center, in
qualitative agreement with the electron-scattering results mentioned in Section 14-7.
The remainder of the analysis goes through as a straightforward exercise, which we
include as Problem 20 at the end of the chapter. The crude model in Figure 14-21 is
interaction is expected to have the same general appearance with a somewhat deeper
attractive well.
710 Properties and Models of the nucleus
Example
The range-depth relation for zero binding results in the following numerical
expression of Equation (14-31):
2
77
2
{hcf m 2 (197 MeV- fm)
IVo
2
= - —7- = = 102 MeV fm •
2
if E.* = 0.
4 Mc 2
4 939 MeV
We can use this calculation to constrain the values of the square -well parame-
ters. Thus, a typical radius r = 1.6 fm implies a well depth
102 MeV fm 2
V =
(1.6 fm)
•
=
2
—= 40 MeV,
Example
Let us return to Equation (14-29) and carry the solution for rR( r) one step
further. The constants < i and b obey the equality
b = a sin Kr •
e
kr
°,
1 K 2
si
K + • *
2 2
° 1 + cot Kr k
:
The quantity A> is known to lie in the interval ( tt/2, w), and so the positive
root is chosen to give
K K
sin Kr = c k 'n
2 2 2 2
yjK + k Jk + k
The result enables us to rewrite the radial sol ution in terms of a single
multiplicative constant:
/ sin Kr r < r
o
ft 1
R(r) = -< K
> r
1 1
\ U ,
2
+ k
2
i r
o-
This expression is put to use in Problems 18 and 19 at the end of the chapter.
Figure 14-22
20 40 80 80
Magic
20M28) -(82
numbers
illllllllllllllllll
?0
lllllllllllll III llll
40
llllllllllllll
60
lllllllll llllllll
80
llllllll lllllllill
100
llllllllilllil lllllll
120
III
N
independent nucleon experiences the average force of all the other nuclear particles.
We have invoked the exclusion principle in Section 14-6 to argue the merits of this
point of view. The central field and the exclusion principle have been brought
together successfully in the shell theory of atoms. We find the same methods fruitful
again, but to a lesser degree, in the theory of nuclei.
Hints of nuclear shell behavior have already been noted in Figure 14-4. These first
2 8 20 28 50 82 126...
We present the same observations again in Figure 14-22 in the form of distributions of
the stable nuclides, plotted as functions of Z and N. The figureshows the more
obvious accumulations of nuclei at (tin), and at jV = 20 and
Z= 20 (calcium) and 50
82. We note that Ca is doubly magic with Z N
40 = = 20, and we discover that such
4
other doubly magic examples as He (Z = N 2),
= [6
(Z= N= 8), and 208 Pb
(Z 82 and N
= = 26) are species with unusual stability. The binding energies of
1
magic configurations of nucleons tend to be larger than average, enough to stand out
as marked deviations from the smooth graph of E b /A in Figure 14-12. These
fragments of systematic evidence are taken as suggestions of shell closures in nuclei.
We find more compelling evidence of closed-shell behavior when we look at
nucleon separation energies in the vicinity of the magic numbers. These quantities are
712 Properties and Models of the Nucleus
Figure 14-23
Neutron separation energy versus neutron number for several families of isobars. The data are
from the tables of Wapstra and Bos, the source used to construct Figure 14-12.
A
15
En
A = 40 A = 54
10
A = 17
A = 91
(MeV)
A = 140
A = 209
5
Magic
numbers
20) (28
analogous to the ionization energies of atoms. Figure 9-7 has taught us to read the
number as a clear indication of atomic shell
variation of ionization energy with atomic
structure. We present nuclear data of the same sort in Figure 14-23 by plotting the
neutron separation energy E n versus N for several groups of isobars. Only a limited
sample of graphs needs to be shown since the results for the selected values of A are
typical of many different isobar families. We find in every case that E n
varies with N
and drops abruptly when TV changes from N*, a magic number, to N* + 1. The
graphs tell us that the binding of the last neutron is relatively large when the nucleus
contains N* neutrons and becomes unusually small with the addition of one more
neutron. We conclude that a closed shell of neutrons occurs at the neutron number
N*. A parallel study of the proton separation energy leads to a similar conclusion for
protons. We recall again that the ionization energy of atoms demonstrates exactly the
same behavior (somewhat more dramatically) whenever the atomic number corre-
sponds to any of the noble gases. We know from Equation (14-10) that E n
is equal to
a difference of nuclear binding energies:
A -l
E„( X) = Eb ( A X) - E (Ab
X).
A+
n+ A X l
X + y.
This process is also called an (n, y) reaction, where the transformation of the nucleus
is expressed as
A
X(n,y) A + X. l
14-9 Magic Numbers 713
We define the cross section a for this type of collision by analogy with the definition
given in Section 3-4 for the elastic scattering of charged particles. In the case of
neutron capture, we interpret the cross section as the effective area presented by a
single target nucleus to a single beam neutron. This quantity can be used to measure
the probability for the nucleus to capture the neutron. The («,y) cross section
becomes quite large when the neutron number of the target nucleus is one unit less
than a magic number N* and becomes quite small when the target neutron number is
equal to jV*. These observations constitute good evidence for a closed shell of neutrons
at the magic neutron number TV*.
Example
o
y
= 2.6 X 10
6
barns for
l35
Xe(«, y) 136 Xe
and
136
o
y
= 0.26barn for Xe(«, y ) 137 Xe.
Xenon has Z= 54, and so the two situations involve target neutron numbers
equal to N* — 1 and N*, with N* = 82. We can appreciate the enormous size
of the one cross section relative to the other if we recall the definition
-28 2
1 barn = 10 m 2
= (10 fm) .
2
2.6 X 10
6
barns = (16000 fm) 2 and 0.26 barn = (5.1 fm) .
The figures in parentheses may be compared with the scale set by the nuclear
radius. We recall Equation (14-3) to obtain
R = RoA 1
/3 = (l.07fm)(136)' /3 = 5.50 fm
135
for A = 136, and essentially the same A = 135. Obviously,
result for Xe is
136
very hungry for neutrons while Xe is somewhat satiated.
Example
The nuclide
88
,Sr has a magic number of neutrons, = 50. therefore expect N We
the neutron separation energy for this nucleus to be somewhat larger than the
value predicted by the semiempirical mass formula. The actual value of En can
be determined with the aid of Equation (14-10):
£„(
88
Sr) = £ A(
88
Sr) - £ 6(
87
Sr) = (768.47 - 757.35) MeV =11.12 MeV.
(The binding energies are taken from the tables of Wapstra and Bos, the
reference used to construct Figures 14-12 and 14-23.) The prediction from the
774 Properties and Models of the Nucleus
En ( A X) = «, - M 2/3
(l " S
2
)
Z2 / 1 4Z ;
+ ^3/4'
A(A- 1)
at the end of the chapter.) For A = 88 we have £ = 0.9962, and so we get the
following string of terms using the a coefficients quoted in Section 14-5:
£„(
88
Sr) = (15.76 - 2.67 + 0.88 - 5.82 + 1.36) MeV = 9.51 MeV.
The actual value exceeds the predicted value by 1.61 MeV, a substantial 17%
deviation.
The last three sections have prepared us for a theory of nuclei based on the
independent-nucleon approach. We follow the example of the theory of atoms, and we
employ the central-field approximation and the exclusion principle as the two main
pillars of this investigation. The states of a single nucleon are found by adopting a
central-field model to describe the interaction between the nucleon and the other
A — 1 The exclusion principle governs the protons and neutrons
nuclear particles.
separately and controls the occupation of the single-nucleon energy levels for all A
nucleons. These two ingredients of the theory are enough to predict a nuclear shell
structure. We find, however, that the nuclear shell model must include other dynami-
cal considerations if the theory is to reproduce the expected magic numbers.
We assume a central potential energy V(r) for each nucleon in the nucleus so that
we can apply the properties of angular momentum quantization to the states of each
particle. This procedure enables us to label the states by the single-particle quantum
numbers (n(m e m s
). Hence, the eigenf unction for a nucleon in a stationary state has
the familiar form
The radial function R n/f (r)and the associated energy eigenvalue Enf depend on the
choice of model for V(r). We have discussed these aspects of the general central-force
problem in Section 6-7, and we have formulated the radial differential equation for
and E n( in Equation
R ni,(r) (6-43). Let us rewrite this equation as
d2 2M
-j~S rR ^ + ~^^- r
Ku(r)](rR n ,) = (14-32)
and recall that the effective potential energy includes the ^dependent centrifugal term
along with V(r):
2
h
V*(r)- V ^ + ^^^ + 0- (14-33)
(Note that the nucleon mass M is substituted for the reduced mass in both formulas.)
according to the interpretation given in Section 6-7. Thus, the choice of / fixes the
function Veff (r), and the index n then enumerates the ascending energy levels and
counts the nodes of the corresponding radial solutions. We are reminded of these
properties in the example at the end of the section.
Let us emphasize that the original definition of the quantum number n is in use in
this problem. We should not confuse n with the terminology adopted in the theory of
atoms, where the same symbol is used by convention to denote the principal quantum
number. It is clear that the notion of a principal quantum number has no logical
place in the treatment of nucleons.
The model requires the selection of a function V(r) to describe the nuclear central
field. A possible candidate has already been suggested in Figure 14-14. It is possible to
parametrize this potential energy by the formula
V (r)=
v ' T^ZT,
(r-R)/a '
(14-34)
V /
_|_
J g
where V and a do not vary appreciably from one nucleus to the next, while R varies
1/3
as (Observe the similarity between the shape of the potential energy in Figure
/1 .
14-14 and the shape of the nucleon density in Figure 14-8. We appeal to this analogy,
and to Equation (14-2), when we write V(r).) It is more expedient to assume a
square-well approximation for V(r) because the corresponding differential equation
for R n f(r) and E n( admits an exact analytical solution. A possible strategy might be
to start with a preliminary exact solution for a square-well model of the interaction
and then tune in a more realistic numerical solution based on the rounded model of
Equation (14-34). The first stage of this procedure generates a collection of energy
levels resembling those shown in Figure 14-24. Note that the values of En( are
organized according to columns of different ( , as in the presentation of the analogous
atomic problem in Figure 9-4. The second stage of the procedure shifts these energies
to new levels at positions not very far away. Figure 14-25 shows the square-well and
rounded-well levels columns to represent the results of such a method. It is
in parallel
obvious that these must vary with the choice of mass number A. We describe
results
this implementation of the independent-nucleon model as only one possible strategy,
of energy levels in Figure 14-25. We are able to deduce a shell structure of the nucleus
on the basis of this list of numbers and energies.
Let us consider the process of building up the nuclear ground state by taking Z
protons and N neutrons to populate the lowest unoccupied single-nucleon levels. The
716 Properties and Models of the Nucleus
Figure 14-24
Single-nucleon energy levels Enf in a hypothetical square-well model of the nucleon potential
energy.
,\ e= 1 2 3 4 5 6
2m
3p
2/
1/7
3s
2d
1*
'»
1/
2s
1,7
Lp
Is
14- 10 The Nuclear Shell Model in
Figure 14-25
Shell-model levels and degeneracies for a square well and a rounded well. The cumulative total
of protons or neutrons is supposed to reach a magic number at each of the larger energy gaps.
Only the first three encircled predictions agree with the known magic numbers.
nt
I
2g 2g
1 56
26 (138)
if
2/ V 6 112
2/
14 106
\h \h
2? (92)
3s
3s
2d
2 70
2,1
10 68
lg
U'
IS! (58)
2p
2p
1/
1/
14 (34)
Is
2s
2 (20)
\d
1,1
10 18
lp
Lp
Is
Is
exclusion principle permits no more than 2(2/+ 1) occupants in any shell-model level
E nif
for both species of nucleon. This maximum allowed number is given level-by-level,
for protons and for neutrons, according to the list of degeneracies in Figure 14-25.
Another column on the right side of the figure shows a running total of these maximum
occupancies, accumulating upward from the lowest level. We predict a closed shell of
nucleons of either type wherever we encounter a fully occupied level followed by an
appreciable jump to the next higher energy state. The gap between levels implies a
reduced binding energy for the next added nucleon. This same effect is observed in the
building up of the ground states of atoms, where the analogous gaps are found at the
atomic numbers of the noble-gas elements. The nuclear model in the figure predicts a
shell closure when either Z or N reaches 2, 8, and 20, in agreement with the first three
magic numbers, but fails to continue in the proper sequence thereafter. We are forced
to conclude that the nuclear shell model is either incorrect or incomplete in its present
form.
The difficulty cannot be resolved by choosing a different central field. A new type
of additional ingredient is put forward instead in the form of a nuclear spin-orbit
interaction. We recall from Sections 8-9, 9-4, and 9-7 that the spin-orbit coupling in
atoms causes a splitting of the single-particle energy levels E nf for all /# orbital
states. The splitting occurs because the atomic interaction contributes a different
energy shift for the two allowed values of the total angular momentum quantum
number j, as indicated in Figure 8-28. We have expressed this interaction in terms of
the central potential energy V (r)
c
in Equations (9-36):
dV
VSI (atom) = —-yj - —
1
-S
1
(
• L.
2m e
c r dr
We know that the atomic spin-orbit effect has a secure theoretical basis in relativistic
quantum mechanics, and we also know that the energy splitting in atoms is rather
small especially for small values of the atomic number. In contrast, the nuclear
splitting is assumed a priori to be large and also inverted, in the manner shown in Figure
14-26. We ascribe this behavior to a nuclear interaction of the form
a~SL dV
VSL (nucleus) = S • L, (14-35)
r dr
in which the central-field function V(r) appeals along with the orbital and spin
angular momenta of the nucleon. The minus sign accomplishes the required inversion
Figure 14-26
energy level.
; = <?-
n(
i
= e*
1410 The Nuclear Shell Model 719
Figure 14-27
Shell-model levels including the effect of nuclear spin-orbit splitting. The degeneracy is given
by %j + 1 at each level Enf) The
. accumulated population of nucleons corresponds to a magic
number at every one of the larger energy gaps.
£ nf and£„
2g • • • ...
- liu 148
h 12
-2gy2 10 136
(126)
120
14 1 18
104
LOO
10 92
(82)
12 8(i
68
6 64
8 58
K) (50)
2 40
4 38
6 34
(28)
© 18
14
Is
l«y2 2
© 1
Rounded — with — Spin-orbit n(j Degeneracy Cumulative
well coupling total
720 Properties and Models of the Nucleus
of the split levels, and the phenomenological constant a"SL produces the desired
amount of energy splitting.
Figure 14-27 shows how the large inverted splitting influences the sequence of
shell-model levels. The interaction splits a given single-nucleon energy E nf (with
/=/= 0) into a pair of energy levels E nf The
. figure uses the notation nc* to designate
the states with j = t+ ^ and j = {—
\, and assigns the lower state to the larger value
of j. Note that the splitting is largeenough to produce rearrangements in the final array
of energies. An energy level with quantum numbers (n(*j) is comprised of 1j + 1
degenerate states corresponding to the various assignments of m for the given value
of j. The figure lists these degeneracies at all the levels and includes a running total of
nucleon populations similar to the scheme employed in Figure 14-25. In this case the
Example
Figure 14-28 describes a square-well model for the nucleon potential energy
V{r). Unlike the similar model of the deuteron in Figure 14-19, this illustration
admits more than one bound-state energy level. Let us consider only the / =
case so that Vcfr (r) reduces to V(r) in Equation (14-33). Equation (14-32) then
becomes essentially the same as Equation (14-27) in the treatment of the
deuteron. (The reduced mass and the number of levels are the only differences
between the two problems.) Figure 14-28 shows a well large enough to accom-
modate three bound states, with energy levels Eu , E2i , and E 3t (in E nf
notation). The corresponding solutions of Equation (14-32) can be sketched by
following the guidelines established in Section 6-7. An application of the usual
arguments about curvatures and nodes results in the three graphs of rR n/,(r)
shown in the figure. We see that rR u (r) reproduces the shape of our previous
graph for the deuteron in Figure 14-19, and we note that each successive
function has one additional node. In fact, all three of these £= radial solutions
behave as sine functions inside the well and have parametrizations just like
Equation (14-29). We find the two higher energy levels by solving for the values
of the variable u at the second and third intersections of the two graphs in
Figure 14-20. The qualitative results of this example are extended to the /= 1
case in Problem 22 at the end of the chapter. A much larger family of energy
levels E n( can be obtained for each / if the square well is made sufficiently
large. We have assumed such a system of levels in Figure 14-24.
The nuclear shell model makes many other predictions beyond the explanation of the
magic numbers. The model is based on a procedure for assigning angular momentum
quantum numbers, and so the results are expected to include properties pertaining to
14-11 Spins and Moments in the Shell Model 721
Figure 14-28
Square-well model of the nucleon potential energy with three ^=0 bound states.
V(r)
J = L + S.
We extend the same addition of angular momenta over all the A nucleons and express
the nuclear spin vector as
Note that the and spin angular momenta are added according to the jj-coupling
orbital
scheme. We from our discussion of Figure 9-23 that we are supposed to follow
recall
this plan whenever the spin-orbit interaction has a strong effect.
The sum over nucleons in Equation (14-36) behaves like its analogue in atomic
physics and reduces immediately to a smaller number of terms. Each level nc"j in
The pairing effect plays a decisive role in these residual interactions. Two protons
or two neutrons with opposite values of m. in a given subshell have a greater
probability to be found at small separation where the particles can experience a
greater degree of nuclear attraction. The effect produces maximal binding when even
numbers of like nucleons pair off with canceling angular momenta. These paired
+
protons and paired neutrons make a contribution to the total angular momentum
and parity of an unfilled subshell. Thus, the sum over nucleons in Equation (14-36) is
reduced even further in the state of lowest energy, so that only the last unpaired proton
and neutron are left as surviving contributors. This consequence of the pairing effect is
I=J, + J„
This coupling problem follows the rules given in Section 9-8 and results in an
inequality involving the proton and neutron quantum numbers:
\j ll -j n \<i<j p + Jn (14-38)
as in Equations (9-44) and (9-45). These upper and lower bounds constitute a
prediction for the nuclear spin. An accompanying prediction for the parity is given as
(
— \Y'{ — 1)^", the product of the orbital parities for the proton and neutron. All these
expectations are illustrated by specific tests of the shell model in the first example at
the end of the section.
Our description of the shell model is based on a hypothetical treatment of the
central field. We should therefore be prepared for deviations in the actual ordering of
the shell-model energies relative to the levels in Figure 14-27. The exact sequence of
levels may vary with the mass number A and may also differ between the two species
of nucleon. Table 14-1 provides separate lists of shell-model levels for a proton and for
exhibit the expected shell closures at the magic numbers. We use some of the entries in
example below.
this table in the first
The
shell model also makes predictions about the magnetic moments of nuclei. The
limitations of the model can be grasped by concentrating on the case of an odd-A
nucleus in the ground state. We visualize this system in terms of (A — l)/2 pairs of
nucleons with antiparallel angular momenta, plus a single unpaired proton or
neutron. We know from Equations (14-37) that the overall nuclear spin and parity are
14- 11 Spins and Moments in the Shell Model 723
odd-proton odd-neutron
2^7/2 8 162
1*11/2 12 154
3 d 5/2 6 142
2 Sg/ 2 10 136
"13/2 14 126 -^
3 P 1/2 2 112
2/5/2 6 110
^P-S/2 1 104 3^3/2 4 104
2/7/2 8 100 1^9/2 10 100
l"9/2 10 92 2/7/2 8 90
OS l/, 2 2 82 l"ll/2 12 82 -^
2 "3/2 4 80 2^3/2 I 70
\h n/2 12 76 1/2 2 66
2d 5/2 6 64 1^7/2 8 64
1
St/2 8 33 2d 5/2 6 56
U9/2 Ki 50 1^9/2 10 50 -*
2 P\/2 2 40 2 P\/2 2 40
1/5/2 6 38 1/5/2 6 38
32 32
2^3/2 1 2/>.
V 2
1
1/7/2 8 28 1/7/2 8 28 -+
W 3/2 •1 20 Ws/a 1 20 -*
2^1/2 2 16 2^/2 2 16
lrf
5/2 6 1
1^5/2 6 11
l
P\/2 2 8 1/^1/2 2 8 -^
1^3/2 4 6 1^3/2 4 6
\s l/2 2 2 2 2 "m
"1/2
maximum running maximum runnin g
nCj
nt? «/
" 6
occupancy total J occupancy total
determined by the state of the odd nucleon, and we attribute the magnetic dipole
moment of the entire system to this lone unpaired particle. Both orbital and spin parts
of the magneticmoment contribute if the particle is a proton, while only the spin
magnetic moment contributes if the particle is a neutron. The following analysis treats
both possibilities at the same time.
The prediction is immediate when the odd nucleon is in an {= orbital state.
Only the spin contributes in this circumstance and gives
ju, = either \i
p
or \i
n
The revised expression employs the substitution of magneton units — nB — > fi N and
contains the explicit orbital and spin g-factors for the nucleon. in the proton case we
take
gL = 1 and gs = gp ,
St = ° and Ss = Sn-
We then let the measured magnetic dipole moment correspond to the expectation
value of ju.. in the nuclear spin state for which m = The evaluation of (fi.)
i
i.
Vn Si. 3 Ss
- -
00 - + + f(S+ + +
,
- i(i 1) 1) H f
i(i 1) /(*?+ 1)
i + 1 1 2 1
(14-40)
for either type of nucleon in a nucleus with quantum numbers i and /. We relegate
the derivation of this formula to a few detailed remarks at the end of the section.
Equation (14-40) can assume two different forms since the nuclear spin can satisfy
either i= { + \ or = (— \. Let us eliminate £ and examine the resulting depen-
i
Vn Sl 1\ 31
</0 = '" + - )-
i + 1 2 2 4
.Ss l 31
K)
I \
+ i(i+ 1) - i- +
2 4
f*JV
2
i + 1
|(2* + /-1) + |(*+1)
1
\ Ss
= Mw SiJ * ~ I + "
('-'+*), (14-41)
Vn iL
<iO
(
t(i'+ 1) + I 2
'+ - \[i + -
i + 1
+
gs
i(i+ 1)
'
+
s)('"
+
fK
ft* -
2
(2z + 3z) + |(-0
i'+ 1 |
gL\* + (i=*-\). (14-42)
I + 1
14- I I Spins and Moments in the Shell Model 725
Figure 14-29
Magnetic moments versus nuclear spin for odd-proton nuclei. The plotted points fall between
the Schmidt lines.
m
*N
odd-even
j =(+ -i-
.^
6 ^^^93 Nb °
5
^^
^ 51 V °
"5|n°
«Sc°
4
^ 141p r o 209 B| °
27 A |o
"7 Li
°
3
w ^^^^
19po 127|° 139
La°
175
Lu°
2
153 o
7
5As° Eu
31 po
1
39
K°-
-^
1
1 1 1
107
Ag 2 3
/2 % % %
(;-I + *)
2 2/
oo>«
M/v
I
i/3 >
Z
i
1
*/.
for '±±, (14-43)
i + 1 \ 2 2
A',
Vn-
00. for i = /+ i, (14-44)
1
gn
Mv
i'+12
in the two possible cases where the odd nucleon is either a proton or a neutron.
Our formulas yield four classes of results for the magnetic moments of the odd-,4
nuclei. We present the predicted dependence on the nuclear spin in Figures 14-29 and
14-30 for each of the four different sets of circumstances. The predictions lie on curves
called Schmidt lines (after T. Schmidt, an originator of the idea that the moments
might be associated with the properties of a single odd nucleon). Experimental values
of the moments odd-^ nuclei are also plotted in the figures. We note a
for several
tendency for the experimental points to fall between the Schmidt lines; however, we
observe a general failure of the simple theory to give a very good agreement with
experiment. Some comparisons with experiment are discussed in the second example
below.
726 Properties and Models of the Nucleus
Figure 14-30
Magnetic moments versus nuclear spin for odd-neutron nuclei. Almost all plotted points lie
1L
i~\
even-odd
o 33
S
3
l
/2 k
29 Si o 179 Hf o
"Cr° 173 °
Yb 73
49 Ti° Ge°
9
Be° 91
Zr
o 43
Ca°
3 17
He° i =( +
It is clear that the elementary shell model is not entirely satisfactory as a quantum
theory of the nucleus. The model adheres
an independent-particle theory in which
to
a fixed spherically symmetric nuclear core interacts with an independent nucleon. An
improved self-consistent treatment would allow every nucleon to participate collec-
tively in an interdependent determination of the nuclear state. Some aspects of
nucleon interdependence can be included in the independent-particle theory by letting
the interaction with the nucleon deform the spherically symmetric core. The collective
model of the nucleus builds on the shell model and incorporates this added feature of a
deformed spheroidal nuclear core. The improvement injects some of the philosophy of
the liquid-drop model since the spheroidal core has bulk properties like the liquid
droplet. The rotational dynamics of the core enriches the theory with new rotational
degrees of freedom and thereby opens the way for better agreement with experiment,
particularly in the area of magnetic dipole and electric quadrupole moments.
Detail
Equation (8-49) expresses the crucial part of the derivation of (ju.) in the case
of the one-electron atom. Let us transfer the same construction to the magnetic
moment of the odd- A nucleus in Equation (14-39). We introduce the nuclear
spin vector
I = L + S
in place of the total angular momentum J for the atom and write
2
Oi,/ >«<|i •!/,>.
(8-47):
li-I-yUiX + feSML + S)
Ma/ 2 2
[gLL + (g L + g s )S-L + g s S ]
h
8L 8s f T 2 -
g j} + ^ (/ - L* - S2 ) + gsS'<
h
— (I 2
+ L -S2 2
) + -(I 2 - L2 + S 2 )
h
2
Our next move is to insert this result into the expression for (jjl,I ) and take the
expectation value in the state defined by the eigenvalues
I
2
= i(i + l)fi
2
, L 2 = t(t+ \)h
2
,
S 2 = \h 2 , and /, = iti.
0O»0'+ 0»
!
i(i + 1) + /(/+ 1) /r
h { 2
g.s
+ i(i+ 1) - t(/+ 1) + ^
2
m
Example
Let us compare our shell-model predictions with a very small sample of the
known nuclear and parities. The doubly magic species 16
spins and 40 Ca have
+
completely filled proton and neutron subshells and are observed to be nuclei
as predicted. The odd-,4 nuclides 'gO g ^K 20 and '|F I0 contain an unpaired ,
,
nucleon and furnish a more interesting test of the shell model. Table 14-1 tells us
that the odd neutron in "O is in a \d b/2 subshell and that the odd proton in
39
K
is Equations (14-37) give the corresponding i p assignments
in a \d i/2 subshell.
in terms of the quantum numbers j and ( of the odd nucleon. Thus, we expect
+
17
to have i
p = 'i since j '
= -, and ^=2, and we expect
9
K to have i
p = )
*
since j '
= % and t= 2. These conclusions agree with experiment, according to
the data in Appendix A. On the other hand, the nuclide 19 F has an odd proton
in a 1^5/2 subshell, and so the shell-model quantum numbers J = \ and 2 /=
+ +
would imply | A i
p = .
\ assignment is observed instead, again as recorded in
Appendix A. The odd -odd nuclide 'yN 7 has an odd proton and an odd neutron
and therefore involves a different test. We consult Table 14-1 and find that both
of the odd nucleons are assigned to l/> 1/2 subshells. Since £p = <fn = 1, the model
predicts an even parity from the product of the two odd-parity factors. Since
Jt,
= Jn = 2> tne m °del predicts a nuclear spin given as either i = or i' = 1
+
from Equation (14-38). Appendix A tells us that
14
N is a 1 nucleus, in
agreement with these predictions.
728 Properties and Models of the Hucleus
Example
We can easily convert Equations (14-43)and (14-44) into the graphs of Figures
14-29 and 14-30 if we proton and neutron g-factors from
recall the values of the
V-n 2
A table of nuclear data lists the experimental value as — 1.893, a result quite
39
close to our prediction. In the case of K. we have an odd-proton nucleus whose
quantum numbers c° = 2 and i = A
, obey the relation i = t'— ~2 . The lower
version of Equation (14-43) predicts
u it 3 eh \ 3
1
i+ —2 = -(3 - 2.793) = 0.124.
fi N i + 1 \ 2 ) 5
The listed experimental value 0.391 is somewhat further removed from our
prediction in this instance. We can see the positions of these nuclides with respect
to the Schmidt lines by inspecting the relevant portions of the two figures.
Models are the means of describing the complexities of the nuclear force, in the
interaction of two nucleons and in the binding of many nucleons. We know that the
force between two nucleons is not purely central and is not independent of spin.
However, the two-nucleon problem is not as complex as we might expect, since the
nuclear interactions are known from experiment to be the same for pp, pn, and nn
pairs of nucleons. Evidently, the nuclear force does not depend on the charge of the
interacting particles. This property of charge independence represents a new type of
symmetry principle and conservation law in nuclear physics. Our interest in charge
independence is focused particularly on the effects of the symmetry in the primitive
two-nucleon system. We are also concerned with the manifestations of the symmetry
in the structure of complex nuclei. Our main objective is to introduce charge
independence as a simplifying ingredient in nuclear physics.
The pp, pn, and nn interactions are said to be identical only with regard to the
nuclear characteristics of the interacting particles. This assertion presumes that the
obviously charge-dependent electromagnetic interactions have already been taken into
account wherever these effects may arise. Thus, the proposed nuclear symmetry
defines a relationship between the proton and the neutron and allows a substitution of
the one for the other as far as the nuclear force is concerned. Of course, the two species
of nucleon are not identical in every respect. The particles are distinguished by such
electromagnetic properties as charge and magnetic moment so that their electro-
magnetic interactions are rather different. Hence, the symmetry principle and the
conservation law associated with charge independence are supposed to hold in
circumstances where electromagnetic effects can be neglected relative to the strong
14-12 Charge Independence and Isospin Symmetry 729
nuclear force. This exact symmetry breaks down and becomes a good approximate
symmetry in the presence of electromagnetism. It is reasonable to regard the small
difference in mass between the neutron and the proton as a measure of the deviation
from exact symmetry. It is then a matter of conjecture that the difference in mass
might be due solely to electromagnetic effects.
Charge independence has played a part in the history of nuclear physics since 1932,
the year of the neutron. The concept originated with Heisenberg and E. U. Condon,
who proposed independently that the proton and the neutron should be treated as
differently charged quantum states of a single generic nucleon. They viewed the
charge of the nucleon as a mere label with no meaning as a physical parameter in
the absence of electromagnetic forces. Their idea was inspired by the near equality of
the nucleon masses and was reinforced by the direct observation of charge indepen-
dence in experiments on pp and np scattering. The notion of a nuclear symmetry was
implemented by powerful mathematical techniques in the subsequent work of E. P.
Wigner.
Let us begin to appreciate the symmetry between protons and neutrons by
considering pairs of mirror nuclides related by the interchange of Z and N. We recall
from Section 14-5 that the mass formula predicts the same binding energy for these
isobars if the distinguishing effects of Coulomb repulsion are disregarded. We there-
fore expect the interchange Z «-» N
have no influence on the level of the nuclear
to
ground state whenever the electromagnetic interactions can be ignored. The mirror
nuclides have the remarkable property that the approximate equality of energies
persists level-by-level up two related systems. An excellent
into the excited states of the
example is 4 and
provided by the pair of nuclides]Be 3 in Figure 14-31. The two
,Li
level schemes exhibit parallel patterns of states with the same nuclear spins and
parities, and with much the same excitation energies above the respective ground
states. We
can argue that the Coulomb effect elevates all the levels of the isobar with
the larger of the two values of Z. We recognize, however, that the mirror symmetry of
energy levels tests only the equality of the pp and nn forces. The comparison gives no
information about the relative strength of the pn force because the number of pn
bonds is exactly the same in the two isobars. This finding establishes a property of the
nuclear force known as charge symmetry. Since the equality of forces is tested in only
two of the three possible combinations, the symmetry under the interchange Z <-» N is
less comprehensive than the full symmetry associated with charge independence.
We can compare all three forces if we select a family of nuclides in which the pp,
pn, and nn bonds occur in varying numbers. This opportunity is offered by any
collection of isobars with an even mass number A. We illustrate the arguments in
Figure 14-32 by drawing sketches of the internucleon bonds in two proton-neutron
systems with odd and even values of A. Figure 14-33 presents physical evidence for
the equality of the three interactions in the form of energy levels for the even-A isobars
18
0, 18 F, and 18 Ne. The parallel columns of levels contain states of each isobar for
which the i p quantum numbers are the same and the corresponding excitation
energies are very similar. Nuclear states with this property are known as isobanc
Figure 14-31
7 7
Energy levels and i* assignments for the mirror nuclides Li and Be. The excitation energies
are given in MeV. The Be ground state is unstable and undergoes a /8 transition to the ground
7
state of Li, accompanied by an energy release of 0.86 MeV.
3
10 _ /2
3
/2
-
10
'Hh
Figure 14-32
Systems of isobars for testing charge symmetry and charge independence. Only the pp and nn
forces can be compared in the odd-/l nuclides. The pn force can be included in the comparison
when A is even because the number of pn bonds does not remain fixed for the different nuclei.
2 JV= 1 Z= 3
PP PP 1
PP PP 1 PP 3
Bonds pn ? pn 2 pn 3 pn 4 pn 3
nn 1 nn nn 3 nn 1 nn
14-12 Charge Independence and Isospin Symmetry 731
Figure 14-33
Energy levels assignments for isobars with A = 18. The three level schemes contain
and i
p
threefold families of isobaric analogue states. Certain additional levels also occur as indicated in
l8
F. Energies of excitation are quoted in MeV. Two of the three ground states are unstable so
«Nt
18 F
18Q
between these two rather different physical systems. To clarify the analogy let us
assume for the moment that the isobaric analogue states have exactly the same
energies. We might suppose that this coincidence is accomplished by the removal of
electromagnetic effects. The hypothesis implies sets of degenerate states with common i
p
situation where the particle has spin, the energy may vary with j but not with the
z-component quantum number m r A multiplet of 2j + 1 degenerate states results for
each j, corresponding to a set of quantized orientations of the angular momentum
vector J. We know that the energy cannot depend on these orientations because the
rotational symmetry of the central field precludes the existence of a preferred direction
for the choice of z axis. The introduction of a constant magnetic field then fixes such
.
an axis in space and causes an energy splitting among the 2j + 1 states. This lifting of
the degeneracy is due to the breaking of rotational symmetry by the application of the
external field.
electromagnetic properties of the nucleon doublet are different for isospin up (the
proton) and isospin down (the neutron).
Let us draw the concepts in the last two paragraphs together and formulate isospin
in quantum mechanical language. We ensure the desired properties of the isospin
vector by imposing quantization rules identical to those enjoyed by the angular
momentum J (or L, or S). Thus, the quantum behavior of the isospin T is expressed in
terms of the quantities T2 and T. by the following eigenvalue conditions:
and
Note that the choice of / may be integral or half-integral and that T, has 2t + 1
quantized values for each choice. (No h factors are used in Equations (14-45) and
(14-46) since T is an abstract dimensionless quantity with no classical interpretation as
an angular momentum.)
Each allowed value of the new isospin quantum number t defines a possible isospin
multiplet consisting of 2t + 1 isobaric members. The nucleon doublet has two states
14-12 Charge Independence and Isospin Symmetry 733
with isospin up and down, so that the relevant isospin quantum number must be / = r,
Exactly the same isospin assignments are adopted level-by-level for the mirror
7 7
nuclides Li and Be in Figure 14-31.
We assign a T, eigenvalue to a particular state of a given nuclide zX N according
to the general formula
Z- N
(14-47)
The formalism then implies that the assigned system has an isospin quantum number
/ and belongs to a multiplet of 2t + 1 isobar states. The power of isospin symmetry
lies in the ability to predict these nuclear systems as isobaric analogue states. An exact
symmetry would imply an exact equality of energy levels for the predicted isobars.
Figure 14-33 shows an array of t = 1 multiplets, or isospin triplets, in which the T,
eigenvalues are
- 1 in '10,0, in
l
|F9 , and + 1 in |«Ne 8 .
We have already noted that 18 F also contains a number of levels without any
+
counterparts in the other two isobars. The figure shows the i p = 1 ground state of
18
F as one example of this special class. These slates occur only for T, = and must
therefore be isospin singlets with isospin quantum number / = 0. The formalism
allows for the existence of singlet energy levels when A is even and predicts that such
levels should stand alone, unrelated in energy to any other isobars.
We have considered the substitution symmetry of protons and neutrons earlier in
this section, and we have associated the interchange of Z and with the charge N
symmetry of mirror nuclides. Equation (14-47) tells us that this lesser symmetry
operation is just a reversal of the sign of Tz . The rotational aspects of charge
independence embody a more powerful use of the isospin degree of freedom. Let us
turn to the abstract isospin space for an interpretation of this property and consider an
isospin multiplet of nuclear states with isospin quantum number t. The multiplet has
2/ + 1 isobaric analogue states, corresponding to 2t + 1 orientations of a vector T
with length \jt{t + 1 ) in the abstract space. Isospin symmetry allows us to rotate the
vector, and thereby pass from one isobar state to another, without affecting the energy
level. This picture has an approximate validity when the symmetry is not exact.
Let us return to the problem of two nucleons and use our knowledge of isospin to
make some detailed observations. The deuteron is one such system, but so are the
nucleonic configurations pp and nn. Let us also keep in mind that all these systems
have eigenfunctions describing the space and spin states of the two particles. If we
concentrate on isospin properties first and refer to Equation (14-47), we observe the
following possibilities for the value of Tz :
pp has Tz = 1 ,
pn has Tz = 0, and nn has T, = - 1
The total isospin vector for any two-nucleon configuration has the form
T= T, + T2 ,
.
where T, and T, are isospin-^ vectors. The analogous addition of spin-^ vectors
results in the quantized vector sums
| + \ = and 1
We apply the same rules to isospin and conclude that all two-nucleon states must have
either / = or / = 1 It is obvious that the pp and nn systems belong to / = 1 triplets.
.
V = -j=(pn-np). (14-48)
v
f ,
= nn. (14-49)
(The notation pn/ y2 means there is probability for finding nucleons 1 and 2 to be -,
p and n, respectively.) Note that the triplet states are labeled by their Tz eigenvalues.
We recognize these four expressions as exact analogues of the states of total spin s =
and s = 1, obtained for two spin- r; particles in Section 9-6. Recall that the corre-
A
sponding antisymmetric and symmetric spin eigenf unctions have been listed as \
and xi '" Equations (9-25) and (9-26).
s
The exchange properties of £
/!
and £' can be used to formulate a generalized
version of the Pauli principle. Equation (9-22) instructs us to write eigenfunctions for
two identical fermions as
VX A or f'x
fyY or
S
+ X S
A S
or ^Vf' or fW,
in which the space, spin, and isospin factors are combined to satisfy the required
overall antisymmetry under the interchange of the two fermions.
The deuteron provides an interesting application of the generalized Pauli principle.
We know from Section 14-7 that this pn system is a superposition of iS and D l x
with regard to the interchange of the two spatial variables. The triplet spin state is
likewise symmetric with respect to the two spin orientations. Hence, we describe the
space and spin degrees of freedom by the eigenf unctions \p s and x'S and we select the ,
singlet £ for the isospin description in order to meet the requirement of generalized
antisymmetry. We therefore conclude that the deuteron has isospin quantum number
t = 0. This result is as expected since the alternative t = 1 assignment implies the
existence of impossible isobaric analogues in the systems pp and nn. (Recall that pp
and nn cannot be symmetric in space and spin because of the Pauli principle in its
original form.)
Isospin gives us a new symmetry to use in the classification of quantum systems.
Complexities are simplified and regularities are revealed whenever such an ingredient
is introduced in the analysis of complicated phenomena. The theory of nuclear
structure spans a class of problems in which isospin symmetry plays an influential role.
Isospin principles are also employed extensively, along with other new types of
symmetry, in the phenomenology of elementary particles.
Example
Example
X, + X, ^x 3
+ x 4
.
We let the four nuclides have isospin vectors T, to T 4 , and we write the
conservation of the total isospin as
T= T, + T2 = T3 + T4 .
This relation among quantized isospin vectors acts as a constraint on the states
of the particles in the reaction. Consider, for instance, the (a', a) process
'H+ 12 C- 10 B+ He 4
in which the deuteron, the ground-state carbon nucleus, and the a particle
participate as t = isospin singlets. Isospin symmetry allows the reaction to
10 10
produce only the / = states in B. The ground state of B is among these
736 Properties and Models of the Nucleus
allowed possibilities. Excited states with / = 1 also exist (as isobaric analogues
l0 10 10
with Be and C), but all these B states are forbidden in this reaction because
of the conservation of isospin.
Problems
1. Suppose that the original yray interpretation holds for the experiment in Figure 14-1 and
that the Compton scattering of y rays causes protons to be ejected from the paraffin
detector. Obtain a formula relating the incident y energy and the kinetic energy K of the
observed proton for the case where K is maximum. What y energy is predicted if K is as
large as 5 MeV?
2. Calculate the de Broglie wavelength for an electron with kinetic energy 183 MeV. What
electron beam energy corresponds to a 1 fm de Broglie wavelength?
3. Consult Figure 14-8 and prove that the indicated surface thickness t satisfies the relation
— t
= 4 In 3,
z \
where 2, is a parameter in the formula for the nuclear charge density. Use the data in
14-9 to estimate a universal value for the density of nucleons in the interior of the
nucleus.
4 Data from the nuclear chart for the three stable neon isotopes 2 "Ne,
JI
Ne, and "Ne are:
Compute the atomic weight of neon from these figures and compare the result with the
5. Three doublets are seen in the mass spectrum of a sample containing hydrogen, deu-
terium, carbon, oxygen, and methane. The splittings observed in the M/e ratios of the
ions are:
+-2
0.001548 uA for ('H'H) H+
0.042306 u/e for (
2
HH 2 2
H) - 12 C ++
+
(Zef
V=
3
—
5 4w £f) /?
for the Coulomb energy stored in a uniform solid sphere of charge Ze and radius R.
(Determine the work done to construct the sphere by moving infinitesimal spherical shells
of charge radially inward from infinity.)
Problems 737
8. Use the semiempirical mass formula to compute the atomic mass of each of the isotopes
mentioned in Problem 6. Compare the results with the corresponding entries in Appendix
A.
formula. Take M("C) to be 11.011433 u and consult Appendix A for the other atomic
masses.
10. Show that the semiempirical mass formula leads to the following prediction for the
neutron separation energy:
4Z-
£„("X) = a, - M V3 (1 - £
2
) + a,j^i - -
l
1 1
-
A(A- 1)
"5
l/i
where ( = [(A — \)/A] . The result takes this form when ^X is an even-even nuclide.
11. Use the semiempirical mass formula to derive the following leading-order relation
involving nuclear binding energies:
-[E h(
t
^X)+E (<>X)} h
- £A ("X) = -^ + ....
Take Z even and A odd as conditions in the derivation, and use binomial expansions such
as
{a-
V i)
_1/3
= —-fi --) = nL( 1 + J_ + ..
A l/i \ A) A'^y 3A
13. Consider the Fermi gas model of the nucleus and assume that the Coulomb effects are
approximated inside the nuclear volume by the constant potential energy aZ" / x
' . Refer
to Figure 14-15 and let the potential energy well for the proton gas be elevated by this
amount relative to the depth of the neutron well. Deduce the following relation among the
numbers of nucleons,
N 1 3
= Z 2/J + bZ 2A /\ [
and identify the constant b in terms of the given parameters of the model.
14. Consider the np capture reaction n + p —» d + y in which both of the particles in the
initial state are essentially at rest. Obtain a formula for the binding energy of the deuteron
in terms of the energy of the y ray.
15. Points (x, y, z) on the surface of the indicated ellipsoid of revolution satisfy the equation
x
2.2
y -I- z
2
a b
Choose a suitable volume element and show that the volume of the figure is \ma~b. Let
the ellipsoid contain a uniform distribution of total charge Ze, and derive the formula
Q_= jZe(b -
2 2
a )
17. Prove that the electric quadrupole moment (() ) vanishes in a two-nucleon state described
by an ( = wave function.
18. The square-well model of the deuteron produces an eigenfunction of the form
sin Kr r < r
4>
= K k(,
/Air I
r )
r > rn
A" + k
2
2 k
1 + krn
The parameters r
Q,
k, and K are identified in Section 14-8.
19. Derive a formula for the expectation value of r in the deuteron state using the results of
the model in Problem 18, and calculate the value of (r) for r() = 1.6 fm.
20. The figure defines the parameters of a square-well model for the deuteron in which the
potential energy includes an infinitely repulsive core. Determine the f— solution of the
differential equation for rR(r) with energy — Eb in each of the three intervals (0, r ),
(r , r(1 ), and (ru ,oo). Use the continuity conditions to derive a relation among V Eb rn () , ,
and r() . Obtain a suitably continuous final form for rli(r) containing a single unknown
normalizing constant.
rc ' •
E,
^b
V(r)
-V
90
21. Make a prediction of the neutron separation energy for the magic-number nuclide Zr
based on the semiempirical mass formula. Compare the result with the value determined
Problems 739
90
from the measured atomic masses. Use M( Rq Zr) = 88.908900 u and take A/( Zr) from
Appendix A.
22. Assume a square-well potential energy as a central-field model in an independent-nucleon
theory of the nucleus. Draw a figure to represent the effective potential energy for the
Sketch graphs of the corresponding radial solutions rR lp (r), rR 2 Ar), and rR ifl
(r), and
compare with the /= results in the example accompanying Figure 14-28.
23. The next magic number for neutrons after N = 126 is supposed to be jV = 184. Show
how the sequence of levels in Figure 14-27 may be continued to generate this prediction.
24. Deduce the predictions of the shell model for the nuclear spins and parities of the odd-A
l:, J
nuclides N, Na, ' Al, and Mo. Are the results in agreement with the listings in
Appendix A?
25. What does the shell model predict regarding the nuclear spins and parities of the odd-odd
h 10 50
nuclides Li, B, and V? Do the predictions agree with the i
1'
assignments listed in
Appendix A?
26. Calculate the values predicted by the shell model for the magnetic moments of the odd-A
nuclides in Problem 24. Compare the results with experiment, using a table of nuclear
data.
27. Consult a table of nuclear data to find evidence for isospin symmetry among the nuclides
with mass number A = 10.
28. Examine the tabulated energy levels of the A = 17 system of isobars and find evidence for
NUCLEAR
PROCESSES
-
ofan atom and is known as y decay. (Of course, the radiation from a nucleus is much
more energetic. The nuclear process results in y rays characteristic of the emitting
species, with typical photon energies in the MeV range.) Radiation from nuclei may
also take the form of ejected fragments or particles. These phenomena are called a
decay or (1 decay, depending on the particular form of emission. The processes of a, ft,
and y decay represent the three specific varieties of radioactivity. Each type of emission
is associated with a certain kind of nuclear instability, and each is accompanied by a
particular sort of transition between the energy states of the nuclear system.
Radioactivity involves the spontaneous disintegration of an unstable atomic nucleus.
The spontaneity of the process implies that the transformation of the system of
nucleons takes place in an isolated nucleus, with no stimulus from any external source
of energy. Radioactive behavior is natural if the emissions are observed for a sample of
nuclides occurring in nature and is artificial if the radiation is induced in the sample
by some sort of external bombardment. Natural radionuclides are generally long-lived,
although species with short half-lives are also found in nature as decay products of
commonly occurring decay chains.
The history of nuclear physics begins with the discovery of radioactivity. Much of
our present understanding of nuclear structure comes from this method of viewing the
nucleus. We devote several sections of the chapter to a full discussion of the various
aspects of nuclear radiation. Historical remarks and general definitions are considered
first, and then the three processes of a, /?, and y decay are taken up in turn.
Nuclear transformations are also observed in the collisions of particles with nuclear
targets. The resulting nuclear reactions are additional sources of information about the
states of the nucleus. We examine the principles behind these investigations later in
740
151 Radioactivity 741
the chapter. Finally, we consider the fission and fusion of nuclei as transformations of
special interest for the generation of nuclear energy.
These phenomena constitute a modern-day realization of the alchemists' goal. The
transmutation of the elements is accomplished as desired, by a change of species
within the nucleus of the atom.
15-1 Radioactivity
The first observations of nuclear disintegration were made by accident in the x-ray
experiments of A. H. Becquerel in 1896. X rays had just been discovered in the
previous year, and Becquerel's laboratory was actively involved in the investigation of
their properties. Incident x rays were known to cause fluorescent radiation in atoms,
and so a reversal of roles fluorescent stimulation was suggested
was proposed whereby
as a means of producing x rays. Becquerel selected a fluorescent compound of
uranium and indeed found that his sample emitted penetrating radiation. However,
he also found that the uranium compound produced the same energetic emissions with
no external stimulation. It was apparent that this new type of spontaneous radiation
was peculiar to the uranium in the sample and was altogether different from the
proposed atomic x rays.
The nature of the radiation was studied by Rutherford and Becquerel, and also by
Curie and his remarkable colleague M. S. Curie. These investigators established the
existence of nuclear radiation in its three forms a, ft, and y. They demonstrated the
deflection of a and /? rays by a magnetic field and concluded that these emissions had
to be charged particles. The a particles turned out to be helium nuclei, and the
negative /? particles proved to be electrons with properties just like Thomson's cathode
rays.
Marie Curie
742 Nuclear Processes
Figure 15-1
8u
Radioactive decay scheme of Br. The nuclear energy levels are connected by /$ and y
+
transitions. The amounts of released energy are given in MeV. The designation y8 actually
represents two distinct processes, as described in Section 15-4.
17.6 m
The Curies devoted much of their collaborative work to the identification of the
radioactive elements polonium and radium. Their achievements were acknowledged
decades later when the element curium was named after them.
The first investigations of radioactivity were conducted with natural sources
bearing uranium, thorium, and other heavy elements. Artificially induced radioactiv-
ity was demonstrated for the first time in 1934 by J.-F. Joliot and I. Joliot-Curie. They
27
used a particles to bombard the nucleus A1 and initiated a nuclear reaction
30
producing the radionuclide P. The observation of positron emission from this
nucleus showed that radioactivity was not restricted just to the heavy elements.
The chronology of interesting events in nuclear physics should be noted. A
four-decade period began when natural radioactivity was first observed, prior to the
discovery of the nucleus. The neutron came later, followed by artificial radioactivity.
Finally, the period of development reached its dramatic climax with the disclosure of
nuclear fission.
half-lives of the energy levels and the energy released in the transitions. The figure
employs a useful convention whereby the initial and final nuclear levels appear in
separate parallel columns and exhibit transitions to the left or right depending on the
sign of the charge of the emitted particle. Note that y transitions stay within their own
column in this scheme.
The total charge and the total number of nucleons must be conserved in any
radioactive decay. We know that an a particle is a 4 He nucleus, and so we express the
a decay X —» Y + a in detail as
+
noting that the charge and mass numbers balance explicitly. The /? and fi decays
151 Radioactivity 743
Figure 15-2
^
M&
involve electron emission and positron emission, respectively. We learn later on that a
new type of neutral particle must also be emitted in each of these processes. It would
be premature to divulge this information now, so let us temporarily refer to the two
types of ($ decay as
The emitted /? systems do not contain any nucleons; consequently, the mass number
A remains unchanged in these transitions. Hence, all /? decays are identified as
transformations between nuclear isobars, a property illustrated by the A = 80 family
of nuclides in Figure 15-1.
Each of these nuclear transformations proceeds from a parent nucleus X to a
daughter nucleus Y. Figure 15-2 shows how we can represent such decays with the aid
of a chart of the nuclides. If the Z versus N system of axes is employed as in Figure
14-4, then the three different nuclear transitions are described by the following
transformations of coordinates on the chart:
Parent and daughter are shown executing these transitions in Figure 15-2.
We can use this method of presenting a and /? decays to describe the radioactivity
232 238
of such naturally occurring sources as Th or U. These heavy radionuclides have
the property that their half-lives are of the same order as the age of the Earth.
10 232 9
(Appendix A gives values of the half-life in the range 10 years for Th and 10
238
years for U.) Each unstable species displays its radioactive behavior in a decay series
where the first nuclear transition in the chain is an a decay with a very long half-life.
The series continues through a succession of a and /? processes from one unstable
nucleus to another until the final stable product of the chain is reached. Figure 15-3
744 Nuclear Processes
Figure 15-3
" 38
Decay series for U. Branchings of a and ft decay modes occur at several locations along
JiH 206
the chain between U and Pb.
Z
238
A = 206
energy release. These characteristics constitute an observable signature for the corre-
sponding radionuclide. The half-life is especially useful because this property gives the
radioactive source a timing mechanism to be used for chronological purposes. A
commonly employed application is radiocarbon dating, which establishes the age of a
carbon-bearing material by comparison with the 5730 year half-life of the /^-active
15-2 The Exponential Decay Law 745
14
isotope C. The technique is based on the fact that all living organisms continually
exchange CO, in the Earth's environment, where a minute fraction of the carbon
14
nuclei are of the C variety. These unstable nuclides are formed when neutrons from
cosmic-ray interactions are absorbed by the nuclei of nitrogen atoms in the atmo-
14
sphere. The processes of C production and decay are supposed to be at equilibrium
14
in the carbon-dating technique. It is known that the amount of radioactive C
relative to ordinary U C in living matter is of order 10" 12
. It is believed that this value
has remained essentially constant for over 30,000 years. Carbon decay in a specimen
of plant life must therefore occur at a corresponding constant rate, which turns out to
be about one disintegration per second 4 g of carbon. The ingestion of carbon
in every
ceases when a plant dies, and so the amount of 14 C in the specimen decays thereafter
at a rate that decreases by half every 5730 years. The time of death (or the age of the
specimen) is then determinable from a measurement of the rate of decay for a known
mass of carbon. The development of this remarkable technique is attributed to W. F.
Libby. A simple numerical illustration of the method is furnished by the following
application.
Example
14
The production of C in the atmosphere proceeds according to the (/?, p)
reaction
n +' 4 N ^ C 14
-r'H.
14
C -* 14
N + 0-.
Let us select a 64 g charcoal sample for carbon dating and suppose that /?
radiation is observed at a rate of 2 disintegrations per second. The decay rate for
a living specimen with the same mass of carbon would be
/ 1 disintegration/s \
(64 g) = 16 disintegrations/s,
using the constant rate quoted in the text. (The derivation of this rate is left to
is
3
2 disintegrations/s 1 / 1 \
16 disintegrations/s 8 \ 2/
l4
The elapsed time since death must therefore be equal to three C half-lives:
3(5.73 X 10
3
y) = 1.72 X 10
4
y.
species, and the half-life is a measure of the lifetime of the nuclide. However,
746 Nuclear Processes
the lifetime is not a statement of the exact time of decay for a single unstable nucleus
since the decay occurs as a random discrete event. Such an act of chance is predictable
only in terms of a probability for the occurrence of a certain decay transition within a
given interval of time. It follows that the transition probability per unit time
determines the radioactive behavior of a sample containing many nuclei. Thus, the
instability of the nucleus describes a statistical phenomenon, and the half-life of the
denotes the emitted radiation. This decay rate refers to the radioactive behavior of a
single nucleus in the probabilistic sense described above. The decay constant is a
fundamental property of the nucleus because, in principle, the quantity is predictable
from a quantum theory of the decay transition.
We can determine y from experiment if we observe the radiation from a source
containing many nuclei of the species X. We proceed by defining N(t) as the number
of radioactive nuclei in the sample at time /. This number is presumed to be large so
that N can be treated as a continuous variable. We then consider the change dN in
that number after an infinitesimal time interval dt, and we identify the positive
quantity — dN as the corresponding number of decays. The ratio — dN/N represents
the probability of decay in terms of the number of decays occurring in the interval dt
and the number of decaying nuclei present at time t. We employ y, the decay
probability per unit time, to express the probability in the interval between t and
/ + dt:
dN
= ydt. (15-1)
N "
V '
dN
—
dt
= yN, (15-2)
a relation deduced first on empirical grounds by Rutherford and Soddy as the original
nuclear transformation law. Since y does not depend on t, it is straightforward to
integrate Equation (15-1) and thereby determine N(t):
-
rN(t)dN
I
JNo
—n = Jq
n
/ ydt => -In
N(t)
n
= yt,
taking N (t
to be the number of decaying nuclei at t = 0. We then solve for the desired
quantity to get
N(t)=N e- yl
. (15-3)
- ("dNU) he-y'dt
15-2 The Exponential Decay Law 747
Figure 15-4
Decay curve corresponding to the exponential decay law N(t) = Nn e Y The indicated mean '.
life t and half-life T, 2 are related to the quantum mechanical decay constant y.
,
N(t)
This lifetime is called the mean life t. The integrations in the definition lead to the
simple result
t= -, (15-4)
N{t) = N e~' /T .
'15-51
The half-life t, i
,., is a different lifetime, already defined as the time required for the
sample to decay to half of its initial population. Appendix A includes a list of values of
t 1/2 for a large number of radionuclides. We show the times t and t 1/2 on the graph
in Figure 15-4, and we observe that the two parameters satisfy the equations
X, a;,
N(r) = and jV(t i/2 ) =
We obtain these results directly from the determination of t in Equation (15-5) and
from the definition of t, The second expression leads
,.,. to a simple relation between
the two lifetimes and the decay constant:
In 2
T 1/2 = Tln2 (15-6)
Y
Equations (15-4) and (15-6) are interesting because they connect experimental and
theoretical quantities, the lifetime of a sample of nuclei on the one hand and the
quantum mechanical decay rate for a single nucleus on the other.
We can measure the activity of a source by counting the rate of emitted charged
particles in a detector. Each observed particle, or count, represents a single nuclear
applied electric field. A particle is counted when the gas atoms in the chamber are
ionized by the passage of the particle and the ions are collected in the applied field.
The rate of observed counts is the same as the rate of decay of the sample —dN/dt,
except for a possible multiplying correction for the known efficiency of the detector.
This detection efficiency may include at least two factors, the fraction of the total solid
angle subtended by the detector at the source of radiation and the probability for the
particle to produce a signal in the counting circuit.
The decay constant is easily determined by such an experiment. We use the fact
that the measured activity depends on the time:
dN ( dN\
where (
— dN/dt ) denotes the activity at / = 0. If we take the logarithm on both sides
of the equality we get
/ dN\ I dN\
Hence, we can deduce the decay constant y if we plot the counting rate versus t on a
semilogarithmic graph and take the slope of the best straight line through the
experimental data.
The rate of decay of a radioactive source is usually quoted in curies (symbol Ci),
defined by the unit
1 Ci = 37 X 9
10 disintegrations/s.
This large unit of activity is approximately equal to the observed disintegration rate
22(
for a 'Ra source with a mass of 1 g.
Figure 15-5
stable
15-2 The Exponential Decay Law 749
Figure 15-6
Solutions of the rate equations for the populations of the three levels in Figure 15-5. The results
dN x
dN2
—^ = Yl2#l - Y23^2>
y23 N 2
. 15-7)
dt
The first rate has the form of Equation (15-2) so that the population N^t) must
behave according to the basic exponential decay law. We note that the rates sum to
zero, and we conclude that the solutions of the three coupled differential equations
must always obey the constraint
jV, + N 2
+ N 3
= N , a constant.
Figure 15-6 shows a set of qualitative graphs for the special class of solutions in which
the initial populations have the values JV,(0) = N N (0)
,
2
= 0, and jV3 (0) = 0. An
even more specialized case is treated analytically in one of the exercises below.
Example
226
The activity of 1 g of Ra is supposed to be approximately 1 Ci. To verify this
we first convert the 1
g sample into ^ mole, and we then find the number of
nuclei from Avogadro's number:
N = (^ mole)(6.02 X 10
23
/mole) = 2.66 X 10
21
.
226
Appendix A tells us that Ra has a half-life of 1600 years. The rate of decay is
750 Huclear Processes
dN N
= yN = In 2
dt t 1/2
(ln2)(2.66 X 10-M )
10
,u „-i
3.65 X 10 s
(1600 y)(365 d/y)(24 h/d)(3600 s/h)
Example
We can use the exponential decay law in conjunction with the quoted half-lives
and abundances of
235
U and 238
U in order to estimate the age of the Earth. Let
us denote these two isotopes as X and X, and express their populations
according to Equation (15-5):
N(t) = N e
- ' /f and N{t) = N e~
l/
\
N(t)
N(t)
\n[N(t)/N(t)] \n[N(t)/N(t)]
l/f- 1/f ln2(l/f 1/2 - 1/t 1/2 )'
using Equation (15-6) to get the final result. The value of / refers to the present
time when the two isotopic abundances are known to be
We consult Appendix A to find the two half-lives and use all these data to make
our estimate of the age of the Earth:
ln(99.27/0.72)
9
X 10
9
y = 5.9 X 10 y.
(In 2)( 1/0.704 - 1/4.468)
Example
The two-step cascade in Figure 15-5 is somewhat easier to analyze for the
behavior of X 2
when X, is very long-lived. We assume y I2 «: y23 and find that
the solution for the X, population becomes
NM = N l0 e-**' -+ l0
dN2
= R l2 - y23 N2 ,
dt
JX R V2 -y,,N2 (t)
N,(i)
dt — 1
In
~ ^12 ~ 723^20
./;
N?0 ^12 Y 2 3^2 •'m
Y2 3
taking N 2i)
to be the initial number of X ;
. nuclei. The solution for N 2(
t ) is found
from this result in two steps:
Rv<-yMt) 723
R - y N20 v2 23
and so
N (t)
2
= —
R„
Y23
+ \N20
\
-—e A',
Y23
Y12
^10+
Y23
/
I
Mo" —^10
Yi'
Y23
l«
Y23'
Our conclusion holds in the interval < / «c 1/y 12 Note that M, may either
.
decay or grow during this interval, depending on the relative size of the ratios
V^io and Yi 2 /Y 23 -
15-3 a Decay
Some of the earliest observations in nuclear physics involved the participation of the a
particle. The emission of a radiation gave evidence of the instability of the nucleus in
Becquerel's discovery of radioactivity. The scattering of a particles gave proof of the
existence of the nucleus in Rutherford's experiments. These first insights into the
nucleus were witnessed with the aid of particles emitted in a decay.
The emission of a particles is generally associated with the heavier unstable nuclei.
An inspection of the nuclear chart reveals that a activity is a main mode of instability
for the isotopes of elements beyond lead at Z= 82 and is only rarely observed in the
lighter species. The emitted particle is a nuclear fragment four times as massive as the
proton. The emission process is energetically favorable for a particles, compared to
752 Nuclear Processes
Figure 15-7
a Source Recorder
other possible nuclear fragments, because of the large binding energy and unusual
stability of the A = 4 nuclide. A tendency for a decay to prevail among the heavier
nuclides is not hard to understand. The a fragment and the residual daughter are
bound together inside the unstable parent nucleus by the attractive short-range
nuclear force. At the same time, the Coulomb repulsion between these two charged
bodies acts as a disruptive influence and contributes to the probability for nuclear
disintegration. Figures 14-4 and 14-12 remind us that these long-range effects of
Coulomb repulsion become increasingly significant in nuclei of larger size. The
heaviest nuclides may also undergo spontaneous fission, another type of fragmentation
process.
Nuclear radiation energy is readily absorbed in matter when the radiation is in the
form of a particles. The moving charges lose their kinetic energy in ionizing collisions
with the atoms in the material medium. Familiar kinematical arguments tell us that
these collisional losses of energy are greater for a particles than for the much lighter /3
radiation. A range in the medium is defined as the distance in which the particle loses
all its kinetic energy. An effective absorber for a particles might be an aluminum foil,
a sheet of paper, or even a volume of air. The range decreases with the density of the
absorbing medium and increases with the energy of the particle. To give an example
at a typical energy, a 6 MeV a particle is observed to travel approximately 5 cm in air
and 0.05 mm aluminum.
in
to obtain a calibration of range versus energy for the particular absorbing medium.
The resulting range-energy curve then gives a determination of the energy of an a
particle directly from an observation of its range.
The typical a decay X -» Y -I- a is initiated in the ground state of the a-active
parent X. The observed ranges in air vary between 2 and 9 cm for the different
emitters, as the corresponding a-particle energies vary between 4 and 9 MeV. It is
also possible in certain nuclei for the transformation to proceed from an excited state
X* to the daughter state Y. The resulting a particles are emitted with excessive
energy, between 9 and 12 MeV, and are characterized by their distinctive long range.
15-3 a Decay 753
Figure 15-8
Kinetic energy of emitted « particles versus mass number for several radioactive sources. The
data for the selected elements polonium, radium, thorium, uranium, and curium are taken from
Chart of the Nuclides, 13th edition. All quoted energies are associated with nuclear transitions
X -» Y from the ground state of X A
to the ground state of Y. few of the lighter nuclei emit a
particles with energies smaller than the values plotted here; these cases include the very
144 148
long-lived nuclides Nd and Sm.
Cm
This type of a decay is quite rare because the emission process X* —> Y + a must
plot of these unique values of Ka versus the mass number A, organized by element,
for a large number of a-emitting isotopes. One of our two main objectives in this
section is to explain the uniqueness of these energies. We see from the figure that
almost all the observed values fall between 4 and 9 MeV, a rather narrow energy
interval. It is remarkable that the corresponding half-lives have an enormous range of
variation, extending over some 24 orders of magnitude. We observe an inverse
correlation between half-life and a-particle energy, where the longer-lived nuclides
have the smaller values of K a and vice versa. We can illustrate this behavior by
754 Nuclear Processes
Figure 15-9
at rest.
©
-p < ( Y K») > P
while
Our other main goal in this section is to understand the correlation between t 1/2 and
A' n , with special concern for the great disparity in orders of magnitude of the one
quantity compared to the other.
It is easy to see why the energies of the a particles are unique. We apply the
familiar conservation laws of momentum and energy to the process
^X-^*Y+*He,
and we note that the decay produces an a particle in a two-body final state. The parent
nucleus is taken to be at rest, as in Figure 15-9, so that conservation of momentum
requires the emitted a particle and the recoiling daughter nucleus to have equal and
opposite momenta. Conservation of the total relativistic energy provides another
relation involving the nuclear masses:
M x c' = M Y c
2
+ KY + M ac
2
+ Ka .
We can add Z electron masses to both sides of the equation and convert the
expression directly to atomic masses:
A criterion for a instability emerges from these relations. The nucleus X a unstable,
is
so that the decay X -» Y + a may occur, whenever the mass of X exceeds the sum of
the masses of Y and a. We define the a-disintegration energy, or Q-value for the decay, in
terms of the difference in mass:
A -4
Q= [M{ A X) - M( Y) - M( 4 Hc)]c 2 . (15-8)
This quantity represents the total amount of released energy to be shared by the two
15-3 a Decay 755
final particles:
d=Ka + Ky . (15-9)
We may use the nonrelativistic formula for kinetic energy and write
2 2
p p
0.
2Ma 2 My
,2
MA I M
The final solution for Ka is a unique prediction for the a-particle kinetic energy, given
in terms of the masses of the particles participating in the decay. We note in passing
that relativistic formulas are required for the total energy because the rest energies
change with the identities of the particles, while nonrelativistic formulas are allowed
for the kinetic energy because the kinetic energies are much smaller than the rest
energies.
Other conservation laws also play a part in the determination of the final state. Let
us refer again to Figure 15-9 and introduce the nuclear spins I x and I Y along with
the orbital angular momentum L for the Ya system. The a particle has spin zero since
4
He is an i = nuclide. Conservation of angular momentum therefore results in the
equality
I X = IY + L. (15-11)
•Yl <<f<z x +z Y
- . (15-12)
The nuclear states X and Y are also endowed with definite parities. The a particle is
known to have even parity and the final orbital parity is given by ( — 1 )*. Conservation
of parity implies a multiplicative constraint among the various odd and even factors:
Figure 15-10
Model of the Coulomb barrier for the a decay X —* Y + a. The potential energy of the Ya
system consists of a strongly attractive nuclear contribution at short distance and a repulsive
electrostatic contribution for larger separation between Y and a. The a-disintegration energy Q
determines the probability for penetration of the barrier.
V(r)
MeV at a separation r of order 10 fm. (We estimate these numbers in the example at
the end of the section.) The Q-value is shown in the figure as the energy level of the
system, since the total kinetic energy is equal to Q at very large Ya separation,
according to Equation (15-9). A typical Q-value is less than 10 MeV, and so a
"classical" Ya system cannot leave the region of nuclear binding and reach the distant
region of large Ya separation because of the high Coulomb barrier. This trapping of
the system at short range corresponds to the existence of the temporarily bound
configuration X. The instability of X is attributed to the finite probability for the
quantum system to penetrate the Coulomb barrier and enter the decay regime at large
r. This line of reasoning suggests a possible theory of a decay based on a solution to
the related problem of quantum tunneling.
The stationary-state wave function at the energy level Q determines the probability
of finding the Ya system in its two configurations on either side of the barrier. The
quantity of interest is the transmission probability for penetration from small r to
large r. The decay constant for the process X— Y+> a is directly proportional to this
quantity. We have encountered a similar situation in Section 5-9, in our treatment of
quantum tunneling for a rectangular barrier in one dimension. We return to our main
15-3 a Decay 757
la
exp l2m(V -E]
is the primary governing factor in the transmission probability. (This result is only
qualitatively germane
a decay, a three-dimensional problem with a sharply peaked
to
Coulomb barrier. The one-dimensional analogue is best suited for transitions X —> Y
in which both X
and Y are even-even nuclides. These species have nuclear spin i = 0,
and so the final Ya state has zero orbital angular momentum, according to Equation
(15-12).) Let us apply the one-dimensional analogy as an approximate guide to the
behavior of a decay and adapt the exponential factor to the picture in Figure 15-10.
We substitute a rectangular step of height V and length a in place of the sharp
barrier, and we argue that the decay constant for X — Y + a
> is proportional to the
tunneling factor
2a
exp
h
Wo-£.
where jjl is the Ya reduced mass and Q is the energy level. This factor decreases very
rapidly with increasing choices of V — Q, the height of the barrier above the ()-value.
We predict a further reduction in probability if we let the choice of a grow with
increasing V — Q, to reflect the fact that the quantum tunnel becomes longer with
diminishing Q_. These features of barrier penetration explain qualitatively why the
observed half-lives grow so rapidly with decreasing values of the a-particle energy.
Since a decay is monoenergetic and since the unique energy of the a particles can
be measured with high precision, it is feasible to observe the energy and gain precise
information about nuclear energy levels. These spectroscopic applications of a decay
Figure 15-11
Decay scheme and a-particle spectrum for 3- Th. Nuclear levels are labeled by
"'
their i
p
quantum numbers and half-lives. The a and y emissions have energies as shown in MeV.
1.40 x 10 10
3.9
1 4 4.1
Ka (MeV)
758 Nuclear Processes
have been employed extensively to study nuclear structure in the heavy elements. A
typical radioactive sample may exhibit a spectrum of a particles in which several
unique energies are observed for the emissions from a single source. A case in point is
!2
Th decay, where the thorium source emits a particles in two groups, at 4.01 and
3.95 MeV. Figure 15-11 presents these data in a decay scheme along with a schematic
display of the a-particle spectrum. The two a groups arise from decays of the type
232
Th-» 228 Ra+ 4 He,
where the radium daughter occurs in its ground state and in one of its excited states.
The shows that the a decay to the excited level is followed promptly by a y
figure
deexcitation to the ground state. Note that the a spectrum maps out the relevant
228
energy levels of Ra, while the observed y-ray energy confirms the spacing between
the levels. Other varieties of a-active parent nuclei exist with greater numbers of
excited daughter states and produce decay schemes of greater complexity.
Example
The ratio of nuclear masses in the final state is also needed in order to learn how
much of Q is carried away by the a particle. We find this quantity from the
atomic masses by including the masses of the electrons (m ec 2 = 0.5110 MeV):
M„ 4.002603-2(0.5110/931.5)
= 0.01755.
M Ra 228.031069-88(0.5110/931.5)
(We might as well ignore the electron-mass correction since the ratio of atomic
masses produces the same answer to five decimal places. In fact, the ratio of
mass numbers 4/228 is just as good to four decimal places.) Equation (15-10)
gives the desired kinetic energy
4.082 MeV
Kn = = 4.012 MeV.
1.01755
This result corresponds to the smallest of the values plotted in Figure 15-8. The
Qj value is small compared to the height of the Coulomb barrier in the Ra-He
system. Let us estimate the latter quantity by calculating the repulsive Coulomb
potential energy at a range equal to the sum of the two nuclear radii (the
center-to-center distance in the Ya system of Figure 15-9). We use Equation
(14-3) to find the range,
1/3
r = (228 + 4 1/3 )(1.2fm) = 9.2 fm,
15-4 $ Decay 759
and we then compute the Coulomb potential energy for that separation:
7 7 -2
V(r) =
4ire r
2
X 9
N m /C 2 2
x 10" 19
C)
=
(2)(88)(9
;
(9.2
10
77
T7
X 10^ 15 m)(l.60
— •
X 10" 13 J/MeV)
)(l.60
77 r = 28 MeV.
This barrier height is seven times as large as the Q-value for the decay. Let us
interpret our calculations with the aid of Figure 15-10. We choose a low-lying
energy level to represent the small value of Q. an ^ observe that a very thick
Coulomb barrier is obtained. The resulting quantum tunnel is supposed to be
232
long enough to account for the long half-life of Th.
15-4 Decay
Nuclear decay has properties and mechanisms quite unlike those associated with the
(5
in which the proton and neutron numbers change by one unit while the mass number
A remains the same, as indicated in Figure 15-2. A more fundamental picture of the
phenomena appears to be in effect, where fi~ decay transforms a neutron into a
+
proton within the nucleus and /? decay reverses this basic transformation. A
+
qualitative definition of the /?" and /2 instabilities emerges from these observations.
The parent nucleus evidently has two much mass, either because of a neutron excess
+
in the case of /?~ decay or because of a neutron deficiency in the case of /? decay.
Each type of /? transition alters the system of bound nucleons so that the nuclide ^X^
transforms into an isobar of lesser mass, one unit removed in both nucleon numbers Z
and N. The transformation proceeds along a line of constant A on the nuclear chart,
in a direction toward the valley of stability.
Wehave mentioned the valley of stability on two occasions in Chapter 14. The Z
versus TV plot in Figure 14-4 shows the tendency for stable nuclides to lie in a central
valley bounded by radioactive isotopes on either side. The semiempirical mass formula
in Equation (14-13) determines a mass surface in the variables A and Z and defines a
stable valley as the locus of points of minimum mass in each cross section of the
surface at constant A. We describe these features by graphs of the atomic mass
M(jX) as a quadratic function of Z in Figure 14-13, and we note that isobars appear
on the graphs for each choice of A. A ^-unstable isobar takes its position along the
parabolic curve at a value of M above the minimum. A /? transition makes its
appearance on the graph as a link between the unstable parent and its less-massive
daughter one unit away. The link runs from Z to Z+ 1 for /? decay or from Z to
+
Z— 1 for /3 decay. Figure 15-12 shows such a display for the case of the ^-active
isobars at A = mass number corresponding to the decay scheme in Figure
80, the
15-1. Note that the graph of M
versus Z has two branches, one for the even-even
760 Huclear Processes
Figure 15-12
79 93
79 92
Se^—^Kr
A 80
31 33 35 37
nuclides at lower M
and another for the odd -odd nuclides at higher M. We recall
from Section 14-5 that the two mass curves are expected for even A because of the
pairing effect. We have constructed a similar set of graphs in Figure 14-13 for the
isobars with A = 98, 99, and 100. The odd- A family at A = 99 is plotted again in
figure 15-13 as a graph of M
versus Z and also as a partial decay scheme. The
various (i processes are indicated in both parts of the figure. Observe that the
transitions Z —> Z— 1 appear in the decay scheme in two different forms, designated
by the notation EC and fi + . We distinguish these processes in due course as we
continue through the subsequent discussion.
The conservation laws of angular momentum and energy introduce an immediate
new development in the interpretation of fi decay. Let us consider angular momen-
tum first and note that a problem arises when we examine the spins of the participat-
ing particles and nuclides. The emitted charged particle is either a spin-^ electron in
decay or a spin-^ positron in /? decay. The nuclear transition X —» Y involves
+
fi
no change in A, and so both nuclear spins x and Y must be given by integers, if A isi z
the case of a decay but are distributed In a continuous spectrum of energies, Figure 15-14
shows an example of a fi spectrum in which the kinetic energy of the electrons varies
from nearly zero up to a maximum value determined by the energy released in the
15-4 £ Decay 761
Figure 15-13
Sequential (i decays in the A = 99 system of isobars. The plot of M versus Z reproduces one of
the graphs in Figure 14-13. Data in the decay scheme include the values of i
p and t i/2 along
with the released energies in MeV. The /? decays descend from the left while the electron-
+
capture transitions and /8 decays descend from the right.
M (u)
98.92
\
\ Pd</
\ Rh
c/(/^
(%/ 15s
Tc Ru
Nb
39 41 43 45
k-z
47
3.64
16.1 d
65.9 h
Mo
+
3%//3
1.361 • • •
1209
• • J
2.13 « 10 5 y
Ru
nuclear transition. This display of events reveals that less than half of the total
available energy is carried off by the detected charged particles. The missing amounts
of angular momentum and energy are explained by arguing that ft decay must
include another unobserved particle in the final state. The argument treats the two
processes
Figure 15-14
Endpoint
1.161 MeV
use the designations v and v, and describe the two nuclear processes as
We invoke anew type of conservation law for nonnuclear particles when we employ
this notation. By convention, e~ and v are called particles, while e + and v are called
antiparticles, so that the total number of nonnuclear particles is conserved, along with
the total number of nucleons, in each of the decays. Conservation of the total charge
requires the neutrino to be a neutral particle. The original problem of the missing
angular momentum is resolved by assuming that v and v are spin-^ fermions. Since
the electron cannot be a constituent of the nucleus, it follows that the two-body yS
+
systems e'v and e v must be created in the nuclear transformation and are not
regarded as "present in the nucleus" prior to the decay.
The problem of the missing energy in /? decay was acknowledged by 1930, before
the coming of the neutron. An explanation for the continuous j3 spectrum was
urgently needed since the observations presented a challenge to the fundamental law
of conservation of energy. Pauli offered a solution in 1931 by proposing the existence
of an unobserved third particle. The neutrino hypothesis was accepted, even though
the particle itself could not be observed, because the violation of the sacred conserva-
tion laws was not a tolerable alternative. The mass of the neutrino was supposed to be
exceedingly small, perhaps to the point of vanishing, to explain the observed /S
+
v + p -* n + e .
15-4 p Decay 763
Antineutrinos were produced with a very large flux from the /2-active fission fragments
in the reactor, and absorption events were recorded whenever the final-state particles
were observed. Positrons and neutrons were detected, through the processes of positron
annihilation and neutron capture, by the observation of photons in the scintillating
liquid of the detector. This experiment established the existence of the neutrino, and
later experiments confirmed the very small prediction for the neutrino interaction
cross section.
The neutrino concept certainly does not require the mass of the neutrino to be
precisely zero. We must turn to experiment for a determination of the mass, as we
should for any particle. The question of a neutrino mass is of great interest in some of
the current problems in particle physics. We can safely ignore this question when we
study events in the main part of the /3 spectrum, because the mass of the neutrino is
known to be very small, much smaller than the mass of the electron.
Let us begin our kinematical investigations by considering /? decay. We apply
conservation of energy to the three-body decay of a nucleus X at rest by writing
Mx c
2
= MY c ' + KY + m e
c
2
+ K +E
e v
. (15-13)
{M x + Zm e
)c
2
= [M y + (Z + l)w,]<-' + A' v + A',, + £„,
or
M{ Az X)c 2 = M( z ^)c 2 + Ky + K + e
E„
d= [M(jX) - M( z ^)]c 2
= KY + K +E
e v . (15-14)
This result identifies a specific total amount of energy to be shared among the three
final particles Y, e~, and v. In fact, the recoil energy of the massive nucleus Y is
(M x + Zm f
)c
2
= [My + (Z - \)m e
]c
2
+ 2m/ 2 + KY + K + e
E„.
M{jX)c 2 = M{ z A ,Y)c 2 + 2m ec 2 + K Y + K + e
E„
+
when we rewrite the result in terms of atomic masses. The Q-value for fi decay is
764 Nuclear Processes
therefore defined as
d= [M( A2 X) - M{ Z ^Y) - 2m e
}c
2
= KY + KE + E ,.
t
(15-15)
Equations (15-14) and (15-15) give the total amounts of energy released in the two
decays. We note that the nuclides X and Y must give a positive expression for () if ft~
or ft
+
decay is to take place. The equations tell us that K e
is strictly less than Q, as
needed problem of the missing energy. The equations also tell us that the
to solve the
maximum K r approaches Q, for zero nuclear recoil, in the limit of vanishing
value of
neutrino energy. This maximum kinetic energy defines the endpoint in the spectrum for
a given ft decay, as in the illustration provided in Figure 15-14.
The neutrino-mass question comes to the surface when the ft spectrum is examined
close to its endpoint. This region of maximum K (
and minimum E v
is sensitive to the
rest energy of the neutrino because the decay probability is a function of the neutrino
momentum
m 2 c*
Pv = •
The fact that Ev and m vc appear together in this expression implies that a test for a
nonzero mv is conceivable if events can be detected at sufficiently small values of £,,.
(We note in passing that pv can only approach zero for mv = 0, because a zero-mass
neutrino cannot be at rest in any Lorentz frame. The ft spectrum distinguishes the
m v = case since the approach to the endpoint varies gradually with K e
only when
pv = E v /c.) This idea is under investigation in a current experiment to study triton ft
decay,
3
H ^ He :i
+ e~+ v,
near the endpoint of the electron spectrum. Indications suggest a lower bound for m vc 2
around 30 eV, a rather small neutrino rest energy. This question remains unsettled,
however, since there are strong arguments for a much smaller neutrino mass from
other investigations.
+
A nucleus may undergo a second kind of ft transformation in the process of
electron capture. This phenomenon is illustrated in Figure 15-15 by the reaction
in which the electron is absorbed from an atomic orbital state to initiate the nuclear
transformation. The process is most favored when capture occurs for a K electron,
because the /= orbital assignment maximizes the probability of finding the electron
in the volume of space occupied by the nucleus. An electron capture is detected when
an atomic x ray is observed; the x ray results from the transition of an outer electron
to fill the inner hole left by the captured electron. We neglect the atomic binding
energy in the initial state and apply conservation of energy to the process by writing
Mx c
2
+ m t
c
2
= MY c
2
+ KY + E„. (15-16)
(Af x + Zm r
)c
2
= [M y + (Z - \)m t
]c
2
+ KY + £„,
15-4 fi Decay 765
Figure 15-15
process.
Atomic orbit
^<Y) ©^
or
M{jX)c 2 = M( z ^Y)c 2 + KY + E v
.
The Q-value for electron capture is therefore given in terms of atomic masses by the
formulas
d= [M(jX) - M( z ^)\c 2
= KY + E„. (15-17)
This amount of energy released to the final Yp state must be a positive quantity, so
that the atomic mass of X must exceed tne atomic mass of Y, if electron capture is to
occur.
+
Equations (15-15) and (15-17) refer to /? transitions between the same parent and
daughter. The difference in Q-values implies that electron capture is energetically
+
allowed whenever /3 decay takes place. The two processes compete under these
+
circumstances, as suggested in some of the transitions shown in Figure 15-13. Since fi
decay gives the smaller Q-value, it is possible to have masses of X and Y for which
Equations (15-15) and (15-17) have opposite signs. In this situation the unstable
+
nucleus undergoes electron capture as its only mode of /? transition. We illustrate
some of the possibilities in the examples below.
Our discussion of ft decay presumes that both X and Y are in their ground states.
A /? transition often leaves the daughter nucleus in an excited state Y*, which then
deexcites by the y-emission process Y* — Y +> y. We observe such a transition by
detecting a y ray in delayed coincidence with the observed /8 particle. The Q-value
formulas determine the total released energy, including y-ray energy, in this situation.
Example
The isobars 272 Mg, 2 ]A\, and ^Si engage in /? processes similar to those of the
A = 99 system in Figure 15-13. A glance at the tabulated atomic masses tells us
27
that A1 is the stable isobar:
27 27 27
26.984342 u for Mg, 26.981539 u for A1, and 26.986704 u for Si.
>
»Mg-»£Al + «"+*,
e~ + f4 Si -»^A1 + v.
We obtain a result larger than twice the rest energy of the electron, and so we
+
also expect fi decay
+
ftSi -*^A1 + e + v.
+
Q(£ ) = (£(EC) - 2m„c 2 = 4.811 MeV - 2(0.51 10 MeV) = 3.789 MeV.
Example
80
Figure 15-1 includes all three /? processes in the decays of the radionuclide Br.
Let us consult the table of atomic masses and calculate the relevant (^-values.
For /T decay,
80
Br ^ 80 Kr + e~+ J>, we find
Q(EC)=[A/( 8?Br)-M(^Se)] C 2
= (79.918528 - 79.91 6521 )(931 .5 MeV) = 1.870 MeV.
For /T decay, 80
Br ^ 80 Se + e
+
+ v, we find
+
Q.(P ) = Q( EC - 2m S 2 = ) 1-870 MeV - 2(0.5110 MeV) = 0.848 MeV.
The first two of these released energies are entered on the decay scheme in the
figure.
15-5 The Weak Nuclear Interaction 767
The /? instabilities of nuclei are caused by a fundamental type of force known as the
weak interaction. We call the force fundamental because its influence applies to the most
basic particles in nature. The weak interaction affects the behavior of nuclear particles
and exhibits properties of strength and range on a scale very different from the strong
nuclear force. Despite its feeble strength, the weak nuclear force is able to reveal many
of its characterizing features through the commonplace phenomenon of nuclear /?
decay. In return, much is learned about the structure of the nucleus through the use of
the /?-decay interaction as a nuclear probe.
The weak force shows its strength in the properties of the neutrino, whose processes
are only of the weak variety. We can appreciate the weakness of the force by
inspecting the extraordinarily small cross sections for the interactions of neutrinos in
matter. We realize some of the consequences when we also learn that neutrinos are
produced in certain nuclear reactions that occur typically in the interiors of stars. The
stars are almost transparent to the passage of neutrinos, and so the weakly interacting
particles easily escape the stellar interiors. The result is a continuing inundation of the
universe with a debris of almost-massless neutral particles.
The primitive weak phenomena in all nuclear /? decays are the neutron and proton
/? processes
n — p +
> e~ + v and p —> n + e
+
+ v inside the nucleus.
The /? decay of the neutron also occurs in the free state and accounts for the observed
instability of this basic nuclear particle. Both nuclear and nonnuclear particles are
engaged in these manifestations of the weak interaction. The proton and neutron have
already been classified by the name nucleon. This subclass of particles is part of a
larger family whose members interact s'rongly with each other. The electron and
neutrino do not participate in the strong nuclear interaction and are called leptons to
distinguish their behavior from that of the nucleons. We define a lepton number L
according to the particle and antiparticle assignments
+
L = + 1 for e~ and v and L = — 1 for e and v.
The nuclear yS decays are then supposed to obey a conservation law for the total
number of leptons along with the familiar law conserving the total number of
nucleons. We have learned to identify p and n as two different charged isospin states
developments began in nuclear physics in 1934 when Fermi proposed his theory of
/?-decay transitions in nuclei. His picture of the nuclear /? interaction had a number of
decays. Several crucial discoveries were made in experiment and in theory during the
1950s. These ingredients became additional aspects of the picture as attention then
turned toward the understanding of the weak decays of the elementary particles.
The Fermi theory draws a close analogy between the weak interaction in nuclear /8
decay and the electromagnetic interaction in atomic y emission. Let us represent the
768 Nuclear Processes
Enrico Fermi
( — er) in a transition state describes the atomic photon-emission process and that the
dipole transition amplitude determines the intensity of the radiation. Let us express
the formula for the amplitude in terms of the electron states in the figure by writing
ftt^a/T,
where a, and a, represent initial and final sets of subshell quantum numbers
(nc"m^m ) according to Equation (9-9). In the Fermi theory there is an analogous
s
15-5 The Weak Nuclear Interaction 769
Figure 15-16
Atomic y emission and nuclear /? decay as bound electron and nucleon single-particle
transitions. The electron states a, and <x
f
denote and final subshell quantum numbers,
initial
while the nucleon states X and Y refer to parent and daughter nuclei.
P(Y)
e(otf)
Time
Space
e(a,) n(X)
4>%4>*dT
/
and final orbital quantum numbers. These £ values label the angle dependence of the
nucleon eigenfunctions in the Fermi transition amplitude. The amplitude vanishes if
the / values are not equal, because of the orthogonality of the spherical harmonics in
the integral J^y^Px ^ T This observation means
- that the nucleon does not change its
orbital state, so that the overall parity of the nucleus remains unaltered in the
transformation. An allowed transition is further defined as one in which the pair of
leptons is emitted with orbital angular momentum €= 0. This type of decay is
favored because there is no centrifugal potential energy barrier to inhibit the emission
of the created particles. The nuclear /? process is said to be forbidden whenever the
/= condition on the leptons is not met. (In this context forbiddenness implies a high
degree of suppression rather than an outright prohibition of the decay. Forbidden
transitions actually occur for many unstable species on the nuclear chart. The degree
of suppression grows with increasing { , and so the half-lives tend to be longer for these
nuclides.)
Let us consider only allowed transitions and look for constraints on the change in
the nuclear spin as X
transforms into Y. The emitted /? system consists of two spin- L ,
this case we express the conservation of angular momentum for the process X Y+
by writing
i
x = i
Y + 1.
The addition of vectors constrains the quantized angular momenta such that the
nuclear spin quantum numbers must obey either i
x = *
Y or ix = i
v + 1. We also
note that the conservation law does not permit transitions from i
x = to i
Y = 0. The
result, that A? '
= or + 1, is a second type of selection rule due originally to Gamow
and E. Teller. We distinguish the two cases s = and s = 1 by calling the one a
Fermi transition and the other a Gamow-Teller transition.
We observe that the Fermi transition amplitude /^y'/'x ^ T contains no operation
to change the nuclear spin. Another amplitude must therefore be introduced to
accommodate the A* = + 1 transitions included in the Gamow-Teller selection rule.
The required spin modification does not affect the nuclear parity, and so the parity of
the nuclear state remains unchanged in both Gamow-Teller and Fermi transitions.
Let us summarize our conclusions about the allowed decays and the emission of /?
s = 0, Ai '
= 0, no (15-18)
for Gamow Teller transitions. Both selection rules are designated "no" to indicate no
change in the parity of the nuclear state. We recall that the total spin quantum
numbers s = and s = 1 refer to antiparallel and parallel spins. Let us apply this
familiar picture to the emission of the pair of leptons and represent the selection rules
schematically with the aid of Figure 15-17. Specific examples of allowed Fermi and
Figure 15-17
Selection rules for Fermi and Gamow-Teller transitions in the decay X— > Y+ e~ + v.
Y i
4
= <y = or *x = »Y ± 1
'x i'x <y
(but ix = -h ;
Y = 0)
75-5 The Weak Nuclear Interaction 771
Gamow-Teller /? decays are given below. We have already remarked that forbidden
decays also occur. These processes can be associated with changes in the nuclear
parity and with changes by more than one unit in the nuclear spin.
We have learned in Section 7-5 that the overall parity of an interacting system
obeys a multiplicative conservation law in the radiative transitions of atoms. We also
know that these processes are governed by the electromagnetic interaction. It is
natural to ask whether the overall parity is similarly respected by the weak interaction
in the fi decays of nuclei. We discuss the experimental and theoretical responses to this
important question in Chapter 16, when we turn to the more recent discoveries in
weak interaction physics.
Example
+
The )8 decay of Sc is described as
+
_>i
Sc -»gCa + e + v,
+
which both
in initial and final nuclides have for their i
!'
quantum numbers.
This allowed — > transition is a Fermi process, obeying the selection rules
given in Equations (15-18). The Gamow-Teller selection rules in Equations
32
(15-19) are illustrated by the f$ decay of P,
<-'p _»32c , - , -
+ 32 +
The i
p values are 1 for P and for
!
"S, implying A; = - 1 with no change
in parity. Most allowed /? decays proceed via a combination of Fermi and
Gamow-Teller The decay of the neutron is a case in point since the
transitions.
emitted leptons e~ may have parallel or antiparallel spins, and the
and v
change in nuclear spin cannot be balanced by the spins of the leptons alone, and
so a nonzero value of / is needed in the orbital state of the leptons. A
36
suppression of C1 decay results, accounting qualitatively for the long half-life
Example
A very crude calculation can be made along the following lines to estimate the
strength of neutrino interactions in matter. Let us examine this question in terms
of the antineutrino absorption reaction v + p —> n + e
+
, as studied in the
Reines-Cowan experiment. We construct the absorption cross section ov as an
effective area presented toan incoming antineutrino by a target proton, incorpo-
rating the probability for an interaction to occur. The primitive weak nucleon
transformation is represented by the (i decay of the neutron n ^> p + e~+ v.
772 Nuclear Processes
The average time of decay is found from the 10.5 min half-life:
Let us use the reciprocal of this time interval to set a rough measure of the
probability per unit time for the p — » n transition in the v absorption process.
he 1240 MeV fm
X = —h = —
E„
=
3 MeV
•
= 413 fm.
pv
13
A 4.13 X 10 in
2I
/ -r- = 1.38 X 10 s.
c 3 X 10" m/s
21
1 1.38 X 10 s
P= t- - = - = 1.52 X 10~ 24 .
t 909 s
The antineutrino collides with a target proton if the radius of the target area A
is within the range of localization of the antineutrino. The collision then causes
the absorption of the antineutrino with a probability given by P. We use these
arguments to estimate the target area
2
A = -n A
2
= tt(4.13 X 10" 13 m) = 5.36 X 10~ 25 m 2
o„ = AP = (5.36 X 10
25
m 2
)(l.52 X 10
24
'
)
= 8.15 X 10
49
m 2
.
Let us pursue the arguments a bit farther to estimate the mean free path /„ for
neutrino interactions in matter. This quantity deduced from the cross section is
',= —1
°,«
.
where n denotes the number of protons per unit volume. We take the absorbing
medium to be liquid hydrogen, and we find n from the density and the proton
15-6 y Decay 773
mass:
70 kg/m 3
= 4.2 X 10
28
rrT
1.67 X l(T 27 kg
1
= =
''
(8.15 x 10-
49
m 2
)(4.2x 10 28 m- 3
2 9 ' X ^^ m "
Thus, the extremely small v cross section translates into an incredibly long mean
free path. This average distance between interactions is large enough to call for
the use of light-year units:
2.9 X 10
19
m
tv = 7i
15
= 3.1 X 10
3
lt-y.
9.46 X 10 m/lt-y
Of course, our estimates are based on an extremely gross calculation. The actual
cross section is somewhat larger, but only by a single order of magnitude. For 3
MeV antineutrinos, a,, is approximately 10
47
m 2
and {v is of order 100
light-years. We offer these figures to demonstrate the extraordinary weakness of
the weak interaction. The observation of antineutrinos begins to become feasible,
as in the Reines-Cowan experiment, when the v flux upon the target is
15-6 y Decay
The electromagnetic interaction affects the nucleus through the process of y deexcita-
tion. Nuclear y radiation is emitted whenever a transition occurs from an excited level
to a state of the same nuclide at lower energy. The prior state of excitation of the
nucleus is often the result of some nuclear reaction or disintegration, as in the example
shown in Figure 15-18. If the excited level is not so high as to allow the prompt
ejection of a nuclear particle, then the radiative process offers the main available
mode of decay. Photons produced in these electromagnetic transitions have energies of
MeV order, a typical scale of energy for nuclear excitation. A particular nucleus
yields a spectrum of y rays characteristic of a certain system of excited states. The
resulting decay scheme is unique and serves as a signature for the radiating species.
Measurements of y-ray wavelengths may be made using crystal spectrometers like
those employed for atomic x rays, provided the energies of the emitted photons are not
far beyond the usual x-ray regime. These Bragg techniques are used for y rays with
energies up to 1 MeV. Radiation of moderate energy may also be detected by
introducing a thin-foil material (called a radiator) in the path of the photons and by
observing the ejection of electrons produced by the photoelectric effect or the
2
Compton effect. If the y energy exceeds 2m e
c , the detection of photons can be
performed by letting the y rays produce electron-positron pairs in a thin-foil
absorber.
774 Nuclear Processes
Figure 15-18
5* 5.27 y
0.318
.Co 3.3 ps
0'
4
2 -
1.173
2*
Y 0.7 ps
1.332
0*
Y stable
Ni
zX* ix + y,
and we note that the final level need not be the ground state. This process is not
fundamentally different from the radiative deexcitation of an atom, although the
energy scale is appreciably larger for nuclear radiation. Other numerical distinctions
come into play when we consider the enormous range of lifetimes found in y decay.
The vastly different probabilities for the various radiative transitions depend on the
y-ray energies and the i p quantum numbers of the initial and final nuclear states. It is
instructive to organize all these decays with the aid of selection rules similar to those
found for the emission of electromagnetic radiation in atoms.
We have taken the electric dipole transition to be the only important mechanism in
the deexcitation of atoms. Sources of radiation other than the oscillations of the
electric dipole moment must also be considered in nuclear y decay. Figure 14-18
shows the electric dipole as one of a sequence of configurations that make up a
complex distribution of charge. Nuclei may also have quadrupole, octupole, or higher
electric moments as nonnegligible contributors to the radiation field. We classify these
rnultipole structures as 2 -poles, taking k= 1,2,3,... for the sequence of dipole,
quadrupole, octupole, . . . configurations. Magnetic multipoles also exist as distinct
Figure 15-19
ci^
and magnetic dipole sources of radiation are shown in Figure 15-19. We associate
electric dipole radiation with the linear oscillation of an electric charge and magnetic
dipole radiation with an oscillating current loop. The analogous k = 1 quantum
systems are described by the expectation values of the electric and magnetic dipole
moments, oscillating in a transition state. Similar descriptions apply to the multipoles
beyond k—\.
The multipole index k is equal to the angular momentum (in h units) carried away
by each photon in a 2*-pole radiation field. This interpretation of k as an angular
momentum quantum number generalizes oui remarks in Section 7-5, where we
consider the special case k = 1 and find that the electric dipole photon radiates one h
unit of angular momentum. The probability for radiative nuclear decay depends
sensitively on the quantum number ';, as large assignments of k imply highly
suppressed rates of decay and long radiative lifetimes. We can illustrate this property
with the aid of the following qualitative list of ranges of the mean life t, organized by
multipole for given extremes of the photon energy e:
~ lb
£1
1
electric dipole 10 to 10
14 12
magnetic dipole Ml 10" to 10 s
We use the conventional notation Ek and Mk to symbolize the various electric and
magnetic 2 A
-poles. Note that t grows through many orders of magnitude as k
increases, and observe that the electric multipoles have the faster decays for each
value of k.
Thus, we associate opposite parities with the Ek and Mk modes for each value of k.
Selection rules in nuclear y decay follow from the conservation of overall angular
momentum in the nucleus-plus-radiation system. We let the participants in the process
X* — X +> y have nuclear spin and multipole quantum numbers i*, i, and k, and
we then use these assignments to denote quantized angular momentum vectors in the
conservation law
i* = i + k.
This familiar vector constraint implies that the multipole index k must obey the
condition
We have already learned that no radiation multipole exists for k = 0. It follows that
the quantum numbers i* = i = cannot satisfy the vector equality for any k, so that
no — * radiative decay is possible.
Radiation may be emitted with or without a change of parity in the nuclear state.
Conservation of the overall parity calls for the emission of an odd-parity multipole
when X* and X have opposite parity, or an even-parity multipole when X* and X
have equal parity. We can be more specific about the multipoles we invoke the if
space-inversion properties of the various electric and magnetic modes. Thus, we find
that parity conservation permits
Our list of lifetimes correlates each of these multipoles with a range of values of t.
The enormous variations indicate vividly the role of the selection rules in nuclear y
decay.
The list of t values suggests the existence of nuclear states with large radiative
lifetimes. These metastable systems may
live long enough to allow the direct measure-
ment of such state properties as the mean life, the nuclear spin, and the magnetic
moment. An excited configuration of the nucleus is called an isomeric state, or a nuclear
isomer, if its mean life is long enough to be measurable. The radiative decay is then
said to occur in an isomeric transition, and the phenomenon is known as nuclear isomerism.
We expect isomeric transitions to be accompanied by radiation in the higher multi-
75-6 y Decay 777
poles, where the probabilities for decay are greatly suppressed, and where the changes
in the nuclear spin are given by several units.
The deexcitation of a nuclear state may also occur by a radwtionless mechanism
known as internal conversion. In this alternative process, the nucleus releases energy in a
transition to a lower state, and the energy is absorbed at once in the electronic
configuration of the associated atom. The exchange of energy results in the ejection of
atomic electrons instead of the emission of a nuclear y ray. An analogous nonradiative
type of transition takes place in atoms through the Auger effect, as discussed in
Section 3-7.
Detail
defined by the ratio of the nuclear radius and the y-ray wavelength:
R
£
=
A/277
We interpret £ as a means of comparing scales of length for the only two lengths
available in the radiation problem. We then
Equation (2-51) and (14-3) so
recall
that we can express £ in terms of the photon energy and nuclear mass number:
e R A l/i e
£ = 2tt/?
he he
Values of this nuclear parameter are less than ^ for most nuclei, while values of
the analogous atomic ratio are two orders of magnitude smaller for all atoms.
(We illustrate this claim in the first example below.) Since £ is raised to a certain
power to determine the probability for a certain type of transition, it follows that
that type of transition is suppressed much less in nuclei than in atoms. This
comparison of scales tells us why more radiation multipoles are considered in the
case of nuclear y decay.
Example
he 1240 MeV •
fm
X = — = = 620 fm.
e 2 MeV
This comparison of lengths is the idea behind the radiation scale parameter £.
Let us look at the upper end of the range of £ by choosing a large mass number.
778 Nuclear Processes
We take A = 6
3
and again set e = 2 MeV to get
R A l/3 e (1.2fm)(6)(2MeV)
£ = = = 0.073.
he 197 MeV •
fm
The probability for y decay depends on £ raised to a power, where the exponent
increases with the multipole index k. The smallness of £ implies small decay
rates and long lifetimes for radiation in the high multipoles. We find an even
smaller scale parameter when we consider the radiative deexcitation of atoms.
Let us estimate £ for atoms by computing a ratio of typical lengths:
£ = —
2tt/?
A
- = 2tt
6
10^
X
10
10~
m
=
7
—m = 1 X 10" 3 .
Electric dipole transitions take the smallest power of £ and give the only
appreciable modes of decay for such a small scale parameter. This argument
justifies the neglect of all but the E\ mode in the radiation from atoms.
Example
l37
The (3 decay of Cs leads predominantly an isomeric state of 137 Ba, as
to
indicated in Figure 15-20. This y level has a measurable half-life of 2.55 min.
+
Its isomeric transition to the \ ground state is labeled as such in the figure. The
transition occurs with a change in nuclear parity, while the nuclear spin
decreases by four units. Equation (15-20) tells us that the multipole quantum
number is in the range 4 < k < 7, and parity conservation then implies that the
allowed multipoles are A/4, E5, A/6, and El. The A/4 multipole has the
smallest value of k and is expected to give the dominant decay mode. Larger
values of Az are not uncommon in other isomeric transitions on the nuclear
chart. The correspondingly longer half-lives are listed in hours, days, and even
years.
Figure 15-20
IJ7
Isomeric transition in Ba following the fi
l!
decay of Cs. Energies for the fi and y
transitions are given in MeV.
;<
3( 3.17 y
I37
v 95% • • •
Cs 51
5%\\P
W"/? 2.55 m
V" +
IT 0.662
1.17\ '/
2
stable
V/2 '
l>
15-7 Resonance Radiation 779
The y-active states of nuclei have radiative lifetimes ranging from attoseconds to
years.The variations span 25 orders of magnitude, and so the methods for measuring
such disparate intervals of time must vary accordingly. We can determine mean lives
longer than 1 jus directly from observations of y-ray counting rates. We can measure
lifetimes greater than 10 ps, for excited nuclei resulting from (5 decay, by detecting
delayed coincidences between the emitted and y rays. Very short lifetimes can be
(5
h
r = -. (15-21)
T
widths are found in nuclear states since some of the lifetimes are extremely short. To
I6
give an example, an El mean life of 10" s implies a natural width around 10 eV.
The natural width of an unstable level can b" obtained, in principle, from the
following idealized experiment. We consider a source particle and a target particle of
the same species and assume that both particles are constrained from moving. The
emission of a photon by the source may then result in an observable resonant
absorption of the photon by the target. Both the distribution of emitted radiation and
the cross section for y-ray absorption have widths that include the natural width of
we can measure the natural width if we eliminate thermal line
the excited state. Thus,
broadening from such an idealized absorption experiment. The main idealization is, of
course, the assumption of no motion for the particles in their interactions with the
photon. A free emitter and a free absorber must undergo recoil and exchange the
photon off resonance.
Let us examine phenomenon of resonance radiation (or resonance fluorescence)
this
be different from A£ and from each other. This mismatch influences the resonance
mechanism to an extent determined by the natural width of the excited state.
The figure describes the kinematics involved in the conservation of energy and
momentum. Let us look first at the photon emission process X* — > X + y and observe
that the energies obey the relation
E*=E + ee + —
P
2
(15-22)
780 Nuclear Processes
Figure 15-21
Resonance radiation and the effects of recoil. The emitted and absorbed photons have unequal
energiese, and e a both different from the transition energy A£.
,
E* E*
\K \E
Emitter Absorber
®
><£h -(*>,
P= -•
This problem has been studied in Section 3-6, and the solution for the energy of the
emitted photon has been given in Equation (3-61). We rewrite our result as
A£ (15-23)
2Mc 2
and we note that the photon energy is less than the transition energy by an amount
equal to the kinetic energy of the recoiling system X. We then turn to the absorption
process y + X —» X* and find that the recoil effect requires the incident photon
energy to be greater than the transition energy by the same amount:
(A£V
e. = A£ + (15-24)
2Mc 2
(The proof is left to Problem 18 at the end of the chapter.) Figure 15-22 summarizes
our conclusions in terms of a schematic spectrum of energies for photons emitted and
absorbed at the transition energy AZs. Note that the energies occur in broadened
distributions instead of sharp peaks, due in part to the natural width of the excited
energy level E*. We see that resonance radiation may occur only when the two peaks
in the figure have sufficient overlap. The criterion is realized if the indicated width T
is large compared to the recoil kinetic energy
(A£T
K (15-25)
2Mc 2
We meet this condition for resonance when we consider the visible radiation from
atoms, and indeed we observe resonance fluorescence as a common atomic phenome-
15-7 Resonance Radiation 781
Figure 15-22
Emission Absorption
A£- K AE &E + K
non. The analogousnuclear problem finds K much larger than T, however. It would
therefore seem that resonance radiation is not an observable phenomenon in nuclei.
The mechanism for resonance can be recovered by letting the emitter approach the
absorber, so that the energy of the emitted photon is Doppler shifted to the larger
value required for absorption. We can determine the desired shift by noting the
connection between Equations (15-23) and (15-24):
1 + AE/2Mc 2 I AE
'1 - AE/2Mc 2
\ Mc 2
relation between frequencies can be cast immediately in terms of the photon energies
e„ and e„ to obtain
c + u c + u
£„ = e. = e = e.\ 1 +
'{?
to first order in u/c. The two expressions for e a tell us that the Doppler effect offsets
AE
(15-26)
Mc
based on the recoil-free property of the emission and absorption of y rays for nuclei
bound in solids. Embedding the emitter and absorber in separate crystals forces the
782 Nuclear Processes
crystal lattice to take up the recoil of the nucleus in both parts of the resonance
mechanism. An equivalent kinematical result is obtained by substituting the very large
mass of the crystal in place of the nuclear mass in the formula for K. The mismatch
between e, and e a disappears under these conditions, as in the idealized case of the
emitter and absorber kept at rest. The recoilless behavior of the Mossbauer system
makes it possible to achieve a resonance for an extremely sharp y transition and
enables us to measure the natural width of the resulting absorption line.
The Mossbauer effect exercises the criterion for nuclear resonance radiation with
great sensitivity. The technique can be used to detect and measure minute shifts in the
energies of photons caused, for instance, by an altered nuclear environment or a small
relative motion within the system. These perturbations on the photon energy are easily
seen in an absorption experiment, since the slightest mismatch between emitter and
absorber causes the Mossbauer device to respond off resonance.
Mossbauer's original observations were made in a low-temperature experiment
l9l
using the 129 keV y ray in the spectrum of Ir. The effect has since been observed in
numerous y transitions of many different nuclides. The capability to detect small
y-ray energy shifts was put to immediate use in several types of precision experiments.
A test of the theory of relativity was among the first applications of the effect. Use of
the technique has also spread into biology and metallurgy. To cite one example, the
57
14.4 keV y ray in Fe has been used to probe the environments of iron nuclei in
samples of organic and metallic matter.
Example
Resonance radiation is best discussed with the aid of numerical illustrations. Let
us start with the relation between mean life and natural width:
Tt = h= 1.055 X 10
- M s = 0.6582 X 10' 15 eV s.
J • •
H
In atoms we expect to find t ~ 10 s and T ~ 10 7
eV, as noted above.
Visible light corresponds to a transition energy around 2 eV, while atomic rest
9
energies are at least as large as 10 eV. If we use these numbers in Equation
(15-25) to assess the recoil effect, we get
2
(A£) (2eV)'
K= = = 2 X 10" 9 eV.
2Mc T
,
2 Q
9 ,
2(l0 eV)
Our estimates find K much less than T and imply that recoil has no adverse
consequences for resonance fluorescence in atoms. The rather different situation
191
in nuclei is nicely illustrated by Mossbauer's nucleus Ir. The 129 keV y ray
comes from a state with half-life t 1/2 = 0.13 ns. The corresponding natural
width is quite small:
0.658 X 10-
|5
h eV s
r = - = -
a
9
= 3.5 X 10" 6 eV.
t 0.13 X 10- s/ln2
Hence, the energy distribution of emitted and absorbed photons forms two
narrow peaks, unlike the broad ones shown in Figure 15-22. The recoil effect
15-8 Introduction to Nuclear Reactions 783
2 6
(0.129 MeV) (l0 eV/MeV)
K" = 4 ' 68 X I0 ~' eV
2(191)(931.5MeV)
for a free Ir nucleus. (The substitution of the y-ray energy for the transition
energy is a very good approximation here.) Resonance fluorescence can be
observed via the Doppler effect if emission and absorption occur at a relative
velocity given by
A£ 0.129 MeV
"°^= (191X931.5 MeV)
C = 7 - 25X1 °"f - 218m/s '
for situations where the reaction yields a two-body final state composed of a residual
nucleus Y and
an outgoing nuclear particle y. The product Y may be either stable or
radioactive. In either event we call the process a two-body reaction whenever X and Y
are different nuclides. Nuclear scattering is then classified as a special case in which X
and Y are of the same species. We divide scattering into its categories, elastic and
inelastic, and we express the two possibilities as
In the inelastic case, X* denotes an excited state and x' refers to a scattered particle
whose energy is reduced by the excitation of the nuclear target.
A typical two-body reaction experiment might entail the detection of y as a
function of the outgoing angle, for a number of choices of the incident energy. Typical
beam particles might be protons, neutrons, deuterons, a particles, or other ions. A
charged projectile must have enough kinetic energy to penetrate the region of
Coulomb repulsion around the nucleus in order to produce a nuclear interaction with
the target. For insufficient energy the outcome of the charged-particle collision is
expected to agree with the predictions of Rutherford scattering. If we consider protons
in collision with light nuclei as an example, and recall our calculation of short-range
Coulomb repulsion at the end of Section 14-1, we find that a beam energy of several
784 Nuclear Processes
MeV is needed to gain access to the nuclear regime. A suitable proton accelerator
must be supplied to meet this requirement. We can probe the nucleus at lower energy-
if we use neutrons instead, since neutrons are not affected by electrostatic repulsion. A
nuclear reactor serves as a suitable source of neutrons for these low-energy experi-
ments.
Nuclear reactions were first observed in 1919 in Rutherford's laboratory, when
energetic particles were found to be produced by a particles passing through air.
These occasional events were interpreted as a-particle collisions with nitrogen nuclei,
yielding protons in the final state:
14
4
He + N -*'£> +[H or
l4
N( a, /?)
17
0.
Radioactive a emitters were the only available sources of energetic charged particles
until the coming of the accelerator. One of the earliest machines was the high-voltage
generator, developed for the acceleration of protons by J. D. Cockcroft and E. T. S.
Walton. Particles from this device were used to initiate nuclear disintegrations for the
first time in 1932. The accelerated protons bombarded a lithium target and produced
pairs of a particles via the reaction
[H + ]U ^ 4
2
He + 4.He.
These observations were of special interest since the reaction gave one of the earliest
A nuclear reaction may proceed from the given initial state xXto any final state,
as long as all the relevant conservation laws are obeyed. The total charge and total
number of nucleons must be conserved, to balance the values of Z and A as in the two
processes cited in the previous paragraph. Energy and momentum must satisfy the
familiar conservation laws in all collisions. Every process other than elastic scattering
has different total kinetic energies in the initial and final states. Such reactions are
called exoergic if
or endoergic if
Z^^final
< i-> -^initial-
A certain minimum, or threshold, energy is needed to initiate the reaction in the latter
circumstance. Conservation of the total angular momentum constrains the orbital and
nuclear spin quantum numbers of the participating nuclides, and conservation of the
overall parity adds further conditions to these constraints. Finally, since charge
independence is a valid symmetry of the strong nuclear interaction, a further
conservation law is also applied to the total isospin.
The conservation laws lead directly to a number of important kinematical rela-
tions. Let us apply these considerations to the general two-body reaction X(x, y)Y,
taking the target nucleus X to be at rest in the lab frame as in Figure 15-23a. The
determining ingredients in the kinematics are the beam kinetic energy K x
and the
energy released in the reaction. This latter quantity is also known as the Q-value. We
express the conservation of the total relativistic energy according to the notation in the
figure by writing
K x
+ M x
c
2
+ Mx c
2
= Ky + M y
c
2
+ KY + My c
2
.
15-8 Introduction to Nuclear Reactions 785
Figure 15-23
Kinematics of the two-body reaction x + X -> Y + y in the laboratory frame and in the center
ofmass frame.
CM
© *~Px Q © *-p'
K' K .
Q=(M x
+ Mx -M -MY y
)c
2
Ky + K Y - A\ (15-27)
Note that the nuclear masses may be replaced by their atomic-mass counterparts in
the formula for (), since the requisite electron masses make no net contribution. Note
also that the Q-value for a reaction may be either positive or negative, unlike the
strictly positive difference of rest energies required for a decay. According to our
remarks in the previous paragraph, we have Q > in an exoergic reaction and Q <
in an endoergic reaction. The remaining Q = possibility applies in the special case
of elastic scattering.
Equations (15-27) suggest a way to determine an unknown nuclear mass, provided
the other three masses in the reaction are known and the final kinetic energies are
measured. We concede that the small recoil of the residual nucleus Y is difficult to
using the quantities defined in Figure 15-23a. We assume that the kinetic energies are
small compared to the rest energies so that we can employ the nonrelativistic formulas
K = Kj ~ and KY
2M '
2m: 2M,
Note that relativistic energy conservation must be applied to obtain Equations (15-27)
and that approximations to the kinetic energies may then be adopted when the
786 Nuclear Processes
My j
- 21
MY -J-K K V x yv
cos6. (15-28)
px // // Mx + M x
= P-
M x
M x
Mx M MX X
and gives an immediate formula for the momentum of x in the two frames:
Mx + M
PX = (15-29)
P'
M„ x
The colliding particles in the CM system have total kinetic energy
P'
2
P'
2
P'
2
Mx + M x
K' = K'x + K'x =
2M X
2M X 2M X
Mx
We use Equation (15-29) to connect this energy with Kx ,
the beam kinetic energy in
the lab:
K= — —
tr p'
2
I A/ x
-— + M \
2
Mx + M x
= =.K'— -. (15-30)
2 A/. 2M. \ Mx Mx
The result shows that K exceeds A" and reminds us that a portion of the beam
x
and Y are produced at rest in the CM frame. This configuration of the final state in
Figure 15-236 exists when K' assumes its minimum value K'lh as determined from the
,
K'th + (M x
+ Mx )c
2
= (M y
+ MY )c
2
.
When we compare with Equations (15-27), we find that the threshold energy in the
CM frame is given directly by the excess mass:
= *.'»,—t: = ".
*,h -Q-^T4 (15-32)
This formula for the threshold of the endoergic reaction in the lab satisfies the
inequality K lh
> K[ h and again reminds us that part of the beam energy is not
available for conversion into rest energy. Thus, it is possible to convert all the kinetic
energy of the colliding particles into excess mass in the final state only in the CM
frame.
Finally, let us return to Equation (15-28) and show how the expression can be put
to spectroscopic use in reactions of the general type X(.v, y)Y. We may take the
residual nucleus Y to be in its ground state, or any of its excited states. Each higher
level has a well-defined excitation energy above the level of the ground state, and so
each excited state has a definite mass value Y M . We let the incident beam have a
fixed energy Kx We then arrange the spectrometer
. to detect the outgoing particle y at
a fixed angle 6, so that we obtain a unique determination of K for each possible state
of the nuclide Y. Hence, the energy spectrum for the detected particle serves as an
image of the energy levels of Y. An example of this form of nuclear spectroscopy is
[H+SX-^X+JH,
A+
in which the excited states of the nucleus '
X are mapped out by the distribution of
outgoing proton energies. We demonstrate some of the analysis in the following
illustration.
Example
Figure 15-24 shows a portion of the energy spectrum of protons produced in the
reaction
27
A\(d, p)
28
A1. The experiment employs a beam of 2.10 MeV deuterons
along with a proton detector set at 90°. The figure also includes a portion of the
28
energy level diagram for the nucleus A1. We refer to Equations (15-27) and
(15-28), and we take
K = Kd =
x
2.10 MeV, Ky = Kp , and 6 = 90°
to obtain
^l 1 +
tl- A ''
788 Nuclear Processes
Figure 15-24
p) 28 AI
?7 AI
(d,
K d =2.\ MeV
.'
0„ = 90°
6 7
Al
KD (MeV)
K d (\ - MJM S
KP = d+
)
1 + M/My
using masses from Appendix A. We then predict the energy for protons detected
at 90° to be
1 + i/28
The two calculated values of Kp appear in the figure as the first and fourth lines
at the high end of the proton spectrum.
A reaction of the general form X(*, y)Y may yield any two-body system allowed by
the conservation laws. Hence, the collision of a beam particle x with a target nucleus
X produces a particular Yy final state as a random event, subject to the probabilistic
principles of quantum mechanics. The probability for the reaction is expressed by
means of a reaction cross section. This quantity is measurable in a suitably designed
experiment and is predictable in a theory of the nuclear reaction. Any theoretical
treatment of the nuclear process necessarily includes a model of the nucleus itself. The
resulting phenomenology brings experiment and theory together for the interpretation
of nuclear structure.
Let us continue to concentrate on two-body final states as we formulate these ideas.
The events in a given reaction x + X —> Y + y are described in principle by a
quantum mechanical amplitude x(X)', *X). (Wc follow the practice adopted in
Section 8-7 and use right-to-left notation to designate the amplitude.) The construc-
tion of the complex function x is such that |x|~ determines the reaction probability.
We cannot be specific about the form of the function without a detailed theory, except
to say that x depends on the beam eneigy K x
and the angle 6 of the detected particle
y. We suppose that the detector subtends an infinitesimal solid angle d£l, as in our
discussion of Rutherford scattering in Section 3-4. The reaction cross section da is then
defined by complete analogy with the Rutherford cross section:
— (Yy,xX)=\ X
da
(Yy,xX)\-.
|2
(15-33)
This quantity a measurable and calculable function of the variables A", and 8. The
is
total reaction cross section a r is obtained by integrating over all directions of the
detected particle:
,. da
ar = a(Yy, xX) = /^(Y?, *X) dSl. (15-34)
The usual interpretation of the cross section for scattering carries over for reactions as
well. The reaction cross section represents the effective target area presented to a beam
particle x by a target nucleus X for a specific reaction leading to the final state Yy.
790 Nuclear Processes
Figure 15-25
Formation and disintegration of the compound nucleus in the reaction x + X —» [W] —> Y + y.
CM
© *- I K) ©— *- • -*—(x)
Collision
energy A7
These equations adapt at once to the special case of elastic scattering, with total cross
section
, do
if = o(Xx,xX) = j— (Xx,xX) dQ, (15-35)
Quantum mechanics makes predictions about the process of interest through the
associated amplitude. In principle, we might argue that x(Y)', vX) should be ob-
tained by solving the Schrodinger equation to find the wave functions for the initial
and final states. In practice, we must acknowledge the difficulty of such a multi-
nucleon problem and turn instead approximate models. A prospective
to the use of
model is viable in this phenomenological approach if the ideas have intuitive appeal
and if the results are subject to direct experimental test.
A useful picture of the interacting nuclear system is provided by the compound model,
a nuclear reaction mechanism proposed by Bohr in 1936. The basic entity in the
model is a temporary composite structure called the compound nucleus. This object is
supposed form in the collision of x with X and then decay into the final system
to
consisting of Y and y. We describe the two-stage process of formation and decay by
the notation
x + X -» fW] -» Y + y.
We visualize the two stages in the lab and CM frames by the sequence of events
shown in Figure 15-25. The model makes three assumptions: the total energy of the
colliding system is rapidly distributed throughout the whole compound structure, the
compound is rather long-lived relative to the time required for x to traverse the region
of nuclear interaction, and the disintegration of the compound is completely indepen-
dent of the circumstances of formation. These premises are understood to have limits
of validity that depend on the energy and mass number of the system.
The model refers to a compound state of the nucleus without specification of the
states of the individual nucleons. These constituents are presumed to execute complex
15-9 Huclear Reactions and Nuclear Structure 791
Figure 15-26
27
Total cross section and absorption cross section of A1 for incident neutrons. Elastic scattering
is the dominant contribution to the total cross section at these energies.
o(n,y)
5 I.
10 mb-
50 keV 50 keV
random motions upon collision and share their energies quickly by virtue of their
strong short-range interactions. This mode of distribution of the total energy permits
the rapid formation of the compound nuclear state and brings about its subsequent
disintegration into any one of several possible final channels. The random selection of
a decay channel is supposed to occur without memory of the formation state and is
assumed to result from the random exchanges of energy among the nucleons. This
collective view of interacting constituents in the compound model is fundamentally
different from the independent-nucleon theory invoked in Section 14-10 as the basis
for the nuclear shell model.
The best evidence for temporary compound nuclear states is the existence of
resonances. We observe these phenomena in a given reaction as well-defined enhance-
ments of the cross section at specific values of the energy. A resonant nuclear state
represents a system of nucleons whose configuration is especially favorable for the
distribution of a definite amount of total energy. We interpret a resonance as a
quantum state of the compound nucleus, and we assign the state a nuclear spin and
parity according to the experimental evidence. The temporary nature of the state
Pr = h.
6
s but must be larger
'
section, and so the short lifetime of the compound state can be found by measuring
the resonance width. Figure 15-26 shows an illustration of a resonance for low-energy
27 2ii
neutrons incident on A1 nuclei. The compound nucleus [ Al] is clearly in evidence,
as a resonant state occurs at the same value of the neutron energy in both the
28
scattering and the capture processes
27
A1(«, «)
27
A1 and
27
Al(n, y) Al. discuss the We
numerical aspects of these graphs in the first example at the end of the section.
A formalism for nuclear reactions has been developed by Wigner and G. Breit. The
theory includes an expression, known as the Breit-Wigner formula, for the cross
section in the neighborhood of a resonance. Let us examine the construction of this
formula from the point of view of the CM frame in Figure 15-25. The initial collision
.
energy and the final released energy are identified in the figure as A' and A",
respectively.At resonance we relate the two energies to AE, the excitation energy of
the resonance, by using conservation of energy:
A" + M x
c
2
+ Mx c
2
= Mw c
2
+ AE = K' + M y
c
2
+ MY c
2
,
(15-36)
where M
w c denotes the rest energy of the compound nucleus in its ground state.
Note that the Q-value in Equations (15-27) can be equated to the energy difference
A" — A" at resonance. We then define the new energy
W= A" + (M x
+ Mx - Mw )c
2
= K'+ (My + MY - M w )c
2
,
(15-37)
h
x = -.
p'
This kinematical quantity varies with A" and therefore depends on W. We use A to
introduce an energy-dependent area
a (W) = J— ,
(15-38)
J
anticipating the appearance of the area as one of the factors in the cross section. The
Breit Wigner reaction cross section is given in terms of o by the formula
o r
= o(Yy,xX) = on (W)- ~ 1
—z.
2
(15-39)
(W- AE) + (T/2)
We note that the second factor achieves a maximum value when the variable passes
through resonance at W = AE. Thus, the formula describes the W dependence of a
resonance peak whose sharpness is controlled by the value of the parameter Y. A
similar result follows for elastic scattering by letting the final state Yy be replaced by
the elastic state X*:
We have identified the parameter V as the width of the resonance. The other
quantities T and
x
I\ are called partial widths for the two states X* and Yy. The sum
of all such parameters over all possible final states is defined to be equal to T, the total
width. Hence, the ratios Tx/T and T y/T express the probabilities for decay of the
resonance into the channels X* and Yy. These properties of Equations (15-39) and
(15-40) give an excellent description of the cross sections near resonance to support the
notion of the compound nucleus as an idealized representation of the interacting
system.
Another useful picture of the system is provided by the direct model, in which the
collision process is described as a direct reaction. This view of the process holds in energy
15-9 Huclear Reactions and Nuclear Structure 793
regimes away from resonances, where the collision energy cannot be so effectively
distributed over the whole system. We assume instead that the incident projectile x
interacts rapidly within a small region of the target nucleus and detaches the ejected
particle y, leaving the product nucleus behind. Examples of direct processes are the
(d, p) stripping reaction
2
H +-4 X -» /I + l
X+ H 1
Figure 15-27
Energy levels of
l5
N. The excited state at 12.49 MeV is observed as a resonance for the energies
indicated in the reactions
14
C(/>, n)
14
N,
,4
N(n, «) M N, and "B(a, «) 14 N.
12
11
a+ n B
n + 14 N
p+ la
C
in
MeV ^N
1
'H + A X A-\
X + 2 H.
Both of these collisions involve the transfer of a single nucleon between the projectile
and the target. Such processes can be analyzed as direct reactions to test the shell
model or some other independent-nucleon theory of the nucleus.
It is apparent that the compound and direct models provide alternative contexts for
the analysis of nuclear structure in a nuclear reaction. Let us take a last look at
resonances and nucleon transfers as the basic alternative mechanisms. We wish to
consider a few of the processes pertaining to the
15
N system for illustration, so let us
display some of the energy levels of this nucleus in Figure 15-27 for reference. We find
15
abundant evidence of excited states in the compound nucleus [
N] by observing
many resonances in the following reactions:
in
a+-'B-^ N l4
+ K or
14
C + />
n
d + C ^ N i4
+ n or
l4
C +p or "B + a
H H
p + C ^ N +
1
/7+ N^ N
14 14
+ k or
14
C+/> or "B + a.
l4 ,4
The thresholds for the states p N, and a + "B are located at the energies
+ C, n +
shown and the resonant states of 15 N are found at excitation energies
in the figure,
above these levels. We examine some of the numerical details of one of these
resonances in the second example below. We also find the transfer of nucleons at work
when we use spectrometers to detect protons and neutrons in such N-producing
reactions as
a + I2
C ^ ,5 N + />,
d + HC -> I5
N + n,
rf + 14
N ^' r>
N+ p.
Example
M, 11
K' = K. (0.035 MeV) = 0.034 MeV.
M x + M., I'M
Equation (15-36):
A' + M n
c
2
+ Mx c
2
= Mw c
2
+ A£.
The resulting excitation energy is found with the aid of the mass table in
Appendix A:
AE = M„ + ( M x - Mw )c
2
+ A"
= (1.008665 + 26.981539 - 27.981913)(931 .5 MeV) + 0.034 MeV
= 7.723 MeV + 0.034 MeV = 7.757 MeV.
appears just beyond this energy, above the range of levels shown in the figure.
Example
=
M x— =
14
—
A' A„ V 1 .779 MeV) = 1 .660 MeV.
M x + M„ 15
(
A£ = (M„ + M x - Mw )c
2
+ K'
= (1.008665 + 14.003074 - 15.000109)(931.5 MeV) + 1 .660 MeV
= 10.83 MeV + 1.66 MeV = 12.49 MeV.
= 12.49 MeV - (4.002603 + 11.009305 - 15. 000 109) (93 1.5 MeV)
= 12.49 MeV - 10.99 MeV = 1.50 MeV,
and by taking X to be
14
C in the second case,
K! = A£ - (Mp + M x - Mw )c
2
Note that the a + "B and p + 14 C thresholds have excitation levels in the 15 N
system at 10.99 and 10.21 MeV, respectively. The beam energies at resonance in
the two collisions are
/C = K' = —
15
11
(1 .50 MeV) = ;
2.05 MeV
and
M x + ML 15
K, = — K'
h
= -(2.28 MeV) = 2.44 MeV.
'
Mv 14
v '
The fission process was found to yield neutrons along with the final nuclear
products. This circumstance suggested the possibility of a chain reaction, where the
neutrons produced in the fission of one nucleus could induce subsequent fissions in
other nuclei in a sample of fissile The prospects for the generation of energy
material.
in very large quantity were appreciated by many as early as 1939. It was recognized
that a chain reaction might be put to practical use as a controlled mechanism in a
nuclear reactor, or as a catastrophic device in a nuclear bomb. The first example of a
controlled chain reaction was demonstrated under Fermi's direction in 1942, and the
Figure 15-28
Schematic evolution of the fission process including the ejection of prompt photons and
neutrons. The residual nuclei are radioactive and emit secondary /? and y radiation.
understanding of this result, let us consult Figure 14-12 and compare the binding
energies of nuclei at intermediate and large mass numbers. The figure shows that a
nucleus has about 1 MeV medium A than at high
more binding energy per nucleon at
A. Consequently, in a system of approximately 200 nucleons, we find the rest energy
to be approximately 200 MeV greater when the nucleons are found in a single heavy
nucleus than when the system is divided into two medium-mass fragments. This excess
rest energy determines the total energy released for distribution among all the final
The fissioning heavy nucleus has a neutron-rich composition so that the fission
fragments contain many more neutrons than protons. The fragments are highly
unstable and deexcite by the prompt emission of y rays along with the all-important
neutrons. A typical fission sequence passes through these early stages according to the
scenario shown in Figure 15-28. The instabilities continue after the neutron-ejection
stage because the residual nuclear products are still rich in neutrons and are therefore
radioactive. The fission products decay to their final stable states by the emission of fi
Figure 15-29
235
Fission yield in percentage versus mass number for the thermal-neutron fission of U.
4-
either of these channels in random quantum fashion, with probabilities favoring fission
over radiation by an approximate 5 : 1 ratio. We can picture this competition of
236
processes in two stages: the formation of the compound nucleus [
U] and the
immediate disintegration of the intermediate state into either the fission or the
radiation channel.
The fission process does not yield a unique pair of fragment nuclides. Instead, we
observe a distribution of mass-number pairs, comprising a "light group" and a "heavy
235
group," as indicated by the fission-yield graph in Figure 15-29. In the fission of U,
there is a probability in excess of 1% for the occurrence of any light fragment with A
between 85 and 107, together with a corresponding heavy fragment with A between
129 and 151. We note that a symmetric yield into equal-mass fragments is extremely
improbable and that the largest probabilities occur for asymmetric pairs in the
vicinity of the mass numbers 96 and 140.
A typical fission event might display the following chain of representative nuclear
products:
n + "S U 5
U ,5
Sr+ 14,
Xe *Sr + l40
Xe + In.
This example illustrates the essential fact that the excess of neutrons in the original
heavy nucleus is passed along as a large unstable surplus of neutrons in each nuclear
fragment. The instability is partially relieved by the ejection of prompt neutrons, and
the remaining neutron excess is eventually stabilized by subsequent /3~ decays. Our
representative example might exhibit the following series of /? transitions:
94 94
Sr -+ Y - 94 Zr(stable) and 140
Xe - ,40
Cs -> ,40 Ba - ,40
La -» 140
Ce(stable).
bookkeeping for the total release of energy can be drawn up roughly as follows:
the attractive nuclear force is not operative and the nuclei are under the influence of
Coulomb repulsion alone.
Let us introduce an r-dependent potential energy of the form shown in Figure
15-30 to represent the regions of attraction and repulsion. The configuration of the
compound nucleus is supposed to vary from deformation to fragmentation and
beyond as r varies across these regions. The potential energy V(r) can be inserted in
the Schrodinger equation, so that a stationary-state eigenfunction can be obtained, to
describe the fission of a given compound nucleus into a specific pair of fragments.
Note that V( r ) has a much like the Coulomb barrier in a decay. The
fission barrier
nucleus disintegrates with a certain decay constant, through the process known as
spontaneous fission, if the stationary state has an energy level below the top of the
236
barrier. Many of the heavy nuclides, including U, exhibit this behavior in their
ground state as a less probable alternative to a decay. Thermal neutrons may also
cause induced fission in the same compound nuclear system, as in the process of main
interest
We identify this state of the system with an excitation energy above the ground state,
and we note that the energy level lies higherthan the fission barrier. Thus, there is a
distinction between spontaneous and induced fission associated with the two energy
levels shown in the figure.
800 Nuclear Processes
Figure 15-30
Fission- barrier model describing the deformation and fragmentation of the compound nucleus
as a function of fragment separation. The potential energy V(r) pertains to a specific pair of
fragment nuclei X and Y.
V(r)
Induced fission
Spontaneous fission
The
old fission theory of Bohr and Wheeler is still useful as a qualitative guide to
the problem of deformation and fragmentation. Their treatment begins with the
liquid-drop model of a spherical nucleus and focuses on the surface energy and the
Coulomb energy as the only radius-dependent considerations. The corresponding
forces of surface tension and electrostatic repulsion are expected to compete with each
other as the excitation of the droplet deforms the original spherical shape. We have
expressed the surface and Coulomb contributions to the semiempirical mass formula
through the terms
Z2
a,
2
y)
A 2/i and a.,—
3^1/3
j-rr
in Equation (14-13). Let us recall the formula for the nuclear radius,
R = R A '\ l
so that we can identify the surface and electrostatic energies by the formulas
V, = AirR
2
s and V = - (15-41)
5 4tte R
In V we
t
introduce a surface energy s per unit area, and in V we
t
use a result quoted in
Equation (14-11). A small deformation of the spherical droplet implies a small growth
in the radius R. We refer to Figures 15-28 and 15-30 and observe that this small
deformation corresponds to a small departure of the separation variable r from an
equilibrium position at r = 0. It is clear from Equations (15-41) that the sum of Vs
and V may r
either increase or decrease with a growth in R, since the one term
increases as the other decreases. Bohr and Wheeler argue that V s
has the larger role, so
that the nucleus resists deformation, if the ratio Z 2/A is less than a certain critical
value. (We compute this number in the first example at the end of this section and
obtain a value approximately equal to 50.) This criterion establishes the existence of
1510 Nuclear Fission 801
This picture accommodates both types of fission, spontaneous and induced, depend-
ing on the energy level of the compound nucleus. We have already associated
spontaneous fission with the problem of barrier penetration for a level below the top
of the barrier. The large masses severely diminish the penetration probability so that
spontaneous fission usually competes very unfavorably with a decay. We have also
associated induced fission with levels above the barrier. We find that thermal-neutron
235 238
fission occurs for U, but not for U, because the thermal-neutron energy level
236
exceeds the barrier for the compound nucleus [
U], but not for the compound
239 238
nucleus [
U]. Thresholds exist in all fission reactions of the type U(«, /) because
of this property.
The Bohr- Wheeler theory does not agree with experiment in several important
The most glaring defect is
respects. its failure to explain the observed asymmetry in
the distribution of fragment masses. This question remains to be solved in a satisfac-
tory theory of nuclear fission.
Let us turn from theory to practice, and discuss the fission process in the context of
nuclear reactors. The design of a reactor takes advantage of the fact that most of the
total fission energy is carried away as kinetic energy by the final nuclear products.
These energetic massive nuclei quickly dissipate their energies in collisions with atoms
in the fuel element of the reactor. The exchanged energy can be drawn off as heat for
the conversion of water into steam, and the steam can be used to turn turbines and
produce electricity.
The essential feature of the fission process in the development of a chain reaction is
the production of neutrons. These secondary particles are able to sustain the sequence
of reactions if each fission, on the average, causes another subsequent fission. The
critical mass defines the minimum amount of fissile material required for a chain
reaction in a given configuration of the material.
The fission sequence may produce an occasional delayed neutron, with about 1%
probability, instead of the usual prompt variety. The delay occurs because the neutron
does not come from a primary fission fragment, as in Figure 15-28, but comes instead
from some other nuclear product in the sequence after one of the intervening /?
decays. The intervening time may amount to a delay of many seconds in the emission
of the neutron. This effect provides a means by which the chain reaction in a reactor
can be controlled. In practice, the shape of a fissile material is designed to allow room
for neutron absorbers (such as boron or cadmium), whose location can be varied to
control the determination of the critical mass. It is clear, however, that no such
mechanical device is fast enough to regulate the prompt neutrons. The design of the
reactor may instead incorporate a configuration of material in which the mass stays
subcritical for prompt neutrons alone and achieves criticality when delayed neutrons
participate in the process.
Fission reactions induced by thermal neutrons tend to have much larger cross
sections than the same reactions initiated by fast neutrons. The reduced cross section
for fast neutrons poses a problem for reactor design because the typical fission neutron
has an energy in the MeV range. A moderator may be introduced in the fission fuel to
solve this problem. The moderating medium contains light atoms (such as deuterium
or carbon), which undergo elastic collisions with the neutrons and thereby degrade the
neutron energies to thermal levels.
802 Huclear Processes
n+ 23y U 239
U + y
2»U-» 239 Np + «-+*
239
'Np 239
Pu + e '+ v.
The Pu nucleus fissions under bombardment by slow or fast neutrons, and so the
use of a moderator is not essential if plutonium is the reactor fuel.
Figure 15-28 reminds us that the secondary nuclei in the fission sequence are
radioactive. Some of these nuclides are long-lived hazardous sources of radiation. The
disposition of this waste material is a very serious consideration for the nuclear reactor
program.
Example
dV d_K 3 Z 2e 2
F= = —87rRs and F. = '
2
~d~R dR 5 4ire Q R
The directions of these two forces are, respectively, inward and outward, as
anticipated. Hence, the equilibrium of the droplet is restorable if the expressions
obey the inequality
2
3 Z'-V
8irRs > 5
5 4tt£ /?
or
Z2 8m
3 '
tf
Let us use the formula R = R A l/3 and rearrange this criterion as follows:
2
Z2 8to Z2 477# J
2
< 2
RlA le /4ire A ' *e*/4ire R
connection between Equations (15-41) and the corresponding terms in the mass
formula:
and
3 Z 2e 2 3 e
2
Z2 Z2
5 4tte R 5 477e /? A ,/3 V
a..
/3 "
2
3 e
AttRis = a.,
2
and = a.,,
3
5 4tte R {)
Z 2
MeV
—
A
\
= 2—
a2
a3
= 2
17.81
0.7105 MeV
= 50.13,
J c
using the values quoted for a 2 and a 3 in Section 14-5. The Bohr-Wheeler
argument tells us that the compound nucleus [^W] has a stable equilibrium
configuration and, hence, a fission barrier ii Z 2/A is less than the critical value.
The choice of numbers Z= 92 and A = 236 gives
2
Z 2
(92)
= 35.9,
A 236
Example
Let us associate the two energy levels in Figure 15-30 with the ground state of
236
U and the excitation state of the system « therma + U. i
235
We can compute the
energies of interest with the aid of a table of masses. (We turn again to
Appendix A and appeal to the original sources of Appendix A as well.) First,
236
[M + M( 235 U) -
n
Af( U)]c 2
We take this result to represent the difference in energy between the levels in the
figure, and we thus neglect the very small kinetic energy of the thermal neutron.
(Note that the curve in the figure plays no role in this exercise. Remember, the
potential energy V(r) refers to the pair of interacting fragments X and Y.) Next,
804 Huclear Processes
and calculate the Q-value for a reaction leaving the residual nuclei in their
ground states:
This calculation determines the total amount of energy released in the form of
energetic residual nuclei, fast neutrons, and prompt y rays. The ground states of
4
Sr and Xe are fi unstable, and so the subsequent decay chains are further
sources of released energy. A final quantity of interest is the following ratio of
released energy and fissile rest energy:
185 MeV
235 2
= 8.45 X 10
M( U)c (235.0)(931.5 MeV)
The result tells us that the prompt release of energy in this particular channel
amounts to a mere 0.08% conversion of mass to energy in the fission of a single
235
U nucleus.
Nuclear fusion occurs when light nuclei combine in a reaction to form a more massive
nucleus in the final state. Nuclear binding energies are such that the fusion of a
typical pair of nuclei at low mass number results in the release of a large quantity of
energy. These reactions are of great interest because of their capability to generate
useful energy on a very large scale. The process has fundamental significance in
astrophysics since fusion is the main contributor to the energy produced in stars. The
The more realistic fusion reactions occur in binary collisions. A good example is
deuteron-triton fusion
d + t -* a + n,
in which the formation of the a particle again results in a large release of energy. This
fusion reaction proceeds rapidly, via the strong nuclear interaction, and generates an
energy of 17.6 MeV. Let us describe the nuclear process by writing
2
H + 3
H -* 4
He + n + 17.6 MeV,
3
He + n + 3.3 MeV
-'H + 2
H 3
H + 'H + 4.0 MeV
and
'H + 3
He -> 4
He + 'H + 18.3 MeV.
This collection of four reactions can conceivably operate together as a combined reaction,
in which the nuclei
3
H and 3 He serve as catalysts for the basic process of ternary
deuteron fusion
3d -> a + n + p.
A net Q- value of 21.6 MeV is obtained in the overall release of energy. To confirm
this result, let us sum the four reactions, cancel the catalysts 3
H and 3 He, and divide
by 2 as follows:
6(
"HJ +
3
H + 3 He -» 2( 4 He + n + 'H) + 3 He + 3 H + 43.2 MeV
2 -* 4
=> 3( H) He + n + 'H + 21.6 MeV.
The practical implementation of these combined processes is among the goals of the
current research program in controlled fusion energy. The fusion reaction generates
less energy than a typical fission process; however, the yield of energy per unit mass of
nuclear fuel is greater in the fusion reaction by more than a factor of 3. Further
advantages in favor of fusion are the nonoccurrence of direct radioactive by-products
and the relatively low cost of fusible fuel.
Every fusion reaction involves a collision of particles with positive charge. The
collision energies must therefore be large enough to overcome the effects of Coulomb
repulsion. Let us illustrate this fact with the aid of Figure 15-31 by considering dt
fusion, d + /
— a +> n. Since the reaction is exoergic, an appreciable nuclear interac-
tion might be expected even for a dt collision at rest. The figure tells us instead that
the reaction cross section is severely affected by the Coulomb barrier for low deuteron
bombarding energiesand that the maximum cross section is finally reached around
100 keV. We can undertake a fusion experiment in a regime of energy favorable for
the reaction, simply by accelerating a beam of particles onto a target. Unfortunately,
such a method is not suitable for the generation of useful energy because the energy
released in the reaction is largely dissipated in the ionization of the target. In
principle, this problem can be circumvented by the use of an ionized target. In
>
Figure 15-31
Kd (keV)
20 40 60 80 100 120
tail of the thermal distribution.) The process is said to be thermonuclear because of this
association between fusion and thermal motion at high temperature.
The confined system of particles fits the description of a fully ionized gaseous
medium known as a plasma. Matter in the plasma state consists of coexisting gases of
nuclei and electrons, the debris of atoms ionized by collisions at high temperature.
Neutral matter on Earth can be maintained in this phase only under extraordinary-
laboratory conditions. The plasma state exists much more commonly in the high-tem-
perature interiors of stars, because of the containment provided by the force of
gravitational attraction.
The origin of the energy emitted by stars was one of the first great mysteries of
astrophysics. The problem was originally put in terms of the Sun, a modest star of
well-known size, mass, and temperature. In time, other information about the Sun,
such as composition, gas density, interior temperature, age, and radiation rate, also
became rather well known. was apparent that a chemical reaction could not
It
account for all these properties. It was especially obvious that a new mechanism was
needed to explain the generation of a very large quantity of energy. H. A. Bethe
found a solution to the problem, for ali the normal stars, in 1938. He proposed the
existence of cycles of thermonuclear processes in which the basic effect was the fusion
of four protons to make an a particle. The resulting proton chain was supposed to cause
15- 1 ] Fusion and Thermonuclear Energy 807
Figure 15-32
Reactions in the proton cycle. The three processes execute the basic proton fusion chain
4p -> a + 2e + + 2v + 2y.
(^
®-^
nuclei by the processes of thermonuclear fusion in stars. The evaluation of the various
reaction rates predicted two specific cycles whose relative influence was expected to
depend on the interior temperature of a given star.
Hydrogen burning proceeds via the proton cycle in stars of the main sequence, for
7
masses up to one solar mass and for temperatures in the range (0.8-1.5) X 10 K.
The proton cycle consists of the following reactions:
+
'H +'H -+ 2
H + e + p,
'H + 2
H ^ He 3
+ y,
3
He+ He^ He
3 4
+ 2('H).
Figure 15-32 shows sequence of processes takes four protons through three
how this
successive reaction stages to yield one a particle along with liberated energy in the
form of positrons, neutrinos, and y rays. We note that two copies of the first two
processes are needed to carry out the cycle, and we find that an energy of 26.7 MeV is
released as the net result. This amount includes the y rays produced when the
positrons annihilate with electrons in the stellar plasma. A 2% share of the released
energy is carried away by the emitted neutrinos and is not observed in the luminosity
808 Nuclear Processes
of the star. We note that the cycle starts at a very slow rate since the first process is
+
governed by the weak interaction. In effect, the first reaction is equivalent to the /8
"H + 12
C ^ N + l:i
y
N ^ nC + e + +
,;i
v
'H + ,:i
C ^ N + y 14
r>
>
H + I4
N -+' + y
O ^ N + e+ +
ls lf,
v
r,
We note that once again four protons are fused in stages to make one a particle. The
other essential property of this cycle is the remarkable fact that C is regenerated,
and not consumed, during the reaction sequence. Thus, carbon participates as a
no large abundance of carbon is needed to sustain the hydrogen-burn-
catalyst, so that
ing chain. The overall reaction rate is much greater, while the loss of released energy
to neutrinos is only somewhat larger, in the carbon cycle than in the proton cycle.
Both of these cycles contribute, although the latter one dominates, in the production of
solar energy.
A star may exhaust its supply of hydrogen and contract, to reach temperatures in
8
excess of 10 K. Helium burning begins in this range of greater temperature, as the
helium nuclei become able to penetrate the higher Coulomb barrier and fuse together
in the formation of carbon. We find that the net result of the cycle of reactions is the
ternary fusion process
4
3( He) -*''C,
and we note that nucleosynthesis is well underway at this stage. Newly formed carbon
acts as a catalyst in the carbon cycle for these temperatures, and carbon fusion also
l2
sets in as the C composition of the star increases. Thus, heavier nuclei appear in a
succession of fusions performed by the (a,y) reactions
4
He + 12
C ^ 16
+ y,
4 16 20
He + T* Ne + y,
4
He + 20 Ne ^ Mg 24
+ y.
At 10 9 K and beyond, carbon burning and oxygen burning commence in the reactions
12
C+ C^ Ne+ 12 20 4
He,
l6
+ 16
-» 28 Si 4
+ He.
The synthesis of nuclei continues via the fusion process until the top of the curve in
56
Figure 14-12 is reached with the formation of Fe.
1511 Fusion and Thermonuclear Energy 809
The building up of nuclei beyond iron takes place through a series of neutron-cap-
ture reactions and occasional /? decays. These stages of nucleosynthesis depend
critically on the availability of neutrons in the stellar plasma. Neutron captures
proceed slowly, allowing time for the decay of unstable nuclei, if the number of
accessible neutrons is small. These slowly progressing syntheses come to an end at the
209
last stable nucleus Bi. Very large numbers of neutrons may also become available
in rarer situations. A rapid succession of neutron captures can then occur, with no
intervening (5 decays, to carry the formation of heavier nuclei to the highest possible
mass numbers. The existence in nature of heavy elements like thorium and uranium is
supposed to be due to such a mechanism.
We have noted that the proton cycle generates energy in stars quite slowly since the
weak interaction controls the first step in the cycle. Laboratory fusion reactions must
rely instead on the strong interactions of the nuclear particles. Candidate processes
include the dt fusion reaction d + t -* a + n, and the deuteron fusion chain 3d — a >
Example
r=(l.2fm)(2 1/3 + 3
1/3
) = 3.2 fm,
V{r) = Z2 Z3 -
e
2
47re n r
= (1)(1)~ —MeV
1.44
—
3.2 fm
-fm
= 0.45 MeV.
810 Nuclear Processes
Actually, the collision energy does not need to be as large as this to give an
appreciable fusion cross section, as Figure 15-31 indicates. Let us ignore the
collision energy by taking the initial dt system to be fused at rest, and let us
construct a variant of Equation (15-10), so that we can predict unique energies
for the final a particle and neutron:
M
—
= K+ K= K \l n
/
+
17.6 MeV
=> K n
= - = 14.1 MeV and K = 3.5 MeV.
1 + 1/4
Such energetic neutrons would be able to escape from the plasma. A shroud of
lithium could be provided to recover the energy and regenerate
3
H through the
exoergic capture reaction
n + 6 Li ^ 4
He + 3 H.
Escaping neutrons would also create other problems, since their interactions
outside the plasma would induce radioactivity and cause radiation damage in
Example
The reactions of the proton cycle generate the proton fusion chain
\p -> a + 2e
l
+ 2v + 2y
in the manner illustrated in Figure 15-32. The released energy is given by the
difference of rest energies (4M., — M a
— 2m )c 2 Let
e
. us reexpress the basic
process by introducing four electrons on both sides:
+
4(p + e~) -» (a + 2e~) + 2(e~+ e ) + 2v + 2y.
e~+ ^
+
— 2y.
We can now identify a total release of energy Q in the form of neutrinos and y
+
rays (including those from e~e annihilation), and also allow for a-particle
recoil, by introducing atomic masses and writing the simple expression
Q.
01
26.7 Me 1
The proton chain supplies the energy radiated by the Sun, with a rate of
26
radiation known to be 4.0 X 10 W. Let us use this number to compute the rate
of proton consumption by fusion in the Sun:
26
4.0 X 10 J/jf
7—————— —, — r- = 3.7 X 10
38
protons/s.
(6.68 Me V/proton)( 1.60 X 10~ l3
J/MeV)
We can estimate the number of available protons in the Sun by supposing that
30
protons constitute the entire 2.0 X 10 kg solar mass:
30
2.0 X 10 kg
7
1.2 X 10" protons.
1.67 X 10 " kg/proton
The Sun can then be expected to exhaust its supply of protons in a time
estimated as
57
1.2 X 10
18
38
3.2 X 10 s.
3.7 X 10 s~'
11
It is comforting to know that the prediction exceeds 10 years.
Problems
1. Positron-emitting radionuclides are produced artificially in (a, n) reactions with '"B and
~' 7
A1 targets. Identify the radioactive nucleus and the corresponding nuclear decay product
for each of these reaction-and-decay processes.
2. The An series begins with the natural radionuclide 23 "Th and ends with the stable nuclide
208
Pb. The chain of decays proceeds through the following steps:
where a branching of a and ji decay modes occurs toward the end of the series. Draw up
a partial chart of the nuclides and make a plot of this decay chain, identifying each of the
nuclides in the series.
t = — and t, ,
= In 2 • t
Y
relating the mean life t, the decay constant y, and the half-life t, _,
for the exponential
14
organic matter, and show that the rate of C decay in a live sample is 1 disintegration/s
for every 4 g of carbon.
6. Let the sequential decay scheme in Figure 15-5 have decay constants such that y, 2
s> y,,,
and assume vanishing initial populations for X , and X 3 . Simplify the rate equations in
this situation, and determine the populations N,(0, N.,(<)> and N s
(/) for < / <K I/Y23.
Sketch a graph for each of these solutions.
7. Let the initial populations for the two-step cascade in Figure 15-5 be such that A', = .Y ;
= at 1 = 0. Obtain the corresponding solutions to the rate equations, valid for any
choice of the parameters y, , and y2j Determine the form of the solutions in the special
.
case Y| 2 ^ Y>-(,
for times much less than l/y^. These limiting results should agree with
calculation is valid.
9. Use data from Appendix A to compute the kinetic energies of a particles emitted in the
decays of Po and U. The results should agree with information plotted in Figure
15-8.
2,
10. Ra is a known a emitter. Recently, this nucleus has been observed to undergo another
14
more rxotic mode of radioactive decay in which C fragments are emitted instead of a
particles. The two decay modes are
223
Ra -> L''''Rn + 4 He and 223
Ra -+ 209 Pb + M C,
where the a decay dominates by a factor of order 10 . Calculate the kinetic energy of the
emitted fragment in each case, using M( 20<, Pb) = 208.981080 u and M( 2l9 Rn) =
219.009485 u along with other mass data obtainable from Appendix A. Estimate the
height of the Coulomb barrier for each of the decays.
i2
11. The nuclide P is ($ active. Identify the decay process and use masses given in
4
$Co ^Ni ^Cu ^Zn (
;
Ga
63.935812 u 63.927968 u 63.929766 u 63.929146 u 63.936837 u
Identify every possible B transition in this system and compute the Q-value for each
process.
40
15. The nuclide K is B unstable. Identify all possible ft processes and compute the
corresponding disintegration energies. The relevant atomic masses are listed in Appen-
dix A.
16. Refer to Figure 15-18 and classify the three indicated transitions according to the selection
rules for B and y decay.
Problems 813
17. Predict the radiation multipole for each of the y-ray transitions shown in the figure.
0.81 s
2*
0'
*
1 3;..
0.80 s
n 1.064
'•/,
15.
2.319
%" >' 0.13 ns
IT
0.909
0.570
'/.
' 0* V
)
J -7
,Pb
Refer to the photon absorption process in Figure 15-21, and show that the incident photon
energy ea must exceed the transition energy A£ by the amount (&E)i2 /2Mc~.
A 14.4 keV y ray is emitted in a transition to the ground state in 57
Fe. The half-life of the
excited state is 98 ns. Calculate the natural width of the excited state and the recoil kinetic
energy of the nucleus. Determine the relative velocity required between emitter and
absorber to meet the criterion for resonance radiation.
20. The Q-value for the two-body reaction in Figure 15-23 can be expressed in terms of
measurable kinematical quantities by the equation
M Jmm
= K\ 1 4- —-\ - K,
\
- 2- -JKKcos
~M~) My V '
'
in which mass numbers may be substituted for the various nuclear masses. Derive this
protons be detected at 9 = 90°. Calculate the Q-value using mass data from Appendix A,
and predict the kinetic energy of the observed protons. Determine the threshold energy
required for accelerator-produced a particles in the same endoergic reaction.
46
22. The energy levels in Sc can be deduced from the energy spectrum of outgoing protons,
detected at fixed angle, in the reaction
45
Sc(y, /?)
46
Sc. Let 6.974 MeV deuterons be
45
incident on a Sc target, and consider proton detection at 37.5°. Predict the energy of the
46 4fa
protons for the case where Sc is left in the ground state and for the case where Sc is
4b
left in an excited state at excitation energy 0.4441 MeV. The atomic mass of Sc is
2 2
(W- A£) + (r/2)
and show that T gives the full width of the resonance peak at half-maximum, as indicated
in the drawing. Refer to the graphs in Figure 15-26 and estimate the full width T and the
27 28
partial widths T„ and Ty using the, fact that, at the energies given, A1 + n and A1 + y
are the only channels open for reactions initiated by n + 27A1 collisions.
IV
sl<
i0
25. Resonances are observed in the reaction ~'A\(oc, p) Si at a-particle bombarding energies
.'5.95, 4.84, and 6.57 MeV. Compute the excitation energy for each of the corresponding
26. A beam of a particles is incident on a Be target, and a resonance is seen at 1.732 MeV
beam energy. Calculate the excitation energy of the corresponding state of the compound
l2
nucleus. The same resonant state occurs in neutron collisions with a C target. Compute
the value of the neutron beam energy at the position of the resonance.
27. Use the Maxwell-Boltzmann distribution of particle kinetic energies to deduce a velocity
distribution function for thermal neutrons. (The second example in Section 2-3 is a useful
place to start.) Determine the temperature dependence of the most probable velocity, and
show that the corresponding neutron kinetic energy is equal to k H T Calculate the value
.
"M
Mo+ m Te + 3* and 94
Sr
143
Ba + 3n,
as well as many other possible sets of nuclear products. Calculate the prompt energy
release in the two instances, given the masses (in u) 93.91523, 103.91358, 132.91097, and
142.92055 for the Sr, Mo, Te, and Ba nuclides.
30. Consider the symmetric fission of the compound nucleus [^Wj into a pair of like nuclei.
Deduce an expression for the height of the Coulomb barrier in the form CZ 2 /A {
'
, and
calculate a value for the constant C.
32. Compute the energy released in each of the four stages of the cycle that make up the
ternary deuteron fusion process 3d —* a + n + p, and confirm the result quoted in the
33. Determine the energy released at each stage of the carbon cycle. Relevant masses not
given in Appendix A are A/(
n N) = 13.005739 u and M( ]
''Q) = 15.003065 u.
SIXTEEN
ELEMENTARY
PARTICLES
Our study of the nucleus has already introduced us to the reciprocal domains of
high energy and small scale. We proceed now to higher regimes of energy as we turn
to the examination of matter and the interactions of particles at shorter range.
Relativity must also play an essential part in any investigation involving collisions at
high energy. We find that, i the- collision energy increases, a succession of thresholds
occurs where the energies are sufficient to create the masses of new particles. Our
small family of elementary particles thus becomes a rather large array of species,
whose ranks still continue to grow in number. Nature has furnished some of the
8'b
816 Elementary Particles
evidence for these particles in cosmic-ray processes, while the great body of experi-
mental information about the known particles has come primarily from accelerators
and detectors, the sophisticated machinery of high-energy physics.
Elementary particle theory has its own elaborate apparatus for treating phenomena
at high energy. The theory is concerned with the principles pertaining to the four
main forces of nature — the strong, electromagnetic, weak, and gravitational interac-
tions of particles. We have already been introduced to the first three forces in such
problems as the nuclear binding of protons and neutrons, the emission of photons by
atomic and nuclear systems, and the /? decay of radioactive nuclei. Our goal is to
understand the distinguishing features and, especially, the unifying characteristics of
these very different interactions.
Particle physics is currently enjoying a period of unprecedented fundamental
progress, following decades of previous developments in phenomenology. The theoreti-
cal discoveries include a general formalism for a theory of the interactions of particles,
a basic principle to underlie all theories, and a demonstrable realization of such a
theory in successful agreement with experiment. Achievements in the laboratory have
been just as momentous. The experiments have given timely confirmation of the most
and continue to point the way toward possible new
crucial theoretical predictions
physics.
We examine these more recent discoveries in the latter part of the chapter, after we
present the necessary phenomenological background. The phenomenology is im-
portant in its own right because the observations describe the properties of the
particles, and because and
the interpretations indicate the organization of the large
growing particle family. A
scheme emerges from this investigation,
classification
bearing new conservation laws and quantum numbers. These attributes of the
particles represent new symmetry properties of the fundamental interactions. We learn
that it is possible to implement the concepts of symmetry by introducing a substructure
for the particles themselves. Thus, a collection of uniquely designed constituents takes
its place in the theory on a scale smaller than that of the observed particles. This
substructure reveals its existence in certain kinds of experiments. The notions of
symmetry and substructure finally culminate in a theory of the fundamental interac-
tions, where the basic principles operate among the new particle subentities at the
Particle phenomena were studied in cosmic-ray processes before the coming of the
high-energy accelerator. Several of the original elementary particles were discovered
in these investigations. The existence of cosmic radiation was proposed in 1911 by
V. F. Hess in order to explain the presence of ionization in the atmosphere. It was
suggested that radioactive elements on Earth might be responsible; however, this
hypothesis was ruled out when it was found that the ionization increased with altitude.
Hess argued on the basis of his experiments that the radiation must be coming from
outer space.
Cosmic rays are classified as primary and secondary forms of radiation. The
primary component refers to the extremely energetic radiation that impinges from
space on the upper atmosphere of the Earth. The secondary component is produced
copiously as the result of collisions of primary radiation with particles in the
atmosphere. The primaries are known to be particles with positive charge because of
their characteristic deflections in the Earth's magnetic field. Most of these particles are
161 Introduction to High-Energy Physics 817
protons, although surprising abundances of other nuclei are also observed. The flux of
the radiation does not vary with time or direction in space as the particles enter the
magnetic field of the Earth. The most striking aspect of the primary radiation is its
energies around tens of MeV. The cyclotron was superseded in 1945 by the invention
of the synchrocyclotron and its immediate successor the proton synchrotron. The
innovative features of these accelerators were developed independently by V. I.
Veksler and E. M. McMillan. Their contributions to machine design made it possible
to accelerate beams of protons to hundreds of MeV and, eventually, to energies far
beyond.
Detectors also had to be devised so that the subnuclear particles could be seen in
the experiments. Two particular types of apparatus, the cloud chamber and the
bubble chamber, should be mentioned because of their place in history and because of
their instrumental relation to each other. The cloud chamber was constructed in 1911
by C. T. R. Wilson and was put to immediate use in nuclear and cosmic-ray
818 Elementary Particles
Ernest Lawrence
experiments. Electronic counters were added to the instrument so that the operation of
the chamber could be triggered by incoming particles. This modification was built
into the design of the cloud chamber by P. M. S. Blackett in 1931. Most of the early
pictures of elementary particle processes were taken with the aid of these detectors.
The bubble chamber was invented in 1952 by D. A. Glaser and proved to be a more
sensitive device for observing the tracks of particles. High-energy physics enjoyed a
time of remarkable productivity as a result of the introduction of the bubble chamber
in the accelerator laboratory.
Each of the high-energy devices operates on the basis of classical principles. The
Cockcroft Walton generator accelerates protons through a large potential difference,
built up by a high-voltage transformer and voltage-multiplying circuits. Machines
with the same basic design are still employed today as low-energy injectors for the
higher-energy accelerators.
The cyclotron is a circular accelerator, shown schematically in Figure 16-1. in
which protons are guided in semicircular orbits by magnetic fields and are boosted to
successively higher velocities by intervening electric fields. The figure shows how the E
field in the gaps of the cyclotron alternates in direction synchronously with each
semicircular pass of the proton, and how each accelerating boost enlarges the radius of
the orbit. Equation (1-45) governs the relativistic motion of the proton and reduces to
the following nonrelativistic expression for the angular velocity:
v eB
eBR
R
Figure 16-1
required to ensure synchronism with the reversal of the electric field. Relativity
eventually takes effect with the replacement of the proton mass M by the relativistic
form yv M, an increasing function of the velocity v. The orbiting time must therefore
begin to grow with v, so that the faster protons arrive at the gaps behind schedule and
the synchronous operation of the cyclotron breaks down.
Synchronism can be recovered by varying the frequency of the applied E field.
The synchrocyclotron incorporates this mechanism to achieve the acceleration of
protons at much larger energies. The strength of the B field may also be varied to
gain the same effect. The relation p = eBR holds for a stable circular orbit and
suggests that, if R is kept fixed, a small increase in B can cause a small increase in the
momentum p. The proton synchrotron makes use of this notion, along with a slowly
changing £"-field frequency to achieve synchronization, and along with certain focus-
ing techniques to stabilize an equilibrium orbit of fixed radius. Accelerators of this
type are not limited in their design energy by any theoretical considerations.
Both the cloud chamber and the bubble chamber are optical detectors in which the
paths of charged particles are seen as visible tracks. The tracks are made in the cloud
chamber from droplets of liquid suspended in a gas. In operation, a volume of air and
vapor in the chamber is allowed to expand and undergo a reduction in temperature.
The vapor becomes saturated so that a condensation of droplets can occur along the
ionized path left by a charged particle passing through the chamber. The supplemen-
tal use of electronic counters enables the expansion of the chamber and the photo-
graph by the arrival of the incoming particle. The tracks
of the track to be initiated
are made in the bubble chamber from bubbles of gas suspended in a liquid. In
operation, the liquid is kept below the boiling point and is then allowed to expand.
The reduction in pressure lowers the boiling point so that localized boiling can occur
to form bubbles along the path of a charged particle moving through the chamber.
The visualization of tracks makes it possible to analyze the kinematics of the observed
particles in both devices. These procedures have been used extensively to reconstruct
events in high-energy collisions.
It should not be assumed that every accelerator is designed for the bombardment of
two beams circulate in opposite directions in separate storage rings and undergo
collisions where the rings intersect. This design has certain advantages from the
standpoint of usable energy. Since the center of mass of the colliding particles is taken
to be at rest, all the collision energy is available in the system for the initiation of CM
reactions and, especially, for the creation of new particles. A most impressive example
of such an accelerator is the proton-antiproton collider at the- CERN laboratory in
Switzerland. The colliding particles in this machine can have up to 270 GeV kinetic
energy each of the two beams. Of course, many more collisions can occur for a
in
single beam incident on a dense target. The energy advantage is overriding, however,
if high energy is the main goal of the design. A comparison of energies can be drawn
up with the aid of a previous example, discussed at the end of Section 1-10. The
formula of interest is the relation between the total kinetic energies A' and A" in the
two frames where the target particle is at rest and where the center of mass is at rest.
The earlier example gives this relation in terms of the two total relativistic energies. A
simple conversion of energies leads us to the desired result:
Note that the formula applies to collisions of equal-mass particles and that A appears
as a quadratic function of A'. Some of the striking consequences of this result are
demonstrated in the following numerical illustration.
Example
Equation (16-1) can be used to determine how large the single-beam energy A
must be in a fixed-target accelerator to produce the equivalent energy of a CM
given colliding-beam accelerator with collision energy A'. Let the colliding
particles be protons in each instance and, for convenience, let the proton mass be
approximated as 1 GeV/c 2 The formula then becomes
.
1 + —
A'
where both energies are expressed in GeV. The numerical results are listed for
/
540
K= 2(540) 1 + - -
\
= 2(540)(136) = 146,880.
The energies of the colliding beams in this case are the same as those for the
protons and antiprotons in the giant CERN collider.
The main goal of particle physics is to understand the interactions of the fundamental
particles. We know that the strong nuclear interaction accounts for the binding of
protons and neutrons at short range, with energies of the order of several MeV per
nucleon. We
have already speculated that the proton and neutron are not fundamen-
tal and that the force between the particles is a demonstration of a more basic
interaction among more elementary entities. The electromagnetic interaction is sup-
posed to be a fundamental mechanism for the emission and absorption of photons and
the behavior of charged particles. The Coulomb force yields the structure of the atom
as one of its manifestations. We know that this force has infinite range and produces
atomic binding energies of the order of several eV. Nuclear /? decay is attributable to
the weak interaction. We have assessed its extraordinary weakness in Section 15-5 by
estimating the cross section for the absorption of neutrinos in matter. The weak force
is known to have a very short range by the fact that leptons are created in the
immediate vicinity of a nucleon in the n —> p transition. We are able to judge the
relative strengths of the strong, electromagnetic, and weak interactions when we
compare the cross sections in typical reactions or the lifetimes in typical decays. It is
obvious that the interactions differ very substantially as to range, strength, and energy
scale. Consequently, the prospects for unifying the forces of nature according to some
common principle may appear to be quite remote. In fact, the most notable
development in the theory of elementary particles has been the achievement of a
fundamental unification among some of the forces.
The theoretical apparatus of particle physics is provided by the relativistic quan-
tum theory of fields. Our understanding of particle theory depends on this kind of
machinery as much as our progress in experiment relies on the accelerator. The
formalism of field theory is too advanced for the purposes of this text. Fortunately, the
theory produces certain graphic techniques that furnish an intuitive picture of the
essential ideas. We wish to adopt these visual techniques so that we can gain a
qualitative understanding of particle behavior. We find that the Schrodinger equation
must be set aside because the Schrodinger theory is limited to the nonrelativistic
treatment of a fixed number of particles. We turn instead to quantum field theory in
order to incorporate relativity with the principles of quantum mechanics and provide
for the creation and destruction of particles.
Relativistic quantum mechanics has been discussed briefly in Section 8-10 in the
context of Dirac's theory for spin-^ particles. We recall that the prediction of
antiparticles is among the conclusions of this remarkable theory. The Dirac equation
2
for a free particle has solutions for any relativistic energy above the value mc ,
or
2
below the value -mc The . antiparticle concept stems
of from the latter class
Figure 16-2
Energy levels for a free Dirac particle. The vacuum is interpreted as a sea of fully occupied
negative energy states, and a hole in the sea is interpreted as an antiparticle with positive
energy.
'
hv > 2mc 2
seems to imply that a particle can make an unending series of radiative transitions to
lower energy and emit photons indefinitely. To prevent this instability, Dirac's
hypothesis defines the no-particle state (the vacuum) as a system in which every
negative level is fully occupied by a spin-up and a spin-down fermion. A particle at
positive energy is then unable to fall below the level E= mc 2 because the exclusion
principle prohibits transitions to the filled levels at negative energy. The argument
goes on to consider the effect on the vacuum state of an incident photon with energy
greater than 2mc . The figure shows that the absorption of the photon excites a
particle to a positive energy level and leaves a hole in the negative energy sea. The
absence of a negative-energy particle is interpreted as the presence of a positive-
energy antiparticle. We can express this absorption process in terms of electrons and
positrons by writing
di
y + vac —> e + e.
Thus, the phenomenon shown in the figure has the same behavior as the process of
electron-positron pair production. Dirac's argument relies only on the Pauli principle,
and so the prediction of antiparticles applies to fermions of all sorts.
Confirmation of Dirac's prediction came in 1932 with the discovery of the positron
by C. D. Anderson. The new particle was found in a cosmic-ray experiment using a
cloud chamber with an applied magnetic field. The events of interest showed tracks
originating in a thin lead plate and curving in opposite directions in the applied field,
as indicated in Figure 16-3. Since no entry track was visible, Anderson concluded that
the process was initiated by a cosmic-ray photon and that the oppositely charged
tracks were due to electron-positron pair production.
The first principles of quantum field theory were introduced before 1930 by Dirac,
Heisenberg, Pauli, and others. Their work was beset by profound difficulties that
delayed the formulation of a consistent theory for many years. The most urgent
16-2 Particles and Fields 823
Figure 16-3
Cosmic-ray photon
Lead plate
Cloud chamber
given to the theory of the electromagnetic interaction. This feat was achieved during
the 1940s through the work of Feynman, J. Schwinger, S.-I. Tomonaga, and F. J.
Dyson. Very precise calculations were performed with the theory, and the results were
compared with measurements of similar precision. The credibility of quantum elec-
trodynamics drew support from the fact that extraordinary agreement was obtained in
every case. Two of the more stringent tests of theory and experiment were the
determinations of the Lamb shift and the g-factor for electron spin, quantities already
mentioned in Section 8-10. Quantum field theory has also been applied to the other
interactions. It would be premature to discuss the consequences until more has been
said about these other forces later in the chapter.
Let us turn our qualitative comments about field theory to better advantage now,
and extract some tangible benefits for later use. The theory assigns a field to each
fundamental particle and casts the interactions of particles in terms of the correspond-
ing interactions of fields. The field varies in space and time and has the quantum
mechanical properties an operator that creates and destroys the associated particle
of
in states of definite momentum and energy. Thus, as an example, the electromagnetic
field (actually, the electromagnetic vector potential) acts as a quantized field and
Richard Feynman
absorbed into the field of the other. The bremsstrahlung photon and the exchanged
photon are said to be real and virtual, respectively, since the one can be observed in a
detector while the other cannot.
Figure 16-5 summarizes the main points to assume in our heuristic introduction to
quantum field theory. The basic diagrams of quantum electrodynamics are drawn as
vertices describing photon emission and photon absorption by a particle of charge e.
Each vertex contains the world-line of a moving charge, and so the corresponding
element of the theory has the form of a current, as indicated in the figure. The coupling
of the photon to the current represents the interaction between the electromagnetic
field and the charged particle. This interaction is proportional to the charge e, since e
included in the figure beside each of the vertices. Powers of e can then be associated
with diagrams in which several vertices are connected together. For instance, there are
two vertices in the scattering diagram in Figure 16-4, and so the corresponding
amplitude for the elastic scattering of electrons must be proportional to e It is .
16-2 Particles anil Fields 825
Current
ii7E hc
and rewrite the dependence on the charge so that a power of a is substituted for every
two powers of e. Graphical techniques based on currents, quanta, and coupling
constants are similarly employed for the other fundamental particle interactions.
Example
Let us recall that the interaction of an electron with an applied magnetic field is
expressed in terms of the magnetic moment of the particle. We can represent the
interaction with the aid of Figure 16-6 by showing the behavior of the electron
current under the influence of the applied field. The indicated diagrams describe
Figure 16-6
Feynman diagrams defining the magnetic moment of the electron. The indicated contributions
to gs are of order a a ,
1
B® -
826 Elementary Particles
Many of the early ideas about particles were motivated by observations taken from
nuclear physics. The theory of the nuclear force was originally regarded as fertile
ground for particle concepts, since the interactions of nucleons were presumed to be
explainable in a fundamental theory of the strong interaction. Quantum field theory
had been used to express the theory of the electromagnetic interaction, and so the
same methods were expected to be applicable to the force between nucleons. The two
conspicuous aspects of this force, which any proposed nuclear field theory would have
to accommodate, were the properties of short range and charge independence.
A new kind of field, called the meson field, was introduced in 1935 by H. Yukawa to
act as a mediator for the interactions of nuclear particles. The field was assumed to be
composed of quanta, (ailed mesons. The exchange of a meson was supposed to furnish
the basic mechanism for the strong force between a pair of nucleons, as in the
exchange diagram shown in Figure 16-7. These notions were patterned closely after
the properties of photons, the quanta of the electromagnetic field. Yukawa made
allowance for the short range of the force by endowing his hypothesized mesons with
mass. Charge independence could also be incorporated in the theory by letting the
mesons exist with different charges. The analogy with quantum electrodynamics went
further to include a mesonic version of the bremsstrahlung process. The theory
predicted meson emission, also shown in Figure 16-7, in which a quantum of the field
was allowed to materialize as a meson whenever a nucleon passed within range of
real
another nuclear where the collision energy was enough to
particle, in circumstances
create the meson mass. In 1947 the predicted meson was discovered as a cosmic-ray
particle in the execution of this particular process.
Figure 16-7
Exchange of a virtual meson between nucleons and meson emission by a nucleon in the field of
a nucleus.
M .^
16-3 Mesons and the Nuclear Force 827
The mass meson and the range of the nuclear force can be related to each
of the
other. Let us use theFeynman diagram for virtual meson exchange in Figure 16-7 to
deduce the relation. The meson is emitted by the one nucleon and absorbed by the
other in the indicated time interval A«. Consequently, there is a temporary increase in
the total energy of the two-nucleon system by an amount at least as large as the meson
rest energy mc 2 This momentary violation of energy conservation cannot be detected
.
since the exchange of the virtual meson cannot be observed. Thus, the energy of the
system is uncertain for a time At by an amount AE > mc 2 and so the uncertainty ,
h h
A£ mc
h
cAt < —
mc
because the velocity of the exchanged particle must be less than c. This argument
serves to define the spatial extent of the meson field, and hence the range of the
nuclear force, as
r =— h
• (16-2)
mc
The result is such that 2irr is equal to the meson Compton wavelength h/mc. We
note that r and m are inversely related. Hence, if the general formula is also applied
d
p
2 — -
> h
2
V 2
and E — ih—> .
dt
In this case the differential operations are assumed to act on a meson wave function
4>(r, t). We obtain the partial differential equation for $ by the familiar substitution:
la
i a 2
m 2c 2
-cY + E 2 = m 2c* => V 2
- - tt^
of-
c
= -TT*-
n
(16 - 3 >
V 2 $=— $ (16-4)
4>(r)=4> .
(16-5)
We leave the proof of this result to Problem 7 at the end of the chapter. Equation
(16-5) describes a static meson field whose exponential fall-off with r is controlled by
the value of r . This scale parameter therefore plays the role of a range, determined
by the mass of the meson in accord with Equation (16-2).
The meson theory of the nuclear force called for three differently charged species of
meson, with charges +e, 0, and —e, in order to allow for the property of charge
independence. Evidence for this feature of the nucleon-nucleon interaction came
cyclotron was used to accelerate deuterons, and these particles were stripped of their
protons to produce a beam of energetic neutrons. Hydrogenous material was placed in
the beam to furnish a dense target of protons, and detectors were supplied to
determine the angle dependence of the scattered neutrons. A regime of energy was
chosen so that neutron diffraction would result, and indeed a large cross section was
observed for scattering in the forward direction. It was especially interesting to
discover that the cross section also showed equally large scattering in the backward
direction.
We show the results of the experiment and interpret the results in the two parts of
Figure 16-8. These illustrations employ the center of mass system as the ideal frame
for picturing the forward -backward symmetry of np scattering. Large backward
scattering is shown in the upper part of the figure as a growth of the differential cross
section toward the 180° direction. The symmetry of this angular distribution is
explained in the lower part of the figure in terms of two kinds of contribution to the
neutron proton interaction. The expected diffraction of neutrons at forward angles is
seen as direct scattering, associated with the effect of a short-range meson field and the
exchange of an uncharged meson. The observation of scattered neutrons at backward
angles is interpreted as charge-exchange scattering, in which the nuclear force transforms
neutrons into protons inside the region of interaction. This feature of the force between
nucleons is evidence for a charge-changing property of the meson field, which necessi-
tates the exchange of charge-bearing quanta between the interacting particles. The
symmetry of the scattering for forward and backward angles implies an equality of
contributions from the exchange of the corresponding neutral and charged mesons.
These observations enable us to express the charge independence of the nucleon-
nucleon interaction as a characteristic ingredient of the meson theory. Evidently,
charge independence is a symmetry property of the interactions of mesons with
nucleons, which constrains the nucleons to emit the differently charged mesons with
equal probabilities. It is interesting that the quanta of the meson field can carry charge
momentum and energy. This notion is
as well as a bold extension of the ideas of
quantum field theory. Quantum electrodynamics has no such behavior, since the
emission of photons does not cause a change in the charge of the emitting particle.
The notion has real substance because charged mesons do exist among the observed
particles and because their properties indeed conform to the symmetrical predictions
implied by charge independence.
16-3 Mesons and the Huclear Force 829
Figure 16-8
Differential cross section for np elastic scattering. The data are plotted versus neutron-scattering
angle in the CM system, and the results are interpreted in terms of direct and charge-exchange
scattering. A
symmetrical distribution of scattered neutrons implies a symmetry of the nuclear
force resulting from the exchange of neutral and charged mesons.
:(.)
90 MeV neutrons
in
60 120 180
e (deg)
cu
Forward angle
Backward angle
Charged
exchange
Meson theory has not been a great success, primarily because the strong interac-
tions of nucleons are far more complicated than the simple exchange of single mesons
suggests. In fact, meson theory is not fundamental at all, as quantum electrodynamics
is, since the particles of the theory are not fundamental and, hence, not qualified to
serve as basic elements in a field theory. The connection between the range of a field
and mass of an exchanged quantum is still a very useful idea. The property of
the
charge independence is also very important as a symmetry to simplify the phenome-
nology of nucleons and mesons. We give more attention to this property in other
sections of the chapter.
Example
Let us examine the range-mass relation and extract a numerical estimate for the
meson mass. Equation (16-2) can be rearranged to give the rest energy:
he 197.3 MeV •
fm
830 Elementary Particles
Then, if we choose a typical range for the nuclear force to be 1.50 fm, we obtain
197 MeV •
fm
mc 2 = = 131 MeV
1 .50 fm
as the predicted value for the rest energy of the exchanged meson.
New particles began to make their existence known in cosmic-ray events in the 1930s,
starting with Anderson's discovery of the positron in 1932. The muon was the next to
be seen in 1937, in experiments performed by Anderson and S. H. Neddermeyer.
These studies of the secondary cosmic rays found evidence of penetrating charged
particles whose radiative behavior in matter was not understood in terms of properties
of the known particles. The anomaly was at first thought to be due to a failure of
quantum electrodynamics, the theory of the radiative process. The experiments
indicated a more constructive alternative, however, in which the observed penetrabil-
ity of the particles in matter was explained by proposing a new species of particle,
with charge + e and with mass two orders of magnitude larger than the electron mass.
The new was temporarily given the name mesotron, or fx meson, because it
particle
was quantum of Yukawa's field theory. This
originally believed to be the predicted
belief collapsed when it was demonstrated that the particles penetrated matter with
little interaction, so that the candidate could not qualify for identification as a strongly
interacting meson. Instead, the observations established the existence of the two muons
f
ju and ju. as new particles unrelated to the nuclear force. The detailed properties of
tin muons were eventually determined in the course of accelerator experiments during
the 1950s.
The oppositely charged muons are spin- ^ fermions with mass 106 MeV/c". We
can rule out the possibility that these are Yukawa's mesons because the quanta
exchanged in Figures 16-7 and 16-8 must be integer-spin bosons in uncharged as well
as charged varieties. The muons are unstable and decay via the fi process
— T
ju. > e + 2 neutrinos,
Successive decays m -* fi
-* e in a cosmic-ray
exposure
v I
.. .
'
.
-i
particles were produced in meson-emission reactions, like the one in Figure 16-7, in
which high-energy nucleons emitted pions in collisions with atomic nuclei in the upper
atmosphere. The photographic plates also showed the decays of the unstable charged
pions into muons. The processes were interpreted as
+ v and /x + v,
77° —> y + y,
an electromagnetic decay process. Pion production was studied with the aid of
accelerated beams of protons and neutrons in such reactions as -
+
+ n + n + 77
( p + n + 77
p + p
-»
o
and " + P '
a + /' + 77°
\p+P+ 77
P + P + 77
,ir~ + p
77 + p —* 77 + p and 77 + p
have given indications of several resonant states in the pion-nucleon system. The cross
sectionshave a scale typical of the strong interactions, and the resonances stand out as
unique excitations of the interacting particles. Pion-nucleon collisions also lead to
other kinds of final states, revealing the existence of new types of particles. We return
to this development in another section.
Pions have spin and parity quantum numbers that represent intrinsic qualities of
the particles. We must turn to experiment to learn such properties, in the same spirit
as for the nuclear spin and parity p in the case of nuclei. Experiments sensitive to
i
these characteristics tell us that the pion is a spin-0 meson with odd parity. Such
particles are designated by 0~ quantum numbers and are called pseudoscalar mesons.
The arguments for this assignment are discussed in two of the following exercises.
Example
We can get a very rough estimate of the scale expected for a strong interaction
cross section by computing the area of a disk of radius r j, the range of the
nuclear force A typical choice for r gives the result
2
77r
2
= 77(1.5 X 10" I5 m) = 7.1 X 10- 30 m = 2
71mb.
164 Muons and Pions 833
In fact, the cross sections for the elastic scattering of protons by protons and
pions by protons vary with the collision energy and have values of order tens of
millibarns.
Example
The spin of the pion is known from a comparison of the cross sections for the two
inversely related processes
+ +
p + p -» 77 + d and 77 + d -> p + p.
The rate of each reaction is determined by the probability for producing the
respective final particles and is proportional to the number of final states
available at the given energy. In particular, the cross section for the first of the
two reactions contains a spin-multiplicity factor for the deuteron, given by
2i d + 1 =3 for i . = 1,
along with a similar factor (2sv + 1) pertaining to the unknown pion spin s„.
The cross sections are also proportional to the squared moduli of the complex
reaction amplitudes
+ 2 + 2
\x(ir d, pp)\ and \x(pp,* d)\ -
These quantities are identical if the processes are compared at the same values
of the CM energy. The ratio of cross sections therefore contains the factor
{2s n + 1 ) as the only unknown ingredient. This sort of analysis can be applied to
the two reactions to give the result s„ = 0.
Example
tt~+ d —» n + n
and from the fact that the overall parity cannot change in a process governed by
the strong interaction. The reaction of interest proceeds from an initial bound
state corresponding to a 77-mesic atom, in which the ir~ meson replaces the
electron in its atomic orbit around the deuteron. A strong ir~d interaction occurs
in the initial {= orbital state, so that all the initial angular momentum
quantum numbers are known quantities:
K= °> i
d
= !> and ^
= °-
The total angular momentum quantum number must therefore have the value
j = 1 in the initial state, and conservation of angular momentum implies j = 1
in the final state as well. The spins of the two final neutrons are constrained by
the Pauli principle, so that the only allowed spin state is the triplet with total
spin quantum number 5=1. (The s = alternative is ruled out because s =
834 Elementary Particles
function for the two identical fermions.) These arguments result in a unique set
of final angular momentum quantum numbers,
s = 1 , c" = 1 , and 7=1,
since no other value of ( satisfies the constraints. The corresponding parity is
odd, and so the overall parity of the ( = m~ d system must also be odd because
of parity conservation. We assemble the initial parity multiplicatively from the
three separate factors
Thus, we find the overall parity to be odd if we assign the pion an odd intrinsic
parity.
16-5 Neutrinos
The weak decays of the muon and pion call attention again to the neutrino and the
idea of lepton conservation. We have identified the neutrino v to be the neutral,
almost massless, spin- !
, fermion that occurs undetected along with the emitted
positron in nuclear (i
y
decay. We recall that the particle v is designated as a lepton.
The antilepton v is then distinguished from v according to the Dirac interpretation of
an antiparticle. We must now refine these bookkeeping procedures to make allowance
for other kinds of leptons and, especially, other kinds of neutrinos.
The lepton concept must be enlarged in order to accommodate the results of the
two-neutrino experiment of 1962. Let us divulge the conclusion before we describe the
experiment itself. We consider neutrino processes, which may be either reactions or
decays, and we observe the neutrino that appears in association with a muon to be
intrinsically different from the neutrino that accompanies an electron. The two varieties
7r^ix + + v
fi
md M + "«>
n — p +> e + ve .
Neutrinos have served as beam particles through the use of reactors since the 1950s
and through the use of accelerators since the 1960s. The neutrino-induced production
of leptons may yield either muons or electrons in accordance with the particular type
+
of incident neutrino. Thus, there are reactions that lead either to ju~ and ju ,
v^ + n p + fi and v^ + p -> n + /x ,
+
or to e~ and e ,
— p + +
vr + n > e~ and ve -\-
p —* n + e ,
The two-neutrino experiment was among the first to use an accelerator as a source
of high-energy neutrinos. A proton beam from the accelerator was directed onto a
nuclear target to produce fast forward pions, and the decays of the charged pions
provided the desired neutrinos. Meters of steel were placed in the beamline to absorb
muons and The resulting neutrino beam was then allowed
other unwanted particles.
to pass through an array of aluminum plates so that observations could be made of
the occasional interactions between the beam particles and the nuclear material in the
plates. These neutrino collisions were found to yield muons in every case. Equal
numbers of electrons and muons would have been seen if there had been no difference
between v„ and v. The observation of muons alone showed instead that the incident
neutrinos, originating in m decay, were identifiable as distinct muon-neutrinos.
The interpretation of this experiment suggests a classification of leptons into
subgroups, or leptomc generations, in which a distinction is introduced on the basis of an
enlarged set of conserved lepton quantum numbers. These new lepton numbers are
regarded as internal attributes that express the electron-lepton and muon-lepton
content of the various particles. Thus, the conserved lepton number L, introduced in
Section 15-5, turns into the pair of separately conserved lepton numbers L and
e
L^.
These quantum numbers are assigned to the different generations as follows:
while
M
v
and
V
v.,
have(L = f 0, L = +1 and -l).
M M
We see that the separate conservation laws for L and L^ are obeyed
e
in the /? decays
of the neutron and pion, as described above. The charged pions also exhibit the rare
electronic decay modes
+ '+
e + v, and w~ v.
as a further illustration. We note that the new scheme of conserved quantum numbers
+ -> +
immediately accounts for the nonoccurrence of the radiative decay ju. e + y. The
observed /? instability of the muon involves the emission of two neutrinos, whose
identities must be specified to conform to these ideas. If we write ju." decay as
+
T T
and nth (L, = 0, LM = 0, LT = + 1 and - l)
VT i>T
T
as the third generation of leptons. The extended principles of lepton conservation are
836 Elementary Particles
to cite only a few examples. All these remarks presume the existence of a distinct
tauon-neutrino. This conjecture has not been confirmed directly by experiment.
Several unanswered questions about neutrinos still stand out. The topics of greatest
concern are the mass of the neutrino, solar neutrinos, and neutrino oscillations. These
problems have important bearing on the theory of weak phenomena and on the
theory of astrophysical processes.
Until recent times the mass of the neutrino has not been accessible to decisive
measurement, so that values of the mass have been found only in terms of upper
bounds. We have mentioned a Dossible approach to this question in Section 15-4 in
the context of a current study of the endpoint in the /? spectrum of triton decay. The
quoted lower bound of 30 eV refers to the rest energy mv c2 for the electron-antineu-
trino emitted in the process
3
H -> 3
He + e~+ vt .
Neutrinos from stars like the Sun have their origin in the various reactions that take
place deep within the stellar interiors. Consequently, the detection of these neutrinos
on Earth offers a unique view of the processes of stellar energy generation. The flux of
solar neutrinos is currently being studied for this purpose with the aid of large
underground detectors. The mechanism used for the detection of neutrinos is the
t^-capture reaction
37 37
vt + C1 -^ Ar + e~.
Neutrinos are produced in the Sun by several different processes, including certain
branches of the proton fusion chain, and a small number are captured by the chlorine
nuclei in a large sample of the liquid C 2C1 4 , a dry-cleaning fluid. (Unfortunately, the
capture reaction has a high neutrino threshold energy, so that neutrinos from the main
proton cycle fall below the level of sensitivity of the detector.) The rate of neutrinos
37
captured per nucleus is found from the quantity of radioactive Ar present after an
exposure of the sample. A useful measure of this quantity is the solar neutrino unit
(symbol SNU), defined as
Theory and experiment are in disagreement over the value of the solar neutrino rate.
Models of solar energy generation predict a rate of approximately 6 SNU, while the
measured result is closer to 2 SNU. This factor-of-3 shortfall in the number of
detected neutrinos is an indication of something unusual in the behavior of the Sun, or
in the behavior of the neutrinos.
Oscillation in a beam of neutrinos is a phenomenon in which the lepton-number
composition of the particles varies with the length of the beam. This effect may occur
when a source emits electron-neutrinos, and some p/s transform into v 's as the beam
propagates along. The resulting neutrino beam may then be observed to have a
variable vt content along its path. A similar kind of behavior definitely takes place in
certain other neutral systems to be discussed later on. Neutrino oscillation provides a
plausible explanation for the solar neutrino problem. It is possible that the Sun
produces v t \ at the predicted rate and that an oscillation from vf to v occurs on the
way to the detector on Earth. (This transformation is most likely to occur inside
the volume of the Sun.) A shortage of detected neutrinos is then observed because the
37
arriving v^s cannot participate in the C1 capture reaction. A recent analysis of the
problem of neutrino conversion inside the Sun indicates values of the mass for ve and
v^ far below the lower bound for v found in the triton /?-decay experiment.
t
Neutrinos provide many means of access to information about the weak interaction.
Phenomena such as fi decays and reactor processes are able to probe the interaction at
low energy, and neutrino beams from accelerators are able to explore the properties of
the interaction at high energy. Experiments in the latter category have contributed to
many recent developments in the theory of the fundamental particles.
Conservation laws are related to the basic dynamical principles that govern a physical
system. We may identify a conserved quantity from principle if the dynamical theory
is known, or we may recognize the conserved nature of such a quantity from empirical
evidence if the theory is not fully established. In either case, the conservation law
supplies a scheme of organization, known as a symmetry, that reduces the complexity of
the given system. We use conserved quantities for this purpose in classical mechanics
when we let constants of the motion act as parameters to designate the orbits of
particles. We also use such quantities in quantum mechanics when we let quantum
numbers serve as labels to specify the states of the system. Many of our more familiar
conserved quantities are associated with spatial behavior. Conservation of angular
momentum is a case in point. The conservation law stems from rotational symmetry,
and the angular momentum quantum numbers identify the various quantum states.
Parity is another independent spatial property of a quantum system. Our aim is to
examine its meaning as a symmetry so that we can understand the circumstances in
which the associated conservation law is violated.
Let us first recall the problem of particle motion in one dimension with potential
energy V(x), where V behaves as an even function satisfying V( — x) = V(x). This
property implies the existence of energy eigenfunctions having symmetric and anti-
symmetric behavior under the reflection x -* —x. Thus, states of definite energy exist
with definite even and odd parity as a result of the reflection symmetry of V.
We then recall that the parity property in three dimensions pertains to space
inversion (
— x, —y, —z). This operation differs
in all three coordinates (x, y, z) -*
from where only one coordinate changes sign. Figure 16-9 illustrates
mirror reflection,
the distinction and shows how space inversion is represented as a mirror reflection
838 Elementary Particles
Figure 1 6-9
Transformation of the coordinate axes by space inversion and by mirror reflection. A reflection
in one axis, followed by a 180° rotation about the same axis, is equivalent to the space-inversion
operation in which all three axes are reflected.
y
G-
-y
Mirror
accompanied by a 180° rotation about the reflected axis. The central-force problem is
governed by a potential energy V(r), where V remains unchanged under the inversion
r —» — r. Consequently, stationary states exist in the form of angular momentum
eigenfunctions with orbital quantum number ( and with definite parity given by
f
{— \) . Again, the parity of the states originates in the inversion symmetry of V. Since
inversion is thesame as reflection accompanied by rotation, as in Figure 16-9, and
since rotational symmetry is already a property of V, it follows that the parity of the
states is associated directly with the mirror symmetry of the interaction expressed
by V.
that also occurs in nature as an equally operative physical process. To illustrate, let us
imagine taking a device made of several moving parts and using the image in a mirror
to assemble a reflected version of the original machinery. We could then compare the
two machines and argue that the original and the duplicate should function in
mirror-symmetric states of motion. In a similar spirit, let us consider two optically
active materials whose respective molecules rotate the plane of polarization of light in
opposite directions. It would be surprising if the two species of molecules could exist as
exact mirror images of each other and yet could rotate the light in opposite directions
by different amounts. These mirror-related phenomena have no distinguishable physi-
cal effects and therefore offer no operational way to tell right- and left-handed systems
apart. We note that any existing asymmetric physical behavior could be used as a
means of differentiating between right- and left-handedness. These models of familiar
macroscopic symmetry do not prepare us for the mirror asymmetry that exists among
subnuclear particles.
60
Co ^ 60 Ni + e~ + ve .
The cobalt source was placed in a magnetic field to orient the nuclear moments, and
the sample was kept at low temperature to inhibit the thermal disordering of the
60
aligned spins. The nuclide Co was an excellent choice because of its large nuclear
840 Elementary Particles
Figure 16-10
60
Co decay and its mirror image. Electrons
are emitted to the south in correlation with a
west-to-east spinning motion of the sample of
nuclear spins. The image reverses the spinning
motion, but not the electron direction, to give
an unrealized decay configuration.
oO oa
<2> <U-P
Mirror
+
spin (i p = 5 ) and long half-life (t 1/2 = 5.3 y). Particular advantage was taken of
60
the Co decay sequence (shown in Figure 15-18), in which /? decay occurred from the
60 60 60
Co ground state to an excited state of Ni, and y decay followed to the Ni ground
state. The fi and y radiation could be detected together in the experiment, and the
observed distribution of y rays from the oriented nuclei could be used to monitor the
polarization of the sample. It was found that when the spins were aligned predomi-
nantly in one direction, the electrons in the /?-decay process were emitted prefer-
entially in the opposite direction. This observation was interpreted as a violation of
reflection symmetry. Conservation of parity would have called for equal numbers of
electrons emitted parallel and antiparallel to the spin orientation (as was observed for
the y rays emitted by the same aligned sample).
We can understand the asymmetric emission of electrons by the polarized source as
a violation of mirror symmetry by examining the argument sketched in Figure 16-10.
The sample is shown with its spins pointing north, representing a spinning motion
from west to east, while the electrons are shown radiating to the south. In the mirror,
the image of the experiment shows the electrons emitted again to the south, but in
correlation with a spinning motion from east to west. The mirror image does not
correspond to an observed process of electron emission. Since the image process does
not occur in nature, it follows that the decay of the nucleus is not reflection symmetric.
Parity violation may be used to define right- and left-handedness, since the
symmetry between the two is broken in (i decay. Let us suppose that we are in
communication with an experimenter in another galaxy, and we wish to tell the
60
person how to make a left-hand screw. The instructions would be to align a Co
sample and observe the direction of the emitted electrons. The threads should then be
cut so that the screw advances in the electron direction as it turns in the sense of the
spinning nuclei. The violation of reflection symmetry is needed to convey this
distinction between right and left. It is also essential that the distant experimenter uses
60 60
Co and not anti- Co in order that parity nonconservation alone serves to distinguish
right from left.
Figure 16-11
s state capture
I52
+ Eu(*' = 0} -+
152
Sm*(; = \) + ve
15 2
Sm(; = +
I
0) y.
Hence, the net effect is the indicated final state consisting of ve and y along with
Sm(i = 0). A detection scheme for y rays is contrived to guarantee that this final
system has collinear momenta, as shown in the figure. The y rays are observed to have
unique left-handed polarization, and it is inferred that the neutrinos can exist in only
one of the two possible spin states normally allowed for a spin-^ particle. The
experiment draws the conclusion that the neutrino vt is a left-handed particle, having
its spin orientation opposite to its momentum. Such a configuration of spin and
momentum is intrinsically asymmetric under mirror reflection.
This experiment is construed as an observation of the left-handed nature of the
neutrino and, by inference, the right-handed nature of the antineutrino. We note in
passing that the properties of unique handedness and absolute masslessness are
intimately connected. To see the connection, we consider a massive particle whose spin
and momentum are parallel in some Lorentz frame. It is possible to transform to a
frame moving faster than the particle and to find that the momentum is reversed but
the spin is not. The original right-handed configuration of spin and momentum is thus
observed as left-handedness in the new frame. A zero-mass particle always has speed c
and cannot be transformed in this manner. Therefore, its spin and momentum
directions must maintain their parallel or antiparallel relationship in all Lorentz
frames. These remarks are not meant to imply that the neutrino must be massless. We
842 Elementary Particles
Figure 16-12
Spin
"*~"
fj — ~~ft
— *" *" 2 axis
Left-handed Left-handed.
argue instead that the neutrino mass may be very small and that a very fast frame is
then required to outrun the neutrino and reverse its observed handedness.
is a basic principle of weak interaction theory. The viola-
Parity nonconservation
tion of mirror symmetry occurs in distinct patterns, as evidenced by the handedness
properties of the neutrinos. These patterns of asymmetry are built into the current
theory of the fundamental particles. Thus, the asymmetry has the same respected
status in particle physics as any of the equally well-established symmetries and
conservation laws.
Example
The experiment in the figure shows that the muons are observed with their z
component of angular momentum given by m = — \. Angular momentum
conservation then implies mv = + - l
Let us recall the beginnings of weak interaction theory so that we can start to assemble
a picture of the fundamental weak interaction. The weak theory is rooted in an
analogy with quantum electrodynamics. Since the electromagnetic interaction can be
formulated in terms of the emission and absorption of photons, it is natural to suppose
that a similar idea also holds for the /3-decay interaction.
The original /3-decay theory for elementary particles was an outgrowth of the
formalism for nuclear /? decay. The first ideas were put forward in the Fermi theory of
1934, as discussed in Section 15-5. Fermi's picture drew a parallel between /? decay
and radiative decay by taking the emission of a pair of leptons in a weak nuclear
transition to be similar to the emission of a photon in an electromagnetic transition.
Parity nonconservation was a subsequent development that had to be built into this
primitive theory.
We can rely on Feynman diagrams to visualize the weak theory, as we have done
in the case of quantum electrodynamics in Section 16-2. In fact, such a representation
of nuclear /J decay has already been offered in Figure 15-16. This diagram from the
previous chapter is reproduced for our present purposes in Figure 16-13. We see that
16-7 The Weak Interaction 843
the interaction involves the coupling at very short range between a nucleon world-line
and an emitted lepton pair. These coupled elements of the theory are identified as
weak currents, by analogy with the electromagnetic current shown in Figure 16-5. The
neutron-to-proton world-line constitutes a weak nucleon current in which the interacting
particle undergoes a change of identity as the weak transition takes place. This feature
of the weak theory is in contrast with the identity-preserving and charge-preserving
properties of the electromagnetic current. The pair of emitted leptons ve e is repre-
sented in similar fashion by a weak lepton current, whose orientation is contrived to fit
the given diagram. We display these weak currents in Figure 16-13 as separate
elements of the interacting system.
The weak nucleon current of the original Fermi theory is a polar-vector quantity.
Its structure is the same as the electromagnetic current except for the operation that
changes the identity of the nucleon. An analogous weak nucleon current of axial-vec-
tor form is also needed to allow for parity nonconservation. To interpret the
interaction in Figure 16-13, it is understood that both polar-vector and axial-vector
contributions of opposite parity appear in the nucleon current and that the lepton
current incorporates the unique handedness of the neutrino. The resulting construction
is known as the universal Fermi interaction. This version of the weak theory is due to
^-leptons. We note that the nucleon current of Figure 16-13 is replaced by the ju-lepton
current so that the universal interaction describes the (3 decay of the muon. In all
expressed in terms of a universal coefficient called the Fermi constant GF . Its value is
given by
GF = 8.95 X KT^MeV m •
844 Elementary Particles
Figure 16-15
\
M
P
C
)
Example
The scale of the weak interaction is set by the Fermi constant G F a very small ,
GF =— I
he \ mwc
where g 2 /he is treated as a dimensionless factor and where the range h/m w c is
included to secure the proper dimensions. The mass of the weak quantum W i
2
is now known to be around 81 GeV/c . Let us use this result to compute the
range of the weak field:
he 197 MeV • fm
= 2.4 X 10~ 3 fm.
mw e
2
81 X 10
3
MeV
g
2
GF 8.95 X 10" 50 MeV m •
3
1
2
he rfirhc (2.4 X l(T 18 w) (l97 X 1(T 15 MeV m) • 13
2
We note that rw
'it
is very small, and we see that g /hc is analogous to the fine
structure constant
2
e 1
4:7Te hc 137
Thus, the two comparable quantities in the weak and electromagnetic theories
are found to differ by only a factor of 10. We must therefore explain the
846 Elementary Particles
16-8 Strangeness
We are about to describe a proliferation of particles, and we wish to prepare for this
outburst by introducing a preliminary classification scheme. The elementary particles
are conveniently organized into two mutually exclusive families called the Uptons and
the hadrons. Leptons have already been defined in Section 15-5 as particles that do not
engage in any strong interaction processes. The first part of Table 16-1 contains a
complete list of the known leptons, including the new members recently encountered
in this chapter. The strongly interacting particles comprise the much more numerous
family of hadrons. A partial list of some of the existing hadrons is given in the other
two parts of the table. These particles have properties that are more complicated than
the leptons since they participate in all the fundamental interactions. The table
indicates some of their characteristics in the form of several new quantum numbers. Our
goal is to interpret these quantities by uncovering the corresponding symmetries and
conservation laws.
We make a further subdivision of the hadrons according to fermion and boson
properties. The strongly interacting particles with half-integral and integral spins are
called baryons and mesons, respectively. Table 16-1 lists one particular group of
hadrons in each of these two categories. A conserved baryon number B is defined
accordingly:
77 +p -*
\
and 77 + p -» ( ,7° + m +p
+
77~ + p \77 + 77'+ n
16-8 Strangeness 847
"0
c
(4
o o o o o o >
-a CO a.
£
CO X X X X X X | x x :
CO O
(C CO "^
„- CO O
* Ol Tf & o _ <* X co
2 <~D
1-1
cm ra —. cq ,<-;
<u X
-£
C3
°
CM
J !/5 73 CN M CO
\ o
J> CO <T> > us r-- -
CO
5
U CO
CO
ai U3 (Ti (N 1^
CO -h CO CT> O)
iO
^<
—i
CM
<u en m
co
co i CO
t^,CT) CT> —> -^ —i —• CO CO <* m
(A
— ' -< O O O O — —
I I
^oo +1 +1
£
CM
V
i
J3
\c cr E
> \ c
to i^
o >•- P^N-I* O — O I -H -\'l -|w
— 1
>
d -t- in
o d -f -V
X
H +1 +1 1+
ft
— be
— C rt
n V \ t^~ •
.-:•
JS
3
V
-a
<L>
a
>.
s x
3 x 1
—1 CM I M O — < *^ —
J
ri
c
a.
1
eJ i
X c
h C
£
cfl
t>3 O CM CM C/J O o — — i
Q I I I I I I +1 +1
c X!
£
E 'a
^r Q
c
rt
18 CO
*' >
£ S8 <u — O O — O — o
-h
\
i '
u H 11
c
O' ©? t5
w -O C -a
'J :
C 0,
3 3
ex
~ &, 3- ** t- ~'
o
OJ
e/s 8
S
,i
•v.
J S
*
c
1o c i:
c
w
bo ra JJ
s_
-C ^ HWH I' 1
I'l S b fe
^ o
\ Jai
rt
E
CJ
PQ 1!
_i
2 + i)
c >-> C P
u.
u V u
c X
rd
V hH
X
—
Figure 16-16
A P r~ * m» « — «» ' nm * '
iimu '
at the lower energies. The new particles make their first appearance at the thresholds
for the associated production reactions
Our attention is drawn especially to the fact that the observed hyperons and kaons are
+
not seen singly in final states such as K° + n, or tt° + A, or tt + 2~, even though
these systems have thresholds at lower energies. Apparently, the new strange particles
are produced with some intrinsic property that prevents their occurrence until there is
enough energy to create these particles in association with each other. A new kind of
conservation law is in evidence here. The conserved quantity is the strangeness
quantum number S, and the assignment of a value of S to each hadron is such that
the total strangeness is conserved in S =
every hadronic reaction. Thus, if we take
for pions and nucleons, S = + 1 for K+
and S = — 1 for A, 2 and 2~, we and K , ,
see that conservation of strangeness holds in the observed ir~ p reactions. We also note
that the conservation law would be violated by the production of unassociated
hyperons or kaons. Other strange particles observed at the higher m~ p thresholds in
+
Figure 16-16 are the hyperon 2 and the kaons K~ and K°. It is clear that we must
assign S= — 1 to each of these particles if strangeness is to be conserved in the
indicated final states.
Our search for more strange particles can be continued along lines suggested in the
second portion of Figure 16-16. beam of K mesons be incident on protons
If we let a
in our liquid-hydrogen bubble chamber, we find thresholds as shown for the various
final states 7T° + A, 7r° + 2°,. ... All these systems have total strangeness S = — 1.
-
The new baryons Z° and E also appear, with strangeness S = — 2, in the reactions
16-8 Strangeness 849
at the higher energy thresholds in the figure. It is noteworthy that the new strange
+
baryons are \ particles like the nucleons and that the new kaons are mesons like
the pions. The
organization of the hadrons in Table 16-1 incorporates this observa-
tion. Wefind that the strangeness-conserving m~p and ~ p reactions, indicated by K
the thresholds in the figure, have cross sections in the millibarn range. We therefore
regard the reactions as strong interaction processes.
All strange particles are unstable. Their various modes of decay contain further
information about the validity of strangeness as a conserved quantity. Let us list the
dominant modes for the strange baryons and charged kaons as follows:
p + 77
A
K + 77°
£ + 77°
+
2°^A + y 2-->« + 77"
n + 77
and
+
77 +77° 77"+ 77°
+ +
77 + 77 + 77~ 77~ + 77~ + 77
h
77+77 +77
+
77° + e + V U
t
77 + e~+ V,
JT° + H +
+VU K+ M" + v..
(Neutral kaons decay in a special way to be discussed in Section 16-9.) The lifetimes
for all but the 2° decay are in the range 10"
10
to 10~ 8 s, long enough for the various
charged particles to form measurable tracks in a bubble chamber. Figure 16-17 shows
an example and decay, as recorded by a photograph taken in
of particle production
such a detector. The orders of magnitude of the lifetimes are characteristic of decays
that proceed via the weak interaction. The main observation to associate with this
remark is the fact that strangeness is not conserved in these decays. If we examine each
of the listed strange-particle decays (except the one for 2°), we find that strangeness
conservation is violated by exactly one unit in every case. The exceptional 2 decay
occurs with a much shorter lifetime and yields a photon in the final state. These
circumstances, taken together with the strangeness-conserving nature of the process,
tell us that 2° decay governed by the electromagnetic interaction.
is
Figure 16-17
Associated production and decay of strange particles in a liquid-hydrogen bubble chamber. The
production-decay sequence consists of the processes
w-+p->K° +A
'—> b + 77
P
+
77 + 77
16-8 Strangeness 851
strangeness but does not observe the conservation law. Instead, the strangeness
quantum number changes in a definite way, by a single unit, in every instance of
strange-particle weak decay. We have already seen the breakdown of a conservation
law due to the weak interaction in our discussion of parity. Here, another conserved
quantity of an internal nature is found to be violated in the same class of physical
processes. It is interesting to note that the strong interaction determines the existence
of thehadrons and obeys the conservation law, while the weak interaction causes their
decay and exhibits a breaking of the law.
Strange particles were first seen in cosmic-ray events by G. D. Rochester and C. C.
Butler in 1947, the year of the pion. Hyperons and kaons began to be produced in
accelerator reactions in the next decade, and associated production was interpreted
during this period. The idea that strange particles were produced in pairs was
originally proposed by A. Pais. Subsequently, the concept of strangeness was fully
developed in terms of a symmetry principle by Gell-Mann. The identification of this
new internal quantum number was the first step in the organization of a body of
symmetries governing all the hadrons.
Example
We can apply the methods of relativistic kinematics from Section 1-11 to study
the associated production of strange particles. mesons Let us consider ir~
incident on protons at rest and determine the threshold beam energy for the
K° + A final state. In the reaction tt~+ p -^> K° + A, we have the two
momentum four-vectors
and &>' =
7!
p i{m K + MA )c
for the initial tt p system in the lab frame and the final K°A system at rest in
the CM frame. Lorentz invariance relates & and &' by the equality
Pi-
K
/ }
=-{m K + Mh fc\
2
El =
7T
c
2
pl
l 7T
+ rn 77 c\
= 2EvM +(ml + M y p
c
2 2
(m, + MA -(ml + M
2 2
) )
E„ =
2M„
852 Elementary Particles
v = E„-
K„ P mx = L
2
(™a + ^a) ~K + M p )
c
TT ft
2Mp
mK + MA + mn + M
=
2Mp
p
{m AK +
v
MA — m
K m
'
— Mp} h )c
2
.
allowed to act on the wave function and produce a transformed state. Thus, a
particular physical process is turned into three other processes under the action of the
different symmetry operations, and each of the three is compared in turn to the one
given. The comparison asks specifically whether the transformed copy process is
physical, and whether the transition probability is the same as for the original process.
We recall that we have already pursued this line of inquiry with regard to the P
operation in Section 16-6. If the copy and the original are found to have equal
probability, then the interaction governing the process is said to be C-, P-, or
TT-invariant, depending on the transformation in question. We can also subject the
interacting system to any combination of the three operations. In this way we may
learn, for instance, that CP-invariance holds where C and P symmetries are violated,
and we may then argue that the two violations exactly compensate each other.
An important theorem applies to these considerations. It can be proved from very
general properties of relativistic quantum field theory that every interaction is
invariant under the composite transformation CPT. This invariance means that the
combined symmetry with respect to the three operations together is always respected,
16-9 Heulral K Decay and CP Symmetry 853
even in situations where any of the three factors exhibit violations. One of the main
consequences of CPT-invariance is the requirement that particle and antiparticle have
the same mass and the same lifetime. It is clear that any observed breakdown of this
symmetry would be cause for alarm, calling for a reevaluation of the principles of
quantum field theory.
Our principal concern is the behavior of the weak interaction with regard to C, P,
and T. We already know that P-invariance /3 decay: the mirror image
is violated in
of a certain decaying sample of aligned nuclei does not exist as an observed physical
system. We can see immediately that C-invariance is also violated by the inherent
left-handedness of the neutrino: the application of C to the neutrino yields a
left-handed antineutrino, a nonexistent particle. Note, however, that when CP is
this problem and consider the superposition of the mass-degenerate K° and K states.
1 1 _
K,= -^(K° - K°) and K = -^{K° +
2
K°), (16-6)
V2 V2
because these states have definite properties under the CP operation. If we accept this
assertion for the moment, we can perceive a subtle distinction between K and K .
K°= -i^{K + l
K2 ) and K° = - -^(K, - K2 ). (16-7)
to the transformations
Space inversion then introduces a minus sign for the 0" particles:
The CP properties of the states in Equations (16-6) follow from these operations:
CPK X
=K X
and CPK2 =-K 2
. (16-10)
CP-odd.
+
Next, we consider the 7r 7r final state that results from neutral kaon decay. This
system must have angular momentum zero and orbital quantum number { = 0. The
overall even parity of the state is determined from the orbital parity, which is even,
and the intrinsic parities of the two pions, which are odd. Thus, when we apply C and
+ +
P to the 77 77 state, we find that C interchanges 77 and it ~, and then P
interchanges them back, such that the sign of the wave function remains unaltered.
+
Consequently, the tt tt~ final state in neutral K decay is CP-even, and the same is
q
true for the tt 7t final state. If CP-invariance is a valid symmetry of the weak
interaction, then the K x
portion of Equations (16-7) must be responsible for the
observed 277 decays since the K 2
state is forbidden to have the CP-even 2tt mode.
Decay into three pions is allowed for K 2 ; however, the ()-value is much smaller for 377
than for 277, so that the probability should be correspondingly smaller for K 2
decay
than for K x
decay. A substantially longer lifetime should therefore be observed for K 2
relative to K
These arguments lead us to the conclusion that, although K° and K°
x
.
are the particles produced with definite strangeness in strong reactions, they are not
the particles observed in weak decays. Instead, K° and K mix to form two distinct
decay states K and K 2 whose lifetimes are quite different because of CP invariance.
x
,
+
/t7 + e~+ vf
~+ +
77 e + vt
+
77+77 77 + /A~ + ^
K. -» {
and K 2
-»
{
>77° + 77° 77 + [I + V
77 +77 +77
+
77 + 77~+ 77°
10 8
with lifetimes around 10 and 10 s, respectively. This interpretation of the
neutral kaon system remained intact until 1964.
The logic of K° — K° mixing
quantum mechanical principle of follows from the
superposition. We can pursue the arguments further and show how neutral kaons
participate in the process of regeneration, an extraordinary phenomenon in which a
member of a system is severed away and later grows back. The following discussion of
the process is illustrated in Figure 16-18. We suppose that 77" mesons are incident on
a slab of material and that the collisions in matter produce #°'s via the familiar
16-9 Heutral K Decay and CP Symmetry 855
Figure 16-18
Regeneration of short-lived neutral kaons. Reactions in slab A produce K°'s whose A',
component decays away, leaving a pure K 2 beam incident onjlab B. The K° and K n parts of
the 2
K
state have different strong reactions in the slab. The K° portions are removed in these
processes so that a beam of K° 's emerges. A regenerated K x
component is contained in the
emergent beam.
©-—
K 2= M1±M tf° = ^
associated-production reaction
+ p -» K° + A.
K2 state. These particles are directed at a second slab of material, located along the
beam-line at a distance such that the transit time between slabs is greater than the K x
K° + p
K° + p -> K° + p.
Thus, the strong reactions in the second slab remove most of the K° half from the
incident K2 state and leave mostly #°'s in the transmitted beam. Equations (16-7)
again tell us that this K° beam is an equal-parts mixture of K and K2 The x
.
remarkable conclusion is summarized by the observation that the K state dies out at x
one end of the beam-line and then grows back at the other through the intervention of
the strong interaction. The rebirth of K is seen by detecting again the rapid decay
x
into two pions, as shown in the figure. This process of regeneration of short-lived
neutral kaons has been observed in the laboratory. The steps in the analysis are
reminiscent of those followed in the Stern-Gerlach thought experiments of Section
8-7. Recall that we have applied superposition to spin states in the case of the
856 Elementary Particles
The mixing theory of Gell-Mann and Pais predicted that the CP-odd A 2
state
could not decay into two pions. An experiment to place improved bounds on this
and A 2 The decaying particles are therefore renamed Ks and KL (A-short and
.
A-long) and are called by those names in Table 16-1. The experimental results may
be interpreted by attributing the violation of CTMnvariance to the fact that the
physical decay states A v and K L are not CP-pure. The departure of Ks and K L from
A, and A is expressed in this view by altering the superposition of states to read
'.,
+ eK2 K +
A,=
A,
= 2
and KL = ,
2 eK.
2
. (16-11)
/i + M /T+l El
Kl „<>
{«<> +
is then presumed to be due to the small CP-even A, contribution in the expression for
+
7r~+ e + v
A ' 77' + e~+ vt
Example
Our discussion of neutral kaon mixing and CP-violation can be neatly cast in the
language of the Schrodinger equation. First, let the weak interaction be turned
off and consider the simple time-dependent wave functions for stable kaons at
rest:
K° = Ae-'
m ° c2,/h
and K° = Ae-'
m ^ i/h .
Note that each state is parametrized by the same mass m . The I dependence
satisfies the equations
d
ih ih d _ _
— -K°£
= m K° and — -K° = m
z ()
K .
c dt c dt
ih d K m "
K°]
c
2
dt K° m K°
Next, let the weak interaction be turned on, and introduce the mixing mecha-
nism by altering the form of the "mass matrix:"
m u
:l m
d K°~ ni a K
~ ih
2
~dt lK°, . V m [K°\
-
ih d K ms "
\Ks]
2
\ A
K L_ mL Kl\
c dt
1 -1
1
and
"l K x
e A',
v'l + l«
858 Elementary Particles
A',
K,
and m L are ultimately connectedto the quantities m, u, and v given in the mass
matrix. We leave the derivation of these relations to Problem 17 at the end of
the chapter. The given quantities are complex-valued, and so the derived
parameters must have the same property. We interpret the meaning of the
complex masses m s and m L by defining the following real and imaginary parts:
h
m s c = m so c l
L0 L
2t, 2^
The solutions for A^ s - and KL are then written as
/h c2 ' /k
A^ = A s e~ imsc2 '
= A s e~ ims <>
e~' /2Ts
and
2 /r° 2
l*s|
2
= \A s e-'
\
and \KL \
= \A L \
2
e~'^.
Thus, we see that m so and m L{) are the masses of the decaying particles Ks and
KL , and ts and tl are the corresponding lifetimes.
16-10 Isospin
The list of baryons and mesons in Table 16-1 includes the assignment of several
internal quantum numbers. These properties are associated with the symmetries and
conservation laws that govern the behavior of every hadron. have just examined We
how the strangeness S is treated as one such conserved quantity. Another internal
attribute with its own unique symmetry and conservation law is the isospin, denoted
for each hadron in the table by the pair of quantum numbers / and T,. We have
already introduced the isospin properties of the strongly interacting particles in
Section 14-12 in our discussion of the charge independence of the nuclear force. The
binding of protons and neutrons in the nucleus is only one of many manifestations of
the strong interaction. Our purpose now is to extend the domain of isospin symmetry
beyond the nucleons so that the concept embraces the whole family of hadrons.
Let us begin by organizing the tabulated baryons and mesons according to two
charts, as in Figure 16-19. The members of each group have spatial properties in
+
common, since the baryons are \ particles and the mesons are ~ particles. (These
16-10 Isospin 859
Figure 16-19
+
Mass levels of the eight \ baryons and the eight (T mesons listed in Table 16-1.
H
Y v
I
A
S = -1
n P i.o-
.<? = n
S =
5
-0 + S = ±l
A'" a: a A"
7T ^° TT
s = o
Mass -1
2
(Gev/C )
spin and parity assignments are deduced by observing the spatial distributions of
outgoing particles and decays of the various species.) The figure is a
in the reactions
+
mass-level diagram, in which the eight ~2 baryons and eight ~ mesons exhibit their
masses at four different levels and three different levels, respectively. The striking
feature of these displays is the fact that approximate mass degeneracies exist among
certain subsets of the particles. This pattern of degenerate multiplets subdivides the two
hadron systems into singlets, doublets, and triplets, each with a strangeness assignment
and a value for the mass. The charge and the slight deviation in mass appear to be the
only distinguishing characteristics among the members of each of the multiplets.
We have seen multiplets of states of different charge and nearly equal energy
before in the isobaric analogue levels of nuclei. Recall that these degenerate energy
states refer to nuclear isobars with the same i
p assignment and with different values of
Z and N for a given value of the mass number A. The existence of such related
nuclear states reflects the charge independence of the nuclear force and, more
generally, the charge independence of the strong interaction. We know that this
property of the nuclear force is expressed in terms of a symmetry with respect to
rotations in isospin space. Recall that the conserved isospin vector T behaves as a
quantized angular momentum, whereby T2 has eigenvalues given by t(t + 1), and Tz
has 2t + 1 discrete values for each integral or half-integral choice of the isospin
quantum number t. Rotational isospin symmetry then implies that the 2t + 1
ring the properties of angular momentum quantization directly to the isospin vector
,
T. We may also argue that slight deviations from degeneracy within an isospin
multiplet are attributable to the influence of the electromagnetic interaction. These
effects break isospin symmetry, just as the imposition of a magnetic field acts to
separate degenerate states in the central-force problem. Thus, the strong interaction is
indifferent to the isospin direction, while the electromagnetic interaction selects the z
direction in isospin space and breaks the symmetry by distinguishing states of different
T. and different charge.
We assign isospin quantum numbers to nuclei in accordance with the basic / = \
doublet identification of the nucleon, where the proton has T, = + L
,
(isospin up) and
the neutron has Tt = - ',
( isospin down). The z component of isospin for a nuclear
state is then determined by the proton and neutron numbers as in Equation (14-47):
Z- N
T. = .
A A
T=Z- - = - ,
2 e 2
01
Q. A
— = T + z
— (16-12)
e 2
and take t = 1 to denote a triplet of states with Tz = — 1, 0, and +1. All the
tabulated \ baryons and mesons are similarly organized according to the
following assignments of internal quantum numbers:
Tt - -1 1 •• 2
T, = (1
/'
= +2" T = +1
:
B = 1 5 = N doublet I = 2
i
n P
S = - 1 A singlet I
= A
S 1 2 triplet t
= 1
2" 2° 2+
S= -2 Z doublet I
= 1 "=" •=•0
2
T. = -1 T = -l
1
z 2
T. = Tz -+i Tz = +1
' +
B = S = m triplet I = 1 77 7T° 77
+
S= +1 K doublet t
= 1 K° K
S= -1 K doublet t
= 1
2
K~ K°
S= T) singlet t
= (1
1
The values of the charge Q/*, the baryon number B, the strangeness S, and the z
component of isospin Tz obey the relation
£> B+ S
- = Tz + (16-13)
e 2
foreach of the tabulated baryons and mesons. This relation among quantum numbers
is due
to Gell-Mann and K. Nishijima and is meant to be a general formula for all the
hadrons. Since B and S appear together in the equation, it is convenient to define a
new quantity called the hypercharge,
Y = B + S, (16-14)
- =T+ z
-. (16-15)
e 2
+
n +p -> 77° + d and p + p -> 77 + d.
states if isospin is conserved. We therefore expect the cross sections to have the ratio
o(n/> ->w°(rf) 1
o(pp -» ir
+
d) " 2'
because we see from Equations (14-48) and (14-49) that the np state has / = 1 in half
of composition, while the pp state has / = 1 exclusively. This prediction for the
its
+ + 77 + p
7r +p ~* 7r + p and 77 +p
77° + n
are especially useful as sources of information about the 77 N system. It is also possible
to examine many other combinations and nucleons. of initial and final states of pions
We learn from charge independence that only two independent complex amplitudes
are required to express all possible combinations. We can understand this property of
pions and nucleons by adding isospins and invoking isospin symmetry. The itN system
has isospins Tw and TN and so the total isospin is expressed as the vector sum
,
T = TW + T„. (16-16)
= and
1 +
22
I I -
2
indifferent about the orientation of the vector T in isospin space. Only the magnitude
of T matters, so that a separate independent amplitude need not be introduced for
every possible value of Tz We
. have just found T to have two allowed values of the
magnitude \jt(t + l) , one for t = \ and one for t = f . Consequently, only two
independent quantum mechanical amplitudes are needed to describe all 77./V —» 77N
processes. If we designate these quantities as X1/2 anc^ X3/2J we ^ nc^ mat tne
amplitudes for the elastic and charge-exchange scattering processes cited above are
16-10 Isospin 863
+
X(77 />,77>) =X 3 /2,
It is clear that isospin symmetry is a rather powerful idea since the number of different
We
do not need a derivation of the numerical coefficients in Equations (16-17) to
comprehend the basic meaning of the formulas. The first equation says that the m +
p
system can only be in a t = § state, an obvious fact since the z components of isospin
add up to give
7z =
1
-
1 + 1
2
= 1
2
+
for 77 and p. (The same argument applies to the 77 « system where
Tz = -1 - \ = -§.
Charge symmetry tells us that the sign of T. is immaterial and that the elastic
scattering amplitudes for m + p and 77 ~n satisfy ihe identity
+ +
x(v p,7r p) = x(tt 72,77 n) = x 3/2-
This predictionis verifiable and agrees with experiment.) We cannot draw such an
immediate conclusion about the ir~p and m°n states. Both of these systems have
T i:
Since 7^ = — ^ occurs in one of the four substates for / = | and also in one of the two
-
substates for t = ^, it is apparent that both tt /) and 77 °« correspond to superpositions
of the < = ^7 and / = \ states. The actual mixtures needed to form the two 777V
systems are well-defined and lead directly to the results quoted in Equations (16-17).
We pursue these remarks somewhat further in Section 16-11.
Example
Our discussions of the internal quantum numbers have been aimed primarily at
the conservation laws pertaining to the strong interaction. Let us consider some
weak decays now so that we can demonstrate a few violations of the laws. In A
decay, A — p + > 77", we have a transition of overall quantum numbers of the
form
S = - 1 and T. = and T =
S = 1 and T, S = and T = 1
864 Elementary Particles
Example
77 + p — K° + A, a
> t = f) reaction, even though m~ p also has a t = § part.
A + A -f*
it + 7j, a violation of isospin conservation.
#*+ p -f*
77
+ -t- 2+ , a violation of conservation of isospin and strangeness.
+ + +
77 + p —» A^ + 2 , a pure £ = | allowed reaction.
Other systems are considered in Problems 18 and 19 at the end of the chapter.
High-energy spectroscopy is concerned with the mass levels and quantum states of
baryons and mesons. These particle properties can be ascertained by investigating the
reactions of colliding hadrons. The products of a given reaction may exhibit new
hadronic states as resonances in the reaction cross section. We have seen resonances in
nuclear reactions, and we have interpreted the phenomena as excited nuclear states
representing unique short-lived configurations of the interacting nucleons. Hadron
resonances occur in a higher-energy range, as expected for the excitation of matter on
a smaller scale. The widths of resonant hadronic states are considerably larger than
their nuclear counterparts, and the lifetimes are correspondingly shorter, as ap-
propriate for the smaller size of the resonating systems. Typical widths and lifetimes
are of order 100 MeV and 10" 23 s, respectively. Thus, the hadronic excitations are in
the domain of the strong interaction, where the instabilities of the resonances are
many orders of magnitude stronger than the weak instabilities of the listed hadrons in
Table 16-1. The resonances are produced via the strong interaction and decay by the
same route. These systems exist with specific mass and quantum-number assignments
and are accorded hadron status on a par with the more familiar particles, despite their
extremely transitory nature.
Excited states of the nucleon are seen in the interactions of pions with protons.
Prominent resonances of the itN system appear as distinct bumps in the total cross
+
sections for the 77 /> and m~ p reactions
±
77 + p -* all possible final states.
1611 Baryon and Meson Resonances 865
and t = f .
If we denote these quantities aso |/2 and a 3/2 we find that the observed
,
+
o(tt p) = o 3/2 and a(ir-p) = %(o 3/2 + 2a 1/2 ). (16-18)
(Note that the formulas are just like the ones for the scattering amplitudes in
Equations (16-17). The results follow from a theorem that relates the total cross section
directly to the imaginary part of the amplitude for elastic scattering. These total cross
sections are also expressible in more familiar terms as sums of the squared moduli of
all the contributing amplitudes.) We can easily invert the relations to obtain a 1/2 and
a 3/2 :
§a(w p) - \a{m + p) = +
and o 3/2 a( ir p). (16-19)
+ =
o(tt p) = 200 mb and a{-n~ p) 67 mb.
+ =
0(77 p) 3a(77"/?)
directly from Equations (16-19). The elastic and charge-exchange scattering processes
can also be examined from the same viewpoint. If we neglect the t = \ amplitude
following ratios for the elastic and charge-
Xi /2 in Equations (16-17), we get the
2
Figure 16-21
Pion-nucleon total cross sections. The graphs on the left pertain to the processes ir
± + p -» all
possible final states, and the graphs on the right refer to itN interactions in the separated
isospin states / = i and t = \,
200 200-
100 -
14 16 18
/•:,
M (GeV)
9 9 i 1
The experimental results confirm this prediction and verify decisively the t = \
hypothesis for the resonant state.
Three other resonance bumps also appear with clearly established isospin quantum
numbers in Figure 16-21. The resonant states are found to have unique spins and
parities by analyzing the spatial distributions of particles scattered at the different
resonance energies. Furthermore, the bumps occur at specific values of the CM
energy, so that the resonances correspond to definite assignments of mass. Thus, a
given resonant state is defined by a mass level and a complete set of hadron quantum
numbers, including the usual spatial properties of spin and parity, and also the
internal attributes of isospin, baryon number, and strangeness. We recognize these
characteristics to be exactly the same as those possessed by the more familiar (and
more nearly stable) hadrons, and we therefore regard the resonances as full-fledged
hadronic particles. The four prominent resonances at energies below ECM = 2 GeV
are designated by the following symbols and specifications. The t = | states are called
+ +
A(l232,f ) and A(l950,^ ),
+
AT(l520,f )
and A^(l680,f ).
16- 1 1 Baryon and Meson Resonances 867
Figure 16-22
Ei Pi
The parenthetical information gives the mass in MeV/c 2 along with the spin and
parity. Each of these hadrons has baryon number B = 1 and strangeness S = 0,
because each occurs as an excitation in the pion-nucleon system. The isospin
assignments tell us that there should be four degenerate particles for t = with f,
T. !
2> ;, and
A", A , A + .and A-
N <mdN +
r
'
.
The indicated charges of these baryons are in accord with the Gell-Mann-Nishijima
formula. Equation (16-13). Numerous other baryon resonances with strangeness
quantum numbers S = and £ ¥= are also known.
Meson resonances also occur, in states whose existence could be detected if
meson-meson collisions could be observed. Unfortunately, such experiments cannot
be performed directly because stable meson targets do not exist. It is possible to see
these resonances indirectly by investigating those meson-nucleon reactions in which at
least two mesons appear in the final state. Figure 16-22 illustrates this sort of process
and suggests how a particular pair of final mesons may be analyzed in order to gain
(m uc 2 ) = (£, + E2 f - <r
2
(p, + p 2 )~ (16-20)
for the case of two particles. We note that the expression m v2 c' is Lorentz-invariant
and is equal to the total energy of the two mesons in their own CM frame. The mass
m x2 is a useful variable to employ in a plot of the distribution of the measured events.
868 Elementary Particles
Figure 16-23
Mass distributions of pion pairs showing the p meson resonance at 770 MeV/c 2 .
Events
Events
+
+ 77 + 77° +p
7T +p +
77 + 77 '
+ 77 + p.
Figure 16-23 shows two typical distributions in which a prominent peak appears at the
mass value 770 MeV/c 2 in the 77 + 77° system of the first reaction, and in both 77 + 77~
+ +
systems of the second reaction. No such phenomenon is seen, however, for the 77 77
combination in the second process. These results are consistent with a / = 1 isospin
assignment for the resonance, where the states with T, = —1,0, and 1 correspond to
+
the charged mesons p ,
p°, and p*. The p + state is observed in 77 77°, and the p°
+ + +
state is observed in 77 77 . The absence of an effect in 77 77 rules out the possibility
of a I = 2 assignment, and so, of the total isospins possible for two isospin- 1 pions,
t = 0, 1, and 2, only the t = 1 hypothesis agrees with experiment. Further analysis
shows the spin and parity of the resonance to be 1 ~, the quantum numbers of a vector
meson. The t = 1 p meson has strangeness S = and, of course, baryon number
B = 0. Other meson resonances are found in many different varieties with a profusion
of spatial and internal quantum numbers.
Vector meson resonances can also be seen in processes initiated by the collision of
beams of electrons and positrons. In these reactions electron-positron pair annihila-
tion takes place, producing electromagnetic energy, and systems of hadrons then
+
materialize out of this energy. The total cross section o(e~e ) includes the processes
+ —
e~+ e > all possible hadrons
Example
+
Let us examine the hadrons A(1232, f ) and p(770, 1~) and show that both
occur as c"= 1 resonant states in their respective interacting systems. The baryon
+
A++ is a ~
+
resonance of the m p system. Since and p are 0~ and -^ tt
+
+
particles, the parity of the 77 /> orbital state must be odd in order to conserve
parity. It follows that the orbital quantum number c* must be odd, and only
/= 1 is allowed by conservation of angular momentum. The meson p + is a 1 ~
+ =
resonance of the 77 7r° system. In this case, c* 1 is required immediately so
that the two 0~ particles conserve both angular momentum and parity. The
meson p° appears in 77 + 7r~ but is forbidden to occur in 77°77°. This conclusion
follows from Bose symmetry, which demands an exchange-symmetric wave
function for the two identical pions. An c"= 1 pair of 77° mesons is not allowed
because the state would have odd spatial symmetry and would therefore be
exchange antisymmetric.
Example
Two different kinematic variables are use^ to display the baryon resonances in
Figure 16-21. The beam energy (or beam momentum) is the natural choice for
the presentation of cross-section data, but the total CM energy is more conveni-
ent for interpreting the resonances as particles of definite mass. We can readily
pass from one variable to the other with the aid of techniques discussed in
Section 1-11. Consider m -p collisions where the beam pion (mass m) has
momentum p and energy E, and where the target proton (mass M ) is at rest.
&> =
i(E + Mc 2 )/c
Let ECM be the total CM energy so that 0> transforms into the four-vector
E 2M 2
E 2 + 2EMc 2 + M 2
c
4
2EMc 2 + (m 2 + M 2
)c
L
870 Elementary Particles
using the relativistic relation between p and E. The result takes the form
r2
'CM
= 2Mc 2 {K + mc 2 ) + (m 2 + M 2
)c* = 2Mc 2 K + (m + Aifc'
when we introduce the pion beam kinetic energy K=E— mc 2 This formula
.
enables us to relate the energy axes in the left and right halves of Figure 16-21.
16-12 Quarks
pendent symmetries associated with the separate conservation laws of isospin and
strangeness.
It should be acknowledged at once that the hundred or more existing hadrons are
far too numerous to be regarded as fundamental particles. This remark obviously
applies to the baryon and meson resonances, since these states occur as short-lived
composite systems of interacting hadrons. We must concede the same point regarding
the proton and neutron, because we know that even these more primitive particles are
structured entities. The evidence for nucleon structure begins with the fact that the
magnetic moments of p and n are not equal to the values expected for elementary
Dirac particles. The proton and neutron are also known to have an extended size and,
by inference, an internal substructure. We learn about these properties in electron-
scattering experiments with nucleon targets, similar to the studies discussed in Section
14-3 in the case of nuclei. Thus, it is clear that the nucleons are no more fundamental
than the various resonances and that all hadrons should be treated on an equal
footing. We have already espoused this democratic point of view in Section 16-11.
The two notions of higher symmetry and hadron substructure were drawn together
into a single conceptby Gell-Mann. In 1961, he organized the hadrons according to a
generalization of the concept of isospin by invoking a rotational symmetry in an
internal vector space of eight dimensions. This extension of the familiar three-dimen-
sional isospin space was designed to incorporate hypercharge among the eight
necessary internal degrees of freedom. Gell-Mann called his mathematical framework
the Eightfold Way. The proposal played the role of a periodic table for the
elementary scheme was used to classify known varieties and predict
particles, as the
new hadrons. In 1964, Gell-Mann showed that the mathematical aspects of his
higher-symmetry scheme could be encoded in the quantum numbers of a hypothesized
set of three fundamental particles, called quarks. He demonstrated that the existing
baryons could be constructed from simple combinations of three quarks, and that the
existingmesons could be assembled by combining quarks and antiquaries. The concept
was originally developed as a means of synthesizing the quantum numbers of the
observed hadrons in terms of more basic elements. In this sense the quarks were
contrived to implement a model of the higher internal symmetry. Soon, however, it
1612 Quarks 871
Murray Gell-Mann
became clear from experiment that quarks and antiquaries had real identities as
fundamental constituents in the substructure of hadrons.
Quarks and antiquaries are distinguished from conventional hadrons by the fact
that they are assumed to carry fractional charge and baryon number. The rules of the
quark model tell us from the outset that the postulated particles have values of Q/e
and B in one-third units. Three quarks, called u, d, and s, are introduced with the
following assignments of quantum numbers:
-1
We note that the entries in the last column are consistent with the Gell-
Mann-Nishijima formula, Q/e = Tz + Y/2. The corresponding antiquaries u, d, and
s have suitably conjugated quantum numbers as follows:
B = T, = 5 = Y= d/e =
We observe from the listings for Tz and S that the role of each quark is to supply a
singlenonzero quantum number in the synthesis of any chosen hadron. This use of
quarks as ingredients in a composition is called an assignment of flavors. Thus, the u
quark is u-flavored with isospin up (Tz = ^), the d quark is ^-flavored with isospin
down (T = — ^), and the s quark is ^-flavored with strangeness (S = - 1). These
z
872 Elementary Particles
Figure 16-24
V V
As
>
d :
1 i
1
2
/',
1 1
:." o^
flavor designations are conveniently plotted on graphs of the hypercharge versus the z
component of isospin, as in Figure 16-24. The resulting triplet and antitnplet patterns
+
77 = ml Y = S = V = 1
\/2
7T~ = — du 1) -1
+=
K in 1
1
K° = dl 1
1
K° = sd -1
K = Ml 1
_ I
9
TJ
= —f=-(uu + dd- 2ss) (1 o (16-21)
The set of all possible combinations of three quarks and three antiquarks also includes
a ninth independent structure:
(This singlet state is separate from the octet of pseudoscalar mesons listed in Table
16-1. Note that the tt , tj, and tj' systems are formed by superposing uu, dd, and ss
1612 Quarks 873
states.) It is implicit in each of these expressions that the spatial states of the
constituent particles must have total spin and orbital quantum numbers s = and
(= so that a vanishing total angular momentum is obtained, as required for a
spin-0 composite system. The odd parity of the 0" mesons is then due to the intrinsic-
even and odd parities of quarks and antiquaries. This property of opposite parities
holds true for all spin- \ Dirac particles and antiparticles.
Thethree-quark baryon structures are somewhat more difficult to assemble. A
convenient starting place is the t = \ A multiplet:
uuu,
k~=ddd. (16-23)
Note that the uuu system has Tz = \ as required, and note that the successors to this
state follow by applying the systematic isospin-lowering operation u -* d to each
quark. There are exactly ten different combinations of the three quarks u, d, and s:
Q~= sss,
is especially interesting because the proven existence of this hadron validates the
underlying higher-symmetry scheme. These three-quark expressions must be supple-
mented by spin and orbital configurations appropriate for composite states with spin
and parity | Positive parity is obtained if each quark is in an s state, and spin- 1 is
.
the result if the quarks are assembled with parallel spins. The procedure is illustrated
by the composition
A ++ = ««kT T T,
in which three .r-statequarks have parallel spins so that the z component of total
angular momentum corresponds uniquely to a ~ state.
Combinations of three quarks can also be deduced for the \ baryon octet of
Table 16-1 and Figure 16-20. We see at once that the flavor mixtures uud and udd
reproduce the internal quantum numbers of the proton and neutron. Since these same
+
forms appear, along with their permutations, in A and A it is necessary to take ,
states. If we take
1 1
ind
1 1
we find that p and n are orthogonal to A + and A ensuring the independence of the
,
states. Strange baryons in the octet may then be constructed by the systematic
+
substitution of an ^-flavored quark. Thus, we derive 2 from p by substituting s for d
to get
The ~ *
assignment of these baryons is achieved by assembling estate quarks in states
with two spins parallel and one spin antiparallel.
So far we have treated quarks only as bearers of flavor in a model of the states of
hadrons. We may wonder whether the constituents are just fictitious artifacts of the
model, or whether they really exist as observable particles. The latter question has to
be decided by experiment. If a free quark could be separated from others of its kind,
and if the quark could be bound in an atom, the distinctive one-third unit of charge
would be a readily detectable signature. Recent experiments conducted by W. M.
Fairbank have succeeded in isolating a fractional charge on a metal sphere; however,
no other similar studies have confirmed this observation. Millikan himself is supposed
to have seen, and discarded, a fractional charge on an oil drop as long ago as 1910.
Quarks bound in atoms could also manifest themselves in atomic emission spectra and
in mass spectrograms, but no such indications have ever been recorded. To date,
searches of all kinds have failed to offer convincing evidence, not only for the
occurrence of free quarks in bulk matter but also for the production of free quarks in
cosmic-ray events and in accelerator experiments.
None of these negative results is in conflict with the accepted belief that quarks
really do exist as bound constituents inside hadrons. Evidence for the validity of this
Figure 16-26
Figure 16-25 Construction of qq states by adding vectors in
Inelastic electron scattering as a probe of the (Y, T z )
plane.
baryon substructure. Y
Hadrons
ud
16-13 The Electromagnetic and Weak Interactions of Quarks 875
e~ +p -* e~+ hadrons,
in which the inelastic scattering of electrons is studied at very high energy. Figure
Example
If hadrons are composed of bound quarks it follows that the interactions of hadrons
are fundamentally expressed in terms of the interactions of quarks. All electromag-
netic hadron processes are thereby attributed to radiative quark transitions accompa-
nied by the emission of photons. Likewise, weak hadronic processes are due to weak
all
quark transitions in concert with the emission of weak quanta. The analogy between
the electromagnetic and weak interactions has been introduced with the aid of
Feynman diagrams in Section 16-7. This parallel can now be developed further using
diagrams based on quarks. In fact, the overall success of weak interaction theory in
the context of the quark model constitutes a large share of the evidence that supports
the reality of quarks.
Figure 16-27 shows two examples of the electromagnetic interaction of quarks. The
diagram for electron-proton elastic scattering shows the exchange of a virtual photon,
where the indicated coupling of the photon is taken to each of the proton's u and d
quarks in turn. The hyperon decay 2° —» A + y is represented as a radiative quark
transitionfrom one uds system to another, where the emission of a real photon can
take u into u, d into d, or s into s, each with its own amplitude. These transition
amplitudes are determined by the coupling between the photon and the electromag-
netic current of each quark.
Weak processes have already made their appearance in the Feynman diagrams of
Figure 16-15. Recall that the exchanged weak quanta carry charge and that the
corresponding hadronic transitions are charge-changing phenomena in all the cases
876 Elementary Particles
Figure 16-27
considered so far. The quark model offers a description in which the weak interaction
causes changes in flavor among the constituents of the interacting hadrons. We can
display these processes as weak transitions from one quark flavor to another, using
diagrams like the ones in Figure 16-28. Note that the hadron transitions n —» p and
p —> n proceed via the coupling of
—»
W and W +
to the weak charge-changing quark
currents d u and u —* d, while the remaining u and d quarks act as spectators. The
[3 decays of strange particles take place in similar fashion, as illustrated in Figure
16-29. In these instances, the quark flavor transition is associated with the
strangeness-changing charge-changing weak current s —* u. Strange particles also
decay to final states consisting only of hadrons. These nonleptonic modes have their
own quark Feynman diagrams. Again, Figure 16-29 describes such decays of the
strange hadrons in terms of the flavor transition s —* u.
Strangeness-changing decays are observed to be universally smaller in amplitude
than comparable strangeness-conserving decays. Processes suitable for comparison are
A p + e~ + vt n -* p + e~+ ve
and versus and
A' 77° + e~+ ve 77~—> 77° + e~+ V l
Figure 16-28
Weak quark transitions in the fi decay of the neutron and in the neutrino reaction v^ + p
+
-»
n + n.
16-13 The Electromagnetic and Weak Interactions ol Quarks 877
Figure 16-29
Quark Feynman diagrams representing the decays of strange particles. The upper three
diagrams show the /? decays A -* p + e "+ v and K -» ir~ + +
L t
e + vt or m+ + e + vr . The
lower three diagrams show the nonleptonic decays A -» p + m and A' v -» 77 + tt
'
N. Cabibbo has shown that these phenomena conform to a unified picture based on
Gell-Mann's higher-symmetry scheme. The ideas can be translated into the language
of quarks so that a direct comparison can be drawn between the weak quark currents
s —> u and d —>
Note that the strangeness change is AS = 1 for the one current and
u.
that the accompanying emission of W~ always selects the specific rotated combination
of s and d given by the expression
<^cos + s sin#.
The coupling of the weak quantum to the corresponding Cabibbo current is shown
schematically in the left half of Figure 16-30. The flavor structure of the current can
be expressed as
0.
with (16-25)
d cos + s sin 9
This prescription stipulates that all s -» u transitions are reduced in amplitude relative
Figure 16-30
Couplings of generalized weak quark currents. The first diagram contains the Cabibbo current,
and the second introduces the concept of charm.
A problem arises when we consider the possibility of the two apparently similar
decays
The first of these processes is observed, as noted above; however, the second decay is
not known to occur. We see that the transitions in the respective quark-antiquark
systems are
Both are strangeness-changing effects; however, the one is charge-changing while the
other is charge-conserving. The observed suppression in the second case can be
explained by invoking the following flavor-transition mechanism, put forward by S. L.
Glashow, J. Iliopoulos, and L. Maiani. Their proposal calls for the cancellation of a
pair of conspiring contributions to the strangeness-changing charge-conserving current
and also introduces a new quark flavor called charm. The mechanism employs two
rotated combinations of s and d:
We note that the second expression is orthogonal to the first and that the first
current takes place via the schematic procedure shown in Figure 16-31. The replace-
ment of s and d by the two rotated forms produces the indicated substitutions, in
which the s —> d transition appears with coefficient sin cos in the first contribution
and — cos 6 sin in the second. The argument goes on to hypothesize a new quark
flavor with Q/e = §, denoted by c. This new quark is conceived to be a partner for
the second of the two rotated combinations of ^ and d in a new charge-changing
1 61 3 The Electromagnetic and Weak Interactions of Quarks 879
Figure 16-31
d dcos0+ssm0 -d sm + s cos
current:
0.
with
d sin 6 + s cos 6
(16-26)
Thus, two related weak quark currents are defined by introducing the rotations of the
two flavors. We show these flavor transitions together in Figure 16-30.
A charm quantum number C is also defined along with the introduction of the
c-flavored quark. Table 16-2 quantum-number assignments for the quarks u,
lists the
d, s, and and includes a new column ior the values of C. Like isospin and
c
strangeness, charm refers to another quantity conserved by the strong interaction. The
entries in the table tell us that the Gell-Mann-Nishijima formula for the charge must
now be modified to include C:
— = T + -
Y
+ —C (16-27)
-
* 2 2
a
Table 16-2 Quantum Numbers of Quarks and Antiquarks
B 1 T. 5 Q/e
i 1 i
1 2
II (1 ',
2 2 3
d
i 1
_ i II
1 I
3 2 2 i 1
2
A
1
- i
I
3
3
1
(1
1 2
c 3 3
1
u _ i
_ 3
J
3 2 2
d _ 1
2
1
1
3
i
3 2
;
I
_ 1
2 i
3 i i
c
_ 1 1
-1 j
"Flavors are listed according to baryon number, isospin and z component, strangeness, hypercharge,
charm, and charge. Evidence exists for a ^-flavored quark with Q_/e = - j, and speculation abounds for a
^-flavored quark with Q/e = -.
880 Elementary Particles
The charm hypothesis implies that the c quark should manifest itself in the existence
of hadrons containing the new flavors c and c. These quantum numbers may occur in
given the generic name charmomum. The bound charm degree of freedom was expected
to become observable if enough energy could be added to dissociate a cc state into
separate c- and c-flavored hadrons. An intensive search for charm in other forms
immediately yielded other charmonium states and eventually led to new c-flavored
nonstrange mesons with compositions cu and cd, as well as ^-flavored strange mesons
of the cs variety. Charm spectroscopy flourished from the start and has continued to
yield a flood of information about systems containing the c quark.
New beyond charm have also come to light since 1977. Another vector
flavors
meson known as the T resonance was discovered in high-energy collisions as a narrow
pairs at E CM = 9.5 GeV. The circumstances of this
4
peak in the yield of fi jti
discovery were just like those associated with the J/\p meson. The T meson was
therefore interpreted as a bb bound system, in which a new quark b was introduced,
with Q/e = - \ to fit the analysis of the resonance and to suit the analogy with
,
charm. A whole family of similarly defined T states has subsequently been observed.
One of these recent discoveries was found at an energy above the threshold for decay
into pairs of b- and ^-flavored hadrons. Yet another quark t, with Q/e = j, has also
been postulated to act as a partner for b. Direct experimental evidence for the
existence of this new flavor is anticipated at higher energy.
High-energy electron-positron colliding beams are ideal for the production of such
uncharged vector mesons as J/\p and T. These resonances make their appearance in
processes of the form
+ -*
e~ 4- e hadrons
+
and are seen as conspicuous features of the total cross section a(e~e ).
Electron -positron storage rings have been constructed at several laboratories around
the world to serve as factories for c- and ^-flavored particles. Observations of cc and bb
systems are presented convincingly in the ratio of cross sections
+
a(e e )
R =-T^-+ -* ^TT> (
16 - 28 )
aye n e fi )
Figure 16-32
R 4
20 25
(GeV)
The t and b quarks are designed to contribute t and b flavors in the composition of
whole collections of new hadrons. Quantum numbers and conserved quantities have to
be defined accordingly, extending the lists given in Table 16-2. The new quantum
numbers associated with / and b have been given the names truth and beauty. These
two new species of flavor terminate a succession of three pairwise generations of
quarks,
and complete the parallel with the already-introduced three generations of leptons
participate in these processes through the electromagnetic and weak currents that
represent the flavor transitions of quarks. A similar picture based on currents has also
been presented earlier regarding the behavior of the leptons. The likenesses between
the treatments of the two interactions encourage the view that the two theories may
have a common origin in a single body of principles. The electromagnetic-to-weak
882 Elementary Particles
analogy has been entertained since the time of Fermi, and the possibility of a single
theory has evolved during the intervening decades.
Wehave also noted that the electromagnetic and weak interactions are very
and range. Electrodynamics describes a force of infinite
different as to their strength
range mediated by the exchange of zero-mass photons, while the weak force is known
to have an extremely short range so that the exchanged weak quanta are presumably
very massive. It would seem that th^ analogy between the two interactions should
break down over these rather substantive distinctions. Despite the enormous disparity
in strength and range, it has been shown that the two forces actually do represent
different manifestations of a single unified interaction. The electroweak theory that draws
together the electromagnetic and weak interactions is in agreement with all experi-
mental tests and is currently regarded as an established theory. This unification of
forces may be compared to the outcome of Maxwell's theory, in which the principles
of electricity and magnetism are united in a single formalism. New physical laws are
at work in the unified theory. These ideas have had a revolutionary influence on all
recent developments in particle physics.
Our presentation of the electroweak theory is divided into two parts. First, in this
section, we describe the structure of the theory and examine the observable conse-
quences in the interactions of quarks, leptons, and quanta. We continue to rely on
Feynman diagrams to illustrate our descriptive approach. Later, in Section 16-16, we
turn to the deeper questions of principle that underlie the whole successful formula-
tion.
theory in which the interactions of quarks and leptons are mediated by a unified
electroweak field having four charge-specific degrees of freedom. Four different
charge-bearing quanta are associated with this generalized field. The quanta
W\ W°, W , and B°
are required to be massless particles by the guiding principle of the theory, just as the
photon is required to be massless in the theory of the electromagnetic interaction. Two
of thesequanta become the expected and W~ W weak interaction. The
of the
remaining two are neutral and can therefore be mixed in various combined states. The
This relation has the form of a rotation defined by the weak mixing angle 8W a ,
This expression defines an independent neutral weak quantum and leads to effects not
anticipated by any previous discoveries in weak interaction phenomenology.
One of the two vital ingredients of the electroweak theory is the principle by which
the quanta are introduced originally as massless particles. The other is a mechanism
by which the weak quanta W +
, W~ , and Z acquire mass while the electromagnetic
quantum y retains its requisite zero-mass property. The guiding principle is known as
1614 The Electroweak Interaction 883
m if
cos0„,, (16-31)
,n z
a = — —-j(m w
77 (he)
c
2
an6w ) . (16-32)
These formulas involve the weak mixing angle, a quantity determined by the analysis
of a variety of weak processes. Equation (16-32) can be used to predict the mass of
W± in terms of the known quantities a and G F and , Equation (16-31) can then be
applied to predict a result for the mass of Z. Both of these masses are expected to be
quite large, of order 100 GeV/c 2 , in keeping with the very short range of the weak
force. Equation (16-32) is especially interesting in this regard because the relation
connects the strengths of the two unified interactions. The dependence on the large
mass mw demonstrates clearly how the extremely small size of GF is associated with
the extremely short range of the weak interaction.
The electroweak theory includes electrodynamics. Hence, all the predicted cou-
plings between the photon and the electromagnetic currents of the fundamental
charged particles have their usual form. The theory also describes all the usual
charge-changing weak transitions in terms of the anticipated couplings of W± to the
various charge-changing currents. These charged currents are shown in Figure 16-33.
Note that the quark flavor transitions involve the modified quarks a", s', and b' . These
Figure 16-33
Figure 16-34
Figure 16-35
Neutrino scattering processes involving contributions due to weak neutral currents. A charged-
current term also occurs in vr e —* ve e~, but not in v e~ —* v e~
vP + e — v„ + e
vu + P "*
"u
+ P
18-14 The Electroweak Interaction 885
The theory also predicts a new weak phenomenon brought about by the presence
of the neutral quantum Z. This fourth quantum in the theory couples to charge-conserv-
ing weak currents, also called weak neutral currents, as illustrated in Figure 16-34. An
entire body of new weak processes is introduced by this piece of the unified interac-
tion, with probabilities determined by the unambiguous predictions of the unified
theory. The couplings to the weak neutral currents are distinct from electromagnetic
couplings, which also involve charge-conserving currents, because of the parity-violat-
ing effects in the weak interaction. Examples of such neutral-current phenomena are
shown in Figure 16-35.
Several different ideas had to be brought together by several people to build this
intricate theory. Yang conceived the guiding principle, and Schwinger proposed the
unification concept, both in the 1950s. The fourfold structure of the unified electro-
weak was devised by Glashow in 1961. The mechanism by which the weak
field
quanta acquired their mass was incorporated in the theory by S. Weinberg in 1967,
and a very similar model was put forward a year later by A. Salam. Little attention
was given to these contributions at first, because the crucial renormalizability of the
theory was regarded as doubtful. Skepticism gave way to enthusiasm in 1971,
however, when a proof of this property was carried out by G. 't Hooft. The theory
received support from experiment in 1973, when neutral currents were discovered in
neutrino reactions of the type
\+P v + hadrons.
Further confirmation of a piece of the theory came in 1974 with the discovery of
charm in the J/4> resonance.
The keystones of the electroweak theory, the new quanta W± and Z, were not
discovered until 1983. An accelerator with especially high energy had to be con-
structed to create these massive particles, and the CERN proton-antiproton collider
was dedicated to that purpose. The essential innovation in the design of the collider
was a technique developed by S. van der Meer for accelerating and storing antipro-
tons at very high energy. Collisions were achieved between 270 GeV protons and 270
GeV antiprotons, and evidence of the weak quanta was found, in experiments
conducted by C. Rubbia and others. The basic production mechanism was interpreted
to be quark-antiquark annihilation, as illustrated in Figure 16-36. The detection of
Figure 16-36
e "e
Hadrons Hadrons
886 Elementary Particles
the particles, first W and then Z, was performed through the materialization of the
massive quanta into pairs of leptons. Quoted values for the two masses have been
given as
Example
The experimental value of sin~#,„ is around 0.23, and so W is about 29°. We can
use this result to make predictions for m w and m z . Let us rewrite Equation
(16-32) as
17 (X
m w c 2 s\r\6,., = 3 '
}/2G F/(hc)
and let us recall from Section 16-7 the expression for GF in terms of the proton
mass:
GF 1.03 X 10
5
o\2
m^^sind,,, = i/
1
—p=r-. —VI 37 —— Mx
5 p
2
i/2(1.03 x \0 )
37.2 GeV A 2
in
ir 78 GeV A'
sin 29°
and
78 GeV A''
90 GeVA 2
,
cos 29°
where the second prediction is based on Equation (16-31). Both results are in
agreement with the findings from CERN.
fermions, we must ask how fermion antisymmetry is obeyed in the formation of baryons.
These questions lead us to the concept of color and to the theory of the strong
interaction where color plays the main role. (In this context color is a fanciful name
for a new quantum number. The concept is unrelated to any attribute affecting our
visual senses.) The fundamental treatment of the strong interaction involves the
dynamics of color in the binding of quarks. Accordingly, the theory is called quantum
chromodynamics. Strong processes occur among the observable hadrons as outward
manifestations of this basic interaction of confined constituents.
Let us begin with the issue of fermion antisymmetry and illustrate the problem by
referring to the hadron A++ . Recall from Section 16-12 that one of the states of this
spin- 1 particle is described as
A + + = uuul T T,
where each u quark is in an s state with spin up. The description in its present form
violates the Pauli principle, because both the space and spin factors in the wave
function are symmetric under the exchange of quark variables. Color is introduced at
this point to contribute an additional quark degree of freedom. If we assign a different
color to each quark, we
see that the violation of the exclusion principle disappears
++
since no two of the quarks in A are in the same flavor-space-spin-color state. We
need three different values for the color quantum number to accomplish this trick. (In
fact, there are three colors of quarks in all, for a variety of reasons. One of the pieces
of pertinent information from experiment is discussed below.) The quark colors are
conventionally designated by means of the indices R, and B, using notation Y,
inspired by the three primary colors red, yellow, and blue. We assume that this same
threefold multiplicity is attached to every quark flavor, so that we have color triplets of
quarks (u R u Y u B ), (d R d y d B ), (sR s Y sB ), and so on.
, , , , , ,
We can now return to the A + + problem and construct an eigenfunction for three u
quarks. The desired expression must be exchange symmetric in space and spin, and
exchange antisymmetric in color. The Slater determinant of Section 9-5 can be
adapted to these properties by writing
whose flavor and color are explicit, and whose estate and spin-up specifications are
implicit in the quark labels (1), (2), and (3). We note that the expression remains
unchanged when the color indices are rotated through the cycle
because of the cyclic symmetry of the determinant. The color portion of the eigenfunc-
tion is called a singlet since its unique form is maintained under such rotations. In this
respect color acts like charge and combines in a color-neutral system to make a hadron.
Color-singlet states are assumed for all hadrons. This stipulation prevents the added
888 Elementary Particles
+
?7 = -j^{u R d R + u ydy + u B d B ), (16-34)
where the flavors u and d refer to J-state constituents and where the usual s = spin
eigenf unction (1/ y2)(T i ~~ IT) is implied. Note that the baryon state in Equation
(16-33) is antisymmetric under exchange of any pair of color indices and that the meson
state in Equation (16-34) issymmetric under the same operation. We obtain color-
singlet expressions for all known baryons and mesons by adhering throughout to
color-antisymmetric and color-symmetric constructions in the two situations.
Experimental evidence for three colors of quarks is found in electron-positron
annihilation into hadrons. We have already introduced the cross-section ratio R in
Equation (16-28) and Figure 16-32 to express the energy dependence of this process
+ +
relative to the energy dependence of e~e -* n~fi The quark model takes the .
+— + —
reaction e~e * hadrons through the intermediate step e~ e > qq and lets the qq
pairs materialize into the various hadronic final states. Figure 16-37 shows this
hadronization mechanism in a schematic derivation of R. The different quark-anti-
quark pairs uu,dd, ss,.. have successive thresholds in the variable ECM Each pair
. .
e + e
+ —> q + q and e~ + e
+ — > jx + /x
+
is rather insensitive to the difference in mass between q and ju. The energy dependence
for e e '—> qq therefore becomes the same as the energy dependence for e~ e + —> ju~/x + ,
,_ M sElxf^OI 2
in 2
*- ,."l. r
o(e e -> fi fi )
,;
x (/i
| H ,e e
,*
)\
-3 E- q
\ e
^
j
<'"»>
The amplitudes in the intermediate step correspond to the diagrammatic model shown
in the figure. This a succession of rising plateaus as each qq threshold is
result predicts
exceeded. Our expectations are illustrated in the lower part of Figure 16-37. The
actual behavior of R in Figure 16-32 shows good agreement with the stepwise shape of
this prediction. We note particularly that the factor of 3 arising from color is needed
to make the agreement possible.
If we compare Figures 16-32 and 16-37 more closely, we see that R exhibits sharp
deviations from Equation (16-35) wherever the successive qq contributions have their
thresholds. Thus, as ECM decreases, the T family of bb states appears at the onset of
16-15 Color and the Strong Interaction 889
Figure 16-37
Quark model for the ratio R in Figure 16-32. Quarks and muons couple to the virtual photon
with charges Q^ and e. Quarks contribute to the sum over q as indicated in the graph of R.
Hadrons
3^
Q
11
q - u, d, s, c, b 10 3
q - u, d, s, c 3
q = u, d, s
±
10 15 20 25 iO 35
Em (GeV)
the bb term in R, and then the cc charmonium states occur near the lower cc
threshold. We can estimate the masses of the constituent b and c quarks on these
grounds to be around 5 and 1.5 GeV/V 2
respectively. It is obvious that the succeeding
,
quarks s, d, and u are much lighter than b and c. It should also be noted that the
whole complex issue of quark masses is subject to interpretation, since the quarks are
not observed as free particles.
The original problem of fermion antisymmetry was resolved in the 1960s with the
introduction of the color concept by O. W. Greenberg, and by M.-Y. Han. and Y.
Nambu. Color dynamics was promoted a decade later as the basis for the theory of the
strong interaction by Weinberg and Gell-Mann, among others.
The strong interaction accounts for the binding of quarks and antiquaries to make
mesons and the binding of three quarks to make baryons. These bound systems of
color-bearing constituents are formed dynamically in color-singlet states, as noted
above. Quantum chromodynamics is the renormalizable quantum field theory that
promises to explain such phenomena. We devote the rest of the section to a purely
descriptive account of complex and compelling theory. The underlying principle
this
Figure 16-38
Quark and antiquark color transitions with the corresponding gluon couplings. The gluons
execute changes of color represented by pairs of color indices.
Y YR v u
Quark
V
YR
Antiquark
quark. These color-changing effects propagate from quark to quark by the transmis-
sion of the color-clwnging quanta that make up the mediating strong field. The quanta
are called gluons since they are devised to account for the binding of quarks. Figure
16-38 shows how the color quantum numbers behave when gluons are emitted in the
color transitions of quarks and antiquarks. Like the photon in quantum elec-
trodynamics, the gluons must occur in the theory as massless quanta because gauge
symmetry demands this property. Unlike the photon, however, the quanta of the
gluon field are not emitted from hadrons as observable particles because the gluons
carry color, and color is contrived to be permanently confined inside the hadrons.
Quantum chromodynamics and quantum electrodynamics, QCD and QED, are
analogous theories to the extent that their principles share gauge symmetry as a basic
concept. The gauge symmetries of the two theories are different, however, enabling the
gluons to have a certain very important feature not shared by the photon. Fundamen-
tal charged particles emit and absorb photons, and so the photon is said to be coupled
to the charge degree of freedom in QED. Since the photon carries no charge, there
exists no coupling of photons to photons. By analogy, gluons are coupled to the color
degree of freedom in QCD. The gluons must carry color since their couplings to
quarks are designed to cause quark color transitions. Consequently, gluons can
interact with gluons through their own intrinsic color quantum numbers. QCD
departs fundamentally from QED in this respect and leads to a remarkable property
called asymptotic freedom as a result. By virtue of the gluon-gluon interaction, the
coupling of color in QCD approaches zero strength at arbitrarily small distances or,
equivalently, at arbitrarily large momenta. This property has been established as a
rigorous consequence of the theory in investigations by D. J. Gross and F. Wilczek and
by H. D. Politzer.
The fact that the coupling of colors vanishes asymptotically at very short range
encourages the view that the same coupling may grow without bound for very long
range. This belief holds the key to the notion of color confinement, whereby free quarks
1615 Color and the Strong Interaction 891
Figure 16-39
QQQ
I QQQ
are never seen, and observable hadrons are always found with quarks bound in
color-singlet (or zero-color) states. Quantum chromodynamics is a very complicated
relativistic quantum field theory. A few rigorous conclusions have been extracted from
the theory so far, while confinement remains the central problem to be solved.
Approximations to QCD and models of quark confinement have been rather
These studies support the idea that the configurational energy of a quark
successful.
system increases as the quarks separate. We illustrate the situation with the aid of
Figure 16-39 by showing the an incident y ray on a bound qqq baryon. The
effect of
added energy excites the quark system, producing a tendency toward quark sep-
aration, but no amount of additional energy can suffice to achieve quark liberation.
Instead, the added energy produces qq pairs in the strong gluon field so that the
originalbaryon fragments into a meson-baryon final state.
The ideas of QCD are readily translated for application to the phenomenology of
heavy-quark systems. The large masses of the constituent quarks allow the use of the
nonrelativistic Schrodinger equation in models based on a suitable central potential
energy. Expressions of the form
V{ r ) = + a. t r (16-36)
r
892 Elementary Particles
Figure 16-40.
Potential-energy model for heavy quark-antiquark binding and mass levels of charmonium.
The P !
states of the cc system exhibit spin-orbit splitting.
<:,
3
(0
Mass 1 12
(GeV/c 2 ) Spin and parity
can be applied to analyze the states of the cc charmonium system and the bb T system.
This type of potential energy function is sketched in Figure 16-40. The first term in V
mimics Coulomb attraction, in keeping with the analogy between QCD and QED,
and the second term supplies a linear barrier to simulate the effect of confinement.
The familiar angular momentum techniques of atomic physics hold for cc and bb.
Consequently, the resulting states are identifiable by the familiar spectroscopic
2
notation '
l
L. for total spin s, orbital angular momentum ( and total angular
,
and j = 1. The excited state i//(3685) occurs with the same set of vector-meson
quantum numbers. These two charmonium states are distinguished by their radial
quantum numbers n = and n = 2. A spectrum showing most of the known cc
1 states
and radiative transitions is included in Figure 16-40. A similar approach can be taken
with the bb system to analyze the family of T states.
The strong interaction, as just described, bears little resemblance to the old theory
which protons and neutrons are bound by the exchange of
of the nuclear force, in
mesons. This original problem in hadron physics is evidently not a fundamental
strong-interaction problem, simply because the interacting hadrons are not fundamen-
tal particles. Each hadron owes chromodynamics of permanently
its existence to the
confined quarks. The hadrons themselves are color-neutral, and so the exchange of
gluons does not occur from one hadron to another unless there is some overlap of the
hadronic quark systems. Hence, the fundamental strong interaction can become
operative in the force of attraction between nucleons at short range, but the complex
mechanism involves the participation of many quarks. This situation may be likened
16- 1 6 Gauge Symmetries 893
Example
Equation (16-35) is easily applied to make predictions for the cross-section ratio
R. If we let the sum over q include uu, dd, and ss terms, we obtain
2
2
3E(^) -3[(t) + (-i)* + (-;f] = 2.
When the sum is extended to take in cc, and then bb, the results are
?(*)"-HS)'-?
and
*ffl"-H-;K-
These values of R are indicated on the right side of the graph in Figure 16-37.
We observe that each of the numbers would be reduced by one-third in the
absence of color.
Conservation laws and symmetry principles have appeared throughout the chapter
with the introduction of each new internal quantum number. Every case considered so
far has been an application of a global symmetry, where the relevant quantity obeys its
conservation law in a uniform manner over all space and for all time. To cite an
example, the conservation of isospin in strong reactions is attributed to a rotational
symmetry in which the rotations in the three-dimensional isospin space are indepen-
dent of locations in space and time. The conservation of charge in all reactions has
also been treated (so far) in terms of a similarly global symmetry principle. The
interactions of particles must be described in a way that accommodates all these
conserved quantities.
A more powerful concept emerges when a conserved quantity can be associated
with a local type of conservation law. Charge is a prime example of such a quantity.
A corresponding local symmetry then becomes operative and implements the conserva-
tion law in a manner that may vary from one space-time point to another. This
variability forces the system to include a mediating field whose response to the
symmetry operation is such as to compensate for the variation of the symmetry from
894 Elementary Particles
point to point. Thus, the required field enables the local symmetry to propagate
through the system and provide a mechanism for interaction. Gauge symmetry is
another name used to describe this behavior. The mediating field is referred to as a
gauge field, and the mediators There can be no limit
of the field are called gauge quanta.
to the space-time extent over which the symmetry may vary. Therefore, the
local
compensating property of the gauge field has to have infinite range, so that the gauge
quanta must be massless. A theory based on gauge symmetry is said to possess gauge
invanance, since observable properties of the system are not altered by the local
symmetry operation.
A gauge-invariant theory defines an interacting system. The interaction medium is
furnished by the gauge field, and the structure of the interaction is dictated by the
nature of the underlying gauge symmetry. The resulting relativistic quantum field
<9E
V X B — — —^
1
——
8 1 dp
V E= •
dt e dt
and
V •
( V X B) -
1
—V
c
• —
dE
at
= ju V •
J.
When we use
dp
—
dt
+ V'J = 0. (16-38)
Note that the result hinges on the presence of Maxwell's displacement current, given
by the dE/dt term added to Ampere's law. We recall that the prediction of
electromagnetic waves also follows because of this contribution.
The last two of Maxwell's equations involve no source terms and may therefore be
construed as kinematical relations among the various field components. We can use
vectorand scalar potentials to convey the effects of these formulas and thereby simplify
the whole system of coupled equations. A vector potential A is introduced by setting
B = V X A, (16-39)
E= -V4>- —
dA
dt
(16-40)
so that Equation (16-37c) is also secured. This second result follows with the aid of the
identity
v x (v<?>) o.
We obtain a reduced, but still coupled, set of differential equations for A and <£ when
we return to the first two of Maxwell's equations and insert Equations (16-39) and
(16-40).
The potentials are not uniquely defined by this procedure. In fact, this is the place
where gauge symmetry makes its first appearance. We observe that B is not changed
in Equation (16-39) if we replace A by
dA
$' = <}>-—. (16-42)
dt
The new quantity A(r, t) is introduced as an arbitrary scalar function of space and
time in the two expressions. These assertions hold because
V X A' = V X A
and
- V¥ - —
dA'
= - v<J> + V —
dA
- - - VA = - V* -
-
dA d
-
dA
operations have no effect on the observable fields E and B, and we conclude that
896 Elementary Particles
gauge-invariant theory. Let us focus on the nonrelativistic problem and start with the
Schrodinger equation for a free particle:
— 1
Zm\
-V
/ h
i
\
J
2
*=
8
a-77 *•
dt
(16-43)
The phase of the complex-valued wave function ^(r, ) is not a measurable quantity. /
We may therefore argue that the description of the particle cannot be affected if ^ is
replaced by a phase-altered wave function of the form
ia
¥'(r, = e *(r,t).
It is easy to see that ^' also obeys Equation (16-43), provided the phase a is a
constant. Let us suppose, however, that the phase is allowed to vary with r and t and
that the observable predictions of the theory are required to remain unchanged. In
this case the differential operators act on a(r, t) as well as ^, so that ^' no longer
satisfies the given free-particle equation. We express such a variable-phase alteration
of ^ as
prevent these arbitrary effects from becoming observable by introducing the electro-
magnetic potentials to serve as gauge fields. If A in Equation (16-44) is taken to be the
same arbitrary function as in Equations (16-41) and (16-42), the gauge transformation
of A and <$> exactly compensates for the arbitrary phase variation of ty. The resulting
formalism for ty, A, and <j> therefore constitutes a gauge-invariant theory.
We have to modify the free-particle equation to accomplish these ends. If the
replacements
h h d d
ii
-V -> -V - 0A and ih—
at
-» ih —
dt
- Q<p (16-45)
i(7 v -« A
)
*"(^" a*)*- (1M6)
We now wish to demonstrate how this approach gives a gauge-invariant procedure for
the determination of ty. The gauge-transformed version of the left side of Equation
16-16 Gauge Symmetries 897
,<W*(Q vA )^ + e
,-
ft A/«
-V - Q(A + VA) *
= ,'QA/A
(QVA). -v - QaW
+ ,«W» -V - Q(A + VA) (DA ¥
-v - qa] 9.
The analysis of the right side of the equation proceeds along similar lines:
^'QA/A^
a--a*r|*-.
/ 3A
3A \ <9
/ dA \
M'
Thus, the structure of Equation (16-46) is such that an arbitrary phase variation of ^
is reconciled by the corresponding gauge behavior of A and The phase-altering <J>.
factor therefore passes through all the operations in the equation, so that 4'' satisfies
rewritten:
V - QA ^ \
+ Q.^ = ih — *. (16-47)
2m\ i I at
The familiar features of the Schrodinger equation for an interacting particle appear in
this result, as the second term on the left evidently contains a potential energy of the
form
V=Q<$>.
charge of the particle be the usual Coulomb interaction. Note that the term
and V to
Q$ty in the equation represents a coupling in which the particle and the gauge field
interact through the coupling parameter (). The remarkable conclusion to draw from
898 Elementary Particles
this demonstration is that the requirement of gauge invariance dictates both the
existence and the form of the interaction. Gauge symmetry operates in exactly the
same way in conjunction with the Dirac equation to generate the electromagnetic
interaction for a relativistic spin-^ charged particle. Quantum electrodynamics is the
resulting gauge-invariant theory.
Quantum chromodynamics is obtained as the theory of the strong interaction
through a different application of gauge invariance. We again consider local phase
variations in the spirit of Equation (16-44), and we also let the gauge transformations
act on the colors of quarks to produce changes in those degrees of freedom. There are
eight distinct color alterations possible among the three quark colors. Consequently,
eight gauge fields are needed, with gauge behavior different from Equations (16-41)
and ( 16-42), in order to compensate for the arbitrary phase adjustments of the quarks.
The color-changing aspects of the quark transformations are fundamental to the
gauge symmetry of QCD. These features set this theory apart from the simpler gauge
symmetry of QED. The result is a vastly more complex structure for the description of
the strong interaction. Eight color-changing zero-mass gluon quanta are associated
with the eight gauge fields. These gluons couple to the colors of quarks, and also to the
colors carried by other gluons, so that the gauge-invariant theory describes
gluon -quark and also gluon-gluon interactions. The latter situation has no analogue
in QED. The gluon-gluon interaction is the source of the property of asymptotic
freedom for gluon couplings at short range and is a crucial ingredient in the
mysterious property of color confinement. The gluon-gluon interaction comes about
because the various transformations of the internal degrees of freedom are not
commutative operations. Gauge theories based on noncommuting local variations of
phase are named generically after Yang and R. L. Mills, the first investigators to
make such an extension of gauge symmetry.
While one type of Yang-Mills theory implements a particular gauge symmetry
among the colors of quarks, another type invokes a different set of gauge principles for
the behavior of quark flavors. We are led in the first case to the theory of the strong
interaction, as just discussed, and in the second case to the theory of electroweak
unification. This second application of noncommutative gauge symmetry pertains to
local phase variations in which the noncommuting transformations occur within the
doublets of quark flavors
u t
a" s' V
e T
(Recall that the quark entries a", s', and b' represent rotated combinations of d, s,
this remark by itemizing the four types of quark and lepton transitions as follows:
$ = -m 2c 2 <&, (16-48)
Gauge invariance requires that we again incorporate the gauge fields A and $ and
assume the gauge behavior given in Equations (16-41) and (16-42). The interaction of
the particle with A and is then described by the modified Klein-Gordon equation
<f>
when the replacements in Equations (16-45) are made. If we define the real and
imaginary parts of $ by writing
= $, + z0 2 ,
Figure 16-41
,2 _
= „22
/i + A<&*0
hi
2
(<D,,<I>,) = m2 + a(<I>
2
+ <I>
2
). (16-50)
This hypothesis injects an ingredient of nonlinear behavior that couples the de-
termination of <J>, and <b 2 through Equation (16-49). We observe that the expression
4>*<I> is independent of the phase of and is therefore left unchanged by local phase
variations. Consequently, w 2 ($,, <I>
2 ) is a gauge-invariant quantity so that the incor-
poration of ml in Equation (16-49) does not conflict with the governing principle of
gauge symmetry. Figure 16-41 shows two versions of the paraboloidal surface given by
m as a function of <!>, and <&.,. We are interested in the lowest energy state of the
system, and we suppose that this state occurs for the smallest possible value of the
mass m. If we take the parameter ju 2 to be positive, we find
and <&
2
lie anywhere on the circle
$2 + $.
2
— with jli
2
< 0. (16-51)
A
This conclusion determines only the modulus of $, while the phase of $ is left
Example
k X
V(x)
V
= -x 2 + -x 4 .
'
2 4
This expression includes the usual harmonic term with spring constant k and an
additional anharmonic contribution with positive parameter X. The system is in
stable equilibrium at x = 0, provided k is positive. In general, we find a
dV
= F(x) = - —
ax
= ~(kx + Xx 3 ) = -x(k + Xx 2 ).
+x ,
where x
Figure 16-42 shows the shape of V(x) in the two cases. We expect to find
symmetric stable minima on either side of x = for k < 0, because V{ x ) is
symmetric under the parity operation x -* -x. We break this symmetry sponta-
neously when we choose, say, x = +x to be the position of equilibrium for
902 Elementary Particles
Figure 16-42
V(x) V(x)
k>0
small oscillations of the particle. Note that the broken symmetry is discrete, and
not continuous, in this illustration.
Gauge theory has revolutionized our understanding of the elementary particles. The
general notion of local symmetry inspires a procedure that applies in different ways to
each of the fundamental forces. Quantum chromodynamics is generally accepted as
the theory of the strong interaction, and electroweak unification is established as the
proper framework for the electromagnetic and weak interactions. These two gauge
theories, taken together, comprise the so-called Standard Model for the behavior of
quarks and leptons. No experimental contradictions and no mathematical incon-
sistencies are known to be in conflict with this combined theory.
Despite its resounding success the Standard Model is not regarded as the ultimate
fundamental theory, for a variety of reasons. The electroweak portion of the model
does not meet all the qualifications of a truly unified theory, since the precise
unification of electromagnetic and weak interactions is not completely specified. We
can see this immediately by noting that the weak mixing angle 6W is left unpredicted.
In fact, the arbitrary separate treatment of the strong and electroweak theories
involves an excessive number of such free parameters. The presumption of a 1 : 3 ratio
between the charges of quarks and leptons is another arbitrary and, hence, unsatisfy-
ing feature of the overall theory. We should be able to eliminate these elements of
arbitrariness, and gain more predictive power, by turning to a higher level of
unification based on a greater degree of local symmetry. The higher symmetry would
make the unified theory simpler and would introduce constraints reducing the number
of free parameters. Grand unified theories are designed to accomplish these objectives.
The theories are so named because they embed the Standard Model in a single gauge
theory and thus serve to unify the strong, electromagnetic, and weak interactions. This
speculative notion has an obvious appeal as the next step to take toward an ultimate
theory.
The proposed schemes for grand unification differ as to their specific details. It is
not yet possible to say that any of these proposals has emerged as the obvious choice.
One of the leading candidates is the model developed in 1974 by Glashow and H. M.
Georgi. Their theory may not be correct in all its predictions; however, there is reason
16-17 Grand Unification 903
Figure 16-43
10
1 1Q 15 10 20
Energy scale (GeV)
to believe that the ideas contained in this simplest of models are at least pointing in
the right direction.
Scales of distance and scales of energy are decisive considerations in grand unified
theories. The unification concept presumes the existence of a regime of the variables
where the forces of interest obey the assumed symmetry In this domain the
exactly.
then become aware of the large differences between the two forces via the breaking of
2
gauge symmetry when we descend in energy toward values comparable to m w c .
We illustrate this state of affairs in Figure 16-43 by showing the behavior of the
various coupling strengths as functions of the energy scale. The curves labeled W and
B refer to the couplings of the gauge-field quanta W+ ,
IV ,W~, and B°. (Recall that
the electromagnetic and neutral -weak interactions are associated with mixings of W°
and B° to form the observed quanta y and Z.) Note that the curve falls with W
increasing energy scale, so that the couplings W
diminish in strength at short range.
This property of asymptotic freedom is expected for these couplings because of the
noncommutative nature of the W gauge transformations. It is known that the W and
904 Elementary Particles
15
B curves intersect at a very high energy, of order 10 GeV, indicated on the graph by
the quantity m x c 2 The
. behavior of the strong gluon interaction is shown, with the
label G, in the same figure. This coupling also enjoys the property of asymptotic
freedom and must also fall with increasing energy scale. Remarkably, the G curve
merges with the W
and B curves in the vicinity of the same high energy determined
by rn x c 2 This coalescence of the three independent couplings is ideally suited for a
.
forces of equal strength act between seemingly similar particles. A single curve
describes the strength of this unified interaction, as indicated by the high-energy
portion of the graph sketched in the figure.
Grand unified theories treat quarks and lcptons as fundamentally similar particles
in theregime where the higher gauge symmetry is exact. The well-known distinctions
between these particles are then supposed to evolve at the lower energy scales, where
spontaneous breakdown of the symmetry takes effect. The unifying interaction em-
braces the familiar gluon, photon, and weak gauge couplings to quarks and leptons,
corresponding to the usual lower gauge symmetries with their associated strong,
electromagnetic, and weak gauge fields. New symmetries are also present in the
unified theory, because the higher level of unification includes operations that
transform quarks into leptons. Each of these transformations involves a certain change in
color and flavor, and each of the possibilities calls for the introduction of a new kind of
color-changing flavor-changing gauge quantum. The additional quanta give rise to
unanticipated interaction phenomena with couplings prescribed by the particular
higher gauge symmetry chosen for grand unification.
The model of Georgi and Glashow defines gauge transformations within the
following five- and ten-fold collections of colors, flavors, and leptonic quantum
numbers:
(u)r,y,b
{d)R,Y,B
(
u )r,Y, B
and (16-53)
(d)/i,Y,H
V
+
e
These assignments group the quarks u and d together with the leptons e~ and ve The .
other two generations of quarks and leptons participate in the model in similar
fashion. When we analyze the unified interaction for all possible quark-to-quark and
lepton-to-lepton transformations, we reproduce the usual strong, electromagnetic, and
weak gauge couplings. Some of these familiar couplings are shown in Figures 16-33
and 16-38. In addition, we also generate new classes of quark- to- lepton and quark-to-anti-
quark transformations. Some of these so-called leptoquark and diquark transitions are
illustrated in Figure 16-44. Note that the transformations occur with the emission or
absorption of color- and flavor-bearing quanta, represented by the symbols X and Y.
These quanta have Q/e values equal to \ and \, respectively. Note also that the
indicated transitions violate the conservation laws for baryon number and lepton
number. The new gauge quanta acquire very large masses via the spontaneous
breakdown of gauge symmetry. Thus, the mass of X is selected in Figure 16-43 to
2
specify the energy scale m x c above which the higher gauge symmetry becomes exact
,
Figure 16-44
d Y u UR dR
Leptoquark transitions Diquark transitions
The embedding of the three familiar forces in a single unified interaction imposes
restrictions on several of the arbitrary features of the Standard Model. In particular,
the adoption of a single coupling strength for a single gauge symmetry fixes the
relative strengths for the couplings of the G, W, and B quanta. This determination of
couplings is made initially at energy scales beyond m x c The three curves in Figure
.
16-43 can then be followed below the unification scale to ascertain the strengths of the
couplings in the longer-range regime of current experiment. The same procedure can
be used to predict a value for the experimentally measurable weak mixing angle. In
the Georgi-Glashow model, the result is
2
sin 0„,= f (16-54)
at the unification scale. The value then evolves with the coupling strengths at longer
range to yield a prediction much closer to the experimental figure, sin 6W = 0.23.
and
where the factors of 3 are due to the three colors. Both of these equalities imply the
relation
In fact, the model goes on to explain why such sums of charges must equal zero for any
assignment of quark and lepton quantum numbers. We can use this property of the
model to argue that the charge relation is a necessary consequence of the higher level
of symmetry. One more element of arbitrariness in the Standard Model is thus
removed by the adoption of a unified interaction.
906 Elementary Particles
Figure 16-45
to the same collection of interconnected particle states. Similar constructions are found
in other models of grand unification. This provision leads us directly to the most
striking conclusion of the grand unified theories, the prediction of baryon- and
lepton-number violating nucleon decay. Figure 16-45 shows two of the possible mecha-
nisms for the proton decay mode
+
p -> 7T° + e .
We note that the diagrams contain the vertices from Figure 16-44, connected by the
exchange of the superheavy X and Y gauge quanta. A prediction can be made for
the proton lifetime t\, based on the specific choice of unification model. (Actually, the
prediction is somewhat blurred by uncertainties stemming from the parametrization of
the strong interaction and the treatment of the hadronic bound states.) The resulting
formula for 7\ depends sensitively on the unification mass mx . It is remarkable that
30
values for t. in excess of 10 years are obtainable for input selections of mx larger
14 2
than 10 GeV/c . A mean life in this range is amenable to experiment, and a
unification mass in the corresponding regime is in line with the scenario described in
Figure 16-43. This consistent set of circumstances is favorable for a decisive experi-
mental test of the ideas of grand unification.
The possible violation of baryon number is an intriguing development, particularly
in view of the comparable situation regarding the conservation of electric charge. We
have learned that charge conservation is associated with an exact gauge symmetry,
whose validity can be tested by setting very low experimental bounds on measure-
ments of the photon mass. Baryon number does not appear to have its origin in any
analogous unbroken gauge invariance, and so an exact conservation law would have
to correspond to an exact global symmetry. We are inclined to view this sort of
invariance as somewhat implausible, now that we have found a scheme, operative on
an extraordinarily small scale of distance, in which quarks and leptons look very much
alike. Baryon and lepton numbers cannot be regarded as sacred conserved quantities
evidence of proton decay. Since the lifetime of an unstable proton must be very long,
an enormous sample of matter is needed to collect detectable numbers of decay events
in practical intervals of time. To illustrate, let us assume 10 30 nucleons in a ton of
material, and let us take the lifetime to be r
p
= 10
30
y. Under these conditions we
may one observable proton decay in the sample per year. One of the ongoing
find
experiments is located in an abandoned salt mine near Lake Erie. A volume of pure
water containing 8000 metric tons is monitored by phototubes to detect the light
radiated by the expected products of proton decay. The apparatus is buried deep
underground to reduce the detection of events due to cosmic-ray muons entering the
sample. Unfortunately, neutrinos cannot be eliminated, and their interactions can
simulate the desired p -> ir°e + events. In fact, the only decay candidates seen so far
are attributable to this source of background. The experiment to date has set a lower
+
limit for the partial mean mode greater than 10 32 years. This result is
life in the TT°e
at a factor of 10 larger than any of the predictions based on the simplest
least
Georgi-Glashow model.
Grand unified models belong to a general class of spontaneously broken gauge
theories that predict the existence of magnetic monopoles. These objects have been found
mathematically among the solutions of the classical field equations of the theory. The
prediction gives an enormous estimate for the mass, of order 10 16 GeV/c 2 and ,
provides a unique relation between the magnetic pole strength and the electric charge
of the particle. Such objects seem to occur very seldom, if at all, in the real world.
Only a single candidate has been seen in the laboratory, in an experiment performed
by B.Cabrera in 1982. Since their incidence is so rare, it is incumbent on the theory
baryons and antibaryons only if we postulate the known excess as an initial condition
on the Big Bang. Instead, let us adopt a grand unified theory in which B is violated
and insist on equal numbers at the start. Grand unification then takes its course,
beginning with very brief initial time intervals and very large thermal energies beyond
the unification scale at m x c Matter in this epoch is supposed to consist of quarks and
.
leptons in equilibrium with all the quanta of the higher gauge symmetry. Spontaneous
symmetry breaking eventually sets in as time evolves and the universe cools. Viola-
tions of C and CP are also introduced as essential ingredients in this scenario. Let us
illustrate their role by considering the decays of the superheavy gauge quanta, even
though these processes are not the only (or the most important) contributors to the
asymmetry. The diquark decay modes of X and X can differ because of C- and
CP-violations, so that the rates of production from
are not the same. Consequently, we can have equal amounts of X and X in the
.
equilibrium period and still generate unequal numbers of quarks and antiquaries
thereafter. A small net excess of the one over the other would survive subsequent
processes of matter-antimatter annihilation and perhaps explain why our part of the
universe contains essentially no antibaryons.
Grand unification has many satisfying and promising aspects. Nevertheless, it is
conceivable that the strong, electromagnetic, and weak interactions cannot be unified
properly until the unification of forces takes account of the force of gravity. By the
same token, it is possible that a consistent quantum theory of gravity cannot exist in
from the other forces of nature. These areas of speculation belong to the next
isolation
more comprehensive stage in the problem of unification. The final solution of this
problem is called the Theory of Everything.
Example
The energy scale of grand unification is extraordinarily large, and the associated
scale of distance is extremely small:
hi 0.2 GeV fm •
15
= 2 X 10
16
fm.
10 GeV
M 2
—h
G = Mc 2 and R =
R Mc
The combination of conditions provides a criterion for the scale of the mass M
in terms of the gravitational constant G:
5
Mc he
GM --2
= Mc 2 M, 2 _
h ~G
This argument determines the rest energy of the so-called Planck mass:
Mc = 2
n N m 2 /kg 2
6.67 X 10~ •
9
1.96 X 10 J
19
1.22 X 10 GeV
1.60 X 10- 10 J/GeV
The unification scale mxc 2 falls several decades short of this value. It is
noteworthy, however, that the unification mass mx lies closer to the Planck mass
than to the weak mass m w .
Problems 909
Problems
1. Positively charged cosmic-ray particles approach the Earth from various directions and
are deflected in the Earth's magnetic field. Give a qualitative argument to explain why
more particles penetrate the field in the polar regions and why the particles tend to arrive
on Earth preferentially from the west.
A"
A = '(K'l 1 + 2
4 Mr
where K is the single-beam kinetic energy in the first machine, and K' is the total kinetic
energy in the second machine. Take the nonrelativistic limit of the formula, and compare
with the result found in Section 15-8.
Generalize the derivation in Problem 3 to the case of unequal colliding masses. Let K be
the beam kinetic energy for a particle of mass m incident on a target particle of mass A/,
Assign momenta and energies for the emission of a photon by a free electron according to
the diagram, and show that the conservation laws of momentum and energy cannot be
satisfied. The conclusion implies that the photon must be virtual in this case. How is it
possible that the photon is real for the emission processes in Figures 15-16 and 16-4?
(P. E) (P,e)
(Po. E )
.
6. Represent the g-factor for electron spin as in Figure 16-6, and draw all Feynman
diagrams of order a and a~.
g~'/ r
8. Calculate the minimum energy of photons incident on a fixed proton target for the
photoproduction of neutral pions in the reaction y + p — w° + > p.
9. Let neutrons be incident on a fixed proton target and calculate the threshold kinetic
energies for each of the pion-production reactions
In + n + tt*,
n + p -» I n +p + it ,
\p + p + n~
Table 16-1 gives sufficiently accurate values for the masses of the particles.
10. Calculate the momenta of the final particles in the charged pion decays
+ — n* + —
it > i> and tt* * e* + vr .
Assume that the pion is at rest and neglect the neutrino masses.
11. Consider the decay of it mesons in flight and obtain expressions for the maximum and
minimum energies of the emitted y rays in terms of the it velocity.
12. The reaction used to detect solar neutrinos is the endoergic capture process
37
vr + r,7
Cl -* Ar + «".
Compute the threshold energy for incident neutrinos using atomic mass data from
Appendix A. One source of solar neutrinos is the proton fusion reaction
'H +'H - H 2
+ *
+
+ vf .
Calculate the maximum energy of neutrinos produced for protons at rest, and determine
whether the chlorine detector is sensitive to these neutrinos.
13. Neutrinos and antineutrinos are intrinsically left- and right-handed, respectively, if the
particles have zero mass. Use this fact to prove that the w" decay mode tt -* vv cannot
14. Let 77 mesons be incident on protons at rest and derive a formula for the beam energy at
the threshold for the production of a final state of total mass M. Consider the reactions
equations
'
Mi
2
A K° m u K°
c dt .K° _ V 772 . .K°
and
ih d Ks ms '
Ks
7 2
~dt Ki. 77! • KL
Use the transformation between these sets of states to derive the relations
1 + e
for d + d —* a + y. Explain qualitatively why these two reactions are expected to proceed
at comparable rates.
19. The 277 system has nine different charge combinations 2 + + 2 + 7r°, 2 + ?r~,.... How 77-
,
many independent amplitudes for elastic and charge-exchange scattering are there in this
system, and how are the amplitudes charac,c:iisd° How many independent amplitudes
are needed to describe the production of 27;' states in K~ p collisions?
Determine the ±
20. ir p cross section ratios, expressed as
+ + 2 2 2
\x(v P,* p)\ --
Ix(t~>, *"/>) I
: \x(^°n,^'p)\ ,
under the assumption that the isospin amplitudes X\/> anc' X3/2 are ecl ua ^ ar) d under the
assumption that X3/2 vanishes.
±
21. Calculate the values of the pion beam kinetic energy in it p reactions for the observation
+ +
of the ttjV resonances A(1232, V\ A^(1520, f"), 7V(1680, | ), and A(1950, \ ).
22. Show that, for a system of two pions, the isospin / and the angular momentum f are
correlated so that / and ( must be either both odd or both even.
23. Refer to the three-quark formulas for the states A++ A+ A , , , and A ", and use the flavor
substitution u —» s to deduce expressions for all possible strange baryon states. Identify the
quantum numbers T, and Y for the resulting ten hadrons, and plot their locations on a
graph of Y versus Tz
.
24. Construct all nine qq systems vectorially, using the addition of vectors in the (Y, T. ) plane.
Show that the result contains states with quantum numbers in the antitriplet pattern, and
note that a sextet (six-fold) pattern of quantum numbers remains when the antitriplet is
removed.
25. Continue the vectorial construction begun in Problem 24, and generate all qqq systems by
applying the method of vector addition to the composition q{qq). Specifically, construct
the q(qq) states by adding the triplet of q vectors in the (F, Tz ) plane to the antitriplet
and sextet results obtained from the qq system. Show that the final 9 triplet-antitriplet
states form octet and singlet patterns and that the final 18 triplet-sextet states form octet
26. What combination of quarks is needed to construct the antibaryon Z ? Consider the
photoproduction of Z via the reaction
-
Y + p -* E + X,
and deduce the identity of the least-massive particles comprising the unspecified system X.
Calculate the corresponding threshold energy for photons incident on target protons at
rest.
28. Identify all quantum numbers for the c-flavored hadron systems udc, cu, cd, cu, Id, cs,
and cs.
29. Draw P'eynman diagrams to show the effects of weak neutral currents in the processes
30. Assume a value of 40 GeV/c" for the /-quark mass, and show how this undiscovered sixth
quark may contribute in the cross-section ratio R for electron- positron annihilation.
31. Gluons change the colors of quarks and antiquarks in the manner of Figure 16-38. Show
that eight different gluons are needed to describe all possible color transitions for three
Explain the notation, assuming a central potential energy like the one shown in Figure
16-40. The explanation should include sketches of the radial functions corresponding to
the four different T states.
33. Use Maxwell's equations to deduce the equations satisfied by the electromagnetic poten-
tials A and <£ in terms of the source densities J and p. Assume that A and <j> obey the
subsidiary (Lorentz) condition
V A+ • -— 1 d<>
= 0,
c at
34. Consider expectation values taken in the states of a nonrelativistic particle of charge Q,
and show that (p — (?A) and (E — Q<f>) are invariant under gauge transformations.
35. Let the classical one-dimensional motion of a particle of mass m be governed by the
potential energy V(x) = (k/2)x 2 + (X/4)x 4 and , consider the case k < 0. Deduce the
frequency of small oscillations about the position of stable equilibrium, chosen to be at
the point x = +*,, as described in Figure 16-42.
APPENDIX
TABLE
OF
NUCLEAR
PROPERTIES
isotopic abundance (%), and the radioactive half-life t 1/2 The last two items share
. the
last column in the table, depending on the stable or unstable character of the
particular nuclide. Unstable species art indicated by an asterisk next to the value of
A. Their half-lives are given in seconds (s), minutes (m), hours (h), days (d), and years
(y). Sources of information are Chart of the Nuclides, 13th edition (1984), and Atomic
Mass Evaluation, by A. H. Wapstra and K. Bos (1977).
A-!
' ~ *
' —i • 1 1 H ' 11'1 i i 1 ;
—
t- oi COo
o IT ~ —
b
^S~ CO
O CM
,
iO
—
— cm
~
r~- C
-i
I D
CO
- o
o w CJ '
— —
in
CM
CO
CM
I--
to
CD
~ -h
:c
~j
S*
o —
~, C 1 3 cc
— ^ CM CM -r en — —
c-. r. C i
-
o 5)
+ + + + + + + + + +
—
1 1
(
O "I-
\
IO — ~ co CM IO CO co in CO CM CO -o CO - iO d OI ~
cc
— ~o to o :c CO TjH CO CO IO -r CO ~.
cr.
CO
i
Oi "» IC
— — — T C cr. 1
— d
i.~
—
i - cc
— —
~,
oS o _; ~-i CM ro
CM
d
C
iC
e
CO i
- i
- 00 oS d »H
CM C I CM C I I i
CM ? i OI OI -i eo CO
— o
cr-
OI
<r> —i CM OI CO -r
OI
m
CM
to
CM
r-~co co en
CM CO
o -h
co
CM
co
CM CM CM C I CM CM CM CM
bo
loCO X 2 < r. (X
< cr, c — -i CO * in
N
>-
x
M
-r tO
O
cc s
\ C m n >
CO
3
o o,
CD
-a
CO
X 3 c >. CO i - CO
'"' cr,
q co o cr. m_ in C 1 'CC a, '
— C 1 cr,
""I
—
CD CO —
.
- d -. d c d oS i
c i
CO o _j ai d d CO CO d d i -
o
i
O O
t
I
CO "'I"' —i -I-' —i -I ci CM
in n Co a, c, co ^_ co o OI iO, to in CO O
o in OI -r cr.
o
~.
to C -r
1
C O C o CO co CO CO o IO
o
in -r r~ cr.
O o m o m -
I 1
1
•-, -, CO co co CD ^™ c:
to co to
CO <f to-
to C to
— to C CO -
—
— CO Tf o CO CO
O
CO. CD to
o
o o
i 1
o
I
^^ CO CD ~
CD CD
o o q o o p
,
— ,
o D
O
•^ - C 1 co
q
co -r 'D 1
q o
i d d
- d q
— q
oi
re
c i
CD
CO
CD
d d
CD
iri d
— i CM CO CD r- oo o O — ' CM CM CO ^ rf in co
V
ir c X J 03 U
O — C I CO m to i-
»-^
:
-
r >*
( -a >> ~ 3 —
\v o 3 r-^ r-^
3
IT)
-a
CI co
CO CM
to r- CM
I
r--
CM
3
— >- — c- CO
-
.'
CO i tO CO 3
9 i ~ co 3 CI o
o lT) CM — 3 iO CO to c o — d
3 : i
CO
3 co
3
r-^
01 CO Oi 3 IO CM = CO - i
CO
+ +
M '» CO ^1 OI CO 1
O o o
—
o o — m 3 c 3 3 3 3 3 3 3
Oi CM -» CO CO r^ CO tO 3' CO
:D
r-
i— in to
S
i
-r 3, 3'
OI 3 3. CM CO
3 r-~ 3
m
3 3 3
— 3 CO
CI 3
X "I
3
3
-r2 c
in •z ~ CO
co <o
3. CO 3 co
CO
CO
CO r-
m 3 3 3
r-
3, -
t-~
3 3 3 in
iO 3 r-»
-v
3 3 3 3 3
i
^ ~: 3 Ol
CO CO
3 CO CO
Oi Oi
CO
3 3 co CO CO Ol
3 3<
CM CM CM
Oi 3 3
CI
3 CI CI
3 3; 3 CI
3 CI
CI
3
3;
d —4 CM CO3 CO >* iO co oi I~-oi CM oi CO -*
'3 3 '3
CO 3 in CO 3 d
m 3 m m m m m m m m lO lO c '3 3 3 3- CO r-»
—
m i CM
in
co
in
-r
in
cO
in
3 m
>n in
to
in m O
3)
to m o
CO co
<o 3
CO Tt<
to to
in
3 3 m
3 to
to
to
en
to r^
o — r-~
E a o a re
o u
U g d N
< 3 m
CI
3 r»
Cl
CO
CM
3
CI
3
CO
-H
CO
N CI CM
e~ "3
X
3 > 3 co
CM
o <
3 3 3 X
\ — m 3
3 m
!
CM r- —-( CO 3 iO CO -f
§ T3 >3
3
-
^ m' 3 3. 3 ^ 3
3 3 3
I"- m" CO CO to" cm in 3 CO r-~ CO CO CO 3
CO r~» -1 co 3 3. 3 Oi 3 3 3 r-» 3
r-i A
+ +
•~ O O "1 = C^| CM CM ~">l W 3 CM H CM O "1^1 T^ "|C O O to H ''
3 — 3 —
3 lO 3 3 m 3 CO 3
— 3 3 co
CM ~o r - ~c t~» co CI
3 3 3
— 3
,
CO 3
t-~
3 3
1
1
- r^; in 3 r-»— CO
CO 3
-i 3. CI CO
3
-
f
i—i
3 3 3 3
3 30 3 CO CO 3 r-»
3
i^~. CO
3 3 iO -r" —
3 3
-
r- 3
CI 3 CO co in -f Cl CO ^^ CI in iO CI CI
iO in 3
r~-
3 3
3 33 3 3 3 3 m 3 3
t-»
t-» 3 3 3 3
3
3'
3 3 3 3
lO
3 iO iO
3 3 3 3 3 3 3
3 3 3 3: 3; 3.
_'
co
CO
eo
-r
CO
-r
co
in
CO
to
CO
3 CO 3
3; 3 CO
CO 3 3
CO co -r
3 CO
CO -*
-1^
-f
-r <o
-V 3
CO m' r^
3 3 3 3 3
3'-
IT
CM Tf 1- in to r~- r~~
3 <y> Oi
CO * _^
3
* 3
3 3 r^
iO.
3
* to CO
^^ "^ m
—
CO CO co CO CO CO CO Tf 'f rH tJ* ICO
cc
E u
O CO 3 < ^ h >
< 2 r~- CO 3
—
-
C'
CI
—1
CM
CI
CI
CO
CI
N
4.3
1 ' 1 1 i I .
i 1 —— , ' i i 1
'i i .
o —
X 5
*
H - • -
sz
•
w CO
— ~
s
— CO
— ~ CO cn _' -r CM tO co
— co
— *
l
zz tO •
IC s
I - -V en cn -f CO -r r^ CO
&8 — CM 1- ~ CM CM - _ CO c- in CM tC — CM 30 cc -+.
c — — -0
CN —
1
CO — CO -I
r~-
CM CM I!-, -r
-T-
"I J .
CM
Tt<
+ + + t
+ + + + + + + + + * + +
r
2 -
i
-
I
• !
~ -
tO 31 - ~.i
- iO|(N o o -|CM Mcm mlo o -. -!- ^~ — 1
CVJ -i-i o CCl| CM Cl| CM
c- = £ '3 s r- ~
C I CC
- ~ C i CM 'CO C 1 51
~.
-f
CO
in
-
in
t^ 51
tO in,
m
1 - cc-
m ~r i- CC
f
co
CO
-1
~r
cc
1
to -v
tO
^n
in
CM
CO
in co
-f CM
-v <* tO O
i
<* CO
51
O
in
cn t-- i- co
IC - CC
-o -s- I in -r -
in tc -
tc m -f m m
- in - —
in co -
CO m
s w 3
in -r -
— -f co "*
o —
co
~ c C
~ ~ ~ z s c 3
~ c
51 cc ~.
C cr
51 51 51
51 51 51 51 51 51 5) 51 51 51 51 5)
CM H * in r^ to
~ CO o _; co CM -+<
- -V in -
-
to r-' CO
- — CM CO CM -r
o o = O o O o O
i
CO tJ>
51 51
m
51
to co
51 51
i
51
-
51
51
—
O CM
O O"f i cn
o o
>~ in to co
o o o ooo _—i—*
r^ co oi cm co coin
— — <
o ->
C
2 * 2 2 Oh < U
to - CC
CM
-r
CO
-r -r -r
l
-r — 5l
'- 0 "0 -a
\ CO CO
n 51 i^~ to CO >- m i
- co
m -+• in 1 - i
in to CO to o cn cc CO m to CO
-~
3
- CI co
c s
I ". o
O 51 cn 51
C 1
O
in
51 "-; 1
in
- i
«• CI
i
-
-r 51 CM 51
CC -1
tc
3
re
re in
r^- r~-
II II
~ o o o
t
o o
t
«| cm eo| cm
+
co
+
O O
t
in| cm «| cm OOO
+ + +
-r OOO
CC' o r~ o co •V to
o
_^ to
co
O
a,
eo CO in
CO O
^^
cn
51
'CC
TO
i
to
-
5l
— to
-f
cn
CC
zz
in
CO
O I
co
s- -r"
in CO r^
C CC1
eo
51
51
in.
CM
in CO in
C I
cn -i -r in
— c tc /- — CI to I
s-
-
in X Is- c
l~
1
r n
'
i CM C i C / C
I
I
1
-
^-i ,— — 1
— c ^
c ~. cc ec 3 CC zz rc
51 51 51 51 51 51 ~, 51 51 51 51 51 51 51 51 51
51 51 51 51 C7 CT,
C7; cn , ;
r- -r -fc r- 51
:
CO O ^-]co in -r to m' i
- 51 i
- CC cri i — CO
~,
to - - i j i i - 1 - r". r^ I - CO CO cc CO cc :c :c cc co CC :c CO 51
ON*
r^ r^ r^
Tf
I
s-
in
r^ I
in
s-
co
1^-
o
CO
51
r— oo
i
CM "*• tD
CO CO CO
in
CO
1
cc
- to CO
00 CO 51
o CO
CO
51
CO
ON*
51 51 51
E ^3 •—
% r t2 r. > N
< C I CO -r m tc r^
CO
-J-Z 51
co
o
*
cn CO CO eo 80 CO
N
4-4
>.
>>
- o O
= £ >s X
\ co
-f <
Ol
CO
CO
«
Ol
>> CO
-
ID
i- r- CO CM
I
-r
- lO
co CO iT) o CM
so
so
co o
- CM
-1 r-- r-» iO ^^ lO CM
CI CM
I - CI
in
o
CM CM CM c CM in
CM CM
CO
CM
A
\ +O +O o o o
+ + + +
o o o
+
o
CO n- CO cc in — -. CM -h CI -r r^ Ol CM 1-
r-» .
—
1 i
-r CM v- c r~ cc r^ CM cc -1 ^^ ^~ -_ —- r~
m ~ ~. r»
c~» . 1
r~-
>~ CM ^-* CM
CC.
m r»
r-»
CO
o o ~ - r» ~ — CO
—
r:
.
CM
.— -V
— -f
i-H — C I
Ol
—1 CI CI
r-
CI Cl
-r
CI CI
to
CI CM
Ol
CM
Ol Ol ~; Ol 01 Ol Ol ~. Oi Ol Ol Ol Ol C7i CXj CC; Ol Oi Ol Ol Ol Ol
oi __ CD
~ ,-H CO in -r m to — co o CM in r-~ Ol co Ol — CM en
co -f co -f -f -r ~r -+- -r "* m in in iC in in in LlC m to CC CO
O O — * r- cm -h o o IN co ^
CM
^ ^ co
cm
^ * in
->f
m m CO
m co co
in in co
Ol
m co co co co
E a/ T3 s R XI -O >
U £u 2 a. cc W H u
< CD Ol o _ CI en -r- m to
N m ,n CO (O (O to tc to to
>
>• >•
^^ _
e\J
o o
CN r^ o
—
c
\
e~ > X X V
> -r
X
\ i^- ^n «* co r~
CO
r~
in
Ol in Tt< to -V C 1 Ol
~
—
1
-.
m
CO
en
CM
LJ
r-~
CO
o
'
Ol
—
b^ -r -f C I r-» CM CM oo in cm c
c
— CO
C
—
i
CI
cc
CI —
o
CO
r- i— -H
1^
— O)
Ol
CI co in Tf 1
'
I 1
+ + + + + + + + + + + + +
c + + + +
~ + i -i
-i
— ~ •r r- CI to
~ in en. lC to tO to -t- to
— ~
-T CO
CI r
Ol
lC --H 'O en
'CO
r-» :o :c r—
t~»
-r CO i^ L-n ^- en — -f
>~v
-f
71 CM co r
i
i CI CO
en
-+• CI
to
-r Ol r-C —» in
•V
in
C
r^
in 00
in
CI
in
t-H
---
CO
to
o — ^,
en in -r -
-r -r
O c -r
-f -t- r
-r
3
^^
:
— o o o c c
Ol Ol CT- Ol
C7.
Ol Ol
c
Ol Ol
c O
Ol Ol
o
Ol
_
Ol ,Ol
,o O
Ol J^
_
rn Ol Ol or. 01
S on o fM -t- in r- Ol to M co o — CI to i-n o 1
- r- CO
CI CI CM CI C) CI CI CI CI CO en en CO CO en en. co CO
c -h co in CO CO o r^ O) Ol — CM ' co r~ co r~ co CO Ol
CI CM CM CM CM CM CO CM CM CM CO CO CO CO CO CO CO CO CO
E c V
o CO H i— CO
< c — CJ en -r m
n
CC.
m
i -
m m
>
\ m in in in
4-5
— . 'i .i . < , i i 1i i i 1
' <' 1 i 1 «
cm
\ >>
"3
e" ^ £ -a
\ ~ -r 3 CO i * 31 00 CO
-V
— —
qs* CO
in
to
-v
CM co
in I s *
to
r^
Tt 1
— — "*
22 ,
3
|
6§ d id — I
- ~i CM co in 3 CO 31 si co d -r 21 CM in
CM rh CO tO CO CO CM o I
i— CM CJ 21 r^ 21 21 m A
o. + + . (
1
1
+ + + i
+ + + + + 1
-
3 o O
i
©
l
c c
i
-i 21 I - ~r -H m to to
m co <3
m in o I
s* CO to r^ ~, 22 in CO in
-r -r
-f
I*»
-f
3)
in
-T
31
co
to
31
- 31
r*.
.n i^-
CO
CM
co -V
CO £ -r
CO
in cm
co * to 22
3.
in
3
3 21
i
~V
o*
i
-t-
:c 02
m
•— c 2 i
CM "* -r to co CO CO 21 CO Tf -f in to o -f
3 in
~ '3
3 31 to to to '3 tO !0 to c '£ I
s * i^ r- t~* r^ s
I * cc CO
3
I
* *
31 3 CM — CO -r in to cr, C2 CM CO •* m '-0 31 3
~! 00 o 3. a.
i
— 3 en a
t**
cr, 31
cr,
•cr, C2 O o o o o C*«
3
CC'
o O
CM CM CM CM 21 21 CM CM 21
5 23
z S-i
Oh < H 32
< 3 >
I
s*
-
02 3,
* co
s CO
i
~i
CO
CO
CO
\
1 1
t*« I
>- >
o -
**
b
—
i
3
-—
.c >*
o m CM c 3 X CO s* 3 X
CO to 31 CO 31 31 CO -v i
-
to — C 1
en in CO to to
I
* m
l~ o
o
'3
CM
en
en
C 1 to
C
o
o
^ —I r CO
r**
31
CO CO
^^
i
-
- m'
CO
oi
31
*
—
to
7-1
CO
co CM
d r~
CO
-r
C I 1 C I
-. + + 1 + + i
+ ( l
+ + + + + + + +
o o o
1
o o o o
1 I
1
1
•
1 r*|M -IM -|M inloi r*|c 1 . N r*|M CO o 3 3 '"1 M iO| M
z, to
o,
C
a,
1 1
-V
-
31
to
CM
i—
CM
-f
31
-
CO
3
o
to I
s- 02
- 31 r^
, — ~.
I
-^
s* m o*
31 I
s
m
3
CM
CO to
in
in
3
-t-
© f m —
i
s* 21
en C C co CM co C CO I to to 31 -I 3, CO 31 t**
* O
o m m
I
^_^^
O o
I
3 CM CM e Tf o tO cc CO - CO co to s
I CO 3 -f CM
m m m
i
CO CO CO CO CO co CO CO cr CO -t -r -r -r -r tj<>n -v
a, 31 a. 3 31 31 31 cr, 3 31 31 31 31 31 cr, 31 31 31 3 31 3131
J -r
tO
ifj
to
•
to
o to
to
f
to
-
CO
to
d _!
r^
CN
I
s*
CO
I
s-
~r
i -
in to
i *
r^ 31
r^ r*-
d
CO
-I
CO
_
X'
co
CO CO
iri -t-
co
to
CO
* K * *
-r m
to
to
tO
to
to
r*-
tO
co
to
31
to
—
,
r**
CM
I
s*
CO
r^
tf
I
s*
in
i *,
to
i-
i
I
*.
s*
CO
r^ cc
C -h
CO
CM
CO
21
02
-T to
02 CO
in
TO
I
CO
s*
6 -
© X s X h £ &
< r*-
to
co 3,
tO
©* — -1 so -r
s*
ms*
N •-D i t-* r*. I** I I
/1-6
m > P» >
C en
~
-r
>•
— Xo CC
<— a
C r>
in
2
Cj
s X X X
o
T3
l~ X >- x >
C CO o :
*
— o > —
I
<f lO.
>. > > >.
q
— q- *
-r co "* (N q O
in
d
i i c\i c i
-1 o- CO -1
CO
zz
r»
CO CO
f-H
c c
- CM —i
in
CM VO
CM r-
co
o
in -f CO *! CO co CM
lO 1
—
+ + + + + + + 1
+ + 1 + 1
+ i-
t + i
'-
- 1 r^|CM O .n|cj m|«N -1- o o 1- - r M O O ,.- 1
- i-i~> -[ c
C i-i
-
r~ -| N C7)|C
CI CO CO Cl CM Ci o » CO in o -t- CO Cl CM CO
o
o c
1
o
i
CO CO CO C 1 CM CC -r c F>H Cl Cl c
r^
-v
i-
-r T— -r
CO
CC
cc
in
CO
in
cc
CC -r uC -r in in i.C to lO lO •~c r-» i-~ :c
q o o q o q o o q q o O q o C q q o q cc
co iX co 05 co d __ co -r CO i^ oS oici CO-r CO -
-f *
i
CC co CO
r--
CO
ci
cc. CC -f -r
•X<
~r *
Cl Cl Cl
-r
CI
-t-
CI
-r
CM
in
Cl
in
Cl
in
CM
m
Cl
in
~i
Cl CI CM C 1 Cl Cl C 1 CI
CO m co r^ — O -V -H co ^ CO r~ CM CO-* co r^-
CO co CO CO CO
CT-
co f -V -r T*l
Cl
•^ct^i
CM N ^
CM
i
C"!
"*
(N
1
^
CT>
in
CMCM
in in
CM (N
n
CM
in
CM
CM CM CM CM CI CM CM CM CM
E a 6
o D 2 £ 1 5 £ D W
< Cl CO -V m LO
',
r»
Ci
CO cc
cr
o
o
cc cr,
\ Oi cr, cr,
- "0 £ -C
r.
a. m -r CC'
t- -a
\ > co
— Cl —
Cfl
CM c -r
>. CO
t^
r- CO
>-H
q
m •+
>- CI CO
Cl c
CO ID co CO r-~
r^ c m * _ ~ _ X o m — CO r--
£ Cl
c
CC CO
CO
CO CM
CM —*
rt cc
i.C
CI o CM CM
cc
+ +
« - |c< o o o o i
o o >n| cm inl M O
_ _ ~ —
CI r- ^_i c
— to
CI
CO CO -f I-
o
CO
~ CC;
i.C Cl
CO
i.C cr,
-r
in
in
co
-r
-r
CC r^
-f CO
r-
cn m CO
r- r-»
co in
-r
CI
CO
I- nC -f —
c:
r-~ o
—
r-
— CM o co C 1
- CI co r-^ CM 1 — r-- -r CO CO in i
~~ , . -£ CO i.C cc
o — T— — ci C; CM — CO cc CO CO CO -r
CC co
CO CO o
co
cc
CT)
q C q
I
q q
1 .
cc ZZ — q o q cc q cc
30 oS CO d ci d ci — cc co JD CO i^ CO oS— ,
CM — CO
— * Cl CO CO co CO
3 o
-
—I
CI
~—
Cl
CM CM
CM CI
CI CM
CM CI
i Cl CI
CM Ci
CI
Cl CM
1
CI C 1 Cl CM Cl
i CM CI C I
C — CO O CM -i CO CO (£> CO r- co CD — CM — < CO
— CO
'
ffi
O — '
i
CM CM CM CM CM CM CM
CM CM CM
CM CM
CM CM
CM
CM
CO
CM
CO
CM
CO
CM
CO
CM
CM CM CM CM CM CM CM CM CM
U n
o ^ Jh <
m c co 01 c
co CO cc co CC ~.
N
A-7
BIBLIOGRAPHY
Some of the following references contain material at a higher level. A suitable adjustment of
Parallel Resources
Eisberg, Robert, and Robert Resnick. Quantum Physics of Atoms, Molecules, Solids, Nuclei, and
Particles. New York: Wiley, 1985.
Feynman, Richard P., Robert B. Leighton, and Matthew Sands. The Feynman Lectures on Physics,
Ford, Kenneth W. Classical and Modem Physics, Vol. 3. New York: Wiley, 1974.
McGetvcy, John D. Introduction to Modem Physics. New York: Academic Press, 1983.
Richtmyer, F. K., E. H. Kennard, and John N. Cooper. Introduction to Modem Physics. New York:
McGraw-Hill, 1969.
Tipler, Paul A. Modem Physics. New York: Worth, 1978.
Weidner, Richard T., and Robert L. Sells. Elementary Modem Physics. Boston: Allyn and Bacon,
1973.
Young, Hugh D. Fundamentals of Waves, Optics, and Modem Physics. New York: McGraw-Hill,
1976.
Mathematical Sources
Boas, Mary L. Mathematical Methods in the Physical Sciences. New York: Wiley, 1983.
Dwight, Herbert Bristol. Tables of Integrals and Other Mathematical Data. New York: Macmillan,
1961.
A-8
Bibliography A -9
Background Readings
Childs, Herbert. An American Genius: The Life of Ernest Orlando Lawrence. New York: Dutton,
1968.
Curie, Eve. Madame Curie: A Biography. Translated by Vincent Sheean. Garden City: Double-
day, 1937.
Fermi, Laura. Atoms in the Family: My Life with Enrico Fermi. Chicago: University of Chicago
Press, 1954.
Heisenberg, Werner. The Physical Principles of the Quantum Theory. Chicago: University of Chicago
Press, 1930.
Hermann, Armin. The Genesis of Quantum Theory (1899-1913). Cambridge, MA: MIT Press,
1971.
Jammer, Max. The Conceptual Development of Quantum Mechanics. New York: McGraw-Hill, 1966.
Pagels, Heinz R. The Cosmic Code. New York: Simon and Schuster, 1982.
Pagels, Heinz R. Perfect Symmetry: The Search for the Beginning of Time. New York: Simon and
Schuster, 1985.
': The Science and the Life of Albert Einstein. New York: Oxford
University Press, 1982.
Schwartz, Joseph, and Michael McGuinness. Einstein for Beginners. New York: Pantheon Books,
1979.
Swenson, Loyd S. The Ethereal Aether. Austin: University of Texas Press, 1972.
Thomson, George. The Electron. Oak Ridge: United States Atomic Energy Commission, 1972.
Classical Sources
Sears, Francis Weston. An Introduction to Thermodynamics, the Kinetic Theory of Gases, and Statistical
References on Relativity
Kacser, Claude. Introduction to the Special Theory of Relativity. Englewood Cliffs, NJ: Prentice-Hall,
1967.
Mermin, N. David. Space and Time in Special Relativity. New York: McGraw-Hill, 1968.
Taylor, Edwin F., and John Archibald Wheeler. Spacetime Physics. San Francisco: W. H.
Freeman, 1966.
Reif, F. Fundamentals of Statistical and Thermal Physics. New York: McGraw-Hill, 1965.
Bransden, B. H., and C. J. Joachain. Physics of Atoms and Molecules. New York: Longman, 1983.
Fano, U., and L. Fano. Physics of Atoms and Molecules: An Introduction to the Structure of Matter.
Herzberg, Gerhard. Molecular Spectra and Molecular Structure: I. Spectra of Diatomic Molecules. New
York: Van Nostrand Reinhold, 1950.
Karplus, Martin, and Richard N. Porter. Atoms and Molecules: An Introduction for Students of
Pauling, Linus, and E. Bright Wilson. Introduction to Quantum Mechanics. New York: McGraw-Hill,
1935.
White, Harvey Elliott, Introduction to Atomic Spectra. New York: McGraw-Hill, 1934.
Amoros, Jose Luis, Martin J. Buerger, and Marisa Canut de Amoros. The Laue Method. New
York: Academic Press, 1975.
Ashcroft, Neil W., and N. David Mermin. Solid State Physics. New York: Holt Rinehart and
Winston, 1976.
Blakemore, J.
S. Solid Stale Physics. Philadelphia: W. B. Saunders, 1969.
I >obbs, E. R., and G. O. Jones. "Theory and Properties of Solid Argon." Reports on Progress in
Physics 20:516-564(1957).
Feynman, R. P. "Application of Quantum Mechanics to Liquid Helium." In Progress in Low
'Temperature Physia, edited by C.J. Gorter, Vol. I. Amsterdam: North-Holland, 1955.
Kittel, Charles. Introduction to Solid Stale Physics. New York: Wiley, 1971.
London, Fritz. Superjluids, Vols. 1 and 2. New York: Dover, 1961, and Wiley, 1954.
McKclvey, John P. Solid-State and Semiconductor Physics. New York: Harper & Row, 1966.
Silvera, Isaac F., and Jook Walraven. "The Stabilization of Atomic Hydrogen." Scientific
Ziman, J. M. Principles of the Theory of Solids. Cambridge: The University Press, 1965.
Bibliography A- 1
References on Nuclei
Bethe, Hans A., and Philip Morrison. Elementary Nuclear Theory. New York: Wiley, 1956.
Blatt, John ML, and Victor F. Weisskopf. Theoretical Nuclear Physics. New York: Wiley, 1952.
Enge, Harald A. Introduction to Nuclear Physics. Reading, MA: Addison- Wesley, 1966.
Evans, Robley D. The Atomic Nucleus. New York: McGraw-Hill, 1955.
Mayer, Maria Goeppert, and J. Hans D. Jensen. Elementary Theory of Nuclear Shell Structure. New
York: Wiley, 1955.
Griffiths, David. Introduction to Elementary Particles. New York: Harper & Row, 1987.
Halzen, Francis, and Alan D. Martin. Quarks and Leplons. New York: Wiley, 1984.
Perkins, Donald H. Introduction to High Energy Physics. Reading, MA: Addison-Wesley, 1982.
Powell, C. F., P. H. Fowler, and D. H. Perkins. The Study of Elementary Particles by the Photographic
Method. New York: Pergamon Press, 1959.
Ryder, Lewis. Elementary Particles and Symmetries . New York: Gordon and Breach, 1975.
Cohen, E. Richard, and Barry N. Taylor. The 1986 Adjustment of the Fundamental Physical
Constants: A Report of the CODATA To.k Group on Fundamental Constants. Elmsford, NY:
Pergamon Press, 1987.
Herman, Frank, and Sherwood Skillman. Atomic Structure Calculations. Englewood Cliffs, NJ:
Prentice-Hall, 1963.
Lederer, C. Michael, and Virginia S. Shirley, eds. Table of Isotopes. New York: Wiley, 1978.
Moore, Charlotte E. Atomic Energy Levels, Vols. 1, 2, and 3. Washington: National Bureau of
Standards, 1949, 1952, and 1958.
Particle Data Group. "Review of Particle Properties." Physics Letters 170B:l-350 (1986).
Walker, F. William, Dudley G. Miller, and Frank Feiner. Chart of the Nuclides. San Jose: General
Electric Co., 1984.
Wapstra, A. H., and K. Bos. "The 1977 Atomic Mass Evaluation in Four Parts: Part I. Atomic
Mass Table." Atomic Data and Nuclear Data Tables 19:177-214 (1977).
Besancon, Robert M., ed. The Encyclopedia of Physics. New York: Van Nostrand Reinhold, 1985.
Gray, H. and Alan Isaacs, eds. A New Dictionary of Physics. London: Longman, 1975.
J.,
Wesley, 1981.
"
ANSWERS
Chapter 1 Chapter 2
d, + d, u
2. 1.72 X 10' 7
W, 3.82 X 10
26
W,
1 I V-'
A c
5770 K, 502 nm
3 m, in, tan V;AT) *
iz^ + n
;
3.
5. /(> X 10
5
m 10. 355 nm, 1053 nm,
r) _1
15. 5.93 X 10 eV, 5 X 10™ s
13. (20 m, 20 m/c)
and (10 m, 10 m/c)
18. 5 X 10
19
s ', 1 X 10
18
m -2 -s"
19. 5.80 X 10 6
m, 5.80 X 10"" m
21. 140 keV, 95°
16 1 mA, ,;; mA 25. 2(1 + m/M)mc 2
20 e + Je - m y 2
\/2
'
/l - ,/
2
/2c 2
'
26
E - /£ 2 - m y + 2e'
»/2b' o' /l - z/
2
/2c 2
7.0 GeV
1 + v'
2
/2c 2 '
1 + »' 2 /2c 2
v/2
Chapter 3
26. 1.13 X 10 "'
23
1. 6.6 X 10 moles" 1
2 2
11. 0.74 s"'
35. /(m, + w,,) + 2K m.,/c
l
12. 1.90 X 10
H
" m, 5.90
5.9C X 10
28
m" 3 ,
A-12
Answers A- 13
-27 -
16. 7.27 X 10 kg m/s, 4.35 m/s 27 E„ E„.\ \cc
17. £4 -» £ 3 ,
2m 2
-3a 2
2hm
28. 0, (-) ,0,(--) 2
£„ -» £ 4
(n > 6), 24 m a
£„ -» £2 (» ^ 5) 5a A
3. 134°, 78°
— (-1 +
* /
/l + 2EL2 / ib 2 l )
4. 110°, no
for E>
5. 3.97 X 1
' "'
W/nr,
1.73 X 10~ 6
N/C
2i>/?
— 2
(**
a<» a*
*)
7. 5 MeV
km
8. 98 MeV
9. 4£ , a /4
h- m m
I
2h J~2H
f..
2fiR
— r
2
(
—2 ) , sin —
2
(f>, no
„. , 3.67 X 10" lD
m
y m V i 10. 0.015 eV, 28 jam
kd
14. \\A\
2
cos-( — sin0) 11. 0, A
12. l/4w
17. 8 In 2 15. 0°,180°;
m 2 2
A(ff!,|a,| + w2 a 2|
6.
1
+ —
2m
1
»Vi(4 + i)
l
- 4(4 +
)»
i)|-|«ifl2i,
4
fl 77 k\m - m 2 t
\
\a a.,\ {
7. + 1
2 ma Chapter 7
9 kl
2
+ kf = l
1 2wx , _ I.
— Z - 2
-^(an--* -4 '*'''*
/t
10.
m, 4
]/a a
(2a )- ,/2
3
3 "X 2.
-9,^/A,
+
,
A - 14 Answers
Chapter 10
14.
h } h
\
22. fi H Bm }
R = 0.113 nm
23. 9.43 X 10" 7 eV; 22. 0.128 nm
±11.59, ±8.28, ±4.97, ±1.66 24. 1.0, 1.3
times 10"
H
eV for j = \\
Chapter 1
4. 96, 1,9
Chapter 9
11. 0.1 K, 6 X 10
5
K
2. 79.0 eV below
13. 6 X 10"
3. - 77.5 eV 21
17. 3.6km/s, 1.2 X 10 J/K
4. 6.s, 4/, 5d, Sp
18. 6.2 x 10" K, 3.3 X 10
a J/K
~'
3? r Chapter 12
, ,
11.
4 4 I 2
'• (3) 2 2 2;
12. 1.85 eV, 3.37 eV, 3.84 eV, 3.88 eV, a 'v^fl '3v^ a
1 54 eV, 4.53 eV, 4.54 eV, 4.54 eV (b) -6e, -4e, -3e, -ze
13. 6.3 /xeV, 2132.4 fieV 2. (a)6,6;(b)£
16. 4, 3 (three ways), 4. (a)fl
3
,a
3
/V^,4a 3 /V3;
2 (four ways), 1 (three ways), (b) -6c, - 12c, -8e
1 7 7 J '
P 8. 1.6 X 10 6 m/s, 31 nm
19.
!
F2 10. 5 X 10 " s
Chapter IS
10~ 6
20. 8 X
4. 0.730 g
Chapter 13
8 5.302 MeV, 4.781 MeV
9. 5.304 MeV, 4.602 MeV
1. -0.9 m/s
11. 1.711 MeV
2. 131 m/s
6
13. 0.709 MeV, 1.144 MeV, 0.122 MeV
7. 3 X 10
15. 1.312 MeV, 1.505 MeV, 0.483 MeV
8 (a) 3.6 jU-eV;
_3 19. 4.67 neV, 1.95 meV, 81.4 m/s
(b)4.0 X 10- 3
J/m !
,4.3 X 10 J/m3
21 -1.191 MeV, 4.43 MeV, 1.531 MeV
9. 15 nm
25. 13.11 MeV, 13.89 MeV, 15.39 MeV
13. 2.6 X 10~ 9 T
26. 11.85 MeV, 7.48 MeV
15. 47 mK
27 0.0235 eV, 2120 m/s
Chapter 14
28. 22.8 MW •
h
29. 195.9 MeV, 185.4 MeV
1. 51 MeV 30. 0.189 MeV
2. 6.76 fm, 1240 MeV 31 22.37 MeV, 8.683 MeV
4. 20.18
33, 1.943 MeV, 1.199 MeV, 7.551 MeV,
5. 1.007825 u, 2.014102 u,
7.297 MeV, 1.732 MeV, 4.966 MeV
15.994915 u
Chapter 16
6. 8.551 MeV, 8.790 MeV,
8.505 MeV, 7.868 MeV, 8. 144.7 MeV
8. 39.9632 u, 55.9359 u, 9. 292.6 MeV, 279.9 MeV, 287.0 MeV
119.903 u, 207.987 u 10. 29.8 MeV/c, 69.80 MeV/c
9. 0.367 MeV, 0.4498 MeV, 12. 0.813 MeV, < 0.420 MeV
0.5590 MeV 14. 172.4 MeV, 904 MeV
12. 6.738 MeV, 7.368 MeV, 17.2 MeV 19. three, two
20. 1:1:0 and 0:2:1
2MRi
v,
13
2 (tt)
'
21. 190 MeV, 612 MeV,
h 9tt
Space-travel cartoon, page 7. From Einstein for Beginners, by Joseph Schwartz, illustrated by-
Zeeman lines, page 376. From Introduction to Atomic Spectra, by Harvey Elliott White, published
by McGraw-Hill, 1934. Reproduced with permission of McGraw-Hill.
Dirac, page 421. Permission of AIP Niels Bohr Library.
Pauli, page 441. Courtesy of Philip Rosen.
Rotational spectrum, Figure 10-19, page 531. From Physics of Atoms and Molecules by B. H.
Bransden and C. J. Joachain, published by Longman, 1983. Reproduced with permission of
Longman Group Ltd.
A-16
Photo Credits A- 1
Vibrational -rotational spectrum, Figure 10-20, page 531. From Atoms and Molecules: An Introduc-
tion/or Students of Physical Chemistry by Martin Karplus and Richard N. Porter, published by
W. A. Benjamin, 1970. Reproduced with permission of Benjamin/Cummings.
Molecular electronic spectrum, Figure 10-21, page 532. Photograph by R. Colin. From Physics
X-ray diffraction pattern, Figure 12-14, page 586. From The Laue Method by Jose Luis Amoros,
Martin J. Buerger and Marisa Canut de Amoros, published by Academic Press, 1975.
Reproduced with permission of Academic Press.
Strange-particle processes, Figure 16-17, page 8">0 From Nuclei and Particles, by Emilio Segre,
Anderson, C. D., 822, 830 Dirac, P. A. M„ 420, 821 Hartree, D. R., 446, 451
Andronikashvili, E. L., 649 Drude, P., 587 Heisenberg, W. K., 182, 195, 219,
Aston, F. W.. 683 Duane, W„ 107 668, 729, 822
Au scr, P., 164 Dillons, P. 1... 561 Heitler, W„ 509
Avogadro, A., 73, 123 Dunoyer, I.., 400 Hermite, C, 249
Dyson, F.J., 823 Hertz, G. L„ 168
Back, E., 426 Hertz, H. R„ 2, 100
Balmcr, ]. )., 146 Einstein, A., 2, 7, 29 7', 99, 107, Hess, V. E, 816
Bardecn, J„ 615, 635, 650 124, 171, 182, 539, 559, 560 Higgs, P. W„ 899
Becquerel, A. H.. 741, 751 Hofstadter, R., 679
Bcthc, H. A.. 806 Fairbank, W. M„ 874 Hund, E, 488
Blackctt. M. S„ 818 P. Fermi, 465, 668, 767, 796. 8 12
E., Hunt, F. L„ 107
Bogoliubov, N. N'., 648 Feynman, n,. P., 648, 823, 843
Bohr, N. H. D., 122, 182,268,668, Fitch, V. L„ 856 Iliopoulos, J.. 878
790, 796 FitzGerald, G. F„ 6
Boltzmann, L. E., 1. 76, 538 Fock, V., 446, 451 Jensen, J. H. D., 720
Bom, M„ 219, 226. 502 Franck.J., 168 Joliot.J. F., 742
Bose, S N . 465 Fran/, R., 595 Joliot-Curie, E. 742
Bi.iKK. W. H., 1115, 584 Fraunhofer, |. von, 145 |osephson, B I)., 658
Bragg, W L„ 105, 584 Frisch, O. R , 796
Brattain, \V II .
i>1 5 Kamerlingh Onnes, H . 634, 638
Bun, (... 791 Gamow, G., 755, 770 Kirchhoff, G. R , 76. 93
Brillouin, 1, ., 602 Geiger, H. W„ 132, 1 H Klein. O., 827
Bioun. R . 123 Gell-Mann, M., 843. 851, 854, 861, Kusch, P., 400
Butler, C C. 851 870. 877, 889
Georgi, H M„ 902 Lagrange, |. L., 90
Cabibbo, N . 877 Gerlach, W„ 390 Lamb, W F ... 123
Cockcroft.J. 1).. 784 Glashow, S. L„ 878, 885, 899, 902 Larmor, 383 |
,
1-1
1
Meitncr, I. . 796 Rayleigh.J. W. Strutt, 85, 94. 165 't Hooft, 6„ 885, 899
Mendeleev, I). I.. 12 I, 156, 668 Reines, F., 762, 771 Ting, S. C. C., 880
Michelson, \ A . I Richardson, R. C, 635, 664 Tomonaga, S. I., 823
Millikan, R A . 101, 128, 874 Richter, B., 880 Townes, C. H., 174
Mills. R I Rochester, <; 1) , 851
Minkowski, II , 60 Roentgen, W. K , 103 Uhlenbeck. G. E., 393
Morley, 1 U . I Rubbia, C, 885 Urey, H. C, 667
\lo[ sc. P, M., 525 Russell, II. N„ 482
Moseley, H. C.J., II)'-'. 447, 668 Rutherford, E., 122, 132, 61*7,669, van dcr Meer, S., 885
Mossbauer, R. 1... 781 741, 746, 751, 784, 815 van dcr Waals.J. I).. 517
Rydberg.J. R , 147 Veksler, V. I, 817
Nambu, \ 889 .
Nishijima, K., 861 Salam, A . 885, 899 Weinberg, S„ 885, 889, 899
Saunders, F. A., 482 Weizsai kei , ( I v< in, 689
Ochsenfeld, R . 638 s< hmidt, I.. 725 Wheeler, J. A , 796
Oppenheimer, ) R 502 Schrieffer, |. R 635, 650
,
Wiedemann, (.'•
II, 595
Osheroff, I) I) ,
t, .
771 Atomic shell theory, 147, 466 834, 839, 842, 876
Absorption edge, 159 Atomic spei ti 3" decay, 759, 763
Accelerator, 078, 784, 810. 817 Atomic weight, 382 |T decay, 759. 763
7
Acceptor, 613, til ) Augei effect, 164. 777 (3 radiation, 667, 741, 752, 759
A< i, 170 Autoionizatic n. 165, 470 b flavor. 880
At tivation enei g) ,
Oos Avei age e • i g\ . 85, 93, 9 I. 556 Big Bang, 79. 836, 907
A. th u\. 746 Average value, 200 Binan fission. 796
Addition of angulai momenta, 405, Avogadro's number. 125, 178. 687, Binding energy, 447, 455, 512, 577,
418. 438, 483, 721, 755, 776, 749 652, 682, 689, 700
802 Axial vector, 838. 843 per nucleoli. 686, 797. 804
Aether, 2. 0, 8 Biot-Savart law, 410
Age of the Earth, 743, 750 Balmer— Rydberg formula, 147, 156, Blackbody, 75, 80
Alkali atom. 447, 455. 470 344 Blackbody radiation, 75, 93, 172,
Allowed region, 242. 252. 303 Balmer series, 146, 157, 370, 387, 538, 559
a decay, 740. 742. 751, 790 139 Bohr atom, 122. 148. 149
a particle, 132, 667, 735, 741. 751 Hand, 527, 502, 570, 596. 006 Bohr formula. 149, 155, 350, 368,
Ammonia molecule, 283, 5 I I degradation, 532, 537 476
Amplification ratio, 621, 6 13 gap, 576. 596. 606 Bohr hypothesis, 148, 152, 268
Amplitude, 270. 368, 102. 862, 805 head. 532, 537 Bohr model. 149, 163, 175. 201,
Analyzer, 403 Barn. 141, 713 344, 348. 360, 447
Angulai differential operator, 306, Barrier: Bohr orbit, 153, 162. 177, 201, 381,
314 rectangular, 273, 298, 756 447
Angular momentum, 135, 151, 300. single-step, 273, 298 Bohr radius, 153, 201. 348, 363
301. 307, 309, 320, 325. 375. Barrier penetration, 271, 750, 799 Boltzmann's constant, 85, 94
381, 393. 405. 181. 731 Baryon, 846, 858, 870, 887 Bond:
Angular momentum operator, 301). Baryon asymmetry, 907 chemical, 501
320, 330 Baryon number, 846. 861. 871, 879. covalent, 501, 508, 513, 576
Angular solution, 314 904 directed, 522
Anbarmonic oscillator, 901 BCS theory, 635, 050, 000 hydrogen, 583
Antiferromagnetism, 627, 630 Beam: ionic, 501, 511, 512
Antineutrino, 761 a particles, 132, 667. 669 Born-Oppenheimer approximation,
Antiparticle, 65, 111, 121. 821, 852 atoms, 390. 397, 400 502, 536, 622
Antiproton, 65. 820. 885 electrons, 678 Bose-Einstein condensation, 553,
Antiquark, 870 lieu \ ions, 077 643. 648, 653, 665, 666
Antisymmetric wave function, 238, kaons, 848, 855 Bose-Einstein statistics, 538, 540,
286, 465, 540 magnets, 390 552, 504, 573, 644, 047,666
Anlisymmetrization, 466, 471, 489 molecules, 400 Bose gas. 643, 666
1-3
1 1 1
Subject Index
Boson. 465, 540, 557, 559, 564, 573, Collision, 50, 53 678,686,690,696, 715,729,
643, 665, 678, 830, 846 Collision time, 590 752, 783, 800, 805, 821, 892,
symmetry, 465, 167. 869 Color, 880, 808, 00 1 897
Boundan condition, 82, 236. 253, Color confinement, 880, 898 Coupling constant, 824, 844
274, 280, 310, 315, 597, 622, Color singlet. 887 CP invariance, 852
707 Combined reaction, 805 CPT invariance, 852
Bounded motion, '25 Complementarity, 71, 188 < quark, 878, 889
Bound si.ttc. 152, 155, 254 Complex exponential function, 208, Creeping film, 637, 650
b quark, 880, 889 223, 227, 272, 309. 314 Critical field, 638, 654
Bracket! sei ies, 180 ( Complex numbei . 207 Critic al mass, 801
Bragg angle, 105, 186 ( :oni|)onent, 484, 192 Critical temperature, 639, 666
Bragg refle< tion, 105, 186 Compound nucleus, 098, 790, 796 Critical velocity, 637, 641, 648
Breedei reactor, 802 Compton effect, 107, 107, 550, 669, Cross section, 134, 140, 180,771
Bn ii \\ ignei formula, 701 , 813 736 absorption, 458, 712, 771
Bremsstrahlung, 106, 823 Compton wavelength, 110, 115, 154, differential, 141, 679, 789, 828
Brillouin /one. 602, 625, 632 185. 827 elastic, 790
Brownian motion, 123, 78 1 ( Condui Hon band, 008 electron-positron, 868, 880, 888
Bubble chamber, 817, 850 Conduction electron, 570, 571. (.08, |>ion iiik leon, 864
632 reaction, 789, 805, 865
Cabibbo angle, 87 ( Condui livity: Rutherford. 141,678, 789
Cabibbo current, 877 electrical, 570, 588 scattering, 140, 789, 828
Cabibbo rotation, 877, 884, 898 thermal, 576, 588, 626, 632, 635 Crystal, 105, 575
Carbon atom, 493 Configuration, 449, 483, 488 covalent, 622
Carboi -.814 ( onjugate variables, 176, 195 ionic , 631
Carbon dating, 744, 812 Conservation: Crystal plane, 105, 186
Cathode ray, 103, 126, 379, 741 angular momentum, 151, 300, Curie, 748
Central-field model, 440, 449, 463, 331, 371,412,418,483,493, Curie's law, 629
477, 481, 695, 711, 714 703, 755, 760, 776, 784, 833, Current-current coupling, 843
Central force, 135, 151, 300, 301, 837, 869 Current density, 611, 632, 655, 660
317, 323, 345, 351, 357,443, baryon number, 810, 90 1 Curvature of an eigenfunction, 253,
701, 705, 714, 731, 838, 859 charge, 677, 893, 906 255, 333, 720
Centrifugal potential energy, 302, .ha. in, 879 Curvaturc-to-valuc ratio, 253
329, 769 energy, 44, 50, 53, 685, 754, 760, Cyclic variable, 310
r flavor, 878 779, 784, 827 Cyclotron, 817
Chain reaction, 796 isospin, 728, 735, 784, 862, 865,
Chandrasekhar limit, 57 1 893 Davidson-Germer experiment, 185
Characterise wavelength, 25, 145 lepton number, 767, 830, 8:',',, 904 de Broglie hypothesis, 182
Charai teristit x ray, 100, 101 , 117, momentum, 44, 50, 53, 754, 779, de Broglie wavelength, 183, 186,
457 784 188, 192. 193, 199, 539,554,
Charge conjugation, 852 nil. leon numbei, 078, 7 12. 78 1 573, 648, 672
(harged urrent, 883 ( parity, 371, 755, 770, 770, 784, Debye temperature, 566, 570, 574,
Charge independence, 728, 781, 833, 837, 800 593, 639, 052
826, 850 probability, 220, 342 Decay, 712. 715, 817, 821, 864
( Charge numbei 008, 673
. sli.mgrness, 818, 853, 80:',, 870, i hannel, 791
Charge symmetry, 720, 863 870 constant, 740, 81
Charm, 878 Constrained motion, 220, 309 ( (live, 746
Charmonium, 880, 880, 892 ( Constraint, 80 law, 745
Chatl o| the \udiclcy 07:'., 743 ( Containment, 800 probability, 7 10
(.11, molecule, 521 Continuous symmeti v. 001 I, lie. 710
Classical energy relation, 223, 263, (Continuum state, 255, 27 sencs 7 13, 718, 812
28:',, 280, 302, 309 Coopet |).m , 650, 659 666 u ansition, 713
(lassie ,,1 limit, 150, 210, 395 Coordinates: Deexcitation, 209. 157, 476, 740. 797
< .lassie ,il radiation, 200. :'>77 lefthanded, 326 Degenerate Fermi system, 567, 587,
Classification scheme, 810, 846 i ighthanded, 326 632, 696
Closed shell, 117. 151, 15:'.. 170. 510, space nine. 1 I. 26, 33, 38, 15 Degenerate stales, 292. 321, 331.
71 1, 718, 722 spherical, 303. 320. 333 340, 357, 371, 385, 395, 408,
Closed subshell, 110, 177, 180, 188 three-dimensional Cartesian, 288, 118, 110, 100, 175, 18.3. 715.
Cloud chamber, 817, 822 304 731, 8.5.3, 859
CM frame, 57, 780, 700. 820, 8 0., three-dimensional polar, 300, 301, Delayed neutron, 801
805 327, 333 Density ol siaies. 557, 565, 568, 574,
Cockcroft-Walton generator, 78 1, Correspondence principle, 159, 240, 50 1, 1 1
Diatomic molecule, 242, 502, 508, 306, 310, 323, 331, 351, 115. stimulated, 170
524, 574 449, 714 \-ia\ . 157
Differential operator. 223. 263, 283, isospm, 732, 859 Emissive powei . 75
289, 305, 306, 320, 329, 34 1 spin, 394 I mitter, 620
443, 827, 896 Eigenvalue equation, 251, 257, 307, Endoergic reaction, 784
Diffraction, 185, 188, 191, 199,230 315,320,330 Endpoint, 764. 836
Diode, 615 Eightfold Way, 870 Energy band, 590
Dipole transition amplitude, 270, Einstein coefficients, 171 Energy cell, 86
368, 374, 768 Einstein solid, 539. 560, 574. 622 Energy gap. 596. 633, 638, 640, 65 I,
Dirac equation, 420, 821. 827, 898 Electrical discharge, 126 654
I>ii,h sea, 822 Electric dipole: Energy level, 148, 151, 105, 170,
Dirac theory, 420, 670, 821, 873 moment, 269, 368, 374, 511, 513 233, 237, 246. 268, 310. 324,
Direct energy, 506 radiation, 368. 775 357
Direct reaction, 792 selection rule, 270, 366, 371, 386, nuclear, 729, 742, 787, 794
Discrete angular momentum, 153, 399, 418, 441, 462, 475, 481, shell-model, 715
178, 183,310. 320 493 single-particle. 149, 10 1, 695, 715
Discrete energy, 94, 148, 237, 254, transition, 270, 368, 386, 475, x-ray, 462
310, 330 774 Energy level diagram, 148, 156, 238,
Discreteness, 73, 111 Electricquadrupole momem, 700. 246, 359, 409. 419, 450, 607,
Discrete orbit, 148, 153 702. 737 730, 742, 822
Discrete symmetry, 902 Electromagnetic current, 824, 843, Energy operator, 263
Disintegration chain, 674, 743, 748 875, 883 Energy release, 742, 754, 763, 765.
Disintegration energy, 754, 763 Electromagnetic energy density, 79 797, 804
Dispersion, 211, 562, 625 Electromagnetic field, 268, 368, 823, Ensemble, 538, 543, 572
Dissociation energy, 516, 527 826, 869, 894 Equal-spacing rule, 246
Distance of closest approach, 134, Electromagnetic interaction. 100, Equilibrium position, 212. 525
341 368, 732, 767, 773, 816,821, Equipartition of energy. 85, 93. 1 19
Distinguishable particles, 541, 572 823, 832, 839, 842, 849, 860, Equivalent electrons. 488
Distribution, 86 869,875, 881,894,898,902 Error integral, 249
angular, 133, 139, 324, 360,679 Electromagnetic modes, 79, 81, 95 Even function, 237, 247, 326
blackbody, 75, 93 average ener,^ , 79. 85, 94 Event, 11,26,38,42,60
Fermi, 551,680, 696 number, 79, Exact symmetry, 903
frequency, 75, 1 19 Electromagnetic spectrum, 104, 114 Exchange, 465, 54
gaussian, 215 Electromagnetic theory, 2. 3, 7, 11. antisymmetry, 465, 471, 488, 734,
227
intensity, 191, 882, 894 887
Maxwell-Boltzmann, 86. 91,1 19, Electromagnetic transition, 842, 875 symmetry, 465, 471, 488, 734, 869.
171. 549, 814 Electromagnetic waves, 2, 3, 7 888
Planck, 93. 172 standing, 79. 80. 95 Exchange force, 475
probability, 191. 193,209,227. Electron, 73, 126, 835 Exchange integral, 506
324, 335. 360, 449 charge, 128, 130 Excitation, 165, 269. 448, 457, 476
velo( ity, 814 mass, 128, 130 collision.il, 167, 457
wavelength, 77, 1 19 Electron affinity. 512, 535 radiative, 167. 269
Distribution function, 86, 91, 543 Electron capture, 764 Excitation energy, 167, 672, 729. 792
Donor, 612. 633 Electron diffraction, 185, 679 Excitation spectrum. 636, 642, 005
Doping. 608, 633 Electron-electron repulsion, 142, Excited slate, 155, 773
Doppier effect, 22, 67. 781 475, 481 Exclusion principle, 446, 449. 456,
Double slit, 189, 191. 227 residual, 482 464, 467, 074. 690, 095, 711.
Double-well oscillator, 533 Electronic energy, 530 714, 822, 887
Effective potential energy. 302. 329, Electron spin, 375. 393, 394, 405, energy, 263
714 410. 420. 425. 438, 446 function of r, 363
Eigenfunction, 233, 251, 320, 445 Electron spin resonance, 399 momentum, 261. 263
angular momentum, 320, 331 Electroweak interaction, 881. 898, position, 261 , 363
energy. 233. 265, 290. 307. 331 902 potential energy, 261. 365
isospin, 734 Electroweak quantum. 8X2, 885. 898
momentum, 272 Electroweak theory, 882, 899, 902 Faraday's law, 662, 894
simultaneous, 310, 320. 331 elm experiment, 126, 179, 379, 682 Fermi, 672
spin, 471, 734 electron, 743, 760 energy, 567. 608, 616, 632, 652,
908 842,84 -
37! 881.
Fluctuation, 124 ty, 571
- ence, 107. 741, 770 Grotrian diagram, 4 42.170. 178, elec iron— phonon, 051
Flux quantization, ;
479 electroweale 38 398 902
Forbidden region, .' 12 i Ground slate. 155. 237, 215. 249, gluon-gluon -
Fractional baryon number, 871 Half-life, 674 7 4',. 71", 7 meson-meson, 867
Fractional charge, 871, 874, 905 Harmonii oscillator. 176, 202. 240. noncentral. 4 13. 481
ni 796 270. 524, 901 nudeon—nucleon, 670. 699. 709,
Franck-Hertz experiment, 108. |x| Heat capai il
Gamow—Teller selection rule. 770 H; molecule. 501, 502. 534, 590 Interference. 189, 221, 227
Gamow-Teller transition, 770 Hole. 457. 462, 191,61 Internal conversion. 165. 777
Gas discharge tutx-. 126, 1 15 Hole current. 615 Internal quantum number, 7 .
Gauge symmetry, -
Hydrogen atom. 49. 34 1 4 space-time, 39. 61
904 Hydrogen burning, 807 ume. 9. 12. 20. 30
Gauge theory, 894, 898, 902 Hyperchargi Intrinsic pantv. 832. 834. 839. 873
Gauge transformation, 895. 896, 912 Hvperfine structure. 431. 700 Invariant ma-
Gaussian wave packet, 215. 218 IGperon. 846 lm ersion frequency. 283
Gell-Mann— Nishijima formula, 861, Hvperon decay, 8 19. 876 Inversion svmmetrv. 837
Ionization. 155. 167
Generation: Identical particles - 19, 569 energv. 155. 453, 633. 712
lepton. 835, 881, 904 Identical-particle svmmem. 285. level. 458
quark. 881. 904 440. 4 Hi, 464 potential, 512, 535
(k-neration current. 017 Impact parameter. 135 Isobar. 674. 691. 729, 743, 759. 859
Subject Index 1-7
Isobaric analogue states. 72 Lepton number. 767. 835. 90 1 Mass level diagrai
Isomer. 776 Leptoquark transition. 904 Mass matrix, 857
Isospin. 728, 858, £ Lifetime. 17. 746. 779. 791. 832. Mass numbei
Isospin multiplet: 849 Mass spei trometi
doublet. 7 12 LiF molecule, 512, 520. 535 Matrix mechanics, 195. 219
singlet. 73:5. 859 Light-emitting diode (i.EDi. 619 Mattel wave, 182, 185, 188, I'' I,
Isotope. 154. 673. 682 Liquid He. 539. 574. 634. 664 Maxwell-Boltzmann statistic
Isotope effect. 154. 639 Liquid helium, 517. 553. 634. 635 542. 546. 554. 500. Y,
Isotopic abundance. 676, 683 Localization, 111. 205. 228. 260, 204. Maxwell's equations. 2, i
J/<!>
meson, 880 London force. 517 Measurement, 194. 195
Junction: Longitudinal wave. 624 Meissner effect. 638, 655, 662, 666
Josephson. 658 L orbit. 162 Meson. 540. 326.
p-n. 615 Lorentz contraction. 6, 15 388
p-n-p, 619 Lorentz force. 51. 378. 629 Meson exchange, 826
Lorentz frame, 30, 33. 38, 55. 60. Meson field. 826
Kaon, 839. 848 113 Meson theois. 82l
long-lived, 854. 856 Lorentz invariance. 39, 61, 851 867 Metal, 575. 587. 0ob
neutral. 852. 911 869 Metastable state. 172. 497. 776
short-li\ed. 854. 856 Lorentz transformation. 26. 29, 33, Michelson interferometer, 4, 66
Kaon decay, 849, 852 38, 45, 51, 56, 61. 113 Michelson-Morlev experiment. 4.
Lepton, 767. 830, 834, 843, 846, Massless particle, 48, 111, 841, 882. absorption, 712, 797
Neutron excess, 798 parent, 743. 752. 759 Periodicity, 175, 319, 315
Neutron numbei ' .
Nuclide, 973 Periodic table, 123. 163, 376, 440,
Neutron-proton mass difference, 446, 449, 673
670, 706, 729 Observable, 299, 333 Permittivity, 633
Neutron separation enei Observer, 8, 11, 29, 38 Persistent current, 637, 638, 640
711, 737 Occupation numbei, 87, 449, 5 1 1. Perturbing interaction, 269-
Neutron yield, 799 919. 718 Pfund series, 180
Newtonian mechanics, I. 7, 27, 11. Octet, 891, 91 I Phase, 205,207, 227, 659, 896
baryon, 891. 879 transition. 575, 646, 664
Noble gas, 1 17, 155, 511, 517 meson, 891, 879 Phase space, 175
Node nt an eigenfunction, 237, 255, Odd lime lion, 237, 247, 326 Phase velocity, 206
152 Oil-drop experiment, 128, 179.871 Phonon, 539, 540, 560, 576, 593,
Noncentral force, 4 13, 1*1 Old quantum theory, 74, 178, 447. 621, 640, 648
Noncommuting transformations, 488 exchange, 651
One-bod) problem, 159.391 l"> '.
Photoelectric effect, 99, 116, 155,
Nonconservation: One-dimensional box, 231. 259, 287 167. 458, 539, 559
baryon number, 904 One-electron atom, 149. ill, ',18, Photon, 74, 100, 107, 111, 170, 190,
lepton numbei 90 . 1 357, 379, 395, 495 539, 559, 823, 826, 882
parit) Optically active electron, 147, 179, absorption, 101, 148, 167, 170,
Noi iii.il density, 648 481, 487 824
Normal fluid, 635, 648 Orbital: emission, 148. 167, 170, 824,875
Normalization, 228, 230, 239, '2 19. antibonding, 597 exchange, 824, 875
110,316.334,34! atomic , 359 scattering, 109, 116, 121, 194
168 bonding, 507 Photovoltage, 618
Normal modes, 529 hybrid, 522 Photovoltaic 618
cell,
Nuclear binding energy, 685, 689, molec ulai, 598, 535 Physical state, 226,260
711 Orbital angulai momentum, 394, Pickup reaction, 794
Nuclear decay, 674, 7 in. 7 12. Mr, Pion, 817, 830
745 Orbital magnetii moment, 389, 390 parity, 832, 833
Nik leai densit) . 682 Orbital parity, 834, 838, 854 spin, 832, 833
Nuclear disintegration, 668, 674, Orbital state, 359, 406 Pion decay, 831,834
740. 742 Orbital symmetry, 598 Pion production, 832, 846
Nik leai iv itation, 072, 773, 779, Orbit equation, 137, 179 Planck mass, 908
783 Ordei parametei .
',",9 Planck quantum hypothesis, 74, 94,
Nik leai lone 668, 670, 1.71. 699 Organization ol panicles, 819, 849. 99. 193, 2 19, 271
711. 728, 74 1. 755, 799, 826, 858, 879 Planck's constant, 94, 192. 197. 152.
•'12 Orthogonality, 238, 256, 295. 337. 179. 196
i, mm-. 1)71. 989. 827 343, 354, 498 Plane k's formula, 94, 1 77
Nik leai isomei ism, 771) Oilbobi liuni 9, 'I Plasma, 806
Nuclear magnetii moment, 392, 132, Overlap integral, 599, 519. 599 Plutonium. 802
1,79. 701, 722 ( )vei lapping bands, 997 Polarizability, 518. 535
Nik leai mass. 682 Polarization, 377, 388, 562, 939
Nik leai model, 668, 672, 979, 689, Paii annihilation, 1 15, 898 Polarizer, 103
991,. 705, 714, 721), 799, 792, Pairing, 664, 977, 690, 722 Polai vector, 838, 843
791, Paii ing energy , 690 Polyatomic molecule, 521. 526
Nucleai radiation, 719, 711, 77:5 Paiiproduc Hon, I II, 822 Population inversion, 172. 497
Nik leai radius, 132, 142. 971. 678, Parahelium, 9,9 Positron, 114. 422, 822
! 799, 713, 777, 800 Paramagnetism, ',27. 9:',:', Pi isitronium, 1 15. 154
Nik leai reaction, 977, 735, 740, 78.3. Parity, 238. 2 17. 273. 325. 330, 371. Poynting vector, 189
789, 894 I'll, Mil, 722, 755, 779, 775, Precession, 383, 398, 413, 428
Nudeai lead,, i, 781, 799. 81)1 784. 837. 852 ['recessional state, 386, 398
Nmleat shell theory, 977. 991. 711, Partic le in a box, 234, 26 Pressure, 555. 599
714, 729, 791 Particle wave- duality, 182. 188 Probabilistic theory, 192. 219. 538
Nik leai spin, 432, 979, 799, 721. Partition luni tion, 91, 549. 572. 609, Probability, 220, 228, 3,34, 360, 402,
755, 799, 779 1,28 494, 538, 572. 749. 772
Nik leai structure, 1)97, 978, 789 Paschen-Bai k effec i, 126 Probability current density, 229, 277,
Nucle.u transformation, 98",, 712, I'asc ben sei les. 147. 1 57 294. 342. 659
749. 713. 749, 797, 78:5 Pauli paramagnetism, 629 Probability density, 228, 233. 235.
Nucleon, 979. 686 689 695, 732, Pauli pressure , 569 260, 283, 311, 333, 558, 573
797, 826, 846, 899, 864 Pauli principle. 166, 9,9. 17 1 Probability interpretation, 192. 193,
Nucleon— nucleon interaction, 972. 599. 5 12. 5 1',. ",-,7, 51,8. 576, 205,219,226,260,283,289,
989, 999, 705, 728, 829, 892 ,
,2 69 i
703 73 324, 333, 360, 464
Nucleon number, 673, 767, 846 833, 887 Propagation of waves, 205
Nucleon spin, 979 l> branch, 529 Proper length, 15, 31. 37
.I,, leosynthesis, 897 Penetration depth, 296, 657, 999 Proper lime, 13, 31, 38, 42
Nucleus, 132, 539, ",79. 997, 7 19 Periodic boundary condition. 597. Proton, 95, 128. 540, 570, 670. 732.
daughter, 743, 752, 759 622 875
Subject Index 1-9
Quantum field theory, 822 821 Random event, 1"! 193 22 Root-mean-square (rms) deviation,
844 85 1, 882 889 894 746. 789 124. 262
Quantum interference, 227. 5 Randomness. Rotational energy, 324, 536
Quantum number. 153, 176 236 Range. 752 Rotational motion, 308, 501, 522.
245. 255. 287. 320, 376. 394. Range—depth relation, 708
4 15. 832, 8 6, 879 -energy cm ve, 752 Rotational symmetry, 323. \.~
angulai momentum, 310, 315, Range-mass relation. 827, 844 882 357. 386, 731 - 7 :
'
Rate equations. I 893
4H6. 721 "" 333, 841 Rayleigh scattering, 165 Roton. 642, "is 666
azimuthal, 310, 315, 35* RayleigrTs criterion, 197 Russell-Saunders coupling. 482
charm. 879 R branch. 529 Rutherford atom. 122. 132. 145
Reaction amplitud Rutherford scattering. 132. 134. 152.
( /'. 854 Reaction cross section. 789. 792. 805. 17'
433 Reaction kinematics. 53, 784, 820. Rydberg energy, 153. 347. 34H
h\ percharge. 861
internal, 7 - Reaction probability 789 Saturation. 101. 686
isospin, 732 Real photon Saturation current. 61s
magiK Reciprocal lattice. 586. 602, 632 Scalar potential. 895
molecular. 508 Recoil. 158, 779 Scale - '
principal. 351, 357. 449. 715 Recursion relation. 248, 271. 296. Bragg. 105. [86, 584, 602
radial node. 331. 352. 449. 714 355. 373 charge-exchange, 828
single-electron, 357. 395, 408, 445, Reduced mass. 151. 301. 345. 381. Compton, 107. 116. 120
Sc attering (Continued) Solar spectral irradiance, 78 triplet, 472, 487, 489. 510. 734.
inelastic, 875
167, 783. Solid, 575. 576 769
neutron— proton, 828 molecular, 583 Spin-statistics theorem, 467
nucleon— nucleon, 703. 729, 828 two-dimensional, 579 Splitting, 375
photon, 109, 194 Solid angle, 139, 333, 789 beam, 391, 399, 401
pion-nucleon, 832, 862, 864 Solid helium, 560, 635 energy level. 385. 398, 415. 430.
Raman, 107 Sound velocity, 563, 566, 574, 627 434. 718
Rayleigh, 105 Space inversion, 326. 837, 852 hyperfine. 434
Rutherford, 132, 134, 152, 179. Space quantization, 391 inverted. 718
Space time, I, 26, 33, 38, 60 I line. 375. 377. 418. 430. 685
Thomson. 108, 111, 1 It) Space-time diagram, 40, 768, 823 spin-orbit, 415, 416. 461, 479, 718
x-ray, 108 Spec id, heat, 539, 500, 573, 588. Zeeman, 385, 425, 446
Si altering angle, 108, 1 33, 135 010 Spontaneous fission. 54. 799
Si hmidl hue. 72") Spec Hal absorptance, 75, 1 18 Spontaneous symmetry breaking,
Schrodingei equation, 219, 220, 231, Spectral band, 501, 523. 527 883. 899. 901. 904
283, 289. 303, 309. 312, 857, Spectral emissivity, 75, 118 Square well. 257, 296. 706. 715. 738
896 Spectral energy density, 79, 85. 93. i quark, 871. 889
time-independent, 233. 251, 284 171 SQl ID, 663
Schrodingei theory, 226, 231, 268, Spectra] line, 145, 270, 344, 307 Stability, 074. 092
357, 37"). 393, 120, 142, 117. Spectral radiant emittance, 75, I 18 island ol. 677
821. 891, 896 Spectral reflectance, 75 valley of, 075, 693, 759
Sc reening, 163, 1 1 1. 150 Spectroscopy notation, 359, 108, Standard Model. 902
Set oihI sound, 636, 650 111, 401, 474, 480, 484 Standing wave, 79, 80, 589. 601
Selection rule, 270 6,374, Spectrum, 75 Stationary state, 148. 149.220,231,
386, 399. 118, 111, 162, 17"). absoi ption, 145 251. 264. 268. 275, 290. 307.
181, Vi',, 71.9, 774 a-pai Ik le, 758 309, 323, 327, 330,334,345,
Self-consistent theory, Hi), 150 167 atomii , 115 357, 360, 39",
Semiclassical theory, 269, 368 I) 1,501,523 >' Si.hisik al met h.uiK s, 86, i
-
Semii ondui toi , 575, 606, 608 (3-ray, 700, 830 Stefan Boltzmann constant, 76. 98
doped, 998, 912 OUkhodv. 7o 'i I Stefan-Boltzmann law, 76. 98. 119
extrinsic, 008. 012 continuous, 75, 106, 145, 101. 760 Stellai abet ration, 35
intrinsic .
008 discrete, 106, 1 15. 101. 758 Stellai energy, 806
n type, 01 1 (le< tromagnetic , 101, I 14 Steradian, 139
// type, 01 1 ele< tronii .
">27 Stern—Gerlach experiment, 390, 394,
Semiconducioi devi, 1 615 emission, 145, 366 855
Semiempirical m.iss formula, 089, energy , 638, 12 Stirling's formula, 89, 91, 548, 551
712, 759, 800 frequent y, 75 Stokes' law, 124, 128
Separation constant, 232, 313 7-iay. 773 Stopping potential, 101
Separation ol variables, 231, 313, mass, 683 Strangeness, 846, 853. 861, 870
143, 11'. molecular, 167. 501. 527 Strangeness-i hanging decay 849,
Scncs limit, 147 optical, 447, 453, 171) 803, 876
1 flavor, 871 proton, 787, 813 Strange partii le, 8 18
Shear, 570 rotational, 527, 537 decay, 849, 876
Shell, 359, 367, 147, 1 19 rotational vibrational, 527 Sn ipping reaction. 793
inner, 102. 118, 457 solai, 78, 145 Strong interaction, 816, 821, 826,
outer, 102. 4 18. 476 wavelength, 77, 97 832. 839. 846, 849. 858, 864
Shell closure, 447, 453, 077, 711, \ ray, 106, 161, 117, 157 870. S80, 898, 902
718, 722 Speed ol light, 2, 8, 26, 34 Sturm— Liouville theory, 256
Shift: Spherical harmonic, 315, 320, 326, Subshell, 449, 721
blue, 24, 781 345. 115 ordering, 45
Doppler, 22. 781 Spin, 375, 393. 394, 405. 146, 164, Substructure, 8 It), 870
energy level, 385, 389, 4 14. 429 732, 700 Supen ondui toi , 634, 638, 65 I
Sulfate tension, 802 momentum-energy, 15,63, 113 I i). ertaint) prim iple, 182, 195
Susceptibility, 628, 63:5 phase. 896 285. 311, 321. 321. 465, 5
Symmetric wave function, 238, 286, velocity, 31 ".lo. 669, 779. 827
465, 540 space-time, 29, 38, 45, 61 I Ingerade wave fun< lion, 50 1
Symmetrization, 466, 471, 489 rransistoi , 575, 615, 633 Unification, 816, 821, 882, 902
Symmetry, 238. 285, 292, 323, 325, Transition, 148. 166, 170. 268 Unification mass. 904
386, 728, 816, 837. 846. 852, allowed, 769 Unification scale. 903
858. 87(1, 883. 893, 902 charge-changing, 844, 875, 883 Unified theory, 882, 902
Symmetry breaking, 386, 732. 860 charge-conserving, 878, 885 i unci sal In mi mtci.u lion * 1 i
Symmetry energy, 690, 698 color-changing, 890, 898, 912 u quark, 871. 889
Synchrocyclotron, 819 diquark, 904 Uranium, 797
Synchronized docks. 11, 13,26 electric dipole, 270. 368, 386, 170,
Synchrotron, 8 19 774
Vacancy, 157. 191, 593
electromagnetic, 842, 875
\'.i< iiuiii. 822
Tachyon, 70 Fermi, 769
Valence, 123
Tauon, 835 flavor, 876, 881, 898
hand, 1)08
Tanon decay, 836 forbidden, 769
electron, 155, 177. 510, 608
Tauon-neutrino, 836 Gamow—Teller, 770
van del VVaals c ixsl.il, 622
Tau-theta puzzle, 839 hole, 462. 499
van dei VVaals interaction, 501, 5 12,
Temperature, 538 isomeric, 776
517, 893
Tensor force, 703 laser, 172
Vectoi meson, 869
Term, 484, 487, 488, 192 leptoquark, 904
Vectoi potential, 823, 895
Tetrahedral structure, 579 nuclear, 742. 746. 752. 759. 768,
Velocity addition, 3 1, 35, 56, 58,
Texture, 664 773
786
tflavor, 880 quai k-to-antiquark, 904
Velocity spat e, 92
Thermal neutron. 587. 796, 814 quark-to-lepton, 904
Vertex. 82 1
Lorentz, 25. 29. 33, 38. 45, 51,61, position, 195, 262, 206 absorption, 458
113 spin, 394 diffraction, 101, 186
1-12 Subject Index
o
en CT eo i
^: s
<*>
z i
r; 8 X
00 i en 3 3
VJ en Vl >o •— eo •—
Eff
u
OD
«
CO s O s
2: (H
S * 00 a
0J
m C s 0J
to <* OQ _ 5 ft)
00 en 00 o N) &•
V
*> en en eo
1 '
-t. R. a.
> s r
- 2 r- S
b
a o m & O 5 ^i 1 eo 1 °
* -J eo ro
>£> 10
JO
5 H
=r
«
>-»
O 5 i 2 rsi
s U)
o
tT>
en
oo
* 5 o ro
ro
M
w
§ £ a-
8 oj s
i
j
h
0)
8
X <
lO en » o 5
9 c Q.
X 3 to en " eu
ro
eo
W
C £ Z S
"
* 5 S
— 8 * 5 *
3 -n
ro o ro
o
3 3
§ ^ *
"D • i jo . -1
" o
a
—
Hxj
M >x>
eo
do, * 5 en eo
I 3 ro
o
a H
sr
« »
5'
§
3. g.
8
t
"0
c Ifi
li
8 co
^ l\>
8 O
vi
o>
2 JO
£
•••
a>
ro
CT)
>
CO
3 O
« > 5 m » r
o
3
c s
4
J ^o S
eo
* o vi
1 =o
" g ° - w
8 I en en
O
N K
" 9 o«
o 5 * 7° «
S =:
2
3 .
I
* o ip
<Tl
8 °-o.
*»
I
00
ro
00 H
.
• nc
1.1 8
"
OD
3T s
H x >
C
s > C W
CT on I V, 3S
ro
oj v 1 5
-T- ft)
vi en lO
w
tl
D r
o 12. M O
< <*
*L
n
3
q
f
s -»
—
i£>
oo
*>
8
><
en
CT)
S TO oo
o oo
!"
eo w
_
en
c
sr
UU
-1
U
» rr
•»(/>
£
I
O CT. <n £» (o ^>
w
tf>
5 o>
sr
B ^D vi tl 13 o a o
-^ eg
<
- -n
« 3
• m In
S
* 5"
*
0J
X > CD
C73
r» a oo ^. eo
ET 8 oo •— ID »— eo en
3
ft
8
*
21 o * 3c
s 3 -0 CO
s
O s
8 ^f o O
. CT oo s en eo
<<*
c a — lO ro O ro ^ fT>
3 n
S
2£ o 8 u< V,
Bi
CO
3 cr 'n
s
>
eo
eo
TJ
i
Z
INJ o 83 *— eo en VI
5
— • CO CO o
8 © m
00 8
ft
<n
ro
eo 8
*>. ro •t» CT> 00
S > 2 CD o 1 T|
00 8 m ! lO s ^
en eo en VI UJ
s? m H u>
X X
^
%
> g z 8 X
<D
(n s eo t — _^ s ft)
.- 2 »
0^ A <n 00 i_- ro
INTRODUCTION TO THE ^fMrp"sCIE^
" iiiiitt««| D 6
,
047160531X 01 * '
PHYS340