Discover millions of ebooks, audiobooks, and so much more with a free trial

From $11.99/month after trial. Cancel anytime.

Group Theory in Quantum Mechanics: An Introduction to Its Present Usage
Group Theory in Quantum Mechanics: An Introduction to Its Present Usage
Group Theory in Quantum Mechanics: An Introduction to Its Present Usage
Ebook815 pages13 hours

Group Theory in Quantum Mechanics: An Introduction to Its Present Usage

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Geared toward research students in physics and chemistry, this text introduces the three main uses of group theory in quantum mechanics: (1) to label energy levels and the corresponding eigenstates; (2) to discuss qualitatively the splitting of energy levels, starting from an approximate Hamiltonian and adding correction terms; and (3) to aid in the evaluation of matrix elements of all kinds.
"The theme," states author Volker Heine, "is to show how all this is achieved by considering the symmetry properties of the Hamiltonian and the way in which these symmetries are reflected in the wave functions." Early chapters cover symmetry transformations, the quantum theory of a free atom, and the representations of finite groups. Subsequent chapters address the structure and vibrations of molecules, solid state physics, nuclear physics, and relativistic quantum mechanics.
A previous course in quantum theory is necessary, but the relevant matrix algebra appears in an appendix. A series of examples of varying levels of difficulty follows each chapter. They include simple drills related to preceding material as well as extensions of theory and further applications. The text is enhanced with 46 illustrations and 12 helpful appendixes.
LanguageEnglish
Release dateFeb 21, 2013
ISBN9780486174334
Group Theory in Quantum Mechanics: An Introduction to Its Present Usage

Related to Group Theory in Quantum Mechanics

Titles in the series (100)

View More

Related ebooks

Physics For You

View More

Related articles

Reviews for Group Theory in Quantum Mechanics

Rating: 0 out of 5 stars
0 ratings

0 ratings0 reviews

What did you think?

Tap to rate

Review must be at least 10 words

    Book preview

    Group Theory in Quantum Mechanics - Volker Heine

    INDEX

    PREFACE

    The object of this book is to introduce the three main uses of group theory in quantum mechanics, which are: firstly, to label energy levels and the corresponding eigenstates; secondly, to discuss qualitatively the splitting of energy levels as one starts from an approximate Hamiltonian and adds correction terms; and thirdly, to aid in the evaluation of matrix elements of all kinds, and in particular to provide general selection rules for the non-zero ones. The theme is to show how all this is achieved by considering the symmetry properties of the Hamiltonian and the way in which these symmetries are reflected in the wave functions. In Chapter I the necessary mathematical concepts are introduced in as elementary and illustrative a manner as possible, with the proofs of some of the fundamental theorems being relegated to an appendix. The three uses of group theory above are illustrated in detail in Chapter II by a fairly quick run through the theory of atomic energy levels and transitions. This topic is particularly suitable for illustrative purposes, because most of the results are familiar from the usual vector model of the atom but are derived here in a rigorous and precise way. Also most of it, e.g. the introduction of spin functions and the exclusion principle, is fundamental to all the later more advanced topics. Chapter III is a repository for the theory of group characters, the crystallographic point-groups and minor points required in some of the later applications. Thus, after selected readings from chapter III according to his field of interest, the reader is ready to jump immediately to any of the applications of the theory covered in later chapters, namely: further topics in the theory of atomic energy levels (Chapter IV), the electronic structure and vibrations of molecules (Chapter V), solid state physics (Chapter VI), nuclear physics (Chapter VII), and relativistic quantum mechanics (Chapter VIII).

    The level of the text is that of a course for research students in physics and chemistry, such as is now offered in many Universities. A previous course in quantum theory, based on a text such as Schiff Quantum Mechanics, is assumed, but the matrix algebra required is included as an appendix. In selecting the material for the applications in various branches of physics and chemistry in Chapters IV to VIII, I have restricted myself as far as possible to topics satisfying three criteria: (i) the topics should be simple applications that illustrate basic principles, rather than complicated examples designed to overawe the reader with the power of group theory; (ii) the material should be intrinsically interesting and of the sort that is suitable for inclusion in a general course of advanced quantum mechanics; and (iii) topics must not involve too much specialized background knowledge of particular branches of physics. The view adopted throughout is that group theory is not just a specialized tool for solving a few of the more difficult and intricate problems in quantum theory. In advanced quantum mechanics practically all general statements that can be made about a complicated system depend on its symmetry properties, and the use of group representations is just a systematic, unified way of thinking about and exploiting these symmetries. For this reason I have not hesitated to include simple results for which one could easily produce ad hoc proofs from first principles: indeed, it must always remain true that the use of group theory could be circumvented by detailed algebraic considerations on almost all occasions. However, the author is convinced that the essential ideas of group theory are sufficiently simple to make the time spent on acquiring this way of thinking well worth while.

    A series of examples is appended to each section. Some of these are simple drill in the concepts introduced in the section; others, particularly in later chapters, indicate extensions of the theory and further applications. Those marked with an asterisk are more difficult or require additional reading, and are often suitable as topics for review essays (alias term papers).

    With the three criteria for selection mentioned above, it has of course been quite impossible to do real justice to any of the applications to various branches of physics and chemistry that are touched on in Chapters IV to VIII. This appears to me unavoidable because of the amount of background knowledge required for many applications. It merely highlights the fact that in each of these specialized subjects there is a need for a monograph which uses group theory from the beginning as naturally and as freely as the Schrödinger equation itself. In this field the chemists have already led the way,¹ and the author hopes that the present book may hasten the day when the same applies in physics by providing a convenient basic reference text.

    It is a pleasure to acknowledge my indebtedness to Professor B. L. Van der Waerden whose elegant book first inspired my interest in this subject. Also I am very grateful to Dr. S. F. Boys, Dr. G. Chew, Dr. R. Karplus, Dr. M. A. Ruderman, Dr. M. Tinkham and Mr. D. Twose, who have either patiently helped me to understand aspects of their special subject, or have read parts of the manuscript and made helpful comments. I am indebted to Mrs. M. Rogers and Mrs. M. miller for undertaking the typing of the manuscript, and to Mr. J. G. Collins and Mr. D. A. Goodings who have generously helped with the correction of proofs. Dr. E. R. Cohen has kindly allowed the reproduction of his tables of Wigner coefficients, and D. Van Nostrand Co. similarly a figure.

    Cambridge, England.

    V. HEINE

    NOTATION

    Note: e is taken as the charge on the proton: all angular momentum operators such as L = (Lx, Ly, Lz(except in § 18), whereas the quantum numbers L, ML, etc., are of course pure numbers.

    Chapter I

    SYMMETRY TRANSFORMATIONS IN QUANTUM MECHANICS

    1. The Uses of Symmetry Properties

    Although this book has been titled Introduction to the Present Use of Group Theory in Quantum Mechanics in accordance with customary usage, a rather more descriptive title would have been The Consequences of Symmetry in Quantum Mechanics. The fact that these symmetry properties form what mathematicians have termed groups is really incidental from a physicist’s point of view, though it is vital to the mathematical form of the theory. It is in fact the symmetries of quantum mechanical systems that we shall be interested in.

    The following three simple examples illustrate in a preliminary way what is meant by symmetry properties and what their main consequences are.

    (i) It can be shown that the wave functions ψ(r1, r2) (without spin) of a helium atom are of two types, symmetric and anti-symmetric, according to whether

    ψ(r1, r2) = ψ(r2, ri) or ψ(r1, r2) = −ψ(r2, r1),

    where r1 and r2 are the position vectors of the two electrons (Schiff 1955, p. 234). The corresponding states of the atom are also referred to as symmetric and anti-symmetric. Thus the eigenfunctions turn out to have well defined symmetry properties which can therefore be used in classifying and distinguishing all the different eigenstates.

    (ii) There are three 2p wave functions for a hydrogen atom,

    (1.1)

    where f (r) is a particular function of r = |r| only (Schiff 1955, p. 85). Now in a free atom there are no special directions and we can choose and label the x-, y- and z-axes as we please, so that the three functions (1.1) must all correspond to the same energy level. If, however, we apply a magnetic field in some particular direction, the argument no longer holds, so that we may expect the energy level to be split into several different levels, up to three in number. In this kind of way the symmetry properties of the eigenfunctions can determine the degeneracy of an energy level, and how such a degenerate level may split as a result of some additional perturbation.

    (iii) The probability that the outer electron of a sodium atom jumps from the state ψ1 to the state ψ2 with the emission of radiation polarized in the x-direction is proportional to the square of

    (1.2)

    (Schiff 1955, p. 253). If the two states are the 4s and 3s ones, ψ1 and ψ2 are functions of r only. To calculate M in this case, we make the change of variable x′ = − x in (1.2) and obtain M = −M, i.e. M(4s, 3s) = 0. This transition probability is therefore determined purely by symmetry. The situation is rather different. when the transition probability is not zero. Suppose ψ1 and ψ2 are the 4px and 3s wave functions 1(r) and ƒ2(r). Then (1.2) becomes

    (1.3)

    By making the change of variable x′ = y, y′ = x, the x² in (1.3) can be replaced by y² or similarly by z². Thus by addition

    (1.4)

    Similarly the probabilities for all possible transitions from any 4p state to the 3s state or vice versa, with the emission or absorption of radiation, polarized circularly or linearly in any direction, can be reduced to the integral occurring in (1.4), the simple numerical factor in front being determined purely by the particular direction and 4p state chosen. Symmetry properties thus establish the relative magnitudes of several matrix elements of the form (1.2), their absolute values being then determined by the value of one integral. This type of argument explains why the intensities of the various components of a composite spectral line are often observed to bear simple ratios to one another.

    These examples serve to illustrate what is generally true, namely that symmetry properties allow us to classify and label the eigenstates of a quantum mechanical system. They enable us to discuss qualitatively what splittings we may expect in a degenerate energy level under some perturbation. They help in calculating transition probabilities and other matrix elements, and, in particular, in setting up selection rules stating when these quantities are zero. In the following sections we shall develop these kinds of symmetry argument in a systematic fashion, and shall see how they can be used for the above three purposes in situations that are less elementary than the examples given above.

    The real importance of symmetry arguments in such situations lies in the fact that for systems of interest the Schrödinger equation is usually too complicated to be solved analytically or even numerically without making gross approximations. For instance, for an atom with n electrons the equation contains 4n variables (including spin) which are not separable. However, the symmetry properties of the equation may be relatively simple, so that symmetry arguments can easily be applied to the problem. Another important point about symmetry arguments is that they are based on the symmetry of the Schrödinger equation itself, so that they do not involve approximations, in particular those used to obtain approximate eigenfunctions of the equation. In fact the beauty of the method lies in the fact that, for instance, an n electron problem can often be treated as simply and as rigorously as a one electron problem. At the present time the most spectacular illustrations of these two aspects of symmetry arguments occur in nuclear and fundamental particle physics. The shell-model theory of the energy levels of nuclei has been developed, with selection rules for various transitions, etc., all without an exact knowledge of the interaction between two nucleons. Similarly it is possible to discuss tentatively the relationships between the various fundamental particles and give selection rules for transitions between them, which are based purely on symmetry ideas, such as spin, charge conjugation, isotopic spin and parity, without the slightest understanding of the field equations describing the interactions of all these particles.

    2. Expressing Symmetry Operations Mathematically

    Many of the symmetry properties that we shall be concerned with involve rotations, so that we shall start by considering how a physical operation such as rotating a system is expressed mathematically.

    Consider a body with a point P on it which has co-ordinates (x, y, z). If we rotate the body clockwise by an angle α (Fig. 1), i.e. we rotate by − α about the z-axis in the conventional sense, the point P moves to the position P′(X, Y, Z), where

    (2.1)

    i.e.

    (2.2)

    FIG. 1. Rotation of a point P to P′.

    Here and elsewhere the x-, y- and z-axes are chosen to form a right-handed set. However, instead of rotating P, we can also consider the body and P as fixed, and refer all co-ordinates to a new pair of axes OX and OY which make an angle + α with Ox and Oy (Fig. 2).

    FIG. 2. Rotation of axes.

    We have analogously to (2.1)

    so that the co-ordinates (X, Y, Z) of P referred to the new axes are related to (x, y, z) again by (2.2).

    Thus the single transformation (2.2) can represent either the change in the co-ordinates of a point when we rotate a body by an angle −α, or the change in the co-ordinates of a fixed point when we rotate the co-ordinate axes by an angle +α. The close relationship between these two operations is directly evident from the similarity between Figs. 1 and 2. The two different points of view also arise when considering the symmetry properties of a physical system. Consider for instance a perfectly round plate without any markings on it: we say it is symmetrical about a vertical axis through its centre, say the z-axis. We can express this more precisely by saying that if we rotate the plate about its axis, we cannot tell that we have rotated it because it is completely round with no markings on it. On the other hand we could also say that for a fixed position of the plate, the various physical properties such as moments of inertia associated with the x- and y-axes must be the same, no matter in what directions these axes are chosen. In this example the first approach is perhaps more natural, but when discussing the symmetry of the Schrödinger equation for a physical system we shall adopt the second point of view. Anticipating a little, we shall be considering a given equation and the forms it takes when expressed in terms of different variables like x, y, z and X, Y, Z which correspond to using different co-ordinate axes. There are two reasons for this choice. Firstly, the Schrödinger equation is a mathematical relation and not like a plate so that we cannot rotate it in quite the same sense, though we could, of course, write down the equation for the rotated physical system. Expressing an equation in terms of different sets of co-ordinates is a more familiar concept. Secondly, we shall be considering some transformations of co-ordinates that have no simple physical analogue. For instance, we can carry out a rotational transformation of spin co-ordinates without altering the position vectors ri of the electrons in an atom, but what does it mean physically to rotate an atom in spin space while holding it fixed in ordinary space? Nevertheless the transformations of co-ordinates which we shall apply to the Schrödinger equation will usually be suggested by and linked with the physical symmetry of the system in an obvious way.

    When discussing linear transformations of co-ordinates, it is convenient to refer to them by a single symbol such as T. For instance, we shall call the transformation (2.2) the transformation R, or because it corresponds to a rotation, the rotation R. If it is necessary to be specific about the angle of rotation, we shall call (2.2) the rotation R, z) of + α about the z-axis because this sign corresponds to the change-of-axes point of view which we are adopting. We have already discussed in connection with Fig. 2 the effect of applying a transformation such as R on the co-ordinates of a point, and we shall now make the following preliminary definition of what it means to apply R, z) to a function of x, y, z. In § 5, reasons will appear for replacing this definition by a slightly enlarged concept. Applying the transformation R, z) (2.2) to a function ƒ (x, y, z) means to substitute the expressions (2.2) for x, y, z in. the function and thus express f in terms of X, Y, Z. This results in a function of X, Y, Z which in general displays a different functional form from ƒ (x, y, z). For instance applying R(α, z) to the function (x y, we obtain

    (2.3)

    which is a different function of X, Y, Z. Similarly we can apply a transformation to each side of an equation. For instance the equation

    (2.4)

    becomes²

    (2.5)

    which is still a correct equation as can easily be verified.

    PROBLEMS

    2.1 Apply the transformation R, z) (equation (2.2)) to each of the following functions: (a) exp x; (b) (x + iy)²; (c) x² + y² + z²; (d) (r), (r), (r).

    2.2 Write down the linear transformation that corresponds to a rotation of α about the y-axis, and apply it to each of the functions of problem 2.1.

    2.3 The Schrödinger equation for a simple harmonic oscillator of frequency ω is

    where ψ(x) is an eigenfunction belonging to the energy value E. By operating on the equation with the transformation x = − X, show that ψ(−x) is also an eigenfunction belonging to the same energy level and so are ψ(x) + ψ(−x) and ψ(x) − ψ(−x).

    3. Symmetry Transformations of the Hamiltonian

    We shall now apply linear transformations like R, z) (2.2) to the time-independent Schrödinger equation

    (3.1)

    is the Hamiltonian operator and E the energy value belonging to the eigenfunction ψ.

    The Hamiltonian for an atom with n electrons, considering the nucleus as fixed and omitting spin dependent terms, is (Schiff 1955, p. 284)

    (3.2)

    where m is the mass of an electron, e the charge on a proton, and

    (3.3)

    (3.4a)

    If we apply the transformation R, z) (2.2) to the co-ordinates (xi, yi zi) of each of the n electrons, we have

    Similarly

    (3.4b)

    and it can easily be shown that ³

    (3.5)

    Thus substituting these relations into (3.2), we see that the Hamiltonian has precisely the same form when expressed in terms of the (Xi, Yi, Zi) co-ordinates as in terms of the (xi, yi zi) co-ordinates, i.e.

    (3.6)

    This is expressed by saying that the transformation R(α, z(3.2) unchanged, or R(α, z) leaves invariant, is invariant under R(α, z), or R(α, z. A symmetry transformation of a Hamiltonian is defined as a linear transformation of co-ordinates which leaves that Hamiltonian invariant in the sense of equation (3.6).

    The reason for applying linear transformations like R(α, z) (2.2) to a Hamiltonian now becomes a little clearer. We have seen that R(α, z) leaves the Hamiltonian (3.2) invariant. However, R(α, z) applied to the eigenfunctions of the Hamiltonian does not in general leave them invariant. Consider for instance the 2p wave functions for a hydrogen atom (example (ii) of §1). R(α, z) applied to (r) gives (X cos α − Y sin α) ƒ (R) which has a different functional form. In particular for α = 90° we obtain − (R) so that R(α, z) has changed one eigenfunction into another. More generally consider a Schrödinger equation

    (3.7)

    Applying any symmetry transformation T we obtain

    (3.8)

    where ψ2 in general has a different functional form from ψ1. Thus ψ2(Xi Yi, Zi(Xi, Yi, Zi(Xi, Yi, Zi(xi, yi, zi) have the same form, we can also say from (3.8) that ψ2(xi, yi, zi) is an eigenfunction of (xi, yi, zi) and belongs to the same eigenvalue E as ψ1. An alternative method of wording this argument is to say that since (3.8) is a differential equation in terms of the variables Xi, Yi, Zi, we can replace Xi, Yi, Zi by xi, yi, Zi or any other set of symbols throughout without upsetting the validity of the equation. Thus (3.8) becomes

    (3.9)

    which is just our previous conclusion expressed in symbols. Thus we see that the symmetry transformations of a Hamiltonian can be used to relate the different eigenfunctions of one energy level to one another and hence to label them and to discuss the degree of degeneracy of the energy level. Before we can pursue this further ( § 6), we must discuss in greater detail the symmetry transformations of Hamiltonians (§§ 3 and 4) and their effect on wave functions (§ 5).

    The Hamiltonian (3.2) has two other types of symmetry transformation besides the rotation R. The transformation

    (3.10)

    is called the interchange or permutation of the co-ordinates 1 and 2, and is a symmetry transformation of (3.2) as is obvious by inspection. Similarly any permutation of the co-ordinates xi, yi, zi, i = 1 to n, is a symmetry transformation. The other symmetry transformation is the inversion transformation Π

    (3.11)

    This can be combined with the rotations. An ordinary rotation such as (2.2) is called a proper rotation, and the combination of a proper rotation with the inversion Π is called an improper rotation. As a particular example of an improper rotation, we have ΠR(180°, x) which is just the reflection mx in the mirror plane x = 0, i.e.

    (3.12)

    It can easily be verified that all improper rotations, as well as proper ones, leave the Hamiltonian (3.2) invariant. However, there are many simple and important transformations that are not symmetry transformations of (3.2), for instance the transformation to cylindrical polar co-ordinates

    (3.13)

    This transformation is in any case not a linear one because it involves products of Ri . Also ∇i² becomes

    (3.14)

    which is not identical in form with (3.3), so that (3.13) is not α symmetry transformation. Of course we may wish to express the Hamiltonian (3.2) in terms of cylindrical polar co-ordinates for some problem, but in the future we shall refer to such a transformation as a change to polar co-ordinates, so as to avoid confusion with symmetry transformations which we will be considering so much that it will be convenient to refer to the latter simply as transformations.

    We must now indicate briefly what the symmetry transformations are for the Hamiltonians of physical systems besides free atoms and ions which we have been considering so far. An atom has complete spherical symmetry, i.e. it is invariant to any rotation about any axis (cf. problem 3.7), so that it has a higher degree of symmetry than molecules and crystal lattices which are usually only invariant to certain rotations about certain axes (cf. problems 3.4 and 3.5). Thus the latter have some of the symmetry transformations of the atom, but not any radically new ones except for the translational symmetry of a crystal lattice. We have therefore already mentioned in connection with (3.2) almost all the types of symmetry transformation which we shall discuss.

    To sum up, the form of a Hamiltonian remains unchanged by certain linear transformations which are called symmetry transformations of the Hamiltonian. Symmetry transformations in general change the eigenfunctions of one energy level into one another.

    PROBLEMS

    3.1 Show that the following co-ordinate changes are not symmetry transformations of the Hamiltonian (3.2).

    (xi, yi, zi) = (2Xi, 2Yi, 2Zi), i = 1 to n.

    (xi,y1,z1) = (−X1, −Y1, −Z1), (xiyi, zi) = (Xi, Yi, Zi), i = 2 to n.

    xi = exp Xi, yi = exp Yi, zi = exp Zi, i = 1 to n.

    x1, y1, z1, given in terms of X1, Y1, Z1 by equation (2.2), (xi, yi, zi) = (Xi, Yi, Zi), i = 2 to n.

    xi = Ri cos φi, yi = Ri sin φi, zi = Ri , i = 1 to n.

    3.2 Express the Hamiltonian (3.2) in terms of spherical polar co-ordinates r, θ, φ, where

    x = r sin θ cos φ, y = r sin θ sin φ, z = rcos θ,

    (Schiff 1955, p. 69).

    takes the form

    and express also the rotation (2.2) and the other symmetry transformations mentioned in § 3 in polar co-ordinates. Hence, verify that they again leave the Hamiltonian (3.2) invariant.

    3.3 Write down the Hamiltonian without spin dependent terms for an ion of nuclear charge Z with n (not equal to Z) electrons, and show that it has the same symmetry transformations as the Hamiltonian (3.2). Do the same for the one-electron Hartree equation (Schiff 1955, p. 284) for the single valence electron of a sodium atom.

    3.4 Write down the Hamiltonian without spin dependent terms for the two electrons in a hydrogen molecule, considering the two protons as fixed at the points ± (0, 0, α). Show that it is (a) invariant under any rotation about the z-axis, but only to 180° rotations about the x- or y-axes, (b) invariant under reflections in the plane through the origin perpendicular to the z-axis and in any plane containing the zand the reflection in the z-axis

    (xi, yi, zi) = (−Xi − Yi, Zi) i = 1, 2,

    and (d) invariant under the interchange of co-ordinates 1 and 2.

    3.5 In problem 3.4, assume that one of the protons has been replaced by a deuteron, and suppose that the deuteron has a slightly different charge from that of the proton. What effect does this have on the symmetry properties of the Hamiltonian? Although in reality the deuteron and proton have the same charge, they do have different masses and magnetic moments, and this would affect the symmetry of the problem in a similar way to the fictitious difference in charge if the interaction with the nuclear moments were included in the Hamiltonian.

    3.6 Repeat the discussion of problem 3.4 in terms of spherical polar co-ordinates and in terms of cylindrical polar co-ordinates (equation (3.13)). Which set of co-ordinates do you think is most convenient for this problem?

    3.7 A rotation about the origin can be defined mathematically as a linear transformation of co-ordinates that leaves invariant the distance of an arbitrary point (x, y, z) from the origin. Using this definition, show that the Hamiltonian (3.2) is invariant under any rotation about any axis. Show that the definition includes the improper as well as the proper rotations (Margenau and Murphy 1943, p. 310).

    3.8 Show that an improper rotation of 180° about any axis is the same as a reflection in the plane through the origin perpendicular to that axis.

    3.9 Write down the Hamiltonian for a hydrogen atom in small uniform electric and magnetic fields parallel to the z-axis (Schiff 1955, pp. 158, 292) omitting spin dependent terms and considering the nucleus as a fixed charge. Also assume that the 2p eigenfunctions

    where ƒ (r) is given by Schiff (1955, p. 85), are still eigenfunctions to a first approximation in the presence of the fields. Prove (a) the 2p level is three-fold degenerate in the absence of the external fields, (b) in the presence of the electric field only, ψ1 and ψ2 are degenerate with one another but need not be degenerate with ψ3, (c) in the presence of the magnetic field only, symmetry arguments do not require any of the functions ψ1, ψ2 and ψ3 to have the same energy so that the 2p level may be split into three levels. Hint: in each of the cases (a), (b) and (c) test whether the reflection in the plane y = 0 and the rotation of 90° about the y-axis are symmetry transformations. If they are, use them to apply the argument of equations (3.7), (3.8), (3.9) to each of the functions ψ1, ψ2, ψ, rotations about other axes and other reflections to ensure as far as possible that no degeneracy required by symmetry has been missed.

    4. Groups of Symmetry Transformations

    In this section we shall illustrate and define what is meant by a group in the mathematical sense of the word, and shall show what relevance this concept has to the symmetry transformations of Hamiltonians.

    Example of a group

    Let us first consider the symmetry properties of a particular physical object, namely an equilateral triangle cut out of a piece of cardboard having the same finish on both sides and lying on the table with its vertices at the points 1, 2 and 3 and its centre at the origin of co-ordinates (Fig. 3). Ok, Ol, Om are three other axes perpendicular to the sides, Ok being identical with the negative y-axis. A rotation A of 120° about the z-axis moves the vertex that was at the point 1 to the point 2, etc., and we shall call this an equivalent position of the triangle since it is indistinguishable from the original position. It can easily be seen that the following rotations all leave the triangle in equivalent positions and that there are no other proper rotations that do this.

    FIG. 3. Axes for an equilateral triangle.

    (4.1)

    If we apply two rotations successively, for instance first A and then K, this moves the top vertex from position 1 first to position 2 and then to 3, the vertex at the position 2 to 3 and then to 2, the vertex at 3 to 1 and then to 1. Thus the combined operation A followed by K is identical with the single rotation L. Similarly K followed by A is the same as M, and it can easily be verified that combining any pair of the rotations (4.1) in either order gives another rotation which is also one of the ones listed in (4.1). If the rotation F applied First followed by S applied Second is equivalent to the single Combined rotation C, we write

    (4.2)

    where it is customary to write the S before the F in analogy with differential operators. For instance

    (4.3)

    means first differentiating ƒ (x) and then multiplying the result by x². This is clearly not the same as

    (4.4)

    and similarly when combining rotations it is important to follow the convention of (4.2). We have already seen that

    (4.5)

    and similarly it is possible to write down a whole multiplication table (Table 1) where the rotation in the top row is applied first and the rotation in the left column second. There is an important feature of Table 1, namely that for every rotation P, there is also a rotation P−1, say, which undoes the effect of P, and that P also undoes the effect of P−1, i.e.

    (4.6)

    In fact in every case P and P−1 are just two rotations by the same angle about the same axis but in opposite directions. When the angle is 180° this of course makes P and P−1 identical. It can also be verified from the multiplication table that the triple products

    TABLE 1

    Multiplication Table for the Group 32

    P(QR) and (PQ)R are always the same, so that they can be written unambiguously as PQR. Alternatively this follows directly from the physical nature of rotations as can easily be shown. These properties suffice to establish that the rotations E, A, B, K, L, M (4.1) are the elements of a group.

    Definition of a group. A group is a collection of elements A, B, C, D, .. . . . . . which have the properties (a) to (e) below. The elements in the simplest cases may be numbers. They may also be any other quantities such as matrices, physical operations like rotations, or mathematical operations such as making a linear transformation of co-ordinates.

    (a) It must be possible to combine any pair of elements F and S in a definite way to form a combination C which we shall write

    (4.7)

    where as before F is the first element, S the second element and C the combination, if the order of F and S is important. In our example with the elements (4.1), the law of combination was "first apply rotation F and then S". With other groups the law of combination may be matrix multiplication or like addition. If for two elements PQ = QP then P and Q are said to commute, and if this is so for every pair of elements then the law of combination is commutative and the group is Abelian.

    (b) The combination C = SF of any pair of elements F and S must also be an element of the group. Thus a multiplication table among the group elements can always be set up like Table 1.

    (c) One of the group elements, E say, must have the properties of a unit element, namely

    (4.8)

    for every element P. For instance omitting all reference to E would make it impossible to set up a complete multiplication table for the other rotations of (4.1) (cf. Table 1). This is related to the next property.

    (d) Every element P of the group must have an inverse P-1 which also belongs to with the property

    (4.9)

    (e) The triple product PQR must be uniquely defined, i.e.

    (4.10)

    This is true for all the kinds of elements and laws of combination that we shall wish to deal with, but there are examples where it does not hold, e.g. 24 ÷(6 ÷ 2) ≠ (24 ÷6) ÷ 2!

    Two simple examples of groups are all positive rational fractions excluding zero with the law of combination being multiplication, and all positive and negative integers including zero with the law of combination being addition. In the latter case it is interesting that zero plays the role of the unit element E. The permutations of n objects, i.e. the operations of rearranging them and not their different arrangements in a row, say, form the permutation group also known as the symmetric group Sn. The proper rotations by all possible angles about a fixed axis form the axial rotation group. This is clearly Abelian. The full rotation group (Chapter II) consists of all proper rotations about all axes through a point, and this becomes the full rotation and reflection group when all improper rotations are included. There are thirty-two groups of particular interest formed from a finite number of particular rotations about a point and are known as point-groups (§ 16). These clearly do not include all possible finite groups of rotations because, for instance, the rotations by 360 r/n degrees about a fixed axis where r = 1 to n, always form a group of n elements. An example of a point-group is the group (4.1) which is called 32 (pronounced three two, not thirty-two) in the international notation, to denote that it includes some two-fold axes (rotations by 180°) perpendicular to a three-fold axis (120°, 240°). All the proper and improper rotations that move a cube to an equivalent position form the full cubic group m3m. In the older Schoenfliess notation these two point-groups are called D3 and Oh. All square matrices of a given order and with non-zero determinant form a group, the law of combination being matrix multiplication. So do all unitary matrices (appendix A) of given order, and likewise all unitary matrices of given order and with determinant +1, as can easily be verified. Finally linear transformations of co-ordinates can form groups as we shall now see.

    The group of symmetry transformations of a Hamiltonian.

    Consider three protons fixed at the points

    (4.11)

    forming an equilateral triangle about the origin (Fig. 3). The Hamiltonian for one electron moving in the field of the three protons is

    (4.12)

    This system is not one of physical importance but its symmetry is closely related to that of an ozone molecule ⁴ or that of an ion situated between three water molecules in the hydrated crystal of a salt, to which the following discussion can easily be extended (cf. problems 4.5 and 4.6). The physical system of three protons has the same rotational symmetry as the equilateral triangle already discussed, which suggests that the linear transformations E′, A′, B′, K′, L′, M′ corresponding to the rotations E, A, B, K, L, M (4.1) may be symmetry transformations of the Hamiltonian (4.12). These transformations can easily be found from (2.2) and simple extensions of the argument of §2. For instance A′ is obtained from (2.2) by putting α = − 120° in accordance with § 2 because A (4.1) is a physical rotation of +120°. We obtain:

    (4.13)

    It can easily be verified that all these transformations are indeed symmetry transformations of (4.12). The ∇² remains invariant under each as in § 3, and applying for instance A′ to the other terms we obtain

    whence (4.12) is invariant under A′.

    The result that the transformation A′ of (4.13) is a symmetry transformation of (4.12) is actually no accident and can be proved as follows using the corresponding rotation A without ever writing down the form of A′ or substituting in (4.12). Let the potential in (4.12) due to the protons be

    and let P(x, y, z) be any point. Consider the physical operation A of rotating the point P and the three protons but not the co-ordinate axes. The system of protons is moved into an equivalent position, one proton from r1 to r2, one from r3 to r1 and one from r2 to r3, and P is moved to the position (X, Y, Z). During this rotation the potential at P has remained constant because it depends only on the distances of P from the three protons, and these distances have not changed because P and the protons have been rotated as a rigid whole. Thus V (x, y, z) due to the initial charge distribution is equal to the potential at (X, Y, Z) due to the final charge distribution. Since, however, A has moved the system of protons into an equivalent position, the initial and final charge distributions and potentials are identical, so that the potential at (X, Y, Z) due to the final charge distribution is V (X, Y, Z), i.e. we have

    (4.14)

    where according to Fig. 1 of § 2, x, y, z and X, Y, Z are related by equations (2.2) with α = − 120°. It only remains to view (4.14) and (2.2) in terms of a change A′ of co-ordinates, rather than in terms of physical rotations. This only involves a change in the interpretation of (4.14) and (2.2) and does not destroy their validity as correct mathematical relations. We therefore obtain that V(r) is invariant under the co-ordinate transformation A′. This argument applies similarly to all the transformations (4.13), and indeed to any similar situation (cf. problem 4.8).

    We can also verify from (4.13) or prove by the above type of argument that the transformation A′ followed by K′ is the same as the single transformation L′. For in detail this simply means that first expressing a function f (x, y, z) in terms of X, Y, Z using A′ (4.13) and then in terms of ξ, η, ζ using K′

    (X, Y, Z) = (−ξ, η,−ζ),

    gives the same result as expressing it directly in terms of ξ, η, ζ using the combined transformation

    namely the transformation L′. In symbols K′A′ = L′ Similarly, any of the transformations (4.13) can be combined, and their multiplication table is exactly the same as Table 1 for the point-group 32 of rotations as can be verified most easily by matrix multiplication. It is also easy to show that the transformations have all the other properties (a) to (e) above required for them to form a group, which we shall call the point-group 32 of transformations.

    THEOREM. We shall now generalize this result and prove that the symmetry transformations of a Hamiltonian always form a group.

    is invariant under each of two symmetry transformations F and S. We shall first show that the combined transformation SF (first F, then S second) is also a symmetry transformation. Let the co-ordinates x1, y1, z1, x2, y2 . . . zn of the Hamiltonian be written for convenience q1, q2, . . . q3n, and let F be the transformation

    (4.15)

    when written in terms of the summation convention (appendix A), and S be the transformation

    (4.16)

    Now the transformation SF means to substitute first for the qi in terms of the Qi using (4.15) and then to substitute for the Qi further in terms of some new variables vi where

    (4.17)

    Since F and S are both symmetry transformations,

    (4.18)

    so that the composite transformation SF from the qi direct to the vi is also a symmetry transformation. Thus the symmetry transformations satisfy the group requirements (a) and (b) above. We can indeed write down the transformation SF explicitly by eliminating the Qi from (4.15) and (4.17), i.e. SF is

    (4.19)

    Further we always have the identity transformation

    (4.20)

    having the property (4.8) of the unit element E, which verifies (c). As regards (d), if we substitute for the qi in the initial Hamiltonian in terms of the Qi using F (Qi(qi) by solving (4.15) for the Qi (Qi) But this is just applying the transformation F−1

    (4.21)

    which undoes the effect of F, and this is

    Enjoying the preview?
    Page 1 of 1