Introduction To Classical Field Theory
Introduction To Classical Field Theory
Introduction To Classical Field Theory
DigitalCommons@USU
8-2019
Part of the Applied Mathematics Commons, Cosmology, Relativity, and Gravity Commons, Elementary
Particles and Fields and String Theory Commons, and the Geometry and Topology Commons
Recommended Citation
Torre, Charles G., "Introduction to Classical Field Theory" (2019). All Complete Monographs. 3.
https://digitalcommons.usu.edu/lib_mono/3
C. G. Torre
Department of Physics
Utah State University
Version 1.2
June 2019
2
About this document
This is a quick and informal introduction to the basic ideas and mathematical
methods of classical relativistic field theory. Scalar fields, spinor fields, gauge
fields, and gravitational fields are treated. The material is based upon lecture
notes for a course I teach from time to time at Utah State University on
Classical Field Theory.
This version, 1.2, is roughly the same as version 1.1. The update includes:
c 2016, 2019 C. G. Torre
3
4
Contents
Preface 9
2 Klein-Gordon field 17
2.1 The Klein-Gordon equation . . . . . . . . . . . . . . . . . . . 17
2.2 Solving the KG equation . . . . . . . . . . . . . . . . . . . . . 18
2.3 A small digression: one particle wave functions . . . . . . . . . 20
2.4 Variational principle . . . . . . . . . . . . . . . . . . . . . . . 21
2.5 Getting used to δ. Functional derivatives. . . . . . . . . . . . . 26
2.6 The Lagrangian . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.7 The Euler-Lagrange equations . . . . . . . . . . . . . . . . . . 28
2.8 Jet Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.9 Miscellaneous generalizations . . . . . . . . . . . . . . . . . . 33
2.9.1 External “sources” . . . . . . . . . . . . . . . . . . . . 33
2.9.2 Self-interacting field . . . . . . . . . . . . . . . . . . . 34
2.9.3 KG in arbitrary coordinates . . . . . . . . . . . . . . . 35
2.9.4 KG on any spacetime . . . . . . . . . . . . . . . . . . . 39
2.10 PROBLEMS . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
5
6 CONTENTS
11 Goodbye 233
11.1 Suggestions for Further Reading . . . . . . . . . . . . . . . . . 233
Preface
This document was created to support a course in classical field theory which
gets taught from time to time here at Utah State University. In this course,
hopefully, you acquire information and skills that can be used in a variety
of places in theoretical physics, principally in quantum field theory, particle
physics, electromagnetic theory, fluid mechanics and general relativity. As
you may know, it is usually in courses on such subjects that the techniques,
tools, and results we shall develop here are introduced – if only in bits and
pieces as needed. As explained below, it seems better to give a unified,
systematic development of the tools of classical field theory in one place. If
you want to be a theoretical/mathematical physicist you must see this stuff
at least once. If you are just interested in getting a deeper look at some
fundamental/foundational physics and applied mathematics ideas, this is a
good place to do it.
The traditional physics curriculum supports a number of classical1 field
theories. In particular, there is (i) the “Newtonian theory of gravity”, based
upon the Poisson equation for the gravitational potential and Newton’s laws,
and (ii) electromagnetic theory, based upon Maxwell’s equations and the
Lorentz force law. Both of these field theories appear in introductory physics
courses as well as in upper level courses. Einstein provided us with another
important classical field theory – a relativistic gravitational theory – via
his general theory of relativity. This subject takes some investment in geo-
metrical technology to adequately explain. It does, however, also typically
get a course of its own. These courses (Newtonian gravity, electrodynam-
ics, general relativity) are traditionally used to cover a lot of the concepts
1
Here, and in all that follows, the term “classical” is to mean “not quantum”, e.g., as
in “the classical limit”. Sometimes people use “classical” to also mean non-relativistic; we
shall definitely not being doing that here. Indeed, every field theory we shall consider is
“relativistic”.
9
10 CONTENTS
and methodology of classical field theory. The other field theories that are
important (e.g., Dirac, Yang-Mills, Klein-Gordon) typically arise, physically
speaking, not as classical field theories but as quantum field theories, and it
is usually in a course in quantum field theory that these other field theories
are described. So, in a typical physics curriculum, it is through such courses
that a student normally gets exposed to the tools and results of classical field
theory. This book reflects an alternative approach to learning classical field
theory, which I will now try to justify.
The traditional organization of material just described, while natural in
some ways, overlooks the fact that field theories have many fundamental fea-
tures in common – features which are most easily understood in the classical
limit – and one can get a really good feel for what is going on in “the big pic-
ture” by exploring these features in a general, systematic way. Indeed, many
of the basic structures appearing in classical field theory (Lagrangians, field
equations, symmetries, conservation laws, gauge transformations, etc.) are
of interest in their own right, and one should try to master them in general,
not just in the context of a particular example (e.g., electromagnetism), and
not just in passing as one studies the quantum field. Moreover, once one has
mastered a sufficiently large set of such field theoretic tools, one is in a good
position to discern how this or that field theory differs from its cousins in
some key structural way. This serves to highlight physical and mathematical
ingredients that make each theory special.
From a somewhat more pragmatic point of view, let me point out that
most quantum field theory texts rely quite heavily upon one’s facility with the
techniques of classical field theory. This is, in part, because many quantum
mechanical structures have analogs in a classical approximation to the theory.
By understanding the “lay of the land” in the classical theory through a
course such as this one, one gets a lot of insight into the associated quantum
field theories. It is hard enough to learn quantum field theory without having
to also assimilate at the same time concepts that are already present in the
much simpler setting of classical field theory. So, if you are hoping to learn
quantum field theory some day, this class should help out quite a bit.
A final motivation for the creation and teaching of this course is to support
the research activities of a number of faculty and students here at Utah State
University. The geometric underpinnings of classical field theory feature in a
wide variety of research projects here. If you want to find out what is going
on in this research – or even participate – you need to speak the language.
I have provided a number of problems you can use to facilitate your
CONTENTS 11
learning the material. They are presented first within the text in order to
amplify the text and to give you a contextual hint about what is needed
to solve the problem. At the end of each chapter the problems which have
appeared are summarized for your convenience.
I would like to thank the numerous students who have endured the rough
set of notes from which this document originated and who contributed nu-
merous corrections. I also would like to thank Joseph Romano for his (unfor-
tunately rather lengthy) list of corrections. Finally, I would like to acknowl-
edge Ian Anderson for his influence on my geometric point of view regarding
Lagrangians and differential equations.
12 CONTENTS
Chapter 1
13
14 CHAPTER 1. WHAT IS A CLASSICAL FIELD THEORY?
For example, if the medium is the air surrounding you, u could represent
its compression/rarefaction relative to some standard value. For small dis-
placements, the medium can often be well-modeled by supposing that the
displacement field u satisfies the wave equation
1
u,tt − ∇2 u = 0, (1.1)
c2
where the comma notation indicates partial derivatives, ∇2 is the Laplacian,
and c is a parameter representing the speed of sound in the medium. We say
that u is the field variable and that the wave equation is the field equation.
∇ · E = 4πρ, (1.5)
∇ · B = 0, (1.6)
1 ∂E 4π
∇×B− = j, (1.7)
c ∂t c
1 ∂B
∇×E+ = 0, (1.8)
c ∂t
where ρ is the charge density and j is the electric current density.
As we shall see later, the electromagnetic field is also fruitfully described
using potentials. They are defined via
1 ∂A
E = −∇φ − , B = ∇ × A. (1.9)
c ∂t
In each of these examples, the field variable(s) are u, φ, (E, B) – or
(φ, A), and they are determined by PDEs. The “infinite number of degrees
of freedom” idea is that, roughly speaking, the general solution to the field
equations (wave, Poisson, or Maxwell) involves arbitrary functions. Thus
the space of solutions to the PDEs – physically, the set of field configurations
permitted by the laws of physics – is infinite dimensional.
1
We are using Gaussian units for the electromagnetic field.
16 CHAPTER 1. WHAT IS A CLASSICAL FIELD THEORY?
Chapter 2
Klein-Gordon field
The simplest relativistic classical field is the Klein-Gordon field. It and its
various generalizations are used throughout theoretical physics.
17
18 CHAPTER 2. KLEIN-GORDON FIELD
you will see in this text.) You can see that the Klein-Gordon (KG) equation
is just a simple generalization of the wave equation, which it reduces to when
m = 0. In quantum field theory, quantum states of the Klein-Gordon field
can be characterized in terms of “particles” with rest mass m and no other
structure (e.g., no spin, no electric charge, etc.) So the Klein-Gordon field
is physically (and mathematically, too) the simplest of the relativistic fields
that one can study.
If you like, you can view the Klein-Gordon equation as a “toy model” for
the Maxwell equations which describe the electromagnetic field. The quan-
tum electromagnetic field is characterized by “photons” which have vanishing
rest mass, no electric charge, but they do carry intrinsic “spin”. Coherent
states of the quantum electromagnetic field contain many, many photons and
are well approximated using “classical” electromagnetic fields satisfying the
Maxwell equations. Likewise, you can imagine that coherent states involv-
ing many “Klein-Gordon particles” (sometimes called scalar mesons) are well
described by a classical scalar field satisfying the Klein-Gordon equation.
The KG equation originally arose in an attempt to give a relativistic
generalization of the Schrödinger equation. The idea was to let ϕ be the
complex-valued wave function describing a spinless particle of mass m. But
this idea didn’t quite work out as expected (see below for a hint as to what
goes wrong). Later, when it was realized that a more viable way to do
quantum theory in a relativistic setting was via quantum field theory, the KG
equation came back as a field equation for a quantum field whose classical
limit is the KG equation above. The role of the KG equation as a sort of
relativistic Schrödinger equation does survive the quantum field theoretic
picture, however. The story is too long to go into in this course, but we will
give a hint as to what this means in a moment.
Let us pause and notice something familiar here. Granted a little Fourier
analysis, the KG equation is, via (2.5), really just an infinite collection of
uncoupled “harmonic oscillator equations” for the real and imaginary parts
of ϕ̂k (t) with “natural frequency” ωk . Thus we can see quite explicitly how
the KG field is akin to a dynamical system with an infinite number of degrees
of freedom. Indeed, if we label the degrees of freedom with the Fourier wave
vector k then each degree of freedom is a harmonic oscillator. It is this
interpretation which is used to make the (non-interacting) quantum Klein-
Gordon field: each harmonic oscillator is given the usual quantum mechanical
treatment.
20 CHAPTER 2. KLEIN-GORDON FIELD
and 3/2 Z
− 1
ϕ := d3 k a∗k e−ik·r+iωk t (2.11)
2π R3
∂ϕ+ √
i = −∇2 + m2 ϕ+ , (2.12)
∂t
where the square root operator is defined via Fourier analysis to be3
3/2 Z
√
1
−∇2 + m2 ϕ+ := d3 k ωk ak eik·r−iωk t . (2.13)
2π R3
the positive frequency solutions are sometimes called the “one-particle wave
functions”.
In a quantum field theoretic treatment, the (normalizable) positive fre-
quency solutions represent the wave functions of KG particles. What about
the negative frequency solutions? There are difficulties in using the negative
frequency solutions to describe KG particles. For example, you can easily
check that they satisfy a Schrödinger equation with a negative kinetic energy,
which is unphysical. Moreover, the relativistic inner product with respect to
which one can normalize the positive frequency solutions leads to a negative
norm for the negative frequency solutions. This means that the negative
frequency part of the solution to the KG equation cannot be used to describe
the quantum mechanics of a single particle. In quantum field theory (as
opposed to quantum mechanics), the negative frequency solutions are inter-
preted in terms of the possibility for destruction of particles. Quantum field
theory, you see, allows for creation and destruction of particles. But now we
are going too far afield. . .
The region R can be anything you like at this point, but typically we assume
We restrict attention to fields such that S[ϕ] exists. For example, we can
assume that ϕ is always a smooth function of compact spatial support. The
value of the integral, of course, depends upon which function ϕ we choose,
so the formula (2.15) assigns a real number to each function ϕ. We say that
S = S[ϕ] is a functional of ϕ. We will use this functional to obtain the KG
equation by a variational principle. When a functional can be used in this
manner it is called an action functional for the field theory.
The variational principle goes as follows. Consider any family of func-
tions, labeled by a parameter λ, which includes some given function ϕ0 at
λ = 0. We say we have a one-parameter family of fields, ϕλ . As a random
example, we might have
2 +y 2 +z 2 )
ϕλ = cos(λ(t + x)) e−(x . (2.17)
This is equivalent to the vanishing of the rate of change of f along any curve
through (x0 , y0 , z0 ). To see this, define the parametric form of a curve ~x(λ)
passing through ~x(0) = (x0 , y0 , z0 ) via
The tangent vector T (λ) to the curve at the point ~x(λ) has Cartesian com-
ponents
T~ (λ) = (x0 (λ), y 0 (λ), z 0 (λ)), (2.22)
and the rate of change of a function f = f (x, y, z) along the curve at the
point ~x(λ) is given by
~
T (λ) · ∇f
= x0 (λ)∂x f (~x(λ)) + y 0 (λ)∂y f (~x(λ)) + z 0 (λ)∂z f (~x(λ)). (2.23)
~
x=~
x(λ)
It should be apparent from this equation that the vanishing of the rate of
change of f along any curve passing through the point ~x0 is equivalent to the
vanishing of the gradient of f at ~x0 , which is the same as the vanishing of all
the first derivatives of f at ~x0 . It is this interpretation of “critical point” in
terms of vanishing rate of change along any curve that we generalize to the
infinite-dimensional space of fields.
We shall show that the critical points of the functional S[ϕ] correspond
to functions on spacetime which solve the KG equation. To this end, let
us consider a curve that passes through a putative critical point ϕ at, say,
λ = 0.5 This is easy to arrange. For example, let ϕ̂(λ) be any curve in the
space of fields. Define ϕλ via
We call δS the first variation of the action; its vanishing is the condition for
a critical point. We can compute δS explicitly by applying (2.25) to S[ϕλ ];
we find Z
d4 x ϕ,t δϕ,t − ∇ϕ · ∇δϕ − m2 ϕδϕ ,
δS = (2.26)
R
5
We now drop the distracting subscript 0.
24 CHAPTER 2. KLEIN-GORDON FIELD
The critical point condition means that δS = 0 for all variations of ϕ, and
we want to see what that implies about the critical point ϕ.
To this end we observe that δϕ is a completely arbitrary function (aside
from regularity and boundary conditions to be discussed below). To see this,
let ψ be any function you like and consider the curve
ϕλ = ϕ + λψ, (2.28)
so that
δϕ = ψ. (2.29)
To make use of the requirement that δS = 0 must hold for arbitrary δϕ, we
integrate by parts via the divergence theorem in δS:
Z
4 2 2
δS = d x − ϕ,tt + ∇ ϕ − m ϕ δϕ
R
hZ it2 Z t2 Z
3
+ d x ϕ,t δϕ − dt d2 A n · ∇ϕδϕ
R3 t1 t1 r→∞
(2.30)
Here the last two terms are the boundary contributions from ∂R. For con-
creteness, I have assumed that
R = [t1 , t2 ] × R3 . (2.31)
The last integral is over the “sphere at infinity” in R3 , with n being the
outward unit normal to that sphere.
If you need a little help seeing where (2.30) came from, the key is to write
ϕ,t δϕ,t − ∇ϕ · ∇δϕ = ∂t (ϕ,t δϕ) − ∇ · (∇ϕδϕ) − ϕ,tt δϕ + (∇2 ϕ)δϕ. (2.32)
The first term’s time integral is easy to perform, and the second term’s spatial
integral can be evaluated using the divergence theorem.
To continue with our analysis of (2.30), we make two assumptions re-
garding the boundary conditions to be placed on our various fields. First, we
2.4. VARIATIONAL PRINCIPLE 25
note that ϕ, and ϕλ , and hence δϕ, must vanish at spatial infinity (r → ∞
at fixed t) in order for the action integral to converge. Further, we assume
that ϕ and δϕ vanish as r → ∞ fast enough so that in δS the boundary
integral over the sphere at infinity vanishes. One way to do this systemat-
ically is to assume that all fields have “compact support” in space, that is,
at each time t they all vanish outside of some bounded region in R3 . Other
asymptotic conditions are possible, but since the area element (dA) in the
integral over the sphere grows like r2 the integrand should fall off faster than
1/r2 as r → ∞ for this boundary term to vanish.
Secondly, we hold fixed the initial and final values of the fields – a step
which should be familiar to you from the variational formulation of classical
mechanics. To this end we fix two functions
φ1 , φ2 : R3 → R (2.33)
and we assume that at t1 and t2 , for any allowed ϕ (not just the critical
point),
ϕ|t1 = φ1 , ϕ|t2 = φ2 . (2.34)
The functions φ1 and φ2 are fixed but arbitrary, subject to the asymptotic
conditions as r → ∞. Now, for our one parameter family of fields we also
demand
ϕλ |t1 = φ1 , ϕλ |t2 = φ2 , (2.35)
which forces
δϕ|t1 = 0 = δϕ|t2 . (2.36)
This forces the vanishing of the first term in the boundary contribution to
δS in (2.30).
With these boundary conditions, we see that the assumption that ϕ is a
(smooth) critical point implies that
Z
d4 x −ϕ,tt + ∇2 ϕ − m2 ϕ δϕ
0= (2.37)
R
for any function δϕ subject to (2.36) and the asymptotic conditions just
described. Now, it is a standard theorem in calculus that this implies
− ϕ,tt + ∇2 ϕ − m2 ϕ = 0 (2.38)
This, then, is the variational principle for the KG equation. The critical
points of the KG action, subject to the two types of boundary conditions we
described (asymptotic conditions at spatial infinity and initial/final boundary
conditions), are the solutions of the KG equation.
We have seen that, with our choice of boundary conditions, the KG action
is differentiable and that
δS
= −ϕ,tt + ∇2 ϕ − m2 ϕ. (2.45)
δϕ
In general, the idea of variational principles is to encode the field equations
into an action S[ϕ] so that they arise as the equations
δS
= 0. (2.46)
δϕ
where
1 2
ϕ,t − (∇ϕ)2 − m2 ϕ2
L= (2.50)
2
is the Lagrangian density. The Lagrangian density for the KG field depends
at a point (t, x, y, z) on the values of the field ϕ and its first derivatives at
(t, x, y, z) We say that L is a local function of the field. Theories like the
KG theory which admit an action which is a spacetime integral of a local
Lagrangian density are called local field theories.
28 CHAPTER 2. KLEIN-GORDON FIELD
Finally, notice that the Lagrangian for the KG theory can be viewed as
having the same structure as that for a finite dimensional dynamical system
in non-relativistic Newtonian mechanics, namely, L = T − U , where T can
be viewed as a kinetic energy for the field,
Z
1
T = d3 x ϕ2,t (2.51)
R3 2
and U plays the role of potential energy:
Z
1
d3 x (∇ϕ)2 + m2 ϕ2 .
U := (2.52)
R3 2
Evidently, we can view 21 ϕ2,t as the kinetic energy density, and view (∇ϕ)2 +
m2 ϕ2 as the potential energy density.
We now give a slightly more general way to think about this last com-
putation, which is very handy for certain purposes. This point of view is
developed more formally in the next section.
First, we view the formula giving the definition of the Lagrangian as a
function of 9 variables
L = L(x, ϕ, ϕα ), (2.56)
where now, formally, ϕ and ϕα are just a set of 5 variables upon which the
Lagrangian density depends.6 (The KG Lagrangian density does not actually
depend upon xα except through the field, so in this example L = L(ϕ, ϕα ),
but it is useful to allow for this possibility in the future.) This 9 dimensional
space is called the first jet space for the scalar field. In this point of view
the field ϕ does not depend upon xα and neither does ϕα . The fields are
recovered as follows. For each function f (x) there is a field obtained as a
graph in the 5-dimensional space of (xα , ϕ), specified by ϕ = f (x). Similarly,
in this setting we do not view ϕα as the derivatives of ϕ; given a function
f (x) we can extend the graph into the 9-dimensional space (xα , ϕ, ϕα ) via
(ϕ = f (x), ϕα = ∂α f (x)). We can keep going like this. For example, we
could work on a space parametrized by (xα , ϕ, ϕα , ϕαβ ), where ϕαβ = ϕβα
parametrizes the values of the second derivatives. This space is the second
jet space; it has dimension 19 (exercise)! Given a field ϕ = f (x) we have a
graph in this 19 dimensional space given by (xα , f (x), ∂α f (x), ∂α ∂β f (x)).
Next, for any formula F (x, ϕ, ϕα ) built from the coordinates, the fields,
and the first derivatives of the fields introduce the total derivative
∂F ∂F ∂F
Dα F (x, ϕ, ϕα ) = + ϕα + ϕαβ . (2.57)
∂xα ∂ϕ ∂ϕβ
The total derivative just implements in this new setting the calculation of
spacetime derivatives of F via the chain rule. In particular, if we imagine
substituting a specific field, ϕ = f (x) into the formula F , then F becomes a
function F of x only:
the derivative of F:
Dα F (x, ϕ, ϕα ) = ∂α F(x). (2.59)
ϕ=f (x)
where
∂L
Vα = δϕ. (2.62)
∂ϕα
We define the Euler-Lagrange derivative of (or Euler-Lagrange expression for)
the Lagrangian density via
∂L ∂L
E(L) := − Dα . (2.63)
∂ϕ ∂ϕα
Evidently, with our boundary conditions the functional derivative of the KG
action is the same as the EL derivative of the Lagrangian density (evaluated
on a field (ϕ = ϕ(x), ϕα = ∂α ϕ(x), ϕαβ = ∂α ∂β ϕ(x))). We have
δS
= E(L) = −ϕ,tt + ∇2 ϕ − m2 ϕ, (2.64)
δϕ
ϕ=ϕ(x)
L = Dα W α , (2.66)
where
W α = W α (ϕ). (2.67)
Show that
E(L) ≡ 0. (2.68)
is given by
∂F ∂F ∂F ∂kF
Dα F = + ϕ,α + ϕ,αβ + · · · + ϕ,αα1 ···αk (2.73)
∂xα ∂ϕ ∂ϕ,β ∂ϕ,α1 ···αk
Here we use the comma notation for (would-be) derivatives in conformation
with standard notation in physics. Notice that the total derivative of a
function on the J k is actually a function on J k+1 .
From the total derivative formula, it follows that divergences have trivial
Euler-Lagrange derivatives
E(Dα V α ) = 0. (2.74)
This reflects the fact that the Euler-Lagrange derivative corresponds to the
functional derivative of the action integral in the case that the action func-
tional is differentiable. In particular, the Euler-Lagrange derivative ignores
all terms on the boundary of the domain of integration of the action integral.
To make contact between jet space and the usual calculus of variations
one evaluates jet space formulas on a specific function ϕ = ϕ(x), via
∂ϕ(x) ∂ 2 ϕ(x)
ϕ = ϕ(x), ϕ,α = , ϕ,αβ = , .... (2.75)
∂xα ∂xα ∂xβ
In this way formulas defined as function on jet space become formulas involv-
ing only the spacetime. A good framework for doing all this is to view jet
space as a fiber bundle over spacetime. A particular KG field defines a cross
section of that fiber bundle which can be used to pull back various structures
to the base space.
1
L = (ϕ2,t − (∇ϕ)2 − m2 ϕ2 ) − jϕ. (2.77)
2
The Euler-Lagrange (EL) equations are then
0 = E(L) = ( − m2 )ϕ − j. (2.78)
The slickest way to solve this KG equation with sources is via Green’s func-
tions.
This is a “toy model” of the Maxwell equations for the electromagnetic
field in the presence of sources (electric charges and currents). Note that we
have here an instance of a Lagrangian which, viewed as a function on jet
space, depends upon the spacetime point (t, x, y, z) via j = j(t, x, y, z). In
quantum field theory, the presence of a source will lead to particle produc-
tion/annihilation via a transfer of energy-momentum (to be defined soon)
between the field and the source.
34 CHAPTER 2. KLEIN-GORDON FIELD
1 1
V (ϕ) = − a2 ϕ2 + b2 ϕ4 . (2.81)
2 4
g(V~ , W
~ ) = gαβ V α W β = −V t W t + V x W x + V y W y + V z W z . (2.82)
where the components of the inverse metric in the inertial Cartesian coordi-
nates happens to be the same as the components of the metric:
−1 0 0 0
0 1 0 0
g αβ =
0 0 1 0 .
(2.85)
0 0 0 1
The matrices gαβ and g αβ are symmetric and they are each other’s inverse:
Although I won’t prove it here, it is not too hard to show that, after a
coordinate transformation, the EL equations of the transformed Lagrangian
density are the transformed EL equations. (This is not obvious, but must be
proved!) Consequently, the easiest way to find the transformed field equation
is to transform the Lagrangian and then compute the field equations.
Let me take a moment to spell out how the metric behaves under a
change of coordinates. Call the old coordinates xα = (t, x, y, z). Call the
new coordinates x̂α . Of course we will have an invertible transformation
between the two coordinate systems. With the usual abuse of notation we
will write
xα = xα (x̂), x̂α = x̂α (x). (2.87)
The metric in the new coordinates has components
∂xγ ∂xδ
ĝαβ (x̂) = gγδ (x(x̂)). (2.88)
∂ x̂α ∂ x̂β
Note that, while the original metric components formed a diagonal array
of constants, the new metric components will, in general, form some 4 × 4
symmetric array of functions. One should now compute the inverse metric
components,
∂ x̂γ ∂ x̂δ γδ
ĝ αβ (x̂) = g (x(x̂)). (2.89)
∂xα ∂xβ
Equivalently, one can compute ĝ αβ (x̂) by finding the matrix inverse to ĝαβ (x̂).
2.9. MISCELLANEOUS GENERALIZATIONS 37
1p
− det(g) g −1 (dϕ, dϕ) + m2 ϕ2
L=−
2
1p
− det(g) g αβ ∂α ϕ∂β ϕ + m2 ϕ2 .
=− (2.90)
2
There are two ingredients to this formula which should be explained. First,
recall that if ϕ is a function on spacetime then dϕ – the differential of ϕ –
is a 1-form with components ϕ,α . We have made a scalar from dϕ using the
inner product defined
p by the inverse metric. Second, we have introduced an
overall factor of − det(g). You can easily check that this factor is unity
when inertial Cartesian coordinates are used. Under a change of coordinates
the determinant changes as
∂x 2
xα → x̂α , det(ĝ) = det( ) det(g). (2.91)
∂ x̂
This compensates the change in the coordinate volume element in the action
integral:
4
p ∂ x̂ 4 ∂x p p
d x̃ − det(ĝ) = det( ) d x det( ) − det(g) = d4 x − det(g),
∂x ∂ x̂
(2.92)
so that the same formula (2.90) can be used in any coordinates provided
you use the metric appropriate to that coordinate system. Notice that while
the Lagrangian does not explicitly depend upon the coordinates when using
an inertial Cartesian coordinate system it may depend upon the coordinates
in general. Indeed, just switching (x, y, z) to spherical polar coordinates will
introduce explicit coordinate dependence in the Lagrangian, as you can easily
verify.
It is now a straightforward exercise to show that the EL equations of the
KG Lagrangian (2.90) take the form:
p p
∂α − det(g)g αβ ∂β ϕ(x) − m2 − det(g)ϕ(x) = 0. (2.93)
You can also check that, under a change of coordinates xα → x̂α the trans-
formed Euler-Lagrange equations can be computed using (2.93) provided the
metric ĝαβ appropriate to the x̂α coordinate system is used.
38 CHAPTER 2. KLEIN-GORDON FIELD
At this point I want to try to nip some possible confusion in the bud.
While we have a geometric prescription for computing the KG Lagrangian
in any coordinates, it is not a good idea to think that there is but one KG
Lagrangian for all coordinate systems. Strictly speaking, different coordi-
nate systems will, in general, lead to different Lagrangians. This comment
is supposed to be completely analogous to the previously mentioned fact
that, while we can compute “the KG equation” in any coordinate system,
each coordinate system leads, in general, to a different PDE. Likewise, we
have different functions L(x, ϕ, ∂ϕ) in different coordinates. For example, in
Cartesian coordinates L is in fact independent of xα , which need not be true
in other coordinates, e.g., spherical polar coordinates.
Coordinates are convenient for computations, but they are more or less
arbitrary, so it should be possible – and is usually advantageous – to have
a formulation of the KG theory which is manifestly coordinate-free. Let me
just sketch this so you can get a flavor of how it goes.
We consider R4 equipped with a flat metric g of Lorentz signature.8 We
introduce a scalar field ϕ : R4 → R and a parameter m. Let (g) be the
volume 4-form defined by the metric. We define the KG Lagrangian as a
4-form via
1
L = − g −1 (dϕ, dϕ) + m2 ϕ2 (g).
(2.94)
2
The Lagrangian is to be viewed as a function on the jet space J 1 of the KG
field. The metric must be specified to construct this function. Our discussion
concerning the fact that different coordinates imply different Lagrangians
(and EL equations) can be stated in coordinate free language as follows.
Consider a diffeomorphism
f : R4 → R4 . (2.95)
The diffeomorphism defines a new metric ĝ by pull-back:
ĝ = f ∗ g. (2.96)
8
This means that the eigenvalues of the metric component matrix have the signs (-
+++). If this is true in one coordinate system it will be true in any coordinate system.
2.9. MISCELLANEOUS GENERALIZATIONS 39
1 −1
ĝ (dϕ, dϕ) + m2 ϕ2 (ĝ).
L̂ = − (2.97)
2
This metric is flat and can equally well be used to build the KG theory. The
EL equations arising from L̂ or L are the KG equations defined using ĝ or g,
respectively. The relation between the solution spaces of these two equations
is that there is a bijection between the spaces of smooth solutions to each
equation. The bijection between solutions ϕ to the EL equations of L and
the solutions ϕ̂ to the EL equations of L̂ is simply
ϕ̂ = f ∗ ϕ. (2.98)
ϕ → f ∗ ϕ, (2.101)
the Lagrangian should not change. Of course, in order for the Lagrangian to
stay unchanged (say, as a function on jet space) one must also redefine the
metric by the diffeomorphism,
g → f ∗ g. (2.102)
But, as we already agreed, the Lagrangian changes when you use a different
metric. The point is that the metric is not one of the fields in the KG field
theory and you have no business treating it as such. Now, if the metric itself
is treated as a field (not some background structure), subject to variation,
EL equations, etc., then the Lagrangians we have written are generally co-
variant. Of course, we no longer are studying the KG theory, but something
much more complex, e.g., there are now 11 coupled non-linear field equations
instead of 1 linear field equation. We will return to this issue again when we
discuss what is meant by a symmetry.
2.10. PROBLEMS 41
2.10 PROBLEMS
1. Verify (2.4)–(2.9).
L = Dα W α , (2.103)
where
W α = W α (ϕ). (2.104)
Show that
E(L) ≡ 0. (2.105)
4. Obtain a formula for the vector field V α appearing in the boundary term
in the Euler-Lagrange identity (2.70).
Dα j α = 0, when ( − m2 )ϕ = 0. (3.2)
43
44 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
∂ϕ(x) ∂ k ϕ(x)
j α (x) := j α (x, ϕ(x), ,..., ) (3.4)
∂x ∂xk
such that
∂ α
j (x) = 0. (3.5)
∂xα
You can easily see in inertial Cartesian coordinates that our definition of
a conserved current j α = (j 0 , j 1 , j 2 , j 3 ) simply says that the field equations
imply a continuity equation for the density ρ ≡ j 0 (a function on spacetime)
and the current density ~j = (j 1 j 2 , j 3 ) (a time dependent vector field on space)
associated with any solution ϕ(x) of the field equations:
∂ρ
+ ∇ · ~j = 0, (3.6)
∂t
where
∂ϕ(x) ∂ k ϕ(x)
ρ(x) = j 0 (x, ϕ(x), ,..., ), (3.7)
∂x ∂xk
and
∂ϕ(x) ∂ k ϕ(x)
(~j(x))i = j i (x, ϕ(x), ,..., ), i = 1, 2, 3. (3.8)
∂x ∂xk
The utility of the continuity equation is as follows. Define the total charge
contained in the region V of space at a given time t to be
Z
QV (t) = d3 x ρ(t, ~x). (3.9)
V
3.2. CONSERVATION OF ENERGY 45
Note that the total charge is a functional of the field, that is, its value depends
upon which field you choose. The integral over V of the continuity equation
implies that Z
d ~j · dS .
QV (t) = − (3.10)
dt ∂V
Keep in mind that this relation is meant to be valid when the field is a
solution to the field equation.
We call the right hand side of (3.10) the net flux into V . We say the charge QV
is conserved since we can account for its time rate of change purely in terms
of the flux into or out of the region V . In this sense there is no “creation”
or “destruction” of the charge, although the charge can move from place to
place.
With suitable boundary conditions, one can choose V such that charge
cannot enter or leave the region and so the total charge is constant in time.
In this case we speak of a constant of the motion. For example, we have
seen that a reasonable set of boundary conditions to put on the KG field
(motivated, say, by the variational principle) is to assume that the KG field
vanishes at spatial infinity. Let us then consider the region V to be all of
space, that is, V = R3 . If the fields vanish at spatial infinity fast enough,1
the flux will vanish asymptotically and we will have
dQV
= 0. (3.11)
dt
and
∂i j i = −(∇ϕ,t ) · (∇ϕ) − ϕ,t ∇2 ϕ. (3.15)
All together, we get
∂α j α = ϕ,t ϕ,tt − ∇2 ϕ + m2 ϕ
L = T − U, (3.18)
EV = T + U (3.21)
2
Here I have moved away from the total derivative notation back to the physicist’s
notation in which one imagines evaluating all the relevant formulas on a given field ϕ =
ϕ(x).
3.3. CONSERVATION OF MOMENTUM 47
PROBLEM: Verify that the currents (3.23), (3.24) are conserved. (If you
like, you can just fix a value for i, say, i = 1 and check that j1α is conserved.)
have the dimensions of momentum (if one takes account of the various di-
mensionful constants that we have set to unity). The name can also be
understood from the fact that the each of the three charge densities ρ(i)
corresponds to a component of the current densities for the energy conser-
vation law. Roughly speaking, you can think of this quantity as getting the
name “momentum” since it defines the “flow of energy”. In a little while we
will get an even better explanation from Noether’s theorem. Finally, recall
that the total momentum of a system is represented as a vector in R3 . The
components of this vector in the case of a KG field are the P(i) .
with
gαβ = diag(−1, 1, 1, 1). (3.29)
The formula (3.26) or (3.27) given for the energy-momentum tensor is in
fact correct on any spacetime. Note that the energy-momentum tensor is
symmetric:
Tαβ = Tβα . (3.30)
3.5. CONSERVATION OF ANGULAR MOMENTUM 49
If desired, one can view the formula for T as defining a collection of functions
on jet space representing a formula for a tensor field on spacetime. More
0
1
precisely, we can view T as a mapping from J into the 2 tensor fields on
spacetime.
Using inertial coordinates xα = (t, ~x) on flat spacetime you can check
that the conserved energy current has components given by
α
jenergy = −Ttα ≡ −g αβ Ttβ . (3.31)
In particular the energy density is T tt . Likewise, the components of the
conserved momentum currents are given by
α
jmomentum = −Tiα ≡ −g αβ Tiβ , i = 1, 2, 3, (3.32)
so that, in particular, the momentum density in the direction labeled i is
given by −T ti . The conservation of energy and momentum are encoded in
the important identity:
g βγ ∂γ Tαβ = ϕ,α ( − m2 )ϕ, (3.33)
where I remind you that we have defined
ϕ = g αβ ϕ,αβ . (3.34)
This relation shows that when evaluated on a function satisfying the KG
equations the resulting energy-momentum tensor field on spacetime has van-
ishing divergence.
Although we are not emphasizing relativistic considerations in our dis-
cussions, it is perhaps worth mentioning that there is no absolute distinction
between energy and momentum. A change of reference frame will mix up
these quantities. One therefore usually speaks of the “conservation of energy-
momentum”, or the “conservation of four-momentum”, represented by the
currents j(α) , α = 0, 1, 2, 3 with components given by
β
j(α) = −Tαβ = −g βγ Tαγ . (3.35)
Note that
M α(µ)(ν) = −M α(ν)(µ) , (3.37)
which is why there are only 6 independent currents.
PROBLEM: Show that the six currents (3.36) are conserved. (Hint: Don’t
panic! This is actually the easiest one to prove so far, since you can use
ϕλ=0 = ϕ. (3.42)
You are familiar with such “curves in field space” from our discussion of
the variational calculus. We also assume that the transformation defines a
unique curve through each point in the space of fields. We view this as a
transformation of any field ϕ, that varies continuously with the parameter
λ, and such that λ = 0 is the identity transformation. As a simple example,
we could have a transformation
PROBLEM: Consider the real scalar field with the double-well self-interaction
potential (2.81). Show that ϕ → ϕ̂ = −ϕ is a variational symmetry. Con-
sider the 3 constant solutions to the field equation (which you found in the
problem just after (2.81)) and check that this symmetry maps these solutions
to solutions.
which justifies the use of the same notation, I think. Still it is important
to see how these two notions of δϕ are the same and are different. A field
variation in a variational principle involves studying curves in field space
passing through a specific point (a critical point) so that, for each curve, δϕ
is a single function on spacetime. An infinitesimal transformation δϕ will be
a spacetime function which will depend upon the field ϕ being transformed,
and it is this dependence which is the principal object of study. From a more
geometric point of view, field variations in the calculus of variations repre-
sent tangent vectors at a single point in the space of fields. An infinitesimal
transformation is a vector field on the space of fields – a continuous assign-
ment of a vector to each point in field space. Just as one can restrict a vector
field to a given point and get a vector there, one can restrict an infinitesimal
transformation to a particular field and get a particular field variation there.
An example is in order. For the scaling transformation
ϕλ = eλ ϕ, (3.50)
we get
δϕ = ϕ, (3.51)
which shows quite clearly that δϕ is built from ϕ so that, while it is a function
on spacetime, this function varies from point to point in the space of fields.
Likewise for time translations:
δϕ = ϕ,t . (3.53)
3
Contrast this with idea of a critical point. The infinitesimal variational symmetry is
a specific family of field variations that does not change the Lagrangian for any choice of
the field. The critical point is a particular field such that the action does not change for
any field variation.
3.8. DIVERGENCE SYMMETRIES 55
Time translation,
δϕ = ϕ,t , (3.61)
defines a divergence symmetry of the KG Lagrangian. Let’s see how this
works. We begin by writing the Lagrangian as
1 αβ
g ϕ,α ϕ,β + m2 ϕ2 ,
L=− (3.62)
2
where g αβ = diag(−1, 1, 1, 1). We then have4
= ∂t L
= ∂α (δtα L) , (3.63)
δL = E(L)δϕ + Dα V α , (3.67)
4
Here the derivatives are equivalent to total derivatives.
3.9. A FIRST LOOK AT NOETHER’S THEOREM 57
where
∂L ∂L
E(L) = − Dα , (3.68)
∂ϕ ∂ϕ,α
and5
∂L
Vα = δϕ. (3.69)
∂ϕ,α
This identity holds for any field variation. By hypothesis, our field varia-
tion δϕ is some expression built from x, ϕ, and its derivatives that has the
property that δL = 0. Consequently, for the infinitesimal symmetry trans-
formation δϕ we have the relation
0 = E(L)δϕ + Dα V α ⇐⇒ Dα V α = −E(L)δϕ. (3.70)
This is exactly the type of identity that defines a conserved current V α since
it says that the divergence of V α will vanish if V α is built from a KG field ϕ
that satisfies the EL-equation (the KG equation). Note that the specific form
of V α as a function of ϕ (and its derivatives) depends upon the specific form
∂L
of the Lagrangian via ∂ϕ ,α
and on the specific form of the transformation via
δϕ.
More generally, suppose that the infinitesimal transformation δϕ defines
a divergence symmetry, that is, there exists a vector field W α built from ϕ
such that
δL = Dα W α . (3.71)
We still get a conservation law since our variational identity now becomes
Dα W α = E(L)δϕ + Dα V α , (3.72)
which implies
Dα (V α − W α ) = −E(L)δϕ, (3.73)
α α
so that the conserved current is now V − W .
To summarize, if δϕ(x, ϕ, δϕ, . . . ) is a divergence symmetry of L(x, ϕ, ∂ϕ),
δL = Dα W α , (3.74)
then there is a conserved current given by
∂L
jα = δϕ − W α . (3.75)
∂ϕ,α
This is a version of “Noether’s first theorem”.
5
There is an ambiguity in the definition of V α here which we shall ignore for now to keep
things simple. We will confront it when we study conservation laws in electromagnetism.
58 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
With
∂L
δϕ = ϕ,t , W α = δtα L, = −g αβ ϕ,β , (3.77)
∂ϕ,α
we can apply the results of the previous section to obtain a conserved current:
which is our expression of the conserved energy current in terms of the energy-
momentum tensor.
It is worth pointing out that the existence of the time translation sym-
metry, and hence conservation of energy, is solely due to the fact that the
KG Lagrangian has no explicit t dependence; no other structural features of
the Lagrangian play a role. To see this, consider any Lagrangian whatsoever
satisfying
∂
L(x, ϕ, ∂ϕ, . . .) = 0. (3.80)
∂t
Here I should emphasize that this partial derivative is really a partial deriva-
tive – it only applies to the explicit coordinate dependence of the Lagrangian.
So, for example,
∂t (tϕ) = ϕ. (3.81)
From the identity
∂L ∂L ∂L
Dt L = + ϕ,t + ϕ,tα (3.82)
∂t ∂ϕ ∂ϕ,α
3.11. SPACE TRANSLATION SYMMETRY AND CONSERVATION OF MOMENTUM59
we have
∂L ∂L
ϕ,t + ϕ,tα = Dt L, (3.83)
∂ϕ ∂ϕ,α
which can be interpreted as saying the time translation δϕ = ϕ,t yields a
divergence symmetry (3.76), leading to conservation of energy. One says
that the conserved current for energy is the Noether current associated to
time translational symmetry.
Infinitesimally, we have
δϕ = n̂ · ∇ϕ = ni ϕ,i . (3.85)
where
W α = (0, ni L), (3.87)
and D is the total derivative.
As before, it is not hard to see that this result is a sole consequence of
the fact that the Lagrangian has no dependence on the spatial coordinates.
In particular,
∂
ni i L(x, ϕ, ∂ϕ, . . .) = 0, (3.88)
∂x
60 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
so that
∂L ∂L
ni ϕ,i + ni ϕ,ij = ni Di L (3.89)
∂ϕ ∂ϕ,j
Thus we have the conservation law
j α = (ρ, j i ), (3.90)
with
ρ = ϕ,t n̂ · ∇ϕ, (3.91)
and
1
j i = −ϕ,i n̂ · ∇ϕ + ni (∇ϕ)2 − ϕ2,t + m2 ϕ2 .
(3.92)
2
Since the direction n̂ is arbitrary, it is easy to see that we really have three in-
dependent conservation laws corresponding to 3 linearly independent choices
for n̂. These three conservation laws correspond to the conservation laws for
momentum that we had before. The relation between ρ and j i here and ρ(i)
and ~j(i) there is given by
You can see that the translational symmetry in the spatial direction defined
by n̂ leads to a conservation law for the component of momentum along
n̂. Thus the three conserved momentum currents are the Noether currents
associated with spatial translation symmetry.
δL = Dα (aα L) (3.95)
3.13. ANGULAR MOMENTUM REVISITED 61
xα −→ Sβα xβ , (3.100)
We have then
Sγα Sδβ gαβ = gγδ . (3.102)
Consider a 1-parameter family of such transformations, S(λ), so that
α
α α
∂Sβ
Sβ (0) = δβ , =: ωβα (3.103)
∂λ λ=0
Using these transformations in (3.102) and differentiating with respect to λ
we obtain
ωγα gαδ + ωδβ gγβ = 0. (3.104)
This is the infinitesimal version of (3.102). Defining
Differentiate both sides with respect to λ and set λ = 0 to find the infinites-
imal transformation:
δϕ = (ωβα xβ )ϕ,α , (3.108)
with an antisymmetric ωαβ as above. It is now a short computation to check
that, for the KG Lagrangian,
δL = Dα ωβα xβ L .
(3.109)
(Here I am still using the jet space notation with the total derivative D.)
The resulting Noether current is given by
where M α(γ)(δ) are the conserved currents associated with relativistic angular
momentum.
where
Sβα Sδγ ηαγ = ηβδ . (3.113)
These symmetries are, naturally enough, called spacetime symmetries since
they involve transformations in spacetime. These symmetry transformations
have a nice geometric interpretation which goes as follows.
Given a spacetime (M, g) we can consider the group of diffeomorphisms,
which are smooth mappings of M to itself with smooth inverses. Given a
diffeomorphism
f : M → M, (3.114)
there is associated to the metric g a new metric f ∗ g via the pull-back. In
coordinates xα on M the diffeomorphism f is given as
xα → f α (x), (3.115)
and the pullback metric has components related to the components of g via
∂f γ ∂f δ
(f ∗ g)αβ (x) = gγδ (f (x)). (3.116)
∂xα ∂xβ
We say that f is an isometry if
f ∗ g = g. (3.117)
64 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
LX g = 0. (3.119)
It is not too hard to verify that the spacetime translations and the Lorentz
translations define isometries of the Minkowski metric
ϕ : M → C, (3.122)
so that there are actually two real-valued functions in this theory. The La-
grangian for the charged KG field is
PROBLEM: Show that this Lagrangian is the sum of the Lagrangians for
two (real-valued) KG fields ϕ1 and ϕ2 with m1 = m2 and with the identifi-
cation
1
ϕ = √ (ϕ1 + iϕ2 ). (3.124)
2
66 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
From this problem you can surmise that the field equations for the charged
KG field consist of two identical KG equations for the real and imaginary
parts of ϕ. In terms of the complex-valued function ϕ you can check that the
field equations – computed as Euler-Lagrange equations or via the critical
points of the action – are simply
Eϕ (L) = ( − m2 )ϕ∗ = 0, (3.125)
Eϕ∗ (L) = ( − m2 )ϕ = 0. (3.126)
Note that one can do field-theoretic computations such as deriving these
Euler-Lagrange equations either using the real functions ϕ1 and ϕ2 or using
the familiar trick of using “complex coordinates” on the space of fields, that
is, treating ϕ and ϕ∗ as independent variables. In any case, one has doubled
the size of the field space. As we shall see, the new “degrees of freedom”
that have been introduced allow for a notion of conserved electric charge.
Additionally, in the corresponding quantum field theory they also allow for
the introduction of distinct “anti-particles”.
It is easy to see that the Lagrangian (3.123) for the complex KG field
admits the continuous symmetry
ϕλ = eiλ ϕ, ϕ∗λ = e−iλ ϕ∗ . (3.127)
This continuous variational symmetry is given various names. Sometimes it
is called a “phase transformation” for obvious reasons. Because the set of
unitary linear transformations of the vector space of complex numbers, de-
noted U (1), is precisely the multiplicative group of phases eiλ , sometimes the
symmetry transformation (3.127) is called a “rigid U (1) transformation”, or
a “global U (1) transformation”, or just a “U (1) transformation”. For vari-
ous reasons related to Noether’s second theorem (as we shall see), sometimes
this transformation is called a “gauge transformation of the first kind”. You
will also find various mixtures of these terms in the literature. Whatever the
name, you can see that the transformation is simply a rotation in the vector
space of values of the fields ϕ1 and ϕ2 which were defined in the last problem.
The Lagrangian is rotationally invariant in field space, hence the symmetry.
It is straightforward to compute the conserved current associated with
the U (1) symmetry, using Noether’s (first) theorem. The only novel feature
here is that we have more than one field. I will therefore give the gory details.
The infinitesimal transformation is given by
δϕ = iϕ, δϕ∗ = −iϕ∗ (3.128)
3.16. THE CHARGED KG FIELD AND ITS INTERNAL SYMMETRY67
U † = U −1 , (3.139)
det U = 1, (3.140)
ϕ : M → C2 , (3.144)
so we now have two charged KG fields or, equivalently, four real KG fields.
You can think of ϕ as a 2-component column vector whose entries are complex
6
This way of defining SU (2) in terms of a representation provides the “defining repre-
sentation”.
7
The elements of SU (2) can be parametrized by a unit vector and an angle, just as
are elements of the rotation group SO(3). This is related to the existence of the spinor
representation of the group of rotations.
70 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
U (0) = I. (3.145)
We define
ϕλ = U (λ)ϕ. (3.146)
The infinitesimal form of this transformation is
δϕ = iτ ϕ, (3.147)
Note that
δϕ† = −iϕ† τ † = −iϕ† τ. (3.149)
By the way, you can see that τ is traceless and Hermitian by considering
our formula for U (θ, n) above, or by simply noting that U (λ) satisfies
τ = ai σ i , (3.151)
δL = Dα W α , (3.163)
for some W α locally constructed from x, ϕa , ϕa,α , etc. Then the following is
a conserved current:
j α = η α (F ) − W α . (3.164)
Noether’s theorem, as it is conventionally stated – more or less as above,
shows that symmetries of the Lagrangian beget conservation laws. But the
scope of this theorem is actually significantly larger. It is possible to prove a
sort of converse to the result shown above to the effect that to each conserva-
tion laws for a system of Euler-Lagrange equations there is a corresponding
symmetry of the Lagrangian. It is even possible to prove theorems that estab-
lish a one-to-one correspondence between conservation laws and symmetries
of the Lagrangian for a wide class of field theories (including, of course, the
KG field and its variants that have been discussed up until now). There is
even more than this! But it is time to move on. . . .
8
There is an ambiguity in the definition of η α here which we shall ignore for now to keep
things simple. We will confront it when we study conservation laws in electromagnetism.
3.20. “TRIVIAL” CONSERVATION LAWS 73
j α = ( − m2 )∂ α ϕ. (3.165)
It is, of course, easy to check that this current is conserved. It is even easier
to check that this current is completely uninteresting since it vanishes for any
solution of the field equations. The triviality of such conservation laws also
can be seen by constructing the conserved charge in a region by integrating
j 0 over a volume. Of course, when you try to substitute a solution of the
equations of motion into j 0 so as to perform the integral you get zero. Thus
you end up with the trivial statement that zero is conserved.
The second kind of “trivial” conservation law arises as follows. Suppose
we create an antisymmetric, (20 ) tensor field locally from the fields and their
derivatives:
S αβ = −S βα . (3.166)
For example, in KG theory we could use
S αβ = k α ϕ, β − k β ϕ, α , (3.167)
where k α = k α (x) is any vector field on spacetime. Now make a current via
j α = Dβ S αβ . (3.168)
Dα j α = Dα Dβ S αβ = Dβ Dα S αβ = −Dβ Dα S βα = −Dα j α =⇒ Dα j α = 0.
(3.169)
These sorts of conservation laws are “trivial” because they do not really
reflect properties of the field equations but rather simple derivative identities
analogous to the fact that the divergence of the curl is zero, or that the curl
of the gradient is zero. Indeed, the current above is divergence free for any
function ϕ(x) – not necessarily satisfying any field equations.
74 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
Here, in order to perform the integral, it is understood that the current has
been evaluated on some field configuration, e.g., ϕ = ϕ(x). Keep this in
mind as we proceed. For a trivial conservation law arising as the divergence
of an antisymmetric tensor we can integrate by parts, i.e., use the divergence
theorem, to express QV as an area integral over the boundary B of V :
Z
QV = d2 A ni S 0i . (3.171)
B
where I used (1) the divergence theorem, and (2) a straightforward applica-
tion of Stokes theorem in conjunction with the fact that ∂S = ∂∂V = ∅ to
get Z
d2 A ni S ij ,j = 0. (3.174)
B
where
1
V i = ijk Sjk . (3.176)
2
We have seen there are two kinds of conservation laws that are in some
sense trivial. We can of course combine these two kinds of triviality. So, for
example, the current
is trivial.
We can summarize our discussion with a formal definition. We say that
a conservation law j α is trivial if there exists a skew-symmetric tensor field
S αβ – locally constructed from the fields and their derivatives – such that
Next, let me mention that a nice way to think about conservation laws –
trivial or non-trivial – is in terms of differential forms. On our four dimen-
sional spacetime the vector field j α can be converted to a 1-form ω = ωα dxα
using the metric:
ωα = gαβ j β . (3.179)
This 1-form can be converted to a 3-form ∗ω using the Hodge dual
(∗ω)αβγ = g µδ αβγδ ωµ . (3.180)
If j is divergence free modulo the field equations, this is equivalent to ∗ω
being closed modulo the field equations:
d(∗ω) = 0, modulo the field equations. (3.181)
Keep in mind that ∗ω is really a 3-form locally constructed from the field
and its derivatives, that is, it is a 3-form-valued function on the jet space
for the theory. The exterior derivative in (3.181) is a total derivative. As
you know, an exact 3-form is of the form dβ for some 2-form β. If there is a
2-form β locally constructed from the fields such that,
∗ ω = dβ modulo the field equations (3.182)
then clearly ∗ω is closed modulo the field equations. This is just the differen-
tial form version of our trivial conservation law. Indeed, the anti-symmetric
tensor field that is the “potential” for the conserved current is given by
1
S αβ = αβγδ βγδ . (3.183)
2
Let me mention and dispose of a common point of confusion concerning
trivial conservation laws. This point of confusion is why I felt compelled to
occasionally stick in the phrase “locally constructed from the field” in the
discussion above. For simplicity, I will use the flat metric and Cartesian
coordinates on the spacetime manifold M = R4 in what follows. To expose
the potential point of confusion, let me remind you of the following standard
result from tensor analysis. Let V α be a vector field on Minkowski space,
expressed in the usual inertial Cartesian coordinates. V α is not to be viewed
as locally constructed from the field, except in the trivial sense that it does
not depend upon the fields at all, only the spacetime point, V α = V α (x). If
V α is divergence free,
∂α V α = 0, (3.184)
3.20. “TRIVIAL” CONSERVATION LAWS 77
V α = ∂β S αβ . (3.185)
This is just the dual statement to the well-known fact that all closed 3-forms
(indeed, all closed forms of degree higher than 0) on R4 are exact i.e., the De
Rham cohomology of R4 is trivial. This result might tempt you to conclude
that all conservation laws are trivial! Unlike the case in real life, you should
not give in to temptation here. There are two reasons. First, a conservation
law should not be viewed as just a single divergence-free vector field on the
spacetime manifold M . A conservation law is a formula which assigns a
divergence-free vector field to each solution to the field equations. Each field
configuration will, in principle, define a different conserved current. Second,
as we have been saying, each of the (infinite number of) divergence-free vector
fields is locally constructed from the fields, i.e., are functions on jet space
(rather than just x space). Put differently, the conserved current at a point
x depend upon the values of the fields and their derivatives at the point x.
The correct notion of triviality is that a conserved current j α is trivial if for
each field configuration it is (modulo the field equations) always a divergence
of a skew tensor field S αβ that is itself locally constructed from the fields. If
we take a conservation law and evaluate it on a particular solution to the
field equations, then we end up with a divergence-free vector field on M (or
a closed 3-form on M , if you prefer). If M = R4 we can certainly write this
vector field as the divergence of an antisymmetric tensor on M (or as the
exterior derivative of a 2-form on M ). But the point is that for non-trivial
conservation laws there is no way to construct all the antisymmetric tensors
(2-forms) for all possible field configurations using a local formula in terms of
the fields and their derivatives. So, while conservation laws are in many ways
like de Rham cohomology (closed modulo exact forms on M ), they actually
represent a rather different kind of cohomology.9 . Sometimes this kind of
cohomology is called “local cohomology” or “Euler-Lagrange cohomology”.
Finally, as I have mentioned without explanation here and there via foot-
notes, there is some ambiguity in the definition of the vector field which
appears in the divergence term in the variation of the Lagrangian density
(see e.g., V in (3.67)). In light of our definition of trivial conservation laws,
I think you can easily see what that ambiguity is. Namely, the variational
identity only determines this vector field up to addition of the divergence of
9
One says the conservation laws are “horizontally” closed forms on the jet space.
78 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
a skew tensor (locally constructed from the fields and field variations). You
can easily check that this ambiguity affects Noether’s theorem in that the
addition to this vector field of the divergence of a skew tensor changes the
conserved Noether current by the addition of a trivial conservation law.
3.21 PROBLEMS
1. Derive (3.10) from the continuity equation.
2. Verify that the currents (3.23), (3.24) are conserved. (If you like, you can
just fix a value for i, say, i = 1 and check that j1α is conserved.)
3. Show that the six currents (3.36) are conserved. (Hint: Don’t panic! This
is actually the easiest one to prove so far, since you can use
ϕλ = ϕ + λ
10. Show that the Lagrangian (3.123) is the sum of the Lagrangians for two
(real-valued) KG fields ϕ1 and ϕ2 with m1 = m2 and with the identification
1
ϕ = √ (ϕ1 + iϕ2 ).
2
Dα j α = 0,
j α = ig αβ (ϕ†,β τ ϕ − ϕ† τ ϕ,β ).
14. Let S be a two dimensional surface in Euclidean space with unit normal
~n and boundary curve C with tangent d~l. Show that
Z Z
2 ij
d S ni S ,j = V~ · d~l, (3.186)
S C
where
1
V i = ijk Sjk . (3.187)
2
80 CHAPTER 3. SYMMETRIES AND CONSERVATION LAWS
Chapter 4
81
82 CHAPTER 4. THE HAMILTONIAN FORMULATION
The manifold (Γ, Ω) is called the phase space or state space for the mechanical
system and the 2-form is called the symplectic form. The dimensionality of Γ
is 2n, where the integer n is the number of degrees of freedom of the system.
The second ingredient in a Hamiltonian system is a function, H : Γ → R,
called the Hamiltonian. The Hamiltonian defines how states of the system
evolve in time. Assigning coordinates z i to Γ, the dynamical evolution of the
mechanical system is a curve z i = z i (t) in the phase space defined by the
ordinary differential equations:
i ij ∂H
ż (t) = Ω , (4.3)
j
∂z
z=z(t)
where Ωij are the components of the inverse symplectic form. The ODEs
(4.3) are known as Hamilton’s equations of motion. More generally, given any
function G : Γ → R there is an associated foliation of Γ by curves z i = z i (s)
defined by
i ij ∂G
ż (s) = Ω . (4.4)
j
∂z
z=z(s)
You might not have been exposed to this geometric form of Hamilton’s
equations. A more familiar textbook presentation makes use of the Darboux
theorem. This theorem asserts that there exist coordinates z i = (q a , pb ),
a, b = 1, 2, . . . , n, such that any closed, non-degenerate 2-form such as Ω can
be put into a standard form:
The coordinates (q, p) are called canonical coordinates (or “canonical co-
ordinates and momenta”). The existence of such coordinates is a direct
consequence of the non-degeneracy of Ω and the condition (4.2). There are
infinitely many canonical coordinate systems. As a nice exercise you should
determine the components Ωij of the symplectic form in canonical coordi-
nates. In canonical coordinates the Hamilton equations take the familiar
textbook form
∂H ∂H
q̇ a = , ṗa = − a . (4.6)
∂pa ∂q
d
δL = Ea δq a + θ, (4.7)
dt
where Ea is the Euler-Lagrange formula,
∂L d ∂L
Ea = a
− , (4.8)
∂q dt ∂ q̇ a
and θ represents the boundary term which arises via integration by parts.
The quantity θ depends upon q a , q̇ a and linearly upon δq a :
∂L a
θ= δq . (4.9)
∂ q̇ a
∂ 2L
det 6= 0, (4.10)
∂ q̇ a ∂ q̇ b
the EL equations are a quasi-linear system of second order ODEs and we can
expect that the solution space is a 2n-dimensional manifold parametrized by,
e.g., the initial conditions. The points of Γ are thus labeled by solutions
q a (t). Tangent vectors to Γ at a point labeled by the solutions q a (t) are
then variations δq a (t) which are solutions to the linear system of equations
obtained by linearizing the EL equations about the solution q a (t).
PROBLEM: Given L = L(q, q̇, t), give a formula for the EL equations
linearized about a solution q a = q a (t).
84 CHAPTER 4. THE HAMILTONIAN FORMULATION
Ω = dΘ. (4.12)
In formulas, this prescription says that the value of the symplectic 2-form on
a pair of tangent vectors δ1 q a (t) and δ2 q a (t) at the point of Γ specified by
q a (t) is given by
∂ 2L b a a b ∂ 2L
Ω(δ1 q, δ2 q) = (δ1 q δ 2 q − δ 1 q δ2 q ) + (δ1 q̇ b δ2 q a − δ1 q a δ2 q̇ b ).
∂ q̇ a ∂q b ∂ q̇ a ∂ q̇ b
(4.13)
This formula appears to depend upon the time at which the solutions and
the linearized solutions are evaluated. But it can be shown that Ω does not
depend upon t by virtue of the EL equations satisfied by q a (t) and their
linearization satisfied by δq a (t).
PROBLEM: Show that the symplectic form given in (4.13) does not depend
upon t. (Hint: Differentiate (4.13) with respect to t and then use the EL
equations and their linearization.)
∂L
pa = , (4.15)
∂ q̇ a
then
Θ = pa dq a , Ω = dpa ∧ dq a . (4.16)
So, if we make a change of variables (q, q̇) −→ (q, p) on Γ via (4.15) then
(q a , pa ) are canonical coordinates and momenta for Γ. The Hamiltonian
which generates time evolution is then given by the canonical energy formula:
H = pa q̇ a − L, (4.17)
1
The non-degeneracy condition (4.10) ensures the local existence of an inverse.
86 CHAPTER 4. THE HAMILTONIAN FORMULATION
To get this result I used the divergence theorem and the requirement that
the solutions and their linearization vanish asymptotically.
Up to this point the phase space has been defined implicitly inasmuch as
we have not given an explicit parametrization of the set of solutions to the
field equations. There are various ways to parametrize the space of solutions
depending upon how much analytic control you have over this space. Let
us use the most traditional parametrization, which relies upon the existence
of a well-posed initial value problem: for every pair of functions on R3 ,
88 CHAPTER 4. THE HAMILTONIAN FORMULATION
(φ(~x), π(~x)) there is uniquely determined a solution ϕ(x) of the field equations
such that at a given initial time t = t0 :
Hopefully you recognize the pattern familiar from particle mechanics where
Θ = pi dq i and Ω = dpi ∧dq i . Indeed, if one views the field as just a mechanical
system with an infinite number of degrees of freedom labeled by the spatial
point ~x, then one can view the integrations over ~x as the generalizations of the
various sums over degrees of freedom which occur in particle mechanics. For
this reason people often call φ(~x) the “coordinate” and π(~x) the “momentum”
for the scalar field. Although we shall not worry too much about precisely
what function spaces φ and π live in, it is useful to note that both of these
functions must vanish at spatial infinity if the symplectic structure is to be
defined.
The formula (4.33) makes it easy to check that the symplectic form is
non-degenerate as it should be. Can you construct a proof?
Integrating by parts in the term with the gradients, and using the fact that
φ vanishes at infinity to eliminate the boundary terms, we get
Z
d3 x π δπ + −∇2 φ + m2 φ + V 0 δφ .
δH = (4.36)
R3
This means
δH
= π(~x), (4.37)
δπ(~x)
and
δH
= −∇2 φ(~x) + m2 φ(~x) + V 0 (φ(~x)). (4.38)
δφ(~x)
Hamilton’s equations define curves in Γ, that is, one parameter families
(φ(~x, t), π(~x, t)) according to
∂φ δH
=
∂t δπ
∂π δH
=− (4.39)
∂t δφ,
which yields
∂φ(~x, t)
= π(~x, t) (4.40)
∂t
∂π(~x, t)
= ∇2 φ(~x, t) − m2 φ(~x, t) − V 0 (~x, t). (4.41)
∂t
90 CHAPTER 4. THE HAMILTONIAN FORMULATION
You can easily verify as an exercise that these equations are equivalent to the
original field equation (4.23) once we make the correspondence ϕ(x) = φ(~x, t).
Finally, using standard techniques of classical mechanics it is possible to
deduce the Hamiltonian from the Lagrangian. Recall that the KG Lagrangian
is (in a particular inertial reference frame (t, ~x))
Z Z
3 3 1 2 2 2 2
L= d xL = dx ϕ − (∇ϕ) − m ϕ − V (ϕ) . (4.42)
R3 R3 2 ,t
At any fixed time t, view φ ≡ ϕ(t, ~x) and φ̇ ≡ ϕ,t (t, ~x) as independent fields
on R3 . Define the canonical momentum as the functional derivative of the
Lagrangian with respect to the velocity:
δL
π= = φ̇. (4.43)
δ φ̇
The Lagrangian now assumes the canonical form:
Z Z
3 3 1 2 1 1 2 2
L= d x π φ̇ − dx π + ∇φ · ∇φ + m φ + V (φ) , (4.44)
R3 R3 2 2 2
so that the Hamiltonian is given by the familiar Legendre transformation:
Z Z
3 3 1 2 1 1 2 2
H= d x π φ̇ − L = dx π + ∇φ · ∇φ + m φ + V (φ) .
R3 R3 2 2 2
(4.45)
∂ ∂
Ω = dpa ∧ dq a , Ω−1 = ∧ , (4.47)
∂q a ∂pa
we have
∂A ∂B ∂A ∂B
[A, B] = − . (4.48)
∂q ∂pa ∂pa ∂q a
a
[q a , pb ] = δba . (4.49)
it is bilinear,
[A, cB] = c[A, B], [A, B + C] = [A, B] + [A, C], c = const.; (4.51)
This endows the vector space of functions on phase space with the structure
of a Lie algebra called the Poisson algebra of functions. Finally, the Hamilton
equations can be written
∂φ(~x, t) ∂π(~x, t)
= [φ(~x), H], = [π(~x), H]. (4.56)
∂t ∂t
PROBLEM: Verify (4.55) follows from (4.54) and that (4.56) agrees with
(4.40), (4.41).
δH = [H, G] = 0. (4.60)
The field momentum is a constant of the motion for solutions of the scalar
field equation (4.23). We showed this earlier in the special case of the Klein-
Gordon field (V (ϕ) = 0), but it is not hard to see that this result generalizes
to the case of equation (4.23), as it must because the Lagrangian density still
has the spatial translation symmetry.
In terms of our parametrization of the phase space using initial data, the
field momentum along ~v is given by
Z
P (~v ) = d3 x πφ,i v i , (4.63)
R3
Let us compute the Poisson bracket of the field momentum with the Hamilto-
nian. There are a few ways to organize this computation, but let us emphasize
the role of P as an infinitesimal generator of translations. Our strategy is
to use (4.35) in conjunction with the infinitesimal change in (φ, π) under the
canonical transformation generated by P (~v ). We have (try it!)
and
δπ(~x) ≡ [π(~x), P (~v )] = v i π,i ≡ ~v · ∇π. (4.65)
94 CHAPTER 4. THE HAMILTONIAN FORMULATION
4.5 PROBLEMS
2. Given L = L(q, q̇, t), give a formula for the EL equations linearized about
a solution q a = q a (t).
3. Show that the symplectic form given in (4.13) does not depend upon t.
(Hint: Differentiate (4.13) with respect to t and then use the EL equations
and their linearization.)
4.5. PROBLEMS 95
8. Verify (4.55) follows from (4.54) and that (4.56) agrees with (4.40), (4.41).
9. Show that P (~v ) in (4.62) is a constant of the motion for solutions of the
scalar field equation (4.23).
~ = 0,
∇·B (5.2)
~
~ − ∂ E = 4π~j,
∇×B (5.3)
∂t
~
∂B
~+
∇×E = 0. (5.4)
∂t
97
98 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
PROBLEM: Show that the scalar and vector potentials, when assembled
into the 4-potential
Aµ = (−φ, Ai ), i = 1, 2, 3, (5.12)
are related to the electromagnetic tensor Fµν by
Fµν = ∂µ Aν − ∂ν Aµ . (5.13)
Show that this formula for Fµν solves the homogeneous Maxwell equations
Fαβ,γ + Fβγ,α + Fγα,β = 0.
5.2. ELECTROMAGNETIC LAGRANGIAN 99
∂xβ
A0α (x0 ) = α0
Aβ (x(x0 )). (5.15)
∂x
In any case, A is called the “Maxwell field”, the “electromagnetic field”,
the “electromagnetic potential”, the “gauge field”, the “4-vector potential”,
the “U(1) connection”, and some other names as well, along with various
mixtures of these.
As always, having specified the geometric nature of the field, the field
theory is defined by giving a Lagrangian. To define the Lagrangian we in-
troduce the field strength tensor F , also known as the “Faraday tensor”, or
as the “curvature” of the gauge field A. We write
The field strength is in fact a two-form (an anti-symmetric (02 ) tensor field):
F = dA. (5.19)
We have
1
δL = − F αβ δFαβ
2
1
= − F αβ (δAβ,α − δAα,β )
2
= −F αβ δAβ,α
= F αβ ,α δAβ + Dα −F αβ δAβ .
(5.25)
E β (L) = F αβ ,α , (5.26)
F αβ ,α = 0. (5.27)
There are some equivalent expressions of the field equations that are worth
knowing about. First of all, we have that
We write this using the wave operator (which acts component-wise on the
1-form A) and the operator
via
Aβ − (div A),β = 0. (5.31)
You can see that this is a modified wave equation.
A more sophisticated expression of the field equations, which is manifestly
valid on any spacetime, uses the technology of differential forms. Recall that
on a spacetime one has the Hodge dual, which identifies the space of p-forms
with the space of n − p forms. This mapping is denoted by
α → ∗α. (5.32)
102 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
A0α = Aα + ∂α Λ. (5.46)
Show that
A0α,β − A0β,α = Aα,β − Aβ,α . (5.47)
Show that, in terms of the scalar and vector potentials, this gauge transfor-
mation is equivalent to
φ → φ0 = φ − ∂t Λ, ~→A
A ~0 = A
~ + ∇Λ. (5.48)
A → A0 (5.49)
we have
F → F. (5.50)
Evidently, the Lagrangian – which contains A only through F – is invari-
ant under this transformation of A. We say that the Lagrangian is gauge
invariant.
The gauge transformations constitute a very large set of variational sym-
metries. Up to boundary conditions, one can use any function Λ to define
a new set of potentials. Mathematically, the gauge transformations form an
infinite-dimensional Abelian group.
Insofar as classical electrodynamics can be formulated in terms of the field
strength tensor, the gauge transformation symmetry has no physical content
in the sense that one always identifies physical situations described by gauge-
equivalent Maxwell fields. Thus the Maxwell fields A provide a redundant
description of the physics. On the other hand, while the potential does not
have direct classical physical significance, it does have a physical role to play:
the need to use the potential A can be understood from the desire to have a
local variational principle – which is crucial for quantum theory. Indeed, in
the quantum context the potential plays a more important role.
5.3. GAUGE SYMMETRY 105
The gauge symmetry is responsible for the fact that the Maxwell equa-
tions for the potential
A − d(div A) = −4πj (5.51)
are not hyperbolic. (For comparison, the KG equation is hyperbolic.) Indeed,
hyperbolic equations will have a Cauchy problem with unique solutions for
given initial data. It is clear that, because the function Λ is arbitrary, one
can never have unique solutions to the field equations for A associated to
given Cauchy data. To see this, let A be any solution for prescribed Cauchy
data on a hypersurface t = const. Let A0 be any other solution obtained by
a gauge transformation:
A0 = A + dΛ. (5.52)
It is easy to see that A0 also solves the field equations. This follows from a
number of points of view. For example, the field equations are conditions
on the field strength F , which is invariant under the gauge transformation.
Alternatively, the field equations are invariant under the field equations be-
cause the Lagrangian is. Finally, you can check directly that dΛ solves the
source-free field equations:
[( − d div)dΛ]α = ∂ β ∂β (∂α Λ) − ∂α (∂ β ∂β Λ) = 0. (5.53)
Since Λ is an arbitrary smooth function, we can choose the first two deriva-
tives of Λ to vanish on the initial hypersurface so that A0 and A are distinct
solutions with the same initial data.
To uniquely determine the potential A from Cauchy data and the Maxwell
equations one has to add additional conditions on the potential beyond the
field equations. This is possible since one can adjust the form of A via gauge
transformations. It is not too hard to show that one can gauge transform
any given potential into one which satisfies the Lorenz gauge condition:
∂ α Aα = 0. (5.54)
To see this, take any potential, say, Ã and gauge transform it to a potential
A = Ã + dΛ such that the Lorenz gauge holds; this means
∂ α ∂α Λ = −∂ α Ãα . (5.55)
Viewing the right-hand side of this equation as given, we see that to find
such a gauge transformation amounts to solving the wave equation with a
given source, which can always be done.
106 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
In the Lorenz gauge the Maxwell equations are just the usual, hyper-
bolic wave equation for each inertial-Cartesian component of the 4-vector
potential,
Aα = −4πjα (5.56)
A0 = A + dΛs , (5.57)
Λ0 = 0. (5.58)
Infinitesimally, we have2
δA = dσ, (5.59)
where
∂Λs
σ= . (5.60)
∂s s=0
It is easy to see that the function σ can be chosen arbitrarily just as we had for
field variations in the usual calculus of variations analysis. The Lagrangian
is invariant under the gauge transformation; therefore it is invariant under
2
Notice that the infinitesimal gauge transformation has the same form as a finite gauge
transformation. This is due to the fact that the gauge transformation is an affine (as
opposed to non-linear) transformation.
5.4. NOETHER’S SECOND THEOREM IN ELECTROMAGNETIC THEORY107
its infinitesimal version. Let us check this explicitly. For any variation we
have
1
δL = − F αβ δFαβ , (5.61)
2
and under a variation defined by an infinitesimal gauge transformation
so that δL = 0.
Now, for any variation the first variational identity is
δL = E β δAβ + Dα −F αβ δAβ ,
(5.63)
where
E β = F αβ ,α . (5.64)
For a variation induced by an infinitesimal gauge transformation we therefore
must get
0 = E β ∂β σ + Dα −F αβ ∂β σ ,
(5.65)
which is valid for any function σ. Now we take account of the fact that the
function σ is arbitrary. We rearrange the derivatives of σ to get them inside
a divergence:
0 = −Dβ E β σ + Dα −F αβ ∂β σ + F αβ ,β σ
(5.66)
This must hold for any function σ; we can use the fundamental theorem of
variational calculus to conclude that the Euler-Lagrange equations satisfy
the differential identity
Dβ E β = 0, (5.68)
which you proved directly in a previous homework problem. Note that this
says the Euler-Lagrange expression is divergence-free, and that this holds
108 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
whether or not the field equations are satisfied – it is an identity arising due
to the gauge symmetry of the Lagrangian.
Compare our results above to Noether’s first theorem. We have seen that
the gauge symmetry – being a continuous variational symmetry – leads to a
divergence-free vector field, as it must by Noether’s first theorem. But we
now have a new ingredient: the gauge symmetry is built from an arbitrary
function of all the independent variables xα so that the gauge transformation
can be localized to an arbitrary location in spacetime. This leads to the vector
field being divergence-free identically, independent of the field equations.
Indeed, the divergence relation is an identity satisfied by the field equations.
All this is an example of Noether’s second theorem, and the resulting identity
is sometimes called the “Noether identity” associated to the gauge symmetry.
1√
Lj = − −gF αβ Fαβ + 4πj α Aα (5.69)
4
Show that this Lagrangian is gauge invariant (up to a divergence) if and only
if the spacetime vector field j α is chosen to be divergence-free. What is the
Noether identity in this case?
δL = Ea (L)δϕa + Dα η α , (5.70)
Now imagine integrating by parts each term in the linear operator [D(Λ)]a so
that all derivatives of Λ are removed. The boundary terms that arise vanish
with our boundary conditions on ΛA . This process defines the formal adjoint
D∗ of the linear differential operator D:
Z Z
a
Ea [D(Λ)] = ΛA [D∗ (E)]A . (5.76)
R R
You can easily check out this argument via our Maxwell example. The
gauge transformation is defined by the exterior derivative on functions:
[D(Λ)]α = ∂α Λ. (5.79)
W α = 0. (5.81)
Dα E α = 0 (5.84)
where
−1 0 0 0
0 1 0 0
ηαβ =
0
. (5.87)
0 1 0
0 0 0 1
To every electromagnetic potential Aα we have a 10 parameter family of po-
tentials obtained by letting the Poincaré group act via the pull-back operation
on 1-forms:
A0α (x) = Mαβ Aβ (M · x + a). (5.88)
You can interpret the transformed potential (modulo gauge transformations)
as describing the electromagnetic field in the transformed reference frame.
Because the Lagrangian is the same in all reference frames, these trans-
formations define a 10 parameter family of (divergence) symmetries of the
Lagrangian and corresponding conservation laws. The Lorentz symmetry is
responsible for conservation of relativistic angular momentum, which we shall
address a little later. The energy-momentum tensor arises via translational
symmetry. Let us begin by focusing on the spacetime translation symmetry.
Consider a 1-parameter family of translations, say,
aα = λbα . (5.89)
We have then4
δAα = bβ Aα,β . (5.90)
3
I assume that we are considering the source-free Maxwell theory; sources will, in
general, destroy translational symmetry.
4
It is worth noting that this formula is not gauge-invariant; it really only defines the
change in the fields due to a translation modulo a gauge transformation. We will address
this issue soon.
112 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
δL = F αβ ,α δAβ + Dα −F αβ δAβ ,
(5.93)
Dα j α = 0, (5.95)
Since this conservation law exists for each constant vector bα , we can sum-
marize these conservation laws using the canonical energy-momentum tensor
α αβ 1 α µν
Tγ = F Aβ,γ − δγ F Fµν , (5.96)
4
which satisfies
Dα Tβα = 0, (5.97)
modulo the field equations.
There is one glaring defect in the structure of the canonical energy-
momentum tensor: it is not gauge invariant. Indeed, under a gauge trans-
formation
A −→ A + dΛ (5.98)
we have
Tβα −→ Tβα + F αµ ∂µ ∂β Λ. (5.99)
In order to see what to do about this, we need to use some of the flexibility
we have in defining conserved currents. This is our next task.
5.7. IMPROVED MAXWELL ENERGY-MOMENTUM TENSOR 113
δL = E(L) + Dα η α , (5.110)
along with
η α = −F αβ δAβ , and δAβ = bγ Aβ,γ =⇒ η α = −bγ F αβ Aβ,γ . (5.112)
The lack of gauge invariance snuck into the calculation via the formula for
the change in A under an infinitesimal translation, δAβ = bγ Aβ,γ . We de-
stroyed gauge invariance with this formula since its right hand side is not
gauge invariant. A gauge invariant formula for the change of A under an
infinitesimal translation can be gotten by accompanying the translation with
a gauge transformation:
δAβ = bγ Aβ,γ − Dβ (bγ Aγ ) = bγ Fγβ . (5.113)
With this improved symmetry transformation we get the improved energy-
momentum tensor from Noether’s theorem. Notice also that this additional
gauge transformation is a symmetry and corresponds to the additional trivial
conservation law needed to improve the energy-momentum tensor in the first
place.
where the integral takes place at some chosen value for x0 ≡ t and we used
the spatial divergence theorem along with boundary conditions at spatial
infinity to get the second equality. The 1-form Θ which is normally used to
construct the symplectic 2-form is then defined by
Z Z
3 β0
Θ(δA) = d x F δAβ = d3 x F i0 δAi , (5.118)
R3 R3
=− d3 x Λ ∂i (∂t δ1 Ai − ∂ i δ1 At ), (5.121)
R3
5.8. THE HAMILTONIAN FORMULATION OF ELECTROMAGNETISM.117
where integration by parts and the divergence theorem were used to get the
last equality. The boundary term “at infinity” vanishes since Λ has compact
support. Next, recall that the field variations δA represent tangent vectors to
the space of solutions of the Maxwell equations (5.115) and so are solutions
to the linearized equations. Because the Maxwell equations are linear, their
linearization is mathematically the same:
∂ β (∂α δAβ − ∂β δAα ) = 0. (5.122)
Setting α = 0 we get
∂ i (∂t δAi − ∂i δAt ) = 0. (5.123)
This means that (5.121) vanishes and we conclude that the “pure gauge”
tangent vectors (5.120) are degeneracy directions for the 2-form Ω.
Degenerate 2-forms are often called “pre-symplectic” because there is
a canonical procedure for extracting a unique symplectic form on a smaller
space from a pre-symplectic form. We will not develop this elegant geometric
result here. Instead, we will proceed in a useful if more roundabout route
by examining the Hamiltonian formulation of the theory that arises when we
parametrize the space of solutions to the field equations (5.115) with initial
data.
Viewing the spacetime fields as Qi (t, ~x), P i (t, ~x), φ(t, ~x), the Lagrangian can
be viewed as a functional of (Qi , P j , φ) and can be written in the Hamiltonian
“pq̇ − H” form (exercise):
Z
3 i 1 i 1 ij i
L[Q, P, φ] = d x P Q̇i − (Pi P + Fij F ) + P ∂i φ
R3 2 2
Z
3 i 1 i 1 ij i
= d x P Q̇i − (Pi P + Fij F ) + φ∂i P (5.138)
R3 2 2
To get the second equality I integrated by parts and used the divergence
theorem on the last term. At each time t, we will assume that (Qi , P i )
120 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
∂i P i = 0, (5.139)
δL d δL δL d δL δL
i
− = 0, − = 0, = 0, (5.140)
δP dt δ Ṗ i δQi dt δ Q̇i δφ
The first two terms are what you might expect: they define the energy of the
electromagnetic field (once you recognize that 14 Fij F ij = 12 B 2 is the magnetic
energy density). You can see that the first term (the electric energy) is akin to
the kinetic energy of a particle, while the second term (the magnetic energy)
is akin to the potential energy of a particle. The term we want to focus on
is the third term – what’s that doing there? Well, first of all note that this
term does not affect the value of the Hamiltonian provided the canonical
variables satisfy the constraint (5.139). Secondly, let us consider Hamilton’s
equations:
δH
Q̇i = = Pi − ∂i φ, (5.142)
δP i
δH
Ṗ i = − = ∂i F ij . (5.143)
δQi
From (5.142), which is secretly the relation between the electric field and
the potentials, you can see that the last term in the Hamiltonian is precisely
5.9. PROBLEMS 121
show that Z
G=− d3 x Λ(~x) ∂i P i (5.145)
R3
5.9 PROBLEMS
~ B)
1. Maxwell’s equations for the electric and magnetic field (E, ~ associated
to charge density and current density (ρ, ~j) are given by
~ = 4πρ,
∇·E
~ = 0,
∇·B
~
~ − 1 ∂ E = 4π~j,
∇×B
c ∂t
~
∇×E~ + ∂ B = 0.
∂t
122 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
~
~ = −∇φ − 1 ∂ A ,
E ~ =∇×A
B ~
c ∂t
satisfy (5.2) and (5.4).
j α = (ρ, j i ), i = 1, 2, 3.
where indices are raised and lowered with the usual Minkowski metric.
4. Show that the scalar and vector potentials, when assembled into the 4-
potential
Aµ = (−φ, Ai ), i = 1, 2, 3, (5.147)
are related to the electromagnetic tensor Fµν by
Fµν = ∂µ Aν − ∂ν Aµ . (5.148)
5.9. PROBLEMS 123
Show that this formula for Fµν solves the homogeneous Maxwell equations
Fαβ,γ + Fβγ,α + Fγα,β = 0.
j α = (ρ, ~j).
Use the results from the preceding problem to show that the Maxwell equa-
tions with sources have no solution unless the vector field representing the
sources is divergence-free:
∂α j α = 0.
Show that this condition is in fact the usual continuity equations representing
conservation of electric charge.
A0α = Aα + ∂α Λ.
124 CHAPTER 5. ELECTROMAGNETIC FIELD THEORY
Show that
A0α,β − A0β,α = Aα,β − Aβ,α .
Show that, in terms of the scalar and vector potentials, this gauge transfor-
mation is equivalent to
φ → φ0 = φ − ∂t Λ, ~→A
A ~0 = A
~ + ∇Λ.
10. Consider the electromagnetic field coupled to sources with the Lagrangian
density
1√
Lj = − −gF αβ Fαβ + 4πj α Aα
4
Show that this Lagrangian is gauge invariant (up to a divergence) if and only
if the spacetime vector field j α is chosen to be divergence-free. What is the
Noether identity in this case?
11. Verify equations (5.90)–(5.95).
12. Show that the 2-form (5.119) does not depend upon the time at which
it is evaluated.
13. Show that if the constraint (5.132) holds at one time and the canonical
variables evolve in time according to (5.130), (5.131) then (5.132) will hold
at any other time. (Hint: Consider the time derivative of (5.132).)
14. Show that the EL equations defined by L = L[Q, P, φ],
δL d δL δL d δL δL
− = 0, − = 0, = 0, (5.149)
δP i dt δ Ṗ i δQi dt δ Q̇i δφ
are equivalent to the Maxwell equations.
15. Using the Poisson brackets
Z
3 δM δN δM δN
[M, N ] = dx − ,
R3 δQi (~x) δP i (~x) δP i (~x) δQi (~x)
show that Z
G=− d3 x Λ(~x) ∂i P i
R3
is the generating function for gauge transformations
δQi = [Qi , G] = ∂i Λ, δP i = [P i , G] = 0.
Chapter 6
Scalar Electrodynamics
Let us have an introductory look at the field theory called scalar electrody-
namics, in which one considers a coupled system of Maxwell and charged KG
fields. There are an infinite number of ways one could try to couple these
fields. There is essentially only one physically interesting way, and this is the
one we shall be exploring. Mathematically, too, this particular coupling has
many interesting features which we shall explore. To understand the motiva-
tion for the postulated form of scalar electrodynamics, it is easiest to proceed
via Lagrangians. For simplicity we will restrict attention to flat spacetime in
inertial Cartesian coordinates, but our treatment is easily generalized to an
arbitrary spacetime in a coordinate-free way.
1
Lj = − F αβ Fαβ + 4πj α Aα . (6.1)
4
Incidentally, given the explicit appearance of Aα , one might worry about
the gauge symmetry of this Lagrangian. But it is easily seen that the gauge
transformation is a divergence symmetry of this Lagrangian. Indeed, under
125
126 CHAPTER 6. SCALAR ELECTRODYNAMICS
j α Aα −→j α (Aα + ∂α Λ)
= j α Aα + Dα (Λj α ), (6.2)
∂α j α (x) = 0. (6.3)
The idea now is that we don’t want to specify the sources in advance,
we want the theory to tell us how they behave. In other words, we want to
include the sources as part of the dynamical variables of our theory. In all
known instances the correct way to do this always follows the same pattern:
the gauge fields affect the “motion” of the sources, and the sources affect
the form of the gauge field. Here we will use the electromagnetic field as the
gauge field and the charged (U (1) symmetric) KG field as the source. The
reasoning for this latter choice goes as follows.
Since this is a course in field theory, we are going to use fields to model
things like electrically charged matter, so we insist upon a model for the
charged sources built from a classical field. So, we need a classical field
theory that admits a conserved current that we can interpret as an electric
4-current. The KG field admitted 10 conserved currents corresponding to
conserved energy, momentum and angular momentum. But we know that
the electromagnetic field is not driven by such quantities, so we need another
kind of current. To find such a current we turn to the charged KG field. In
the absence of any other interactions, this field admits the conserved current
The simplest thing to try is to build a theory in which this is the current
that drives the electromagnetic field. This is the correct idea, but the most
naive attempt to implement this strategy falls short of perfection. To see
this, imagine a Lagrangian of the form
1
Lwrong = − F αβ Fαβ − (∂ α ϕ∗ ∂α ϕ + m2 |ϕ|2 ) − iAα g αβ ϕ∗ ϕ,β − ϕϕ∗,β . (6.5)
4
This Lagrangian was obtained by simply taking (6.1), substituting (6.4) for
the current, and adding the KG Lagrangian for the scalar field. The idea is
that the EL equations for A will give the Maxwell equations with the KG
6.2. MINIMAL COUPLING: THE GAUGE COVARIANT DERIVATIVE127
current as the source. The EL equations for the scalar field will now involve
A, but that is ok since we expect the presence of the electromagnetic field to
affect the sources. But here is one big problem with this Lagrangian: it is
no longer gauge invariant! Recall that the gauge invariance of the Maxwell
Lagrangian with prescribed sources made use of the fact that the current was
divergence-free. But now the current is divergence-free, not identically, but
only when the field equations hold. The key to escaping this difficulty is to
let the KG field participate in the gauge symmetry. This forces us to modify
the Lagrangian as we shall now discuss.
and
∂α ϕ∗ → Dα ϕ∗ := (∂α + iqAα )ϕ∗ , (6.7)
Here q is a parameter reflecting the coupling strength between the charged
field ϕ and the gauge field. It is a coupling constant. In a more correct
quantum field theory description q is the bare electric charge of a particle
excitation of the quantum field ϕ. The effect of an electromagnetic field
described by Aα upon the KG field is then described by the Lagrangian
This Lagrangian yields field equations which involve the wave operator mod-
ified by terms built from the electromagnetic potential. These additional
terms represent the effect of the electromagnetic field on the charged scalar
field.
The Lagrangian (6.8) still admits the U (1) phase symmetry of the charged
KG theory, but because this Lagrangian depends explicitly upon A it will
not be gauge invariant unless we include a corresponding transformation of
ϕ. We therefore extend the gauge transformation to be:
Aα −→ Aα + ∂α Λ, (6.9)
Dα ϕ −→ eiqΛ Dα ϕ, (6.11)
Dα ϕ∗ −→ e−iqΛ Dα ϕ∗ . (6.12)
For this reason Dα is sometimes called the gauge covariant derivative. There
is a nice geometric interpretation of this covariant derivative, which we shall
discuss later. For now, because of this “covariance” property of D we have
that the Lagrangian (6.8) is gauge invariant.
The Lagrangian for scalar electrodynamics is taken to be
1
LSED = − F αβ Fαβ − Dα ϕ∗ Dα ϕ − m2 |ϕ|2 . (6.13)
4
We now discuss some important structural features of this Lagrangian.
If we expand the gauge covariant derivatives we see that
1
LSED = − F αβ Fαβ −∂ α ϕ∗ ∂α ϕ−m2 |ϕ|2 +iqAα ϕ∗ ∂ α ϕ − ϕ∂ α ϕ∗ + iqAα |ϕ|2 .
4
(6.14)
This Lagrangian is the sum of the electromagnetic Lagrangian, the free
charged KG Lagrangian, and a j · A “interaction term”. The vector field
contracted with Aα is almost the conserved current (6.4), except for the last
term involving the square of the gauge field which is needed for invariance
under the gauge transformation (6.10) and for the current to be conserved
when the new form of the field equations are satisfied. The EL equations for
the Maxwell field are of the desired form:
∂β F αβ = −4πJ α , (6.15)
6.3. GLOBAL AND LOCAL SYMMETRIES 129
where the current is defined using the covariant derivative instead of the
ordinary derivative:
iq ∗ α
Jα = − (ϕ D ϕ − ϕDα ϕ∗ ) . (6.16)
4π
As you will verify in the problem below, this current can be derived from
Noether’s first theorem applied to the U (1) phase symmetry of the La-
grangian (6.13). Thus we have solved the gauge invariance problem and ob-
tained a consistent version of the Maxwell equations with conserved sources
using the minimal coupling prescription.
PROBLEM: Verify that (6.16) is the Noether current coming from the U (1)
symmetry of the Lagrangian and that it is indeed conserved when the field
equations for ϕ hold.
One more interesting feature to ponder: the charged current (6.16) serv-
ing as the source for the Maxwell equations is built from the KG field and the
Maxwell field. Physically this means that one cannot say the charge “exists”
only in the KG field. In an interacting system the division between source
fields and fields mediating interactions is somewhat artificial. This is physi-
cally reasonable, if perhaps a little unsettling. Mathematically, this feature
stems from the demand of gauge invariance. Just like the vector potential,
the KG field is no longer uniquely defined - it is subject to a gauge trans-
formation as well! In the presence of interaction, the computation of the
electric charge involves a gauge invariant combination of the KG and electro-
magnetic field. To compute, say, the electric charge contained in a volume V
one should take a solution (A, ϕ) of the coupled Maxwell-KG equations and
substitute it into
Z
1
d3 x iq ϕ∗ D0 ϕ − ϕD0 ϕ∗ .
QV = (6.17)
4π V
This charge is conserved and gauge invariant.
Aµ −→ Aµ + ∂µ α, (6.23)
and
Dα ϕ∗ := (∂α − iqAα )ϕ∗ , (6.25)
which satisfies
Dµ (e−iqα(x) ϕ) = e−iqα(x) Dϕ. (6.26)
Then with the Lagrangian modified via
∂µ ϕ → Dµ ϕ, (6.27)
so that
LKG = −Dα ϕ∗ Dα ϕ − m2 |ϕ|2 , (6.28)
we get the local U (1) symmetry, as shown previously. Thus the minimal
coupling rule that we invented earlier can be seen as a way of turning the
global U (1) symmetry into a local U (1) gauge symmetry. One also obtains
the satisfying mental picture that the electromagnetic interaction of charges
is the principal manifestation of this local phase symmetry in nature. Thus
the electromagnetic interaction is introduced via the principle of local gauge
invariance.
The electromagnetic interaction of charges is described mathematically
by the ∂α → Dα prescription described above. But the story is not complete
132 CHAPTER 6. SCALAR ELECTRODYNAMICS
1
LEM = − F µν Fµν , (6.30)
4
admits the local U (1) symmetry and can be added to the locally invariant
KG Lagrangian to get the final Lagrangian for the theory
In this way we have an interacting theory designed by local U (1) gauge sym-
metry. The parameter q, which appears via the gauge covariant derivative,
is a “coupling constant” and characterizes the strength with which the elec-
tromagnetic field couples to the charged aspect of the KG field. In the limit
in which q → 0 the theory becomes a decoupled juxtaposition of the non-
interacting (or “free”) charged KG field theory and the non-interacting (free)
Maxwell field theory. In principle, the parameter q is determined by suitable
experiments.
Scalar electrodynamics still admits the global U (1) symmetry, with α =
const.
ϕ → e−iqα ϕ, ϕ∗ → eiqα ϕ∗ , (6.32)
Aµ → Aµ , (6.33)
6.3. GLOBAL AND LOCAL SYMMETRIES 133
where
iq ∗ α
Jα = − (ϕ D ϕ − ϕDα ϕ∗ ) . (6.37)
4π
Evidently, J is divergence-free when the scalar field equations of motion hold.
J is the conserved Noether current J corresponding to the electric charge
carried by the scalar field. This is the current that serves as source for the
Maxwell field. The presence of the gauge field renders the Noether current
suitably “gauge invariant”, that is, insensitive to the local U (1) transforma-
tion. It also reflects the fact that the equations of motion for ϕ, which must
be satisfied in order for the current to be conserved, depend upon the Maxwell
field as is appropriate since the electromagnetic field affects the motion of its
charged sources.
By construction, the theory of scalar electrodynamics admits the local
U (1) gauge symmetry. With α(x) being any function, the symmetry is
Aµ → Aµ + ∂µ α(x). (6.39)
There is a corresponding Noether identity (Noether’s second theorem, re-
member?). To compute it we consider an infinitesimal gauge transformation:
where Eϕ and Eϕ∗ are the scalar field EL expressions and E µ is the gauge
field EL expression. If we integrate this relation over a compact region and
134 CHAPTER 6. SCALAR ELECTRODYNAMICS
choose α(x) to vanish at the boundary of this region, then the divergence
term vanishes. We can integrate by parts in the third term to get the Noether
identity
Dµ E µ + iq(ϕEϕ − ϕ∗ Eϕ∗ ) = 0. (6.43)
Notice that this identity does not follow if the gauge transformation includes
a global part. The terms involving the EL expressions for the KG field are
the same as arise in the identity (6.36). Thus the Noether identity (6.43) can
also be written as
Dα Dβ F αβ = 0. (6.44)
Thanks to this Noether identity we can obtain – again! – the conservation
law of electric charge. We have the electromagnetic field equation
E β ≡ Dα F αβ − 4πJ β = 0. (6.45)
just electromagnetic data on the surface bounding that region. Indeed, the
conserved electric charge in a 3-dimensional spacelike region V at some time
x0 = const. is given by
Z Z Z
0 1 i0 1 ~
QV = dV J = dV F,i = dS n̂ · E, (6.49)
V 4π V 4π S
This generalizes the purely vectorial version of the Stokes’ and divergence the-
orems you learned in multi-variable calculus in Euclidean space to manifolds
of any dimension. As I have mentioned before, we can view the electromag-
netic tensor as a 2-form F via
1
F = Fαβ dxα ⊗ dxβ = Fαβ dxα ∧ dxβ . (6.51)
2
Using the Levi-Civita tensor αβγδ , we can construct the Hodge dual ?F ,
defined by
1
? F = (?F )αβ dxα ∧ dxβ , (6.52)
2
where
1
(?F )αβ = αβ γδ Fγδ . (6.53)
2
In terms of F and ?F the Maxwell equations read
dF = 0, d ? F = 4πJ , (6.54)
1
J = αβγδ J δ dxα ∧ dxβ ∧ dxγ . (6.55)
3!
136 CHAPTER 6. SCALAR ELECTRODYNAMICS
The relation between these 2 quantities can be obtained using Stokes theo-
rem: Z Z Z Z
0
χ −χ= ω− ω= ω= dω, (6.61)
S0 S S 0 −S V
0
where ∂V = S − S. In particular, if ω is a closed p-form, that is, dω = 0,
then χ = χ0 and the integral χ is independent of the choice of the space S in
the sense that χ is unchanged by any continuous deformation of S.
1
To say that S is “closed” means that it has no boundary, ∂S = ∅, e.g., a 2-sphere.
6.5. SCALAR ELECTRODYNAMICS AND FIBER BUNDLES 137
if you don’t mind, I will just give a quick and dirty summary of the salient
points. Those who are not ready for this material can just skip it. A more
complete – indeed, a more correct – treatment can be found in many advanced
texts. Of course, the problem is that to use these advanced texts takes
a considerable investment in acquiring prerequisites. The idea of our brief
discussion is to provide a first introductory step in that direction. A technical
point for those who have some background in this stuff: for simplicity in what
follows we shall not emphasize the role of the gauge field as a connection on
a principal bundle, but rather its role as defining a connection on associated
vector bundles.
Recall that our charged KG field can be viewed as a mapping
ϕ : M → C. (6.62)
π: E → M (6.63)
where
π −1 (x) = C, x ∈ M. (6.64)
M is called the base space. The space π −1 (x) ≈ C is the fiber over x. Since
C is a vector space, this type of fiber bundle is called a vector bundle. For
us, M = R4 and it can be shown that for a contractible base space such as
Rn there is always a (non-unique) diffeomorphism that makes possible the
global identification:
E ≈ M × C. (6.65)
For a general base space M , such an identification will only be valid locally.
Next, recall that a cross section of E (often just called a “section”) is a
map
σ: M → E (6.66)
satisfying
π ◦ σ = idM . (6.67)
Using coordinates (x, z) adapted to (6.65), we can identify a KG field ϕ(x)
with the cross section
σ(x) = (x, ϕ(x)). (6.68)
Thus, given the identification E ≈ M × C, we see that the bundle point
of view just describes the geometric setting of our theory: complex valued
6.5. SCALAR ELECTRODYNAMICS AND FIBER BUNDLES 139
functions on R4 . For the purposes of this discussion the most interesting issue
is that this identification is far from unique. Let us use coordinates (xα , z)
for E, where xα ∈ R4 and z ∈ C. Each set of such coordinates provides an
identification of E with M × C. Since we use a fixed (flat) metric on M ,
one can restrict attention to inertial Cartesian coordinates on M , in which
case one can only redefine xα by a Poincaré transformation. What is more
interesting for us in this discussion is the freedom to redefine the way that
the complex numbers are “glued” to each spacetime event.
Recall that to build the Lagrangian for the charged KG field we also had
to pick a scalar product on the vector space C; of course we just used the
standard one
hz, wi = z ∗ w. (6.69)
We can therefore restrict attention to linear changes of our coordinates on
C which preserve this scalar product. This leads to the allowed changes of
fiber coordinates being just the phase transformations
z → eiα z, α ∈ R. (6.70)
We can make this change of coordinates on C for each fiber so that on π −1 (x)
we make the transformation
z → eiα(x) z. (6.71)
There is no intrinsic way to compare points on different fibers, and this fact
reflects itself in the freedom to redefine our labeling of those points in a way
that can vary from fiber to fiber. We have seen this already; the change of
fiber coordinates z → eiα(x) z corresponds to the gauge transformation of the
charged KG field:
ϕ(x) → eiα(x) ϕ(x). (6.72)
When building a field theory of the charged KG field we need to take
derivatives. Now, to take a derivative means to compare the value of ϕ at
two neighboring points on M . From our fiber bundle point of view, this
means comparing points on two different fibers. Because this comparison is
not defined a priori, there is no natural way to differentiate a section of a
fiber bundle. This is why, as we saw, the ordinary derivative of the KG field
does not transform homogeneously under a gauge transformation. Thus, for
example, to say that a KG field is a constant, ∂α ϕ = 0, is not an intrinsic
statement since a change in the bundle coordinates will negate it.
140 CHAPTER 6. SCALAR ELECTRODYNAMICS
To continue the analogy with differential geometry a bit further, you see that
the field ϕ is playing the role of a vector, with its vector aspect being the
fact that it takes values in the vector space C and transforms homogeneously
under the change of fiber coordinates, that is, the gauge transformation. The
complex conjugate can be viewed as living in the dual space to C, so that it is
a “covector”. Quantities like the Lagrangian density, or the conserved electric
current are “scalars” from this point of view – they are gauge invariant. In
particular, the current
iq ∗ µ
Jµ = − (ϕ D ϕ − ϕDµ ϕ∗ ) (6.76)
4π
is divergence free with respect to the ordinary derivative, which is the correct
covariant derivative on “scalars”.
Do we really need all this fancy mathematics? Perhaps not. But, since all
the apparatus of gauge symmetry, covariant derivatives, etc., which show up
repeatedly in field theory, arises so naturally from this geometric structure,
it is clear that this is the right way to be thinking about gauge theories.
Moreover, there are certain results that would, I think, be very hard to come
by without using the fiber bundle point of view. I have in mind certain
important topological structures that can arise via global effects in classical
and quantum field theory. These topological structures are, via the physics
literature, appearing in the guise of “monopoles” and “instantons”. Such
structures would play a very nice role in a second semester for this course, if
there were one.
6.6 PROBLEMS
3. Verify that (6.16) is the Noether current coming from the U (1) symmetry
of the Lagrangian and that it is indeed conserved when the field equations
for ϕ hold.
Spontaneous symmetry
breaking
We now will take a quick look at some of the classical field theoretic underpin-
nings of “spontaneous symmetry breaking” (SSB) in quantum field theory.
Quite generally, SSB can be a very useful way of thinking about phase tran-
sitions in physics. In particle physics, SSB is used, in collaboration with the
“Higgs mechanism”, to give masses to gauge bosons (and other elementary
particles) without destroying gauge invariance.
143
144 CHAPTER 7. SPONTANEOUS SYMMETRY BREAKING
ϕ̃ = ϕ + const. (7.2)
∂ α ∂α ϕ = 0 (7.3)
If the Lagrangian and its field equations represent the “laws”, then the
solutions of the field equations are the “states” of the field that are allowed by
the laws. The function ϕ(x) is an allowed state of the field when it solves the
field equations. A symmetry of a given “state”, ϕ0 (x) say, is then defined
to be a transformation of the fields, ϕ → ϕ̃[ϕ], which preserves the given
solution
ϕ̃[ϕ0 (x)] = ϕ0 (x). (7.4)
Since symmetry transformations form a group, such solutions to the field
equations are sometimes called “group-invariant solutions”.
Let us consider an elementary example of group-invariant solutions. Con-
sider the KG field with mass m. Use inertial Cartesian coordinates. We have
seen that the spatial translations, xi → xi + const., i = 1, 2, 3, form a group
of symmetries of the theory. Functions which are invariant under the group
7.1. SYMMETRY OF LAWS VERSUS SYMMETRY OF STATES 145
1 d 2 df
(r ) = 0. (7.8)
r2 dr dr
1
The quotient space M/K is the set of orbits of K in M . Equivalently it is the set of
equivalence classes of points in M where two points are equivalent if they can be related
by an element of the transformation group K.
2
Some boundary conditions have to be imposed at r = 0, but we will not worry about
that here.
146 CHAPTER 7. SPONTANEOUS SYMMETRY BREAKING
This is the principal reason one usually makes a “symmetry ansatz” for
solutions to field equations which involves fields invariant under a subgroup
K of the symmetry group G of the equations. It is not illegal to make other
kinds of ansatzes, of course, but most will lead to inconsistent equations or
equations with trivial solutions.
Having said all this, I should point out that just because you ask for
group invariant solutions according to the above scheme it doesn’t mean you
will find any! There are two reasons for this. First of all, it may be that
there are no (non-trivial) fields invariant with respect to the symmetry group
you are trying to impose on the state. For example, consider the symmetry
group ϕ → ϕ + const. we mentioned earlier for the massless KG equation.
You can easily see that there are no functions which are invariant under
that transformation group. Secondly, the reduced differential equation may
have no (or only trivial) solutions, indicating that no (interesting) solutions
exist with that symmetry. Finally, I should mention that not all states have
symmetry - indeed the generic states are completely asymmetric. States
with symmetry are special, physically simpler states than what you expect
generically.
To summarize, field theories may have two types of symmetry. There
may be a group G of symmetries of its laws – the symmetry group of the
Lagrangian (and field equations). There can be symmetries of states, that
is, there may be a transformation group (usually a subgroup of G) which
preserves certain solutions to the field equations.
You can easily check that the solutions given by ϕ = 0, ± ab are critical points
of this energy functional.
spare you most of the details of the computations, but you might try to
fill them in as a nice exercise.) In an inertial reference frame with coordi-
nates xα = (t, xi ), the conserved energy for this non-linear field is easily seen
(from Noether’s theorem) to be
Z
3 1 2 1 ∗i 1 2 2 1 2 4
E= dx |ϕ,t | + ϕ,i ϕ, − a |ϕ| + b |ϕ| . (7.20)
2 2 2 4
You can easily check that the solutions given by |ϕ| = 0, a/b are critical
points of this energy functional. As before, the maximally symmetric state
ϕ = 0 is unstable. The circle’s worth of states (7.18) are quasi-stable in the
following sense. Any displacement in field space yields a non-negative change
in energy. To see this, write
ϕ = ρeiΘ , (7.21)
where ρ and Θ are spacetime functions. The energy takes the form
Z
3 1 2 1 i 1 2 2 i 1 2 2 1 2 4
E= dx ρ + ρ,i ρ, + ρ (Θ,t + Θ,i Θ, ) − a ρ + b ρ . (7.22)
2 ,t 2 2 2 4
The critical points of interest lie at ρ = ab , Θ = const. Expanding the energy
in displacements (δρ, δΘ) from equilibrium yields
1 a4
Z
3 1 2 1 i 1 a 2 2 i 2 2
E=− 2+ d x δρ + δρ,i δρ, + (δΘ,t + δΘ,i δΘ, ) + a δρ
4b 2 ,t 2 2 b
(7.23)
Evidently, all displacements except δρ = 0, δΘ = const. increase the energy.
The displacements δρ = 0, δΘ = const. do not change the energy, as you
might have guessed, since they correspond to displacements along the circular
locus of minima of the potential energy function. The states (7.18) are the
lowest energy states – the ground states. Thus the lowest energy is infinitely
degenerate – the set of ground states (7.18) is topologically a circle. That
these stable states form a continuous family and have less symmetry than the
unstable state will have some physical ramifications which we will unravel
after we take a little detour.
ϕ + a2 ϕ − b2 ϕ3 = 0. (7.25)
PROBLEM: Using (7.24) expand the action functional for (7.25) (see (7.9))
to quadratic order in δϕ. Show that this approximate action, viewed as
an action functional for the displacement field δϕ, has (7.26) as its Euler-
Lagrange field equation.
Evidently, the linearized equation (7.26) is a linear PDE for the displace-
ment field δϕ. In general, this linear PDE has variable coefficients due to the
presence of ϕ0 . But if the given solution ϕ0 is a constant in spacetime, the
linearized PDE is mathematically identical to a Klein-Gordon equation for
δϕ with mass given by (−a2 +3b2 ϕ20 ). The mass at the minima, ϕ0 = ±a/b, is
2a2 . The mass at the maximum, ϕ0 = 0, is −a2 . The negative mass-squared
is a symptom of the instability of this state of the field.
152 CHAPTER 7. SPONTANEOUS SYMMETRY BREAKING
From the way the linearized equation is derived, you can easily see that
any displacement field δϕ constructed as an infinitesimal symmetry of the
field equations will automatically satisfy the linearized equations when the
fields being used to build δϕ satisfy the field equations. Indeed, this fact
is the defining property of an infinitesimal symmetry of the field equations.
Here is a simple example.
Because of (1), the mass terms comes solely from the expansion of the
potential in (2). Because of the U (1) symmetry of the potential, through
each point in the set of field values there will be a curve (with tangent vec-
tor given by the infinitesimal symmetry) along which the potential will not
change. Because the symmetry is broken, this curve connects all the ground
states of the theory. Taylor expansion about the ground state in this sym-
metry direction can yield only vanishing contributions because the potential
has vanishing derivatives in that direction. Thus the broken symmetry direc-
tion(s) defines direction(s) in field space which correspond to massless fields
in an expansion about equilibrium. This is the essence of the (classical limit
of the) Goldstone theorem: to each broken continuous symmetry generator
there is a massless field.
1 1 1 1 1
L = − F αβ Fαβ − ∂α ρ∂ α ρ − ρ2 (∂α Θ + qAα )(∂ α Θ + qAα ) + a2 ρ2 − b2 ρ4 .
4 2 2 2 4
(7.36)
The Poincaré invariant ground state(s) can be determined as follows. As
we have observed, a Poincaré invariant function ϕ is necessarily a constant.
Likewise, it is too not hard to see that the only Poincaré invariant (co)vector
is the the zero (co)vector Aα = 0. Consequently, the Poincaré invariant
7.4. THE ABELIAN HIGGS MODEL 155
Ignoring terms of cubic and higher order in the displacements (δA, δρ, δΘ)
we then get
1 α β β α 1 aq 2
L ≈ − (∂α Bβ − ∂β Bα )(∂ B − ∂ B ) − Bα B α
4 2 b
1 1
− ∂α δρ ∂ α δρ − a2 δρ2
2 2
(7.40)
PROBLEM: Starting from (7.34) derive the results (7.36) and (7.40).
As you can see, excitations of ρ around the ground state are those of a
scalar field with mass a, as before. To understand the rest of the Lagrangian
(7.40) we need to understand the Proca Lagrangian:
1 1
LP roca = − (∂α Bβ − ∂β Bα )(∂ α B β − ∂ β B α ) − κ2 Bα B α . (7.41)
4 2
For κ = 0 this is just the usual electromagnetic Lagrangian. Otherwise...
( − κ2 )Bα = 0, ∂ α Bα = 0. (7.42)
156 CHAPTER 7. SPONTANEOUS SYMMETRY BREAKING
7.5 PROBLEMS
4. Using (7.24) expand the action functional to quadratic order in δϕ. Show
that this approximate action, viewed as an action functional for the displace-
ment field δϕ, has (7.26) as its Euler-Lagrange field equation.
( − κ2 )Bα = 0, ∂ α Bα = 0. (7.43)
158 CHAPTER 7. SPONTANEOUS SYMMETRY BREAKING
Chapter 8
Recall that spin is an “internal degree of freedom”, that is, an internal bit
of structure for the particle excitations of a quantum field whose classical
approximation we have been studying. The spin gets its name since it con-
tributes to the angular momentum conservation law. In quantum field the-
ory, the KG equation is an equation useful for describing relativistic particles
with spin 0; the Maxwell and Proca equations are equations used to describe
particles with spin 1. One of the many great achievements of Dirac was to
devise a differential equation that is perfectly suited for studying the quan-
tum theory of relativistic particles with “spin 21 ”. This is the Dirac equation.
Originally, this equation was obtained from trying to find a relativistic analog
of the Schrödinger equation. The idea is that one wants a PDE that is first
order in time, unlike the KG or the 4-potential form of Maxwell equations.
Here we shall simply define the Dirac equation without trying to give details
regarding its historical derivation. Then we will explore some of the simple
field theoretic properties associated with the equation.
ψ : M → C4 . (8.2)
159
160 CHAPTER 8. THE DIRAC FIELD
γµ : C4 → C4 , µ = 0, 1, 2, 3 (8.3)
γµ γν + γν γµ = 2ηµν I. (8.4)
We can define
iI 0
γ0 = , (8.6)
0 −iI
and
0 −iσj
γj = , j = 1, 2, 3. (8.7)
iσj 0
Here we are using a 2 × 2 block matrix notation.
The γ-matrices are often called, well, just the “gamma matrices”. They
are also called the Dirac matrices. The Dirac matrices satisfy
and
γ0† = −γ0−1 , γi† = γi−1 (8.9)
The Dirac matrices are not uniquely determined by their anti-commutation
relations. It is possible to find other, equivalent representations. How-
ever, Pauli showed that if two sets of matrices γµ and γµ0 satisfy the anti-
commutation relations and the Hermiticity relations of the gamma matrices
as shown above, then there is a unitary transformation U on C4 such that
γµ0 = U −1 γµ U. (8.10)
8.1. THE DIRAC EQUATION 161
γ µ = η µν γν . (8.12)
γ µ ∂µ ψ = −mψ, (8.17)
162 CHAPTER 8. THE DIRAC FIELD
0 = (γ ν ∂ν + mI)(γ µ ∂µ ψ + mψ)
= ( − m2 )ψ, (8.18)
≡ I, m2 ≡ m2 I. (8.19)
Note: That each component of the Dirac field satisfies the KG equation is
necessary for ψ to satisfy the Dirac equation, but it is not sufficient.
The preceding computation allows us to view the Dirac operator as a
sort of “square root” of the KG operator. This is one way to understand the
utility of the Clifford algebra.
I have no choice but to assign you the following.
PROBLEM: Verify that the Dirac γ matrices shown in (8.6), (8.7) satisfy
unitary transformations in the quantum theory. The reason why this is im-
portant is that irreducible projective unitary representations of the Poincaré
group are identified with elementary particles in quantum field theory. The
Dirac equation, in particular, is used to describe elementary spin 1/2 par-
ticles like the electron and quarks. These fermionic representations in the
quantum theory can be seen already at the level of the classical field theory,
and this is what we want to describe here. But first it will be instructive to
spend a little time systematically thinking about the Poincaré group, its Lie
algebra, and its representations.
To begin, we’ll review the definition of the Poincaré group. Recall that
the Poincaré group can be viewed as the transformation group x → x0 which
leaves the flat spacetime metric η invariant:
Elements of the Poincaré group are thus labeled by pairs (L, a).1 The group
composition law for two elements g1 = (L1 , a1 ), g2 = (L2 , a2 ) is defined by
(g2 g1 · x)α = Lα2β (Lβ1γ xγ + aβ1 ) + aβ2 = (Lα2β Lβ1γ )xγ + Lα2β aβ1 + aβ2 , (8.24)
so that
g2 g1 = (L2 L1 , L2 · a1 + a2 ). (8.25)
As mentioned earlier, in quantum theory it is the “projective unitary
representations” of symmetry groups which are relevant. It can be shown
that all irreducible unitary, projective representations of the Poincaré group
are characterized by two parameters, the spin and the mass. Nature only
seems to take advantage of the representations in which the mass is real and
1
As an exercise: In terms of this characterization of group elements, what is the identity
and inverse transformation?
164 CHAPTER 8. THE DIRAC FIELD
positive and the spin is a non-negative integer or half integer.2 The spin of
the representation is largely controlled by the geometrical type of the field.
The principal role of the field equations is then to pick out the mass of the
representation and ensure its irreducibility. So far we have considered a scalar
field and a 1-form, corresponding to spin-0 and spin-1. The Dirac field is a
new type of geometric object: a spinor field, corresponding to spin- 12 .
To better understand this last statement, we now want to see how the
Poincaré group is represented as a transformation group of the classical fields.
A field representation of the Poincaré group G of transformations assigns to
each group element g ∈ G a linear transformation Ψg of the fields,
ϕ → ϕ0 = Ψg (ϕ), (8.26)
To understand this point of view, consider our previously studied fields. The
Klein-Gordon field was defined as a function or scalar field, ϕ : R4 → R, so
that we have the representation of the Poincaré group on the space of KG
fields given by:
ϕ(x) → ϕ0 (x) = ϕ(L · x + a). (8.28)
It is easy to check that the space of KG fields, equipped with this trans-
formation rule provides a field representation of the Poincaré group. This
representation is called the spin-0 representation. The electromagnetic field
was defined in terms of a 1-form A,
A : R4 → T ∗ M. (8.29)
A = Aα dxα (8.30)
via
Aα (x) → A0α (x) = Lβα Aβ (L · x + a). (8.31)
This transformation law leads to the spin-1 representation of the Poincaré
group.
2
The other allowed representations have imaginary mass, or vanishing mass and con-
tinuous spin. Both mathematical possibilities appear to have unphysical properties.
8.3. THE SPINOR REPRESENTATION 165
PROBLEM: Verify that the transformation laws (8.28) and (8.31) define
representations of the Poincaré group.
aα (s) = s bα . (8.33)
such that
Lαβ (s) = exp(sω α β ) ≈ δβα + s ω α β + · · · . (8.35)
Thus ω and b label the infinitesimal generators of the 1-parameter family of
Poincaré transformations.
The formal definition of the Lie algebra goes as follows.3 The underlying
vector space is the set of pairs h = (ω, b) with addition given by
h1 + h2 = (ω1 + ω2 , b1 + b2 ). (8.36)
[h1 , h2 ] = h3 , (8.38)
where
γ β γ β
(ω3 )βα = [ω1 , ω2 ]βα = ω1α ω2γ − ω2α ω2γ , (8.39)
α β α β
bα3 = ω1β b2 − ω2β b1 + bα1 − bα2 . (8.40)
1
Sµν = [γµ , γν ]. (8.41)
4
4
The transformation of the value of a scalar field is just the identity transformation.
8.3. THE SPINOR REPRESENTATION 167
PROBLEM: Using
γµ γν + γν γµ = 2ηµν I, (8.44)
show that
1 1
[S(ω), S(χ)] = ω µν χαβ [Sµν , Sαβ ] = [ω, χ]αβ Sαβ = S([ω, χ]). (8.45)
4 2
This shows that the linear transformations S(ω) on the space C4 represent
the Lie algebra of the Lorentz group. This representation is called the (in-
finitesimal) spinor representation of the (infinitesimal) Lorentz group. The
Lorentz transformation L(ω) defined by the matrix ω α β is represented on
vectors in C4 – “spinors” – by the matrix exponential R(ω):
R(ω) = exp {S(ω)} . (8.46)
The fact that S(ω) represents the Lie algebra ensures that the exponential
will (projectively) represent the Lorentz group. As with scalar and vector
fields, the translational part of the Poincaré group is represented trivially on
the values of the spinor.
We have considered how C4 – the values of taken by a Dirac field – can
provide a representation of the Lie algebra of the Poincaré group. We now
extend this representation to the connected component of the identity of the
Poincaré group by adding in the Lorentz and translational transformations
to the argument of the field and exponentiating. We specify a Poincaré
transformation5 by using a skew tensor ω, and a constant vector b. We
define the transformation ψ → ψ 0 by
ψ 0 (x) = eS(ω) ψ(eS(ω) · x + b). (8.47)
5
We are restricting to the component connected to the identity transformation this way.
For simplicity I will suppress the interesting discussion of how to represent the remaining
transformations (time reversal and spatial reflection).
168 CHAPTER 8. THE DIRAC FIELD
δ2 δ1 ψ − δ1 δ2 ψ = δ3 ψ, (8.49)
where
ω3β α = [ω2 , ω1 ]β α , α β
bα3 = ω1β α β
b2 − ω2β b1 + bα1 − bα2 . (8.50)
Comparing with (8.39) and (8.40) we see that the infinitesimal transforma-
tions (8.48) do indeed represent the Lie algebra of the Poincaré group.
As I have mentioned in the preceding discussion, the “spinor represen-
tation” of the Poincaré group we have been describing is not quite a true
representation of the group, but rather a projective representation. (As we
just saw, we do have a true representation of the Lie algebra.) To see why
I say this, consider a Poincaré transformation consisting of a rotation by 2π
about, say, the z-axis. You can easily check that this is generated by an
infinitesimal transformation with bα = 0 and
which is just showing that a rotation by 2π is the same as the identity trans-
formation. (Check this calculation as a nice exercise.) The corresponding
transformation ψ → ψ 0 of the spinor field is determined by
1
S(ω) = ω αβ [γα , γβ ] = π[γ1 , γ2 ] = iπ diag(1, 1, −1, −1). (8.54)
4
8.3. THE SPINOR REPRESENTATION 169
So that
eS(ω) = −I, (8.55)
and
ψ 0 (x) = −ψ(x). (8.56)
ψ̄ = iψ † γ 0 , (8.58)
so the EL equations coming from varying ψ̄, namely Eψ̄ = 0, yields the Dirac
equation for ψ The EL equations coming from varying ψ are determined by
8.6 Energy
It is instructive to have a look at the conserved energy for the Dirac field,
both to illustrate previous technology and to motivate the use of Grassmann-
valued fields.
From (8.64) the time translation symmetry is a divergence symmetry with
δL = Dµ (δ0µ L). (8.70)
According to Noether’s theorem, then, we have the conserved current
1
j µ = − [ψ̄γ µ ψ,0 − ψ̄,0 γ µ ψ] + δ0µ L. (8.71)
2
Next we take note of the following.
PROBLEM: Check that the current (8.72) is conserved when the Dirac
equation holds.
Let us write
a(t)
b(t)
ψ=
c(t) .
(8.76)
d(t)
Since γ0 = diag(i, i, −i, −i) we have that γ 0 = diag(−i, −i, i, i) so that the
Dirac equation reduces to the decoupled system
−iȧ + ma = 0
−iḃ + mb = 0
iċ + mc = 0
id˙ + md = 0. (8.77)
This energy is clearly not bounded from below. Physically this is a disaster
since it means that one can extract an infinite amount of energy from the
Dirac field by coupling it to other dynamical systems. Note that, while we
are free to consider redefining the energy density by a change of sign, this
won’t help because ρ is not bounded from above either.
The fix for this difficulty is hard to explain completely without appealing
to quantum field theory, which is where the Dirac field has its physical util-
ity. Still, it is worth it to briefly sketch how it goes. In quantum field theory
one, loosely speaking, replaces the set-up where fields take values in a com-
mutative algebra (e.g., real numbers, complex numbers, etc. ) with a set-up
where fields take values in an operator algebra on a Hilbert space.6 Because
6
More precisely: fields are viewed as “unbounded, self-adjoint operator-valued distri-
butions on spacetime”.
174 CHAPTER 8. THE DIRAC FIELD
In the “classical limit” that we have been exploring, this operator algebra
aspect of the problem is suppressed for the most part. But a little bit remains.
For bosonic fields (integer spin) the classical limit has the fields taking values
in a commutative algebra, but for fermionic fields, e.g., spin 1/2 fields like
the Dirac field one can show that the “classical limit” of the operator algebra
leads to fields which anti-commute. This leads one to formulate a classical
Dirac field theory as a theory of Grassmann-valued spinors.
A finite-dimensional Grassmann algebra A is a vector space V with a
basis χα , α = 1, 2, . . . , n equipped with a product such that
χα χβ = −χβ χα . (8.81)
The algebra is, as usual, built up by sums and products of the χα . Every
element Ω ∈ A is of the form
where ω0 can be identified with the field (real or complex) of the vector
space, the coefficients ω α1 ··· are totally antisymmetric. The terms of odd
degree in the χα are “anti-commuting” while the terms of even degree are
“commuting”. If the vector space is complex, then there is a notion of com-
plex conjugation in which
(αβ)∗ = β ∗ α∗ , α, β ∈ A. (8.83)
α∗ = α, β ∗ = β, (8.84)
Dirac fields are then considered as maps from spacetime into V . Of course,
one cannot directly measure a “Grassmann number”, but the “commuting”
part of the algebra can be interpreted in terms of real numbers, which is how
one interprets observable aspects of the classical field theory.
Let us return to our simple example involving the energy density asso-
ciated to spatially homogeneous solutions to the Dirac equation. Given the
anti-commuting nature of the values of the Dirac field, it follows that the
solution to the field equations we wrote down above has a, b, c, d now being
interpreted as anti-commuting Grassmann numbers. It then becomes an is-
sue as to what order to put the factors in the definition of various quantities,
say, the energy density. To answer this question it is best to appeal to the
underlying quantum field theory, from which it turns out that the correct
version of (8.79) is
ρ = (a∗ a + b∗ b − cc∗ − dd∗ ) = (a∗ a + b∗ b + c∗ c + d∗ d). (8.86)
It turns out that in the classical limit of the quantum theory each of the
quantities a∗ a, b∗ b, c∗ c, d∗ d correspond to positive real numbers whence the
energy density is now positive.
j α = iψγ α ψ. (8.89)
Verify that this vector field is divergence-free when the Dirac equation is
satisfied.
This current can be interpreted as the electric 4-current of the field in
the absence of interaction with an electromagnetic field. To introduce that
interaction we “gauge” the global U (1) symmetry to make it “local”. We do
this by introducing the electromagnetic potential A = Aα dxα and defining a
“covariant derivative”,
Dµ ψ = ∂µ ψ − iqAµ ψ. (8.90)
1 µ 1
ψ̄γ Dµ ψ − (Dµ ψ̄)γ µ ψ − mψ̄ψ − Fαβ F αβ .
L=− (8.93)
2 4
PROBLEM: Calculate the field equations for ψ and A from this Lagrangian
density.
8.8. PROBLEMS 177
8.8 PROBLEMS
2. Verify that the transformation laws (8.28) and (8.31) define representa-
tions of the Poincaré group.
3. Using
γµ γν + γν γµ = 2ηµν I, (8.95)
show that
1 µν αβ 1
ω χ [Sµν , Sαβ ] = [ω, χ]αβ Sαβ = S([ω, χ]). (8.96)
4 2
5. Show that the equations Eψ = 0 (see (8.62)) are equivalent to the Dirac
equation for ψ.
6. Prove (8.69).
7. Show that the Dirac Lagrangian vanishes when the field equations hold.
8. Check that the current (8.72) is conserved when the Dirac equation holds.
j α = iψγ α ψ. (8.97)
Verify that this vector field is divergence-free when the Dirac equation is
satisfied.
10. Calculate the field equations for ψ and A from the Lagrangian density
(8.93).
178 CHAPTER 8. THE DIRAC FIELD
Chapter 9
179
180 CHAPTER 9. NON-ABELIAN GAUGE THEORY
The non-Abelian gauge theory substitutes a Lie group1 for the group
U (1) that featured in electrodynamics. Although when among friends one
often uses the terms “non-Abelian gauge theory” and “Yang-Mills theory”
interchangeably, one properly distinguishes the Yang-Mills theory as the spe-
cialization of non-Abelian gauge theory to the gauge group built from SU (2),
as Yang and Mills originally did. We shall try to make this distinction as
well.
Non-Abelian gauge theory is a theory of interactions among matter just
as electrodynamics is. Following Weyl, we were able to view electromagnetic
interactions between electrically charged matter as a manifestation of a “lo-
cal U (1) symmetry” of matter. Recall that the charged KG field acquired
its conserved charge by virtue of the U (1) phase symmetry. This symme-
try acted “globally” in the sense that the symmetry transformation shifted
the phase of the complex KG field by the same amount throughout space-
time. The coupling of the charged KG field to the EM field can be viewed
as corresponding to the requirement that this phase change could be made
independently (albeit smoothly) at each spacetime event. Making the sim-
plest “minimal” generalization of the charged KG Lagrangian to incorporate
this “local” gauge symmetry involved introducing the Maxwell field to define
a connection, i.e., a gauge covariant derivative. This gives the correct La-
grangian for the charged KG field in the presence of an electromagnetic field.
The dynamics of the EM field itself could then be incorporated by adjoining
to this Lagrangian the simplest gauge invariant Lagrangian for the Maxwell
field. This leads to the theory of scalar electrodynamics.
We have already discussed another kind of “charged KG” theory in which
one has not a single conserved current but three of them corresponding to
a global SU (2) symmetry group. What happens if we try to make that
symmetry local? This is one way to “invent” the YM theory.
where
n = (n1 , n2 , n3 ), (n1 )2 + (n2 )2 + (n3 )2 = 1, (9.6)
and
0 1 0 −i 1 0
σ1 = σ2 = , σ3 = . (9.7)
1 0 i 0 0 −1
are the Pauli matrices. Note that there are three free parameters in this
group, corresponding to θ and the two free parameters defining ni .
The group SU (2), as represented on C2 , acts on the fields in the obvious
way:
ϕ(x) → U ϕ(x). (9.8)
One parameter subgroups U (λ) are defined by any curve in the 3-d parameter
space associated with θ and n. As an example, set n = (1, 0, 0) and θ = λ
to get
cos λ i sin λ
U (λ) = . (9.9)
i sin λ cos λ
The infinitesimal form of the SU (2) transformation is
δϕ = τ ϕ, (9.10)
Note that
δϕ† = −ϕ† τ. (9.12)
We can write
τ = −iaj σj , (9.13)
where ai ∈ R3 . Thus infinitesimal transformations can be identified with a
three-dimensional vector space with a basis
δj ϕ = iσj ϕ. (9.14)
i
ek = − σk , τ = ak ek , ak ∈ R3 . (9.15)
2
It is easy to check that in this basis the structure constants of su(2) are given
by
[ei , ej ] = ij k ek , (9.16)
where ijk is the three-dimensional Levi-Civita symbol and indices are raised
and lowered using the Kronecker delta.
PROBLEM: Verify that the ek in (9.15) satisfy the su(2) Lie algebra (9.16)
as advertised.
These currents “carry” the SU (2) charge possessed by the scalar fields.
PROBLEM: (a) show that the Lagrangian (9.17) is invariant with respect
to the transformation (9.8); (b) compute its Euler-Lagrange equations; (c)
derive the conserved currents (9.19) from Noether’s theorem; (d) verify that
the currents (9.19) are divergence-free when the Euler-Lagrange equations
are satisfied.
PROBLEM: Show that if any one of the currents (9.19) are conserved, then
the other 2 are automatically conserved. (Hint: Consider the behavior of the
currents under SU (2) transformations of the fields.)
U : C2 → C2 , U † = U −1 , det U = 1, (9.21)
becomes a “local” symmetry:
Aµ = Aiµ ei , (9.25)
9.2. LOCAL SU (2) SYMMETRY 185
where ei are the basis for su(2) defined in (9.15). Thus, if you like, you can
think of the gauge field as 4 × 3 = 12 real fields Akµ (x) labeled according to
their spacetime (index µ = 0, 1, 2, 3) and “internal” su(2) structure (index
k = 1, 2, 3).
The gauge covariant derivative of ϕ is now defined as2
Dµ ϕ = ∂µ ϕ + Aµ ϕ. (9.26)
A fixed, given gauge field Aµ just defines another way to compare phases of
fields at different points. Nothing is gained symmetry-wise by its introduc-
tion. Indeed, under a gauge transformation
= U (x) (∂µ + Aµ ) ϕ
= U (x)Dµ ϕ. (9.29)
Granted this set-up, we can then define the Lagrangian for ϕ using the “min-
imal coupling prescription
∂µ → Dµ , (9.30)
so that we have
L = −η αβ hDα ϕ, Dβ ϕi − m2 hϕ, ϕi. (9.31)
This Lagrangian is no more gauge invariant than our original Lagrangian
(9.17), which it includes as the special case A = 0, but it has the virtue
of allowing for any choice of connection between C2 at neighboring points.
Physically, this Lagrangian defines the dynamics of the fields ϕ under the
influence of the “force” due to any given gauge field A.
I think you can imagine that we could play the preceding game whenever
one is given (1) some fields ϕ : M → V , where V is some vector space, and
(2) given a representation of SU (2) on this vector space, (3) given a globally
2
For simplicity we absorb a coupling constant into the definition of A.
186 CHAPTER 9. NON-ABELIAN GAUGE THEORY
We can interpret this last result as follows. The Lie algebra su(2), like
any Lie algebra, is a vector space. You can easily check that the set of 2 × 2
anti-Hermitian trace-free matrices is a real three-dimensional vector space
using the usual notion of addition of matrices and multiplication of matrices
by scalars. This three dimensional vector space provides a representation
of the group SU (2) called the adjoint representation. This representation is
defined by
τ → U τ U −1 . (9.38)
9.3. INFINITESIMAL GAUGE TRANSFORMATIONS 187
You can check that this transformation provides an isomorphism of the al-
gebra onto itself:
[U τ U −1 , U τ 0 U −1 ] = U [τ, τ 0 ]U −1 . (9.39)
Anyway, at any given point, ξ ≡ δU is a field taking values in this vector
space and we have a representation of SU (2) acting on this vector space, so
we can let SU (2) act on ξ via
ξ → U ξU −1 . (9.40)
This linear representation can be made into an explicit matrix representation
on the matrix elements of ξ (viewed as entries in a column vector) but we
will not need to do this explicitly. The infinitesimal adjoint action of SU (2)
on the vector space su(2) is via the commutator. To see this, simply consider
a 1-parameter family of SU (2) transformations Uλ with
dUλ
Uλ=0 = I, = τ, (9.41)
dλ λ=0
where τ is trace-free and anti-Hermitian. We have the infinitesimal adjoint
action of SU (2) on su(2) given by
δξ = [τ, ξ] (9.42)
PROBLEM: Show that the adjoint representation (9.40) has the infinites-
imal form (9.42).
The point of these observations is to show that we can now extend our
definition of the covariant derivative to ξ via the adjoint representation. We
can define a covariant derivative of ξ as we did for ϕ by replacing the repre-
sentation of infinitesimal SU (2) on ϕ,
δϕ = τ ϕ, (9.43)
with its action on ξ:
δξ = [τ, ξ]. (9.44)
We define
Dµ ξ = ∂µ ξ + [Aµ , ξ, ]. (9.45)
It then follows that we can interpret the infinitesimal gauge transformation
of the gauge field in terms of the gauge covariant derivative of ξ:
δAµ = −Dµ ξ. (9.46)
188 CHAPTER 9. NON-ABELIAN GAUGE THEORY
We can associate a group element g[γ] ∈ SU (2) to each point on this curve
by, in effect, exponentiating this infinitesimal transformation. More precisely,
we define a group transformation at each point along the curve by solving
the differential equation
d
g(s) = −γ̇ µ (s)Aµ (γ(s))g(s), (9.49)
ds
subject to the initial condition
g(0) = I. (9.50)
We say that ϕ has been parallelly propagated from x1 to x2 along the curve
γ if
ϕ(x2 ) = g(1)ϕ(x1 ). (9.51)
The idea is that, given the curve, the group transformation g(1) is used to
define the relationship between the spaces C2 at x1 and x2 . Equivalently, we
can define the parallel propagation of ϕ along the curve γ to be defined by
solving the equation
d
ϕ(γ(s)) = −γ̇ µ (s)Aµ (γ(s))ϕ(γ(s)). (9.52)
ds
9.4. GEOMETRICAL INTERPRETATION: PARALLEL PROPAGATION189
You can see how the covariant derivative determines the rate of change of ϕ
along the curve.
Let me emphasize that the parallel transport of a field (relative to a given
connection) from one point to another is path-dependent, i.e., depends upon
the choice of curve connecting the points. This is because the form of the
ODE shown above will depend upon γ. We will say a little more about this
below.
It is possible to give a formal series solution to the differential equation
of parallel propagation. Indeed, if you have studied quantum mechanics, you
will perhaps note the similarity of the equation
d
g(s) = −γ̇ µ (s)Aµ (γ(s))g(s). (9.54)
ds
to the Schrödinger equation for the time evolution operator U (t) of a time-
dependent Hamiltonian H(t):
∂U (t)
i~ = H(t)U (t). (9.55)
∂t
In quantum mechanics one solves the Schrödinger equation using the “time-
ordered exponential”; here we can use the analogous quantity, the “path-
ordered exponential”:
∞
X Z 1 Z sn Z s2
n
g(s) = (−1) dsn dsn−1 · · · ds1 γ̇ µ (sn )Aµ (γ(sn )) · · · γ̇ µ (s1 )Aµ (γ(s1 ))
n=0 0 0 0
Z 1
≡ P exp − ds γ̇ µ Aµ .
0
(9.56)
190 CHAPTER 9. NON-ABELIAN GAUGE THEORY
Dµ ϕ = 0, (9.57)
such fields are called covariantly constant, or just “constant”, or just “par-
allel”. Covariantly constant fields have the property that their value at any
point is parallel with its value at any other point – relative to the definition of
“parallel” provided by the connection A and independently of any choice of
curve connecting the points. As we shall see below, the existence of parallel
fields requires a special choice of connection.
It is instructive to point out that everything we have done for SU (2) gauge
theory could be also done with the group U (1) in SED. Now the connections
can be viewed as i× ordinary, real-valued 1-forms ωµ , which commute. In
this case the path-ordered exponential becomes the ordinary exponential:
Z 1 Z 1
µ µ
P exp − ds γ̇ Aµ = exp −i ds γ̇ ωµ . (9.58)
0 0
Finally, let us consider the behavior of the curvature under a gauge trans-
formation.
Thus while the gauge field transforms inhomogeneously under a gauge trans-
formation, the curvature transforms homogeneously. The curvature is trans-
forming according to the adjoint representation of SU (2) on su(2) in which,
with
τ ∈ su(2), U ∈ SU (2), (9.73)
we have
τ → U τ U −1 . (9.74)
Infinitesimally, if we have a gauge transformation defined by
The curvature of the YM field differs from the curvature of the Maxwell
field in a few key ways. First, the YM curvature is really a trio of 2-forms,
while the Maxwell curvature is a single 2-form. Second, the YM curvature is
a non-linear function of the the gauge field in contrast to the linear relation
F = dA arising in Maxwell theory. Finally, the YM curvature transforms
homogeneously under a gauge transformation, while the Maxwell curvature
is gauge invariant. You can now interpret this last result as coming from the
fact that the adjoint representation of an Abelian group like U (1) is trivial
(exercise).
9.6. LAGRANGIAN FOR THE YM FIELD 193
If we view A as fixed, i.e., given, then it is not a field variable but instead
provides explicit functions of xα which appear in Lϕ = Lϕ (x, ϕ, ∂ϕ). Gauge-
related connections A will give the same Lagrangian if we transform ϕ as
well, but this has no immediate field theoretic consequence, e.g., it doesn’t
provide a symmetry for Noether’s theorem. The Lagrangian in this case is
viewed as describing the dynamics of KG fields interacting with a prescribed
YM field. The full, gauge invariant theory demands that A also be one of
the dynamical fields. Following the examples of the electrically charged KG
field and Dirac field, we need to adjoin to this Lagrangian a term providing
dynamics for the YM field.4 This is our next consideration.
We can build a Lagrangian for the YM field A by a simple generaliza-
tion of the Maxwell Lagrangian. Our guiding principle is gauge invariance.
Now, unlike the Maxwell curvature, the YM curvature is not gauge invariant,
rather, it is gauge “covariant”, transforming homogeneously under a gauge
transformation via the adjoint representation of SU (2) on su(2). However, it
is easy to see that the trace of a product of su(2) elements is invariant under
this action of SU (2) on su(2):
tr (U τ1 U −1 )(U τ2 U −1 ) = tr {τ1 τ2 } .
(9.78)
we get
1
LY M = − δij F µν i Fµνj
. (9.82)
4
So, the YM Lagrangian is arising really as a sum of 3 Maxwell-type La-
grangians. However there is a very significant difference between the Maxwell
and YM Lagrangians: the Maxwell Lagrangian is quadratic in the gauge field
while the YM Lagrangian includes terms cubic and quartic in the gauge field.
This means that, unlike the source-free Maxwell equations, the source-free
YM equations will be non-linear. Physically this means that the YM fields
are “self-interacting”.
Dµ F αβ = ∂µ F αβ + [Aµ , F αβ ], (9.90)
which is appropriate given the fact that (like δA) F transforms according to
the adjoint representation of SU (2).
From this computation we see that the EL derivative of LY M is
where the Latin (su(2) component) indices are raised and lowered with δij .
The source-free YM equations are then
Dµ F µν = 0. (9.92)
Of course, this compact geometric notation hides a lot of stuff. The PDEs
for the gauge field A are, more explicitly,
ϕ → U ϕ, A → U AU −1 . (9.101)
But this kind of symmetry is not that meaningful (in general) since there is no
gauge invariant way of demanding that U be constant except via the covariant
derivative, and this would force the curvature to vanish. The availability
of “global” or “rigid” gauge symmetries is a special feature permitted by
Abelian gauge groups. It does not immediately generalize to YM theory.
However, we can still consider Noether’s second theorem as applied to
the gauge symmetry
To begin with, the source-free theory described by LY M also has the gauge
symmetry, and this implies the identity (which you can easily check directly)
Dν Dµ F µν = 0. (9.103)
Dν Dµ F µν = 0. (9.104)
(Note that this is not quite as trivial as in the U (1) case since covariant
derivatives appear.)
For the YM field coupled to its source described by Lϕ , the differential iden-
tity is
{Dν [Dµ F µν − j ν ]}i + 2RehE(Lϕ ), ei ϕi = 0. (9.105)
We conclude then that the scalar current (9.100) satisfies a covariant conser-
vation equation when the field equations hold:
A = A1 e1 , A1 = αµ dxµ . (9.107)
Dφ = 0. (9.108)
PROBLEM: Suppose that the gauge field A is such that there exists a
covariantly constant field φ = φi ei (e.g., as in the previous problem),
Dφ = 0. (9.109)
Show that J µ := φi jiµ (see (9.100)) defines a bona fide conserved current.
ϕ : M → V, (9.110)
9.10. NON-ABELIAN GAUGE THEORY IN GENERAL 199
Dµ ϕ = ∂µ ϕ + Aµ ϕ. (9.111)
so that
Fµν = ∂µ Aν − ∂ν Aµ + [Aµ , Aν ]. (9.113)
The curvature is a g-valued 2-form. The behavior of the gauge field and
curvature under a gauge transformations is the same as when the gauge
group was SU (2):
α: V × V → R (9.116)
α(v, w) = 0 ∀ w =⇒ v = 0. (9.118)
Next, let
β: g × g → R (9.119)
be a non-degenerate bi-linear form on the Lie algebra, invariant with respect
to the adjoint representation of G on g. If U ∈ G the adjoint representation,
AdU : g → g, is given by
AdU · τ = U τ U −1 , τ ∈ g. (9.120)
200 CHAPTER 9. NON-ABELIAN GAUGE THEORY
we have
β(v, w) = βij v i wj . (9.128)
The adjoint-invariance of β implies that
c c
Cka βcb + Ckb βac = 0. (9.129)
c c
Along with the usual anti-symmetry of the structure constants, Cab = −Cba ,
this implies
c c
βca Cbd = βc[a Cbd] . (9.130)
The next ingredient we will need to build the Lagrangian is the 3-dimensional
permutation symbol.6 This object is denoted η αβγ , α, β, γ = 1, 2, 3, and is de-
fined to be (i) totally anti-symmetric (changes sign under any permutation
of its components), and (ii) have components consisting of 0, ±1 where, in
any coordinate system (with the same orientation),
η 123 = 1. (9.131)
important point to make in this context is that nowhere did we need to use
a metric tensor to define the permutation symbol; such an object is always
available as a rank-n tensor density on any n-dimensional manifold.
PROBLEM: Show that one can likewise define a permutation symbol ηαβγ ,
which is a covariant tensor density of weight plus one. (Do not introduce a
metric to lower the indices!)
To get the last equality I used (9.130). The curvature formula (9.113) ex-
pressed in the basis (9.125) is
i
Fµν = ∂µ Aiν − ∂ν Aiµ + Cjk
i
Ajµ Akν . (9.138)
We thus get
j
Erµ (L) = 2η µβγ βjr Fβγ . (9.139)
9.11. CHERN-SIMONS THEORY 203
we can conclude
Erµ (L) = 0 ⇐⇒ i
Fµν = 0. (9.141)
The EL equations for the Chern-Simons Lagrangian say that the gauge field
(or connection) is flat!
I will conclude this brief introduction to Chern-Simons theory by exposing
the somewhat some remarkable symmetries of the Lagrangian. To begin, it
is reasonable to inquire whether there is a gauge symmetry. At a first glance,
existence of such a symmetry would appear doubtful since the Lagrangian is
not built in an invariant way from the curvature. Still, it is instructive to
look closer. To investigate this, use (9.138) to write the Lagrangian as
αβγ i j 1 j i k l
L = η βij Aα Fβγ − Ckl Aα Aβ Aγ
3
αβγ 1
=η β(Aα , Fβγ ) − β(Aα , [Aβ , Aγ ])
3
αβγ 2
=η β(Aα , Fβγ ) − β(Aα , Aβ Aγ ) . (9.142)
3
Define
Ãα = U Aα U −1 − ∂α U U −1 , F̃βγ = U Fβγ U −1 . (9.143)
Use the gauge transformation formulas (9.114) and (9.115) and the invariance
conditions (9.121) and (9.130) to obtain
(9.144)
and
+ 3β(U −1 ∂α U U −1 ∂β U, Aγ ) − 3β(U −1 ∂α U, Aβ Aγ )
(9.145)
204 CHAPTER 9. NON-ABELIAN GAUGE THEORY
We now have
2
L̃ ≡ η αβγ β(Ãα , F̃βγ ) − β(Ãα , Ãβ Ãγ )
3
2 αβγ
= L + η β(∂α U U −1 , ∂β U U −1 ∂γ U U −1 )
3
αβγ
β(U −1 ∂α U, ∂β Aγ ) + β(U −1 ∂α U U −1 ∂β U, Aγ ) .
− 2η (9.146)
U −1 ∂α U U −1 = −∂α U −1 , (9.147)
so that
2
L̃ = L + η αβγ β(∂α U U −1 , ∂β U U −1 ∂γ U U −1 ) + ∂α η αβγ β(2U −1 ∂βU, Aγ ) .
3
(9.148)
The last term is a divergence; what about the second term? This term is
interesting. If gauge transformation can be obtained by iterating infinitesimal
transformations, that is, if it can be written as
You can check that in this case the term of interest in (9.148) can also be
written as a divergence. Thus infinitesimal gauge transformations will be
divergence symmetries of the Lagrangian. Noether’s second theorem then
applies implying the existence of differential identities satisfied by the Euler-
Lagrange equations. In this case these are equivalent to the “Bianchi identi-
ties”:
D[α Fβγ] = 0. (9.151)
It can happen that there are gauge transformations not obtained by it-
erating infinitesimal transformations as in (9.149). To explain this in detail
will take us too far afield, so let me just provide you with an example.
PROBLEM: Consider a Chern-Simons theory where M = S 3 and G =
SU (2), and βij = −2δij . Using the Euler angle coordinates xµ = (γ, β, α),
(0 < α < 2π, 0 < β < 2π, 0 < γ < π) for S 3 , define U : M → G by
cos(γ/2) e−i(α−β) sin(γ/2)
i(α+β)
e
U= (9.153)
−ei(α−β) sin(γ/2) e−i(α+β) cos(γ/2)
Show that Z
η αβγ β(∂α U U −1 , ∂β U U −1 ∂γ U U −1 ) = 48π 2 . (9.154)
M
This means that the integrand can’t be written as a divergence since in such
a case the integral would have to vanish by the divergence theorem. Thus for
these “large” gauge transformations the Lagrangian is not gauge invariant.7
Finally, let me point out an even more remarkable symmetry enjoyed
by the Chern-Simons Lagrangian. Consider a diffeomorphism of M . This
is a smooth 1-1 mapping of M onto itself whose inverse is also smooth.
Diffeomorphisms are the “symmetries” of manifolds in the sense that one
considers two manifolds related by a diffeomorphism to be the same abstract
manifold. The diffeomorphisms form a group which is the symmetry group of
“generally covariant” theories like Einstein’s general theory of relativity, as
we shall discuss later. I want to show you that, unlike the usual Yang-Mills
type of Lagrangian, the Chern-Simons Lagrangian is generally covariant in
the sense that it admits the diffeomorphism symmetry. Fori brevity, I will just
sketch the proof of invariance with respect to infinitesimal transformations.
Given a coordinate system xµ on M , a diffeomorphism φ : M → M looks
like a change of coordinates
xµ → x̃µ = φµ (x), (9.155)
where φµ will be smooth with a smooth inverse in the domain of the coordi-
nates. An infinitesimal diffeomorphism comes from a 1-parameter family of
diffeomorphisms φs in the usual way:
φµs (x) = xµ + sV µ (x) + O(s2 ). (9.156)
7
Interestingly and importantly, with a proper normalization the action integral changes
by an additive integer under such gauge transformations. This allows gauge invariance in
the associated quantum field theory.
206 CHAPTER 9. NON-ABELIAN GAUGE THEORY
Here the V µ define – and are defined by – the 1-parameter family of transfor-
mations. Geometrically, V α is a vector field on M . So, for any infinitesimal
diffeomorphism defined by a vector field V α on M , define the infinitesimal
transformation of the gauge field as8
9.12 PROBLEMS
1. Verify that the ek in (9.15) satisfy the su(2) Lie algebra (9.16) as adver-
tised.
8
For those who know about these things, this is the “gauge covariant Lie derivative”
of A.
9.12. PROBLEMS 207
2. (a) Show that the Lagrangian (9.17) is invariant with respect to the
transformation (9.8); (b) compute its Euler-Lagrange equations; (c) derive
the conserved currents (9.19) from Noether’s theorem; (d) verify that the
currents (9.19) are divergence-free when the Euler-Lagrange equations are
satisfied.
3. Show that if any one of the currents (9.19) are conserved, then the other
2 are automatically conserved. (Hint: Consider the behavior of the currents
under SU (2) transformations of the fields.)
4. Show that the adjoint representation (9.40) has the infinitesimal form
(9.42).
Show that
Fµν → U (x)Fµν U −1 (x). (9.162)
Dν Dµ F µν = 0. (9.163)
(Note that this is not quite as trivial as in the U (1) case since covariant
derivatives appear.)
A = A1 e1 , A1 = αµ dxµ . (9.164)
208 CHAPTER 9. NON-ABELIAN GAUGE THEORY
12. Suppose that the gauge field A is such that there exists a covariantly
constant field φ = φi ei ,
Dφ = 0. (9.166)
Show that J µ := φi jiµ (see (9.100)) defines a bona fide conserved current.
14. Show that one can define a permutation symbol ηαβγ , which is a covariant
tensor density of weight plus one. (Do not introduce a metric to lower the
indices!)
15. Show by direct computation just using the definition (9.113) and the
definition of covariant derivative that
D[α Fβγ] = 0. (9.168)
17. Apply Noether’s second theorem to the infinitesimal gauge and diffeo-
morphism symmetries of the Chern-Simons Lagrangian. What are the cor-
responding differential identities satisfied by the Euler-Lagrange expression
(9.139)?
Chapter 10
209
210 CHAPTER 10. GRAVITATIONAL FIELD THEORY
Q = diag(1 , 2 , . . . , n ), (10.4)
line element of the metric represents the invariant spacetime interval and
determines the infinitesimal proper time elapsed along nearby timelike sepa-
rated events and the infinitesimal proper distance between nearby spacelike
separated events.
The metric defines a notion of squared-“length” of vectors and a notion of
“angles” between vectors. Consequently, one can use the metric to define the
difference between two vectors (and other kinds of tensors) at neighboring
spacetime points. Using the metric to define parallelism on the spacetime,
one obtains a derivative operator (or “covariant derivative”) ∇ which is de-
fined as follows. The derivative maps functions to 1-forms using the exterior
derivative:
∇µ φ = ∂µ φ. (10.5)
1
The derivative maps vector fields to tensor fields of type 1 :
∇α v β = ∂α v β + Γβγα v γ , (10.6)
where the Christoffel symbols are given by
1
Γβγα = g βσ (∂γ gασ + ∂α gγσ − ∂σ gαγ ). (10.7)
2
The derivative maps 1-forms to tensor fields of type 02 :
∇α wβ = ∂α wβ − Γγβα wγ . (10.8)
The derivative is extended to all other tensor fields as a derivation (lin-
ear, Leibniz product rule) on the algebra of tensor fields. In particular, the
Christoffel symbols are defined so that the metric determines parallelism; we
have:
∇α gβγ = ∂α gβγ − Γσβα gσγ − Γσγα gβσ = 0. (10.9)
dq µ dq ν
gµν = κ, (10.11)
ds ds
where κ is a given constant; κ > 0 for spacelike curves, κ < 0 for timelike
curves and κ = 0 for lightlike curves.
µ dq ν
PROBLEM: Show that gµν dqds ds
is a constant of motion for (10.10).
2 2
Cαβγδ = Rαβγδ − (gα[γ Rδ]β − gβ[γ Rδ]α ) + Rgα[γ gδ]β .
n−2 (n − 1)(n − 2)
(10.18)
The curvature tensor is completely determined by the Weyl tensor, Ricci
tensor, and Ricci scalar through the following formula:
2 2
Rαβγδ = Cαβγδ + gα[γ Rδ]β − gβ[γ Rδ]α − Rgα[γ gδ]β .
n−2 (n − 1)(n − 2)
(10.19)
10.2. THE GEODESIC HYPOTHESIS 213
dq µ (0)
µ
q (0) = xµ0 , = vµ. (10.21)
ds
Here is a useful mathematical result with an important physical interpre-
tation. Pick a spacetime event, i.e., a point p ∈ M . Events q sufficiently close
to p can be labeled using geodesics as follows. Find a geodesic starting at p
which passes through q. This geodesic will be unique if q is in a sufficiently
small neighborhood of p. Let uµ be the components of the tangent vector at
p which, along with p, provides the initial data for the geodesic which passes
through q. If the geodesic in question is timelike, then normalize uµ to have
length gαβ (p)uα uβ = −1. If the geodesic is spacelike, normalize uµ to have
length +1. If the geodesic is null, normalization is not an issue. The geodesic
in question will pass through q when the affine parameter s takes some value,
say, s0 . Assign to q the coordinates xα = s0 uα . It can be shown that this
construction defines a coordinate chart called geodesic normal coordinates.
This chart has the property that
These relations will not in general hold away from the origin p of the nor-
mal coordinates. Physically, the geodesics correspond to the local reference
frame of a freely falling observer at p. For a sufficiently small spacetime
214 CHAPTER 10. GRAVITATIONAL FIELD THEORY
region around p the observer sees spacetime geometry in accord with special
relativity insofar as (10.22) is a good approximation for any measurements
made. This is the mathematical manifestation of the equivalence principal
of Einstein. Sometimes (10.22) is characterized by saying that spacetime
is “locally flat”, but you should be warned that this slogan is misleading –
spacetime curvature can never be made to disappear in any reference frame.4
Let us see what in fact is the physical meaning of spacetime curvature.
Nearby geodesics will converge/diverge from each other according to the
curvature there. This “tidal acceleration” is the principal effect of gravita-
tion and is controlled by the geodesic deviation equation, which is defined as
follows. Consider a 1-parameter (λ) family of geodesics, xµ = q µ (s, λ). This
means that, for each λ there is a geodesic obeying
d2 q µ (s, λ) α β
µ dq (s, λ) dq (s, λ)
+ Γ αβ = 0, (10.23)
ds2 ds ds
dq µ (s, λ) dq ν (s, λ)
gµν = κ, (10.24)
ds ds
The deviation vector field
∂q µ (s, λ)
rµ (s, λ) = (10.25)
∂λ
defines the displacement of the geodesic labeled by λ + dλ relative to the
geodesic labeled by λ. The relative velocity v µ (s, λ) of the geodesic labeled
by λ + dλ relative to the geodesic labeled by λ is the directional derivative
of the deviation vector in the direction of the geodesic labeled by λ:
∂rµ ∂q σ
v µ (s, λ) = + Γµνσ rν . (10.26)
∂s ∂s
The relative acceleration aµ (s, λ) of the neighboring geodesic labeled by λ+dλ
relative to the geodesic labeled by λ is the directional derivative of the relative
velocity in the direction of the geodesic labeled by λ:
∂v µ ∂q σ
aµ (s, λ) = + Γµνσ v ν . (10.27)
∂s ∂s
4
Roughly speaking, at a given point the zeroth and first derivative content of the metric
can be adjusted more or less arbitrarily by a choice of coordinates, but (some of the) second
derivative information is immutable.
10.3. THE PRINCIPLE OF GENERAL COVARIANCE 215
This is the precise sense in which spacetime curvature controls the relative
acceleration of geodesics. Note that while one can erect a freely falling refer-
ence frame such that (10.22) holds at some event, it is not possible to make
the curvature vanish at a point by a choice of reference frame. Thus the
geodesic deviation (10.28) is an immutable feature of the gravitational field.
One can say that gravity is geodesic deviation.
ϕ ≡ g αβ ∇α ∇β ϕ = 0. (10.29)
This definition can be used to compute the wave equation in any coordi-
nate system, but the form of the equation will, in general, be different in
different coordinate systems because the metric and Christoffel symbols –
which must be specified in order to define the equation for ϕ – are different
in different coordinate systems. For example, suppose that the spacetime is
Minkowski spacetime (so the curvature tensor of g vanishes). Then one can
216 CHAPTER 10. GRAVITATIONAL FIELD THEORY
∗ ∂φµ ∂φν
(φ g)αβ (x) = g̃µν (φ(x)). (10.32)
∂xα ∂xβ
More generally, given any tensor field we can relate its components φ∗ Tβ...
α...
(x)
in terms of xµ and its components T̃β...
α...
(y) in terms of y µ :
−1α
∗ α... ∂φν ∂φ µ...
(φ T )β... (x) = β
... . . . T̃ν... (φ(x)). (10.33)
∂x ∂y µ y=φ(x)
system and produces the curvature tensor R(g) in that same coordinate sys-
tem. Given a diffeomorphism φ – an equivalent presentation of the manifold
– the metric changes to φ∗ g. Applying (10.13) we get the important result:
What this formula says in English is: If you apply the curvature formula to
the transformed metric, the result is the same as applying the formula to
the untransformed metric and then transforming the result. In this sense the
curvature formula is “the same” in all coordinate systems or, more elegantly,
is defined from the metric only using the underlying manifold structure. We
say that the curvature tensor is “naturally” defined in terms of the met-
ric.7 More generally, any tensor field obtained from the curvature using the
metric and covariant derivatives with the metric-compatible connection will
be “naturally” constructed from the metric. We will call such tensor fields
natural.
The principle of general covariance is the requirement that the field equa-
tions for spacetime are naturally constructed from the metric (and any other
matter fields which may be present). Just considering spacetime (with no
matter), let the field equations be of the form G = 0, where G = G(g) is
a tensor constructed naturally from the metric g. Supposed g0 is solution
to the field equations, G(g0 ) = 0. Then, thanks to the property (10.34), we
have
G(φ∗ g0 ) = φ∗ G(g0 ) = 0. (10.35)
This shows that generally covariant field theories on a manifold M have a
very large group of symmetries – the diffeomorphism group of M .
which is constructed naturally from the metric. Since we aim to vary the
metric to get the field equations, we must take care to identify all places
where the metric is used.
A Lagrangian density is meant to be the integrand in an action integral.
For an integral over a manifold of dimension n, the integrand is most properly
viewed as a differential n-form — a completely antisymmetric tensor of type
0
n
. Given a coordinate chart, xα , α = 0, 1, 2, 3, the integral over some
region B in that chart of a function f (x) is
Z Z
0 1 2 3
f (x) dx ∧ dx ∧ dx ∧ dx ≡ f (x) dx0 dx1 dx2 dx3 (10.36)
B B
According to the rules for integrating forms, the integral of the volume
form over a region B is the volume V (B) of B defined by the metric. In
coordinates xa
Z Z p
= |g| dx0 dx1 dx2 dx3 = V (B). (10.39)
B B
10.4. THE EINSTEIN-HILBERT ACTION 219
so that
δg αβ = −g αγ g βδ δgγδ . (10.43)
Henceforth we must remember to make this exception in the usual tensor
notation for raising and lowering indices with the metric. Next, we recall a
couple of results from linear algebra. Let A be a non-singular square matrix.
We have the identity
ln(det(A)) = tr(ln(A)). (10.44)
Now let A(λ) be a non-singular square matrix depending upon a parameter
λ. It follows from (10.44) that
d −1 d
det(A(λ)) = det(A(λ))tr A (λ) A(λ) . (10.45)
dλ dλ
δg = gg αβ δgαβ . (10.46)
After the volume form, the next simplest natural Lagrangian density is
the Einstein-Hilbert Lagrangian:
LEH = (const.)R, (10.48)
where R is the scalar curvature. Einstein’s theory – including the cosmolog-
ical constant Λ – arises from a Lagrangian density which is a combination of
the Einstein-Hilbert Lagrangian density and the volume form:
1
L= (R − 2Λ), (10.49)
2κ
where κ is given in terms of Newton’s constant G and the speed of light c as
8πG
κ=
c4
To compute the Euler-Lagrange expression for L we will need to know
how to compute the variation of the scalar curvature. Here are the results
we will need.8
1
δΓγαβ = g γδ (∇α δgβδ + ∇β δgαδ − ∇δ δgαβ ) , (10.50)
2
δRαβγ δ = ∇β δΓδαγ − ∇α δΓδβγ , (10.51)
δRαγ = δRαβγ β = ∇β δΓβαγ − ∇α δΓββγ . (10.52)
Finally,
δR = δg αγ Rαγ + g αγ δRαγ = −Rαγ δgαγ + ∇σ g αγ δΓσαγ − g σγ δΓββγ (10.53)
where
1
Gαβ = Rαβ − Rg αβ (10.55)
2
is the celebrated Einstein tensor, and the vector field featuring in the diver-
gence is
Θσ = g αγ δΓσαγ − g σγ δΓββγ . (10.56)
The Einstein action for the gravitational field is given by
Z
1
Sgrav [g] = (R − 2Λ). (10.57)
M 2κ
Gαβ + Λg αβ = 0. (10.59)
These are usually called the “vacuum Einstein equations” (with a cosmolog-
ical constant). Using the definition of the Einstein tensor, and contracting
this equation with gαβ (“taking the trace”), we get (in 4-dimensions)
R = 4Λ. (10.60)
The vacuum equations are 10 coupled non-linear PDEs for the 10 compo-
nents of the metric, in any given coordinate system. Despite their complexity,
many solutions are known. The majority of the known solutions have Λ = 0,
but here is a pretty famous solution which includes Λ. It is called the “Kottler
222 CHAPTER 10. GRAVITATIONAL FIELD THEORY
where
(k · )βγδ = k α αβγδ , (10.71)
and
k α = Gαβ + Λg αβ vβ
(10.72)
The last equality in (10.70) follows from the divergence theorem, which can
be expressed as: Z Z
α
(∇α k ) = k · . (10.73)
M ∂M
If the manifold M has no boundary, then the boundary term is absent. The
boundary could be “at infinity”, which can be the limit as r → ∞ of a sphere
of radius r. We assume the diffeomorphism is the identity transformation on
the boundary, which means the vector field v α vanishes there (at a fast enough
rate, in the sphere at infinity case). Either way, the boundary term is absent
and we have the important identity:
Z Z
1 αβ αβ 1
∇α Gαβ vβ ,
δSgrav = ∇α G + Λg vβ = (10.74)
κ M κ M
were we used ∇α gβγ = 0.
Now we finish the argument. We know a priori the action is invariant
under diffeomorphisms. This means that for infinitesimal diffeomorphisms
(which are trivial on the boundary of M )
δSgrav = 0. (10.75)
On the other hand, we have the identity (10.74), which must vanish for any
choice of the vector field v α thanks to (10.75). Evidently, using the usual
calculus of variations reasoning we must have the identity
∇α Gαβ = 0. (10.76)
This is the contracted Bianchi identity; it can be derived from the Bianchi
identity (10.15).
∇α E αβ = 0. (10.77)
Finally, you might re-read §5.5. Then you will see that the discussion above
is just an instance of Noether’s second theorem.
∇α ∇α ϕ − m2 ϕ = 0 (10.79)
You can think of the appearance of the metric in this equation as bringing
into play the effect of the gravitational field on the “motion” of the scalar
field.
There are now EL equations for the metric and for the scalar field. The EL
equation for the scalar field is (10.79), given above. The EL equations for
the metric take the form
1 1
{−Gαγ + Λg αγ } + T αγ = 0, (10.82)
2κ 2
or
Gαβ + Λg αβ = κT αβ , (10.83)
where
1 1
T αγ = ∇α ϕ∇γ ϕ − g αγ ∇β ϕ∇β ϕ − g αγ m2 ϕ2 (10.84)
2 2
10.8. THE CONTRACTED BIANCHI IDENTITY REVISITED 227
is the energy-momentum tensor of the scalar field. The equations (10.83) are
the celebrated Einstein field equations.
PROBLEM: With
Z
1 1 αβ 2 2
S[g, ϕ] = (R − 2Λ) − g ∇α ϕ∇β ϕ + m ϕ (10.85)
M 2κ 2
compute the energy-momentum tensor (10.84) of the scalar field via
2 δS
T αβ = p . (10.86)
|g| δgαβ
Note that here the energy-momentum tensor is not defined via Noether’s
theorem but instead via the coupling of the scalar field to gravity. States of
the combined system of matter (modeled as a scalar field) interacting with
gravity satisfy the coupled system of 11 non-linear PDEs given by (10.79)
and (10.83), known as the Einstein-Klein-Gordon (or Einstein-scalar) system
of equations.
PROBLEM: Show that the following metric and scalar field, given in the
coordinate chart (t, r, θ, φ) define a solution to the Einstein-scalar field equa-
tions for m = 0.
2
g = −r2 dt ⊗ dt + dr ⊗ dr + r2 (dθ ⊗ dθ + sin2 θ dφ ⊗ dφ), (10.87)
1 − 32 Λr2
1
ϕ = ± √ t + const. (10.88)
κ
You can think of this as a necessary condition for a metric and scalar field to
satisfy the Einstein-scalar field system. Let me show you how this condition
on the energy-momentum tensor is guaranteed by the KG equation via the
diffeomorphism invariance of the action for the metric and scalar field.13
The analysis is very much like the one we did for the vacuum theory.
Return to the Lagrangian (10.81) and form the action integral over some
region M :
Z
1 1 αβ 2 2
S[g, ϕ] = (R − 2Λ) − g ∇α ϕ∇β ϕ + m ϕ . (10.90)
M 2κ 2
This identity must hold for all v α of compact support. Integrating by parts
via the divergence theorem, with the boundary terms vanishing because v α
13
You should compare this discussion with its analog in scalar electrodynamics in §6.3.
10.8. THE CONTRACTED BIANCHI IDENTITY REVISITED 229
∇α Tβα − ∇α ∇α ϕ − m2 ϕ ∇β ϕ = 0, .
(10.98)
From this identity it is easy to see that any solution (g, ϕ) to the KG
equation (10.79) will obey the compatibility condition (10.89). Recall that
(10.89) was a necessary condition implied by the Einstein field equations
(10.83) and the contracted Bianchi identity. In this way the coupled Einstein-
scalar field equations are compatible. But it is even more interesting to note
that any solution (g, ϕ) to the Einstein field equations (10.83) alone, provided
∇ϕ 6= 0, will automatically satisfy the KG equation (10.79)! The logic goes
as follows. If (g, ϕ) define a solution to (10.83) then by the Bianchi identity
the energy-momentum tensor built from the solution (g, ϕ) has vanishing
covariant divergence. But then the identity (10.98) and the assumption that
ϕ is not constant means that (g, ϕ) must satisfy (10.79). So, if we opt to
work on the space of non-constant functions ϕ only, which is reasonable
since constant functions cannot satisfy the KG equation with m 6= 0, then
the equations of motion for matter are already contained in the Einstein field
equations!
The divergence condition ∇a T ab = 0 is closely related to – but not quite
the same as – conservation of energy and momentum. You will recall that
in flat spacetime conservation of energy-momentum corresponds to the fact
that the energy-momentum tensor is divergence free, which is interpreted as
providing a divergence-free collection of currents. Here we do not have an
230 CHAPTER 10. GRAVITATIONAL FIELD THEORY
For this reason one cannot use the divergence theorem to convert (10.89) into
a true conservation law.14 Physically, this state of affairs reflects the fact that
the matter field ϕ can exchange energy and momentum with the gravitational
field; since the energy momentum tensor pertains only to the matter field
it need not be conserved by itself. In a freely falling reference frame the
effects of gravity disappear to some extent and one might expect at least
an approximate conservation of energy-momentum of matter. This idea can
be made mathematically precise by working in geodesic normal coordinates
(defined earlier). At the origin of such coordinates the Christoffel symbols
vanish and the coordinate basis vectors can be used to define divergence
free currents at the origin. So, in an infinitesimally small region around the
origin of such coordinate system – physically, in a suitably small freely falling
reference frame – one can define an approximate set of conservation laws for
the energy-momentum (and angular momentum) of matter. But because
gravity can carry energy and momentum this conservation law cannot be
extended to any finite region. The crux of the matter is that there is no
useful way to define gravitational energy-momentum densities – there is no
suitable energy-momentum current for gravity. Indeed, if you had such a
current, you ought to be able to make it vanish at the origin of normal
coordinates, whence it can’t be a purely geometrical quantity or it should
vanish in any coordinates. In fact, it is possible to prove that any vector field
locally constructed from the metric and its derivatives to any order which
is divergence-free when the Einstein equations hold is a trivial conservation
law in the sense of §3.20.15
The upshot of these considerations is that gravitational energy, momen-
tum, and angular momentum will not occur via the usual mechanism of
conserved currents. Thanks to the equivalence principle, gravitational en-
ergy, momentum, and angular momentum must be non-local in character.
You may recall that an analogous situation arose for the sources of the Yang-
14
There are actually two reasons for this. First of all, as just mentioned, the ordinary
divergence is not zero! And, even if it were, there is no useful definition of the integral of a
vector (namely, the divergence of T ) over a volume in the absence of additional structures,
such as preferred vector fields with which to take components.
15
I. Anderson and C. Torre, Phys. Rev. Lett. 70, 3525 (1993).
10.9. PROBLEMS 231
Mills field. Despite our inability to localize gravitational energy and momen-
tum, it is possible to define a notion of conserved total energy, momentum,
and angular momentum for gravitational systems which are suitably isolated
from their surroundings. So, for example, it is possible to compute these
quantities for a star or galaxy if we ignore the rest of the universe. Based on
our preceding discussion you will not be surprised to hear that such quanti-
ties are not constructed by volume integrals of densities. Perhaps a future
version of this text will explain how all that works.
10.9 PROBLEMS
9. With
Z
1 1 αβ 2 2
S[g, ϕ] = (R − 2Λ) − g ∇α ϕ∇β ϕ + m ϕ (10.101)
M 2κ 2
232 CHAPTER 10. GRAVITATIONAL FIELD THEORY
2 δS
T αβ = p . (10.102)
|g| δgαβ
10. Show that the following metric and scalar field, given in the coordinate
chart (t, r, θ, φ) define a solution to the Einstein-scalar field equations for
m = 0.
2
g = −r2 dt ⊗ dt + 2 dr ⊗ dr + r2 (dθ ⊗ dθ + sin2 θdφ ⊗ dφ), (10.103)
1− 3
Λr2
1
ϕ = ± √ t + const. (10.104)
κ
11. Repeat the analysis of §10.7 and §10.8 for the case where the matter field
is the electromagnetic field defined by
1
LEM = − g αγ g βδ Fαβ Fγδ , (10.105)
4
where
Fαβ = ∇α Aβ − ∇β Aα = ∂α Aβ − ∂β Aα . (10.106)
∇α T αβ = ∇β ϕ ∇α ∇α ϕ − m2 ϕ .
(10.107)
Chapter 11
Goodbye
If you have stayed with me to the end of this exhilarating mess, I congratulate
you. If you feel like there is a lot more you would like to learn about classical
field theory, then I agree with you. I have only given a simple introduction
to some of the possible topics. Slowly but surely I hope to add more to this
text, correct errors, add problems, etc. so check back every so often to see
if the version has been updated. Meanwhile, the next section has a list of
resources which you might want to look at, depending upon your interests.
233
234 CHAPTER 11. GOODBYE
Classical Field Theory and the Stress-Energy Tensor, S. Swanson, Morgan &
Claypool Publishers