Quantum Gravity Lectures
Quantum Gravity Lectures
Quantum Gravity Lectures
Thomas Hartman
Cornell University
Abstract These are the lecture notes for a one-semester graduate course on black
holes and quantum gravity. We start with black hole thermodynamics, Rindler space,
Hawking radiation, Euclidean path integrals, and conserved quantities in General Rel-
ativity. Next, we rediscover the AdS/CFT correspondence by scattering fields off near-
extremal black holes. The final third of the course is on AdS/CFT, including correlation
functions, black hole thermodynamics, and entanglement entropy. The emphasis is on
semiclassical gravity, so topics like string theory, D-branes, and super-Yang Mills are
discussed only very briefly.
Prereqs This course is aimed at graduate students who have taken 1-2 semesters of
general relativity (including: classical black holes, Penrose diagrams, and the Einstein
action) and 1-2 semesters of quantum field theory (including: Feynman diagrams, path
integrals, and gauge symmetry.) No previous knowledge of quantum gravity or string
theory is necessary.
1
Contents
1.3 Homework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
4.2 Wavefunctions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2
4.6 Vacuum correlation functions . . . . . . . . . . . . . . . . . . . . . . . 46
7.3 Action . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
8.3 Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
9 Symmetries of AdS3 97
3
9.1 Exercise: Metric of AdS3 . . . . . . . . . . . . . . . . . . . . . . . . . . 97
4
15.1 Vacuum correlation functions in CFT . . . . . . . . . . . . . . . . . . . 138
5
19.2 Lorentz invariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
6
25 Thermodynamics of 2d CFT 214
7
1 The problem of quantum gravity
These are lectures on quantum gravity. To start, we better understand clearly what
problem we are trying to solve when we say ‘quantum gravity.’ At low energies, the
classical action is
√
Z
1
S= −g (R − 2Λ + Lmatter ) . (1.1)
16πGN
Why not just quantize this action? The answer of course is that it is not renormalizable.
This does not mean it is useless to understand quantum gravity, it just means we have
to be careful about when it is reliable and when it isn’t. In this first lecture we will
consider gravity as a low-energy effective field theory,∗ see when it breaks down, and
make some general observations about what we should expect or not expect from the
UV completion.
1. Write down the most general possible action consistent with the symmetries;
5. Trust your answer only if the neglected terms in the derivative expansion are
much smaller than the terms you kept.
This works for renormalizable or non-renormalizable theories. Let’s follow the steps
for gravity. Our starting assumption is that nature has a graviton — a massless spin-2
field. This theory can be consistent only if it is diffeomorphism invariant.
∗
Donaghue gr-qc/9512024.
8
Counting metric degrees of freedom
This can be argued various ways∗ ; we’ll just count degrees of freedom. In 4D, a massless
particle has two degrees of freedom (2 helicities). Similarly, the metric gµν has†
1
2
D(D + 1) − D − D = 21 D(D − 3) (1.3)
degrees of freedom.
Note that in D = 3, the metric has no (local) dof. It turns out that it does have some
nonlocal dof; this will be useful later in the course.
√
Z
1
−g −2Λ + R + c1 R2 + c2 Rµν Rµν + c3 Rµνρσ Rµνρσ + · · ·
S= (1.4)
16πGN
So the general theory is the Einstein-Hilbert term plus higher curvature corrections.
We have ignored the matter terms Lmatter and matter-curvature couplings, like φR.
∗
See Weinberg QFT V1, section 5.9, and the discussion of the Weinberg-Witten theorem below,
and Weinberg Phys. Rev. 135, B1049 (1964).
†
In more detail: a 4x4 symmetric matrix has 10 independent components. In 4D we have 4
0
functions worth of diffeomorphisms, xµ → xµ (xµ ). And g̈0µ cannot appear in a 2nd order diff-invariant
equation of motion, so these components are non-dynamical. For more details, see the discussion of
gravitational waves in any introductory GR textbook, which should show that in transverse-traceless
gauge the linearized Einstein equation have two independent solutions (the ‘+’ and ‘×’ polarizations).
‡
See Weinberg QFT V1, section 2.5.
9
Step 3: Coefficients = scale of new physics
Coefficients should be fixed by dimensional analysis, up to O(1) factors. This doesn’t
work for the cosmological constant: experiment (ie the fact the universe is not Planck-
sized) indicates that Λ is unnaturally small. This is the cosmological constant problem.
We will just sweep this under the rug, take this fine-tuning as an experimental fact,
and proceed to higher order.
For these purposes let’s take the coordinates to have dimensions of length, so the metric
is dimensionless, and R has mass dimension 2. The action should be dimensionless
(since ~ = 1). Looking at the Einstein-Hilbert term, that means [GN ] = 2 − D, so in
terms of the Planck scale,
1
≡ (MP )D−2 . (1.5)
GN
In D > 2, this term is not renormalizable. This means that the theory is strongly
coupled at the Planck scale. If we try to compute scattering amplitudes using Feynman
diagrams, we would find non-sensical, non-unitary answers for E & MP . The rules
of effective field theory tell us that we must include the R2 terms, with coefficients
c1,2,3 ∼ 1/MP2 . Higher curvature terms should also be included, suppressed by more
powers of MP . More generally, the rule is that these coefficients should be suppressed
by the scale of new physics, which we will call Ms . New physics must appear at or
below the Planck to save unitarity, so Ms < MP , but it’s possible that Ms MP . So
to allow for this possibility, we set
1
c1,2,3 ∼ . (1.6)
Ms2
1
Ms ∼ , c1,2,3 ∼ α0 (1.7)
`s
where `s is the string length and α0 = `2s is the string tension. In this context the R2
and higher curvature terms in the action are called ‘stringy corrections.’
10
flat space) with the action (1.4), we set
1
gµν = ηµν + hµν (1.8)
MP
where the first terms come from expanding the Einstein action, and the other terms
come from the higher curvature corrections.∗ In curved space, the higher curvature
terms would also contribute to the terms like h∂h∂h since R ∼ const + ∂ 2 h + · · · .
√ √
= GN GN + crosses + GN (1.10)
+ + + ···
√
This is an expansion in the coupling constant GN = 1/MP ; but this is dimensionful, so
it must really be an expansion in E/MP . That is, each diagram contributes something
2
∗ Ms
The term written here comes from a term in the action MP R2 , where we pick off the terms
√ 1
−g ∼ 1 + δg, R ∼ ∂ 2 δg, then rescale δg = MP h.
11
of order
1+number of loops
E2
. (1.11)
MP2
So the strong coupling scale, where loop diagrams are the same size as tree diagrams,
is
Estrong ∼ MP . (1.12)
Below the strong coupling scale, this is a perfectly good quantum theory. We can use
it to make reliable predictions about graviton-graviton scattering, including calculable
loop corrections.
If there are two different scales Ms and MP with Ms MP the situation is slightly
more complicated. The Einstein term is strongly coupled at the Planck scale, but
looking at (1.9), the higher curvature terms become strongly coupled at a lower scale
somewhere between Ms and MP ,
for some x ∈ (0, 1). (This is a known number that you can find by examining all the
diagrams.) It is important, however, that interactions in the higher curvature terms
still come with powers of 1/MP , so even in this case Estrong contains some factor of MP
(ie, x > 0): the theory is still weakly coupled at the scale of new physics, E ∼ Ms .
1 1 1
−2
= 2− 2 (1.15)
q2 + Ms q 4 q q + Ms2
12
The 2nd term looks like a massive field with mass Ms , but the wrong sign. It is a new,
non-unitary degree of freedom, or ‘ghost’, in the classical theory. Although the only
field is still the metric, it makes sense that we’ve added a degree of freedom because
we need more than 2 functions worth of initial data to solve the 4th order equation of
motion (1.14).
The ‘ghost’ should not bother us, because it appears at the scale Ms . This is the scale
of new physics where we should not trust our effective field theory anyway. And at
energies E Ms , the ghost has no effect on classical gravity. To see this, let’s compute
the classical potential between two massive objects. The first term in (1.15) gives the
Newtonian 1/r potential. The second term looks like a massive Yukawa force, so the
classical potential is
1 e−rMs
V (r) = −GN m1 m2 − . (1.16)
r r
This tiny for distances r > 1/Ms .
1 1 1 a 4 q2 1
+ + ··· ∼ − + q log + ···
q 2 Ms2 q 2 MP2 Λ2 q 2
(1.17)
1 1 1
−2
= 2 − 2 + ··· , (1.18)
q2 + Ms q 4 q Ms
and the log term is the loop diagram (including external legs!). a is an order 1 number
that can been calculated from this diagram, and we’ve dropped some terms to simplify
the discussion (see Donaghue for details). To calculate the attractive potential between
13
two stationary masses, we set the frequency to zero q 0 = 0 and go to position space,∗
Z
1 1 1 1 1 1
3
d ~q 2
+ 2 − 2 log ~q 2 + · · · ei~q·~x ∼ + 2 δ(r) + 2 3 (1.19)
~q Ms MP r Ms MP r
This Fourier transform is just done by dimensional analysis (we’ve dropped numerical
coefficients). The first term is the classical Newtonian potential, the second term is
the classical higher-curvature correction, and the last term is a quantum correction.
The delta function does not matter at separated points; it is UV physics and does
not affect the potential. It came from the same physics as the Yukawa term e−rMs in
our discussion above— the difference is that the Yukawa term is the exact classical
contribution whereas the delta function comes from expanding out the propagator in
a derivative expansion.
The last term in (1.19) is a reliable prediction of quantum gravity, with small but
non-zero effects at low energies.
Does Ms = MP ?
So does Ms = MP , or is there a new scale Ms MP ? This is basically the question
of whether the new UV physics that fixes the problems of quantum gravity is weakly
coupled (Ms MP ) or strongly coupled (no new scale). Both options are possible,
and both are realized in different corners of string theory.
If we ask the analogous question about other effective field theories that exist in nature,
then sometimes the new physics is strongly coupled (for example, QCD as the UV
completion of the pion Lagrangian) and sometimes it’s weakly coupled (for example,
electroweak theory as the UV completion of Fermi’s theory of beta decay).
Breakdown
As argued above, the effective field theory breaks down at (or below) MP . It is conceiv-
able that this is just a problem with perturbation theory, and that the theory makes
sense non-perturbatively, for example by doing the path integral on a computer. The
problem is that the theory is UV-divergent and must be renormalized; this means we
do not have the option of just plugging in a particular action, say just the Einstein term
∗
See for example Peskin and Schroeder section 4.7.
14
R√
−gR, and using this to define a quantum theory. We must include the full series of
higher curvature terms. Each comes with a coupling constant, so we have an infinite
number of tunable parameters and lose predictive power. The only way this theory
can make predictions is if these infinite number of running couplings flow under RG to
a UV fixed point with a finite number of parameters. This idea is called ‘asymptotic
safety,’ and although it’s a logical possibility there is little evidence for it. It is not
how other effective fields theories in nature (eg pions) have been UV completed.
In an ordinary quantum field theory without gravity, in flat spacetime, there two types
of physical observables that we most often talk about are correlation functions of
gauge-invariant operators hO1 (x1 ) · · · On (xn )i, and S-matrix elements. The correlators
are obviously gauge-independent. S-matrix elements are also physical, even though
electrons are not gauge invariant. The reason is that the states used to define the
S-matrix have particles at infinity, and gauge transformations acting at infinity are
true symmetries. They take one physical state to a different physical state — unlike
local gauge transformations, which map a physical state to a different description of
the same physical state.
In gravity, local diffeomorphisms are gauge symmetries. They are redundancies. This
means that local correlation functions like hO1 (x1 ) · · · On (xn )i are not gauge invariant,
15
and so they are not physical observables.∗ On the other hand, diffeomorphisms that
reach infinity (like, say, a global translation) are physical symmetries — taking states
in the Hilbert space to different states in the Hilbert space — so we get a physical
observable by taking the insertion points to infinity. This defines the S-matrix, so it is
sometimes said that ‘The S-matrix is the only observable in quantum gravity.’
This is not quite true, since there are also non-local physical observables. For example,
suppose we send in an observer from infinity, along a worldline xµ0 (τ ), with τ the
proper time along the path. Although the coordinate value xµ0 (τ ) depends on the
coordinate system, it unambiguously labels a physical point on the manifold; that is,
0
xµ0 (τ1 ) labels the same physical point as xµ (τ1 ) in some other coordinates. Therefore
hO1 (x0 (τ1 )) · · · On (x0 (τn ))i should be a physical prediction of the theory, which answers
the physical question ‘If I follow the path xµ0 (τ ), carrying an O-meter, what do I
measure?’. So apparently, to construct diff-invariant physical observables, we need to
tie them to infinity. Although this sounds like a straightforward fix, it is actually a
radical departure from ordinary, local quantum field theory.
Based on this analogy, we might guess that the UV completion of gravity is an ordinary,
D = 4 quantum field theory with no graviton, and that the graviton is a emergent
degree of freedom at low energies. This is wrong. The graviton may be an emergent
degree of freedom, but it cannot come from an ordinary D = 4 quantum field theory
in the UV. The reason is the Weinberg-Witten theorem:†
∗
This is true in the effective field theory of gravity too, not just in the UV. However in perturbation
theory it is not a problem. In perturbation theory, coordinates xµ label points on the fixed background
manifold, which are meaningful. It is only when we allow the geometry to fluctuate wildly that this
really becomes a problem.
†
See the original paper for the proof, it is short and clear. The basic idea is to first argue that,
since states carry energy, hp|T 00 (0)|pi 6= 0. However, if these are single-particle states for a particle
with helicity ±2, then there is just now way for hp0 |Tµν |pi to transform properly under rotations unless
it vanishes.
16
A 4D Lorentz-invariant QFT with a conserved, gauge-invariant stress tensor Tµν cannot
have massless particles with spin > 1.
This does not rule out general relativity itself because Tµν is not gauge invariant in GR
(or, equivalent, the physical part of Tµν is not really a Lorentz tensor). It does rule
out a composite graviton: If gravity emerged from an ordinary QFT, then in the UV
there is no diffeomorphism symmetry, the stress tensor is gauge invariant, so there can
be no graviton in the spectrum.
Emergent spacetime
So the theory of the graviton is sick in the UV, but if we stick to ordinary QFT we
cannot eliminate the graviton in the UV. This leaves two possibilities. One is that
the graviton appears in the UV theory, along with other degrees of freedom which
cure the problems seen in effective field theory. The other is that the graviton is an
emergent degree of freedom, but the UV theory is not an ordinary 4D QFT. These
are not mutually exclusive, and in fact both of these possibilities are realized in string
theory (simultaneously!).
In this course we will focus on the second possibility. We will discuss models where
not only the graviton, but spacetime itself is emergent. The fundamental degrees of
freedom of the theory do not live in the same spacetime as the final theory, or in
some cases do not live in any spacetime at all. Spacetime is an approximate, collective
description of these underlying degrees of freedom, and makes sense only the infrared.
The graviton is emergent, but evades Weinberg-Witten because the way it emerges is
outside the usual framework of QFT.
A look ahead
There are many ways to approach this subject. In this course we will take a route that
begins and ends with black holes. Unlike other EFTs (eg the pion Lagrangian), the
Einstein action contains an enormous amount of information about the UV completion
— infrared hints about the ultraviolet. Much of this information is encoded in the
thermodynamics of black holes, so that is our starting point, and will be the basis of
the first half of the course. As it turns out, black holes also lead to emergent spacetime
and the AdS/CFT correspondence, which are the topics of the second half of the course.
17
1.3 Homework
Review the chapter on black holes in Carroll’s textbook (or online lecture notes) on
General Relativity.
18
2 The Laws of Black Hole Thermodynamics
In classical GR, black holes obey ‘laws’ that look analogous to the laws of thermody-
namics. These are classical laws that follow from the Eintsein equations. Eventually,
we will see that in quantum gravity, this is not just an analogy: these laws are the
ordinary laws of thermodynamics, governing the microsopic UV degrees of freedom
that make up black holes.
∆E = Q (2.1)
where Q is the heat transferred to the system.∗ For quasistatic (reversible) changes
from one equilibrium state to a nearby equilibrium state, δQ = T dS so the 1st law is
T dS = dE . (2.2)
Often we will turn on a potential of some kind. For example, in the presence of an
ordinary electric potential Φ, the 1st law becomes
T dS = dE − ΦdQ (2.3)
where Q is the total electric charge. If we also turn on an angular potential, then the
1st law is
T dS = dE − ΩdJ − ΦdQ . (2.4)
The second law of thermodynamics is the statement that in any physical process,
entropy cannot decrease:
∆S ≥ 0 . (2.5)
These laws can of course be derived (more or less) from statistical mechanics. In the
microscopic statistical theory, the laws of thermodynamics are not exact, but are an
∗
Often the rhs is written Q + W where −W is the work done by the system. We’ll set W = 0.
19
extremely good approximation in a system with many degrees of freedom.
√
Z
1
S= d4 x −g (R − Fµν F µν ) (2.6)
16π
2 δS matter
1 1
Tµν =− √ µν
= − gµν Fαβ F αβ + Fµγ Fν γ . (2.9)
−g δg 4π 4
dr2
ds2 = −f (r)dt2 + + r2 dΩ22 (2.10)
f (r)
with
2M Q2
f (r) = 1 − + 2 , (2.11)
r r
and an electromagnetic field
Q Q
Aµ dxµ = − dt, so Frt = . (2.12)
r r2
This component of the field strength is the electric field in the radial direction, so this
is exactly the gauge field corresponding to a point source of charge Q at r = 0.
∗
See Carroll Chapter 6 for background material.The factors of 2 in that chapter are confusing; see
appendix E of Wald for a consistent set of conventions similar to the ones we use here.
20
This is a static, spherically symmetric, charged black hole. There is nothing on the rhs
of the Maxwell equation (2.8), so the charge is carried by the black hole itself; there are
no charged particles anywhere. The parameter Q in the solution is the electric charge;
this can be verified by the Gauss law,
Z Z
1 1 2
Qelectric = ?F = r dΩ2 Frt = Q . (2.13)
4π ∂Σ 4π
This integral is over the boundary of a fixed-time slice Σ, ie a surface of constant t and
constant r 1.
We will always consider the case M > Q > 0. If |Q| > M , then r+ < 0, so the
curvature singularity is not hidden behind a horizon. This is called a naked singularity,
and there are two reasons we will ignore it: First, there is a great deal of evidence for
the cosmic censorship conjecture, which says that reasonable initial states never lead
to the creation of naked singularities.∗ Second, if there were a naked singularity, then
physics outside the black hole depends on the UV (since the naked singularity can spit
out visible very heavy particles), and we should not trust our effective theory anyway.
∗
There are also interesting violations of this conjecture in some situations, but in mild ways [Pre-
torious et al.]
21
2.3 The 1st law
Now we will check that this black hole obeys an equation analogous to (2.3), if we
define an ‘entropy’ proportional to the area of the black hole horizon:
1
S≡ × Area of horizon . (2.15)
4~GN
We’ve temporarily restored the units in order to see that this is the area of the horizon
√
in units of the Planck length `P = ~GN . (Now we’ll again set GN = ~ = 1.) For now
this is just a definition but we will see later that there is a deep connection to actual
entropy. The horizon has metric ds2 = r+
2
dΩ22 , so the area is simply
p
2
A = 4πr+ = 4π(M + M 2 − Q2 )2 . (2.16)
T dS = dM − ΦdQ (2.18)
with
p
r+ − r− M 2 − Q2 Q Q
T ≡ 2
= p , Φ= = p . (2.19)
4πr+ 2π(M + M 2 − Q2 )2 r+ M + M 2 − Q2
M , the mass of the black hole, is the total energy of this spacetime, so this makes
sense. Φ also has a natural interpretation:
But, we have no good reason yet to call T the ‘temperature’ or S the ‘entropy’ (this
will come later).
22
The 1st law relates two nearby equilibrium configurations. There are two ways we
can think about it: (i) as a mathematical relation on the space of solutions to the
equations, or (ii) dynamically, as what happens to the entropy if you throw some
energy and charge into the black hole.
κ
T = , (2.21)
2π
which is defined physically as the acceleration due to gravity near the horizon (which
goes to infinity) times the redshift factor (which goes to zero). If you stand far away
from the black hole holding a fishing pole, and dangle an object on your fishing line
near so it hovers near the horizon, then you will measure the tension in your fishing
line to be κMobject . It can be shown that κ is constant everywhere on the horizon
of a stationary black hole. This is analogous to the ‘0th law of thermodynamics’: in
equilibrium, temperature is constant.
T ∝~. (2.22)
dr2
2 2 2 2
ds = ` −(r − 8M )dt + 2 + r2 dφ2 , (2.23)
r − 8M
(a) Compute the area of the black hole horizon to find the entropy.
23
(b) Vary the entropy, and compare to the 1st law T dS = dM to find the temperature
of the black hole.
(c) Put all the factors of GN and ~ back into your formulas for S and T . S should
be dimensionless and T should have units of energy (since by T we always mean
T ≡ kB Tthermodynamic ). Does T have any dependence on GN ?
∆(r) ρ2 1
ds2 = − 2
(dt−a sin2
θdφ)2
+ dr2 +ρ2 dθ2 + 2 sin2 θ(adt−(r2 +a2 )dφ)2 , (2.24)
ρ ∆(r) ρ
where
∆(r) = r2 + a2 − 2M r , ρ2 = r2 + a2 cos2 θ , (2.25)
and −M < a < M . This describes a rotating black hole with mass M and angular
momentum
J = aM . (2.26)
(b) The first law of thermodynamics, in a situation with an angular potential Ω, takes
the form
T dS = dM − ΩdJ . (2.28)
Use this to find the temperature and angular potential of the Kerr black hole in terms
of M, a. (Hint: The angular potential can also be defined as the angular velocity of
the horizon: Ω = − ggtφ
tt
|r=r+ .)
24
2.4 The 2nd law
The second law of thermodynamics says that entropy cannot decrease: ∆S ≥ 0. This
law does not require a quasistatic process; it is true in any physical process, including
those that go far from equilibrium. (For example, if gas is confined to half a box, and
we remove the partition.)
Hawking proved, directly from the Einstein equation, that in any physical process
the area of the event horizon can never decrease. This parallels the second law of
thermodynamics! This is a very surprising feature of these complicated nonlinear
PDEs. We will not give the general proof; see Wald’s textbook.
25
q Q. This will change the mass and charge of the black hole,
M → M + , Q→Q+q . (2.29)
Although initially there will be some fluctuations in the spacetime and ripples on the
horizon from the particle that just passed through, these will quickly decay so that
we have once again the Reissner-Nordstrom solution, now with the new energy and
charge. Therefore, in this process the area of the black hole horizon changes according
to the 1st law (2.18),
1
δS = ( − Φq) . (2.30)
T
(a) The infalling particle follows a trajectory xµ (τ ) where τ is proper time. Its 4-
momentum is
dxµ
pµ = . (2.31)
dτ
In a spacetime with a time-translation Killing vector ζ (t) , the energy of a charged
particle
= −(p + qA) · ζ (t) . (2.32)
This is conserved along the path of the particle (which is not a geodesic, since it feels an
electromagnetic force). For a charged particle on the Reissner-Nordstrom black hole,
find in terms of f (r) and the components of pµ .
(b) Assume Q > 0. For one sign of q, the energy can be negative. Which sign?
If we drop a negative-energy particle into a black hole, the mass of the black hole
decreases. Therefore it is possible to extract energy using this process. For uncharged
but rotating black holes, a similar procedure can be used to extract energy in what
is called the Penrose process. Particles far from the black hole cannot have negative
energy, so negative-energy orbits are always confined to a region near the horizon. This
region is called the ergosphere.
(c) Although we can decrease the energy of the charged black hole, we cannot decrease
the entropy. To show this, we need to find the minimal energy of an orbit crossing the
horizon. Assume the particle enters the horizon along a purely radial orbit,∗ pθ = pφ =
∗
This assumption is not necessary. In the general case, the particle can add angular momentum to
the black hole, so we need to consider the charged, rotating Kerr-Newman spacetime. This is treated
26
0. The proper time along the orbit is
dr2
dτ 2 = −ds2 = f (r)dt2 − . (2.33)
f (r)
(d) is conserved along the orbit, so you can evaluate it where the particle crosses the
horizon, r = r+ . Show that the minimal value of is
(e) Reversible processes are those in which ∆S = 0. How would you reversibly drop
a charged particle into a Reissner-Nordstrom black hole? (i.e., what charge would it
have and how would you drop it?)
In a general theory of gravity including curvature corrections, the formula for the
entropy also receives corrections,
Area
S= + higher curvature corrections . (2.35)
4
The more general formula is called the ‘Wald entropy’. We will postpone the general
discussion of the Wald entropy until later; for now suffice it to say that the 1st law still
holds. The 2nd law, however, does not. There are known counterexamples involving
black hole collisions.∗ To my knowledge this is not fully understood. A likely explana-
in detail in [MTW section 33].
∗
See arXiv: [hep-th/9305016], [0705.1518], [1011.4988].
27
tion is that this signals a breakdown of the effective field theory — i.e., that when these
violations occur we must include higher corrections or corrections from new physics in
the UV.
We have seen that the classical Einstein equations lead to laws of black hole mechanics
that are analogous to the laws of thermodynamics. In quantum gravity, it is not just
an analogy.
Temperature
What we called ‘T ’ is a true temperature: black holes radiate as blackbodies with
temperature T . This is Hawking radiation. It does not rely on quantizing gravity itself
— it is a feature of quantum field theory in curved space, which will be derived in the
next couple lectures.
If you throw a cup of hot tea into a black hole, then this entropy seems to vanish. This is
puzzling, because if we didn’t know about black hole entropy, we might conclude that
the ordinary 2nd law (applied to the tea) had been violated by destroying entropy.
However, the generalized second law guarantees that in this process the area of the
horizon will increase, and this will (at least) make up for lost entropy of the tea.
Counting microstates
Finally, we know that in quantum mechanics, entropy is supposed to count the states
28
of a system:
S(E) = log (# states with energy E) . (2.37)
29
3 Rindler Space and Hawking Radiation
The next couple of lectures are on Hawking radiation. There are many good refer-
ences to learn this subject, for example: Carroll’s GR book Chapter 9; Townsend
gr-qc/9707012; Jacobson gr-qc/0308048. Therefore in these notes I will go quickly
through some of the standard material, but slower through the material that is hard
to find elsewhere. I strongly recommend reading Carroll’s chapter too.
Hawking radiation is a feature of QFT in curved spacetime. It does not require that
we quantize gravity – it just requires that we quantize the perturbative fields on the
black hole background. In fact we can see very similar physics in flat spacetime.
Looking at (3.2), we see x2 − t2 = R2 > 0, so the Rindler coordinates only cover the
patch of Minkowski space with
This is the ‘right wedge’, which covers one quarter of the Penrose diagram.
30
Higher dimensional Rindler space is
so we can map this to a patch of R1,D−1 by the same coordinate change. The other
coordinates just come along for the ride.
Black holes have an approximate Rindler region near the horizon. For example, start
with the Schwarzschild solution
dr2 2M
ds2 = −f (r)dt2 + + r2 dΩ22 , f (r) = 1 − . (3.6)
f (r) r
r = 2M (1 + 2 ), so f (r) ≈ 2 (3.7)
∗
Reminder: The notation ζ = ∂η means, in components, ζ µ ∂µ = ∂η , i.e., ζ = η̂.
31
and expand the metric at small ,
The (t, ) piece of this metric is Rindler space (we can rescale t and to make it look
exactly like (3.1)).
Although 2d Rindler is a solution of the Einstein equations, the metric written in (3.8)
(excluding the dots) is R1,1 × S 2 . This is not a solution of the Einstein equations. It
is only an approximate solution for small .
Now we will give a ‘trick’ to derive the Hawking temperature. The trick is to argue that
that, in the black hole metric, the time coordinate must be periodic in the imaginary
direction, and this imaginary periodicity implies that the black hole has a temperature.
Actually, this trick is completely correct, and we will justify it later from the path
integral, but don’t expect this subsection to be very convincing yet!
First we want to argue that QFT at finite temperature is periodic in imaginary time,
with periodicity
t ∼ t + iβ, β = 1/T . (3.9)
We will return this in detail later, but for now one way to see it is by looking at the
thermal Green’s function∗
1
Gβ (τ, x) ≡ − Tr ρthermal TE [O(τ, x)O(0, 0)] = − Tr e−βH TE [O(τ, x)O(0, 0)] , (3.10)
Z
where τ = it is Euclidean time, and TE means Euclidean-time ordering (i.e., put the
∗
This definition holds for −β < τ < β. A good reference for the many types of thermal Green’s
functions is Fetter and Walecka, Quantum Theory of Many-particle Systems, 1971, in particular chap-
ters 7 and 9. The function Gβ defined here is equal to the Euclidean Green’s function on a cylinder
that we will discuss later (up to normalization).
32
larger value of τ on the left). This is periodic in imaginary time,∗
1
Gβ (τ, x) = − Tr e−βH O(τ, x)O(0, 0) (3.11)
Z
1
= − Tr O(0, 0)e−βH O(τ, x) (3.12)
Z
1
= − Tr e−βH O(β, 0)O(τ, x) (3.13)
Z
= Gβ (τ − β, x) (3.14)
Now returning to black holes, Rindler space (3.1) is related to polar coordinates, dR2 +
R2 dφ2 with η = iφ. Polar coordinates on R2 are singular at the origin unless φ is a
periodic variable, φ ∼ φ + 2π. Therefore η is periodic in the imaginary direction,
η ∼ η + 2πi . (3.15)
Going back through all the coordinate transformations relating the near horizon black
hole to Rindler space, this implies that the Schwarzschild coordinate t has an imaginary
periodicity
t ∼ t + iβ , β ≡ 8πM . (3.16)
1 1
T = = . (3.17)
β 8πM
This agrees with the Hawking temperature derived from the first law, (2.19) (setting
Q = 0). As mentioned above, this derivation probably is not very convincing yet, but
it is often the easiest way to calculate T given a black hole metric.
33
Exercise: Kerr periodicity
Difficulty level: moderate – a few pages
Field theory at finite temperature and and angular potential is periodic in imaginary
time, but with an extra shift in the angular direction:
(a) Derive (3.18) by an argument similar to (3.11). The density matrix for QFT at
finite temperature and angular potential is ρ = e−β(H−ΩJ) where J is the angular
momentum.
(b) Starting from the Kerr metric (11.22), we will follow steps similar to section 3.2
to relate the near horizon to Rindler space. This will be easiest if you write ∆(r) =
(r − r+ )(r − r− ) and work in terms of r± instead of plugging in all the M ’s and a’s.
Also, you can safely ignore that θ-direction by setting θ = π/2 (you should come back
at the end of the problem and convince yourself that this was reasonable).
(c) To find the correct periodicity in this situation, define the corotating angular coor-
dinate φ̃ = bφ − ct (this coordinate rotates with the horizon). Now the usual Rindler
argument implies that t has an imaginary identification with φ̃ held fixed. Translate
this identification back into t, φ coordinates and compare to (3.18) to read off β and
Ω. Check that your answers agree with the ones you derived from the first law in the
exercise around (11.22).
34
3.4 Unruh radiation
No unique vacuum
Thus the Minkowski vacuum is not the same as the Rindler vacuum. This is a general
feature of quantum field theory in curved space (although in this example spacetime
is flat!). In general, there is no such thing as the vacuum state, only the vacuum state
according to some particular observer. The reason for this is the following. In QFT,
we expand quantum fields in energy modes,
X
φ̂ = aω,k eiωt−ikx + a†ω,k e−iωt+ikx . (3.20)
ω>0,k
The ‘vacuum’ is defined as the state annihilated by the negative energy modes:
aω |0i = 0 , (3.21)
The ambiguity comes from the fact that energy is observer dependent. The energy is
the expectation value of the Hamiltonian; and the Hamiltonian is the operator that
generates time evolution
i
[H, O] = ∂t O . (3.22)
~
Therefore the Hamiltonian depends on a choice of time t. In GR, we are free to call
35
Figure 1: Different choices of time in Minkowski vs Rindler space.
dτ = R0 dη . (3.24)
Therefore R0 is the redshift factor, and the temperature actually observed (say by a
thermometer) is
1 1 1 a
Tthermometer = √ = = . (3.25)
gηη 2π 2πR0 2π
Where in this argument did we actually decide which state we are in? There are
excitations of Rindler/Minkowski space – say, a herd of elephants running by – where
an observer will certainly not measure a uniform heat bath of thermal radiation! The
answer is that by applying the periodicity trick, we have actually selected one particular
36
very special state, (3.23). What we still need to show is that this state is in fact the
Minkowski vacuum.
• Does a Minkowski observer see that the Rindler observer is detecting particles?
Yes, the Minkowski observer can see that the Rindler observer’s particle detector
is clicking. How does that make any sense if the Minkowski observer doesn’t see
the particles being absorbed? The Minkowski observer actually sees the Rindler
observer’s detector emit a particle when it clicks. This can be worked out in
detail, but the easy way to see this is that the absorption of a Rindler mode
changes the quantum state of the fields; the only way to change the vacuum
state is to excite something.
• Doesn’t this violate conservation of energy? No, the Rindler observer is uniformly
accelerating. So this observer must be carrying a rocket booster. From the point
of view of a Minkowski observer, the rocket is providing the energy that excites
the Rindler observer’s thermometer, and causes Minkowski-particle emission.
Back to Minkowski
One very explicit method to show that the thermal Rindler state is in fact the Minkowski
vacuum is to compare the Rindler modes to the Minkowski modes, and check that im-
posing aM
w
inkowski
|0i = 0 leads to the state (3.23) in the Rindler half-space. This is
already nicely written in Carroll’s GR book and many other places so I will not bother
∗
See http://www.scholarpedia.org/article/Unruh effect for further discussion and references.
37
writing it here. Read the calculation there. But don’t be fooled – that method gives
the impression that the Rindler temperature has something to do with the modes of a
free field in Minkowski space. This is not the case. The Rindler temperature is fixed by
symmetries, and holds even for strongly interacting field theories, for example QCD.
The general argument will be given in detail below, using Euclidean path integrals.
We have shown that Unruh observers see a heat bath set by the periodicity in imaginary
time. We have also seen that the near-horizon region of a black hole is Rindler space.
Putting these two facts together we get Hawking radiation: black holes radiate like a
blackbody at temperature T . Some comments are in order:
1. This section has argued intuitively for Hawking radiation but don’t be disturbed
if you find the argument unconvincing. There are two ways to give a more
explicit and more convincing derivation. One is to match in modes to out modes
and calculate Bogoliubov coefficients; I recommend you read this calculation in
Carroll’s book. The other method is using Euclidean path integrals to put the
imaginary-time trick on a solid footing. We will follow this latter method in the
next section.
2. This is the same T that appeared in the 1st law of thermodynamics. Historically,
the 1st and 2nd law were discovered before Hawking radiation. Since T and S
show up together in the 1st law, it was only possible to fix each of them up to
a proportionality factor. This ambiguity was removed when Hawking discovered
(to everyone’s surprise) that black holes actually radiate.
38
3. In this derivation of Hawking radiation we have chosen a particular state by
applying the imaginary-time periodicity trick. In the Unruh discussion, this trick
selected the Minkowski vacuum. Another way to say it is that the imaginary-
time trick picks a state in which the stress tensor is regular on the past and
future Rindler horizon. Therefore, by applying this trick to black holes, we have
selected a particular quantum state which is regular on the past and future event
horizon. This state is called the Hartle-Hawking vacuum. Physically, it should be
interpreted as a black hole in thermodynamic equilibrium with its surroundings.
Others commonly discussed are:
(a) The Boulware vacuum is a state with no radiation. In Rindler space, it
corresponds to the Rindler vacuum, ρRindler = |0ih0|. This state is singular on
the past and future Rindler/event horizons, so is not usually physically relevant.
(b) The Unruh vacuum is a state in which the black hole radiates at temperature
T , but the surroundings have zero temperature. This is the state of an astro-
physical black hole formed by gravitational collapse. It is regular on the future
event horizon, but singular on the past even horizon (which is OK because black
holes formed by collapse do not have a past event horizon). If we put reflecting
boundary conditions far from the black hole to confine the radiation in a box, or
if we work in asymptotically anti-de Sitter space, then the Unruh state eventually
equilibrates so at late times is identical to the Hartle-Hawking state.
4. If you stand far from a black hole, you will actually not quite see a blackbody. A
Rindler observer sees an exact blackbody spectrum, as does an observer hovering
near a black hole horizon. But far from the black hole, the spectrum is modified
by a ‘greybody factor’ which accounts for absorption and re-emission of radiation
by the intervening geometry. Far from a black hole, the occupation number of a
mode with frequency ω is
1
hnω i = × σabs (ω) . (3.26)
eβω−1
The first term is the blackbody formula and the second term is the greybody
factor. The greybody factor is equal to the absorption cross-section of a mode
with frequency ω hitting the black hole, since transmission into the black hole is
equal to transmission out of the black hole.
39
5. In Rindler space, an observer on a geodesic (i.e., Minkowski observer) falls
through the Rindler horizon, and this observer does not see the Unruh radia-
tion. Similarly, you might expect that freely falling observers jumping into a
black hole will not see Hawking radiation. This is almost correct, as long as the
infalling observer is near the horizon in the approximately-Rindler region, but
not entirely — the potential barrier between the horizon and r = ∞ causes some
of the radiation to bounce back into the black hole, and this can be visible to an
infalling observer.
40
4 Path integrals, states, and operators in QFT
To put our derivation of Hawking radiation on a solid footing, and for other applications
to gravity later on, we will now take a slight detour to explain the relationship between
path integrals and states in quantum field theory. (This is material not normally
covered in detail in QFT courses or books; it is assumed that the reader is already
familiar with path integrals at the level of Peskin and Schröder.)
Path integrals define transition amplitudes. A Euclidean path integral defines a tran-
sition amplitude under evolution by e−βH :
Z φ(τ =β)=φ2
−βH
hφ2 |e |φ1 i = Dφ e−SE [φ] . (4.1)
φ(τ =0)=φ1
This involves a split into space and (Euclidean) time; φ1,2 is a boundary condition that
specifies data at a fixed time. Exactly what this path integral means depends on the
topology of space. If space is a plane (or line in 2d), then we depict this by
meaning it’s a Euclidean path integral over an infinite strip Rd−1 ×interval, with the
boundary conditions shown and the interval has length β.
41
If space is a sphere (or circle in 2d), then the appropriate path integral is
4.2 Wavefunctions
The transition amplitude defines the wavefunction, in the Schroedinger picture. For
example the wavefunction for the state
is the overlap
Ψ[φ2 ] ≡ hφ2 |Ψi . (4.5)
42
is the path integral
Z φ(τ =β)=??
|Ψi = Dφ eS[φ] = (4.7)
φ(τ =0)=φ1
This is a formal object where the data on the top cut is left unspecified. It is a
functional |Ψi that turns field data hφ2 | into complex numbers hφ2 |Ψi.
More generally, any path integral with an open cut Σ defines a quantum state on Σ.
For example, this Euclidean path integral in a 2D QFT defines some particular state
on a circle, Σ = S 1 :
|Xi = (4.8)
43
We could also insert some operators into this path integral to get a different state:
|X 0 i = (4.10)
This means a Euclidedan path integral weighted by O1 (x1 )O2 (x2 )e−SE [φ] , instead of
just the usual e−SE [φ] .
So far we have discussed Euclidean path integrals. But states are states: they are
defined on a spatial surface and do not care about Lorentzian vs Euclidean. The state
|Xi, defined above by a Euclidean path integral, is a state in the Hilbert space of the
Lorentzian theory. It is defined at a particular Lorentzian time, call it t = 0. It can be
evolved forward in Lorentzian time by acting with e−iHt , or equivalently by performing
the Lorentzian path integral:
44
Since |Xi ≡ |X(0)i was defined by a Euclidean path integral, the state |X(t)i is a path
integral that is part Euclidean, part Lorentzian:
|X(t)i = (4.12)
Again, this equation should be read as a formal definition of the state that tells you
what path integral to perform to compute transition amplitudes:
Then by evolving over a long Euclidean time we can project onto the lowest energy
state,
e−τ H |Y i ≈ e−τ E0 y0 |0i . (τ → ∞) (4.15)
It follows that we can define the (unnormalized) ground state by doing a path integral
that extends all the way to infinity in one direction. For example the ground state on
45
the line is produced by the Euclidean path integral
|0iline = (4.16)
This means a path integral on the semi-infinite plane, with an open cut at the edge. The
ground state on a circle is produced by the path integral on a semi-infinite Euclidean
cylinder,
|0icircle = (4.17)
Path integrals with cuts can be glued together to make transition amplitudes. For
example, for a theory on a line, the vacuum-to-vacuum amplitude is
Z
h0|0i = Dφe−SE [φ] = (4.18)
The lower half-plane produces |0i, the upper half-plane produces h0|, and glueing them
together along the cuts at τ = 0 produces the transition amplitude. One way to see
that we should glue is to insert the identity:
X
h0|0i = h0|φ1 ihφ1 |0i . (4.19)
φ1
46
The first term is a path integral on the upper half plane; the second term is a path
integral on the lower half plane; and summing over all possible boundary conditions
φ1 in the middle just says that fields should be continuous across τ = 0 and therefore
glues the half-planes together.
Expectation values of local operators are computed by similar path integrals, but with
extra operator insertions. For example, correlation functions are expectation values in
the vacuum state. In Euclidean signature these are computed by the path integral
hO1 (x1 )O2 (x2 )i ≡ h0|O2 (x2 )O1 (x1 )|0i (4.20)
= (4.21)
This picture means the path integral of O1 (x1 )O2 (x2 )e−SE [φ] over fields on Rd . (The
ordering of operators does not matter on the lhs of (4.20), but is important on the rhs;
more in this below.)
hO1 (t1 , ~x1 )O2 (t2 , ~x2 ) · · · i = h0| eiHt1 O1 (0, ~x1 )e−iHt1 eiHt2 O2 (0, ~x2 )e−iHt2 |0i (4.22)
47
is computed by the following path integral:
(4.23)
This path integral starts at t = −i∞ on the left; evolves to t = 0 to prepare the
vacuum state; evolves in Lorentzian time to t = t2 , where O2 is inserted; then evolves
to t1 where O1 is inserted; then evolves backwards in Lorentzian time to t = 0; then
evolves to t = +i∞ for the vacuum ‘bra’. Again, this picture means you should do the
path integral Z
Dφ O1 (t1 , ~x1 )O2 (t2 , ~x2 )eiS[φ] (4.24)
where we integrate over all fields φ defined on the mixed-signature manifold in the
picture. The Lorentzian action appears in this expression; when you integrate over the
Euclidean part of the manifold, the fact that t is imaginary will automatically change
this into e−SE [φ] .
We rarely need to think about doing folded path integrals like (4.23). Instead, we
do one of two equivalent things: (1) We compute the Euclidean path integral with
arbitrary values of the insertion points, then analytically continue to Lorentzian time,
48
or (2) We use an i prescription to compute the Lorentzian path integral. Actually the
usual i prescription is just a deformation of the integration contour (that is, integration
contour in field space) shown in figure (4.23), and computes exactly the same quantity.
So if you’re ever wondered what you were doing with that i, the answer is the figure
in (4.23)!
One way to see this is to note that correlators computed by the path integral
Z
Dφ O1 (x1 )O2 (x2 ) · · · e−SE (4.26)
are just statistical averages, so they commute just like observables in stat-mech. Put
differently, the reason that fields don’t commute in Lorentzian signature is because
the correlator is not an analytic function of the coordinates. It has branch cuts when
O2 hits the light-cone of O1 , and requires an i prescription to define the function.
Different choices of i prescription give different types of correlation functions, and we
denote these different choices by writing the fields in a different order. In Euclidean
signature, correlators are analytic, there are not branch cuts, and there are no i’s, so
we don’t have to worry about how fields are ordered.
However, when we cut the path integral to translate to operator language, the field
operators don’t commute, even in Euclidean signature. They are ‘time’-ordered ac-
cording to whatever slicing we choose for the path integral. So if states are defined
on constant-Euclidean-time slices, the path integral translates into an operator expres-
sion with fields ordered according to their Euclidean time. If states are defined on
constant-r slices (as we often do in conformal field theory), then the corresponding
operator expression has radially-ordered fields.
49
4.7 Density matrices
A density matrix is an operator; it takes a bra and a ket, and produces a complex
number. Thus any path integral with two open cuts defines a density matrix (un-
normalized). For example, the density matrix ρ = e−βH , for a theory on a circle, is
formally the doubly-cut Euclidean path integral
ρ ≡ e−βH = . (4.27)
This is just a picture representing the statement that matrix elements hφ2 |ρ|φ1 i are
computed by the path integral with boundary conditions φ1,2 on the cuts.
The density matrix ρ = e−βH is the density matrix in a thermal ensemble at tempera-
ture T = 1/β. The thermal partition function is
X
= (4.31)
φ1
50
In the last line, by summing over φ1 we are really just imposing periodic boundary
conditions on the cylinder. This glues together the two ends of the cylinder, producing
a torus. So the thermal partition function for a 2D theory on a circle is equal to a path
integral on a torus:
The trace ‘glues together’ parts of the Euclidean manifold that computes ρ.
hO1 (t = 0, ~x1 )O2 (t = 0, ~x2 ) · · · iβ ≡ Tr e−βH O1 (0, ~x1 )O2 (0, ~x2 ) · · · (4.34)
By the same logic, this is computed by a path integral on a cylinder Rd−1 × S 1 (if space
is a plane) or on S d−1 × S 1 (if space is a sphere).
51
To compute different-time Lorentzian correlators at finite temperature, the easist method
is usually to compute the Euclidean correlators first, as functions of arbitrary insertion
points on the Euclidean cylinder, then analyatically continue.
(a) Draw a picture of the path integral on the cylinder that computes (4.35).
(b) Translate your picture into operator language. Compare to (3.10). (Don’t worry
about the overall sign, this is a convention.)
(c) Check that the 2pt function written in (4.35) indeed has the periodicity of a thermal
correlator (see discussion around (3.10)).
(d) Analytically continue to find the finite-temperature 2pt function at real (Lorentzian)
times hO(t1 , x1 )O(t2 , x2 )i, where t is Lorentzian time. Don’t worry about which Lorentzian
ordering you are computing, just pick one. (The most obvious continuation will com-
pute the time-ordered Lorentzian correlator.)
(e) Fix t1 = x1 = 0. Draw a picture of the complex-t2 plane showing the singularities
of (4.35). When you analytically continued in part (d), you implicitly chose a contour
in this plane to define the analytic continuation. Check that if (t2 , x2 ) lies inside the
future light-cone of (t1 , x1 ), then the analytic continuation is ambiguous, due to one of
the poles in the complex-t2 plane. This ambiguity is why timelike separated fields in
Lorentzian signature do not commute.
52
53
5 Path integral approach to Hawking radiation
We will use the Euclidean path integral to justify the claim in (3.23) that the Minkowski
vacuum corresponds to the Rindler state ρRindler = e−2πHη . Consider a 2d QFT on a
line, and let the state of the full system by the Minkowski vacuum,
ρ = |0ih0| . (5.1)
As argued above, this state is prepared by a path integral on a half-plane, cut on the
line t = 0. Let us divide the line into x > 0 (region A) and x < 0 (region B). The
reduced density matrix in region A is
ρA ≡ trB ρ . (5.2)
This has the nice property that all observables restricted to region A (or to the Rindler
wedge that is the causal evolution of region A) can be computed from ρA alone:
= (5.5)
P
The upper half of this diagram corresponds to the transition amplitude φ̃ hφ̃, φ2 |0i
and the lower half to the transition amplitude h0|φ1 , φ̃i. The trace sums over fields in
54
the left Rindler wedge, which glues together these slits in the path integral, so in fact
Now comes the key observation: we can re-slice this path integral by going to polar
coordinates dR2 + R2 dφ2 , and calling φ ‘time’. Let HRindler be the operator that
generates φ-evolution. That is,
1
[H, O] = ∂φ O (5.7)
~
for any operator O. Then we can translate this same path integral back into op-
erator language in a different way. That is, the path integral in (5.6) is equal to
hφ2 |e−2πHRindler |φ1 i. Therefore
ρA = e−2πHRindler . (5.8)
This looks just like a thermal state at temperature 1/2π, but it is thermal with respect
to the rotation generator. When we go back to Minkowski space φ → iη, this becomes
the boost generator corresponding to the causal development of the Rindler wedge.
Therefore HRindler is exactly what we called Hη above.
This is a complete path-integral derivation of the statement that the Minkowski vacuum
leads to a thermal state in Rindler space. As mentioned above, this can also be shown
by explicit comparison of modes, but the path integral derivation can be more useful
for intuition. Another big advantage is that in the path integral derivation, we did not
assume anywhere that the matter fields were free, or even necessarily weakly coupled—
it is completely general.
Modular Hamiltonian
The Hamiltonian that appears in the relation ρRindler = e−2πHRindler is a special case
of a modular Hamiltonian. A modular Hamiltonian is simply defined as the log of a
density matrix (up to normalization). It is very useful for characterizing entanglement,
55
both in quantum gravity and in condensed matter physics.
So far, we have answered the question “What is the quantum state of fields on Rindler
space?” The complete answer is equation (5.8), and does not require any mention of
“particles” (which only make sense at weak coupling), or any particular observer.
However to gain a more concrete intuition for the physics it is very useful to think in
terms of particles. So in this subsection we will apply to result (5.8) to free (or weakly
interacting) fields, and discuss what an accelerating observer capable of detecting these
particles would actually experience.
A massless free field in 2D Rindler space (in Lorentz signature) obeys the wave equation
Φ = ∇µ ∇µ Φ = 0 . (5.9)
Since η is our ‘time’ coordinate, we take the ansatz Φ = e−iωη f (R), and find the
solution
Φ = e−iωη+ik log R , ω2 = k2 , ω>0. (5.10)
As usual in QFT, we expand the field operator in terms of creation and annihilation
operators, Z
Φ̂(η, R) = dk bk Φk + b†k Φ∗k (5.11)
The creation operators b† create positive-energy modes φk . The b’s annihilate positive-
energy modes. The Rindler vacuum state is defined by
It is clear that this is not the Minkowski vacuum state: Minkowski modes are expanded
in Minkowski plane waves, and Minkowski creation and annihilation operators a†k , ak
are not the same as the Rindler ones. The fact that Rindler space has a different choice
of ‘time’ means it has a different choice of ‘energy’ and therefore a different notion of
56
‘particle’ and ’vacuum’:
How does this relate to our more abstract path integral discussion above? ω in the
mode-expansion (5.11) is exactly the eigenvalue of the Rindler Hamiltonian HRindler .
That is, the 1-Rindler-particle state,
satisfies∗
HRindler |kiR = ω|ki . (5.15)
In the Minkowski vacuum, the quantum state of these fields is ρRindler = e−2πHRindler .
We can use this to calculate observables. For example, what is the occupation number
of a mode with Rindler energy ω = |k|? The calculation is identical to the usual
blackbody calculation:
1
hnk i = Tr e−2πHRindler b†k bk , Z ≡ Tr e−2πHRindler (5.16)
Z
The number operator b†k bk counts the number of quanta in the mode, so it ranges from
n = 0 . . . ∞, so
! !
X X
hnk i = ne−2πn|k| / e−2πn|k|
n≥0 n≥0
1
= . (5.17)
e2π|k| − 1
57
that measures energy in, say, joules, does not measure the energy ω. In fact, ω is
dimensionless, since the Rindler time coordinate is dimensionless, so this wouldn’t
even make sense.∗ What an observer actually calls ‘energy’ is the quantity conjugate
to the observer’s proper time. That is, the observer will consider a mode ∼ e−iωobs τobs
to have energy ωobs , in joules or similar energy units. The proper time of a uniformly
accelerating observer with acceleration a (and therefore Rindler position Robs = 1/a)
is
1
dτobs = Robs dη = dη , (5.18)
a
so the observer will see a mode e−iωη to have energy
ωobs = aω . (5.19)
a
Tobs = . (5.20)
2π
However, a temporarily accelerating observer will also see Unruh radiation. It does
not quite make sense to talk about a ‘temperature’ in this case because the observer’s
thermometer will not reach exact equilibrium in any finite time. When the observer
starts accelerating, there will be some transient effects, and then the observer will feel
thermal radiation; the thermometer will start to heat up, asymptotically approaching
temperature a/2π; and when the observer stops accelerating the thermometer will again
experience some transient effects, then radiate and cool back down to zero temperature.
So, as long as the acceleration lasts a long time compared to the equilibration timescale
tequil ∼ 1/T ∼ R0 , the Unruh temperature is still meaningful in this situation. On the
other hand, for short bursts of acceleration, our analysis does not apply. Instead we
would need to solve a time-dependent problem. This can be done using Feynman
∗
ie ds2 = dR2 − R2 dη 2 , so all the dimensions are carried by R; η is like an angular coordinate.
58
diagrams that describe emission/absorption of particles from an arbitrary worldline.
(There are many wrong papers on this topic. A correct, clear, and short paper that
also has a nice derivation of the Unruh effect from Green’s functions is: “Transient
phenomena in the Unruh effect,” Bauerle and Koning.)
What does physics look like in the Rindler vacuum, |0iR ? To an accelerating observer,
it would look ordinary: this observer would detect no particles. A geodesic observer,
however, would detect observers, since this observer must notice that fields are not in
the Minkowski vacuum. As long as the geodesic observer is in the Rindler wedge, this
just looks like some particular excited state. However, timelike geodesics cannot stay
in the Rindler wedge forever — eventually they go through the Rindler horizon. The
Rindler vacuum state is singular at the horizon. That is, the energy density measured
by a geodesic observer diverges at the Rindler horizon. There is no ’beyond’ the horizon
in this state.
This makes sense. In the Rindler vacuum, there are no correlation between fields in
the left and right Rindler wedges:∗
If there are no correlations, who’s to say that these wedges are actually ‘next to each
other’ ? In a sense they are not. Thus in the vacuum state, the Rindler wedge does not
extend beyond the horizon.
The key to obtaining a finite energy density on the Rindler horizon is to have a lot
of entanglement between the left and right Rindler wedges. In the exercise below you
will show explicitly how, in the Minkowski vacuum, the left and right Rindler wedges
are maximally entangled, much like the two spins in Bell’s thought experiment.† Any
state with smooth horizon must be highly entangled across the horizon.
∗
In this equation |0iR means the product vacuum where each Rindler wedge is in its vacuum.
†
http://en.wikipedia.org/wiki/Bell’s theorem
59
Exercise: Entanglement warm-up
Difficulty level: easy
Consider a quantum system consisting of two particles A and B, each with two states
|0i and |1i (which you can think of as spin-up and spin-down). Suppose the full system
is in the maximally entangled pure state
(This is sometimes called a Bell pair). Find the reduced density matrix ρA for particle
A. You will find a mixed state. Compute the entanglement entropy of this mixed state,
defined as
SA = − trA ρA log ρA . (5.23)
|Ψi ∈ H̃ ⊗ H (5.24)
such that
ρ = trH̃ |ΨihΨ| . (5.25)
This is called purifying the mixed state. In this problem you will show that Minkowski
space is a purification of Rindler space.
The Minkowski Hilbert space∗ factorizes† into two copies of the Rindler Hilbert space,
HM = H̃R ⊗ HR , (5.26)
which are the Hilbert spaces associated to the left Rindler x < 0 and right Rindler
∗
By ‘Minkowski Hilbert space’ we really mean Hilbert space of the theory on an infinite plane,
since Hilbert spaces are defined by the space on which a theory lives, not the spacetime. Similarly by
‘Rindler Hilbert space’ we mean the Hilbert space of the theory quantized on a half-plane.
†
This is not quite true due to UV divergences, but this doesn’t matter for this problem
60
x > 0.∗ In terms of the field data, this just means that a field in Minkowski space at
t = 0, φM (x), can instead be written as the pair (φ̃R , φR ) where φ̃R is a field on the
left Rindler half-plane, and φR is a field on the right Rindler half-plane.
(a) The Minkowski groundstate |0i is formally a functional that turns field data at
t = 0 into complex numbers. That is, the ground state wavefunction is
The subscript M means ‘Minkowski’, ie a state on the full space. Write down the path
integral that computes this wavefunction, and draw the corresponding picture along
the lines of the path integral pictures above.
(b) Now re-slice this same path integral using the Rindler Hamiltonian HRindler , which
generates Euclidean rotations ∂φ . That is, write an operator expression of the form
(c) We want to show that the Minkowski state is the same as the doubled Rindler state
X
|T F DiR⊗R ≡ e−βEn /2 |niR |ni∗R (5.29)
n
where this is a sum over Rindler energy eigenstates, En is the Rindler energy, β = 2π is
the Rindler temperature, and ∗ means CPT conjugate. This is called the thermofield
double state.
To demonstrate this, check that the matrix elements of the state defined in (5.29) agree
with the ones you wrote above,
To do you this you will need to note that the mapping from Minkowski states to Rindler
∗
Clearly HR = H̃R but the tilde will be useful to keep track of things.
61
⊗ Rindler states is
|φ̃R , φR iM → |φ̃i∗R |φR iR , (5.31)
where the conjugation is needed because time runs ‘backward’ in the left Rindler wedge.
What you have just shown is that the full Minkowski vacuum can be reinterpreted as
the thermofield double in two copies of Rindler space.
(d) Finally, check that tracing over the left Rindler Hilbert B space produces a thermal
state in the right Rindler Hilbert space A,
X
ρA ≡ trB |T F DihT F D| = e−2πHRindler = e−2πEn |niR R hn| (5.32)
n
Euclidean plain, positioned along the negative τ -axis. Then the thermofield double
state is produced by evolving this by β/4 to the left and β/4 to the right, producing a
state on the τ = 0 line.
w
z=− (5.33)
w−`
62
maps the half-line in the z-coordinate to the interval x ∈ (0, `), where w = x + itE . In
the z coordinate, evolution of the half-line is generated by the rotational vector field ζ.
(a) Write ζ in the z coordinate,∗ then do the coordinate change (5.33) to write it in
the x, tE coordinates.
(b) The modular Hamiltonian, which generates the time evolution of the interval, is
then Z
HA = dx Ttµ ζ µ |tE =0 , (5.34)
A
where Tµν is the usual stress tensor. Write the integrand explicitly in terms of Ttt , x,
and `.
ρA = e−2πHA . (5.35)
(d) Continue tE → it, and sketch the vector field ζ in 2d Minkowski space (x, t).
We have seen there is no unique vacuum state in quantum field theory. The same is
true on a black hole background. A natural state to consider, which is analogous to
the vacuum state we defined in Minkowski space, is a state prepared by a path integral
on the analytically continued Euclidean spacetime,
dr2
ds2 = (1 − 2M/r)dτ 2 + + r2 dΩ22 . (5.36)
1 − 2M/r
with the imaginary-time identification τ ∼ τ + β. This spacetime only has r > 0, there
is no interior. The t = 0 slice of the Lorentzian spacetime is the τ = 0 slice of the
Euclidean spacetime, see figure 2.
∗
i.e., define z = z1 + iz2 and write ζ in terms of the two real coordinates z1 and z2 .
63
Figure 2: Schwarzschild spacetime. The Euclidean path integral produces a pure,
highly entangled state on the two-sided Lorentzian spacetime. The quantum state on
the right half of the Penrose diagram, where we live, is therefore mixed. This reduced
state is the Hartle-Hawking thermal state.
Sending τ → τ + β/2 takes us to the other side of the Penrose diagram in the maximal
analytic extension of Schwarzschild. This can be shown in detail using Kruskal coor-
dinates. A simpler way to see this is to go to Rindler coordinates near the horizon.
By changing these Rindler coordinates to Minkowski-like coordinates good near the
horizon, we can continue through the horizon to the other side of the Penrose diagram.
So, just like in Rindler space, we get to the other side of the horizon by going half way
around the Euclidean circle.
This path integral prepares an entangled state on M̃ × M , the product of the left and
right Minkowski spaces. Just as in Rindler space, the reduced density matrix on our
spacetime M will be a mixed state,
This is not the only state we could consider. See note 3 for a discussion of other
possibilities. Hawking showed that a black hole formed by collapse will end up in the
‘Unruh state’, which is a state where the black hole radiates into a cold outside region.
Greybody factors
The Hartle-Hawking vacuum (5.37) is time-independent. This means that, in each
64
mode, the flux of outgoing Hawking radiation is equal to the flux of ingoing radiation.
A mode φk outside the black hole does not necessarily fall in; it is absorbed with
probability given by the absorption cross-section σabs (k). Therefore, the only way a
black hole can be in thermal equilibrium with a bath at temperature T is if the Hawking
emission measured at infinite is actually
1
hnk i = σabs (k) . (5.38)
eβw − 1
The extra factor is the ‘greybody factor’. We will probably calculate some greybody
factors later.
Aside: Cosmology
If the early universe is described by inflation, then it is the story of a slowly evolving
de Sitter spacetime. De Sitter spacetime is the Lorentzian continuation of a sphere.
That is, the metric of Euclidean de Sitter is just
The state of quantum fields during inflation is responsible for present-day observables
including the primordial temperature fluctuations in the CMB, observed by experi-
ments like COBE, WMAP, and Planck. Since there is no unique vacuum, we must
pick a state of the quantum fields in de Sitter. For various reasons,∗ we usually assume
this state is the so-called ‘Euclidean vacuum’, also called the ‘Bunch-Davies vacuum’
∗
Here are some reasons: (1) This state respects the symmetries of de Sitter; (2) At short distances,
this vacuum is the one in which comoving observers see no particles (ie it coincides locally with the
Minkowski vacuum); (3) at late times, due to the cosmological expansion, any state will dilute into
this state.
65
or various other things. This state is prepared by a Euclidean path integral on the
hemisphere, cut along the equator. Therefore this quantum state, unlike the Hartle-
Hawking state, has quite possibly already been observed experimentally.
(b) How heavy, in solar masses, would a black hole need to be for its lifetime to be the
age of the universe t ∼ 13 billion years?
(If such black holes exist, we might be able to observe the final moments of decay,
when a large burst of energy is released in Hawking radiation. Unfortunately there
is no particularly good reason to think they should exist, since black holes formed by
stellar collapse must have Minitial & a fewMsun .)
(c) What is the typical energy (in eV) of a particle emitted from a solar mass black
hole via Hawking radiation?
Exercise: Superradiance
Difficulty: a few lines
Rotating (Kerr) black holes are labeled by mass M and angular momentum J, or
equivalently by a temperature T and angular potential Ω. The spacetime is rotationally
invariant and stationary, so modes of a scalar field can be written φ ∼ e−iωt+imφ Sn (r, θ),
where n labels the solutions of given ω, m.∗ The Hawking decay rate of a rotating black
hole is
1
Γω,m,n = σabs (ω, m, n) (5.40)
eβ(ω−mΩ) −1
(a) Take the zero-temperature limit of (5.40). (Hint: ω > 0 and m is any integer. The
∗
Actually, the wave equation fully separates, so in fact S(r, θ) = R(r)F (θ). This is surprising and
nontrivial, since the background has only two Killing vectors. Similarly, the geodesic equation on Kerr
has an ‘extra’ conserved quantity.
66
answer should not be trivial.)
(b) For this decay rate to make any sense, what can you conclude about σabs ?
∗
More accurately, a close cousin of superradiance involving magnetic fields. The details of how this
works are still largely unknown.
67
6 The Gravitational Path Integral
In ordinary QFT, to do a path integral we first fix the spacetime manifold M , then
integrate over fields defined on M . We did the same thing in our discussion of Hawking
radiation. In quantum gravity, however, we must integrate over the geometry itself.
We are only allowed to specify the boundary conditions on the geometry as r → ∞,
just like for other fields. The gravitational path integral (in Euclidean signature) is
√
Z Z
−SE [g,φ] 1
Z= DgDφ e , SE [g] = − g (R + · · · ) + boundary terms ,
16πGN
(6.1)
where φ denotes all the matter fields.
The meaning of this path integral depends on the boundary conditions, as usual. In
analogy to the QFT case, we define the thermal partition function Z(β) as the path
integral on a Euclidean manifold with the boundary condition that Euclidean time is
a circle of proper size β,
Of course we cannot actually do the path integral. In fact, we don’t even really know
how to define it.∗ The best we can do is to approximate it by expanding around a
classical saddlepoint, i.e., a solution of the classical equations of motion:
The leading term, in which ḡ, φ̄ is a solution of the classical equations of motion, is the
semiclassical approximation to the path integral. This solution must of course obey
the correct boundary condition. The next term is the 1-loop term and is O(G0N ), and
the dots indicate higher-loop contributions.
We already know a solution with the correct boundary conditions: the Euclidean
∗
The situation in gravity is even worse than in ordinary QFTs, since the Euclidean action is not
bounded below.
68
Schwarzschild black hole. This is a classical saddlepoint with a Euclidean time cir-
cle of size β. Therefore, to leading approximation, the thermal free energy is the
Euclidean on-shell action:
log Z(β) ≈ −SE [ḡ] , (6.4)
with ḡ the Schwarzschild metric. (We have dropped φ̄ because no matter fields are
non-zero in the Schwarzschild background.)
This partition function can be used in all of the same ways as an ordinary thermody-
namic partition function. For example, recall that log Z = S − βE, so the entropy and
energy are
S = (1 − β∂β ) log Z(β), E = −∂β log Z . (6.5)
We will see that these agree with the area law and the black hole mass.
A similar discussion applies with an angular potential and electric potential, but we
will stick to the Schwarzschild black hole to keep things simple.
We will now do this explicitly, in Einstein gravity (i.e., no higher curvature corrections)
with zero cosmological constant. It is not as simple as computing R (which vanishes
69
for Schwarzschild!) and integrating over spacetime, since there are boundary terms to
worry about and infinities to regulate.∗ Although this is entirely classical, the procedure
to regulate divergences involves counterterms much like those in QFT; in fact we will
see later there is a direct link between these two apparently different divergences.
1
Z
√ 1
Z √
SE [g] = − gR − hK . (6.6)
16π M 8π ∂M
Here hij is the induced metric on the boundary ∂M , and the extrinsic curvature of
∂M is
Kij ≡ 12 Ln hij = ∇(i nj) , K = hij Kij , (6.7)
√
Z Z Z
δ gR ∼ (eom)δg + [A(g, ∂g)δg + B(g, ∂g)∂δg] , (6.9)
M M ∂M
where ‘eom’ essentially means the Einstein tensor‡ and the boundary terms come from
integrating by parts. On a classical solution, the bulk term vanishes. If we impose
boundary conditions that fix the metric at r = r0 , then δg|∂M = 0, so the first boundary
term vanishes, but the boundary term involving ∂δg does not. The Gibbons-Hawking-
York term fixes this problem. It is chosen so that the variation of the full action (6.6)
∗
This subsection follows Hawking’s chapter in General Relativity, an Einstein Centenary Survey,
Hawking and Ellis eds.
†
In the simple case that the boundary is, say, at fixed r, the induced metric hij = gij where i runs
over the transverse directions. That is all we will need. But more generally, the projector onto ∂M is
(as you can see by noting nµ hµν = 0), and then you must define intrinsic coordinate xi on ∂M .
‡ √
(eom)δg ∝ gGµν δgµν
70
has the form Z Z √
δSE [g] = (eom)δg + 1
2
hT µν δgµν . (6.10)
M ∂M
We will return to this ‘stress tensor’ later, but for now the important thing is just that
the boundary term has been chosen to eliminate ∂δg. Thus δSE [ḡ] = 0 for variations
satisfying the boundary condition and ḡ satisfying the equations of motion.
dr2
2 2M
ds = 1 − dτ 2 + 2M
+ r2 dΩ22 . (6.11)
r 1− r
What was the horizon r = 2M in Lorentzian signature is now just the origin of a polar
coordinate system, with angular coordinate τ identified as required for regularity at
the origin,
τ ∼ τ + 8πM . (6.12)
Euclidean black holes are completely smooth solutions; they do not have an interior or
a singularity.
Now we want to evaluate the action. The bulk term vanishes, since the vacuum Einstein
equations set R = 0. The boundary term, evaluated on the surface r = r0 , is
Z √
hK = β(8πr0 − 12πM ) . (6.13)
∂M
1
Z
√ 1
Z √ 1
Z √
SE [g] = − gR − hK + hK0 , (6.14)
16π M 8π ∂M 8π ∂M
71
flat spacetime. This is very similar to what we do in quantum field theory, but this
calculation is entirely classical. (We will see later that in anti-de Sitter space, there is
a direct connection between the two ideas). Note that the counterterm depends only
on data intrinsic to the boundary surface – it is not allowed to depend on ∂n h.
This gives∗ Z √
hK0 = β(8πr0 − 8πM + O(1/r0 )) . (6.17)
∂M
This eliminates the divergence (and changes the finite term!), giving our final answer
βM
SE = = 4πM 2 (6.18)
2
Thus the thermal partition function, or leading approximation to the path integral, is
β2
2
Z(β) = exp −4πM = exp − . (6.19)
16π
From this we can rederive the entropy and energy using standard thermodynamics,
72
always works and agrees with the area law. Roughly, the reason it is proportional to
area is that we can think of the equation (1 − β∂β ) log Z as calculating the change
in the classical action produced by changing the imaginary-time identification. If you
smoothly deform a solution, then δSE = 0 by the equations of motion; but if you
R
introduce a defect, this contributes δSE = def ect (something). Going through the
details, you can derive Area/4.∗ This is also the easiest way to derive Wald’s formula,
which includes the corrections to the entropy from higher curvature terms in the action.
1
Z
√ 1
Z √ a
Z √
SE [g] = − g(R − 2Λ) − hK + h. (6.21)
16π M 8π ∂M 8π ∂M
Choose a to cancel the divergence; the remaining finite expression is the correct SE .
73
74
7 Thermodynamics of de Sitter space
Inflation is the idea that the very early universe, at t . 10−32 s, is approximately de
Sitter spacetime. We will now take a detour to apply our methods to de Sitter space,
since in this situation they produce observable effects via the imprint of primordial
density perturbations on the CMB.
de Sitter is the maximally symmetric solution of the Einstein equations with a positive
cosmological constant. The metric is∗
These coordinates, which cover all of de Sitter space, describe a sphere S D−1 that is
very large at T → −∞, contracts to a minimum radius ` at T = 0, then expands as
T → ∞. The length scale is set by the cosmological constant, Λ ∼ 1/`2 .
To draw the Penrose diagram, we need to make the range of T finite. This can be done
by defining
tan(η/2) = tanh(T /2`) . (7.2)
`2
ds2 = −dη 2 + dΩ2D−1 .
2
(7.3)
cos η
For the Penrose diagram (which captures the causal structure but ignores distances)
we can ignore the overall factor, so this gives the diagram in figure 3. Note that de
Sitter space has an initial and final conformal boundary. (Although the diagram also
appears to have left and right boundaries, these are not really boundaries – at each
value of η space is a sphere, so those lines are just the north and south poles of the
sphere S D−1 .)
Vacuum
As usual, there is no unique vacuum. However we can follow our usual prescription,
∗
I will call these global coordinates, but in the context of cosmology this is usually called the ‘closed
slicing.’
75
Figure 3: Penrose diagram of de Sitter space.
as we did for Minkowski and Schwarzschild, and define a vacuum which is prepared by
a path integral on the Euclidean continuation of de Sitter. Euclidean de Sitter space
is just a sphere: send T → i`θ in (7.1), and you see that θ ∈ [−π/2, π/2] is the polar
coordinate on a sphere S D of radius `. It is shifted by π/2 from the usual definition,
so the surface T = θ = 0 is the equator of the S D . This equator is the minimal-size
spatial section of de Sitter space, an S D−1 of radius `. This is the T = 0 surface at the
middle of the Penrose diagram.
The Euclidean vacuum ∗ is the state prepared by a path integral on the hemisphere.
This path integral prepares a quantum state on the equator, which can then be evolved
in Lorentzian time. Here is a cartoon for this path integral:
(7.4)
Roughly speaking, ‘most’ states will eventually end up close to the Euclidean vacuum,
since the de Sitter expansion dilutes any excitations.
This vacuum state has in a sense been observed experimentally, since it leaves an
imprint in the CMB. I say ‘in a sense’ because it is not a very fine-grained or direct
test of the details of de Sitter space, but it is nonetheless the only good explanation
∗
Also called the ‘Bunch-Davies vacuum’ and various other things.
76
we have for those fluctuations.
Now we will calculate the Gaussian fluctuations in de Sitter that are seen in the CMB,
using path integral methods.∗ We consider a free massless scalar field, in the ‘conformal
coordinates’ (7.3) in D = 4. (It turns out that scalar fluctuations of the metric reduce
to this problem.) The action is
√
Z
I∼ −g∇µ φ∇µ φ . (7.5)
M
We want to compute the wavefunction for this scalar field, in the Euclidean vacuum.
This wavefunction is defined to be the transition amplitude
This is a path integral over field configurations on the mixed spacetime (7.4). The
past boundary condition is regularity on the Euclidean sphere; the future boundary
condition is φ(Ω, t0 ) = φ0 (Ω).
In the WKB approximation the wavefunction is simply the on-shell action of a classical
solution satisfying these boundary conditions,
In fact, since this is a free field, this expression is exact. The on-shell action is a
pure boundary term, since we can integrate by parts in (7.5) then use the equations of
motion. Thus Z √
Ion−shell (t0 ) ∼ hφ∂n φ (7.8)
Σ(t0 )
where Σ(t0 ) is the spatial slice at time t0 , ∂n is the derivative normal to this slice, and
h is the spatial metric.
At late times, the S D−1 spatial sphere in (7.3) is very large. When we look back at the
∗
This is based on Maldacena astro-ph/0210603.
77
CMB, we are looking at a tiny patch of this sphere. Therefore at late times we can
think of the spatial sphere as being essentially flat. That is, we can only see modes with
wavelength much smaller than `. This simplifies the calculation of the wavefunction
because at late times we can approximate the metric by
`2
ds2 = (−dη 2 + d~x2 ) . (7.9)
η2
This is enough to compute the wavefunction, since (7.8) is a pure boundary term at
t = t0 (η = η0 ).
~
For a plane wave φ0 = hk (η)eik~x , the wave equation at late times (i.e., using the metric
(7.9)) is
2
φ = 0 ⇒ ∂η2 hk (η) − ∂η hk (η) + ~k 2 hk (η) = 0 . (7.10)
η
In order to find the wavefunction, we need to find the classical solution satisfying the
boundary conditions stated above. This is
(1 − ikη)eikη
φ = φ0k . (7.11)
(1 − ikη0 )eikη0
This satisfies all of our criteria: it solves the wave equation, and it is equal to φ0k at
time η = η0 . The last criterion was regularity on the Euclidean sphere – this is what
picks the solution going as e+ikη , rather than the other solution ∼ e−ikη . To see this,
note the pole of the Euclidean sphere is t = i`π/2, which corresponds to
Only the e+ikη solution is regular in this limit, so we’ve picked the correct solution,
corresponding to the Euclidean vacuum state. (This is very similar to the path-integral
calculation of the groundstate for the harmonic oscillator.) Although φ is a real field,
(7.11) is complex. This is fine: even though the path integral is over real field configu-
rations, the stationary phase approximation can pick out a complex saddlepoint. It is
just a trick to compute the path integral.
78
Plugging (7.11) into (7.8),
d3 k `2 0 d3 k `2 k 2
Z Z
iI = i φ ∂η φ0k |η=η0 ∼ i − k + · · · φ0−k φ0k
3
(7.13)
(2π)3 2η02 −k (2π)3 2 η0
This gives the wavefunction (exactly in a free theory) via (7.7). Knowing the wavefunc-
R
tion, we can calculate correlators using hφ2 i ∼ Dφφ2 |Ψ(φ)|2 . In detail, accounting
for the normalization of the wavefunction:
R 3
d p 2 3
R
Dφ φk φ−k exp − (2π) 3 ` p φ φ
p −p
hφk φ−k i = R 3
d p 2 3
R
Dφ exp − (2π) 3 ` p φ φ
p −p
1 −3
= (2π)3
k , (7.14)
2`2
2 R 2
where we used the Gaussian integral dzz 2 e−az / dze−az = 1
R
2a
(and we’ve dropped
an overall momentum-conserving delta function).
The power law in (7.14) is what is meant by the statement that ‘inflation predicts a
scale-invariant spectrum of primordial scalar perturbations.’
A single inertial observer travels on a geodesic which we might as well call the north
pole of S D−1 . Thus the worldline of an observer is the solid line on the right side of the
Penrose diagram. It is clear from the diagram that this observer is in causal contact
with only a subregion of de Sitter. This is because the universe is expanding very
rapidly, so you cannot communicate with someone beyond a certain critical distance:
the cosmological horizon. This is labeled ‘horizon’ in figure 3. Like Rindler space, the
position of the horizon depends on the observer.
The static patch is the region of de Sitter in causal contact with an observer sitting
at the north pole. This is the analogue of the Rindler patch. The coordinates on the
static patch are
dr2
ds2 = −(1 − r2 /`2 )dt2 + dr2 + r2 dΩ2D−2 . (7.15)
1 − r2 /`2
79
This looks like a black hole except, the static patch where an observer lives is the
inside, r < `, with a cosmological horizon at r = `.
Temperature
In the Euclidean vacuum, an observer in the static patch will see a temperature.∗ This
is the same reason we discussed above for Rindler space and then for the Schwarzschild
spacetime in the Hartle-Hawking state. The Euclidean path integral prepares a state
entangled between the left and right static patches, and when we trace over the hidden
region, the resulting density matrix is thermal. To find the temperature we can apply
the imaginary time periodicity trick. Starting from the static patch coordinates, define
r = `(1 − 2 ) and expand in to find:
2 2
ds2 ≈ 2`2 (d2 − dt ) + · · · (7.16)
`2
This looks like Rindler, so the Euclidean continuation is regular only if t ∼ t + 2πi`.
Therefore the de Sitter temperature is
1
TdS = . (7.17)
2π`
This is the temperature that you will read on your thermometer if you are an inertial
observer in de Sitter. (Due to dark energy we are now at the start of a new de Sitter
epoch. The present-day de Sitter temperature is the Hubble scale, T ∼ 10−33 eV .)
Entropy
The area of the cosmological horizon is
So in D = 4, S =area/4 gives
SD=4 = π`2 . (7.19)
∗
See Gibbons and Hawking, “Cosmological event horizons, thermodynamics, and particle creation,”
1977.
80
7.3 Action
The Euclidean action of dSD is the action of S D . This is finite and there are no
boundary terms to worry about on a sphere, so it’s straightforward to calculate. In
D = 4, the Euclidean action is∗
`4
√
Z
1 12 6
IE = − g(R − 2Λ) = − 2
− 2 V ol(S4 ) = −π`2 . (7.20)
16π SD 16π ` `
S = log Z . (7.21)
Here’s why: Recall the thermodynamic identity log Z = −βF = −βE + S. The energy
in GR is a pure boundary term (see next lecture!), so a compact space has E = 0. Thus
thermodynamics predicts log Z = S, and that’s exactly what we found in de Sitter.†
de Sitter is mysterious
We will not say much more about de Sitter space in this course. A big reason for this
is that we don’t have any UV-complete theory of gravity in de Sitter, like we do in
anti-de Sitter. We also have no clear answer to the question ‘What is the de Sitter
entropy?’ (Does it count the microstates of something?) Since we live in de Sitter, this
seems like a very important question.
81
Compute the entropy and the on-shell action of D-dimensional de Sitter space, and
verify the relation S = log Z.
82
8 Symmetries and the Hamiltonian
[References: The original paper is very nice and still worth reading, especially sections
1-3: “The Dynamics of General Relativity” by Arnowitt, Deser, Misner (ADM), 1962
(but available on arXiv at gr-qc/0405109). See also appendix E of Wald’s textbook,
and for full detail see Poisson’s Relativist’s Toolkit chapter 4.]
Time plays a special role in the canonical formulation of quantum mechanics, and in
the Hamiltonian approach to classical mechanics, since it is the independent variable.
In GR, time t is just an arbitrary parameter, and the dynamics are reparameterization-
invariant under t → t0 (t), since this is just a special case of diffeomorphisms. To see
how this fits into Hamiltonian mechanics we first consider a simple analog in quantum
mechanics.
Suppose we have a system with a single degree of freedom q(t) with conjugate momen-
tum p, and action Z
I= dtL. (8.1)
dp ∂H dq ∂H
=− , = . (8.3)
dt ∂q dt ∂p
83
The independent variable t is special. It labels the dynamics but does not participate
as a degree of freedom. In GR, time is just an arbitrary label – it is not special, and
the theory is invariant under time reparameterizations. To mimic this in our simple
system with 1 dof, we will introduce a fake time-reparameterizations symmetry. To
do this we label the dynamics by an arbitrary parameter τ , and introduce a physical
‘clock’ variable T , treated as a dynamical degree of freedom. So instead consider the
system of variables and conjugate momenta
where prime = d/dτ . Here N (τ ) is a Lagrange multiplier, which enforces the ‘constraint
equation’
Π + H(p, q) = 0 . (8.6)
The action (8.5) is reparameterization invariant under τ̄ = τ̄ (τ ), since after all τ is just
a label that we invented. The Hamiltonian of the enlarged system is simply
To reverse the procedure, i.e., to go from the parameterized action back to the ordinary
action with 1 dof, we plug in the constraint
Z
0
I = dτ [pq 0 − H(p, q)T 0 ] (8.8)
84
and then rewrite the dynamics in terms of the clock variable:
Z
0
I = dT [pq̇ − H(p, q)] (8.9)
where dot = d/dT . So we see that T is just the original physical time t.
∂
T0 = N (Π + H) (8.10)
∂T
But both T 0 and N are unspecified by the dynamics. For example we are free to pick
the ‘gauge condition’ T = τ , which corresponds to some particular choice of N (τ ).
where hij are the space components of the metric, and πij are their canonical conjugates.
The full spacetime metric is parameterized as
p
N = −1/g tt is called the ‘lapse’ and N i = N 2 g ti is the ‘shift’. These are Lagrange
multipliers, just like N in our discussion above. They are not fixed by the dynamics,
but a choice of parameterization. In other words, any geometry can be sliced into
‘time’ and ‘space’ in a such a way that N and N i can be set to any functions you like.
They are called the lapse and shift because they correspond to our choice of how our
coordinates on a time-slice of fixed t = t0 are related to the coordinates on a time-slice
of fixed t = t0 + δt. The flow vector, which tells you the arrow of time from one slice
85
to the next, is∗
ζ µ = N uµ + N µ . (8.13)
√ √
Z Z
1 4 1
I= d x −gR − d3 x −γ(K − K0 ) (8.14)
16π M 8π ∂M
where K0 is the subtraction term (the extrinsic curvature of the boundary embedded
in flat spacetime). Recall that the on-shell variation is
√
Z
δIon−shell = 1
2
d3 x −γT ij δgij , (8.15)
∂M
1
T ij = K ij − γ ij K − background subtraction .
(8.16)
8π
After quite a bit of work† , the full off-shell action (8.14) can be written
√
Z h i Z
4 ij i
I= d x π ḣij − N H − N Hi − d3 x σuµ Tµν ζ µ , (8.17)
M ∂M
√
Z Z
3 i
H[ζ] = d x (N H + N Hi ) + d2 x σuµ Tµν ζ ν , (8.18)
Σ ∂Σ
Now to explain all these terms: H and Hi are called the Hamiltonian and momentum
constraints, which are essentially the G00 and G0i components of the Einstein equations
(see Wald for explicit formulae). These components of the equations of motion involve
only 1st time derivatives. They are called ‘constraints’ because if we think of GR as
an initial value problem – specify initial data, then evolve in time according to the
∗
Here N µ = hµa N a , where hµν = gµν + uµ uν , i.e., hµa is the projector onto a spatial slice.
†
See Brown and York, “Quasilocal energy and conserved charges derived from the gravitational
action, ” 1993, and also Poisson’s Relativist’s Toolkit, Chapter 4.
‡
The bulk term is called the ‘ADM Hamiltonian’. As far as I know, the boundary terms were first
derived by Brown and York, and by Hawking and Horowitz.
86
dynamical equations – these are constraints on the allowed initial data hij , ḣij at t = 0.
This is in contrast to other, dynamical equations of motion, which tell you how that
data evolves in time.∗ Finally uµ is the timelike unit normal on the boundary, with
u2 = −1.
• The bulk term vanishes on-shell due to the constraint equations. The boundary
term does not vanish in general. This is related to the fact that diffeomorphisms
acting on the boundary are ‘real’ dynamics, whereas diffeomorphisms away from
the boundary are just redundancies.
• We have written the Hamiltonian as a functional of the lapse and shift, since
the dynamics leaves ζ unspecified. This corresponds to a choice of time evolu-
tion. That is, the Dirac bracket† of the Hamiltonian with any function X of the
canonical variables is
{H[ζ], X} = Lζ X . (8.19)
If we choose, for example, ζ µ = (1, 0), then this Hamiltonian generates time
evolution in the t-direction.
• The on-shell Hamiltonian looks just like the Hamiltonian of a 3-dimensional the-
ory living on the boundary with a 3-dimensional stress tensor Tµν . We will see
that at least in AdS this is actually literally the case.
87
identify the canonical coordinates and conjugate momenta, and rewrite it like we did
for gravity in (8.17). Identify the Lagrange multiplier(s) and constraint(s).
In gravity, we gave a physical interpretation of the lapse and shift Lagrange multipliers
as a choice of foliation of spacetime. What is the analogous interpretation of At in
electrodynamics? (It might be useful to couple to a matter field to answer this.)
Reference: Appendix E of Wald. But write your answers in terms of the vector
~ and B.
potential, not E ~
8.3 Energy
√
Z
E ≡ H[ζ(t) ]|on−shell = d2 x σui Tit . (8.20)
∂Σ
This is the usual (covariantized) expression for the energy in terms of the stress tensor.∗
αj
∇i T ij = −nα Tmatter , (8.21)
where n is the spacelike unit normal to the boundary. Therefore, if we impose the
boundary condition that matter fields go to zero fast enough as r → ∞, then the
∗
This equation agrees with other definitions of energy you may have seen, like the Komar formula,
whenever those definitions apply.
†
i.e., the ‘constraints’ in a radial slicing of the spacetime, which contain only first order r-
derivatives.
88
boundary stress tensor is conserved,
∇i T ij = 0 . (8.22)
If in addition
∇(i ζj) = 0 as r→∞ (8.23)
√ √ √
Z Z Z
0 2 i 2 i
d3 x −γ∇i T ij ζj = 0 .
E(Σ) − E(Σ ) = d x σu Tit − d x σu Tit =
∂Σ ∂Σ0
(8.24)
The equation (8.23) is the Killing equation, so the conclusion is that energy is conserved
as long as (i) matter fields fall off fast enough near infinity, and (ii) ζ = ∂t is an
asymptotic Killing vector.
Other asymptotic Killing vectors will similarly lead to conserved quantities. For ex-
ample, if ζ = ∂φ satisfies (8.23), then we can define the conserved charge
√
Z
J= d2 x σui Tiφ . (8.25)
∂Σ
This is in fact the angular momentum, and agrees with all the usual formulae for
computing the angular momentum of a spacetime.
We could also define boost charges, and get the full Poincare group. This requires some
modifications, since in this discussion ζi was a vector within the fixed ∂M , whereas
89
boosts act on ∂M . The results are similar.
We have seen that the bulk Hamiltonian vanishes, but there are boundary terms that
compute conserved charges. Now I will try to explain physically what is going on here.
This is important, so I’ll rephrase: General relativity is invariant under local diffeo-
morphisms. These are like gauge symmetries: fake symmetries, redundancies, that do
not change the physics and are just a convenient human invention to describe massless
particles. However it is not invariant under diffeomorphisms that reach the boundary.
The coordinates as r → ∞ are actually important and meaningful, like the coordinates
in a non-gravitational theory. A time reparameterization with compact support, i.e.,
t → t0 (t, x) such that t0 → t as r → ∞, is a local diff and involves no physics. A global
time shift t → t + 1 acts at infinity and is true time evolution.
The bulk terms in the Hamiltonian, i.e., the constraints, correspond to local diffs, and
the boundary terms correspond to diffs that reach the boundary. That is why the bulk
term vanishes on shell and the boundary term does not.
Certain diffs that reach infinity are actual symmetries. By ‘actual’ symmetries, I
mean symmetries that act on the space of states in the theory: they take one state
to a distinct but related state with similar properties, as opposed to gauge symmetries
which physically do nothing.
90
Asymptotic symmetries in U (1) gauge theory
The precise version of all these statements is the formalism of asymptotic symmetries.
The definition of the asymptotic symmetry group is the group of symmetry transfor-
mations modded out by trivial symmetries,
symmetries
ASG = . (8.28)
trivial symmetries
The definition of a ‘trivial symmetry’ is one whose associated conserved charge vanishes.
where the second term indicates the usual phase rotation on charged matter. These
are gauge symmetries. A local gauge symmetry, ie a transformation for which Λ(x) has
compact support, does not have any conserved charge associated to it. In fact despite
this infinite number of symmetries we know electromagnetism has only one conserved
quantity, the total charge
Z Z
3 0
Q∼ d xJmatter ∼ d2 x Ftr . (8.30)
Σ ∂Σ
This is the conserved charge associated to the global U (1) rotation – it exists and is
conserved even in the un-gauged theory. Thus the global rotation is physical, while
local phase rotations are just redundancies.
The definition (8.28) of the asypmtotic symmetry group is the group of all transfor-
mations, mod gauge transformations with zero associated charge. Therefore in electro-
magnetism,
ASG = U (1)global . (8.31)
91
for which ∇(i ζj) → 0 at infinity. In asymptotically flat spacetimes, ie spacetimes
approaching Minkowski space fast enough as r → ∞, these are simply the Killing
vectors of Minkowski space. Thus the asymptotic symmetry group of asymptotically
flat spacetimes is the Poincare group.∗
This notion is important, because general spacetimes have no isometries, and there-
fore no local conserved charges. (For example, there is no ‘energy’ conserved along
the geodesic of a probe particle.) Asymptotic symmetries allow us to define global
conserved quantities in this situation.
[Pµ , Pν ] = 0 (8.32)
1
[Mµν , Pρ ] = ηµρ Pν − ηνρ Pµ (8.33)
i
1
[Mµν , Mρσ ] = ηµρ Mνσ − ηµσ Mνρ − ηνρ Mµσ + ηνσ Mµρ . (8.34)
i
If we just label the generators as V A for A = 1 . . . 10, then this is just a Lie algebra
i[V A , V B ] = f AB
CV
C
(8.35)
V A ↔ ζ (A)µ , A = 1 . . . 10 . (8.36)
For example P µ ↔ ∂µ , Mtx ↔ t∂x + x∂t , etc. The Killing vectors obey the same
algebra, under the Lie bracket:
[ζ A , ζ B ]µLB ≡ ζ Aν ∂ν ζ Bµ − ζ Bν ∂ν ζ Aµ = f AB
Cζ
Cµ
. (8.37)
92
Recall that conserved charges generate the action of the diffeomorphism under Dirac
brackets. That is, the charge
QA = H[ζ A ] (8.38)
generates
{QA , X}DB = Lζ A X . (8.39)
For this to be consistent with the algebra, the charges themselves must obey the same
algebra:
{QA , QB }DB = f ABC QC . (8.40)
In other words,
{H[ζ], H[χ]}DB = H [ζ, χ]LB + constant , (8.41)
where we have allowed a constant ‘central charge’ term in the algebra of charges, since
this would still be consistent with the action of the generators on X (and actually does
appear in important examples).
Sometimes the ASG leads to surprises. A famous example is in anti-de Sitter space.
The isometry group of AdSD is SO(D − 1, 2). So a natural guess is that the asymptotic
symmetry group of asymptotically-AdS spacetimes is also SO(D − 1, 2). This is true
for D > 3 but wrong in D = 3, as shown by Brown and Henneaux. We will talk about
this more later.
The linearized solution of GR that carries both energy and angular momentum is
4j sin2 θ
2 2M 2 2M
ds = − 1 − dt + 1 + (dr2 + r2 dΩ2 ) − dtdφ . (8.42)
r r r
This is, for example, the metric far away from a Kerr black hole, or a rotating planet.
We will compute the energy and angular momentum using the on-shell Hamiltonian
(8.18). Here it is again, after enforcing the constraints:
√
Z
H[ζ] = d2 x σui Tij ζ j . (8.43)
∂Σ
93
The energy is associated to ζ = ∂t and the angular momentum to ζ = ∂φ .
Kinematics
We want to compute T ij . This is a tensor living on ∂M , which is the surface r = r0 .
To define tensors on ∂M , we first compute the unit normal to the ∂M ,
r
2M
nµ dxµ = 1+ dr . (8.44)
r
The full metric can be split into the normal and tangential parts as
γνµ projects onto the boundary, since nµ γνµ = 0. The components γiµ for µ = t, r, θ, φ and
i = t, θ, φ can be used to turn spacetime tensors into boundary tensors, and vice-versa:
Vi ≡ γiµ Vµ . (8.46)
M
uµ dxµ = (−1 + + O(1/r2 ))dt . (8.48)
r
(This could be used to define the induced metric from hµν = gµν + uµ uν and corre-
sponding projector but we won’t need these to compute the charges.) Projecting the
timelike normal onto the boundary doesn’t change anything, we still have
M
ui dxi = (−1 + + O(1/r02 ))dt , (8.49)
r0
Finally, we need the volume element of the boundary at fixed time. The induced metric
94
on ∂Σ is
2M
σAB dxA dxB = r02 (1 + )(dθ2 + sin2 θdφ2 ) , (8.50)
r0
with volume element
√
2 2M
σ = r0 1 + sin θ . (8.51)
r0
Stress tensor
The extrinsic curvature of ∂M is
As a boundary tensor,
Kij = hµi hνj Kµν . (8.53)
The trace of K is the same whether we use Kµν or Kij (check this!). It is
2 3M
K=− − 2 + O(r0−3 ) . (8.54)
r0 r0
Now we compute the stress tensor from its definition (ignoring the background sub-
traction for now), Tij = Kij − γij K. (I’ve rescaled it by 8π to unclutter notation, but
will put the 8π back in the Hamiltonian below.) It has components
2 8M 5j sin2 θ
Ttt = − + 2 , Ttφ = − , Tθθ = r0 + M, Tφφ = sin2 θ(r0 + M ) (8.55)
r0 r0 r02
plus higher order terms O(M 2 /r02 ). (In this equation we are still ignoring the back-
ground subtraction, we will deal with that below.)
Energy
The energy is the on-shell Hamiltonian for ζ = ∂t . Putting it all together, we have so
far for the energy Z
1
Eunsub = 2r0 sin θ = −r0 , (8.56)
8π ∂M
where ‘unsub’ means we have not dealt with the background subtraction yet.
95
To do the background subtraction, we repeat the whole calculation on the flat spacetime
2M 2M
ds2sub =− 1− 2
dt + 1 + (dr2 + r2 dΩ2 ) . (8.57)
r0 r0
Going through all the steps again, the subtraction term is Esub = −r0 − M . Therefore
the final answer is
E=M , (8.58)
as expected.
Angular momentum
The angular momentum is the on-shell Hamiltonian for ζ = −∂φ .† There is no back-
ground subtraction necessary. We find
Z
1
J= 3j dθdφ sin3 θ = j . (8.59)
8π ∂Σ
∗
We could include the angular momentum term, but we can shift the time coordinate to make
it O(1/r02 ) and it does not contribute. Put differently, we really only need to embed ∂Σ into flat
spacetime, not all of ∂M , so this is only important for the energy and we can ignore the angular
momentum.
†
The minus sign here is the standard convention. It is related to the fact that a mode e−iωt+imφ
carries energy E = +ω and angular momentum J = +m.
96
9 Symmetries of AdS3
This section consists entirely of exercises. If you are not doing the exercises, then read
through them anyway, since this material will be used later in the course. The main
goal of this section is derive the famous result of Brown and Henneaux on the central
charge of AdS3 . This was done in the 80s, using slightly different techniques from what
we’ll use here, and later came to play an important role in AdS/CFT, as we’ll see later.
XA X A = −`2 (9.1)
To find intrinsic coordinates on AdS3 , we just need to solve (9.1). One way to solve
this equation is by
Answer:
ds2 = `2 (− cosh2 ρ dt2 + dρ2 + sinh2 ρ dφ2 ) . (9.4)
These are global coordinates on AdS3 . Although on the hyperboloid (9.1) we can see
from (9.3) that t is a periodic coordinate, when we say ‘AdS3 ’ we will always mean
97
the space in which t is ‘unwrapped’, t ∈ (−∞, ∞) (the universal covering space of the
hyperboloid).
AdS3 inherits the isometries of the embedding space that preserve the hyperboloid.
(For the same reason that the isometries of S 2 are inherited from rotations in R3 .) The
group of rotations+boosts in a 4d geometry with signature (+, +, −, −) is SO(2, 2), so
we expect this to be the isometry group of AdS3 . In this problem we’ll confirm this.
V = X 1 ∂X 0 + X 0 ∂X 1 (9.5)
in the embedding space (9.2). This preserves the hyperboloid, since under X A →
X A − V A , the lhs side of (9.1) is unchanged to linear order (check this).
Write V as an isometry of AdS3 , in the coordinates (9.4). To do this, first define the
projection tensor
∂X A
PµA = (9.6)
∂xµ
where xµ are the coordinates of AdS3 . This can be used to convert the 4-vector V A
into a tensor living on the hyperboloid,
χµ = PµA VA . (9.7)
This same procedure can be used to find all of the Killing vectors of AdS, but I will
98
spare you the trouble. The answer, in a convenient basis, is
1
tanh(ρ)e−i(t+φ) ∂t + coth(ρ)e−i(t+φ) ∂φ + ie−i(t+φ) ∂ρ
ζ−1 = 2
(9.8)
1
ζ0 = (∂
+ ∂φ )
2 t
Note that the subscripts here are just labels, not spacetime indices.
That is, the Killing vectors obey this algebra under Lie brackets, with an additional i,
for example
i{ζ1 , ζ−1 }LB = 2ζ0 , etc. (9.10)
The barred zetas in (9.8) commute with the unbarred zetas, and form another SL(2, R)
algebra. Therefore the isometries of AdS3 form the algebra
The subscripts mean ‘left’ and ‘right’, since the ζ’s involve only the ‘left-moving’ com-
bination t + φ and the ζ̄’s involve the ‘right-moving’ combination t − φ.
Note that as a Lie algebra, of SO(2, 2) = SL(2, R) × SL(2, R). This is a special feature
of AdS3 . In general the AdSD isometry group is SO(D − 1, 2), which does not split
into two factors.
99
9.3 Exercise: Conserved charges
and expand the metric (9.4) at large r. Show that to leading order
dr2
2
ds = ` 2
− r2 dt+ dt− . (9.13)
r2
These are called Poincaré coordinates, and in fact this metric is an exact solution of
Einstein’s equation – it covers a subregion of AdS3 called the Poincaré patch.
dr2
2
ds = ` 2
2
− r2 dt+ dt− + h++ (dt+ )2 + h−− (dt− )2 + 2h+− dt+ dt− , (9.14)
r
where the h’s are arbitrary functions of t+ and t− but independent of r. We will
compute the boundary stress tensor (Brown-York tensor) and use it to define the
energy and other conserved charges in AdS3 .
The boundary stress tensor is defined as the variation of the on-shell action
2 δSon−shell
T ij ≡ √ (9.15)
−γ δγij
√ √ √
Z Z Z
1 1 a
S[g] = −g(R − 2Λ) + −γK + −γ . (9.16)
16π M 8π ∂M 8π ∂M
For the bulk term and Gibbons-Hawking term, we can use our formulae from flat space
given in previous lectures. The last term is a counterterm, which takes the place of the
∗
Specifically, the subleading components of the metric must have a certain fall-off at large r. These
conditions are basically chosen so that the Hamiltonian can be defined.
100
‘background subtraction’ we did in flat space. This leads to
1 ij
T ij = K − Kγ ij + ãγ ij .
(9.17)
8π
Choose the counterterm coefficient ã so that Tij is finite as the cutoff surface r0 → ∞.
Compute T++ , T−− , and T+− to first order in the perturbation hij in the limit r0 → ∞.
where ui is the timelike normal to a fixed-t slice, and ζ = ∂t . (The overall 1/` is a
convention, necessary due to the fact we are using dimensionless coordinates.)
dr2
2 2 2 2
ds = ` −(r − 8M )dt + 2 + r2 dφ2 . (9.19)
r − 8M
To use your results of the previous problem in this calculation you must first change
coordinates to put it in the form (9.14). In particular you will need to redefine r0 = r0 (r)
to eliminate the perturbation to grr .
(e) Compute the energy of global AdS, by keeping the subleading terms in the coordi-
nate transformation (9.12) and plugging them into your formula for the stress tensor.
(Hint: the answer is negative. That’s OK, this is just a choice of zero for energy.)
Comment: We’ve focused on the energy, but we could compute conserved charges
corresponding to all the other Killing vectors in exactly the same way.
101
10 Interlude: Preview of the AdS/CFT correspon-
dence
The rest of this course is, roughly speaking, on the AdS/CFT correspondence, also
known as ‘holography’ or ‘gauge/gravity duality’ or various permutations of these
words. AdS/CFT was conjectured by Maldacena in a famous paper in 1997. A full
understanding of Maldacena’s motivations and results, and the huge body of work to
follow, requires some string theory, but AdS/CFT itself is independent of string theory
and we will not follow this route. Instead we will ‘discover’ AdS/CFT by throwing
stuff at black holes. In fact, this parallels the historical discovery of AdS/CFT in
1996-1997, though we will obviously take a shorter path. Our starting point will be a
black-hole-like solution in 6 dimensions, which might seem umotivated, so the purpose
of this interlude is to describe where we are headed, so you know we are doing this for
a good reason.
Anti-de Sitter space is the maximally symmetric solution of the Einstein equations
with negative cosmological constant. We worked out the metric of AdS3 in global and
Poincaré coordinates in the previous section. For general dimension AdSd+1 , the metric
in global coordinates is
To find the Penrose diagram, we can extract a factor of cosh2 ρ and then define a new
coordinate by
dρ
dσ = ⇒ σ = 2 tan−1 tanh(ρ/2) . (10.2)
cosh ρ
As ρ runs from 0 to ∞, ρ runs from 0 to π/2. Each value of t, σ is a sphere S d−1 . There-
fore the Penrose diagram looks like a solid cylinder, where ρ is the radial coordinate of
the cylinder, and t, Ω are the coordinates on the surface of the cylinder.
Unlike flat space, the conformal boundary (usually just called ‘the boundary’) of AdS
is timelike. From the Penrose diagram, we can see that massless particles reach the
102
boundary in finite time t. (Massive particles cannot reach the boundary; they feel an
e2ρ potential if they try to head to large ρ.)
`2
2
ds = 2 dz 2 − dt2 + d~x2 ,
(10.3)
z
A conformal field theory (CFT) is a QFT with a particular spacetime symmetry, con-
formal invariance. Conformal invariance is a symmetry under local scale transfor-
mations. We will discuss this in detail later. For now I will just mention that one
consequence of conformal symmetry is that correlation functions behave nicely under
coordinate rescalings x → λx. Correlation functions of primary operators (which are
lowest weight states of a conformal representation) obey
hO1 (x1 )O2 (x2 ) · · · On (xn )i = λ∆1 +∆2 +···+∆n hO1 (λx1 ) · · · On (λxn )i (10.4)
where ∆i is called the scaling dimension of the operator Oi . This (together with
rotation and translation invariance) implies for 2pt functions
1
hO(x)O(y)i ∝ . (10.5)
|x − y|2∆
The simplest example of a CFT is a free massless scalar field, where for example in
4d hφ(x)φ(y)i = (x − y)−2 . A massive free field is not conformal, since m shows up
in correlation functions and spoils the simple power behavior. This is generally true –
CFTs do not have any dimensionful parameters, so there can be no mass terms in the
Lagrangian. However the converse is not true, since there are theories with no mass
terms in the classical theory are not necessarily conformal. For example in massless
QCD, scale symmetry is broken in the the quantum theory so the theory acquires a
103
dimensionful parameter via dimensional transmutation.
There are also very nontrivial interacting conformal field theories. We will discuss a
couple of examples later.
It is often useful to think of the CFT as ‘living at the conformal boundary’ of AdS.
Indeed, the CFT lives in a spacetime parameterized by x = (t, ~x), whereas gravity
fields are functions of x and the radial coordinate ρ. And when we discuss correlation
functions of local operators we will see that a CFT point x corresponds to a point
on the conformal boundary of AdS. But it is not quite accurate to say that the CFT
lives on the boundary, for two reasons. First, we should not think about having both
theories at once; we either do CFT or we have an AdS spacetime, never both at the
same time. Second, the CFT is dual to the entire gravity theory, so in a sense it lives
everywhere.
The two theories are commonly referred to as ‘the bulk’ (i.e., the gravity theory) and
‘the boundary’ (ie the CFT).
In this course we will mostly restrict our attention to two types of observables in
AdS/CFT: thermodynamic quantities and correlation functions.
∗
Some people might obect to the word ‘any’ here. To be safe, we could say ‘any theory that we
know how to define in the UV and acts like ordinary gravity+QFT in the IR.’
104
Thermodynamics
The mapping between thermodynamic quantities on the two sides of the duality is
simply that they should be equal, for example the thermal partition functions obey
For the exact relation (10.6) we must in principle include all the quantum corrections
to this semiclassical formula.
Correlation functions
The goal of the next couple lectures is to derive the dictionary that relates CFT
correlators to a gravity calculation. We will give the exact prescription later, but
here is the general idea. Each field φi (ρ, x) in the gravitational theory there is a
corresponding operator Oi (x) in the CFT.∗ The mass of φ determines the dimension
of O. CFT correlation functions can be computed on the gravity side by computing a
gravity correlator of φ, with the points inserted at the boundary:
hO1 (x1 ) · · · On (xn )icf t ↔ “ lim ”hφ1 (ρ, x1 ) · · · φn (ρ, xn )igravity (10.8)
ρ→∞
The limit is in quotes because actually we need to rescale by some divergent factors
that we’ll come to later.
105
detailed microscopic definitions. For example: Type IIB string theory on AdS5 × S 5 is
dual to N = 4 supersymmetric Yang-Mills in 4d.
There are other microscopic examples — deformations of this one, and different versions
in different dimensions, with different types of dual CFTs. All of them (as far as I
know) come from brane constructions in string theory. This is often called the ‘top
down’ approach to AdS/CFT.
Another approach is to simply assume that we have a CFT with some low-dimension
primaries with a particular pattern, and perhaps with some assumptions about the
symmetries and conserved charges of the theory. This is more in the spirit of effective
field theory and is often called ‘bottom up.’ In many cases we can also include informa-
tion about the UV completion of the CFT (i.e., very high dimension operators) in this
approach so it actually goes beyond effective field theory, but without every specifying
the actual Lagrangian of the CFT.
Both approaches are important. Often calculations that can be done in one approach
are impossible in the other, or calculations first done microscopically turn out to have
more general and possibly more intuitive explanations via effective field theory.
106
11 AdS from Near Horizon Limits
Anti-de Sitter space appears in the near horizon region of extremal black holes. In this
section we will describe how the near-horizon limit of extremal Reissner-Nordstrom is
AdS2 × S 2 . Since the case d = 1 (i.e., AdS2 ) is a special case of AdS/CFT that we
would like to mostly avoid, we then discuss the 6d black string. This solution has a
near-horizon AdS3 which will serve as our main example for AdS/CFT.
dr2 2M Q2
ds2 = −f (r)dt2 + + r2 dΩ22 , f (r) = 1 − + 2 . (11.1)
f (r) r r
Recall that this solution is restricted to M > Q (we assume Q > 0) by cosmic censor-
ship. In general, the near horizon geometry is approximately Rindler ×S 2 , as discussed
above. But something special happens in the extremal limit,
M =Q. (11.2)
In this limit the horizon is a double zero, f (r) = (1 − Q/r)2 . So the inner and outer
horizons coincide, r+ = r− = Q, and the Hawking temperature (2.19) is zero.
To take the near horizon limit of the extremal black hole, we define
where λ is an arbitrary parameter. Plugging this into the extremal metric and taking
the limit λ → 0 with z, T, θ, φ held fixed gives the spacetime
Q2
ds2 = 2 2
+ Q2 dΩ22 .
2
−dT + dz (11.4)
z
The metric (11.4) is AdS2 × S 2 . This is the near horizon region of the original black
hole, since if λ → 0 with z held fixed, r → Q. Recall that this spacetime is supported
107
by some nontrivial electric field. Applying the same procedure to the field strength
gives a constant electric field in the near horizon region.∗
λ has disappeared entirely from the solution. This means that (11.4) (together with the
constant electric field) is actually by itself a solution to the Einstein-Maxwell equations.
This did not happen for the non-extremal case: there, the spacetime was only approxi-
mately Rindler near the horizon, and Rindler ×S 2 does not solve the Einstein-Maxwell
equations.
Global coordinates
The near-horizon metric we found in (11.4) is AdS2 in Poincaré coordinates. This
covers only a patch of the full AdS spacetime. Similarly, we considered only one patch
of the Reissner-Nordstrom spacetime. The full global AdS2 includes many Poincaré
patches, and each patch gives the near-horizon region of a different patch of the global
Reissner-Nordstrom. This is illustrated in the Penrose diagrams in figure 4.
108
Figure 4: Penrose diagrams for extremal Reissner-Nordstrom and AdS2 . The AdS2 Pen-
rose diagram is a ‘zoomed in’ version of the RN diagram which includes only the near
horizon region. Unlike higher-dimensional AdS, AdS2 has two conformal boundaries,
which are the blue lines on the left and right. In the RN diagram, the coordinates are
degenerate near the horizon so these boundaries are drawn as dashed blue lines slightly
away from r = r+ . The black hole horizon in RN (red) is the same as the Poincaré
horizon in AdS2 . The Poincaré patch of AdS2 is shaded green in both diagrams.
109
prefactor in front of AdS2 blow up near the boundary, after a conformal rescaling the
sphere just drops out. So points on the conformal boundary are labeled only by T , not
by T, θ, φ.
dt
E ≡ −pt = f (r) . (11.6)
dτ
For a particle in the near-horizon region, this is strictly zero as λ → 0. So from the
point of view of an observer at infinity, everything in the near-horizon region is infinitely
redshifted. Similarly, a wave in the near-horizon region ∝ e−iωnear T has zero frequency
as measured from infinity, since
Unfortunately AdS2 is the runt of the AdS/CFT litter. It is very interesting in its
own right but quite different from other dimensions (since CFT1 does not really make
sense) so not suitable for our purposes. We will focus on AdS3 instead, which appears
in the near horizon limit of a 6d black string (among other things). In the exercises
you will treat the other most popular example of AdS/CFT which involves AdS5 .
The 6d black string is similar to a black hole, but with horizon topology S 3 × S 1 . The
6d metric is∗
r02
2 −1/2 2 2 2
ds = (f1 f5 ) −dt + dφ + 2 (cosh σdt + sinh σdφ) (11.8)
r
2
1/2 dr 2 2
+ (f1 f5 ) + r dΩ3 .
1 − r02 /r2
∗
My favorite references on this solution and the discussion to follow are Kiritsis’s textbook, section
12.7, and MAGOO hep-th/9905111. Note that I will not distinguish between the Einstein-frame
metric and string-frame metric in this discussion; they differ by just a constant in the near-horizon
region which can be absorbed into the definition of `.
110
where
r12 r52
f1 = 1 + , f5 = 1 + , (11.9)
r2 r2
and φ ∼ φ + 2πR is compact. I will not write the other fields but there are some
nontrivial scalars and gauge fields that can be found in the references. This solution
carries two charges, called Q1 and Q5 , related to r1 and r5 . (In string theory these count
the number of D1 branes and D5 branes.) It also carries a momentum proportional to
r0 sinh σ along the φ direction, as you can guess from the boost term cosh σdt+sinh σdφ.
Finally r0 is the position of the horizon and measures the deviation from extremality.
−1
To see this, note the surface gravity is proportional to ∂r grr |r=r0 .
This is called the D1-D5-P black string. (Often in the literature you will find that it
is dimensionally reduced to 5d along the φ direction, so it becomes a 3-charge black
hole.)
First we consider the case with zero momentum. The extremal D1-D5 with P = 0 is
obtained by setting r0 = 0,
ds2 = (f1 f5 )−1/2 (−dt2 + dφ2 ) + (f1 f5 )1/2 (dr2 + r2 dΩ23 ) . (11.10)
`2 = r1 r5 (11.11)
scale
r → λ`r , t → t`/λ , φ → φ`/λ , (11.12)
and send λ → 0. This has the effect of just dropping the 1 in fi = 1 + ri2 /r2 , so
dr2
ds2near =`2 2 2 2
+ r (−dt + dφ ) + `2 dΩ23 . (11.13)
r2
111
This is the geometry AdS3 × S 3 , where the curvature radii of AdS and of the sphere
are equal.
Near-extremal D1-D5-P
Now let us take a different near-horizon limit of (11.8) where we simultaneously scale
r → 0 as we scale the black string towards extremality, r0 → 0. In this limit, r0 cosh σ
stays finite, so this is an extremal limit with finite momentum.
dr2
2
ds2near =`2 2 2
−r dt + 2 2 2 2
+ r dφ + r0 (cosh σdt + sinh σdφ) + `2 dΩ23 . (11.15)
r − r02
The term in brackets is in fact a 3d black hole, called the BTZ black hole. To see this
in more standard BTZ coordinates, define the parameters
ds2near 2 dw2 2
w + w − 2
2
= −h(w)dt + + w dφ + 2
dt + dΩ23 , (11.18)
` h(w) w
where
(w2 − w+
2
)(w2 − w−
2
)
h(w) = . (11.19)
w2
The w, t, φ part of this metric is a 3d black hole carrying mass and angular momentum,
with horizons at w± . Setting w− = 0 and w+ = 8M gives the J = 0 BTZ that was
used in some examples earlier in the course.
112
Exercise: AdS5 as near horizon limit
Difficulty: easy
Consider the 10D metric
where
r34
f =1+ . (11.21)
r4
r3 a constant parameter and ~x a coordinate on 4d space R4 . This metric is an extremal
black brane. (A black brane is like a black hole, but the horizon is a plane instead of a
sphere. In string theory, this solution is the geometry corresponding to a stack of Q3
D3 branes, where Q3 is a conserved charge of this solution, related to r3 .)
∆(r) ρ2 1
ds2 = − 2
(dt−a sin 2
θdφ)2
+ dr2 +ρ2 dθ2 + 2 sin2 θ(adt−(r2 +a2 )dφ)2 , (11.22)
ρ ∆(r) ρ
where
∆(r) = r2 + a2 − 2M r , ρ2 = r2 + a2 cos2 θ , (11.23)
and −M < a < M . This describes a rotating black hole with mass M and angular
momentum
J = aM . (11.24)
Find the value of M (as a function of J) where this black hole is extremal. Then find
the near horizon geometry of the extremal Kerr.
113
12 Absorption Cross Sections of the D1-D5-P
We will now throw a scalar field at a near-extremal D1-D5-P black string. In the
process we will rediscover AdS/CFT. The metric is (11.8), where we now assume
r0 r1 , r5 . (12.1)
This leads to a low Hawking temperature TH . (We also assume cosh σ, r1 /r5 ∼ O(1).)
Our goal is to calculate the absorption cross-section of a scalar field with low energy,
ωr5 1 . (12.2)
We will assume the scalar χ has zero momentum around the φ direction and on the
3-sphere. It is convenient to define
1 r0 eσ 1 r0 e−σ
TL = , TR = , (12.3)
2π r1 r5 2π r1 r5
which will turn out to be left and right moving temperatures in the dual CFT. These
are related to the Hawking temperature by
2 1 1
= + . (12.4)
TH TL TR
The wave equation χ = 0 for a scalar field of the form χ = e−iωt R(r) in the metric
(11.8) is
h d 3d 2
hr +ω f R=0 (12.5)
r3 dr dr
where
r2 r52 r02 sinh2 σ r02
f= 1 + 12 1+ 2 1+ , h=1− . (12.6)
r r r2 r2
This is now basically a 1d quantum mechanics problem. To get some intuition for this
scattering process, define
1
χ(r) = p ψ(r) . (12.7)
r(r2 − r02 )
114
In these variables, the wave equation is
d2
− 2 + V (r) χ = 0 (12.8)
dr
where V (r) is easy to find and plot, but annoying to write down. It looks like a well
near the horizon r = r0 , falls off at infinity, and has a lump somewhere in between.
This looks just like an ordinary Schrodinger equation, so we are just scattering through
a potential.
To compute the absorption cross-section, we need to solve the wave equation and com-
pare the coefficients of the incoming, transmitted, and reflected waves. The strategy
is to solve the equation approximately in the ‘near’ and ‘far’ regions, and match these
solutions together somewhere in the middle. The near and far regions are defined by
far: r r0 (12.9)
near: r r1,5 , r 1/ω (12.10)
The general solution of the wave equation in the far region is a linear combination of
Bessel functions, r
−3/2 πωr
Rf ar = r [AJ1 (ωr) + BY1 (ωr)] (12.11)
2
The general solution in the near region is
h i
Rnear = Ãh−i(a+b)/2 + B̃h+i(a+b)/2 2 F1 (−ia, −ib, 1 − ia − ib, h) (12.12)
with
ω ω
a= , b= . (12.13)
4πTR 4πTL
The boundary condition is that the wave is purely ingoing at the horizon r = r0 . This
sets B̃ = 0. Then we expand both Rnear and Rf ar in the matching region:
Γ(1 − ia − ib)
Rnear ≈ Ã + O(r02 /r2 ) (12.14)
Γ(1 − ia)Γ(1 − ib)
√
π
Rf ar ≈ A √ ω 3/2 + B-terms (12.15)
2 2
115
We have not written the B terms because they are messy, but we will use conservation
of flux to fix B later. Matching the terms in (12.14) gives
r
πω 3 A Γ(1 − ia − ib)
= Ã . (12.16)
2 2 Γ(1 − ia)Γ(1 − ib)
The Wronskian of the 2nd order wave equation is interpreted as the conserved flux,
1 3 ∗ dF
F≡ hr R ∂r R − cc , =0. (12.17)
2i dr
We would like to compare the incoming flux at infinity to the transmitted flux entering
the horizon. The far solution, expanded near infinity, is
Greybody factors
This is the greybody factor that appears in Hawking emission, up to a factor. The
factor is required since the relation between spherical waves that we considered and
plane waves is
e−iωr
e−iωz = K Y000 + · · · (12.22)
r3/2
p
where Y000 is the s-wave spherical harmonic on S 3 . The constant is K = 4π/ω 3 .
Therefore the absorption cross section for a plane wave is
116
This is the greybody factor.
(b) Now calculate the Hawking temperature of the BTZ black hole that appears in the
near horizon, (11.18) (again with σ = 0, so w− = 0).
Note that when we took the near-horizon limit of the near-extremal string, we sent
r0 → 0. So any finite temperature of the BTZ is actually zero temperature as viewed
from asymptotically flat infinity. There is an infinite redshift between the near horizon
region and infinity.
χ = m2 χ . (12.24)
In this problem we will derive the absorption cross section of a low-energy massive
scalar on the near-extremal black string.
(a) Derive the full radial wave equation from (12.24), in the black string geometry
(11.8) (but with σ = 0).
(b)Find the ‘near-region’ wave equation by starting with your answer to part (a) and
assuming r r1,5 and rω 1.
(c) Show that your near-region wave equation is identical to the massive wave equation
on the BTZ ×S 3 geometry,
dr̃2
ds2near 2
=` 2
−(r̃ − r02 )dt̃2 + 2 + r̃2 dφ̃2 + dΩ23 . (12.25)
r̃ − r02
117
(Where the tilded coordinates are proportional to the original coordinates.)
(c) Find the ingoing solution of the wave equation in the near region. Do not bother
with the far-region solutions, since these are messy and nothing interesting happens in
the far region.
Hint: Mathematica cannot solve this wave equation without some coaxing. To simplify
it, first change variables so h = 1 − r02 /r2 is your independent variable. Then define
√
R(h) = (1 − h)a hb ψ(h), with a = 21 (1 + 1 + `2 m2 ) and b = −iω`2 /2r0 . Then Math-
ematica can solve it. This is also the best method to solve it by hand (i.e., first strip
out the singular points, then reduce the result to a standard hypergeometric equation).
(d) Show that in the matching region r0 r r1,5 , the field behaves as
(e) If we think of the S term as the source, or ingoing term, and F as the response, or
outgoing term,† argue that the absorption cross section (of the near region) is propor-
tional to the imaginary part of the ratio,
F
Pabs ≡ Im (12.28)
S
and compute Pabs . (Including the far region too would just contribute some overall
uninteresting factors.)
∆
Pabs = k sinh(2πn)|Γ( + in)|4 (12.29)
2
∗
I’ve only written d so that the result is true in higher-dimensional AdSd+1 /CFTd . In this problem
always set d = 2.
†
The words ‘ingoing’ and ‘outgoing’ are not quite accurate here since these are power-law solutions,
not traveling waves, in the matching region. But they have a similar interpretation.
118
where k = k(r0 , ∆) is a simple constant you should find, and n ≡ `2 ω/(2r0 ).
F
GR = . (12.30)
S
This measures the response of the field to adding a source. (The relation (12.28) is a
version of the optical theorem for this Green’s function.)
Find GR in the high-frequency limit ω/TH 1. (This is the correlator in the extremal
limit, where temperature goes to zero.) You should find a power law. This power-law
behavior at short distances is the hallmark of a conformal field theory.∗
where ∆ is the scaling dimension of the operator. Take the 2d Fourier transform,
Z
+ +iω −
G(ωL , ωR ) ∼ dx+ dx− eiωL x Rx
(x+ x− )−∆ . (12.32)
Don’t worry about the coefficient; we only care about the power law, so you can do this
Fourier transform by dimensional analysis. Check that for ωL = ωR = ω, your answer
agrees with part (f). Therefore, the quantity ∆ that we introduced in the process of
the solving the wave equation is equal to a CFT scaling dimension.
119
The quasinormal modes of BTZ are modes which are ingoing at the horizon and have
S = 0 in (12.26).
(a) Find the spectrum of of quasinormal modes ωn for a massless scalar in BTZ.
(b) If you did the previous exercise, then use your solution of the near-region wave
equation to find the spectrum of quasinormal modes ωn for a massive scalar in BTZ.
120
13 Absorption cross section from the dual CFT
Now we will reproduce the absorption cross section (12.21) using holography. This re-
quires introducing some elements of conformal field theory. We will be more systematic
about CFT and about the AdS/CFT correspondence later, for now we are just going
to work this example in full detail as an illustration.
2d Conformal Transformations
A conformal transformation is a coordinate transformation that leaves the metric un-
changed, up to an overall rescaling:
First we want to find what type of coordinate changes have this special property. To
this end, consider an arbitrary coordinate change z = f (w, w̄), z̄ = f¯(w, w̄) where f¯ is
the complex conjugate of f . The metric in (w, w̄) coordinates is
¯
∂ f¯
2 ∂f ∂f ∂f
ds = dw + dw̄ dw + dw̄ . (13.4)
∂w ∂ w̄ ∂w ∂ w̄
121
For this to have the form (13.3) we must impose
∂f ∂ f¯ ∂f ∂ f¯
= =0. (13.5)
∂w ∂w ∂ w̄ ∂ w̄
The w coordinate labels a cylinder, since if we take w → w + 2πR we get back to where
we started. That is, w is identified,
w ∼ w + 2πR . (13.8)
This circle is a circle of constant magnitude on the z plane. (Draw the pictures for
yourself.) If we split w into real coordinates,
w = σ1 + iσ2 , (13.9)
122
Classical CFT
At the classical level, a QFT has conformal symmetry if the action is invariant under
conformal transformations. For example consider the action of a free massless scalar
Z
S= ¯
d2 z∂φ∂φ (13.10)
Perform the infinitesimal coordinate change z → w(z) and it is easy to check that
dw
the Jacobian in the measure cancels the factors that show up from ∂φ = ∂ φ.
dz w
On
the other hand the free massive scalar is not conformally invariant. This illustrates a
general feature of conformal field theories: they do not have any dimensionful param-
eters. Dimensionful parameters set a scale and therefore are not compatible with the
scale transformation z → λz, which is part of the conformal group (in any number of
dimensions).
Quantum CFT
Classical conformal invariance does not necessarily imply quantum conformal invari-
ance. This is familiar from QCD (setting all quark masses to zero) — this theory
is classically scale invariant, but to define the quantum theory we must introduce a
regulator, and this leads to the dimensionful QCD scale ΛQCD with important physical
consequence (like confinement), so QCD is certainly not scale-invariant or conformally
invariant at the quantum level. From now on when we say ‘CFT’ we mean at the
quantum level.
Primary operators
The local operators of a CFT must transform covariantly. Primary operators ∗ trans-
form with a simple rescaling,
−h −h̄
0 dw dw̄
O (w, w̄) = O(z, z̄) (13.12)
dz dz̄
∗
These are often called primary fields. The names are interchangeable. But remember that in
CFT, a ‘field’ is not necessarily a fundamental field that appears in the Lagrangian and is integrated
over in the functional integral. For example in the free massless scalar, ∂φ is a primary field.
123
where (h, h̄) are called the conformal weights. Another common notation is
∆ = h + h̄, s = h − h̄ , (13.13)
where ∆ is the scaling dimension and s is the helicity. ∆ is the weight under a constant
rescaling (x1 , x2 ) → (λx1 , λx2 ), ie under
the operator transforms with a factor of λ−∆ . s is the helicity because it is the weight
under a rotation (x1 , x2 ) → (x1 − λx2 , x2 + λx1 ), ie under
the operator transforms with a factor of λ−s . The absolute value |s| = |h − h̄| is the
spin of the operator. (This is just the usual definition of spin, so for example in free
field theory it corresponds to the number of Lorentz indices on a field.
Descendant operators are operators that you get from primaries by acting with confor-
mal transformations. For example, ∂O(z, z̄) is a descendant of O(z, z̄). The transfor-
mation law for descendants is more complicated than (13.12) but is completely fixed
by symmetry.
All local operators in a CFT are either primary or descendant. This ensures that
correlation functions transform covariantly under the conformal group. For example,
the 2-point function on the plane must have the form
C12
hO1 (z1 , z̄1 )O2 (z2 , z̄2 )i = (13.16)
(z1 − z2 )2h (z̄1 − z̄2 )2h̄
where
h = h1 = h2 , h̄ = h̄1 = h̄2 . (13.17)
C12 is a constant, related to the normalization of the field. The two-point function
vanishes if the conformal weights of the two fields are different.
124
The path-integral definition of hO1 (z1 , z̄1 )O2 (z2 , z̄2 )i in (13.16) is (up to normalization)
Z
hO1 (z1 , z̄1 )O2 (z2 , z̄2 )i = DΦO1 (z1 , z̄1 )O2 (z2 , z̄2 )e−S[Φ] (13.18)
where Φ stands for the fundamental fields of the theory.∗ Recall from our discussion
of Euclidean path integrals that the path integral on a half-plane prepares the vacuum
state. Therefore in operator langauge,
hO1 (z1 , z̄1 )O2 (z2 , z̄2 )i = line h0|O1 (z1 , z̄1 )O2 (z2 , z̄2 )|0iline , (13.19)
where |0iline is the vacuum state of the theory on an infinite line (which you can think
of as the Im z = 0 axis).
Remember from our discussion of Euclidean path integrals that QFT at finite temper-
ature in Lorentzian signature is related to Euclidean QFT on a cylinder, with periodic
imaginary time. Now we will see this relation very explicitly in CFT.
Mapping to the cylinder via w = iR log z, and applying the transformation law (13.12)
to (13.16), we can easily find the cylinder correlation function
R−2h R−2h̄
hOcyl (w1 , w̄1 )Ocyl (w2 , w̄2 )i ∼ 2h . (13.20)
w1 −w2 w̄1 −w̄2 2h̄
sin 2R sin 2R
(The ‘cyl’ subscript is usually dropped, so functions of w are just assumed to be cylinder
operators.)
125
(b) Derive (13.20), including the missing coefficient.
Note two things about this correlator: First, it is invariant under the cylinder period-
icity w1 ∼ w1 + 2πR.∗ Second, it has the same short-distance singularity as the plane
correlator (13.20), i.e.,
C12
hOcyl (w1 , w̄1 )Ocyl (w2 , w̄2 )i = as w1 → w2 . (13.21)
(w1 − w2 )2h (w̄1 − w̄2 )2h̄
This is always true in QFT: the short-distance behavior is fixed by vacuum correlation
functions. (In fact these two conditions fix the function (13.20) uniquely, assuming
some behavior at infinity, so we do not even strictly need the exponential mapping to
derive (13.20).)
From the Lorentzian point of view, the cylinder correlator (13.20) can be interpreted
different ways. To go to Lorentzian signature, write w = σ1 + iσ2 where σ1,2 are real
coordinates. If we think of σ2 as ‘time’, then the Wick rotation to Lorentzian signature
is σ2 = it. In this case the circle coordinate σ1 remains a circle in Lorentzian signature,
so this Wick rotation gives the Lorentzian theory on the Lorentzian cylinder S 1 × Time.
This Wick rotation has nothing to do with finite temperature.
To get the finite temperature theory, we instead Wick-rotate by setting σ1 = it. Thus
(Note that in Lorentzian signature, w and w̄ are no longer complex conjugates.) This
means that the the theory is periodic in imaginary time t ∼ t + 2πiR. Comparing to
the finite-temperature periodicity t ∼ t + iβ with β = T −1 , we see that our Euclidean
CFT is related to a finite-temperature CFT at temperature
β = T −1 = 2πR . (13.23)
∗
There are some subtleties with branch cuts making this statement that we’ll ignore for now, and
it relies on the fact that (−1)2(h−h̄) = 1 since operators must have integer or half-integer spin.
126
From (13.20), this means the finite-temperature Lorentzian correlator in CFT is∗
We now return to the derivation of the absorption cross-section (12.21). Recall that we
scattered a low-energy quantum from the near-extremal black string. The near horizon
region relevant to this calculation was a BTZ black hole in AdS3 (times S 3 ). We will
set TL = TR = TH for simplicity, which corresponds to setting the parameter σ = 0
in the black string metric. From the point of view of the near-horizon, this sets the
angular momentum of the BTZ black hole to zero.
This boundary was the matching location in our gravity calculation, ie some value of
r in the range r0 r r1,5 .
127
theory of quantum gravity. The only known microscopic CFTs are the ones coming
from string theory, since that is our only candidate theory of quantum gravity, but in
principle there could be other CFTs corresponding to other UV completions of gravity.
In the string theory examples, where the microscopic definition of the CFT is known,
it is also possible to match the coefficient in the absorption calculation and it comes
out correctly.
In this expression O is a CFT operator and χ(r = 0) — the value of the bulk field
at the fictitious membrane where the CFT lives — is treated as a classical source.
We will assume that the space direction in the CFT is unwrapped, so we call it x ∈
(−∞, ∞) (previously called φ), though the S 1 version can also be done with some extra
assumptions about the CFT. We also assume the source couples weakly to the CFT so
that the interaction term (13.26) can be treated perturbatively.
Absorption rate
When we computed the absorption cross section, we assume
The total absorption rate at temperature β is computed by summing this over final
states, and averaging over initial states with a thermal ensemble,
X Z
−βEi
Γabs ∼ e dt1 dx1 dt2 dx2 e−iω(t1 −t2 ) hi|O(t2 , x2 )|f ihf |O(t1 , x1 )|ii (13.29)
i,f
128
The sum over |f i is just the identity, so up to an overall factor of volume (which you
can think of as the momentum-conserving delta function δ(0)),
Z X
Γabs ∼ dtdxe−iωt e−βEi hi|O(t, x)O(0, 0)|ii . (13.30)
i
This thermal correlator was calculated in (13.24). To take the Fourier transform, use
the integral
2h
(2πT )2h−1 ±ω/2T
Z
−iωy h πT ω 2
dye (−1) = e Γ(h + i ) . (13.32)
sinh [πT (y ± i)] Γ(2h) 2πT
First take the Fourier transform assuming indepdendent left and right momenta
and then set ωL = ωR = ω. The absorption rate is given by the difference of absorption
and emission. These correspond to two different i prescriptions (Exercise: why?). So
finally
(2πT )2(h+h̄)−2 ω
Γ(h + i ω )Γ(h̄ + i ω )2
∼ 2 sinh (13.35)
Γ(2h)Γ(2h̄) 2T 4πT 4πT
h = h̄ = 1 (13.36)
and use the identity |Γ(1 + ix)|2 = πx/ sinh(πx). Why should the weight be (13.36)?
For now, we just pick them so the answer works out. In general the weights depend
on the mass and spin of the bulk field, and (13.36) is the correct choice for a massless
129
bulk field. We will treat this more systematically below.
13.4 Decoupling
In the gravity calculation, we assumed near-extremal but not exactly extremal. This
retained some coupling between the near-horizon degrees of freedom, and the fields
in the asymptotically flat far region. Similarly, in CFT, we assume a weak coupling
between gravity fields and CFT fields.
If we take TH → 0, the far region and near regions decouple. This is Maldacena’s
decoupling limit. In this limit we can completely drop the asymptotically flat part of
the calculation, and we are left with the (3d version of the) AdS/CFT correspondence:
CFTs are UV-complete, so this duality defines not only low-energy effective gravity,
but a UV-complete theory of gravity on AdS3 × S 3 .
From now on, we will just forget about the ‘far region.’ It is not needed, and is rarely
used in modern AdS/CFT.
130
14 The Statement of AdS/CFT
Choose coordinates
`2
ds2 = (dz 2 + dx2 ) (14.1)
z2
on Euclidean AdSd+1 , where x is a coordinate on Rd . The boundary is at z = 0.
This is called the GKPW dictionary.∗ The index i runs over all the light fields in
the bulk effective field theory, and correspondingly over all the low-dimension local
operators in CFT.
where the mass of the bulk scalar is related to the scaling dimension of the CFT
operator by r
2 d d2
m = ∆(d − ∆) , ∆= + + m2 `2 . (14.4)
2 4
∗
After hep-th/9802109 by Gubser, Klebanov and Polyakov and hep-th/9802150 by Witten. I highly
recommend reading Witten’s paper.
131
We will see below that (14.3) is the leading solution of the wave equation for a bulk
scalar of mass m.
Similar statements apply to all bulk fields, including the metric, though the boundary
condition and formula for the dimension is slightly modified for fields with spin. The
boundary conditions on the metric involve a choice of topology as well as the actual
metric, which is why we’ve indicated explicitly that Zgrav depends on the boundary
manifold ∂M .
δn
hO1 (x1 ) · · · On (xn )iCF T ∼ Zcf t [φ0 ] i . (14.5)
δφ10 (x1 ) · · · δφn0 (xn ) φ0 =0
The mapping
Each light field in gravity corresponds to a local operator in CFT. The spin of the
bulk field is equal to the spin of the CFT operator; the mass of the bulk field fixes the
scaling dimension of the CFT operator. Here are some examples:
Scalar: A bulk scalar field χ(z, x) is dual to a scalar operator in CFT. The boundary
value of χ acts as a source in CFT. This is exactly the relationship we used in our
derivation of the absorption cross section of the black string.
Graviton: Every theory of gravity has a massless spin-2 particle, the graviton gµν . This
is dual the stress tensor Tµν in CFT. This makes sense since every CFT has a stress
tensor. The fact that the graviton is massless corresponds to the fact that the CFT
stress tensor is conserved. It also fixes the scaling dimension to ∆T = d. We will see
this in more detail later.
Vector: If our theory of gravity has a spin-1 vector field Aµ , then the dual CFT has a
spin-1 operator Jµ . If Aµ is massless, then ∆J = d − 1 and Jµ is a conserved current.
Otherwise, ∆J > d − 1 and the current is not conserved.
132
This illustrated a general and important feature of AdS/CFT: gauge symmetries in the
bulk correspond to global symmetries in the CFT.
This is UV complete.
Note that CFTs are UV complete. Therefore (14.2) is a non-perturbative formulation of
a UV complete theory of quantum gravity. Shockingly, it is a definition of gravity from
a QFT without gravity. This is very powerful because we understand QFT relatively
well.
√
Z
1
g R + Lmatter + `4s R4 + · · ·
SIIB ∼ (14.6)
GN
The stringy states have masses of order 1/`2s , so at energies below 1/`2s it is just an
ordinary effective field theory like we discussed at the beginning of the course.
∗
There are no R2 corrections allowed with this amount of supersymmetry, but there are similar
examples with non-zero `2s R2 terms.
133
The CFT side
N = 4 Super-Yang-Mills is a highly supersymmetric gauge theory in 4d. Its matter
context is fixed uniquely by supersymmetry. It is just an SU (N ) gauge field plus all
the matter fields required by supersymmetry, which include matrix-valued scalar fields
transforming the adjoint representation of SU (N ) (unlike the fundamental representa-
tions we usually encounter in, say, QCD).
The gauge theory has two dimensionless parameters, N (ie the size of SU (N )) and the
Yang-Mills coupling constant gY M . Define the combination
λ = gY2 M N . (14.7)
This is called the ‘t Hooft coupling. It turns out that gauge theory at large N is most
naturally organized as an expansion in λ and 1/N , rather than gY M and 1/N . This
is roughly because there are N fields running in loops, which changes the expansion
parameter from gY2 M to λ.
The mapping
The mapping from string theory parameters to CFT parameters is
4
`AdS
λ∼ (14.8)
`string
and d−1
`d−1
`AdS
AdS
∼ ∼ N2 . (14.9)
GN `P
(with known coefficients). We will see where this particular scaling comes from below in
more generality. For now we just want to note that this is a strong/weak duality: when
one side is easy, the other is (usually) hard. For example to have semiclassical Einstein
gravity, both loops and higher curvature corrections must be suppressed on the gravity
side. This means N 1 and λ 1 so the CFT is very strongly coupled. On the other
hand if we consider a weakly coupled CFT, then `s `AdS so stringy/higher curvature
corrections are not suppressed on the gravity side and this presumably behaves nothing
like ordinary gravity. (This is related to so-called ‘higher spin gravity’ or ‘Vasiliev
gravity’.)
134
14.3 General requirements
Returning to AdS/CFT in general, we can make some similar observations about when
it produces a nice semiclassical theory of gravity. This requires as least two things:
1. Strongly coupled CFT. If the CFT is weakly coupled, then there are too many
operators. For example, a free scalar field ψ leads to conserved currents of every
integer spin:∗
ψ∂µ ψ, ψ∂µ ∂ν ψ, ψ∂µ ∂ν ∂ρ ψ, etc. (14.10)
On the gravity side, this would require lots of massless or very light high-spin
states. This is something we expect in string theory at high enough energies but
not in our low energy effective field theory.
So we must require that the CFT has a sparse spectrum of low-dimension op-
erators. This is sometimes called a large ‘gap’ in the spectrum, meaning a gap
between the low-energy fields and the stringy stuff. This can only happen at
strong coupling, although there can also be strongly coupled theories with no
gap which therefore do not have nice gravity duals.
135
cient:
hTµν (x)Tαβ (y)i = c × (known function of x, y) . (14.11)
Clear there is a tension between requirements (1) and (2). We want lots of degrees
of freedom, and lots of states at high energies, but very few states at low energies.
Roughly speaking, you can think of this as the requirement that the CFT is confining:
it has lots of states at high energies, but very few at low energies where quarks are
confined. Later we will see a very direct link between black hole thermodynamics and
confinement.
Many years ago Bekenstein conjectured that the maximum entropy you can fit into a
region of space is equal to the entropy of the corresponding black hole:
area
Smax = . (14.12)
4GN
This is called the Bekenstein bound. The argument is simple. If you have lots of stuff
in a region and Sstuf f > Sblackhole , then you can throw in some more stuff and form a
black hole. In doing so, the entropy of the system decreases! Therefore the second law
requires a bound like (14.12).†
This bound inspired ’t Hooft (in ’93) and later Susskind (in ’94) to argue that a theory
∗
But not an entirely satisfactory one. For example, it can increase under RG flow.
†
In the last few years this bound has been understood much better using entanglement entropy.
See for example 1404.5635 and references therein.
136
of quantum gravity must secretly live in fewer dimensions than our observed spacetime.
This principle is realized concretely by AdS/CFT.
137
15 Correlation Functions in AdS/CFT
As a first application of (14.2), we will use the gravity side to derive the correlators of
a conformal field theory. First we’ll start with a purely QFT discussion of correlators
in a theory with conformal invariance, then reproduce these results from gravity.
By the way, you’ve already seen one example of CFT correlators compute from gravity:
the absorption cross section calculation. That was related to a CFT correlator at finite
temperature. In this section are deriving correlators in the vacuum state, i.e., empty
AdS.
This will be a brief introduction to CFT. For details, see: Polchinski’s String Theory
book; Kiritsis’s String Theory book; or the big yellow CFT book by Di Francesco et
al.
Pµ = −i∂µ (15.1)
Lµν = −i(xµ ∂ν − xν ∂µ )
D = −ixµ ∂µ
Kµ = −i(2xµ xν ∂ν − x2 ∂µ )
The the first two lines are translations, rotations, and boosts; these generate the
Poincare group (which is 10-dimensional in d = 4). The 3rd line is the dilatation,
or scale generator, since under xµ → xµ + iDµ , the coordinate is just rescaled,
xµ → xµ (1 + ). The last line is called the special conformal transformation.
One way to derive (15.1) is to find the conformal Killing vectors of Minkowski space.
These are defined to be vectors V µ obeying
138
where f is any function. This is the infinitessimal version of the definition of a conformal
symmetry, which maps ds2 → eΩ(x) ds2 .
The dilatation eigenvalue ∆ is called the scaling dimension of O. The second condition
is like a highest weight condition. We can build the full representation by acting on
O(x) with the conformal generators, so for example ∂m uO(x) is a descendant operator.
The finite version of of the D commutator says that under a rescaling x → λx, we have
O(x) → λ∆ O(λx). More generally, primaries obey
0 µ −∆/d
0 0
∂x
O (x ) = det ν
O(x) . (15.5)
∂x
hO1 (λx1 ) · · · On (λxn )i = λ−∆1 −∆2 −···−∆n hO1 (x1 ) · · · On (xn )i . (15.6)
139
Similarly, the only 3-point function allowed by conformal invariance is
c123
hO1 (x1 )O2 (x2 )O3 (x3 )i = , (15.8)
|x12 |∆1 +∆2 −∆3 |x 23 | 2 +∆3 −∆1 |x
∆
31 |
∆3 +∆1 −∆2
where
xij ≡ xi − xj . (15.9)
The number cijk , called an OPE coefficient (for operator product expansion), is a
real physical prediction of the theory, since we’ve already fixed normalizations via the
2-point function.
In fact, the set of scaling dimensions ∆i and the OPE coefficients cijk are all the data
of a CFT. This is because higher correlators can be computed, at least in principle, by
sewing together 3-point functions and summing over intermediate states.
The 4-point function is not completely fixed by conformal symmetry, but it is highly
constrained. With equal external weights ∆1,2,3,4 = ∆, the most general form of the
4-point correlator is
hO(x1 )O(x2 )O(x3 )O(x4 )i = |x12 |−2∆ |x34 |−2∆ F (u, v) (15.10)
140
where Sgrav is the on-shell action for gravity subject to the boundary condition
This material is explained very clearly in many places, so I will not repeat it here. I
recommend reading Witten’s original paper on the subject, where ‘Witten diagrams’
were introduced [hep-th/9802150]. In class, I followed, almost exactly Kiritsis’s String
Theory in a Nutshell sections 13.8.1 and 13.8.2. Read those sections before continuing!
So far we only used the classical theory on the gravity side. (Though on the CFT side,
this is a strongly coupled QFT calculation which is not at all classical!) What happens
when we include loop corrections in the gravity? The gravitational loop expansion is
organized into powers of GN . The classical term is ∼ 1/GN . If we compute Witten
diagrams with loops, then we find an expansion in GN .
On the CFT side, this is an expansion in 1/Ndof , since recall the dictionary `d−1 /GN ∼
Ndof .
This implies something very special about CFTs with a semiclassical holographic dual:
These CFTs, although strongly coupled, have a meaningful expansion in 1/Ndof . Defin-
ing Ndof = N 2 (since this notation holds in SU (N ) gauge theory), this can be restated
as the fact that connected correlation functions are suppressed. That is, if we normalize
our operators by setting
hOOi ∼ 1 (15.15)
(with the appropriate factors of x suppressed), then the 3-point function is suppressed,
1
hOOOi ∼ (15.16)
N
141
and higher-point functions are dominated by their connected piece:
The explicit theories we know of with this sort of behavior are large-N gauge theories.
These have been studied for a long time, starting with a beautiful paper by ‘t Hooft
in the 70s where he showed that the Feynman diagrams of SU (N ) gauge theory in the
large-N limit naturally reorganize themselves into something that looks roughly like
a string theory. I will not cover this, but I highly recommend you read about it in
section 13.1 of Kiritsis, or the big AdS/CFT review [hep-th/9905111].
Another consequence of the weak coupling constant GN on the gravity side is that
gravity has an approximate Fock space. That is, if we have a weakly coupled scalar
field on the gravity side, then we can construct 1-particle states, 2-particles states, etc,
by acting with creation operators. On the CFT side, this means for example that if we
have a primary O1 of dimension ∆1 , and another primary O2 of dimension ∆2 , then
there is a third primary O1+2 of dimension
This is very special; it does not happen in general CFT, where states are just a some
strongly coupled mess and there is no way to ‘add’ some stuff to other stuff without
getting large corrections to the conserved charges from the strong interactions.
Following the gauge theory language, the operators dual to single bulk fields are called
‘single-trace operators’, and the operators like O1+2 are called ‘multi-trace opera-
tors’ and usually just denoted by the product O1 O2 (or more complicated things like
O1 n ∂µ1 ···µ` O2 ).
In words, (15.18) says that in a CFTs with a semiclassical holographic dual, low di-
mension operators have ‘small anomalous dimensions.’ I’ve restricted this statement
to low-dimension operators because these are the operators dual to bulk fields; high
dimension operators are dual to non-perturbative stuff like black hole microstates.
142
16 Black hole thermodynamics in AdS5
Now we return to black holes, and some of the techniques introduced in the beginning
of the course.
The basic starting point is that thermal states in CFT are dual to black holes in
quantum gravity. In fact, this is a special case of the dictionary (14.2), where we
impose boundary conditions appropriate for thermal field theory. That is,
Here Σd−1 is space. We will mostly set Σd−1 = S`3 , a 3-sphere of size `. And Sβ1 is
a circle of size β. As we saw earlier in the course, the Euclidean path integral on
Σd−1 × Sβ1 defines the finite-temperature state on Σd−1 .
The meaning of the notation in (16.1) is that we calculate the gravity partition with
boundary condition φ0 on bulk fields, and boundary condition M on the bulk manifold.
Explicitly, fields obey the usual fall-off φ ∼ r−d+∆ φ0 (x) as r → ∞, and the metric itself
obeys the boundary condition
r2 2 `2 2
ds2 → dt + dr + r2 dΩ23 , tE ∼ tE + β . (16.3)
`2 E r2
Our goal is to compute the free energy at temperature β. For this we can turn off all
fields besides the metric, so φ0 = 0, and we just have the relation
We will compute the rhs in gravity, and interpret it in CFT. The result will exhibit
rich behavior, including phase transitions as a function of temperature. This will turn
out to be related to confinement/deconfinement in the CFT.
143
16.1 Gravitational Free Energy
To compute Zgrav [β], in principle, we should compute the quantum gravity path integral
subject to the boundary condition (16.3). That’s impossible, but in the semiclassical
limit we can evaluate it approximately by expanding around classical solutions of the
equations of motion. We need to find all of the classical solutions that obey this
boundary condition, and evaluate their on-shell actions using
√
Z
1 5 12
IE = − d x g R+ 2 . (16.5)
16πGN `
If there are several solutions, then the semiclassical approximation to the path integral
is
(1) (2)
Zgrav (β) ≈ e−IE + e−IE + · · · (16.6)
Each saddlpoint also comes with an infinite series of perturbative (loop) corrections
but we won’t worry about those, we will just evaluate the classical contributions.
There are three classical solutions (in pure gravity) obey the thermal boundary condi-
tion (16.3): small black holes, large black holes, and thermal AdS.
16.1.1 Schwarzschild-AdS
The Euclidean black hole satisfying the boundary condition (16.3) is called Schwarzschild-
AdS, with metric
dr2 r2 µ
ds2 = f dt2E + + r2 dΩ23 , f =1+ 2
− 2 , (16.7)
f ` r
µ is a constant that will be related to the mass. The explicit metric on the unit 3-sphere
is
dΩ23 = dψ 2 + sin2 ψ(dθ2 + sin2 θdφ2 ) , (16.9)
144
where ψ and θ run from 0 to π and φ ∈ [0, 2π). The horizon is the outermost solution
of the equation f (r+ ) = 0, which gives
r !
2 `2 4µ
r+ = −1 + 1+ 2 . (16.10)
2 `
2π`2 r+
β= 2 , (16.11)
2r+ + `2
ie " r #
π`2 2β 2
r+ = 1± 1− 2 2 . (16.12)
2β π `
`π
βmax = √ . (16.13)
2
Second, for any given temperature β, there are two different black holes, corresponding
to the sign choice in (16.12). Call these the ‘small’ (minus sign) and ‘’large’ (plus sign)
black holes. The turnover is at
√
r∗ = `/ 2 (16.14)
so each β allows a small black hole with r+ < r∗ and a large black hole with r+ > r∗ .
We are working with thermal boundary conditions that fix the temperature β. So in
field theory language, we are working in the canonical ensemble. Therefore we should
sum over the allowed solutions; the thermodynamics will be determined by whichever
has the lower free energy. We will see below that the large black hole always has lower
free energy than the small black hole (but there is also a third solution, so the large
black hole is not always dominate).
The behavior (16.12) is quite different from flat spacetime. In flat spacetime, larger
black holes always have smaller temperature; this means ordinary Schwarzschild has
145
negative specific heat and so is thermodynamically unstable. (It cannot be held in
equilibrium with a bath, because it will absorb radiation from the bath and get colder
the more radiation it absorbs!)
On the other hand in AdS, according to (16.12), large black holes have positive specific
heat. If you make them bigger (higher energy), they get hotter. Small black holes
have negative specific heat, and very small black holes r+ ` don’t care about the
cosmological constant at all so are just like the flat spacetime Schwarzschild solution.
On-shell action
The free energy F = − β1 log Z is computed using the on-shell Einstein action. We must
be careful about the boundary terms. The full action we will use is
√ √ √
Z Z Z
1 5 12 1 4
IE = − d x g(R + 2 ) + d x γK + d4 x γLct [γ] .
16πGN ` 8πGN r=1/ r=1/
(16.15)
The first term is the usual Einstein term. The second term is the Gibbons-Hawking
boundary term. The last term is a counterterm; it can be any function of the intrinsic
boundary geometry γµν and will be picked to cancel divergences. Note that we are
cutting off the spacetime at r = 1/, we will take → 0 at the end.
Bulk term
The Einstein equation in empty spacetime implies R = −20/`2 . Thus the bulk term
in (16.15) is
√
Z
1
Ibulk = d5 x g (16.16)
2πGN `2 r<1/
Z β Z 1/ Z
1 3
= dtE drr dΩ3 (16.17)
2πGN `2 0 r+
πβ 1 4
= − r+ . (16.18)
4GN `2 4
Boundary terms
To compute the Gibbons-Hawking-York boundary term we just plug into the definition
146
of extrinsic curvature, and eventually find
1 3`2 µ`2
πβ
IGHY =− − . (16.19)
GN `2 4 + 42 2
So far our answer Ibulk + IGHY is divergent as → 0. To fix this we need to add a
boundary counterterm which is a functional of γ. It turns out the only choice that
makes everything finite is
`2
√
Z
3 4
Ict = d x γ 1 + R[γ] (16.20)
8πGN ` r=1/ 12
Total
Plugging into the counterterm action, evaluating it, and adding everything up we find
(as → 0)
π2β 3`4
2 2 4
IE = Ibulk + IGHY + Ict = r ` − r+ + . (16.21)
8GN `2 + 4
This is finite, by design. This should be viewed as a function of temperature IE = IE (β),
so we should plug in for r+ (β) using (16.12). If we pick the plus sign we get the action
of the large black hole, and if we pick the minus sign we get the action of the small
black hole. You can easily check that
small large
IE (r+ (β)) ≥ IE (r+ (β)) . (16.22)
Thus the dominant solution, with lower free energy, is the larger black hole, at any β.
3π 2 `2
E = −∂β log Z = µ+ . (16.23)
8GN 4
The first term is the relation between mass and µ. The second term is a little surprising,
since it is independent of the mass! It can be interpreted as a Casimir energy induced
147
by putting the theory on S 3 . Had we chosen boundary conditions R3 × Sβ1 , this term
would be zero.
π 2 r+
3
S = (1 − β∂β ) log Z = (16.24)
2GN
which you can check agrees with the area law S = area/4GN .
We’re not done: there is a third solution with thermal boundary conditions (16.3). It
is called Euclidean thermal AdS and the metric is simply
r2 dr2
2
ds = 1+ 2 dt2E + 2 2
2 + r dΩ3 , (16.25)
` 1 + r`2
This is just the metric of empty Euclidean AdS, except that we have identified the
Euclidean time circle. Note that β is not related to any parameter in the metric itself,
since there is no horizon r+ in this metric. Therefore β is not fixed by any regularity
condition, it is just a free parameter in this solution.
Lorentzian interpretation
We are discussing Euclidean manifolds, so let’s pause to comment on the Lorentzian
interpretation of all this. As discussed earlier in the course, path integrals of quantum
fields on a Euclidean manifold with tE ∼ tE +β prepare the fields in a thermal state. For
both of the Euclidean manifolds discussed here — the Euclidean black hole and thermal
AdS — the path integral on the Euclidean manifold prepares a thermal state on the
Lorentzian manifold. That is, the Euclidean path integral on Euclidean Schwarzschild-
AdS prepares the Hartle-Hawking thermal state for fields on the the Lorentzian black
hole. The Euclidean path integral on Euclidean thermal AdS prepares fields in a
thermal state on ordinary Lorentzian AdS.
148
So in other words: In Lorentzian signature, thermal AdS is exactly the same classical
solution as empty AdS, but the state of the perturbative fields is different — they are
thermally populated, but their energy is small O(~) and does not backreact on the
geometry itself.
In thermal AdS, tE is a circle, but the circle does not contract anywhere. There is no
origin. So in this geometry the coordinates (r, tE ) make a cylinder rather than a disk.
Action
The calculation of the on-shell action is similar to what we did for the black hole.
Skipping the details, the answer in the end is just the Casimir term:
π2β 3`4
(th)
IE = . (16.27)
8GN `2 4
This is the free energy, which we can use to calculate the energy and entropy.
We found three Euclidean geometries obeying the thermal boundary condition (16.3).
They are the small black hole, large black hole, and thermal AdS. The free energy of
the large black hole is always smaller than that of the small black, so in understanding
the phases, we can forget about the small black hole – it never dominates the canonical
ensemble.
This leaves the large black hole and thermal AdS with actions
π2β 3`4
(bh) 2 2 4
IE (β) = r ` − r+ + (16.28)
8GN `2 + 4
2
4
(th) π β 3`
IE (β) = 2
,
8GN ` 4
149
(bh)
where in IE we choose the larger root for r+ (β).
(bh) (th)
Zgrav (β) ≈ exp(−IE ) + exp(−IE ) + . . . . (16.29)
Each of the exponents is very large, since they are order 1/GN . Therefore the sum is
exponentially dominated by whichever term is bigger:
(bh) (th)
log Zgrav (β) ≈ max −IE , −IE . (16.30)
There is a sharp (1st order) phase transition where the two solutions exchange dom-
(th) (bh)
inance, ie at IE = IE . Comparing the two actions, the critical temperature, and
corresponding black hole radius, for this phase transition is
2π` crit
βcrit = , r+ =`. (16.31)
3
The low-temperature phase is thermal AdS; the high temperature phase is the black
hole. This phase transition is called the Hawking-Page transition and was discovered
well before AdS/CFT. The story is qualitatively the same in any number of dimensions,
AdSd+1 (with a few differences in AdS3 ).
Entropy
The entropy of thermal AdS is zero. We can see this either by noting there is no
horizon, or computing (1 − β∂β ) log Z = 0. Actually, it is not exactly zero, since we
have only computed the semiclassical term. There are quantum corrections, and the
true entropy is O(G0N ) from the one-loop contribution (i.e., determinant of gravitons
matter fields in AdS).
Thus full thermal entropy S(β), accounting for the phase transition, is O(G0N ) at low
temperatures and then suddenly jumps to a very large number O(1/GN ) at βcrit . In
the microcanonical ensemble, where we view this as a function of energy S(E), the
entropy is related to the density of states
150
The Hawking-Page transition indicates that theories with a semiclassical gravity de-
scription must have a small number of states at low energy, but an enormous number
of states at high energy, with a sharp transition.
We have been computing the free energy at temperature β for the theory on the space
S`3 , a 3-sphere of size `. In fact since we are in conformal field theory, only the ratio
`/β is meaningful, as this is the only dimensionful parameter. In other words the only
parameter is `T . Going to high temperatures is therefore the same as going to large `.
If we are interested in the theory on R3 we can take ` → ∞. This is the same as taking
the temperature T → ∞. In this limit, from (16.28), the free energy becomes (with
IE = βF )
`2
F ≈− `3 π 6 T 4 , (16.33)
GN
i.e., 3
`
F ∼− V T4 (16.34)
`P
where V is the volume of the system. Up to the prefactor, we could have guessed
this answer from dimensional analysis. In a conformal field theory on Rd−1 the only
dimensionful scale is the temperature, and F must be proportional to volume, so
conformal invariance implies F ∼ V T d .
Note that the theory on the plane has only one phase: the black hole phase. There
is no Hawking-Page transition on R3 . It is essentially always in the high temperature
phase.
Any CFT with a semiclassical holographic dual must share the same thermodynamics,
summarized by (16.30). What does this mean about the CFT? The microcanonical
entropy tells us about the spectrum: we must have an enormous number of degrees of
freedom to reproduce the high-energy density of states. However we must have a small
151
number of states at low energies. This sounds like confinement! In a confining SU (N )
gauge theory, in the confining phase, the physical states are color singlet hadrons, and
the free energy is F = O(1). In a deconfined phase, the states are gluons, and the free
energy is F ∼ O(N 2 ). This agrees with a our results above (after subtracting off the
contribution from the Casimir energy, which is a temperature-independent contribution
to F and does not affect the entropy). The black hole phase is like the deconfined phase,
and the confined phase is like thermal AdS.
This analogy comes with some caveats, so let’s compare and contrast QCD with
a holographic theory like N = 4 Super-Yang-Mills. In QCD, there is a confine-
ment/deconfinement phase transition in infinite volume, ie for the theory on R3 . It is
confining at low temperatures and deconfined at high temperatures. According to our
gravity results, N = 4 Super-Yang-Mills (at strong coupling) does not have a confining
phase on R3 . CFT’s on R3 cannot have phase transitions, because the temperature
can always be rescaled (unless there is some other parameter turned on, like a chemical
potential). So in fact N = 4 SYM is not a confining gauge theory in the same sense
as QCD.
152
transition:
(hW i =
6 0 actually breaks a symmetry: the center of the gauge group, ZN ∈ SU (N ).)
A rough explanation is that you can think of the temporal Wilson loop as a free quark.
If a free quark has finite energy, then you get hW i =
6 0, but if the free quark has infinite
energy then hW i = 0.
To compute a Wilson loop in AdS/CFT from the gravity side, the rule is to find a
string worldsheet ending on the Wilson line and extending into the bulk. This classical
string diagram computes the leading contribution to the Wilson loop at large N .
In the Euclidean black hole, since (tE , r) make a disk, it is easy to find a string world-
sheet ending on this Wilson line. In thermal AdS, however, since the thermal circle is
not contractible — ie (tE , r) make a cylinder — you cannot find such a string world-
sheet, and the Wilson line vanishes.
Thus the deconfined phase is the phase with a contractible thermal circle in the dual
geometry, and the confined phase has a non-contractible thermal circle.
So can we calculate the free energy of N = 4 SYM, and compare to (16.30)? Unfor-
tunately, no. The gravity calculation is dual to N = 4 SYM at very strong ‘t Hooft
coupling, λ ≡ gY2 M N → ∞. The free energy is not protected by supersymmetry, and
it is unknown how to do this calculate in gauge theory at strong coupling.
But we can do the CFT calculation at weak coupling. The free energy of a weakly
coupled QFT is just a 1-loop calculation, ie a determinant for each of the fields. This
calculation has been done. It agrees qualitatively, but not quantitatively, with the
gravity calculation. For example, after translating all the parameters of CFT to gravity
153
parameters, the free energy of free N = 4 SYM on R3 is
4
Ff ree = Fgravity , (16.37)
3
where Fgravity is given in (16.33). This famous factor of 4/3 is not a contradiction. It
just means that the free energy at strong coupling is different from the free energy at
weak coupling.
In principle, or perhaps on a lattice, the free energy is some function of the coupling,
π2 2
F = −f (λ) N V T4 . (16.38)
6
3 45 ζ(3)
gravity: f (λ) = + + ··· as λ → ∞ (16.39)
4 32 λ3/2
and
3
CFT: f (λ) = 1 − λ + ··· as λ → 0 (16.40)
2π 2
Evidently the corrections are heading the right direction, but the full function f (λ) is
unknown.
dr2
2 2 2
ds = ` (r − 8M )dt2E + 2 + r2 dφ2 . (16.41)
r − 8M
154
In a previous exercise, you computed the on-shell Euclidean action of this black hole.
The answer, including all boundary terms and counterterms, is
(bh) π2c
SE (β) = − (16.42)
3β
where
3`
c= . (16.43)
2GN
1. Like in AdS5 , there is also a thermal AdS solution with the same boundary
condition.∗ We will use a trick to compute its action. The trick is to note
1
that (16.41) is a solid torus with boundary S2π` × Sβ1 . (The subscript is the
circumference of the S 1 ). The thermal circle Sβ1 is ‘filled in’ to make the solid
torus.
1
Thermal AdS3 is a solid torus where instead the other circle S2π` is ‘filled in’ to
make solid torus. Argue that this implies
(th) (bh) 4π 2 `2 cβ
SE (β) = SE ( )=− . (16.44)
β 12
2. Sketch a plot of the free energies F (bh) and F (th) . Find the critical temperature
βcrit of the Hawking-Page phase transition, and write log Z(β) as a piecewise
function.
5. Use part (4) in part (3) to find the entropy in the microcanonical ensemble,
S(E). (Be careful about what ranges of E your formulas apply to; in particular
you cannot find S(E) for all E by this method.)
∗
Unlike AdS5 , there is only one black hole with temperature β.
†
i.e., not continuously connected to the identity
155
6. Interpret your results in terms of the density of states in a 2d CFT dual to 3d
gravity.
156
17 Eternal Black Holes and Entanglement
References: This section is based mostly on Maldacena hep-th/0106112; see also the
relevant section of Harlow’s review lectures, 1409.1231.
An eternal black hole is the black hole with the full, two-sided Penrose diagram. It
has a past singularity, a future singularity, and two asymptotic regions:
(17.1)
This is to be distinguished from a black hole that forms from gravitational collapse,
which has no past singularity and no second asymptotic region on the ‘left’ of the
Penrose diagram. Although we often use the maximally extended Penrose diagram to
discuss all sorts of black holes, it is only in the eternal black hole that we should really
take the left side of the Penrose diagram seriously.
Consider any QFT, with Hamiltonian H and complete set of eigenstate |ni,
157
The thermofield double formalism is a trick to treat the thermal, mixed state ρ = e−βH
as a pure state in a bigger system. First we double the degrees of freedom, i.e., we
consider a new QFT which is two copies of the original QFT. If the theory is defined
by a Lagrangian, then for every field φ in the original QFT, there are two fields φ1 (x1 )
and φ2 (x2 ) in the doubled QFT. These two fields live in different spacetimes x1 and
x2 , and are not coupled in the Lagrangian at all. The states of the doubled QFT are
1 X
|T F Di = p e−βEn /2 |ni1 |ni2 . (17.4)
Z(β) n
This is a particular pure state in the doubled system. The density matrix of the doubled
QFT in this state is
ρtotal = |T F DihT F D| . (17.5)
ρ1 = tr2 ρtotal
!
X X
= 2 hm| e−βEn /2 |ni1 |ni2 2 hn0 | 2 hn0 |e−βEn0 /2 |mi2
m n,n0
X
= e−βEn |ni1 1 hn|
n
−βH1
= e (17.6)
Therefore, if we restrict our attention to system 1, this pure state in the doubled system
is indistinguishable from a thermal state. For example, if O1 is made of local operators
acting on system 1, O1 = φ1 (x1 )χ1 (y1 ) · · · , then
1
hT F D|O1 |T F Di = Tr H1 e−βH1 O1 . (17.7)
Z(β)
This procedure is called purifying the thermal state. In fact, any mixed state can be
purified by adding enough auxiliary states and tracing them out.
Although systems 1 and 2 are not coupled in the Lagrangian of the doubled system,
158
they are correlated because we are in this particular entangled state. For example, if
O1 is built from operators acting on system 1 and O2 is built from operators acting on
system 2, then
hT F D|O1 O2 |T F Di (17.8)
can be non-zero.
The Hamiltonian
The choice of Hamiltonian acting on the doubled system is up to us. Two convenient
choices are
Htot = H1 − H2 and H̃tot = H1 + H2 . (17.9)
For our purposes, we will just use Htot , but H̃tot is also useful in other contexts. Under
Htot , the TFD state is time-independent, since the phases cancel:
X
|T F D(t)i ≡ e−iHtot |T F Di = e−βEn /2 e−i(H1 −H2 ) |ni1 |ni2 = |T F Di . (17.10)
n
The statement
Maldacena’s proposal is that the eternal black hole depicted in (17.1) is dual to two
copies of the CFT, in the thermofield double state |T F Di. Each asymptotic boundary
of AdS is a copy of the original dual CFT. So, for example, to compute correlation
functions like
hT F D|φ1 (x1 )χ2 (x2 )|T F Di (17.11)
we would use Witten diagrams with χ inserted on the left boundary, and φ inserted on
the right boundary. Note that the local bulk fields are not doubled: there is just one
bulk field Φ dual to the boundary operators φ1 and φ2 , but this makes sense because
we have to specify double the boundary conditions for Φ. The boundary condition for
Φ on the left acts like a source for φ2 , and the boundary condition for Φ on the right
acts like a source for φ1 .
159
The Hamiltonian
The Hamiltonian Htot in (17.9) has a natural bulk interpretation. It is dual to the
bulk Hamiltonian that generates time evolution along the isometry ∂t , where t is the
usual Schwarzschild coordinate. Recall (or look back at a textbook on the Kruskal
coordinate change) that the Schwarzschild t coordinate runs ‘backwards’ on the left
side of the Penrose diagram. That is, all of the spatial slices drawn in this figure are
equivalent under the ∂t isometry:
(17.12)
Derivation
To justify the claim that the eternal black hole is dual to the TFD state, we will apply
the AdS/CFT dictionary (14.2), in the form
(Here M is the bulk manifold, and the meaning of the lhs is the gravity path integral
with boundary condition ∂M = Σ.)
First, the CFT: The Euclidean path integral that prepares the TFD state is a path
integral on an interval of length β/2, times a circle:
160
Pictorially,
(17.15)
This path integral has two open cuts (red), at the ends of the interval. We interpret the
left cut as defining a state in system 2, and the right cut as defining a state in system
1. That is, this picture should be interpreted as a rule for computing the transition
amplitude with field data ϕ1 and ϕ2 specified at the ends of the interval. To confirm
that this path integral really prepares the TFD state, all we need to do is check that
it computes the correct transition amplitudes. The path integral with these boundary
conditions is∗
These are precisely the matrix elements of the state |T F Di defined in (17.4). So, as
claimed, this is the Euclidean path integral that prepares |T F Di.
Now that we’ve produced this state from a Euclidean path integral on the manifold
Σ, we can apply (17.13). We must find a Euclidean gravity solution with conformal
boundary condition ∂M = Intervalβ/2 × S d−1 . In fact, half of the Euclidean black hole
has precisely this boundary condition. That is, we consider the Euclidean Scharzschild-
AdS solution and restrict to tE ∈ [0, β/2] instead of the full range tE ∈ [0, β]. The
(tE , r) portion of this Eulidean spacetime makes a half-disk; the boundary of the half-
disk is Intervalβ/2 × S d−1 . The half-disk is cut down the middle; this cut is interpreted
as the time=0 surface of the Lorentzian spacetime. Pictorially, the bulk spacetime has
a Euclidean piece that prepares the state, then a Lorentzian piece describing the time
∗
The tildes indicate the conjugate state.
161
evolution in Minkowski signature:
(17.19)
The red blobs in this picture denote the S d−1 ’s at the end of the interval on the
boundary, the red circles in (17.15).
17.3 ER=EPR
Let’s describe this result in words. The left side of the Penrose diagram is dual to
CF T2 , and the right side is dual to CF T1 . The Einstein-Rosen bridge connecting
to the two sides, the black hole interior itself, are somehow ‘created’ by entangling
CF T1 with CF T2 . In fact this is a precise statement: In CFT language, correlators
between the two CFTs like hT F D|O1 O2 |T F Di are non-zero only because we are in
the entangled state |T F Di. After all, the two CFTs are not coupled. In the bulk,
these correlations are nonzero because we can draw Witten diagrams going through
the interior. For very massive fields or high energies, the left-right correlation functions
can be approximated by geodesics that pass through the black hole interior. Without
a wormhole connecting the two sides, there would be no such correlations.
This idea and its generalizations have recently been given the slogan ‘ER=EPR’ (by
Maldacena and Susskind): Einstein-Rosen bridges are equivalent to entanglement (as
discussed by Einstein-Podolsky-Rosen). This slogan is only entirely precise and well
defined in the semiclassical limit, describing the eternal black hole and similar space-
times, but the idea is that some more general construction should make sense in the
very quantum, non-geometrical limit.
162
17.4 Comments in information loss in AdS/CFT
Hawking’s information loss paradox relied on a black hole that forms from collapse,
then evaporates. In AdS, this only happens for small black holes. These black holes
are not in thermal equilibrium, and are difficult to address precisely using AdS/CFT.
Of course, the CFT is always unitary, so if we believe AdS/CFT (or use AdS/CFT to
define a theory of quantum gravity) then obviously this evaporation process, however
it is described in CFT, must be unitary. This strongly suggests that unitarity should
be preserved, and locality or some other tenet of effective field theory must be violated.
However it is not very satisfying, since it does not answer the question of what went
wrong with Hawking’s calculation. Presumably the answer is that local effective field
theory is not quite right in non-perturbative quantum gravity, but we do not really
understand how to characterize this breakdown. This is a very important open question
in current research.
|T F Di → |T]
F Di = (1 + O2 )|T F Di . (17.20)
hT]
F D|O1 |T]
F Di (17.22)
in the perturbed state. To first order in the perturbation, this is the two-sided corre-
lation function
hO1 i ∼ G12 ≡ hT F D|O1 O2 |T F Di . (17.23)
163
Now we can produce a contradiction by waiting a very long time, so this correlation
function decays. On the gravity side, if we hold O2 at a particular time and send O1
to very late times, then the geodesic distance between these two points grows linearly
with time, forever. Therefore the correlation function must decay as
Ggravity
12 ∼ e−const×t/β (17.24)
for t β. This decays exponentially to zero. At very late times, it therefore becomes
exactly thermal, with arbitrarily small corrections.
This contradicts unitarity of the CFT. In the CFT, any perturbation of the thermal
state should stay forever a perturbation of the thermal state: it will of course become
scrambled and appear to thermalize, but it should never forget the initial perturbation
completely, so it should never become arbitrarily close to the thermal state. In fact
the corrections to the thermal state should be suppressed by the entropy, but finite:
GCF
12
T
∼ e−const×S (17.25)
for t β. In summary, at very late times, gravity ‘forget’ the initial perturbation, but
a unitary CFT does not:
Ggravity
12 Gunitary
12 . (17.26)
However is this paradox resolved? The answer is that we have neglected non-perturbative
contributions of the gravity side of order e−1/GN ∼ e−S . For example, there is another
saddlepoint (the thermal AdS saddle) and fluctuations around this saddle will also
contribution to the two-sided correlation function at this order.
Although this tells us where the gravity derivation went wrong, it does not tell us
exactly how to recover the lost information in quantum gravity, i.e., without referring
to the dual CFT. Presumably this would require treating the full non-perturbative
string theory, which is currently not possible.
164
17.6 Entropy in the thermofield double
Our next topic will be entanglement. It will be a while before we get back to gravity,
so as a brief preview, let’s consider the interpretation of entropy in the thermofield
double formulation of the black hole. The state |T F Di is a pure state; pure states
have no ordinary entropy, i.e.,
has entropy
Stot = − tr ρtotal log ρtotal = 0 . (17.28)
However, if we trace out half the system, we know this gives a thermal state, so it
should have some entropy. The reduced density matrix of system 1 is
We started in a pure state, with no entropy. Where did the entropy of system 1
come from? The answer is entanglement. Thermal entropy is just one special case of
entanglement entropy.
165
18 Introduction to Entanglement Entropy
The next few lectures are on entanglement entropy in quantum mechanics, in quantum
field theory, and finally in quantum gravity. Here’s a brief preview: Entanglement
entropy is a measure of how quantum information is stored in a quantum state. With
some care, it can be defined in quantum field theory, and although it is difficult to
calculate, it can be used to gain insight into fundamental questions like the nature of
the renormalization group. In holographic systems, entanglement entropy is encoded
in geometric features of the bulk geometry.
We will start at the beginning with discrete quantum systems and work our way up to
quantum gravity.
A bipartite system is a system with Hilbert space equal to the direct product of two
factors,
HAB = HA ⊗ HB . (18.1)
Starting with a general (pure or mixed) state of the full system ρ, the reduced density
matrix of a subsystem is defined by the partial trace,
ρA = trB ρ (18.2)
and the entanglement entropy is the von Neumann entropy of the reduced density
matrix,
SA ≡ − tr ρA log ρA . (18.3)
166
spanned by
|00i, |01i, |10i, |11i , (18.4)
where the first bit refers to A and the second bit to B, i.e., we use the shorthand
1
|ψi = √ (|00i + |11i) , (18.6)
2
ρA = trB ρ
1
= B h0| (|00i + |11i) (h11| + h00|) |0iB
2
1
+ B h1| (|00i + |11i) (h11| + h00|) |1iB
2
1
= |0iA A h0| + |1iA A h1|
2
∝ 12x2 . (18.7)
The last line says ρA is proportional to the identity matrix of a 2-state system. In this
case we say ρA is maximally mixed, and the initial state |ψi is maximally entangled.
SA = − tr ρA log ρA
1 1
= −2 × log
4 4
= log 2 . (18.8)
167
words:
Rephrased slightly:
Given a state ρA with entanglement entropy SA , the quantity eSA is the minimal number
of auxiliary states that we would need to entangle with A in order to obtain ρA from a
pure state of the enlarged system.
Schmidt decomposition
A very useful tool is the following theorem, called the Schmidt decomposition: Suppose
we have a system AB in a pure state |ψi. Then there exist orthonormal states |iiA of
A and |iiB of B such that
X
|ψi = λi |iiA |ĩiB , (18.9)
i
The number of terms in the sum is (at most) the dimension of the smaller Hilbert space
HA or HB .
If A is small and B is big, this is intuitive. It says we can pick a basis for |iiA , and each
of these states will be correlated with a particular state of system B. The thermofield
double is an obvious example.
Complement subsystems
An immediate consequence of the Schmidt decomposition is that a pure state of system
168
AB has
SA = SB (pure states) . (18.11)
To see this, write the reduced density matrices in the Schmidt basis,
X X
ρA = λ2i |iiA A hi| , ρB = λ2i |iiB B hi| . (18.12)
i i
Both density matrices have eigenvalues λ2i so they have the same entropy. (18.11) does
not hold for mixed states of AB.
Entanglement entropy can be defined whenever the Hilbert space splits into two factors.
A very important example is when we define A as a subregion of space.
where si = 0 or 1 (meaning ‘up’ or ‘down’), and the c’s are complex numbers.
169
√
and ∼ 1/ 2k for a complete basis of states |iiA . In fact this is a theorem, see Harlow’s
lectures for the exact statement.
In other words, most states in the Hilbert space of the full system have entanglement
scaling with volume.
However, often we are interested in the groundstate. Ground states of a local Hamilto-
nian are very non-generic, and the corresponding entanglement entropies obey special
scaling laws. Usually, if the system is gapped (i.e., correlations die off exponentially),
the ground state must obey the area law :
Thus groundstates occupy a tiny, special corner of the Hilbert space. This is a corner
with especially low ‘complexity.’ Intuitively speaking, a large degree of entnaglement
is what makes quantum information exponentially more powerful than classical infor-
mation; so states with lower entanglement entropy are less complex. More specifically,
this actually means that you can encode a groundstate wavefunction with far fewer
parameters than the 2N complex numbers appearing in (18.13).
DMRG
In 1+1d, the area law becomes simply
SA ∼ const (18.16)
independent of the system size. This special feature is responsible for a hugely impor-
tant technique in quantum condensed matter called the density matrix renormalization
group (DMRG). This technique is used to efficiently compute groundstate wavefunc-
170
tions of 1+1d systems using a computer. This would not be possible for general states,
since (we think) classical computers require exponential time to simulate quantum sys-
tems. But (18.16) means that, in a precise sense, groundstates of gapped 1d systems
are no more complex than classical systems.
SA ∼ log LA (18.17)
where LA is the size of region A. This is bigger than the area law, but still much lower
than the volume-scaling of a random state.
Relative entropy
Much of the recent progress in QFT based on entanglement comes from a few inequal-
ities obeyed by entanglement entropy. Define the relative entropy
S(ρ||σ) ≥ 0 (18.19)
171
where pi is non-negative and vi is a basis vector. This is necessary for quantum me-
chanics to have a sensible probabilistic interpretation and is closely related to unitarity.
Triangle inequality
Positivity or relative entropy implies the triangle inequality,
Mutual information
Define the mutual information,
Roughly, I(A, B) measures the amount of information that A has about B (or vice-
versa, since it is symmetric).
In a pure state of AB, the only correlations between A and B come from entanglement,
so in this case I(A, B) measures entanglement between A and B. However, in a mixed
state, I(A, B) also gets classical contributions. For example in a 2-qubit system, it is
easy to check that the classical mixed state
172
Strong subadditivity
So far we have discussed partitioning a system into two pieces A and B, but we can
partition further and find new inequalities. The strong subadditivity inequality (SSA
for short) applied to a tripartite system HABC = HA ⊗ HB ⊗ HC , is
Although this inequality seems obvious — clearly A has more information about BC
than about B alone — and is ‘just’ a feature of positive matrices, it is surprisingly
difficult to prove. See Nielsen and Chuang for a totally unenlightening derivation.
Sometimes it is useful to express (18.25) in different notation, where A and B are two
overlapping subsystems, which are not independent:
Now, what is the maximal amount of mutual information for a quantum (pure or
173
mixed) state of 2 qubits? Write an example of a state with this maximal amount of
mutual information. (Quantum states with more mutual information than is possible
in any classical state are sometimes called supercorrelated.)
Hint: Purify the tripartite system that appears in strong subadditivity by adding a
4th system, D, with ABCD in a pure state. This is always possible.
174
19 Entanglement Entropy in Quantum Field The-
ory
(19.1)
This picture is at a fixed time. Region A is drawn as a circle, but for now it could be
any shape. (It could also be disjoint, but we will assume it is connected unless specified
otherwise.)
HAB 6= HA ⊗ HB . (19.2)
There are two things to worry about: first, in gauge theories, you cannot really localize
states. The gauge constraint is applied to the full system, so by looking at any sub-
region, you cannot decide whether it is a physical state obeying the constraint. This
issue (which also appears in ordinary quantum mechanical gauge systems) has been
addressed in some nice papers just in the last year or so, and we will ignore it entirely.
It turns out to not affect the discussion that follows very much.
175
by introducing the ‘lattice scale’ U V . With a finite cutoff, the Hilbert space of a
finite region is finite-dimensional, and most of the results of the previous section — in
particular, positivity of relative entropy and strong subadditivity — apply to QFT. In
the end we usually want to regulate the divergences somehow, but the leftover finite
pieces do not immediately obey the same properties, so we need to be careful about
tracking cutoff dependence throughout the problem.
The divergent terms in SA come from UV physics. In the UV, any finite energy state
is the same as the vacuum state. Therefore to discuss the structure of the divergent
terms we can restrict to ρ = |0ih0|, the vacuum state of the full system.
UV divergences
The divergent terms depend on the theory and on the shape of region A. In a local
QFT, we expect the divergent piece to be a local integral over the entangling surface
∂A,
(div)
Z √
SA ∼ dd−2 σ h F [Kab , hhab ] , (19.3)
∂A
Let’s organize (19.3) as an expansion in powers of Kab . Since Kab ∼ 1/LA , this is an
expansion in powers of the size LA . What sort of terms can appear? In a pure state,
(div) (div)
SA = SB , an in particular SA = SB . The extrinsic curvature is K ∼ ∇n with n
the unit normal; this flips sign if we consider region A vs its complement, region B.
(div) (div)
Therefore SA = SB implies that only even powers of Kab are allowed:
(div)
SA ∼ a1 Ld−2
A + a2 Ld−4
A + ··· , (19.4)
176
The leading term in (19.4) is a UV divergence proportional to Area(A). This makes
sense: UV modes entangled across ∂A give a divergent contribution, and the number
of these modes is proportional to the area.
The difference between even and odd comes from the fact that the “1/0U V ” term that
would appear in even dimensions actually turns into a log divergence (just as it does
in Feynman diagrams). The powers of (−1) are inserted by convention.
In the vacuum state, the bi and S̃ depend on the theory, but not on LA or U V .
177
Area vs volume terms
The leading UV divergence is always proportional to Area(A), in any state. In the
vacuum we do not expect any extensive contribution to S̃, but in a random excited
state, we expect
S̃ ∼ Volume(A) . (19.7)
This is for the same reason that we argued for volume scaling in a random state of a
lattice system. In a highly excited random state, the IR modes that contribute to S̃
should all be highly entangled with the outside, and the number of such modes scales
with volume.
c LA
SA = log . (19.8)
3 U V
Here c is the central charge of the CFT (which, remember, roughly speaking counts the
degrees of freedom). This agrees with the general formula in even spacetime dimensions
(19.6), with S̃ = 3c .
If we instead consider a highly excited state, then we can’t do the calculation in general,
but in cases where it can be done the result in a typical state scales as S̃ ∼ cLA .
In a Lorentz-invariant QFT, the density matrix of a spatial region A must contain all
of the same information as the density matrix of a spatial region A0 that shares the
178
same causal diamond. That is, for this setup:
(19.9)
we must have
SA = SA0 . (19.10)
ρA0 = U † ρA U . (19.11)
179
20 Entanglement Entropy and the Renormalization
Group
Entanglement entropy is very difficult to actually calculate in QFT. There are only a
few cases where it can be done. So what is it good for? One answer is the relation
to quantum gravity, which we’ll get to later. Another answer is that entanglement
entropy has led to deep insights into the structure of QFT. It is a tool that is almost
orthogonal to the usual tools of QFT, and can be used to prove general facts about
QFT that, so far, cannot be proved using any other method. The most important
example is on the irreversibility of the renormalization group in d = 3. We’ll now take
a brief detour to describe this result and the relevance of entanglement, as pioneered
by Casini and Huerta. We restrict to Lorentz-invariant QFTs.
(20.1)
Starting with CF T 1 in the UV, we deform by a relevant operator and flow down to
CF T 2 or, depending on the deformation, perhaps CF T 3 in the IR. These CFTs might
be free, or trivial, as in QCD, which is an RG flow between a free theory in the UV
and a gapped (empty) theory in the IR. The IR fixed points may also have relevant
perturbations, so we can continue the process and flow to new theories. Two natural
questions are:
∗
Strictly speaking, it connects scale-invariant theories. It is widely suspected that scale invariant
QFTs are necessarily conformal, but this is proven only in 2d and in 4d under certain assumptions.
180
1. Which CFTs can flow to which other CFTs? For example, can the green flow in
the figure exist, connecting CF T 3 to CF T 4 ? Or should it flow from CF T 4 to
CF T 3 instead?
2. Can there by closed cycles, connecting the IR back up to the UV like the red
dotted flow in the figure?
The RG involves integrating out degrees of freedom, so it would be very strange to find
closed cycles! We expect that each time we do into the IR, we reduced the number of
degrees of freedom. To make this intuition precise has been a longstanding problem in
quantum field theory.
where ctherm is a dimensionless number that we might guess counts degrees of freedom.
However, ctherm does not necessarily decrease along RG flows. An example is the flow
from the interacting critical point of N bosons in d = 3, to the Goldstone phase with
N − 1 free bosons.
d=2: c-theorem
This case is the easiest and has been understood since the 80s, when Zamolodchikov
181
proved that the correct quantity to consider is the central charge c. Zamolodchikov’s
c-theorem states
cU V ≥ cIR (20.3)
c plays many roles in a 2d CFT: It appears in the Virasoro algebra, in the trace
anomaly, in the stress-tensor correlation functions, in the Casimir energy on a circle,
in the thermodynamic free energy, and in the groundstate entanglement entropy. In
higher dimensions, these different quantities can have different constants associated to
them, so it is not obvious how to generalize (20.3) to higher dimensions. The picture
that has emerged in the last few years (conjectured in even dimensions long ago by
Cardy) is that the correct quantity to consider is the partition function on S d . Exactly
how this works depends on the dimension.
d=3: F -theorem
The correct measure of degrees of freedom in a 3d CFT is
It can be shown that this is equal to the finite term in the entanglement entropy of
a spherical region. That is, let A be a ball of radius LA . In the vacuum state the
quantity appearing in (19.5) obeys
S̃ = F . (20.5)
FU V ≥ FIR . (20.6)
This was proved by Casini and Huerta using entanglement methods, described below.
d=4: a-theorem
In even dimensions, the partition function on S d has a log divergence due from the
182
conformal anomaly. The coefficient of this log divergence is called a:
R
log ZS 4 ∼ a log . (20.7)
U V
The same number appears in the entanglement entropy of a spherical region. In the
notation of (19.6),
S̃ ∝ a . (20.8)
(20.10)
We have also labeled the regions X, Y, Z. All of these are spacelike regions. Comparing
causal diamonds, Lorentz invariance, as discussed in section 19.2, implies
SA = SX∪Y , SB = SY ∪Z (20.11)
and
SA∪B = SX∪Y ∪Z , SA∩B = SY . (20.12)
183
Now, strong subadditivity implies
In the vacuum state, the entanglement entropy can depend only on the proper length
of the region. Thus SSA becomes
√
2S( rR) ≥ S(R) + S(r) . (20.16)
or equivalently
C 0 (r) ≤ 0, C(r) = rS 0 (r) . (20.18)
(20.18) is the main technical result: the function C(r) is monotonic as a function of
interval size. Now for the interpretation. First, suppose our QFT is scale invariant. In
this case, from (19.8), the entanglement entropy is
c r
Scf t (r) = log . (20.19)
3 U V
184
given by the central charge,
cU V cIR
C(r → 0) = , C(r → ∞) = . (20.21)
3 3
Note that nowhere in this proof have we used the concept of a quantum field!!! We
used only locality, Lorentz invariant, quantum mechanics, and unitarity (in the guise
of the SSA inequality).
Casini and Huerta’s proof of the F theorem d = 3 is quite similar. In this case, there
is no other known way to prove that RG flows are irreversible – standard field theory
methods in even dimension rely on the conformal anomaly, which does not exist in odd
dimensions.
FCF T = S̃ . (20.26)
To use SSA, we use a more clever version of the boosted-interval setup. Two boosted
185
balls, won’t work, because the union of the causal domain of two boosted balls is not
the causal domain of any ball. Instead we must arrange an infinite number of boosted
regions. Projected onto a single time slice, they look like this:∗
(20.27)
FU V ≥ FIR . (20.28)
∗
Figure taken from Casini and Huerta 1202.5650.
186
21 Holographic Entanglement Entropy
The entanglement entropy in this case is given by the holographic entanglement entropy
formula:
area(γA )
SA = , (21.1)
4GN
where γA is a codimension-2, spacelike extremal surface in the dual geometry, anchored
to the AdS boundary such that
∂γA = ∂A . (21.2)
An extremal surface is a surface of extremal area. This looks roughly as follows, with
z the radial direction in AdS:
(21.3)
γA lives in a particular spacelike slice, so that is what is drawn here with the orthogonal
(time) direction suppressed.
Two additional comments: First, in (21.1), we are only allowed to include extremal
surfaces γA which are homologous (continuously deformable) to region A. Second, if
187
there are multiple extremal surfaces satisfying the homology condition, then the rule
is to apply (21.1) to the one which has minimal area.
Static case
In a static geometry there is a natural t coordinate, and symmetry implies that γA will
always lie within a fixed-t slice. An extremal surface in a fixed-t slice is the same as
a ‘minimal area surface’ inside this slice, so in this case the HEE formula reduces to
finding a minimal-area surface in a d − 1-dimensional space geometry.
Extremal surfaces are minimal-area with respect to deformations inside a fixed-t slice,
but maximal-area with respect to deformations in the t direction (since we can always
reduce the area of a surface by making it ‘more null’). The same is true of spacelike
geodesics, which extremize the length of a curve in spacetime, rather than minimizing
or maximizing it.
LA LA
x ∈ [− , ]. (21.4)
2 2
188
The dual geometry is empty AdS3 , with metric
`2
ds2 = 2
(−dt2 + dx2 + dz 2 ) . (21.5)
z
`2
ds2 = (dx2 + dz 2 ) , (21.6)
z2
LA LA
P1 = (z1 , x1 ) = (0, − ) and P2 = (z2 , x2 ) = (0, ). (21.7)
2 2
However, a geodesic that reaches the boundary like this will have infinite length, since
R dz
z
= ∞. This is the gravity dual of the statement that entanglement entropy in QFT
is UV divergent. To regulate the divergence, we follow the same procedure we used to
regulate the on-shell action, or holographic correlation functions: cut off the spacetime
at z = U V .
(21.8)
189
The factor of 2 is because we the geodesic goes out, and comes back, and we will only
integrate z ∈ [, zmax ] once. Treating (21.9) as a 1d “action”, it is easy to show that
the geodesic is a semicircle,
LA LA
x= cos λ, z= sin λ, λ∈( ,π − ). (21.10)
2 2 LA LA
Plugging this back into (21.9) and doing the integral gives
LA
Length = 2LA log . (21.11)
U V
The map between gravity parameters and CFT parameters in AdS3 /CFT2 is
3`
c= , (21.13)
2GN
This agrees perfectly with our general discussion of the structure of entanglement
entropy in QFT in even spacetime dimensions, (19.6), including the UV divergence.
The prefactor also agrees exactly with the known result in 2d CFT, (19.8).
Exercise: 2d HEE
Fill in all the missing steps — i.e., solve for the geodesic and do the length integral
— in the derivation of (21.14). Don’t forget to use conserved quantities to efficiently
solve the geodesic equation.
190
Exercise: Strips in d dimensions
Compute the holographic entanglement entropy of an infinite strip of width LA in a
d-dimensional CFT in the vacuum state. That is, with CFT coordinates (t, x, ~y ), region
A is the region x ∈ [−LA /2, LA /2], t = 0, ~y =anything.
The proof of the strong subadditivity inequality in quantum mechanics is rather tech-
nical and tricky. The holographic proof, in static states, is easy! The statement of SSA
for a tripartite system is
SABC + SB ≤ SAB + SBC . (21.15)
(21.16)
We’ve picked a color scheme to reorganize the inequality a bit, so now it says
The fact that the curves are minimal area immediately implies
so SSA follows.
This argument has also been extended to the time-dependent case. It is much trickier,
since the extremal surfaces need not all lie in the same spatial slice.
191
21.4 Some comments about HEE
It is tempting to say γ is the empty set, but this would not satisfy the homology
condition. A spatial slice of the black hole spacetime looks like
(21.20)
This is not simply connected, and the ‘empty set’ curve is not deformable to region A.
So, in fact, we must choose γA to be the horizon itself (which is extremal). Thus
area(horizon)
SA = (21.21)
4GN
But why is this ‘entanglement entropy’ ? Actually, it might not be. More accurately, the
HEE formula computes the von Neumann entropy of the reduced density matrix, SA =
− tr ρA log ρA . This von Neumann entropy may or may not come from entanglement —
we can’t tell the difference without knowing the full system. In the black hole spacetime,
the ordinary thermal entropy is the von Neumann entropy of the thermal state ρ =
e−βH , so the HEE formula applied to the full space gives the thermal entropy. You can
also think of this as actual entanglement entropy coming from the entanglement of the
CFT with the thermal double.
192
Some words
Entanglement entropy is a measure of how quantum information is spatially organized
in a quantum state. In a general QFT, it is extremely complicated, and we do not
expect any tractable simple formula. The fact that it simplifies, and becomes geometric,
in holographic CFTs is a deep fact about strongly coupled systems. It means that the
organization of quantum information approaches a sort of simplified, universal limit at
strong coupling. How this happens and exactly how it is related to emergent geometry
is an unsolved, and presumably very important, problem in current research.
193
22 Holographic entanglement at finite temperature
In this section we will discuss the HEE computation in a black hole spacetime. For
explicitness, we will talk about the BTZ black hole in AdS3 , but all of the results hold
qualitatively in higher dimensions, too.
dr2
ds2 = 2
+ r2 dφ2 . (22.1)
r − 8M
A: φ ∈ (0, R) . (22.3)
So the figure is
(22.4)
There are many geodesics connecting the endpoints of region A. In fact there are an
infinite number, labeled by the integer number of times that the geodesic winds the
black hole. If R 2π, then there is one that is obviously the shortest, which does not
wind the black hole. This is the one drawn in the figure. The length of this geodesic
is infinite, but if we impose a cutoff at r = 1/U V , the resulting entropy is
(0)
(0) length(γA )
SA =
4GN
c 1
= log sinh(πRT ) , (22.5)
3 πT U V
194
with c = 3`/2GN .
For 0 < R 2π, this is the final answer. For R > 2π, the same formula (22.5)
computes the length of a geodesic that winds (possibly multiple times) around the
horizon. The winding geodesics do not satisfy the homology condition, i.e., they cannot
be continuously deformed to A. But we must also consider disjoint geodesics. For
example, the horizon itself is a geodesic H. This can be added to to the wrapped
geodesic γ (1) . The union
γ (1) ∪ H (22.6)
(1)
is homologous to region A, if we view the orientation of H as opposite that of γA .
If region A is large, we must choose the dominant (minimal area) surface. We are
choosing between the red (wrapping) and blue (disjoint) surfaces in this figure:
(22.7)
The length of the blue curve gets a contribution from the short part, and a contribution
from the horizon:
c 1 c
SAblue = log sinh(π(2π − R)T ) + 2π 2 T (22.9)
3 πT U V 3
The first term is the answer above, applied to AC . The second term is the thermal
entropy, which we know is area(horizon)/4.
195
The two contributions exchange dominance at some point R∗ > 2π. At this point
there is a sharp transition in the behavior of the entanglement entropy. The plot as a
function of system size is something like the solid line in this figure:
(22.11)
Pure vs mixed
Clearly with a black hole, SA 6= SAC due to the homology condition. In fact we found
(for R > R∗ )
SA = SAC + Sthermal . (22.12)
Since Sthermal is the von Neumann entropy of the full space, this can be written
SA = SB + SAB (22.13)
Thus the entanglement entropy of the black hole saturates the Araki-Lieb triangle
inequality,
SAB ≥ |SA − SB | . (22.14)
196
22.1 Planar limit
The infinite-volume limit of the CFT is a limit of the black hole where the horizon
becomes planar. We can take this limit for BTZ by assuming
T R 1, R 2π . (22.15)
c
SA ≈ πT R + subleading (22.16)
3
R
SA ≈ Sthermal (β) × . (22.17)
2π
This makes sense: in the thermodynamic limit, the state is very mixed up, and the
subsystem itself just looks thermal at temperature β.
Geometrically, the reason behind (22.17) is that, in this limit, the extremal surface
‘hugs’ the black hole horizon for most of its length:
(22.18)
The contribution from the horizon is proportional to the horizon area, i.e., to the
thermal entropy.
We have discussed BTZ, but the same feature generalizes to planar horizons in higher
dimensions: for LA β, the extremal surface hugs the black hole horizon, giving a
volume-law contribution to the finite-temperature entanglement entropy.
197
23 The Stress Tensor in 2d CFT
In the last few lectures, we will go into more depth on the AdS3 /CFT2 correspondence.
First we need to cover some more ground in the basics of 2d CFT.
198
The conformal generators make an algebra
and similarly for the barred generators. This is called the ‘Witt algebra’ (or centerless
Virasoro algebra).
It is called the global subalgebra because these are the only ζn0 s that are non-singular
on the Riemann sphere. To see this, first look near z ∼ 0. Clearly the vector field
ζn = −z n+1 ∂z is regular only for n ≥ −1. Now, do the coordinate change w = 1/z,
and look near w ∼ 0. The vector field becomes
which is regular only for n ≤ 1. So the generators in (23.8) are the only ones regular
at both poles of the Riemann sphere.
Translation invariance implies the action S is invariant under xµ → xµ +µ , for constant
µ . The classical stress tensor is defined by applying the Noether to this symmetry.
Promoting µ to an arbitrary function of xµ and varying the action must give something
proportional to ∂,
√
Z
δS = −2 d2 z gT µν ∂µ n . (23.10)
4π δS
Tµν = − √ . (23.11)
g δg µν
∗
Note the extra −2π compared to our when we computed gravitational stress tensors a while back.
This is just a convention but will be important to remember when we compare the two.
199
It’s conserved
The Noether procedure guarantees that
∇µ T µν = 0 . (23.12)
In a flat background (23.1), in complex coordinates the two components of this equation
are
¯ zz = 0 ,
∂T ∂Tz̄z̄ = 0 . (23.13)
∂ ≡ ∂z , ∂¯ = ∂z̄ . (23.14)
And traceless
So far we have used only translation invariance. In a theory that is classically scale
invariant, we can also conclude that the trace of the stress tensor vanishes. The sym-
metry in this case is the infinitessimal rescaling
xµ → xµ + λxµ . (23.16)
Scale invariance implies that this integral vanishes; conformal invariance, or local scale
invariance, means we can make λ → λ(z, z̄) so Tµµ = 0. In complex coordinates,
Tzz̄ = 0 . (23.19)
200
This is the classical stress tensor. Even if it is traceless, the quantum stress tensor might
have a non-zero trace, for two different reasons. First, the UV regulator introduces a
scale, and may introduce a trace. In fact, in a renormalizable theory,
X
Tµµ (x) = βgi Oi (x) , (23.20)
i
where Oi are the relevant operators of the theory, gi are the corresponding couplings,
and βg are their beta functions. So, for example, in massless QCD, although the
classical stress tensor is traceless, the quantum stress tensor has a contribution from
the non-zero QCD beta function. In a CFT, all the beta functions are exactly zero, so
the equation
Tµµ = 0 (23.21)
is true as an operator statement. The phrase “as an operator statement” means the
equation is true inside correlation functions, up to delta-functions where Tµµ hits other
operator insertions (we’ll see some of these delta functions below).
The second origin of a non-zero trace is a quantum anomaly. This happens even in
CFT, if we place the theory on a curved background. This is called the Weyl anomaly
and is important but we probably won’t have time to cover it.
J z̄ = T (z) (23.22)
is the current associated to z → z+const. What are the Noether currents associated
to the general conformal transformation z → z + (z)? These are simply
This is sort of obvious; to reproduce it from the Noether procedure, you can promote
(z) → (z, z̄) and apply the usual Noether procedure.
201
As expected, the current is conserved,
¯ )=0.
∂µ J µ = ∂(T (23.24)
Suppose we have a general symmetry φ0 = φ + δφ. The fact that this is a symmetry
means the action and the path integral measure are invariant,
The first line is just renaming a dummy variable; the second line defines the current,
J µ , which may have contributions from both the classical action and the measure; and
the third line expands to linear order. It follows that
Z
h J µ ∂µ i = 0 (23.29)
This is the quantum version of the Noether procedure. The same exact steps, starting
instead with DφO1 (x1 ) · · · On (xn )e−S[φ] can be used to show that
R
The restriction to xi 6= y is necessary for the derivation to work. Just set the support of
202
to a small circle around y that does not include any of the other operator insertions,
and repeat the steps above.
If there are insertions that collide with ∂µ J µ , we have to be more careful. Suppose x1 =
y. Then when we do the transformation inside the path integral, the transformation
also affects this operator insertion,
O1 → O1 + δO1 . (23.33)
So now,
Z Z Z
−S[φ]
DφO1 (x1 ) · · · On (xn )e = Dφ(1 − J µ ∂µ )(O1 + δO1 )O2 · · · On e−S[φ]
(23.34)
and the conservation law is modified to (restoring some dropped constants)
Z
1
− ∂µ hJ µ (y)O1 (x1 ) · · · On (xn )i = hδO1 (x1 )O2 (x2 ) · · · On (xn )i, (23.35)
2π D(y)
where D(y) is a disk enclosing y, where we’ve chosen to be non-zero. Allowing for
any of the operators to collide with the current, (23.35) becomes
X
∂µ hJ µ (y)O1 (x1 ) · · · On (xn )i = δ(y − xi )hO1 · · · δOi · · · On i . (23.36)
i
As a residue
In two dimensions, using complex coordinates we can write (23.36) in a nice way. By
Stokes, we have Z I
µ
∂µ J ∼ (Jz dz − Jz̄ dz̄) (23.37)
D(y) y
203
and I
i
dzJ(z)O(x) = −resz∼x J(z)O(x) . (23.38)
2π x
Therefore, the Ward identity (23.36) can be written as the operator equation
Therefore the Ward identity for conformal transformations, allowing for both holomor-
phic and anti-holomorphic transformations, is
The sum is over all local operators in the theory. This is called the operator product
expansion (OPE). The OPE coefficients f12j (x − y) depend on the theory, but they
are restricted by conformal invariance, so that in fact the fijk ’s of primary operators
determine the fijk ’s of descendants as well.
The Ward identity, in the form (23.41), means that the singular terms in the O(x)T (y)
OPE are related to the conformal transformations of O.
204
Primaries
Recall that primary operators transform as
−h −h̄
dz 0 dz̄ 0
0 0 0
O (z , z̄ ) = O(z, z̄) (23.43)
dz dz̄
where (h, h̄) are called the conformal weights of O. The corresponding infinitessimal
transformation is
The symbol ‘∼’ means that we only written the singular terms in the OPE; there is
also an infinite series of contributions regular at z = w. To check this gives the correct
residue, expand
Upshot
The lesson is that singular terms in the T φ OPE contain exactly the same information
as the transformations of φ under conformal symmetry. The conformal algebra is one
and the same as the data in the singular part of the OPE.
Now we will examine the transformation of the stress tensor under conformal symme-
try, or equivalently, the T T OPE. Much of the discussion in this section is easiest to
understand first using an example like a free scalar field. This example is worked in
205
every reference so I will not repeat here but encourage the reader to jump to the free
scalar section of Polchinski, Kiritsis, or Tong’s notes before continuing.
The stress tensor is not a primary, so we cannot plug O = T into (23.46). But, it does
behave similar to a primary under rescalings, with (h, h̄) = (2, 0):
This is basically dimensional analysis; the energy should have mass dimension 1, and
R
E ∼ T . Similarly, T (z̄) has scaling weights (h, h̄) = (0, 2). Requiring both sides of
the OPE to have the same scaling weight, it must have the form
set X1 = 0. The OPE is always invariant under permutations (in Euclidean signature).
One way to see this is that equal-time commutators in Lorentzian signature must
vanish by causality, and these equal-time commutators correspond to permutations of
the Euclidean correlator. This property is sometimes called locality or causality.
To fix the other two terms, we use the Ward identity for the scale symmetry (23.48).
The Noether current for scale symmetry is, from (23.23)
Then using the Ward identity (23.41) gives our final answer for the singular terms in
the OPE:
c/2 2T (w) ∂T (w)
T (z)T (w) ∼ 4
+ 2
+ . (23.52)
(z − w) (z − w) z−w
∗
I will not explain why, but this can be found in the references, in the section on the state-operator
correspondence and conformal reps. This also explains why c must be a number, not a field.
206
The constant c is called the central charge. For a free boson, it turns out to be c = 1,
and for a free fermion c = 1/2. (see references).
c 000
δT = −T 0 − 20 T − . (23.53)
12
The first two terms are the usual transformations of a primary. The last term is an
‘anomolous’ term coming from the central charge.
where
f 000 3 (f 00 )2
{f (z), z} ≡ 0 − (23.55)
f 2 (f 0 )2
is called the Schwarzian derivative. The easiest way to derive (23.54) is to check that
it agrees infinitessimally with (23.53), and that it composes correctly under multiple
transformations.
c
[Lm , Ln ] = (m − n)[Lm − Ln ] + (m3 − m)δm,−n . (23.56)
12
This is where the name ‘central charge’ comes from. It is another way of writing
(23.52): the T T OPE, the Schwarzian derivative, and the Virasoro algebra all contain
the same information.
207
23.6 Casimir Energy on the Circle
c is related to the Casimir energy of the theory on a circle. The mapping from the
plane to the cylinder of radius L is
z = e2πw/L . (23.57)
The cylinder coordinate is identified w ∼ w + iL, since this takes us around a circle on
the z plane. Using the finite transformation law (23.54) gives
2
2π c
Tcyl (w) = z 2 Tplane (z) − , (23.58)
L 24z 2
where now we’re using ‘plane’ and ‘cyl’ to distinguish the stress tensor before and after
the transformation.
Let’s calculate the expectation value. On the plane, scale invariance sets all 1-point
functions to zero:
hTplane (z)i = 0 . (23.59)
This is because the only scale-invariant function of z with weight 2 is 1/z 2 , but this
would not be translation invariant. Now using (23.58),
2
c 2π
hTcyl (w)i = − . (23.60)
24 L
This is a Casimir energy, in the usual sense: we started with a theory on a line (i.e.,
space is a line), then imposed periodic boundary conditions with period L, and found
energy ∼ 1/L. The size of the Casimir energy is fixed by the central charge. This is
our first indication that c might be a good way to the measure the degrees of freedom
of a CFT.
To compute the value of the energy explicitly, choose real coordinates w = τ + iφ,
where φ ∼ φ + L. The energy is defined in the usual way by integrating the stress
208
tensor over a fixed-time slice:
1 L
Ecyl = ∈ dxhTτcyl
τ (τ = 0, x)i (23.61)
2π 0
Z L
1
= cyl
dxhTww + Tw̄cyl
w̄ i (23.62)
2π 0
In the second line, we just did the change of coordinates w = τ + iφ, w̄ = τ − iφ, and
used tracelessness of the stress tensor, Tww̄ = 0. Evaluating this in vacuum gives the
Casimir energy
vac c 2π
Ecyl =− . (23.63)
12 L
In a general state, (23.61) gives
c
Ecyl = ∆ − (23.64)
12
where I I
1
∆=h dzzT (z) + dz̄ z̄ T̄ (z̄) i . (23.65)
2πi
Recall that J = zT (z) is the Noether current for scale transformations. Therefore, the
rhs is the expectation value of the scale operator. In an eigenstate, ∆ is the scaling
dimension of that state. So (23.64) says that the energy on the cylinder is the scaling
dimension on the plane, shifted by the central term. This makes sense, since time
translations on the cylinder correspond to scale transformations on the plane.
209
24 The stress tensor in 3d gravity
Now we will compare the stress tensor of 3d gravity to our results of the previous
section. Consider the asymptotically-AdS3 metric
`2 2 r 2
2
ds = 2 dr + 2 dzdz̄ + hzz dz 2 + hz̄z̄ dz̄ 2 + 2hzz̄ dzdz̄ . (24.1)
r `
We have not written every possible term in the perturbation hµν , but it turns out that
other terms can be removed by a diff. Also, we will keep only the leading term in hµν
at large r, i.e., near the boundary, so we can assume that hµν is independent of r.
The Einstein equations imply that the perturbation is traceless and conserved:
hzz̄ = 0 (24.2)
¯ zz = 0 .
∂hz̄z̄ = ∂h (24.3)
The stress tensor of this geometry was computed in an exercise from one of the early
lectures. Let’s briefly review how this works. The Brown-York stress tensor is∗
on−shell
4π δSEinstein
Tij ≡ − √ (24.4)
g δg ij
1 1
= − Kij − Kgij − gij . (24.5)
4 `
The first two terms came from varying the Einstein action plus the Gibbons-Hawking
boundary term. The last term comes from the counterterm, with the coefficient set
in order to make the answer finite as r → ∞. Plugging the metric (24.1) into (24.5),
using (24.2) and doing a lot of work, eventually
1 1
Tzz = − hzz , Tz̄z̄ = − hz̄z̄ . (24.6)
4` 4`
∗
We’ve changed conventions by a factor of 2π compared to some earlier lectures. This is just a
choice, made to agree with our convention for the CFT stress tensor.
210
Thus the Brown-York metric is traceless, conserved, and therefore holomorphic/anti-
holomorphic just like in CFT.
What vector fields ζ preserve the form of the metric (24.1)? The answer is
`4 00
z → z + (z) − ¯ (z̄) (24.8)
2r2
`4
z̄ → z̄ + ¯(z̄) − 2 00 (z)
2r
r 0 r
r → r − (z) − ¯0 (z̄)
2 2
for arbitrary functions (z) and ¯(z̄). Near the boundary, these act on z, z̄ just like
conformal transformations. The extra ∂r piece acts as a rescaling.
Thus transformations of AdS3 that preserve the asymptotics of the metric coincide
with 2d conformal transformations!
Let’s set ¯ = 0 and focus on the holomorphic conformal transformations. Under (24.8),
the metric transforms as
`2 000
2 2 0 0
ds → ds + −2hzz − hzz + dz 2 (24.9)
2
Thus the dz 2 piece of the metric, which we interpreted as the gravitational stress tensor
up to a factor of −1/4`, transforms as
`
δ T = −∂T − 2T ∂ − 000 . (24.10)
8
This is exactly the transformation law in 2d CFT derived in (23.53). Comparing the
211
coefficient of the anomalous term, we see
3`
c= (24.11)
2GN
This is called the Brown-Henneaux central charge, after Brown and Henneaux who com-
puted it way back in 1987 – long before AdS/CFT, and even well before the discovery
of the BTZ black hole or the Brown-York stress tensor. They used a different method,
based on conserved charges, which directly produces the Virasoro algebra (23.56) as
the asymptotic symmetry group. As far as I know, they did not recognize the relation
to the 2d conformal group.
r2 dr2
ds2 = −( 2
− 8M )dt2
+ 2 2
+ r2 dφ2 (24.12)
` r /` − 8M
In an exercise in a previous lecture you computed the energy of this solution, and found
E = M.
Global AdS can also be written in the form (24.12), by choosing M = − 8` . Therefore
the gravitational energy of the groundstate on the cylinder is
` c
Evac = − = − , (24.13)
8 12
This is equal to the Casimir energy of a 2d CFT (23.63). It was actually guaranteed
to agree once we found the transformation law (24.10) agrees with CFT, because the
212
finite version of this infinitessimal transformation must be the Schwarzian derivative,
on the gravity side just as it was in CFT.
213
25 Thermodynamics of 2d CFT
In this lecture we will discuss the thermodynamics of 2d CFT, including the torus
partition function and the famous Cardy formula for CFT entropy. Later we’ll come
back to holography, but this note is entirely in CFT.
This is a sum over states in some Hilbert space; that Hilbert space depends on our
choice of the space in which the theory lives, i.e., our choice of boundary conditions
on the fields. If we choose space to be a circle, then
c
X X
Z= e−βEcyl = e−β(∆− 12 ) , (25.2)
states states
where ∆ is the scaling dimension of the state. The second equality comes from (23.64),
and assumes Lcyl = 2π.
As usual, the trace (25.1) is equal to a path integral in periodic imaginary time, with
period β. Since space is also periodic, this is a path integral on a torus, with a ‘space’
identification and a ‘thermal’ identification:
w ∼ w + L ∼ w + iβ . (25.3)
Our aim is to compute this path integral at high temperature. First we discuss general
properties of CFT on a torus in the next couple subsections.
Consider a 2d QFT (not necessarily conformal yet) on a Euclidean torus. The most
general torus is specified by two lattice vectors ~v1 , ~v2 on the (tE , φ) plane, meaning that
214
we identify all points related by
1
In the complex coordinate z = 2π
(φ + itE ), the torus is specified by a complex number
1
τ= (β + iθ) . (25.6)
2π
The path integral on this torus will be denoted Z(τ, τ̄ ). τ is called the modulus, or
complex structure modulus of the torus.
Converting to trace
The path integral on this torus can be converted to operator language by declaring
that φ is ‘space’ and tE is ‘time.’ Then we take states on a constant-tE surface and
evolve them along the vector β∂tE + iθ∂φ . Thus the path integral can be rewritten as
−βH+iθJ
Z(τ, τ̄ ) = Tr H(0,2π) e (25.8)
where J is the angular momentum (which, by convention, generates motion along −∂φ ,
hence the sign flip). In the subscript, we have noted explicitly what Hilbert space
to trace over: it is the Hilbert space of states defined on the spatial slice where φ ∈
[0, 2π] with tE held fixed. That is, the fields obey the boundary condition X(tE , φ) =
X(tE , φ + 2π). This is not standard notation; usually people just write ‘ Tr ’, with
H(0, 2π) implicit.
215
Large conformal transformations
We already know how conformal transformations act on the plane, as holomorphic/anti-
holomorphic coordinate changes. Now we want to examine conformal symmetry on the
torus. All of the usual conformal transformations z → z +(z) are still symmetries, but
there are also new, ‘large’ conformal transformations, which cannot be continuously
connected to the identity.
The S transformation
One way to see this is to re-slice the torus path integral in a different way. We’ll start
with an intuitive explanation in the simplest case θ = 0, then come back to the general
story below. The torus is Euclidean, so we are free to switch the roles of tE and φ
when we construct the trace by declaring tE is ‘space’ and φ is ‘time’. Then following
the usual logic, we find
−2πJ
Z(τ, τ̄ ) = Tr H(β,0) e . (25.9)
Therefore we have written the same path integral in two different ways (25.8) and
(25.9),
−βH −2πJ
Z(τ, τ̄ ) = Tr H(0,2π) e = Tr H(β,0) e . (25.10)
Here the rotation by 90◦ sends J → H, and the rescaling inserts the factor of 2π
β
.
It follows that
4π 2
−βH
Z(β) = Tr e =Z . (25.12)
β
This is called the S transformation. It came from a large conformal transformation
that swapped space and Euclidean time, and it relates the thermodynamics at high
temperatures to the thermodynamics at low temperatures.
216
25.2 SL(2, Z) transformations
z ∼z+1∼z+τ . (25.13)
w ∼w+1∼w+τ +1 . (25.14)
This is precisely the same torus, since the lattice m + nτ with m, n ∈ Z is unchanged.
From the point of view of the z coordinate, the w coordinate system ‘winds’ around
the torus:
F IGU RE : T wotori, windingcoords (25.15)
T : τ →τ +1 . (25.16)
The winding means that this coordinate transformation cannot be continuously de-
formed to the identity, i.e., there is no infinitessimal version.
This is because we haven’t actually done anything, we’ve just rewritten the same torus
path integral in a different coordinate system. We can also check explicitly from the
trace formula:
Z(τ + 1, τ̄ + 1) = Tr e−βH+i(θ+2π)J . (25.18)
For a theory with only bosons, this is equal to Z(τ, τ̄ ) since angular momentum is
integer quantized. In a theory with fermions, we have to be more careful about imposing
boundary conditions on the fermions, but in the end it still works.
217
There are in fact an infinite number of ways to slice the torus. The general choice of
lattice vectors v10 , v20 that generate the same lattice is
v10 a b v1
= , (25.19)
v20 c d v2
where
a, b, c, d ∈ Z and ad − bc = 1 . (25.20)
a b
The matrix is therefore an element of the group SL(2, Z)— i.e., , 2x2
c d
matrices with integer elements and unit determinant. This re-slicing maps
aτ + b
τ→ . (25.21)
cτ + d
SL(2, Z)
is generated
by the S and T transformations. The S transformation is the
0 1
matrix , which acts as
−1 0
S: τ → −1/τ . (25.22)
Unlike T , the other SL(2, Z) transformations are not symmetries of a general QFT.
This is because the new space circle, which defines the Hilbert space, is not the same
as the old space circle. It is only in a scale-invariant theory that we can rescale these
two circles and see that they have the same Hilbert space. Thus in conformal field
theory, SL(2, Z) is a symmetry:
aτ + b aτ̄ + b
Z(τ, τ̄ ) = Z , . (25.23)
cτ + d cτ̄ + d
218
25.3 Thermodynamics at high temperature
(We will restrict to the case θ = 0, but angular potential can be included at the expense
of slightly more complicated formulas.) First let’s calculate the partition function at
very low temperature:
X
Z(β) = e−βE ≈ e−βEvac (β → ∞) . (25.24)
states
This is simply the statement that at very low temperature, the vacuum state dominates
c
the canonical ensemble. Above we found the Casimir energy Evac = − 12 , so
βc
Z(β) ≈ e 12 (β → ∞) . (25.25)
so we can repeat the derivation at low temperature using the S-transformed partition
function. Replacing β → 4π 2 /β in (25.25) gives
π2 c
Z(β) ≈ e 3β (β → 0) . (25.27)
π2
V
F = − cT 2 , (25.28)
3 2π
where we’ve written the formula for a general ‘volume’, which we’ve been assuming
is 2π. This is a remarkable formula. The scaling with the temperature in the free
energy is fixed by dimensional analysis in a CFT, but the coefficient is not. The high-
temperature free energy dominated by very heavy states, and is usually impossible to
calculate in a strongly interacting QFT. But in this case, modular invariance relates it
to the Casimir energy of the vacuum state.
There are two immediate consequences. First, this confirms once again that c should
be interpreted as a measure of degrees of freedom. Second, it fixes the asymptotic
219
density of states. To see this, write the result as
Z
2 c/(3β)
X
−βE
e ≈ dEρ(E)e−βE ≈ eπ (β → 0) . (25.29)
states
The first sum is over all states of the theory; the integral is over energies, with ρ(E)
the density of states.
The rhs is very singular as β → 0. Each individual term on the lhs is regular at β = 0,
so the singularity can only come from the infinite sum. The strength of the divergence
must be related to the growth of ρ(E) as E → ∞.
To make this more quantitative, we can use standard thermodynamic formulae. The
thermodynamic entropy and energy are
2π 2 c
S = (1 − β∂β ) log Z = (25.30)
3β
2
π c
E(β) = −∂β log Z = 2 . (25.31)
3β
As usual in stat mech, this formula should give the density of states via
As a check, let’s plug this into the partition function and see that it reproduces the
expected singularity as β → 0:
Z
Z(β) ≈ dEρ(E)e−βE (25.34)
Z
≈ dE exp (S(E) − βE) (25.35)
≈ exp (S(E∗ ) − βE∗ ) (25.36)
220
where E∗ is the saddlepoint value, defined by
S 0 (E∗ ) − β = 0 . (25.37)
This saddlepoint is just the usual thermodynamic value of the energy. Using (25.32),
finding the saddlepoint, and plugging in, we find
π2c
Z(β) ≈ exp (25.38)
3β
as claimed. (This is not a surprise; the whole point of thermodynamics and Legendre
transforms is to solve this saddlepoint equation for you.)
The entropy formula (25.32) is often called the Cardy formula. It applies to any CFT
as E → ∞.
r2 dr2
2 2
ds = − − 8M dt + 2 2 + r2 dφ2 (26.1)
`2 r /` − 8M
221
with φ ∼ φ + 2π, has energy E = M and entropy
area
S =
4
1
= 2πρ+
4
π√
= 8M `
2 r
c
= 2π E. (26.2)
3
In the last line we used the Brown-Henneaux central charge, c = 3`/2 (with GN = 1).
This exactly matches the Cardy formula, (25.32).∗ This match was found by Stro-
minger in 1997. What is important about this result is that the Cardy formula counts
microstates in statistical mechanics. They are microstates in the dual CFT, but by
holographic duality, they must be microstates of quantum gravity in AdS3 as well.
We’ve just counted them without actually enumerating them, but a great deal of
progress has been made enumerating them as well.
26.2 Strominger-Vafa
Historically, the first black hole microstate counting was a string theory calculation by
Strominger and Vafa. It gave the same answer as (26.2). This calculation was impor-
tant because it laid to rest any final doubts about whether black hole thermodynamics
is really thermodynamics (i.e., coming from stat mech) or just a mysterious analogy.
In quantum gravity, black hole entropy counts microstates:
where ρQG is the density of states in quantum gravity. This is an extraordinary window
from low-energy physics into the theory of quantum gravity well above the Planck scale.
∗
Actually the Cardy formula was for E → ∞, whereas the black hole formula is for E > 0. It is
possible to get the match at all energies but requires further input from string theory, or some further
assumptions about the spectrum of the CFT.
222