Winitzki - Heidelberg Lectures on Advanced General Relativity 2007
description
Transcript of Winitzki - Heidelberg Lectures on Advanced General Relativity 2007
Lecture 1, 2007-10-17. General informa-
tion
Advanced General Relativity and Cosmology
Instructor: Dr. Sergei Winitzki
Office: Philosophenweg 19, Room 014
Office hours: Thursdays 14:00-16:00
Home page: http://www.tphys.uni-heidelberg.de/~winitzki/AGRC/
Lectures: Wednesday and Friday, 11:15-13:00,
Philosophenweg 19, Seminarraum
Material shown in green was not presented due
to lack of time or for other reasons
1
Course outline
Problem of the year: inflation and dark energy
Tensor calculus explained without indices
Cosmological evolution in f(R) gravity
Birrell-Davies demystified: Basics of quantum
field theory in curved spacetime
Classical and quantum theory of cosmological
perturbations
Geometry of null surfaces, causal structure,
chronology protection, time machines in GR
Thermodynamics of horizons, entropy of black
holes, "holographic principle"
Pseudotensors demystified: Energy in asymp-
totically flat spaces; EMT of gravitational waves
Hawking-Ellis demystified: Classical singularity
theorems
2
Problem of the year
Cosmological observations yield:
- primordial inflation
- late-time inflation
Early work on inflation:
A. Starobinsky, Phys. Lett. B91, p.99, 1980
de Sitter stage in cosmological evolution
Mechanism: modification of gravity due to
quantum backreaction, for example:
S[g] =
∫
d4x√−g
(
R
16πG+ αR2
)
Inflationary spacetime: de Sitter with flat spa-
tial sections, Hubble rate H:
gµνdxµdxν = dt2 − e2Ht
(
dx2 + dy2 + dz2)
3
Problem of the year
To explain dark energy and primordial inflation
within a single model of modified gravity, and
to remain in agreement with other tests of GR.
A model of modified gravity is f(R) gravity:
S =1
16πG
∫
d4x√−g (R+ f(R)) + Smatter
Recent proposals: f(R) = c1(
1 + c2R−n)−1
How to describe consequences of f(R) gravity?
- Need equations of motion for g
- Solutions for Solar system tests
- Homogeneous cosmological solutions: have
inflation and/or dark energy?
- Primordial cosmological perturbations
There are > 50 papers on aspects of f(R)
gravity in 2006-2007
4
Deriving the EOM for f(R) gravity
Note: f(R) is nonlinear ⇒ EOM are 4th order.
Equations for gµν are complicated
Use the method of Lagrange multipliers! Then
we can reduce the Lagrangian to one that is
linear in R and then EOM will be 2nd order
Analogy with mechanics:
S =
∫
dt
[
1
2x2 + f(x, x)
]
– nonlinear in x
EOM would be 2nd order if S were linear in x
Trick: Introduce new variable y and Lagrange
multiplier λ that enforces y = x:
S =
∫
dt
[
1
2x2 + f(x, y) + λ (x− y)
]
EOM for y is f,y(x, y)− λ = 0.
Since f,yy 6= 0, we can solve for y through x, λ:
y = Y (x, λ). Also replace λx→ −λx. Finally:
S =
∫
dt
[
1
2x2 − λx+ f(x, Y (x, λ))− λY (x, λ)
]
Obtained a first-order Lagrangian with extra
d.o.f. λ and potential energy V (x, λ) = f − λf ′5
Whether one can/cannot substitute into
Lagrangians
Suppose we have L(x, x, y, y, ...); we can change
variables x, y → A(x, y), B(x, y), ...
We cannot change variables to new variables
involving derivatives, x→ A(y, y, ...)
Example: L = x2, substitute x = y + y, get
wrong EOM: y(4) − y = 0
Suppose we have L(x, x, y, y, ...) and one of the
EOM is x = f(y, y, ...) ⇒ We can eliminate x.
Suppose we have constraints: L+λF . We can
substitute F into L as long as L+λF remains.
This works even if F involves derivatives!
But: we cannot substitute EOM into L if
EOM is x = f(x, x, ...) or x = f(y, z) (change
the number of relevant initial conditions). Also
cannot substitute arbitrary relationships, e.g. spe-
cific solutions x = f(y, y, const), unless they
are enforced by constraints.
6
Practice problems:
1. Show that L = xf(x, x) can be reduced to
1st order by adding a total derivative ddtM(x, x).
2. Reduce L = f(x, x, x,...x ) to 1st order by
introducing appropriate new variables.
3. Reduce L =...x f(x, x, x) to 1st order (first
reduce to 2nd order, i.e. remove...x ).
Incorrect substitution of EOM into Lagrangian
Example: L = 12x
2 + 12xy
2
Equations of motion:
x =1
2y2
d
dt(xy) = 0
Solve y = C1x−1, substitute into EOM for x:
x =C2
1
2x
But: what if we substitute y = C1x−1 into L?
L =1
2x2 +
C21
2
1
xEquation of motion for x now is wrong:
x = − C21
2x2
7
Want to derive EOM for f(R) gravity!
Apply this technique to f(R) gravity:
S[g] =1
16πG
∫
d4x√−g (r+ f(r) + λ (R− r))
EOM for r is 1+ f ′(r)− λ = 0. Solve r = r(λ)
and substitute back:
S[g] =1
16πG
∫
d4x√−g
(
f − rf ′+ λR)
Now apply a trick: conformal transformation:
g → g ≡ e2Ωg
Choose Ω such that λR√−g becomes R
√−gplus some terms — need a formula for R!
Result: GR with scalar field Ω and metric g
“Einstein frame” vs. “Jordan frame” – these are
different variables, not reference frames
Note: coupling to matter is through g, not
through g — matter feels Ω and g
8
Tensor calculus without indices
Geometric view: manifolds, topology
Geometric operations on vector and tensor fields
Lie derivative, metric, curvature
Index-free notation: instead of
uνuµ;ν = 0 ⇒ uν∂ν
(
gαβuαuβ
)
= 0
write
∇uu = 0 ⇒ ∇ug(u,u) = 2g(∇uu,u) = 0
Practical tools for calculations without indices
Vielbein formalism; curvature and Einstein eqs.
9
Quantum field theory in curved spacetime
Key applications in cosmology:
* Prediction of the primordial fluctuations
* Dynamics of chaotic inflation
Key applications in black hole physics:
* Hawking radiation
* Black hole entropy
* Thermodynamics of horizons
Many calculations do not require “hard” QFT!
10
Causality and horizons
In GR, matter determines the causal structure
Time machines? Chronology protection?
Horizons are boundaries of causal regions
Horizons have thermodynamical properties!
Black holes have temperature and entropy
“Holographic principle”: a bound on entropy
due to quantum effects and nonlinear coupling
to gravity (not due to quantum gravity)
Can study BHs without quantum gravity!
11
Energy and conservation laws in GR
Ill-defined local energy density; “pseudotensors”
Energy in asymptotically flat spaces
Energy of matter + gravity is conserved!
Energy-momentum tensor of gravitational waves
12
Classical singularity theorems
GR predicts singularity in many cases
Limit of applicability of classical GR!
Proofs of singularity theorems have 3 parts:
* Geodesic focusing theorem
* Energy conditions – guarantee focusing
* Topological argument ad absurdum
Applications:
* Gravitational collapse of stars
* Past cosmological singularity (Big Bang)
* Future cosmological singularity (Big Crunch)
13
Lecture 2, 2007-10-19. Manifolds
Gravity in GR curved geometry, not a force
Observations: Small scales R4, large scales –?
Manifold M is a generalized “curved space”
Visualize manifolds as embedded surfaces
Examples: sphere x2 + y2 + z2 = R2.
Torus:(√
x2 + y2 −A)2
+ z2 = R2, A > R
0
z
A
Rρ
14
Local coordinates
Locally like Rn means local coordinate system
which is a map U → Rn, where U ⊂ M is a
patch; all of M must be covered by patches
Local coordinates on torus: 0 < (φ , θ) < 2π,
x = ρ cosφ, y = ρ sinφ, z = R cos θ,
ρ = A+R sin θ
Another choice of coordinates: ρ = ab−sin θ,
z = c cos θb−sin θ, for appropriately chosen a, b, c. The
coordinate change map Rn → R
n is smooth
Intrinsic description: no embedding needed!
Coordinates are scalar functions on M
Functions on manifold = functions on patches
= functions on subsets of Rn
Intrinsic definition of smoothness: smooth func-
tions on subsets of Rn + smooth coordinate
change maps
15
Tangent spaces
Vectors in physics mean velocities / directions
Visualize a vector as velocity of a point moving
within M. Path γ(t) given in local coord’s
In embedding picture: vector is in tangent
plane
E.g.: hypersurface f(x) = 0, normal vector
nj = ∂xjf(x0), tangent plane is n · (x− x0) = 0
Tangent plane→tangent space TpM at point p
Intrinsic picture: velocity means change; tan-
gent vector v = derivative operator on scalar
functions f : M → R; this derivative is with
respect to the “time” of the moving point
Dγf =d
dtf(γ(t)) =
∑
j
∂f
∂xj
∣
∣
∣
∣
x=γ(t0)
dγj
dt
∣
∣
∣
∣
∣
t=t0
The derivative operator is v ≡ ∑
jdγj
dt∂∂xj
. Write
v f instead of Dγf .
Tangent vectors v form a vector space; basis
tangent vectors are ∂xj ≡ ∂j.16
Calculations with tangent vectors
Chain rule: v [f(g)] = f ′(g)v gConsequence: components of v in a coordi-
nate system xµ are computed as v xµ, and
then v =∑
µ (v xµ) ∂xµ.
Practice problems: 1. Find components of
x∂x + y∂y in coordinates a = x+ y, b = 3xy2. In 2D, find tangent vector γ to the curve
γ(t) = x = cos t, y = 2sin t in coord’s x, y3. Given u = ∂x+x∂y, find coord’s X,Y such
that u = ∂Y .
Other properties: v (fg) = fv g + gv f ,linearity v (f + g) = v f + v g – as a 1st
derivative. All 1st derivatives are equivalent to
tangent vectors.
Tangent vectors as short curve segments:
γf ≈ f(γ(t0 + σ)− γ(t0))σ
, f(p′)−f(p) ≈ (σγ)f
Local vector field = choice of tangent vector
v|p ∈ TpM at each point p ∈ U ⊂MFlow of vector field; show that v t = 1
17
Commutator of vector fields
Consider [u,v] f = u (v f)− v (u f)This is a 1st derivative operator, thus [u,v] is
a new tangent vector
Example: [x∂x, ∂x + ∂y] has no 2nd derivatives
Proof: show that [u,v] (fg) = ... [Derivation]
Practice problem: compute commutators [a,b],
[a, c], [b, c] for
a = x∂x + y∂y + z∂z, b = x∂y − y∂x, c = ∂z
Geometric view: [a,b] is the arrow p′ → p here:
a
b
p′
p0
p
p1
p2
Approximately f(p)− f(p′) ≈ (δt)2 [a,b] f |p018
Lecture 3, 2007-10-24. Connecting vector
fields
Vector field c is connecting for v if [c,v] = 0
(these vector fields commute)
Connecting vector c 6= v points across v and
along lines of equal “time”, for some choice of
“time” across the flow
s=0
τ = 3
τ = 2
τ = 1
v
c
Note: coordinate vector fields ∂xµ always com-
mute (no discrepancy when following coord. lines)
Question: Given a vector field v, can we find
coord’s xµ s.t. v = ∂x1? (Yes. Need a basis
of connecting vector fields for v. But [c,v] = 0
is an ODE for c along flow of v.)
19
Practice problems: 1. A vector v is tangent
to a surface f(p) = const iff v f = 0. If u,v
are tangent to the surface, show that [u,v] is
also tangent to the same surface.
2. A flow is given explicitly by the equations
x, y, z 7→ x+ τy, y, z − τxCompute the vector field v that generates this
flow. Find two examples of connecting vector
fields for v.
Differential forms
1-form is a linear function chosen in each tan-
gent space (element of cotangent space T ∗pM)
Example: v 7→ v f for fixed f ; this 1-form is
denoted df , so (df) v ≡ v f . The 1-form df
is called the gradient of f
Interpretation: (df)v is the change of f over
a short curve segment represented by v
Coordinate 1-forms: dx, dy, dz are gradients
of coordinate functions x, y, z
(dx) ∂x = 1, (dx) ∂y = 0, etc.
Practice problems: 1. For ω = d(x2y3) and
v = y2∂x, compute ω v
2. Let ω a ≡ y (a x) y2 − 2a y; express ω
through dx, dy
3. Find all 1-forms ω = adx + bdy such that
ω (∂x + x∂y) = 0
4. Find all vectors v = a∂x + b∂y such that
(xdy+ ydx) v = 0
Tensor fields = tensor product of vector fields
and 1-forms
20
Calculus of n-forms
Antisymmetric tensor product of 1-forms:
(ψ ∧ ω) (a,b) = ψ(a)ω(b)−ψ(b)ω(a)
Exterior differential:
d (λω) = (dλ) ∧ ω+ λ ∧ dω
d(ψ ∧ ω) = (dψ) ∧ ω+ (−)|ψ|ψ ∧ dω
d (dω) = 0
Examples: d (xdx) = 0; d(xdy−ydx) = 2dx∧dyGeneral formulas: (θ is 1-form, ω is n-form)
(dθ) (x,y) = x θ(y)− y θ(x)− θ ([x,y])
(dω) (v1, ...,vn+1) =n+1∑
s=1
(−)s−1 vs ω(v1, ...vs...,vn+1)
+∑
1≤r<s≤n+1
(−)r+s−1ω(
[vr,vs] ,v1, ...vrvs...,vn+1
)
Insertion operator: (ιxω)(a,b, ...) = ω(x, a,b, ...)
Integration of ω over a domain U : ∫U ωStokes theorem:
∫
U dω =∫
∂U ω
Practice problem: show that dθ(x,y) does
not depend on derivatives of x,y
21
Frobenius theorem for 1-forms
1-form ω is exact iff ω = df ; then ω(v) = 0
means v is tangent to the surface f = const
Given a 1-form ω, how to find whether ω(v) =
0 is surface-forming?
Example: ω = xdy is surface-forming, but not
xdy − ydx+ dz (it’s a spiral-shaped field!)
Theorem: ω is surface-forming iff ω ∧ dω = 0
Proof: 1. If ω is surface-forming, then ∃f : ∀v,
(ω(v) = 0) ⇔ (v f = 0). Choosing a basis
df, θ1, θ2, ... in T ∗pM we find ω = λdf (else
∃v : v f 6= 0, ω(v) = 0) ⇒ ω ∧ dω = 0.
2. From ω ∧ dω = 0, need to show that
flows of ∀v : ω(v) = 0 lie within a single sur-
face. Commutator of u,v should remain within
the surface! Need to show ∀u,v : ω(u) = 0,
ω(v) = 0 ⇒ ω([u,v]) = 0. Choose a vector
t such that ω(t) = 1 (“transverse”). Com-
pute: 0 = (ω ∧ dω)(t,u,v) = ω(t)dω (u,v) =
dω(u,v) = u ω(v)− v ω(u)− ω([u,v]), thus
ω([u,v]) = 0.
Practice problem*: If F ∧ ω = 0, where F is
n-form and ω is 1-form, show that ∃ θ : F =
θ ∧ ω where θ is a suitable (n− 1)-form.
22
Lie derivative
Need derivative of vectors in direction v but...
Cannot add tangent vectors at different points!
Solution: transport vectors by the flow of v
v
a pb
p′a′
b′
Lva = limδt→0
T−1v
[
a(p′)]− a(p)
δt
LvX measures how much X differs from TvX(i.e. X transported by the flow of v)
Properties that follow from this picture:
* Lvf = v f on scalar functions f* Lva = [v, a] on vectors a
(Note: connecting vector is exactly transported.)
* Lv (X + λY ) = LvX + λLvY + (Lvλ)Y* Lv (ω a) = (Lvω) a+ω Lva on 1-forms ωNote: LvX depends on the flow of v, not only
on the value of v at a point!
Lie derivative of tensors: Lv(X⊗Y ) = (LvX)⊗Y +X ⊗LvY ; same for wedge product ω ∧ ψ
23
Lecture 4, 2007-10-26. Calculations with
Lie derivatives
LvX depends on derivatives of v as well as on
derivatives of X:
Lλva = λLva− (a λ)v 6= λLva
Lie derivative of 1-form:
(Lvω) u = v (ω u)− ω [v,u]
Compute L∂x (dy):(
L∂xdy)
v = L∂x (v y) −dy [∂x,v] = v (dy ∂x) = 0.
Lie derivative of a bilinear form: (LvA)(x,y) =
v [A(x,y)]−A([v,x] ,y)− A(x, [v,y])
Cartan homotopy formula: Lxω = ιxdω+ dιxω
Practice problems: 1. Compute L∂xdx; Lx∂x (x∂y);
Lx∂y (xydx); Lx∂y+xy∂x
(
2z2dz)
; Lx∂x (xdx ∧ dy).
2. Let A(a,b) ≡ (a x) (b y) z−(a y) (b z)x;compute Lx∂yA. (A is a 2nd rank tensor)
3. Let T (a)b = (b x) ya− (a y)xb; compute
L∂xT . (T is a 3rd rank tensor)
4. Show that Lvdλ = dLvλ for scalar λ
5. Show that Lλvω = λLvω+ (ω v)dλ
24
Important property of Lie derivative
Statement: If L∂1T = 0 for a tensor T =∑
α,β,... Tαβ...dxα⊗dxβ⊗... then the components
Tαβ... do not depend on x1
Proof: use Leibnitz rule and L∂1dxα = 0 and
L∂1∂α = 0
Interpretation: if LvT = 0 then components of
T are constant along the flow of v
Which components? – In a basis of connecting
vectors for v
The property LvT = 0 does not depend on
other basis vectors, only on v!
Note: x-component of a tensor depends on the
function x(p) and on the vector field ∂x. But
∂x depends not only on the function x(p), but
also on the choice of every other coordinate.
25
Metric
Need to compute distances along paths
Visualize using the embedding picture:
z
a
y x
b
Distance between a and b along M is
D(a,b) ≈√
(ax − bx)2 + (ay − by)2 + (az − bz)2
For infinitesimal lengths along x(τ), y(τ), z(τ):
δL = δτ
√
(
dx
dτ
)2
+
(
dy
dτ
)2
+
(
dz
dτ
)2
Use intrinsic coordinates (u, v): δL =
= δτ√
(x,uu+ x,vv)2 + ... = δτ
√
Au2 + 2Buv+ Cv2
Need to know only A,B,C to compute length!
The length in larger space gives the induced
metric. Intrinsically, metric is a quadratic form
describing the length of infinitesimal curve seg-
ments γδτ : δL ≈√
g(γδτ, γδτ)
Note: in GR the metric has signature +−−−26
Applications of metric structure
Example: if g(γ, γ) = Au2 + 2Buv + Cv2 incoords. (u, v), theng = Adu⊗du+B (du⊗ dv+ dv ⊗ du)+Cdv⊗dvShorthand notation: g = Adu2 + 2B dudv +C dv2
Practice problem: compute the metric forthe surface z = x2 + y2 in coords. x, y
* Metric maps vectors into 1-forms and back
v 7→ gv : (gv) x ≡ g(v,x)ω 7→ g−1ω : g(
(
g−1ω)
,x) ≡ ω x
* Orthonormal frame: ea; g(ea, eb) = ηab* Dual basis of 1-forms θa; θa eb = δab
Levi-Civita tensor ε ≡ θ0 ∧ θ1 ∧ θ2 ∧ θ3– independent of basis, up to orientationǫ(a,b, c,d) = the 4-volume of a parallelepiped
Components in a coordinate basis: gµν ≡ g(∂µ, ∂ν)All 4-forms are proportional to ε. Formula:
ε = |det gµν|1/2 dx0 ∧ dx1 ∧ dx2 ∧ dx3
Proof: volume spanned by ∂µ equals ε(∂0, ...)and also equals |det g(∂µ, ∂ν)|1/2(introduce transition matrix ∂α = M
βαeβ, then
the volume spanned by ∂µ equals det M)
27
Affine connection (covariant derivative)
Want to have directional derivative of vectors:
∇vx that depends only on the value v|p (Lie
derivative Lvx contains derivatives of v at p)
Problem: need to subtract vectors in different
tangent spaces; need a transport operator T :
∇vx = limσ→0
Tp′→p x|p′ − x|pσ
Motivation: consider surface embedded in flat
space; “roll” the tangent plane along the vector
v from p′ to p
Tx = x + λn
Tp′→p
n
x|ppp′
x|p′
Note: n-dimensional “rolling” is 2-dim. rota-
tion in the v,n plane – to 1st order this is
equivalent to adding a multiple of n to vectors
in larger space; to 2nd order – a multiple of v
28
Properties of the induced connection
Denote by g and by ∂ the metric and the aff. con-
nection in large space, and by g the induced
metric on the surface
Since Tx = x + λn ⇒ ∇vx = ∂vx + λ(v,x)n
Since ∇vx ∈ TpM, we have ∇vx = Proj(n)∂vx
(orthogonal projection w.r.t. g onto TpM)
Proj(n)x ≡ x− ng(n,x)
g(n,n)
Need a formula for ∇vx without embedding
Note properties: ∂uv − ∂vu = [u,v]; ∂ug = 0
Define Tors (a,b) ≡ ∇ab − ∇ba − [a,b]; ver-
ify that Tors (a,b) is dependent of derivatives;
then Tors (a,b) = Proj(n)˜Tors (a,b) = 0
Verify ∇vg = 0 for tangent vectors v:
(∇vg) (a,b) = v g(a,b)− g(∇va,b)− g(a,∇vb)
= v g(a,b)− g(∂va,b)− g(a, ∂vb) = 0
∇vx has the properties of 1st order derivative.
Verify: ∇vx is independent of derivatives of v:
∇λvx = Proj(n)∂λvx = λProj(n)∂λvx = λ∇vx
→ Express ∇vx through the intrinsic metric g!
29
Levi-Cività (LC) connection
Properties: the connection is torsion-free:
[x,y] = ∇xy −∇yx
Compatible with the metric:
∇xg(a,b) = g(∇xa,b) + g(a,∇xb)
Will now derive an explicit formula for ∇yx.
Define derivative tensor B(x)(a,b) ≡ g(∇ax,b).
Consider first the antisymmetric part of B(x):
B(x)(a,b)−B(x)(b, a) = g(∇ax,b)−g(∇bx, a) =
∇ag(x,b) − ∇bg(x,b) − g(x, [a,b]) = (dgx) (a,b).
Symmetric part: B(x)(a,b)+B(x)(b, a) = g(∇xa−Lxa,b)+g(a,∇xb−Lxb) = ∇xg(a,b)−g(Lxa,b)−g(a,Lxb) = (Lxg) (a,b).
Hence
g(∇ax,b) =1
2(d (gx) + Lxg) (a,b)
Fully explicit formula (Koszul): 2g(∇ax,b) =
a g(x,b)−b g(x,a)− g(x, [a,b])+x g(a,b)−g([x, a] ,b)− g(a, [x,b])
30
Geodesic vector fields and curves
A vector field v is geodesic if ∇vv = 0.
A curve γ(τ) is geodesic with affine parameter
τ if ∇vv = 0, where v = γ is the velocity.
Can change τ → Aτ +B if A,B = const
Normalization property: ∇vg(v,v) = 0, so we
can choose τ such that g(v,v) = 0,±1
Null geodesic: g(n,n) = 0. If observed in a ref-
erence frame with 4-velocity u, the frequency
of the photon is ν = g(u,n).
31
Killing vectors
Motivation: geometry does not change in the
direction of a symmetry
Example: g = dt2 − e2N(t)(
dx2 + dy2 + dz2)
does not change in directions x, y, zGeneralize: components of g are constant along
the flow of v
A vector field v is a Killing vector if Lvg = 0.
Example: g = f(r)dt2 − g(r)dr2 − r2dΩ2 has
Killing vectors ∂t, ∂φ (and others!)
Formula: (Lxg) (a,b) = g(∇ax,b)+ g(a,∇bx)
⇒ Killing equation:
k : g(∇ak,b) + g(a,∇bk) = 0 for ∀a,bNote: for a Killing vector k, the symmetric part
of B(k) vanishes ⇒ B(k) = 12dgk is a 2-form
Conformal Killing vector: Lkg = 2λg for some
scalar λ. ← Geometry is conformally rescaled
under the flow of k.
Example: FRW universe, g = dt2−a2(t)γ. Ex-
pect conformal Killing vector k = f(t)∂t with
some f(t). Calculation: assume a = at∂t+ax∂xsuch that ∂tat,x = 0, then (Lkg) (a,b) =
−f∂t(
a2)
γ(a,b)+2f ′atbt. Thus we must have
2f ′ = 2a′a f , hence f(t) = a(t) · const.
32
Redshift factor and gravitational potential
Metric g is stationary if there exists a timelike
Killing vector k : g(k,k) > 0, Lkg = 0.
Note: metric g is static if g(k,k) > 0, Lkg = 0,
and k is everywhere orthogonal to a surface.
Stationary observers move with 4-velocity k√g(k,k)
Consider a photon along null geodesic γ(τ); lo-
cally observed frequency (energy) is ν = g(k,γ)√g(k,k)
.
Statement: g(k, γ) = const along geodesic γ.
Proof: Let v = γ, then ∇vg(k,v) = g(∇vk,v) =12 (dgk + Lkg) (v,v) = 0.
Hence ν = ν0 [g(k,k)]−1/2. Define redshift z =√
g(k,k).
Photons are redshifted by factor z
33
Lecture 5, 2007-10-31. Geodesics extrem-
ize proper length
Statement: Geodesic curves γ(τ) extremize
proper length∫
√
g(γ, γ)dτ
Proof (for timelike curves): Deform γ(τ) by
transverse vector field t; obtain connecting field
v; compute: Lt∫
√
g(v,v)dτ =∫ g(∇tv,v)√
g(v,v)dτ =
∫ g(∇vt,v)√g(v,v)
dτ =∫ ∇v (...) dτ−∫ g(t,∇v
v√g(v,v)
)dτ .
Hence, ∇vv√g(v,v)
= 0. Reparameterize τ →∫ τ√
g(v,v)dτ to achieve g(γ, γ) = 1.
Note: for spacelike curves need√
−g(v,v); for
null curves need∫
Ng(v,v)dτ .
Universal variational principle:
δ∫
[
g(γ, γ)
N+KN
]
dτ = 0 where N ≡ N(τ)
EOM: g(γ, γ) = KN2; ∇v
(
N−1v)
= 0
For null geodesics: choose K = 0 (but N 6= 0!)
34
Killing vectors II
Compute:
(Lλkg)(a, a) = 2g(∇aλk, a) = 2 (a λ) g(k, a)+(Lkg)(a, a), so Lλkg = λLkg+dλ⊗gk+gk⊗dλor (Lλkg)αβ = λ (Lkg)αβ + λ,αk,β + λ,βk,α
Lk (λg) = (k λ) g+ λLkg
Example: FRW metric g = dt2 − a2dx2, let us
determine conformal Killing vector k = f∂t:Lf∂tg = fL∂t
(
dt2 − a2dx2)
+df⊗g∂t+g∂t⊗df =
−2faa dx2 + 2fdt⊗ dtIf f = a then Lf∂tg = 2ag.
For timelike conformal Killing vectors: redshift
z =√
g(k,k) still meaningful since ∇γg(γ,k) =12 (dgk + Lkg) (γ, γ) = 0 for null geodesics γ
Note: (Lkg) (k,k) = 2g(∇kk,k) = 2λg(k,k),
so λ =g(∇kk,k)g(k,k)
= ∇k ln z
Also compute g(∇kk,x) = 2λg(k,x)−g(∇xk,k) =
2λg(k,x)− 12∇xg(k,k), thus
∇kk = −zg−1dz+ 2λk
Practice problem: Can one always choose a
vector field v such that all covariant derivatives
vanish at a point? (∀a : ∇av = 0)
35
Redshift and gravitational potential
Consider a stationary spacetime with a Killing
vector k; redshift z =√
g(k,k)
Redshift is time-independent: ∇kz = 0 (since
∇k ln z = λ = 0 for a Killing vector)
Is k geodesic? No: ∇kk = −zg−1dz 6= 0 unless
z = const.
Stationary observers move along u = 1zk.
Acceleration of stationary observers, a = ∇uu =
−g−1dΦ where Φ is the gravitational potential.
Proof: with u = z−1k we have ∇uu = 1z∇k
kz =
1z2∇kk = −1
z g−1dz = −g−1dΦ where Φ ≡ ln z.
Example 1: Schwarzschild spacetime, k = ∂t,
g =
(
1− 2M
r
)
dt2 −(
1− 2M
r
)−1
dr2 − r2dΩ2
z =(
1− 2Mr
)1/2; Φ = 1
2 ln(
1− 2Mr
)
≈ −Mr
Example 2: FRW spacetime, k = a(t)∂t,
g = dt2 − a2dx2, Lkg = 2λg
z = a(t), λ = a(t); u = ∂t; but ∇uu = 0:
2g(∇uu,x) = (dgu + Lug) (u,x) = 0
36
Force at a distance. Surface gravity
Consider stationary spacetime; Lkg = 0
Energy transfer and conservation: zE = zg(u, γ)
is conserved in local processes
Stationary observers feel acceleration in trans-
verse direction, a = −g−1d ln z
Transferring rope tension: 1) If rope moves by
∆r and transfers energy ∆E, then F∆r = ∆E.
2) If energy ∆E is sent back by a particle, then
z∆E = const. Hence zF = const
Force needed locally to held a unit mass sta-
tionary: F0 = g−1d ln z
Force at infinity (z = 1): F∞ = zF0 = g−1dz;
|F∞| =√
g−1 (dz,dz)
Example: Schwarzschild spacetime; k = ∂t;
z =√
1− 2Mr ; dz = M
r2dr
√
1−2Mr
; g−1dz = Mr2
√
1− 2Mr ∂r;
|F∞| = Mr2
. This is still finite at horizon, r =
2M , |F∞| = 14M ≡ κ - surface gravity
Note: at horizon k is null so the formula for
∇kk = −zg−1dz does not apply.
37
Change of LC connection under conformal
transformation
Consider g = e2Ωg. What is the relationship
between ∇ and ∇?
Compute: 2g(∇•a, •) = dˆga + Lag = de2Ωga +
Lae2Ωg = e2Ω(dga+Lag)+2(dΩ)∧ˆga+2(a Ω) g,
hence g(∇xa,y) = g(∇xa,y) + (x Ω) g(a,y)−(y Ω) g(a,x) + (a Ω) g(x,y) or, without y,
∇xa = ∇xa+(x Ω) a+(a Ω)x−g(a,x)g−1 (dΩ)
Practice exercises: 1. If k is a Killing vector
for g, show that k is a conformal Killing vector
for e2Ωg, for any Ω.
2. If γ(τ) is a null geodesic for metric g, show
that a reparameterization τ → τ is possible
such that γ(τ) is a null geodesic for g = e2Ωg.
3. Let k be a Killing vector for g and γ(τ) be
a geodesic. Show that γ is transported by the
flow of k to another geodesic. (If [v,k] = 0
then Lk∇vv = 0.)
4. In a stationary spacetime, n photons are
sent each second from point a to point b. How
many photons per second arrive at b? (Express
the answer through the redshift function z.)
38
Lecture 6, 2007-11-02. Energy extraction
from horizons
Consider a stationary spacetime, Lkg = 0, with
a Killing horizon g(k,k) = 0
Lower a mass m very slowly (adiabatically) on
a strong rope from an initial point p0 to point
p1 near horizon. How much energy can be
extracted at point p0?
Force needed to hold m at p1 is F1 = mg−1 d ln z|p1
Due to redshift, force at p0 is F0 = −z1z0F1 =
−mz0 g−1 dz|p1
To describe movement, introduce vector field
s = −g−1dz√
g−1(dz,dz). After a shift ∆L along s,
energy gained at p0 is ∆E = g(F0, s∆L) =
−(
mz0
s∆L)
z. Shifting slowly from p0 to p1
yields ∆E = − ∫ p1p0mz0
dz = z0−z1z0
m
Move to horizon, z1 ≈ 0 so ∆E ≈ m, extracted
the entire energy (mc2)!
39
Energy extraction: finite rope strength
If rope has finite max. tension Fmax, then move
only up to zmin 6= 0: zminFmax ≈ z0mκ, hence
∆Emax ≈(
1− mκFmax
)
m < m.
Optimum choice of mass: mopt = 12Fmaxκ , then
extract only 50% of mc2
Note: the formula for ∆E works only for m .
mopt because of the near-horizon approxima-
tion g−1(dz,dz) ≈ κ2
Note: ∆E is independent of z0, but energy is
redshifted, so more “absolute” energy is gained
if p0 is further from the horizon
Practice problem: For the metric g = f(r)dt2−dr2
f(r)− r2
(
dθ2 + sin2 θ dφ2)
, assume f(r0) = 0,
f ′(r0) 6= 0, so that r = r0 is a horizon. Com-
pute the surface gravity κ for that horizon.
40
Expansion (divergence) of a vector field
The rate of change of volume under flow of t:
Volume spanned by t, a,b, c is ε(t, a,b, c)
Choose a,b, c connecting for t:
tε(t, a,b, c) = Ltε(t, a,b, c) = (Ltε)(t, a,b, c)Due to antisymmetry, must have Ltε = fε
where f ≡ div t is the divergence or expan-
sion of the vector field t
A better formula for computing div t:
Choose orthonormal frame ea, g(ea, eb) =
ηab; then
div t =∑
aηaag(ea,∇eat)
=∑
aηaaB(t)(ea, ea) ≡ TrxB(t)(x,x)
Proof: Consider the dual basis θa and com-
pute Ltε ≡ Lt
(
θ0 ∧ θ1 ∧ ...)
=(
Ltθ0)
∧θ1∧ ...+... Now Ltθ
0 =∑
a θaιea
(
Ltθ0)
, but only the
term with θ0 survives. So Ltε = (ιe0Ltθ0 +
...)ε. Simplify one term:(
Ltθ0)
e0 = −θ0 [t, e0] = −η00g(e0, [t, e0]) = η00g(e0,∇e0t); we
used g(e0,∇te0) = 12∇tg(e0, e0) = 0.
41
Calculations without indices
From index to index-free notation:Introduce appropriate vector & tensor objectsContract each free index with arbitrary vectorSubtract terms coming from nested derivativesTraces: lower both indices, introduce gαβ, thenreplace gαβAαβγ...x
γ... by TraA(a, a,x, ...)Substitute a⊗ a→ g−1 into A(a⊗ a,x, ...)Tricks to simplify calculations:assume zero 1st derivatives of dummy vectorsuse properties of d, L, ∇, Tr where it helps
Example: uαuβuγwγ;αβ = Xλαλβv
αvβ
Introduce vectors u,v,w and a transformation-valued function X : X(x,y)z→ Xµ
αβγxαyβzγ
Rewrite Xλαλβ = gκλXκαλβ; introduce tensor
X(a,x,y, z) := g(a, X(x,y)z)Result: g(u,∇u∇uw−∇∇uuw) = Trag(a, X(v, a)v)
Properties of the trace operation:Linear: Tra∇xF = ∇xTraF ; same for Lx, ιx, dTrag(a, a) = 4; Trag(a,x)F(a,y, ...) = F(x,y, ...);Trag(a,∇ab) = divb; Tra∇a∇aX = X.
Example: Px = x− 2ng(n,x); Tr P =?Pαα = gαβPαβ; Pαβx
αyβ = g(x, Py); substitutexαyβ → gαβ and introduce dummy vector a:Trag(a, Pa) = Trag(a, a− 2ng(n,a))= 4− 2Trag(a,n)g(a,n) = 4− 2g(n,n).
42
Curvature tensor
Define R(a,b)c = ∇a∇bc−∇b∇ac−∇[a,b]c
Verify that R(a,b)c is independent of deriva-
tives of a,b, c
Thus: R is a transformation-valued 2-form.
“Covariant” tensor: R(a,b, c,d) = g(R(a,b)c,d)
Properties: exchange a ↔ b, c ↔ d, ab ↔ cd,
Bianchi identities (derive from Jacobi identity)
Example: Show that R(a,b,x,x) = 0, then set
x = c + d and derive c↔ d antisymmetry.
Start with ∇ab−∇ba− [a,b] g(x,x) = 0;
assume ∇a,bx = [a,b] = 0; get 0 = [∇a,∇b] g(x,x)
= 2g([∇a,∇b]x,x) = 2R(a,b,x,x).
Ricci tensor: Ric (a,b) = TrxR(a,x,b,x) =
Ric (b, a)
Ricci scalar: R = TrxRic(x,x)
Einstein equation: Ric−12Rg = −8πGT , where
T is energy-momentum tensor of matter,
T =2√−g
δ
δg−1Smatter
43
Lecture 7, 2007-11-07. FRW spacetime
Metric g = dt2 − a2(t)(
dx2 + dy2 + dz2)
= dt⊗ dt− a2(t)h; h is the flat 3-dim. metric
Need the Einstein tensor, Ric− 12Rg
Compute covariant derivatives:
g(∇u∂t,v) = 12
(
L∂tg)
(u,v) = −aah(u,v), so
∇∂t∂t = 0 and ∇∂x∂t = aa∂x = ∇∂t∂x.
g∂x = −a2dx; so g(∇u∂x,v) = 12 (dg∂x)(u,v) =
(−aadt ∧ dx) (u,v); so g(∇∂y∂x, •) = 0 and
g(∇∂x∂x, ∂t) = aa, so ∇∂x∂x = aa∂t. (∂y, ∂z...)
Need Ricv ≡ R(∂t,v)∂t − 1a2
∑
s=∂x,∂y,∂z
R(s,v)s.
For v = ∂t need R(s, ∂t)s = ∇s∇∂ts−∇∂t∇ss =
a2∂t − (aa)· ∂t = −aa∂t, so Ric∂t = 3aa∂tFor v = ∂x we have Ric∂x = R(∂t, ∂x)∂t −1a2R(∂y, ∂x)∂y− 1
a2R (∂z, ∂x) ∂z = 1
a2(aa+2a2)∂x.
Therefore Ricv =
(
aa + 2a2
a2
)
v+2
(
aa −
a2
a2
)
g(v, ∂t)∂t
Ric(u,v) =
(
aa + 2a2
a2
)
g(u,v)+2
(
aa −
a2
a2
)
g(u, ∂t)g(v, ∂t
Ricci scalar: R = Trvg(Ricv,v) = 6
(
aa + a2
a2
)
Einstein tensor:
Ric− 1
2Rg = −
(
2a
a+a2
a2
)
g+2
(
a
a− a
2
a2
)
dt⊗dt
44
Scalar field cosmology
Lagrangian for scalar field:
S[φ] =∫
d4x√−g
(
1
2g−1 (dφ,dφ)− V (φ)
)
Energy-momentum tensor:
T ≡ 2√−gδS
δg−1= dφ⊗dφ−
[
1
2g−1 (dφ,dφ)− V (φ)
]
g
Cosmological solution: φ = φ(t); dφ = φdt;
T = φ2dt⊗ dt−(
φ2
2− V
)
g
Einstein equations for FRW spacetime:(
2a
a+a2
a2
)
= −8πG
(
φ2
2− V
)
2
(
a
a− a
2
a2
)
= −8πGφ2
Hence: Friedmann equation
(
a
a
)2
=8πG
3
(
φ2
2+ V
)
≡ 8πG
3ρ(φ, φ)
EOM for φ: φ+ 3Hφ+ V ′(φ) = 0, H ≡ aa
Expanding universe: a > 0, then H =√
8πG3 ρ(φ, φ),
then assume φ ≡ v > 0, so v = v(φ), getdv(φ)dφ = f(φ, v) ← autonomous dynamics
45
Attractors and the slow-roll approximation
Consider the auxiliary variable u(φ) defined bydφdt ≡ −u(φ)
d√V
dφ ≡ −u(φ)F(φ). Then we have
φ =(
u′F + uF ′)
uF ; the EOM for φ becomes
2 + uu′F√V
+ u2 F′
√V
= u
√
√
√
√24πG
(
1 +1
2u2F
2
V
)
If V (φ) satisfies the slow-roll conditions,
Fu√V≪ 1,
F ′u2√V≪ 1,⇒ V ′√
24πG≪ V,
V ′′
24πG≪ V,
the EOM becomes u ≈ u0 ≡ 1√6πG
= const
— the slow-roll approximation is φ ≈ −(√V)′
√6πG
Note: need to verify u20F
2 ≪ V , u20F′ ≪√V .
Generically, there exists a single solution with
u0(φ) → const at φ → ∞. This u0(φ) is ap-
proximated by the slow-roll formula. This is
an attractor under the slow-roll assumptions:
u(φ) = u0(φ) + δu,
d
dφδu = C(u, φ)δu, C > 0
Further approximations: use perturbative ex-
pansion, u = u0(φ)+δu(φ); or substitute u0(φ)
into the l.h.s. of the equation, u(φ) =F(u,u′,φ)√
24πG
46
Lecture 8, 2007-11-09. Conformal trans-
formation of curvature
How do Ric and R change under g → g = e2Ωg?
LC connection: ∇xy = ∇xy + Γ(x,y) where
Γ(x,y) ≡ g(w,y)x + g(w,x)y − g(x,y)w,
where w ≡ g−1dΩ, g(w,x) ≡ x Ω
Ric(a, c) = Trbg(ˆR(a,b)c,b) = Trbg(
ˆR(a,b)c,b)
For calculation of ˆR(a,b)c, assume that ∇a,∇b
of a,b, c all vanish: ˆR(a,b)c = ∇a∇bc−∇b∇ac =
∇a (∇bc + Γ(b, c))+Γ(a,∇bc+Γ(b, c)−[...b↔ a...]
= R(a,b)c+∇aΓ(b, c)+Γ(a,Γ(b, c))−[...b↔ a...]
Need to express ∇xw; note that g(∇•w, •) =12 (dgw + Lwg) but dgw = ddΩ = 0; define a
symmetric bilinear form HΩ(x,y) ≡ g(∇xw,y) ≡ιy∇xdΩ. (In the index notation, this is Ω;αβ)
∇aΓ(b, c) = ∇a (g(w,b)c + g(w, c)b− g(b, c)w)
= HΩ(a,b)c+HΩ(a, c)b−g(b, c)∇aw; antisym-
metrize [b↔ a] and take g(...,b): HΩ(a, c)g(b,b)−HΩ(b, c)g(a,b)−g(b, c)HΩ(a,b)+g(a, c)HΩ(b,b).
Now take trace:
Trb(1st term) = (N − 2)HΩ(a, c) + g(a, c)Ω
47
Conformal transformation of Ricci tensor
2nd term: let’s write 〈xy〉 instead of g(x,y),
then Γ(a,Γ(b, c)) = 〈wa〉Γ(b, c) + 〈wΓ(b, c)〉 a−〈aΓ(b, c)〉w = 〈wa〉 (〈wc〉b + 〈wb〉 c− 〈bc〉w)+
(〈wb〉 〈wc〉+〈wc〉 〈wb〉−〈ww〉 〈bc〉)a−(〈ab〉 〈wc〉+〈ac〉 〈wb〉−〈aw〉 〈bc〉)w. Antisymmetrizing [b↔ a],
only terms 5,6,8 remain: Γ(a,Γ(b, c))−Γ(b,Γ(a, c))
= 〈wc〉 (〈wb〉 a−〈wa〉b)+〈ww〉 (−〈bc〉 a+〈ac〉b)+
(−〈ac〉 〈wb〉+〈bc〉 〈aw〉)w. Compute g(b, ...) =
〈wc〉 (〈wb〉 〈ab〉−〈wa〉 〈bb〉)+〈ww〉 (−〈bc〉 〈ab〉+〈ac〉 〈bb〉)+(−〈ac〉 〈wb〉+〈bc〉 〈aw〉) 〈wb〉. Trace:
Trb(2nd term) = (N − 2) (〈ac〉 〈ww〉 − 〈aw〉 〈cw〉)Finally, Ric = Ric + (N − 2) (HΩ + 〈ww〉 g −dΩ⊗ dΩ) + (Ω) g
Ricci scalar: R = TrxRic(x,x) = e−2ΩTrxRic(x,x)⇒
R = e−2Ω [R+ 2(N − 1)Ω + (N − 1) (N − 2) 〈ww〉]Note that 〈ww〉 ≡ g(w,w) = g−1(dΩ,dΩ).
Einstein tensor: Ric−12Rg = Ric−1
2Rg+(N − 2) [HΩ−dΩ⊗ dΩ−
(
Ω + 12 (N − 3) 〈ww〉
)
g ]
Einstein tensor does not change in 2D!
A conformal transformation can make R = 0
in 2D - any 2D space is conformally flat.
48
Equations of motion for f(R) gravity
After introducing Lagrange multiplier λ:
S[g] + Sm =∫
d4x√
−g(
λR+ U(λ)
16πG+ Lm
)
,
U(λ) ≡ f(r)− rf ′(r)∣
∣
∣
r=r(λ), λ = 1 + f ′(r)
Now conformal transformation: g ≡ e2Ωg; note:√−g = e4Ω√−g and R = e−2Ω(
R+ 6Ω + 6 |dΩ|2)
Note: “” is with respect to g now!
Choose Ω such that λR√−g equals R
√−g plus
some terms:√−g
(
λR+U(λ)16πG + Lm
)
= e4Ω√−g(
116πG
[
λe−2Ω(
R+ 6Ω + 6 |dΩ|2)
+ U(λ)]
+ Lm
)
.
Set λ = e−2Ω and omit Ω (total divergence):
S =∫
d4x√−g
(
R+ 6 |dΩ|2 + e4ΩU(e−2Ω)
16πG+ e4ΩLm
)
Standard action for Einstein gravity (but with
weird matter coupling)← “Einstein frame”, i.e. the
variables g,Ω with 2nd order EOM
The variables g are “Jordan frame” (have 4th
order EOM)
49
Lecture 9, 2007-11-14. Cosmology with
f(R) gravity
Define canonical scalar field: φ ≡ Ω√
34πG ≡
Ωs
Assume f(R) = 12αR
2; neglect Lm; then λ =
1+αR, U(λ) = −(λ−1)2
2α , so (in Einstein frame)
S =∫
d4x√−g
(
R
16πG+
1
2|dφ|2 − V (φ)
)
,
V (φ) =1
32πGα(exp [2sφ]− 1)2 =
(
e2sφ − 1)2
24s2α.
Trick: Write EOM for φ through x(φ) ≡ EkinEpot
,
so dφdt ≡ −
√
2x(φ)V (φ) (assumed φ < 0, α > 0!)
dx
dφ+V ′
V(1 + x)− 6s
√
x (1 + x) = 0
Set V (φ) ≈ V0e2nsφ (we will have n = 2), then
x′ = 2s√x+ 1
(
3√x− n
√x+ 1
)
. Assume x →x0 at φ → ∞: x0 = n2
9−n2; φ ≈ −√
2x0V (φ); so
φ(t) ≈ − 1ns ln
(t−t0)n2√
12α(
9−n2)
Stable attractor : x = x0 + δx, then δx′ ≈ Cδx,C = s
n
(
9− n2)
> 0 as long as n < 3.
50
Do we have inflation?
Note: slow-roll conditions are not valid,
V ′
V
1√48πG
=2
36≪ 1!
Compute
H(t) ≈ s√
2 (1 + x0)V (φ) = 3n−2 (t− t0)−1
a(t) = exp
∫
H(t)dt = (t− t0)3n−2
Conformal transformation:
g = e2Ωg =const
(t− t0)2/n(
dt2 − a2(t)dx2)
Define physical time: want g = dt2− a2(t)dx2;
so define t = (t− t0)1−1/n, then
a(t) = (t− t0)3n−2−n−1
= t3−n
n(n−1)
With n = 2 we have radiation-dominated era,
a ∝ t1/2
Note: inflation will be with φ < 0 and φ > 0!
(Slow roll approximation is then valid)
Practice problem: Determine (approximately)
the potential V (φ) for f(R) gravity with f(R) =
R− α2R
2+βR−n, n > 0, in the regimes of large
R and small R
51
Conditions for viable f(R) theory
If f ′(R) < 0 then wrong sign of the action in
Einstein frame:∫ −R
16πG, so graviton is a ghost
In Einstein frame, have scalar field with
V (φ) =1
16πG
rf ′ − ff ′2
∣
∣
∣
∣
∣
f ′+1=exp(−2sφ)
If f ′′(R) < 0 at Solar system scales then
M2φ
M2Pl
≡ 16πGd2V (φ)
dφ2
=1
3f ′′+Rf ′ − 4 (R+ f)
(1 + f ′)2≈ 1
3f ′′< 0,
which gives tachyonic instability (small-scale)
At high R we must have f(R)≪ R at observed
scales (MφL≫ 1); f ′′ → 0
Hence f(R) must be negative, monotonically
growing in R
Practice exercise: Derive the formula for M2φ .
52
Lecture 10, 2007-11-16. Matter-dominated
era
The full action in Einstein frame is
S =
∫
d4x√−g
[
R
16πG+|dφ|2
2− V (φ) + Lme
4sφ
]
where s ≡√
4πG3 , V = 1
24αs2
(
e2sφ − 1)2
Mock-up Lagrangian for matter:
Lm(ψ, ψ) = pm, ρm = ψLm,ψ − Lm,
and we assume pm = wmρm (nondynamical
matter with wm = const) in Jordan frame
Also assume homogeneous φ = φ(t), ψ = ψ(t)
Evolution equations for φ, ψ:
d
dt
[
φ√−g
]
= −√−gV,φ + 4s√−ge4sφpm
d
dt
[
e4sφLm,ψ
√−g]
= −√−ge4sφLm,ψ
With√−g = a3(t) we get
ψLm,ψψ + ψLm,ψψ + 3HLm,ψ + 4sφLm,ψ − Lm,ψ = 0;
d
dtρm =
(
ψLm,ψψ + ψLm,ψψ − Lm,ψ
)
ψ
53
Finally, using pm = wmρm, we get
d
dtρm = −
(
3H + 4sφ)
ρm (1 + wm) ;
φ = −3Hφ− V,φ + 4se4sφwmρm
Friedmann equation (Einstein frame):
a2
a2≡ H2(t) =
8πG
3
(
ρφ + e4sφρm)
= 2s2[
φ2
2+ V (φ) + e4sφρm
]
Obtained a closed system for φ, ρm, H
How to analyze attractor behavior?
1. Use energy conservation equation
d
dtρφ = −3H
(
ρφ + pφ)
+ 4se4sφφwmpm
2. Use the energy quotient
q(t) ≡ e4sφρm
e4sφρm + ρφ, 0 ≤ q ≤ 1
3. Assume that q(t) → const and wφ → const
on the attractor
4. Derive closed equations for q(t), φ(t)
dq
dt= −3Hq
[
(
wm − wφ)
(1− q) +4s
3wm
φ
H
]
(Complete analysis of dynamics is in Amendola
et al., gr-qc/0612180 — using Jordan frame)
54
Attractor for dust-dominated era (wm = 0)
Integrate the equations for ρm, ρφ:
ρm = ρ(0)m a−3(1+wm)e−4s(1+wmφ)
ρφ = ρ(0)φ a−3(1+wφ)
where we assumed pφ = wφρφ with wφ ≈ const
Now assume wm = 0, derive equation for q(t):
q = 3Hq (1− q)wφFixed point if wφ → 0 or q → 1 (not if q → 0)
If q → 1 then need wφ > 0 so that q > 0.
Since wφ ≥ 0 then at late times ρm(t)≫ ρφ(t)
or at most ρm(t) ≈ C0ρφ(t), hence
H = s√
2
√
ρφ + ρme4sφ ≈ C1a
−3/2
and then a(t) ∝ t2/3, H(t) ≈ 23t−1 in any case.
So ρφ = ρ(0)φ t−2(1+wφ) and φ = C2t
−1−wφ.If wφ > 0 then φ → const at late times, but
this contradicts EOM for φ with V,φ 6= 0 (or
else we must require a fine-tuned value of φ)
55
Dust tracker solution (wφ = 0)
Consider wφ → 0, q → q0 6= 1 (the value q0 is
determined by the EOM for φ) then
H2 =
(
2
3t−1
)2
= 2s2(
ρφ + e4sφρm)
,
φ(t) =
√
2 (1− q0)3s
ln t, esφ = t
√2(1−q0)
3 ≡ tν, ν > 0
Transforming back to Jordan frame:
g = t2ν(
dt2 − a2dx2)
≡ dτ2 − a2(τ)dx2
τ = t1+ν, a(τ) = τ
(
23+ν
)
/(1+ν) 6= τ2/3
To have a dust-dominated era with a(τ) ∝ τ2/3we need ν ≪ 1
(This is a constraint on V (φ) after q0 is deter-
mined)
(See Amendola et al. for precise conditions)
56
Lecture 11, 2007-11-21. Introduction to
perturbation theory for gravity
What is the metric resulting from a small per-
turbation in the matter source?
• Compute Ric(x,y) for g = g+ h to 1st order
in h (treating h as a small perturbation)
Change in the LC connection is a tensor,
Γ(x,y) ≡ ∇xy −∇xy = Γ(y,x)
g(Γ(x,y), z) = g(∇xy, z)−g(∇xy, z)−h(∇xy, z) =12
(
dhy + Lyh)
(x, z)−h(∇xy, z) = 12 (∇xh) (y, z)−
12 (∇zh) (x,y) + 1
2
(∇yh)
(x, z).
Since we are computing to 1st order, we may
set g(Γ, ·) ≈ g(Γ, ·) and disregard ΓΓ.
Assuming ∇x,y x,y, z = 0, write the Riemann
tensor ˆR for g through R for g:
ˆR(x,y)z = ∇x∇yz− ∇y∇xz ≈ R(x,y)z
+∇xΓ(y, z)−∇yΓ(x, z)
57
Compute the Ricci tensor: to 1st order,
Ric(x, z) = TryR(x,y, z,y) ≈ Tryg( ˆR(x,y)z,y)
Covariant Riemann tensor: to first order,
R(x,y, z,y) ≈ R(x,y, z,y)+g(∇xΓ(y, z)−∇yΓ(x, z),y)
= R(x,y, z,y)+∇xg(Γ(y, z),y)−∇yg(Γ(x, z),y)
Change in the Ricci tensor: Ric(x, z) ≈ Ric(x, z)+
∇xTryg(Γ(y, z),y)−Try∇yg(Γ(x, z),y)= Ric(x, z)+12∇xTry(∇yh(y, z) +∇zh(y,y)−∇yh(y, z))
−12Try∇y(∇xh(y, z) + ∇zh(x,y) − ∇yh(x, z))=
Ric(x, z)+12∇x∇zTrh+1
2h(x, z)−12∇x (divh) (z)−
12∇z (divh) (x), where (divh) (x) ≡ Try(∇yh)(y,x).
Practice problem: Show that div (λg) = dλ
By taking trace of the Einstein equation, find
R = 8πGTrT , hence Ric = −8πG(
T − 12gTrT
)
Assume that a background spacetime (Ric, T)
is given and we have a small perturbation δT
of the EMT. We need to solve the equation
1
2∇x∇zTrh+
1
2h(x, z)− 1
2∇x (divh) (z)
−1
2∇z (divh) (x) = −8πG(δT − 1
2gTrδT)
to first order in δT . Complicated!
58
The 3+1/SVT decomposition
Questions: 1. What is the metric gµν gener-ated by a small δTµν in flat spacetime?2. What is the metric g generated by a smallδTµν superimposed onto the cosmological Tµνin FRW spacetime (g = dt2 − a2dx2)?
To solve the difficult tensor equation, use tricks:1. Compute time components separately fromspatial components; h =
h00, h0j, hij
2. Use scalar-vector-tensor (SVT) decompo-sition for h and T
Helmholz decomposition: ω = dλ+ν, divν = 0(we define divν ≡ div (g −1ν) for 1-forms)Determine λ: divω = div dλ = ∆λ, then λ =1∆divω if the BVP (λ → 0 at |x| → ∞) has aunique solution (true in Euclidean signature!)
Motivation for the 3+1 split:- background will depend on t via a(t), can useFourier transform only in 3D but not in time- cannot use SVT decomposition with Lorentzianmetric signature (no unique solution to BVP)!
Motivation for the SVT decomposition:- equations for S,V,T sectors are decoupled- different physical sources contribute to S,V,Tin δT (usually have only scalar sources)
59
Plan:
1. Perform 3+1/SVT decomposition
2. Compute Ric in flat background
3. Resolve gauge issues using 3+1/SVT
4. Solve Einstein equation, determine h
5. dt2 − a2dx2 = a2(η)(
dη2 − dx2)
; make con-
formal transformation to FRW background with
fixed a(t) to compute Ric for perturbed FRW
3+1/SVT decomposition of vectors
Decompose 4-vectors as u = Ae0 + v; e0 ≡ ∂t;g(e0,v) = 0; u ≡
A, vj
. Further, decompose
v as gradient + divergence-free, vj = B,j+Cj,
Cj,j = 0. In 3-dim. Fourier space: v(k) =
iBk+C, k ·C = 0. So B := 1iv·kk·k, C := v− iBk.
In real space: B = 1∆divv; C = v − g−1dB.
Note:(
1∆f
)
(x) = − ∫ f(x,y,z)d3x4π|x−x| , where x ∈ R3
Remarks on the SVT decomposition:
1. The decomposition is nonlocal in space
2. Only defined when divv → 0 sufficiently
quickly at spatial infinity (at least ∝ |x|−2)
3. Defined also without Fourier transform!
4. Useful only because perturbation equations
are algebraic in Fourier space
60
3+1/SVT decomposition of (0,2) tensors
Consider a symmetric tensor Tij in 3-dimensional
Euclidean space with metric δij
Algebraic decomposition: trace + traceless part:
Tij = Aδij + T(1)ij , T
(1)aa = 0, A := 1
3Taa
Fourier decomposition: project along tensors
made out of k
Component parallel to kikj − 13k
2δij:
T(1)ij =
(
kikj − 13k
2δij)
B + T(2)ij , T
(2)ij kikj = 0,
T(2)aa = 0, B := 3
2
kikjT(1)ij
k4
Component parallel to ki & orthogonal to kj:
T(2)ij = i
(
kiCj + kjCi)
+T(3)ij , T
(3)ij kj = 0, T
(3)aa =
0, Caka = 0, Cj := 1i
kaT(2)aj
k2; relabel Dij ≡ T (3)
ij
Final decomposition in real space: Tij = Aδij−(
∂i∂j − 13δij∆
)
B + Ci,j + Cj,i +Dij
Remarks:
1. Tij must decay at least as |x|−2 at infinity
2. Terms proportional to δij can be gathered
because we cannot have B,ij = Aδij (check!)
61
Properties of SVT decomposition
SVT decomposition can be applied to arbitrary
(Euclidean) tensors. It is useful because:
1. Tensor equations split into simpler eqs.
2. Eqs. for S/V/T components decouple
3. Often get algebraic eqs. in Fourier space
Theorem 1: If Aδij+B,ij+Ci,j+Cj,i+Dij = 0
and A,B,Ci, Dij → 0 at least as |x|−2 at infinity,
then each of A,B,Ci, Dij = 0 everywhere.
(Not true in spaces with Lorentzian signature!)
Theorem 2: EOM for perturbations of a ten-
sor X in the presence of (several) background
tensors will decouple after using a complete
decomposition of X with respect to projections
on the background tensors and SVT. (In our
case, the background tensors are k and δij)
Lemma: If a linear function of k is invariant
under 3D rotations, it is equal to zero.
62
Lecture 12, 2007-11-23. Why 4D/SVT
does not work; gauge issues
Try avoiding 3+1 decomposition (only SVT):
hµν = Agµν +B;µν + Cµ;ν + Cν;µ +Dµν
where divC = 0, divD = 0, TrD = 0. (Index-
free notation is more bulky now!)
Gauge transformation: h→ h+δh, where δh ≡Lug; so δh(x,y) = g(∇xu,y) + g(∇yu,x); in
index notation: δhµν = uµ;ν+uν;µ; decompose
uµ = b,µ+cµ with div c ≡ cµ;µ = 0; then δhµν =
2b;µν + cµ;ν + cν;µ. This removes B, C terms!
Compute: Trh = 4A; divh = dA; so δRicµν =
A;µν+ 12gµνA+ 1
2Dµν; Einstein equation will
set A = ..., Dµν = ... Note: Dµν has 5 d.o.f.
It appears that we have 5 d.o.f. in Dµν, but
this is wrong: the gauge has not been fixed
(have 3 free solutions of u = 0, divu = 0).
But these “gauge” solutions are not explicit.
• 3+1/SVT will make the d.o.f. explicit. Also
we will avoid having to solve u = 0 in FRW
background!
63
Perturbations in flat spacetime
Use ordinary derivative instead of ∇Use 3+1/SVT decomposition for h (notation
from V. F. Mukhanov, Physical Foundations
of Cosmology, chapter 7):
h00 = 2Φ; h0j = B,j + Sj; Sj,j = 0
hij = 2Ψδij + 2E,ij + Fi,j + Fj,i + tij
Gauge transformations: hµν → hµν+δhµν where
δhµν = uµ,ν + uν,µ. Apply 3+1/SVT to uµ:uµ →
a; b,j + cj
, cj,j = 0. Now compute:
δh00 = 2a,0; δh0j = a0,j + b,0j + cj,0;
δhij = 2b,ij + ci,j + cj,i
By the SVT theorem 1, we have
δΦ = a, δB = a+ b, δSj = cj,
δΨ = 0, δE = b, δFj = cj, δtij = 0
Hence we can set E = B = 0, Fj = 0 (this is
called longitudinal or Newtonian gauge)
Remarks: 1. Scalar parts of u contribute to
scalar parts of h, vector parts of u to vector
parts of h (cf. SVT theorem 2)
2. The source Tµν will, in general, change
under a gauge transformation as Tµν → Tµν +
LuTµν, but in flat space Tµν = 0. Will have to
take care with FRW background!
64
Einstein equations in almost flat space
Compute Ricµν to first order in Newtonian gauge:
h00 = 2Φ; h0j = Sj; Sj,j = 0
hij = 2Ψδij + tij, taa = 0, tij,j = 0
Need to compute
Ricµν =(Trh),µν + hµν − ∂µ (divh)ν − ∂ν (divh)µ
2Trh = 2Φ− 6Ψ
(divh)0 = gµνh0µ,ν = h00,0 − h0j,j = 2Φ
(divh)j = hj0,0 − hji,i = Sj − 2Ψ,j
Hence (in the SVT decomposition):
Ric00 = −3Ψ−∆Φ, Ric0j = −2Ψ,j −1
2∆Sj
Ricij = (Φ−Ψ),ij +1
2tij + δijΨ− 1
2
(
Sj,i + Si,j)
R = Ric00 −Ricjj = −6Ψ− 2∆Φ + 4∆Ψ
Need to decompose the EMT in 3+1/SVT:
T0j = α,j + βj; βj,j = 0
Tij = µδij + λ,ij + σi,j + σj,i + qij
σj,j = 0, qjj = 0, qij
,j = 0
65
Explicit solutions
Explicit expressions for 3+1/SVT components:
α =1
∆T0i,i λ =
1
∆
[
3
2
1
∆Tik,ik −
1
2Tii
]
µ =1
2
[
Tii −1
∆Tik,ik
]
σj =1
∆
[
Tji,i −(
1
∆Tik,ik
)
,j
]
Einstein equations in 3+1 decomposition are
−2∆Ψ = −8πGT00
−2Ψ,j −1
2∆Sj = −8πG
(
α,j + βj)
(Φ−Ψ),ij +1
2tij −
1
2
(
Sj,i + Si,j)
−[
2Ψ + ∆(Φ−Ψ)]
δij = −8πG(µδij + λ,ij+ σi,j + σj,i + qij)
Explicit solution using SVT theorem 1:
Ψ =1
∆4πGT00, Φ−Ψ = −8πGλ,
Sj =1
∆16πGβj, tij =
1
16πGqij
Remarks: 1. Other equations are consequencesof Tij
,j = 0, do not need to solve them!2. Φ,Ψ, Sj are constrained and not free d.o.f.3. A free wave solution can be added to tij(this gives 2 vacuum d.o.f.)4. For Tµν = (ρ+ p)uµuν−pgµν we have α = 0,λ = 0, µ = p, σj = βj = 0, qij = 0, T00 = ρ+ p
66
Lecture 13, 2007-11-28. Perturbations of
gravity and matter in FRW spacetime
(see Mukhanov 2005, chapters 7 and 8)
Perturbations due to a single, ideal comoving
fluid (if one matter component dominates and
if vector perturbations are absent, e.g. scalar
field/dust/radiation but no cosmic strings):
Tµν = (ρ+ p)uµuν − pgµνIn flat spacetime it follows that Φ = Ψ and
Sj = 0, so the perturbed metric is simply
gµν =
(
1 + 2Φ 00 −δij + 2Φδij + tij
)
, uµ =
1000
To pass to FRW spacetime, note that any
FRW spacetime is conformally flat:
g = dt2 − a2(t)dx2 = a2(t)[
dt2 − dx2]
Note: t is the conformal time, t is the proper
time, t =∫ dta(t)
Example: de Sitter, a(t) = eHt, t = −H−1e−Ht,a(t) = |Ht|−1
Note that −∞ < t < 0 for de Sitter; t → 0 is
“late times”
In matter-dominated FRW, a(t) ∝ tp where
usually 0 < p < 1 and so 0 < t <∞67
Ricci tensor for perturbed metric in FRW
Ansatz for metric perturbations: g = a2g, so
gµν = a2(t)
(
1 + 2Φ 00 −δij (1− 2Φ) + tij
)
Now need to compute the Einstein tensor of g.
Use linear approximation in Φ, tij. Note: a(t)
is fixed and has no perturbations!
Use the conformal transformation formula with
g = e2Ωg, Ω ≡ ln a(t):
Ein = Ein + 2HΩ − 2dΩ⊗ dΩ− (2Ω + |w|2)gwhere w = g−1dΩ. Need the Hessian: HΩ(x,y) =
(∇.∇.Ω)(x,y) = (∇xdΩ) (y) = g(∇xw,y).
Compute: dΩ = (∂t ln a)dt, denote b(t) ≡∂t ln a = ∂ta; then dΩ = bdt; w ≈ (1− 2Φ) b∂t;
|w|2 ≈ (1− 2Φ) b2 to first order in Φ.
Covariant derivative ∇ is computed w.r.t. g
and contains perturbations:
HΩ(x,y) = g(∇xw,y) = b (1− 2Φ) g(∇x∂t,y)+
g(∂t,y)∇xb (1− 2Φ)
g(∇.∂t, ·) = 12
(
dg∂t + L∂tg)
= dΦ ∧ dt+ 12g
So HΩ(x,y) = −[(x Φ) (y t)+(y Φ) (x t)]b+b2g(x,y)+(yt)(xt)b; and Ω = TrxHΩ(x,x) =
−4bΦ + (1− 2Φ) b because Trxh(x,x) = −4Φ
68
Use 3+1/SVT decomposition to computeHΩ and the Einstein tensor:
dΩ = bdt; HΩ00 = b − bΦ; HΩ0j = −bΦ,j;HΩ ij = bΦδij + 1
2btij; w0 = (1− 2Φ) b; |w|2 =
(1− 2Φ) b2
use Ein = Ein+2HΩ−2dΩ⊗dΩ−(2Ω+|w|2)gEin00 ≡ Ein (∂t, ∂t) = −2∆Φ− 3b2 + 6bΦ
Ein0j ≡ Ein(
∂t, ∂xj)
= −2(
Φ + bΦ)
,j
Einij = 12tij+ btij−
(
b2 + 2b)
tij+[(
b2 + 2b)
−2Φ− 4
(
b2 + 2b)
Φ− 6bΦ]δij
Einstein equations:
Ein(0) + Ein(1) = −8πG(
T (0) + T (1))
Background Friedmann equ.
b2 =8πG
3T
(0)00 ; b2 + 2b = −8πGµ(0)
Perturbations in 00, 0j, ij:
∆Φ− 3bΦ = 4πGT(1)00
Φ + bΦ = 4πGα(1)
tij + 2btij − 2(
b2 + 2b)
tij = 4πGq(1)ij
In usual scenarios q(1)ij = 0, so only scalar per-
turbations are sourced
69
Lecture 14, 2007-11-30. Perturbations of
gravity and scalar field in FRW spacetime
Cosmology with scalar field:
S[g, f ] =
∫
d4x√−g
[
R
16πG+
1
2|dφ|2 − V (φ)
]
Perturbations in scalar field: φ = f0(t)+f(t,x)
Metric perturbations (only scalar):
g = a2(t)
(
1 + 2Φ 00 − (1− 2Φ) δij
)
Energy-momentum tensor of φ:
Tµν = φ,µφ,ν − gµνp, p ≡ 12 |dφ|
2 − V (φ)
p =1− 2Φ
a2
(
12f
20 + f0f
)
− V (f0)− fV ′(f0)
T00 = f20 + 2f0f − a2 (1 + 2Φ) p
=(
12f
20 + a2V0
)
+(
f0f + a2V ′0f + 2a2V0Φ)
T0j = f0f,j
Tij = (1− 2Φ) a2pδij =(
12f
20 − a2V0
)
δij + ...
Einstein equations in 3+1/SVT:
(background) 3b2 = 8πG(
12f
20 + a2V0
)
∆Φ− 3bΦ = 4πG(
f0f + a2V ′0f + 2a2V0Φ)
Φ + bΦ = 4πGf0f
70
Equations of motion for the field φ = f0(t)+f :
∂µ(√−ggµνφ,ν
)
+√−gV ′(φ) = 0
Compute:√−g = a4 (1− 2Φ);√−gg0νφ,ν = a4 (1− 2Φ) a−2 (1− 2Φ)
(
f0 + f)
=
a2(
f0 + f − 4Φf0)
;√−ggjνφ,ν = −a2 (1− 2Φ) (1− 2Φ) f,j = −a2f,j;V ′(φ) = V ′0+fV ′′0 ; so EOM for φ is a−2[a2(f0+
f −4Φf0)]·−∆f + a2
[
(1− 2Φ)V ′0 + V ′′0 f]
= 0.
Background EOM: a−2[
a2f0]·
+ a2V ′0 = 0
Perturbation EOM:
a−2[
a2f − 4Φa2f0]·−∆f−2Φa2V ′0+a2V ′′0 f = 0
Collect terms with Φ using background EOM:
a−2(
a2f)·− 4Φf0−∆f +2Φa2V ′0 + a2V ′′0 f = 0
Change variables Φ = a−1h, f = a−1x in the
00 and 0j Einstein equations; use background
EOM (3b2Φ = ...; a2V ′0f = ...) and simplify:
∆h− 3bh = 4πG[
−f20h+ f0x− 3bf0x− f0x
]
h = 4πGf0x
Make Fourier transform in (flat) 3-space:[
f20 −
k2
4πG
]
h = f0x− f0x;h
4πG= f0x
EOM for perturbations (f = ...) is a conse-
quence of this 1st order system
The Mukhanov-Sasaki variable
Would like to have a closed 2nd order equation
For quantization, expect x to be fundamental
(occurs as 12x
2 + ... in canonical field action)
Look for a combination v = x+B(t)h such that
v satisfies an equation of the form v+k2v = Cv
Substitute x = v −Bh into 2nd Einstein equ.:
h
4πG= f0 (v −Bh)
Expect relationships of the form
v = A1h+A2h, k2h = A3v+A4v+ 0 · hbecause then we will have
k2v = A1 (A3v+A4v) +A2 (A3v+A4v)·
Compute k2h from 1st Einstein equ.:[
f20 −
k2
4πG
]
h = f0(
v − Bh−Bh)
− f0 (v −Bh)
then substitute h = 4πGf0 (v −Bh) and find
− k2
4πGh = f0v −
(
4πGBf20 + f0
)
v
+ h(
−f0B + 4πGf20B
2 + f0B − f20
)
71
Now we need to cancel the last term:
f0B = f20
(
4πGB2 − 1)
+ f0B
This is a Riccati equation; convert to linear:
B = −f0Q
Q; Q = 4πGf2
0Q;
A suitable solution is Q = a−1 due to the back-ground equation 4πGf2
0 = b2− b (found by sub-tracting b2 + 2b = −8πGp from 3b2 = ...); so
B =f0b, v = x+
f0bh; v =
f0bh+
1
4πGf0h
− k2
4πGh = f0v −
(
4πG
bf30 + f0
)
v
Equation for v: define z ≡ af0b−1, then
v =z−1
4πG
(
zh
f0
)·, − k
2h
4πG= f0z
(
v
z
)·
−k2v =z−1
4πG
(
−k2h zf0
)·= z−1
(
f0z2
f0
(
v
z
)·)·
= v − zzv
Expressing x through v:
x = v − f0bh; h = 4πGf0z
1
∆
(
v
z
)·
Action for v that reproduces the EOM:
S = 12
∫
d4x
(
v2 − v,jv,j +z
zv2)
Qualitative considerations for solutions
Solution v(t) for k2 ≫ z/z is an oscillator ∝ eiktwith constant amplitudeThe function z(t) grows quickly as ≈ a(t), soeventually every mode k will cross the horizonSolution for k2 ≪ z/z has two branches, onegrowing as z, the other decayingThe growing mode means v ∝ z, so field per-turbation is x ∝ z/a = H−1∂tf0 ≈ const butactually slightly growing towards the end of
inflation. (Figure from Mukhanov 2005)
∝ φ/H
f
ttc tf
f0
∝ 1a
Note: describing the evolution of perturba-tions in non-φ dominated epochs requires adifferent calculation! (We used the fact thatthe background is φ-dominated.)
72
Lecture 15, 2007-12-07. Quantization of
free fields in FRW spacetime
(See also my lecture notes of 2006)
Consider a scalar field in a fixed FRW back-
ground spacetime:
S =
∫ √−gd4x[
1
2gαβ (∂αφ)
(
∂βφ)
− 1
2m2φ2
]
Use conformal time: g = a2(t)[
dt2 − dx2]
S =1
2
∫
d3x dt a4
a−2[
φ2 − (∇jφ)2]
−m2φ2
Change variables: χ ≡ a(t)φ, then
a2φ2 = χ2−2χχa
a+χ2
(
a
a
)2
= χ2+χ2a
a−[
χ2a
a
].
So the action is
S =1
2
∫
d3x dt(
χ2 − (∇jχ)2 −m2eff(t)χ2
)
m2eff(t) ≡ m2a2(t)− a(t)
a(t)
Scalar field φ(t,x) in FRW spacetime is equiva-
lent to scalar field χ(t,x) with a time-dependent
mass meff(t) in flat Minkowski spacetime
Note: the action for Mukhanov-Sasaki variable
v(t,x) is of this form with m = 0 and using z(t)
instead of a(t)
73
Quantization of free field with m(t)
Mode expansion: Fourier transform in space
(not in time)
χ (x, t) =
∫
d3k
(2π)3/2χk(t)e
ik·x, (χk)∗ = χ−k
Then a term such as∫
d3x(
∇jχ)2
becomes
∫
d3x(
∇jχ)2
=
∫
d3xd3k d3k′
(2π)3
(
ei(k+k′)·xi2k · k′χkχk′)
=
∫
d3k k2χkχ−k =
∫
d3k |χk|2 k2
So the action for χk(t) becomes
S[χk(t)] =
∫
d3k dt
(
1
2|χk|2 −
1
2|χk|2
[
k2 +m2eff(t)
]
)
Every Reχk, Imχk is a separate harmonic os-
cillator with time-dependent frequency
ω(t) ≡√
k2 +m2eff(t)
It is more convenient to consider complex-valued
variables χk
Equation of motion:
χk + ω2k(t)χk = 0
74
Quantization of time-dependent oscillators
Action for a unit-mass oscillator:
S =
∫
dt
[
1
2q2 − 1
2ω2(t)q2
]
Equation of motion: q+ ω2(t)q = 0
General solution (complex-valued) is in 2-dim.
space of solutions spanned by v(t), v∗(t):q(t) = a+v(t) + a−v∗(t); v+ ω2(t)v = 0
Quantization replaces q(t)→ q(t) and a± → a±
Standard quantization requires commutation
relations [q, p] = i where p = q, hence
[a+v+ a−v∗, a+v+ a−v∗]= [a−, a+](v∗v − vv∗) = i
A mode function is a solution v(t) such that
1 =v∗v − vv∗
i≡ 2Im
(
v∗v)
Note: v(t) must be complex-valued; have one-
parametric family of admissible mode functions
(can set v(t0)/v(t0) = λ arbitrarily if Imλ > 0)
Note: v(t) 6= 0 for all t because Im (v∗v) is a
Wronskian and is constant in time
For a given v(t) we have a− = 1i [v(t)q(t)− v(t)p(t)],
so the definition of a± depends on v(t)
75
Fock space: a− |0〉 = 0; |n〉 = 1√n!
(
a+)n |0〉,
depends on the choice of v(t)
Hamiltonian H = 12p
2 + 12ω
2q2 is expressed as
H(t) =|v|2+ω2|v|2
2
(
2a+a−+ 1)
+v2+ω2v2
2 a+a++
v∗2+ω2v∗22 a−a− [exercise]
Note: in time-independent cases, ω(t) = ω0,
one chooses v(t) = 1√2eiω0t, so H is diagonal in
the Fock basis: H |n〉 = En |n〉.
In general, |n〉 are not eigenstates of H(t), ex-
cept possibly at one t = t0 if we choose v(t)
such that v(t0) = iω(t0)v(t0). The state |0〉 is
then the instantaneous vacuum at t0.
Statement 1: It is impossible to chose v(t)
such that H(t) is diagonal at all times, i.e. v2+
ω2v2 = 0 for all t.
Proof: v = ei∫
ωdt does not satisfy EOM.
Statement 2: The instantaneous vacuum |t0〉minimizes 〈0| H(t) |0〉 over all possible |0〉.Proof: Consider |λ0〉 defined through v(t) such
that v/v = λ at t = t0. Compute 〈λ0| H(t0) |λ0〉 =|v0|2+ω2
0|v0|2
2 =|λ2|+ω2
02(λ−λ∗). If λ = A+ iB, then the
minimum is at A = 0, B = ω0. So v0 = iω0v0.
76
Lecture 16, 2007-12-12. Quantized free
scalar field in FRW
Decompose the Fourier modes as
χk(η) = a−kv∗k(t) + a+−k
vk(t), a+k
=(
a−k
)∗
Apply this to field mode expansion,
χ(x, t) =
∫
d3k
(2π)3/2
[
a−kv∗k(t) + a+−k
vk(t)]
eik·x
=∫
d3k
(2π)3/2
[
a−kv∗k(t)e
ik·x + a+kvk(t)e
−ik·x]
The canonical commutation relation is
[χ(x1, t), ˙χ(x2, t)] = iδ(x1 − x2)
Choose vk(t) for every k (same for equal |k|)
a−k
=vkχk − vk ˙χk
i; [a−
k, a+
k′ ] = δ(
k− k′)
Once vk(t) are chosen, define vacuum: a−k|0〉 =
0, ∀k. Excited states are interpreted as parti-
cles because 〈kn| H |kn〉 = 〈0| H |0〉+ nωk
Note: no instantaneous minimum energy pos-
sible if ω2k < 0. If so, can have e.g. 〈kn| H |kn〉 <
〈0| H |0〉. No particle interpretation possible!
77
Bogoliubov transformations
Consider an oscillator with “in-out” transition:
PSfrag replacements
ω(t)ω2
ω1
t1 t2t
Natural definitions of minimum-energy vacuum:
v1(t)|t<t1 =1√2eiω1t, v2(t)|t>t2 =
1√2eiω2t
Since
v1, v∗1
is a basis, we must have
v2(t) = αv1(t) + βv∗1(t)
(Bogoliubov transformation/coefficients)
Condition Im(
v2v∗2
)
= 12 gives |α|2 − |β|2 = 1
Express a±2 through a±1 , get a−2 = αa−1 − βa+1
The state∣
∣
∣t10⟩
is not vacuum at t = t2 but
has particles. Compute mean particle number:⟨
t10∣
∣
∣ a+2 a−2
∣
∣
∣t10⟩
= |β|2
Numerical computation of β requires knowing
v1(t), v1(t), v2(t), v2(t) at any one moment t0:
β =1
i[v1v2 − v2v1]t=t0
78
Number density for quantum particles
Bogolyubov transformation: uk(t) = αkvk(t)+
βkv∗k(t), and |αk|2 − |βk|2 = 1
Fourier expansions must agree:
eik·x(
v∗k(t)a+k
+ vk(t)a−k
)
= eik·x(
u∗k(t)b+k
+ uk(t)b−k
)
Hence b−k
= αka−k− βka+−k
, b+k
= α∗ka+k− β∗ka
−−k
Mean k-particle number in the ak-vacuum state:
〈a0| b+k b−k|a0〉 = 〈a0| (α∗ka
+k−β∗ka
−−k
)(αka−k−βka+−k
) |a0〉= |βk|2 〈a0| a−−k
a+−k|a0〉 = |βk|2 δ(3D)(0). The
divergent factor δ(3D)(0) represents 3-volume
of the space. Therefore the number density of
k-particles is nk = |βk|2.
Need to calculate the coefficients βk:
βk =ukvk − ukvk
i
Need to know the mode functions uk(t0), vk(t0)
at any one time t0
79
Particle production by gravity in FRW space-time
Toy model: a(t) = const for t < t1 and t > t2,nonstationary regime for t ∈ [t1, t2]
Mode functions are solutions of
vk +
(
k2 +m2a2 − aa
)
vk ≡ vk + ω2k(t)vk = 0
Natural definitions of vacuum for t < t1 andt > t2 are the positive-frequency solutions:
vk =1
√
2ω(1)k
eiω(1)k t for t < t1
uk =1
√
2ω(2)k
eiω(2)k t for t > t2
ω(1,2)k ≡ ωk(t1,2) =
√
k2 +m2a2(t1,2) = const
Initial state is vacuum ⇒ choose vk as modefunctions ⇒ final state has particle density
|βk|2 = |ukvk − ukvk|2 =1
2ω(2)k
∣
∣
∣
∣
vk(t2)− vk(t2)iω(2)k
∣
∣
∣
∣
2
Need to compute vk(t2). If we use WKB,
vk(t2) ≈1
√
2ωk(t2)exp
[
i∫ t2
t1ωk(t)dt
]
then |βk|2 ≈ 0, so this precision is insufficient!
80
Improved WKB approximation: “instanta-
neous squeezed state”
Consider Bogolyubov coefficients αk(t), βk(t)relating instantaneous vacua at t and at t1:
αk(t) = −v∗k − iωkv
∗k
i√ωk
, βk(t) =vk − iωkvk
i√ωk
Define “instantaneous squeezing parameter”
Z(t) ≡ −βk(t)α∗k(t)
=vk − iωkvkvk + iωkvk
Since |βk|2 =|Z|2
1−|Z|2 ≪ 1, we expect Z(t)≪ 1
Equation for Z(t) is
Z + 2iωkZ =ωk2ωk
(
1− Z2)
, Z(t1) = 0
First approximation (neglect Z2):
Z(t) ≈∫ t
t1
ωk(t′)
2ωk(t′)
exp
[
−2i∫ t
t′ωk(t
′′)dt′′]
dt′
Adiabatic estimate of Z(t): for t ∈ [t1, t2] weintegrate by parts and use ωk(t1) = 0,
iZ(t) ≈ exp
[
−2i∫ t
t1ωk(t
′′)dt′′]
×(
ωk4iω2
k
− 1
2iωk
d
dt
(
ωk4iω2
k
)
+ ...
)
81
This is an adiabatic series (the “small param-
eter” is the slowness of ωk, i.e. ωk ≪ ω2k etc.)
Note: this series is divergent! At t > t2 this
series gives Z(t2) ≈ 0 because Z(t2) is smaller
than the precision of this series.
Asymptotic estimate of |βk|2 ≈ |Z(t2)|2
Note: for high-energy particles (k2 ≫ m2a2,
k ≫∣
∣
∣
aa
∣
∣
∣) the effective frequency ωk(t) changes
slowly (is adiabatic)
Assuming t1 → −∞, t2 → +∞, ωk(t) analytic:
Change variable θ(t) ≡ ∫ tt1 ωk(t)dt, then
Z(θ) ≈ e−2iθ∫ θ
−∞ωk2ω2
k
e2iθ′dθ′
If t = t∗is a zero of ωk(t) of n-th order, then
ωk(t) ≈ ω(0)k (t− t∗)n, so we have near t ≈ t∗
that θ ≈ θ∗ +ω
(0)k
n+1 (t− t∗)n+1. The functionωk2ω2
k
≈ n
2ω(0)k (t−t∗)n+1
= n2(n+1)
1θ−θ∗ .For Imθ∗ > 0,
we get a contribution n2(n+1)
e−2Imθ∗
If δt is the characteristic timescale, Imθ∗ ∼ωkδt ≈ kδt, so |βk|2 ∝ e−4kδt is exponentially
small at high energy (ultrarelativistic particles)
Lecture 17, 2007-12-14. De Sitter space,
Bunch-Davies vacuum
De Sitter spacetime: a(t) = − (Ht)−1, so the
effective frequency is ωk(t) = k2+(
m2H−2 − 2)
t−2
Consider m≪ H (relevant in cosmology); then
vk +
(
k2 − 2
t2
)
vk = 0
has the general solution
vk = Ak
(
1 +i
kt
)
eikt +Bk
(
1− i
kt
)
e−ikt
Normalization yields |Ak|2 − |Bk|2 = 12k
How to choose Ak, Bk?
Consider instantaneous vacuum at early time
|t| ≫ k−1, then vk(t) ≈ 1√2keikt is a natural
choice. Interpretation: short-wave modes do
not feel the curvature, behave as in flat space.
Bunch-Davies vacuum state: for every k, choose
vk/vk → iωk at t→ −∞ (Minkowski vacuum at
early times). This yields Ak = 1√2k
, Bk = 0
82
Amplitude of quantum fluctuations
Fluctuations averaged on spatial scales L:
χL(t) ≡∫
d3x χ(t,x)WL(x),
where WL(x) is the window function such that
WL(x) ≪ 1 for |x| & L and∫
d3xWL(x) = 1,
for example WL(x) = 1
(2π)3/2L3exp
[
− |x|2
2L2
]
The quantum expectation value of χLχL is
〈0| χL(t)χL(t) |0〉 =∫
d3k
(2π)3|vk(t)|2 e−|k|
2L2
≈∫ L−1
0
k2dk
2π2|vk(t)|2 ∼
1
2π2|vk(t)|2 k3
∣
∣
∣
∣
k=L−1
This is called the power spectrum δχ2L(k) of
fluctuations in χ on scales L ∼ k−1
Power spectrum of free field in flat space:
vk =ei√k2+m2t
√2(k2 +m2)1/4
, δχ2L(k) =
1
2π2
k3√
k2 +m2
Power spectrum in de Sitter space (BD):
δφL(t) =k3/2 |vk(t)|a(t)π
√2
=H
2π
√
t2k2 + 1
At late times (t→ 0), δφL ≈ H2π = const!
83
Primordial fluctuations from inflation
Plan: 1. Quantize the MS variable v. 2. De-termine vacuum state |0〉. 3. Determine power
spectrum of Φ at the end of inflation.
vk +
(
k2 − zz
)
vk = 0, z(t) ≡ af0b
BD vacuum state for v: mode functions
vk(t) =1√2keikt, t→ −∞
To determine time tc(k) of horizon crossing,
need to have approximate expression for z(t)
Use slow roll approximation, conformal time:
b2 =a2
a=
8πG
3
(
1
2f20 + a2V (f0)
)
f0 + 2bf0 + a2V ′(f0) = 0
Slow-roll parameters: (of order 10−2)
ε =1
16πG
V ′2
V 2, η =
1
8πG
(
V
V
′′− 1
2
V ′2
V 2
)
Slow-roll solution: (note conformal time)
f0 ≈ −(√
V)′
√6πG
a = −a√
2
3V (f0)ε
z(t) ≈ − a√ε√
4πG;
z
z≈ 2b2 ≈ 16πG
3a2V (f0)
84
Approximately treat ε as a constant for now,and estimate a(t) ∝ t−1, z(t) ∝ t−1, z/z ≈ 2t−2
Horizon crossing: ktc(k) ≈√
2Approximate vk(t) after horizon crossing:
vk(t) ≈ Akz(t)[
1 + k2C(t)]
, Ak unknown
Find leading correction of order k2:
−1 ≈ C + 2z
zC = z−2
(
Cz2).
C(t) =
∫ tz−2(t′)
∫ t′z2(t′′)dt′′ ≈ −1
2t2 +O(t3)
vk(t) ≈ Akz(t)[
1− 1
2k2t2
]
, kt≪ 1
To determine Ak, match at horizon crossing:
Akz(tc(k)) ≈1√2keiktc(k), Ak ≈
1√2kz(tc(k))
Substitute vk into the formula for Φ:
Φ =1
a4πGf0
1
∆z
(
v
z
).
Φk(t) ≈ Aka(t)tε(t)√
8πG
3V (f0(t))
≈ 4πGε(t)
3√
kε(tc(k))
a(t)t
a(tc(k))
√
V (f0(t))
≈ 4πGε(t)
3√
kε(tc(k))
√2
k
√
V (f0(t))
85
Power spectrum
δΦk(t) =k3/2
π√
2|Φk(t)| =
4Gε(t)
3√
ε(tc(k))
√
V (f0(t))
Note: at the end of inflation ε(t) ≈ 1, so δΦk ∝ε−1/2(tc(k))
Towards the end of inflation ε grows, so the
spectrum is “red tilted”
86
Lecture 18, 2007-12-19. Null surfaces,
horizons
A smooth hypersurface (a submanifold of
codimension 1) S ⊂M is a set of points p : f(p) = 0where f is a smooth function such that df 6= 0
on S.
Normal vector n ≡ g−1df ; can redefine f →λf with λ 6= 0 on S, then n→ λn.
Tangent vectors: t f = 0, g(n, t) = 0
Note: 1. A null vector cannot be orthogonal to
a timelike vector. 2. If g(n,n) = g(l, l) = 0 and
g(n, l) = 0 then n = λl. 3. A timelike vector
cannot be orthogonal to a timelike vector.
Hypersurface is timelike / spacelike / null
iff the normal vector n is everywhere space-
like / timelike / null. A spacelike hypersurface
has only spacelike tangent vectors. A timelike
hypersurface has timelike, null, and spacelike
tangent vectors. A null hypersurface has 1
null and 2 spacelike basis tangent vectors (no
timelike!). For a null hypersurface, the normal
vector lies within the hypersurface!
Let us call hypersurfaces just simply surfaces.
87
Examples of null surfaces
Example 1. In Minkowski spacetime, consider
f(p) = 0, f(p) ≡ t−√
x2 + y2 + z2 ≡ t− rthen the normal vector is null,
n ≡ g−1df = ∂t +1
r(x∂x + y∂y + z∂z)
This surface is a lightcone focusing through
the origin.
Example 2. In Schwarzschild spacetime,
g =
(
1− 2m
r
)
dt2 −(
1− 2m
r
)−1
dr2 − r2dΩ2,
the surface r = 2m (the horizon H) is a null
surface.
Proof: Tangent space to r = 2m is spanned
by ∂φ, ∂θ, ∂t. The vector ∂t is null at r = 2m
(but not at r 6= 2m). Also, ∂t is normal to ∂θand ∂φ. Hence, ∂t is normal to TH everywhere
on H. Hence, H is a null surface.
Definition: a null function f(p) is such that
n ≡ g−1df is null everywhere.
88
Geodesic generators of null surfaces
Null surfaces are “made of” geodesic lines called
generators. These are the flow lines of the
normal vector n ≡ g−1df . Note: n is the
affine normal to the surface. (Other normals,
n′ = λn with λ 6= const, are not geodesic.)
Statement: If f is a null function then the
normal vector field n ≡ g−1df is geodesic.
Lemma (stronger!): If a vector n is integrable
(n = g−1dλ for some function λ) and normal-
ized (∇xg(n,n) = 0 for ∀x), then n is geodesic.
Proof: If n is integrable then dgn = 0. Let
us apply the zero 2-form dgn to two vectors
(n,x) where x is arbitrary:
0 = dgn(n,x) = ∇ng(n,x)−∇xg(n,n)− g(n, [n,x])= g(∇nn,x)− g(∇xn,n) = g(∇nn,x).
Interpretation of null functions:
Surface f(p) = 0 is (locally) a lightcone emit-
ted by some event at one moment. If we con-
sider a moving source, then f(p) is the retarded
time function, i.e. time at source when the
lightray that crosses p was emitted.
89
Causal properties
A piecewise smooth curve is causal if its tan-
gent vector is everywhere timelike or null.
A piecewise smooth surface is made of pieces
of smooth surfaces. (It is not smooth on sub-
manifolds of smaller dimension.)
Spacelike surfaces contain only spacelike curves;
null surfaces contain null and spacelike curves.
Surfaces of different kinds are useful for differ-
ent things:
1. Spacelike surfaces: for setting initial condi-
tions. A spacelike surface S is (locally) Cauchy
if every causal curve (in a neighborhood of S)intersects S exactly once.
2. Timelike surfaces are worldsheets of 2-
dimensional objects, e.g. boundaries of imagi-
nary boxes where some system is enclosed.
3. Null surfaces are boundaries of causally con-
nected regions.
The future influence domain I+(F) is the
set of points connected to F by future-directed
causal curves starting at F. (Similarly, the past
influence domain I−.)90
Causal boundaries are null surfaces
Statement: The boundary ∂I+(F) of the fu-
ture influence domain of any set F is a null
surface (piecewise, and away from F).
Proof: Consider a point p ∈ ∂I+(F) where
the normal vector n exists and is not null, and
also p 6∈ F. Go to a local Lorentz frame where
∂I+(F) is locally a plane orthogonal to n.
“Condition B” holds: Points infinitesimally
close to ∂I+(F) on one side are in I+(F) but
point infinitesimally close to ∂I+(F) on the
other side are not in I+(F).
If n is timelike, then a curve going along n is
causal and violates condition B.
If n is spacelike, then there exists a timelike
vector t within ∂I+(F). Then t + λn is time-
like for very small λ; a causal curve along t+λn
crosses ∂I+(F) and violates condition B.
Note: statement is not true if p ∈ F or if n is
undefined (a “corner” of a surface)
Definition: Event horizon H is the boundary
of the past influence domain of the asymptotic
far future (can be timelike or null future)
91
Stationary (Killing) horizons
A null surface H is a stationary (Killing) hori-zon for a Killing vector field k if k is normal toH : g(k,x) = 0 for every tangent vector x tothe surface. (Note: k must be hypersurface-orthogonal to admit a horizon)
Properties:1. Killing vector k is null on H; ∇kk = κk2. Surface gravity κ is constant along flowlines of k
3. (without proof) The set of points wherek = 0 is a 2-sphere B (bifurcation 2-surface)4. Surface gravity κ is constant on B5. Surface gravity κ changes sign across B
Derivations:
1. Note: k is proportional to the affine nor-mal n to H, so k is itself null on H. If t istangent to H, we must have 0 = 1
2∇tg(k,k) =g(∇tk,k) = −g(∇kk, t) for every t such thatg(k, t) = 0, hence ∇kk = λk on H. Recallthat z ≡
√
g(k,k), g−1dz = −∇kkz . Assume
g(k,k) > 0 in some domain bounded by H.Surface gravity (defined as force at infinity) is|κ| =
∣
∣
∣g−1dz∣
∣
∣H = limp→H|∇kk|z = |λ|.
Let us define κ (not only |κ|!) by the formula∇kk ≡ κk on H. (Signed surface gravity.)
92
Lecture 19, 2007-12-21. Zero-th law of
black hole thermodynamics
2. Need formula: ∇x∇yk−∇∇xyk = R(x,k)y
(derivation postponed), then ∇k∇kk = κ∇kk
and so 0 = ∇k (∇kk− κk) = −k∇kκ. Hence
κ = const along k (unless k = 0)
Similarly, ∇k
(
κ2)
= limp→H∇kg(∇kk,∇kk)
g(k,k)= 0
3. Assume κ 6= 0, introduce an affine null
normal n to H, write k = µn. By definition,
∇nn = 0, hence ∇kk = µn∇nµ = κµn, so
∇nµ = κ = const along a flow line (orbit) of
n, as long as µ 6= 0. Hence there is a point on
the orbit where µ changes sign. At that point,
k = 0, changes direction (bifurcation point).
Define B as the set of points where µ = 0.
4. Derive another formula for κ. Since (by
the Frobenius theorem) gk ∧ dgk = 0, a useful
identity is found from a double trace with dgk:
0 = Trx,yιxιy (gk ∧ dgk) ιxιy (dgk) = ...
But ιxιkdgk = 2g(∇kk,x), so ιkdgk = 2κgk.
Also 14Trx,ydgk (x,y)dgk (x,y) = Trxg(∇xk,∇xk) =
Trx(∇xg(k,∇xk)−g(k,∇x∇xk)) = 12z2−Ric(k,k).
93
Now ιyιx (gk ∧ dgk) = g(k,x)ιydgk−g(k,y)ιxdgk+gkdgk(x,y), then we compute 0 = 1
4Trx,yιyιx(gk ∧ dgk) ιyιxdgk = gk(|∇•k|2 + 2κ2), where
we denoted |∇•k|2 ≡ Trxg(∇xk,∇xk). Hence
κ2 = −1
2Trxg(∇xk,∇xk) = −1
4z2− 1
2Ric(k,k)
5. For any vector t tangent to B, we have
∇t∇xk = ∇∇txk + R(t,k)x, hence −∇tκ
2 =
Trxg(∇t∇xk,∇xk) = TrxR(t,k,x,∇xk) = 0 since
k = 0 on B.
Derivation of the formula for ∇a∇bk
g(∇a∇bk, c) = g(∇∇abk, c) +R(b, c, a,k)
Assume that all covariant derivatives of vec-
tors a,b, c vanish. We are trying to reverse the
order of vectors a,b by using the Killing prop-
erty g(∇ak,b) = −g(∇bk, a) and the definitionof R(). Write g(∇a∇bk, c) = ∇ag(∇bk, c) =
−∇ag(∇ck,b) = −g(∇a∇ck,b) since ∇a b, c =
0. Replace g(∇a∇ck,b) = g(∇c∇ak,b)+R(a, c,k,b)
and so g(∇a∇bk, c) = R(c, a,k,b)−g(∇c∇ak,b).
Now repeat this for the permutations (abkc)→(cakb)→ (bcka) and find g(∇a∇bk, c) = R(c, a,k,b)−R(b, c,k, a) + R(a,b,k, c) − g(∇a∇bk, c). Now
use the first Bianchi identity to express R(b, c, a,k) =
−R(a,b, c,k)−R(c, a,b,k). Hence we get
2g(∇a∇bk, c) = 2R(b, c, a,k)
Vector fields orthogonal to a single 3-surface
Statement: If a vector field v is either geodesic
or Killing, and is orthogonal to a single 3-
surface∑
(either spacelike or timelike but not
null) then it is everywhere hypersurface-orthogonal.
Proof: We have ∇vg(v,v) = Lvg(v,v) = 0 in
either case. Hence, g(v,v) is constant along
orbits of v. We may rescale v → u = λv
such that g(u,u) = const everywhere. (If v
is Killing, we will not need to rescale; see be-
low.) Now it is sufficient to show that u is
hypersurface-orthogonal everywhere.
Construct a family of surfaces Σ(τ) by trans-
porting Σ along u, where τ is the affine pa-
rameter, u τ = 1. Now prove that tangent
vectors to Σ(τ) remain orthogonal to u.
Choose a connecting vector c ∈ TpΣ such that
[c,u] = 0 everywhere; by construction g(c,u)Σ =
0. It remains to prove that Lug(c,u) = 0:
Lug(c,u) = (Lug) (c,u) + g(Luc,u) + g(c,Luu)
= (Lug) (c,u) = g(∇cu,u) + g(∇uu, c).
This vanishes either when Lug = 0 (Killing
case, set λ ≡ 1) or when g(u,u) = const and
∇uu = 0 (geodesic case).
94
Lecture 20, 2008-01-09. Near-horizon ge-
ometry for Killing horizons
Assumptions: a metric with hypersurface-orthogonal
Killing vector, and a Killing horizon H with
surface gravity κ =∣
∣
∣g−1dz∣
∣
∣ 6= 0, where z ≡√
g(k,k) is the redshift function (defined only
at that side of the horizon where k is timelike)
Want to investigate near-horizon properties.
Use t as time (k = ∂t), due to hypersurface-
orthogonality have g = z2dt2−(spatial metric).
Use z as a spatial coordinate; introduce spatial
coordinates
y1, y2
within surfaces z = const
(at least locally b/c dz 6= 0 on H)
Metric is written as
g = z2dt2−g(∂z, ∂z)dz2−∑
A,B
yAyBγAB(z, y1, y2)
To compute g(∂z, ∂z), consider the inverse met-
ric; the coefficient of g−1 at ∂z⊗∂z is g−1(dz,dz) =∣
∣
∣g−1dz∣
∣
∣
2 ≡ κ2f(z, yA) where f = 1 on H. Hence
g = z2dt2 − f−1κ−2dz2 − γ(...)Note: g is degenerate on H (det g = 0 on H).
95
Local Rindler frame
Coordinate z is undefined beyond the horizon.
Trick: Define ζ ≡ z2 and consider ζ < 0
g = ζdt2 − dζ2
4ζκ2f(ζ)− γ(...)
Near horizon, f ≈ 1, introduce coordinates
T,X via κT ≡ ±√
|ζ| sinhκt, κX ≡ ±√
|ζ| coshκtThen g ≈ dT2−dX2−γ(...). (2D flat spacetime×2D
space)
Rindler spacetime:
T
X
z = 0z =const
ζ > 0ζ > 0
ζ < 0
ζ < 0
z < 0
96
Removing coordinate singularity
Metric g = ζdt2 −(
4κ2ζf(ζ))−1
dζ2 − γ, Killing
vector k = ∂tSingularity det g = 0 at ζ = 0 is not physical
(det g is a coordinate-dependent quantity!)
Introduce (Painlevé) infalling coordinates: τ ≡t + A(ζ), where A(ζ) is chosen so that the
metric is nonsingular on H in coords.
τ, ζ, yA
:
g = ζ(
dτ −A′dζ)2 − γ
= ζdτ2 − 2A′ζdτdζ −(
1
4ζκ2f− ζA′2
)
dζ2 − γ(...)
Infalling means that ∂τ points inwards: g(∂τ , ∂ζ) >
0; this holds if A′(ζ) > 0.
Can choose A(ζ) such that A′ζ 6= 0 and that1
4ζκ2f−ζA′2 is finite as ζ → 0. For example, can
have 14ζκ2f
−ζA′2 → 1 on H. (Need A′(ζ) ≈ 12κζ
near H in any case.)
Examples: spherically symmetric spacetimes
g = Q(r)dt2 − dr2
Q(r)− r2
(
dθ2 + sin2 θdφ2)
Zeros of Q(r) are horizons. Reissner-Nordström:
Q(r) = 1 − 2mr + q2
r2; Schwarzschild-de Sitter:
Q(r) = 1− 2mr −H2r2
97
Computing surface gravity
Exercise: verify ∇kk = κk directly in Painlevé
coordinates
τ, ζ, yB
using the formula g(∇kk,x) =12 (dgk) (k,x).
Note: surface gravity depends on the normal-
ization of the Killing vector k! (This needs to
be fixed ad hoc, say g(k,k)→ 1 at infinity)
For metric of the form
g = P(x)dt2 − dx2
Q(x)− γ(x, yB)
where P(x0) = Q(x0) = 0, while P ′(x0) > 0,
Q′(x0) > 0. (Assuming k = ∂t in the given
coordinate system!)
Need to transform the metric to g = ζdt2 −(
4κ2ζf(ζ))−1
dζ2−γ where ζ = 0 at x = x0 and
f(0) = 1: change coordinates to ζ = P(x),
then
Q(x) ≈ Q′(x0)P ′(x0)
ζ, g = ζdt2 − P ′(x0)Q′(x0)ζ
dζ2
P ′2(x0)
and hence κ = 12
√
P ′(x0)Q′(x0).
98
Unruh effect
Rindler spacetime (2D): g = (1 + ax)2 dt2 −dx2; Killing vector k = ∂t; Killing horizon x =
−a−1; redshift z = 1+ ax (for x > −a−1 only!)
Proper acceleration of stationary observers:
a = −g−1d ln z =a
1 + ax∂x
Observer at x = 0 has constant acceleration a
Interpretation: horizon plane x = −a−1 due to
constant acceleration. Surface gravity κ = a
To be shown: quantum field in Minkowski vac-
uum appears to have thermal spectrum of par-
ticles, as measured by accelerated detectors
Unruh temperature: T = a2π in Planck units
(Analytic continuation arguments: 1. Euclidean
metric a2x2dt2E + dx2 is nonsingular at origin
only if tE is periodic with period 2π/a. 2.
Fields are functions of sin atE and must be pe-
riodic in tE. – Inconclusive!)
99
Lecture 21, 2008-01-11. Hawking radia-
tion. First law for Killing horizons
Metric g = ζdt2 −(
4κ2ζf(ζ))−1
dζ2 − γConsider lightrays in the t− ζ plane near H:
4ζ2κ2f(ζ)t2 = ζ2, t = ±∫
dζ
2κζ√
f(ζ)
Statement: Any null curve in 2D spacetime
is a null geodesic (lightray).
Proof: g(∇nn,n) = 0 but there is only one
vector orthogornal to n, so ∇nn = λn and
hence ∇µn (µn) = 0 for some µ.
Define the tortoise coordinate r ≡ ∫ dζ
2κζ√f(ζ)
,
then lightrays are t ± r = const; 2D metric is
conformally flat, g = ζ(
dt2 − dr2)
.
Consider scalar field (any coupling/potential):
S [g, φ] =
∫
dt dζ d2y
√γ
2κ√f
[
1
2ζφ2 − 2κ2fζ
(
φ,ζ)2
−1
2|∇⊥φ|2 − V (φ,R)
]
=
∫ √γd2y
∫
dt dr
[
1
2φ2 − 1
2φ2,r + (...) ζ
]
Coupling/potential is unimportant at ζ → 0!
Essentially, free field in 2D Rindler spacetime.
100
Unruh effect
Quantization of free field in Rindler spacetime:use two coordinate systems, t, r and T,XTortoise coordinate: r = a−1 ln (1 + ax)
Minkowski frame: aT = ear sinh at, aX = ear cosh at
g = dT2 − dX2 = e2ar(
dt2 − dr2)
Mode functions: uk = 1√2ωk
eiωkt, ωk = |k|Mode expansions for quantization:
φ(T,X) =
∫
dk√2π
1√
2 |k|
[
e−i|k|T+ikX a−k + ei|k|T−ikX a+k
]
φ(t, r) =∫
dk√2π
1√
2 |k|
[
e−i|k|t+ikrb−k + ei|k|t−ikrb+k
]
Use lightcone coordinates: u = t−r, v = t+r,U = T − X, V = T + X and rewrite modeexpansions using ωk as integration variable:
φ(U, V ) =∫ ∞
0
dΩ√2π
1√2Ω
[
e−iΩU a−Ω + eiΩU a+Ω
+e−iΩV a−−Ω + eiΩV a+−Ω
]
φ(u, v) =∫ ∞
0
dω√2π
1√2ω
[
e−iωub−ω + eiωub+ω
+e−iωvb−−ω + eiωvb+−ω]
Note: both expansions are for the same oper-ator φ, so we can express b±±ω through a±±Ω
101
Bogoliubov transformation
Relationship between u, v and U, V :
aU = −e−au, aV = eav
Hence the u-dependent part of φ is equal to
the U-dependent part: (same for v and V )∫ ∞
0
dω√2π
1√2ω
[
e−iωub−ω + eiωub+ω]
=
∫ ∞
0
dΩ√2π
1√2Ω
[
e−iΩU a−Ω + eiΩU a+Ω
]
Compute Fourier transform in u of both sides:
∫ +∞
−∞du√2πeiωu
∫ ∞
0
dω′√2π
1√2ω′
[
e−iω′ub−ω′+ eiω
′ub+ω′]
=1
√
2 |ω|
b−ω , ω > 0
b+|ω|, ω < 0
∫ +∞
−∞du√2πeiωu
∫ ∞
0
dΩ√2π
1√2Ω
[
e−iΩU a−Ω + eiΩU a+Ω
]
=
∫ +∞
−∞du
2π
∫ ∞
0
dΩ√2Ω
[
eiωu−iΩU a−Ω + eiωu+iΩU a+Ω
]
≡∫ ∞
0
dΩ√2Ω
[
F(Ω, ω)a−Ω + F(−Ω, ω)a+Ω
]
,
where we defined
F(Ω, ω) ≡∫ +∞
−∞du
2πexp
[
iωu+ iΩ
ae−au
]
102
Then the Bogoliubov transformation is
b−ω =
∫ +∞
0dΩ
[
αΩωa−Ω + βΩωa
+Ω
]
,
where (for ω > 0,Ω > 0)
αΩω =
√
ω
ΩF(Ω, ω), βΩω =
√
ω
ΩF(−Ω, ω)
Need to compute 〈Ω0| b+ω b−ω |Ω0〉 = ∫+∞0 dΩ |βΩω|2
where |Ω0〉 is the vacuum state annihilated bya−Ω (vacuum in the Minkowski frame T,X)
Trick 1: The function F(Ω, ω) satisfies
F(Ω, ω) = F(−Ω, ω) expπω
a, F ∗(Ω, ω) = F(−Ω,−ω)
Derivation: Shift the integration contour intothe line u = −iπa−1+ t with t ∈ R, then e−au =−e−at and a factor eπω/a appears.Trick 2: Canonical commutation condition,[
b−ω , b+ω′]
= δ(ω − ω′), entails
δ(ω − ω′) =
∫ +∞
−∞dΩ
[
αΩωα∗Ωω′ − βΩωβ∗Ωω′
]
=
∫ +∞
−∞dΩ
√ωω′
ΩF(−Ω, ω)F ∗(−Ω, ω′) exp
π
a
(
ω+ ω′)
The mean particle density is computed by set-ting ω = ω′ and discarding the δ(0) factor:
〈nω〉 =(
exp
[
2πω
a
]
+ 1
)−1
=
(
exp
[
E(k)
T
]
+ 1
)−1
First law for Killing horizons: δE = TδS
where δE is the energy flow through H, T = κ2π
is the temperature, S = 14GA is the “entropy of
the horizon”
Consider an almost Killing horizon (∇kk ≈ κk
on H) and a small amount of matter (Tµν)
that falls into the horizon very slowly.
“Area of the horizon” is the 2-area of the space-
like cross-section of H at t = const
Consider spacelike connecting vectors s1, s2 for
k; they remain approximately orthogonal to k
since Lkg(k, s) = (Lkg) (k, s) ≈ 0
Introduce a null vector l such that g(k, l) = 1,
then ∇kl ≈ −κl near HThe rate of change of 2-area under flow of k:
2-area A = 4-volume spanned by k, l, s1, s2,so A = ε(k, l, s1, s2) and
Lkε(k, l, s1, s2) = (div k)A+ ε(k,Lkl, s1, s2)
The last term approximately vanishes since
Lkl has almost no component parallel to l:
g(k,∇kl−∇lk) ≈ g(k,−κl + κl) = 0. Hence
div k ≈ ∇n lnA
103
Assume that the horizon is everywhere smooth
(this excludes e.g. collision of black holes!)
Then H is (locally) entirely covered by the flow
lines of k. Let t be the flow parameter for k.
Denote θ(t) ≡ div k along flow lines of k. Then
the total change in A is δA =∫+∞−∞ dt
∮
d2A∇n lnA =∫+∞−∞ dt
∮
d2Aθ(t)Compute ∂tθ(t) = ∇kTrxg(∇xk,x) = Ric(k,k)+div∇kk ≈ Ric(k,k) + κθ(t) since ∇kκ is small.
Then θ(t) = − ∫∞t Ric(k,k)eκ(t−t′)dt′ and so
∫ +∞
−∞θ(t)dt = −1
κ
∫ +∞
−∞Ric(k,k)dt′
=8πG
κ
∫ +∞
−∞T(k,k)dt′
The quantity δE ≡ ∮
d2A∫+∞−∞ dt′T(k,k) is the
total flux of energy through the horizon, so
δE =κ
2π
δA
4G= TδS
Remarks: 1. The null energy condition guar-
antees T(k,k) ≥ 0 and thus δS ≥ 0. But this
is only a weak “2nd law” b/c we assumed very
small variations of smooth Killing horizon.
2. Introducing a rotational Killing vector ∂φsuch that k = ∂t + Ω∂φ we get
δE−ΩδJ = TδS, δJ ≡∮
d2A∫ +∞
−∞dt′T(∂φ, ∂φ)
Lecture 22, 2008-01-16. Raychaudhuri equa-tion for timelike geodesics
Motivation: Want to describe the distortion ofa spacelike 3-volume by the Lie flow of a time-like geodesic field, compared with the paralleltransport (comoving flow)
v
γ
ca τ=0
b
τ
a(τ)
b(τ)
c(τ)
Let v be a timelike geodesic field. Let γ(τ) beone flow line. Let c be a connecting vector forv; values of c on γ are c(τ). Let cP (τ) be theresult of parallel transport of c(0) along γ.
104
Distortion of transverse connecting basis
Some useful properties:
Statement 1: A connecting vector c stays
orthogonal to v if v is normalized and geodesic
(or Killing)
Proof:∇vg(v, c) = g(v,∇cv) = 0;Lkg(k, c) = 0
Statement 2: (Distortion of connecting ba-
sis) a) The connecting vector c satisfies ∇vc =
Bv(c) where Bv is the derivative tensor of v;
g(Bv(c),x) ≡ g(∇cv,x). b) The transforma-
tion operator Z(τ) defined by c(τ) = Z(τ)cp(τ)
satisfies Z(0) = 1 and ddτ Z(τ) = Bv(τ)Z(τ)
Proof: g(∇vc,x) = g(∇cv,x) ≡ B(v)(c,x);(
ddτ Z
)
cP = ∇vZcP = ∇cv = Bvc = BvZcP for
any (parallelly transported) basis vector cP .
Statement 3: Transverse 3-volume satisfiesd
dτV = Lvε(v, a,b, c) = (divv)V = V TrBv
Statement 4: Derivative tensor is transverse,
B(v)(v, ·) = B(v)(·,v) = 0
Proof: g(∇xv,v) = 0, g(∇vv,x) = 0
So the tensor B(v) is effectively 3-dimensional.
105
Orthogonal decomposition of Bv for time-
like v
Consider B(v) as a bilinear form in 3-dim. space
with reduced metric hµν ≡ gµν − vµvνNote: divv = Trxg(∇xv,x) = Trxh(∇xv,x);
Trxh(x,x) = 3; h(v, ·) = 0
Decompose B(v) into pure trace, traceless sym-
metric, and antisymmetric parts (w.r.t. h):
B(v) =1
3hdivv + σ(v) + r(v)
where σ(v) is shear and r(v) = 12dgv is rotation
of v. Note: σ(v) and r(v) are transverse to v.
Statement 5: Rotation vanishes for hypersurface-
orthogonal, non-null geodesic v.
Proof: Let ω ≡ gv; by assumption ω ∧ dω = 0
and ω(v) 6= 0. Compute 0 = ιv (ω ∧ dω) =
ω(v)dω − ω ∧ ιvdω = ω(v)dω − ω ∧ ιv2r(v) =
ω(v)dω, hence dω = 0.
Counterexample for null v: set v = y (∂t + ∂x)
in Minkowski spacetime; then v is null, hypersurface-
orthogonal, and geodesic, but r(v) 6= 0.
Practice exercise: check the counterexample
106
Raychaudhuri equation
Compute ddτdivv. (Note: all 1st derivatives of
x vanish under trace, but [v,x] 6= 0.)
∇vTrxg(∇xv,x) = TrxR(v,x,v,x)
+ Trxg(∇x∇vv,x) + Trxg(∇[v,x]v,x)
= Ric(v,v)−Trxg(BvBvx,x)
Now simplify for timelike geodesics v using
the 3-dim. decomposition. Since Tr() is ascalar product in the space of matrices, we
have Tr(AB) = 0 if AT = A and BT = −B.
Traceless matrices are orthogonal to h. Hence
d
dτdivv = Ric(v,v)−1
3(divv)2−Trσvσv−Trrvrv
Comments: shear changes shape but not vol-
ume; rotation does not change shape; Tr canbe computed with h instead of g here; h is
negative-definite.
Focusing conditions
Statement 6: Trσvσv ≥ 0 and Trrvrv ≤ 0 forany timelike v.
Proof: If a matrix A satisfies AT = ±A then
±TrAa = ±Trxh(AAx,x) = Trxh(Ax, Ax) ≤ 0
Strong energy condition: Ric(v,v) ≤ 0 for alltimelike v (attractive nature of gravity)
107
Focusing theorem
Statement 7: a) For a hypersurface-orthogonal
timelike geodesic field v, assuming strong en-
ergy condition, we have
d
dτdivv ≤ −1
3(divv)2
b) If divv = −θ0 < 0 at τ = 0 then divv = −∞at finite time τ = τ0, where τ0 ≤ 3θ−1
0 .
Proof: a) Follows from Statements 5 and 6.
b) Consider auxiliary variable x(τ) ≡ (divv)−1,
then dxdτ ≥
13 and x(0) = −θ−1
0 . It follows that
x(τ0) = 0 at τ0 < 3θ−10 .
Interpretation: The volume of connecting ba-
sis shrinks to zero. So geodesics that are in-
finitesimally nearby focus onto γ(τ) at τ = τ0.
This is congruence-dependent, not just a prop-
erty of spacetime.
Statement 8: Volume vanishes as V ∝ (τ − τ0)nnear τ = τ0, where we may have n = 1,2,3
Proof: 3-volume element V (τ) is proportional
to det Z(τ) while Z satisfies ddτ Z = BvZ; hence
ddτ det Z = (divv) det Z. Since d
dτ det Z is al-
ways finite, we must have det Z = 0 at τ = τ0where divv = −∞. Since all components of Z
are regular at τ = τ0, the determinant vanishes
as polynomial of τ − τ0 of degree at most 3.
108
Lecture 23, 2008-01-18. Rauchaudhuri
equation for null geodesics
Orthogonal decomposition of Bn requires pro-
jecting onto the 2-dim. spacelike subspace or-
thogonal to n. Choose a null connecting vec-
tor l such that g(n, l) = 1,[n, l] = 0 everywhere.
Properties: ∇ng(n, l) = 0; g(∇ln, l) = 0.
Projection onto n⊥ is the operator
Pnx = x− ng(l,x)− lg(n,x)
2-dim. reduced metric h(2)µν = gµν − lµnν − nµlν
(Note: l is defined up to l → l + λn which
affects Pn but not the final results. l is the
lightray going towards inside of the horizon.)
Consider Bn as a bilinear form in 2-dim. space
with reduced metric h(2)
Note: divn = Trxg(∇xn,x) = Trxh(2)(∇xn,x);
Trxh(2)(x,x) = 2; h(2)(n, ·) = h(2)(l, ·) = 0
Decompose Bn into components w.r.t. the pro-
jector Pn: Bn(x,y) = Bn(Pnx, Pny)+Bn(Pnx, l)g(n,y)+
Bn(l, Pny)g(n,x)+Bn(l, l)g(n,x)g(n,y). Denote
for brevity Bn(Pnx, Pny) ≡ B⊥n (x,y). Compute
trace: Trxg(BnBnx,x) = Trx,yBn(x,y)Bn(y,x) =
Trx,yB⊥n (x,y)B⊥n (y,x) because all other com-
ponents generate Pnn = 0 or g(n,n) = 0.
109
Now decompose B⊥n (x,y) according to sym-
metry and trace properties:
B⊥n = 12h
(2)divn + σ⊥(n) + r⊥(n)
σ⊥n ≡ 12[B
⊥n (x,y) +B⊥n (y,x)]− 1
2h(2)(x,y)divn
r⊥n ≡ 12[B
⊥n (x,y)−B⊥n (y,x)]
Raychaudhuri equation:
d
dτdivn = Ric(n,n)−(divn)2
2−Tr
(
σ⊥n σ⊥n
)
−Tr(
r⊥n r⊥n
)
Statement 1: r⊥n = 0 if n is hypersurface-
orthogonal. (Even if rn does not vanish.)
Proof: The tensor r⊥n is equivalently defined
by r⊥n (x,y) = 12dgn(Pnx, Pny). This vanishes
(by construction) if either x or y is not in n⊥.It remains to consider x and y both in n⊥,so that Pnx = x and Pny = y. Then ιxgn =
ιygn = 0 and (by the Frobenius condition) 0 =
ιxιy (gn ∧ dgn) = gnιxιydgn = 2gnr⊥(n)
(x,y). So
r⊥(n)
(x,y) = 0.
Null energy condition (NEC): Ric(n,n) ≤ 0 for
any null n. [Equivalently, T(n,n) ≥ 0]
Focusing theorem: If NEC holds and divn <0 then focusing occurs within finite interval of
affine parameter τProof: Define x ≡ (divn)−1; then dx
dτ ≥12 and
x(0) = −θ−10 , so x(τ0) = 0 where τ0 < 2θ−1
0
Geodesic deviation equation
If c is a connecting vector to a geodesic fieldv then c satisfies the 2nd order equation:
∇v∇vc = R(v, c)v
Choose v as the field of all the geodesics emit-ted from a point p. Solve geodesic deviationequation along a single curve γ. Obtain c(τ)such that c(0) = 0, ∇vc(0) ≡ c 6= 0.The vector c(τ) is a Jacobi field for γ(τ).Point q ∈ γ is conjugate to p ∈ γ if ∃c(τ) notidentically zero, solving GDE, c = 0 at p and q.Interpretation: an infinitesimally nearby geodesiccurve emitted at p will intersect γ again at q.
Define the operator A(τ) as the map fromc(τ = 0) to c(τ) in a parallelly propagated
basis. Then ¨A = GA, where Gx ≡ R(v,x)v.Initial conditions: A(0) = 1− Pv, ˙A(0) = Pv
Conjugate points and focusing
1. A vector field emitted at p is always hypersurface-orthogonal (proof below)2. If divv < 0 at some point p then focus-ing is guaranteed (assuming energy conditions)within a finite distance δτ from p3. How to compute divv for this field: divv =ddτ ln det A. So det A→ 0 when divv→ −∞
110
Lecture 24, 2008-01-23. Extremal prop-
erties of geodesics
To understand the extremal properties it is im-
portant to consider a particular kind of vector
field: namely geodesics emitted from a point.
This field is hypersurface-orthogonal (see be-
low) and hence we may use the bounds on
focusing from the Raychaudhuri equation. Fo-
cusing will control the extremal properties.
Statement 2: A geodesic vector field v emit-
ted at a single point p is hypersurface-orthogonal.
Proof: Let ω ≡ gv; we will show that ω∧dω =
0. a) Suppose that v is timelike. Since dω =
2rv while rv is transverse to v, the only nonzero
possibility is ω∧dω(v, a,b) = ω(v)2rv(a,b) where
a,b ∈ v⊥. However, 2rv(a,b) = g(∇av,b) −g(∇bv, a). Let us choose a basis of vector
fields a,b, c connecting to v; by construc-
tion these fields vanish at p. So rv(a,b)|p =
0. Now compute the derivative of 2rv(a,b)
along v. We find 2∇vrv(a,b) = ∇vg(∇va,b)−∇vg(∇vb,a) = R(v, a,v,b) − R(v,b,v, a) = 0.
Hence rv(a,b) = 0 everywhere.
111
b) Suppose that v is null. Choose a null con-
necting vector l such that g(v, l) = 1. The only
nonzero possibility is ω ∧ dω(l, a,b) = 2rv(a,b)
where a,b ∈ v⊥. Then the same argument in-
volving a connecting basis shows that rv(a,b) =
0 everywhere.
Statement 3: The transformation operator
G is symmetric, GT = G, with respect to the
metric g
Proof: Compute g((G− GT )a,b) = g(Ga,b)−g(a, Gb) = R(v, a,v,b) − R(v,b,v, a) = 0, so
G− GT = 0
Conjugate points γ(0) and γ(τ0) correspond to
focusing of infinitesimally nearby geodesics:
τ = 0
γ(τ)
τ = τ0c(τ)
Second variation of action in mechanics
Consider mechanical system with action
S[x] =
∫ T
0
[
1
2x2 − V (x)
]
dt
Is the action minimized by a trajectory x(τ)
that solves x+ V ′(x) = 0?
Compute first and second variation of S:
δS =
∫ T
0
[
−x− V ′(x)]
δx(t)dt
δ2S = −∫ T
0
[
δx+ V ′′(x)δx]
δx(t)dt
If there exists a solution for “geodesic devi-
ation” such that δx(0) = 0, δx(0) 6= 0, and
δx(t0) = 0 for some t0 > 0, call t0 a conju-
gate point to t = 0. Suppose no conjugate
points exist until t = T . Then the solution
A(t) of A+ V ′′(x)A = 0, A(0) = 0, A(0) = 1
is nonzero for 0 < t ≤ T . For any δx(t) define
s(t) ≡ δx(t)/A(t) and substitute into δ2S:
δ2S = −∫ T
0
[
As+ 2As]
Asdt
=∫ T
0
[
−(
A2ss).
+A2s2]
dt
=
∫ T
0A2s2dt > 0 if s(t) 6= 0 somewhere
Then S has a minimum on x(t).
112
No minimum of action with conjugate points
Statement: If a conjugate point exists at t0 <
T , then S[x] has no minimum at x(t).
Proof: We will present explicitly a δx(t) such
that δ2S < 0. By assumption, a solution c(t)
of c + V ′′(x)c = 0 exists such that c(t0) =
0, c(t0) 6= 0. Choose δx(t) = c(t)θ(t0 − t) −c(t0)ε(t), where ε(t) is very small and positive
and only nonzero in a small neighborhood of
t = t0. Then δx+ V ′′(x)δx ≈ −c(t0)δ(t− t0) +
O(ε) near t− t0, so
δ2S =∫ T
0
[
δx+ V ′′(x)δx]
δx(t)dt
= −∫ T
0
[
c2(t0)ε(t)δ(t− t0) +O(ε2)]
dt
= −c2(t0)ε(t0) +O(ε2) < 0
for sufficiently small ε. (Note: sign of ε is
chosen to “cut the corner.”)
T
c(t)
0
tδx(t)
t0113
Second variation of geodesic length
Consider a timelike curve γ perturbed by a vec-
tor field t. Define Lie-propagated vector field
v such that Ltv = 0. Assume that t = 0 at
endpoints τ1, τ2. Proper length of the curve
γ(τ ;σ) displaced by parameter σ is
L[γ(τ ; σ)] =
∫ τ2
τ1
√
g(v,v)dτ
Compute first and second derivatives w.r.t. σ:
d
dσL[γ;σ] = −
∫ τ2
τ1g
(
t,∇vv
N
)
dτ, N ≡√
g(v,v)
d2
dσ2L[γ;σ] = −
∫ τ2
τ1
[
g
(
∇tt,∇vv
N
)
+ g
(
t,∇t∇vv
N
)]
dτ
For a geodesic curve, ∇vvN = 0, and we can
set N = 1 to simplify calculations. Moreover:
g(t,∇t∇vv) = R(t,v,v, t) + g(t,∇v∇vt)
Hence the 2nd variation evaluated on γ is
d2
dσ2L[γ;σ] =
∫ τ2
τ1g(
t, Gt−∇v∇vt)
dτ
If this is negative for any field t(τ) then L[γ]is locally maximized by the curve γ
Note: if γ has conjugate points τ1, τ2 with a
Jacobi field c(τ), then choosing t = c yields
the absence of maximum! (See next theorem)
114
Theorem of geodesic extremum:
a) If a timelike geodesic curve γ(τ) has no
conjugate point to τ = 0 within an interval
[0, T ], then γ is a local maximum of L[γ] for
curves between γ(0) and γ(T).
b) If τ = τ0 < T is a conjugate point to τ =
0 on γ(τ) then there exists a longer timelike
curve between γ(0) and γ(T).
Proof: a) Consider the operator A(τ) for γ.If there are no conjugate points for γ(0) within
τ ∈ [0, T ], then A(τ) is nonsingular, so A−1(τ)exists for τ ∈ [0, T ]. We will now show that
d2
dσ2L[γ; σ] =
∫ T
0g(
t, Gt−∇v∇vt)
dτ < 0
for all nonzero t(τ) ∈ v⊥ that vanish at τ = 0
and τ = T . Define s(τ) ≡ A−1(τ)t(τ), denote
∇v under the integral by the overdot, and ex-
press the integral through s(τ):∫ T
0g(
As, GAs− (As)··)
dτ = −∫ T
0g(
As,2 ˙As + As)
dτ
Now we add a total derivative ∇vg(As, As) =
g( ˙As + As, As) + g(As, ˙As + As) and obtain
d2
dσ2L[γ; σ] =
∫ T
0g(
As, As)
dτ
+
∫ T
0
[
g(
˙As, As)
− g(
As, ˙As)]
dτ
115
Rewrite g( ˙As, As)−g(As, ˙As) = g(
s, ( ˙AT A− AT ˙A)s)
and examine this Wronskian-like combination:˙AT A− AT ˙A = 0 at τ = 0, while ¨AT = AT GT =
AT G. Hence(
˙AT A− AT ˙A).
= 0 and finally
d2
dσ2L[γ;σ] =
∫ T
0g(
As, As)
dτ < 0
since A is nondegenerate (has no zero eigen-
vectors) and As is always spacelike, while s 6= 0
at least for some τ .
b) If there exists a conjugate point at τ =
τ0 < T , then ∃ Jacobi field c(τ) 6= 0 such that
c(0) = c(τ0) = 0. Using c(τ) we will explicitly
construct a field t(τ) such that d2
dσ2L[γ(τ ;σ)] >0 when evaluated on the geodesic γ. The
idea is to “cut the corner”: set t(τ) ≈ c(τ)for 0 < τ < τ0 and t ≈ 0 for τ > τ0. Explicitly,
t = c(τ)θ(τ0 − τ) − εq(τ)b, where b = const,
g(b, c(τ0)) = −1, and ε > 0, q(τ) > 0. Then
Gt− t(τ) = c(τ0)δ(τ − τ0) +O(ε)
and finally we obtain
d2
dσ2L[γ;σ] =
∫ T
0g(t, Gt− t)dτ
=
∫ T
0g(−εq(τ)b, c(τ0)δ(τ − τ0))dτ
= εq(τ0) +O(ε2) > 0
Lecture 25, 2008-01-25. Extremal prop-
erties of null geodesics
Since null curves cannot be obtained by ex-
tremizing∫
√
g(v,v)dτ , we will use an alterna-
tive variational principle:
L[γ] =
∫ T
0
g(γ, γ)
Ndτ,
where N(τ) is a Lagrange multiplier. Define
the vector field v as the Lie transport of γ by
a perturbation field t, then LtN = 0, Ltv = 0.
First variation w.r.t. parameter σ:
d
dσL[γ; σ] = Lt
∫
g(v,v)
Ndτ =
∫
2g(∇tv,v)
Ndτ
=
∫
2g(∇vt,v
N)dτ =
∫
d
dτ(...)dτ −
∫
2g(t,∇vv
N)dτ
= −2
∫
g(t,∇vv
N)dτ
Second variation (using ∇vvN = 0 on shell):
d2
dσ2L[γ;σ] = −2
∫
g(t,∇t∇vv
N)dτ
= −2
∫ [
R(t,v,v
N, t) + g(t,∇v/N∇v/Nt)
]
dτ
= 2
∫
g(t, G(τ)t− t)Ndτ,
where G(τ)t ≡ R( vN , t)
vN ; overdot means d
d(Nτ).
116
Restrict N = 1 everywhere, so have ∇vg(v, t) =
g(v,∇vt) = g(v,∇tv) = 12∇tg(v,v) = 0 and
since t = 0 at endpoints, must have g(v, t) = 0
everywhere.
So it suffices to consider t ∈ v⊥.Also, t = λv gives no variation since Gv = 0
and g(t, t) ∝ g(v,v) = 0. So it suffices to con-
sider t ∈ v⊥ up to multiples of v.
Introduce a null vector u such that g(u,v) = 1
and fix the projector Pvx ≡ x − vg(x,u) =
ug(x,v). Then it suffices to consider t ∈ (u,v)⊥.Define A(τ) by the equation GA − ¨A = 0,
A(0) = 1 − Pv, ˙A(0) = Pv. Then focus point
τ = τ0 exists iff a solution A(τ) exists with
det A(τ0) = 0.
Now the same argument involving a solution
A(τ) and t(τ) ≡ A(τ)s(τ) shows that
d2
dσ2L =
∫
g(As, As)dτ < 0
since As is in (u,v)⊥ and thus everywhere space-
like. Similarly, if a focus point exists then there
exist a deformation t(τ) such that d2L/dσ2 >0. This indicates that the deformed curve is
at least somewhere timelike. An explicit ge-
ometric consideration indicates that the curve
can be deformed to everywhere timelike.
Deforming the null curve into timelike
1. Near the focus point τ0 we have a solu-
tion c(τ) such that c ∈ (u,v)⊥, c(τ0) = 0,
c(τ0) 6= 0. Consider the curve γ that follows
v + c until τ = τ0 and then v. The vector γ is
piecewise null.
2. Consider the tangent vector w at τ =
τ0− δτ . The vectors v and w are both future-
directed and null, so g(v,w) > 0, g(v,v) =
g(w,w) = 0. Hence b ≡ v − w is spacelike:
g(b,b) = g(v − w,v − w) = −2g(w,v) < 0.
Also g(b,w) > 0, g(b,v) < 0.
3. The deviation vector t(τ) can be chosen as
t = εq(τ)b where ε > 0, q(τ) > 0, signq(τ) =
sign (τ0 − τ) near τ = τ0, for example q(τ) =
1− |τ − τ0|. Then q(τ)g(γ,b) > 0 for all τ .
4. The tangent vector to the deformed curve
is ˙γ = γ + t = γ + εq(τ)b.
5. We verify that ˙γ is everywhere timelike:
g(γ + t, γ + t) = 2εq(τ)g(γ, b) +O(ε2) ≥ 0
117
t
vv
ww
118
Extremal properties for geodesics emitted
from a surface
Consider a spacelike surface Σ and a point p 6∈Σ. What is the shortest distance from p to Σ?
It is the geodesic γ emitted normally from Σ
towards p unless there exists a conjugate point
on γ before p.
Example: let Σ be the half-hyperboloid given
by t2 − x2 − y2 − z2 = 1, t < 0 in Minkowski
spacetime. The normal vector field is n =
t∂t+x∂x+y∂y+z∂z. A straight line (geodesic)
at point t, x, y, z towards the origin is x =
−t,−x,−y,−z and so it is orthogonal to Σ
because x ‖ n.
Hence the origin is the focus of geodesics emit-
ted normally from Σ.
Consequence: There is no (timelike) curve of
maximal proper length connecting 1,0,0,0and Σ
119
The easiest singularity theorem
If Σ is a compact Cauchy surface with (time-
like) normal vector n, such that divn < 0 ev-
erywhere on Σ, and if the strong energy condi-
tion holds to the future of Σ, then no timelike
geodesic to the future from Σ is complete.
Proof: 1. Consider the timelike geodesic field
n emitted normally from Σ. Since Σ is com-
pact, divn has an upper bound such that divn <−θ0 everywhere on Σ with some θ0 > 0. Since
n is hypersurface-orthogonal, by focusing the-
orem every geodesic has a conjugate point to
Σ at τ < τ0 ≡ 3θ−10 .
2. By extremal length theorem, no geodesic
maximizes proper length longer than τ0.3. Since Σ is compact and Cauchy, for any
point p to the future of Σ there is a nonempty
set of timelike or null curves intersecting Σ and
p. These curves intersect Σ at a compact set,
hence there is a point p′ ∈ Σ and a curve γ be-
tween p′ and p that maximizes proper length
between p and Σ. Then the curve γ would
have to be a geodesic emitted normally from
Σ. But such geodesic cannot maximize proper
length if it is longer than τ0. Therefore, no
points p can exist such that there is a timelike
curve from Σ to p with proper length > τ0.
120
Lecture 26, 2008-01-30. Area theorem
2nd law of horizon thermodynamics: The
combined area of event horizons does not de-
crease with time. (Assuming some conditions!)
Event horizon H = boundary of the set Bwhere signals do not escape to infinity.
Statement 1: (a) Event horizon H is locally
a null surface. (b) Null generators of H may
enter H from the past (at “corners”) but can-
not exit H to the future. (Assuming no naked
singularity directly on H, i.e. that the space-
time is asymptotically predictable.)
Proof: (a) There are no timelike curves cross-
ing H from interior outwards. Hence H cannot
contain timelike tangent vectors but must con-
tain a null tangent vector. B must be on the
“future side” from H. (b) Let γ be a null gen-
erator of H; then ∃γ′ that is arbitrarily close
to γ but never enters B, for instance, γ′ couldbe chosen as γ shifted a little bit towards the
past. Suppose p is a point where the null gen-
erator γ leaves H. By assumption p is not a
singularity, so we may continue γ past p; then γ
enters B\H. However, arbitrarily nearby curves
γ′ never enter B at all; contradiction.
121
Statement 2: If the NEC holds and if there
are no naked singularities, then divn ≥ 0 for
affine generators n of the horizon.
Proof: Suppose divn < 0 at some p ∈ H;
then divn < 0 also in a neighborhood of p.Consider the null geodesic γ emitted from p.By focusing theorem, the null generators n will
focus within a finite interval of affine parame-
ter. After focusing, some point q on γ will be
reachable from p by a timelike curve. However,
a timelike curve from p must be entirely within
B (or else p is visible from infinity). Hence, γleaves H since a point q on γ is not in H. How-
ever, Statement 1 says that γ cannot leave H;
contradiction. Area theorem (Hawking): Let Σ1 and Σ2
be two Cauchy surfaces (Σ2 later than Σ1).
Let H be the event horizon. Then the cross-
section (2D) area of H∩Σ2 is not smaller than
the area of H∩Σ1.
Proof: The generators of H emitted from
H ∩ Σ1 cannot leave H and thus all intersect
Σ2, generating a portion of H∩Σ2. The area
of that portion cannot be smaller than the area
of H∩Σ1 since divn ≥ 0. Other disconnected
pieces of H∩Σ2 might have appeared, making
the area of H∩Σ2 even larger. Remark: Since divn ≥ 0 everywhere, the area
of each connected piece of H is non-decreasing.
122
Singularity in collapsing universe
The existence of a singularity = the existence
of a geodesic that cannot be continued beyond
a finite parameter range ∆τ .
Note: If a piecewise timelike or null curve be-
tween points p, q has proper length L, then
there exists a timelike geodesic between p, q
of length at least L. Same for curves between
a point p and a surface Σ.
Statement 3: If Σ is a spacelike Cauchy su-
face with normal vector field n such that divn ≤−θ0 < 0 everywhere, and if the strong energy
condition holds, then every timelike geodesic
from Σ has proper time ∆τ ≤ 3θ−10 .
Proof: Select any point p to the future of Σ
and consider the set of all causal (timelike or
null) geodesic curves leading to p. All these
curves intersect Σ only once (Cauchy prop-
erty of Σ). The set of tangent vectors at p is
compact (it includes the past timelike and null
directions), hence the set of intersection points
with Σ is a compact subset of Σ. Hence, there
123
exists the longest (by proper length) geodesic
curve γ(p)max between Σ and p. Hence, γ
(p)max
is also the longest piecewise causal curve be-
tween Σ and p.
The curve γ(p)max must be timelike (since it has
proper length 6= 0). By the focusing theorem,
every timelike geodesic orthogonal to Σ will
have a focus within ∆τ ≤ 3θ−10 . Hence, no
geodesic longer than 3θ−10 maximizes proper
length. Hence, γ(p)max is not longer than 3θ−1
0 ,
for any point p to the future of Σ.
Singularity in closed universe
Closed universe = containing a compact space-
like 3-surface Σ without boundary (but not
necessarily a Cauchy surface!)
Cauchy horizon of Σ is ∂I+(Σ) and is locally
a null surface such that I+(Σ) is on the future
side of ∂I+(Σ). Null generators of ∂I+(Σ)
cannot leave it (same reason as before).
Statement 4: If a compact spacelike 3-surface
Σ without boundary, with the normal vector
n such that divn < 0 on Σ, and if the SEC
holds, and if the spacetime has no closed time-
like curves (strongly causal), then there is at
least one incomplete timelike geodesic emitted
from Σ.
Proof: Since Σ is compact, divn < −θ0 < 0
for some θ0 everywhere. By the focusing theo-
rem and extremal theorem, no geodesic emit-
ted from Σ that is longer than 3θ−10 maxi-
mizes length. Let us show that, for every point
p ∈ I+(Σ), the set of causal geodesics be-
tween Σ and p is compact. The intersection
124
I−(p) ∩Σ is a subset of a compact set Σ and
is thus bounded. It remains to show that it is
closed. Any sequence in I−(p)∩Σ corresponds
to a sequence of tangent vectors in the past
lightcone at p and thus has an accumulation
point, a tangent vectorv. If the geodesic γ
emitted from p in the direction v does not in-
tersect Σ, then a neighborhood around γ also
does not intersect Σ; this is impossible.
Hence for each p there exists a timelike geodesic
γ(p)max of maximum length, among all geodesics.
But no other causal curve between Σ and p
can be longer than 3θ−10 ; this is so because
any causal curve can be deformed to a smooth
causal curve of no smaller proper length, and
then, if it is not a geodesic, deformed further
to a curve of longer proper length. Hence,
a nongeodesic curve cannot maximize proper
length. The only remaining possibility is that
there are (nongeodesic) causal curves of ar-
bitrarily large proper length. However, this is
impossible because the set of all causal curves
between a compact Σ and p is compact (in the
C0 topology in the space of curves). Proper
length is upper semicontinuous as a functional
of piecewise differentiable curves, and hence
there exist a causal curve of finite maximum
length between Σ and p. This curve must be
a geodesic and thus is shorter than 3θ−10 .
Hence, no p ∈ I+(Σ) can be further than 3θ−10
from Σ. , so I+(Σ) ⊂ Σ ×[
0,3θ−10
]
. Hence
I+(Σ) is compact.
Now consider the Cauchy horizon ∂I+(Σ): it
is either compact or noncompact. If it is com-
pact, then it contains null generators that wind
around infinitely many times. Then ∃ closed
timelike curve. Hence ∂I+(Σ) is noncompact.
But I+(Σ) is compact; contradiction.
Remark: Hawking & Ellis give a longer proof
without requiring the no-CTC condition.
Lecture 27, 2008-02-01. Singular collapse
There are two families of null geodesics emit-ted normally from a spacelike 2-surface; “in-
wards” and “outwards”.Trapped 2-surface = both “inwards” and “out-wards” future null geodesics emitted normally
to the 2-surface have negative divergence.
Trapped surface theorem (Penrose): If a
time-orientable spacetime M has a noncom-pact Cauchy surface C and a compact, space-
like trapped 2-surface S, then there exists anincomplete null geodesic to the future of S.Proof: Consider ∂I+(S): it is generated bynull geodesics n emitted normally from S. SinceS is compact, divn < −θ0 < 0 on S. Suppose
that every null geodesic has length ∆τ ≥ 2θ−10
from S. By the focusing theorem and the ex-
tremal theorem for null geodesics, every pointp on a null geodesic after ∆τ = 2θ−1
0 will beinside I+(S). Hence, ∂I+(S) ⊂ S ×
[
0,2θ−10
]
and so ∂I+(S) is compact.Since M is time-orientable, ∃v : g(v,v) = 1.
Each orbit of v intersects ∂I+(S) and also Cexactly once. Hence orbits of v map 1-into-1 ∂I+(S) → C. So the image of ∂I+(S) is
a compact subset of C, hence its boundary isnonempty, but ∂∂I+(S) = ∅. Contradiction.
125
Topology change ⇒ closed timelike curves
Theorem (Geroch): If the spacetime domain
M between two compact, spacelike, bound-
less surfaces Σ1, Σ2 is compact and admits
no closed timelike curves, then Σ1∼= Σ2 (have
the same topology) and M ∼= Σ1 × [0,1].Proof: Choose a timelike, nonvanishing vec-
tor field v in M. Orbits of v from Σ1 either
end on Σ2 or never reach Σ2. If an orbit never
reaches Σ2, it winds around itself and can be
deformed to a CTC. Hence, all orbits reach
Σ2 in finite parameter time and provide a 1-
to-1 map Σ1 → Σ2. Parameter time can be
rescaled to [0,1].
Theorem (Hawking): If there is a tube-like
domain T connecting spacelike 3-surfaces Σ1
and Σ2 of different topology, and if ∂T con-
sists of a timelike surface and of Σ1,2, then
there is a closed timelike curve in T .
Proof: One can choose a nonvanishing time-
like vector field v in T such that v is tangent to
the timelike side boundary of T . If each orbit
of v from Σ1 ends on Σ2, then one would have
a 1-to-1 map Σ1 → Σ2, but they have different
topology. So there are some orbits that wind
around in T and never reach Σ2. These orbits
can be deformed into CTCs.
126
Holographic principle
Generalized 2nd law of thermodynamics:δS ≥ 0
where S =combined entropy of matter + 14 of
combined area of black hole horizons.
Susskind process: A system can be made into
a black hole with entropy ≤ πR2 by adding
more energy to it.
Spacelike entropy bound: If a system is bounded
by a sphere of radius R then its entropy is not
larger than πR2.
GSL = A black hole is the highest-entropy ob-
ject possible in a given spherical volume.
This suggests that the fundamental degrees of
freedom scale as area rather than as volume;
hence the name “holography”.
127
Spacelike bound is not generally covariant and/or
not generally valid.
Difficulties with generalizing the spacelike bound:
1. Not clear what is inside and what is outside
a sphere if the universe is closed.
2. In an expanding universe, obviously one
can find an arbitrarily large volume and the en-
tropy density is constant on large scales, hence
S ∝ A3/2
3. In a collapsing star, we may enclose it in
arbitrarily small (infalling) sphere and violate
the bound.
4. Choose a spacelike 3-surface (“surface of
constant time”) such that the boundary of the
system is a 2-surface of vanishing 2-area, then
area is arbitrarily small while entropy “enclosed
in it” remains finite. (The entropy bound, as
formulated, is noncovariant!)
These pitfalls are in some sense “kinematical”
since they do not involve a violation of the
Susskind process. Need a covariant formula-
tion of the entropy bound that avoids these
pitfalls.
128
Covariant entropy bound (Bousso)
A lightsheet L for a 2-surface Σ is a null sur-
face made by emitting null geodesics normally
to Σ such that divn ≤ 0 everywhere. (L needs
to be cut if the nonconvergence condition is
violated somewhere.)
Entropy on a lightsheet is the integral over
L of the 3-form dual to the entropy current
s. (Note: integration in the null direction is
independent of the choice of the affine param-
eter!)
Covariant entropy bound: The total entropy
on a lightsheet is smaller than 14 of the area of
the initial surface Σ.
Interpretation: This is the entropy visible from
Σ. This avoids the counterexamples above!
Spacelike projection theorem (Bousso): If
a system is enclosed in Σ and the lightsheet
L emitted from Σ has Σ as its only boundary,
then the entire entropy of the system is not
larger than the entropy on the worldsheet.
Proof: All matter within Σ will pass through
L; the entropy will not decrease until that time.
129
Lecture 28, 2008-02-06. Inflation is not
eternal to the past
Inflation = accelerated expansion of spacetime.
Example: de Sitter spacetime, g = dt2−e2Htd~x2
Expansion of spacetime = ∃ timelike geodesic
vector field v such that divv > 0 everywhere
throughout a spacetime domain between two
Cauchy surfaces. For de Sitter spacetime: v =
∂t. Compute divv = Trxg(∇xv,x) using or-
thonormal basis
v, e−Ht∂j
: since g(∂j, ∂j) =
−e2Ht, we get divv = g(∇vv,v)−∑3j=1 e
−2Ht
×g(∇∂jv, ∂j) = −3e−2Ht12v g(∂j, ∂j) = 3H.
Hence, in a general spacetime we estimate the
local Hubble rate as H ≡ 13divv ≈ −g(n,∇nv)
for any spacelike, normalized n (g(n,n) = −1)
Theorem (Borde & Vilenkin): If a spacetime
is everywhere expanding, i.e. there exists a
timelike geodesic field v such that −g(n,∇nv) ≥H0 > 0 for any spacelike vector n (assuming
g(v,v) = 1, g(n,n) = −1, g(v,n) = 0) every-
where to the past of a Cauchy surface Σ, then
there exists a timelike geodesic that is incom-
plete to the past of Σ.
130
Proof: Visually, assume that spacetime is filled
with particles move along worldlines that are
orbits of v. The incomplete timelike geodesic
will be constructed explicitly. It will be the
worldline of an observer not at rest with re-
spect to the comoving particles. One can choose
a geodesic vector field u such that g(u,u) = 1
and g(v,u) > 1 everywhere. Denote γ ≡ g(v,u)
(this is the gamma factor of special relativity,
γ = (1− ~vrel)−1/2, where ~vrel is the relative ve-
locity of observer w.r.t. comoving particles).
Then u = αv+βn for some constants α, β and
spacelike vector n orthogonal to v. Explicitly,
projecting u onto v⊥, we get
n =u− vg(u,v)
|u− vg(u,v)| =u− γv√
γ2 − 1≡ αu + βv.
The observer can measure ∇uγ by observing
relative velocities of comoving particles. Then
the observer estimates the local expansion fac-
tor as
H = −g(∇nv,n) = −g(∇αu+βvv, αu + βv)
= −α2g(∇uv,u) = −α2∇ug(v,u) = − ∇uγ
γ2 − 1.
Replace ∇uγ = ∂τγ and integrate H along the
observer’s worldline:∫ τ2
τ1Hdτ = −
∫ τ2
τ2
γ
γ2 − 1dt =
1
2lnγ + 1
γ − 1
∣
∣
∣
∣
∣
τ2
τ1
<1
2lnγ(τ2) + 1
γ(τ1)− 1
since γ > 1 and ln(...) > 0. On the other hand,∫ τ2τ1Hdτ ≥ H2(τ2 − τ1). Hence, there is a lower
bound on the value of τ1:
τ1 > τ2 −1
2H0lnγ(τ2) + 1
γ(τ2)− 1≡ τ1min.
Hence, the worldline u cannot be extended
to the past beyond τ1min and is incomplete.
(Note: orbits of v are complete, but all other
geodesics are incomplete.)
Conclusion: Either there exists a singularity
to the past, or inflation is not past-eternal.
Remarks: 1. The theorem is purely kinemat-
ical - no Einstein equations or energy condi-
tions are used.
2. In de Sitter space, either one uses global
coordinates where the universe is initially con-
tracting, or the worldline u reaches the “edge”
where the geodesic field v is not expanding any
more.
Superluminal travel
“Warp drive” (Alcubierre 1994):
g = dt2 − (dx− vsf(rs)dt)2 − dy2 − dz2,
vs(t) ≡dxs
dt, rs ≡
√
(x− xs(t))2 + y2 + z2,
f(r) ≈
1, |r| < R
0, |r| > R
The worldline γ ≡ τ, xs(τ),0,0 is timelike be-
cause dτ = dt on γ (no time dilation!).
Properties: 1. γ is always a geodesic. Proof:
Let u ≡ γ = ∂t + vs(t)∂x. Compute gu = dt,
hence g(∇uu,x) = 12(Lug)(u,x) = g(∇xu,u) =
0 since g(u,u) ≡ 1.
2. The function xs(t) is completely arbitrary,
which allows superluminal travel.
3. All energy conditions are violated. [Omitted
calculation: T(∂t, ∂t) ∝ −v2s f ′2]Negative energy is concentrated in the “skin”
of the bubble.
131
Superluminal travel needs negative energyK. D. Olum, Phys.Rev.Lett.81, p.3567, 1998; gr-qc/9805003
Generic superluminal travel: Assume that there
exists a null geodesic that travels further (in a
fixed, externally defined time) than any neigh-
bor null geodesic.
Formalize this construction. Let lightrays be
emitted orthogonally from an extrinsically flat
(made of spacelike geodesics), spacelike 2-surface
ΣA. The “externally defined time” is a space-
like 3-surface C intersected by the lightrays.
Let the “furthest-reaching” lightray be between
A ∈ ΣA and B ∈ C. Hence, there exists an
extrinsically flat 2-surface ΣB ⊂ C such that
B ∈ ΣB but no other point p ∈ ΣB can be
reached by any lightrays from ΣA. Then call
the path γ between A and B superluminal.
Theorem (K. Olum): The NEC is violated on
a superluminal path γ if Ric(γ, γ) < 0 at some
point on γ.
Proof: Consider a congruence n of geodesics
emitted normally from ΣA. There are no con-
jugate points to ΣA on γ (or else B is timelike-
reachable from A). Hence, there is no focus-
ing, and divn remains finite everywhere on γ.
132
Initially, divn = 0 on ΣA because there exists a
basis of (spacelike) geodesic vector fields ci ∈TΣA and we have g(n, ci) = 0 and g(∇cin, ci) =
∇cig(n, ci)− g(n,∇cici) = 0.
Raychaudhuri’s equation gives
∇ndivn = Ric(n,n)−Tr(σ⊥n σ⊥n )− 1
2(divn)2
Since Ric(n,n) < 0 somewhere on γ, we must
have divn < 0 at B.
Now we show that divn ≥ 0 at B. Define a
basis of connecting fields ci for n, and consider
ci at point B. The fields ci are not geodesic
at B; the orbits of ci must leave ΣB at B
towards the future of B, or else points near
B are in I+(ΣA). Hence, the deviation ∇cicimust point to the future, so g(n,∇cici) ≥ 0.
Also, γ is normal to ΣB at B (or else it can be
deformed to reach other points near B).
Also, g(ci,n) = 0 everywhere because ∇ng(ci,n) =
0 while g(ci,n) = 0 everywhere on ΣA.
To compute divn at B, consider g(∇cin, ci) =
−g(∇cici,n) at B. Since g(∇cici,n) ≥ 0, we
find divn =∑
i
g(∇cin,ci)
g(ci,ci)≥ 0. Contradiction.
Remarks: 1. The NEC is violated directly on
the path of the signal.
2. Singularities do not help prevent this.
133
Lecture 29, 2008-02-08. Conservation laws.
Komar’s global energy integral
Consider a stationary spacetime with a Killing
vector k. The derivative tensor Bk = 12dgk is
a 2-form, hence its divergence is a divergence-
free 1-form (conserved current):
divBk = Trxg(∇x∇xk, ·) = g(k, ·)[Why is it conserved? divBk = −∗d∗Bk, hence
(∗d∗)(∗d∗)Bk = 0.]
More generally: any rank 2 tensor ω satisfies
ωαβ;[µν]
= Rλβµνωαλ+Rλαµνω
λβ, hence a 2-form
ω satisfies ωαβ;αβ = 2Trx,yRic(x,y)ω(x,y) = 0.
The “conserved charge” Q is obtained by inte-
grating the flux of the conserved current q ≡g−1divBk = k through a 3-surface C:
Q ≡∫
Cg(q,n)d3V =
∫
Cg(k,n)d3V
Since k is a total divergence, rewrite this as
Q =
∫
Σ=∂C
(
∇αkβ)
nαvβd2Σ
Can write using forms:∫
C d ∗Bk =∫
∂C (∗Bk)
The conserved charge Q is an integral over an
infinitely large 2-surface enclosing the system.
134
Example: Schwarzschild spacetime, k = ∂t.
The redshift function is z = z(r) =√
1− 2Gmr .
The metric is g = z2dt2 − z−2dr2 − r2dΩ2
Integrate∫
ΣBk(v,n)d2Σ, where Σ is a sphere
of large radius R. Vectors normal to the sphere
are v ≡ z−1k, n = z∂r. Need to compute
Bk(v,n) = g(∇vk,n) = g(∇z−1kk, z∂r). We
know that the proper acceleration of station-
ary observers is ∇kk = −12g−1dz2. Hence
g(∇kk, ∂r) = −1
2∂rz
2 = −Gmr2
Integrating over the 2-sphere of radius R:
Q =
∫
Σ
(
−mr2
)
d2Σ = −4πGm.
This is the mass of the black hole, times (−4πG).
Now we show that this is not a coincidence:
the total energy (including gravity) of a sta-
tionary spacetime is E = − 14πGQ
135
Interpretation of Q as the energy
Statement: In the Newtonian limit, Q = −4πE,
where E is the total mass of the matter sources.
Proof: The property ∇x∇yk = R(x,k)y yields
g(k,x) = Tryg(∇y∇yk,x) = TryR(y,k,y,x) =
Ric(k,x) and hence Q =∫
CRic(k,n)d3V .
“Newtonian limit” = particles move along or-
bits of k, normal to the equal-time surface C;the energy-momentum tensor of matter sources
is Tµν = ρkµkν, where g(k,k) ≈ 1 and ρ is the
distribution of mass density.
By Einstein equations: Ric(k,n) = −8πG(T(k,n)−12(TrT)g(k,n)) ≈ −4πGρg(k,n) since TrT ≈ ρ.Hence Q =
∫
C(−4πG)ρd3V = −4πGE.
Remarks: 1. In the non-Newtonian case, we
get a combined energy of gravity and matter.
2. The energy (or Komar mass) is time-
independent.
3. Can define the total angular momentum
similarly, using the azimuthal Killing vector ∂φ4. If only have approximate Killing vectors at
infinity, can use the formula with the surface
integral and obtain conserved quantities
136
Conservation laws and energy-momentum
pseudotensors
Killing vectors are special and do not always
exist. But there is always invariance under dif-
feomorphisms.
An arbitrary diffeomorphism is generated by a
vector field v. The variation of the metric is
δg = Lvg. The variation of the action gives
δS =
∫
([Einµν + 8πGTµν]δgµν + qα;α)
√−gd4x
where Ein = Ric − 12Rg is the Einstein ten-
sor, and q is the vector field such that qα;α =
gαβδRαβ. (Explicitly, δRic = LvRic and q =
g−1divdgv.) Since the above is an identity,
while δgµν = −vµ;ν − vν;µ, so we have (using
Bianchi identity and conservation of Tµν) that
[Einµν+8πGTµν]δgµν = −2 ([Einµν + 8πGTµν]v
µ);ν
and so
(qν − 2[Einµν + 8πGTµν]vµ);ν = 0.
On-shell (Ein + 8πGT = 0) we have qα;α = 0,
i.e. a conserved current. Up to a total deriva-
tive, one can write
qν = 2[Einµν + 8πGTµν]vµ +
(
vλUλ[µν]
),µ
137
One can add an exact 1-form to qα such that
the result does not contain 2nd derivatives of
g (but contains derivatives of vα):
qν = qν +(
vβUβ[µν]
)
,µ, Uβ
[µν] = ...
Explicitly, one will have
qν = 8πGTµνvµ + τµνv
ν + σµνλvν;λ
for some (non-tensor) quantities τ and σ.
A “conserved density” is obtained as√−gqα
such that(√−gqα),α =
√−gqα;α = 0 on-shell.
Choose 4 different vectors v0,v1,v2,v3 and
obtain a “pseudotensor”. For example, in a co-
ordinate system xα choose v0 = 1,0,0,0,v1 = 0,1,0,0 etc., i.e. v
µα = δ
µα (noncovariant
definition!). Then one obtains four conserved
currents q0, ..., q3. Since the numerically cho-
sen components of vα are constant, we have,
for example,
[q0]ν = 8πGT0ν + τ0ν
and so on; hence
(8πGTµν + τµν),µ = 0
so 18πGτµν is the “energy-momentum pseudoten-
sor for gravity” such that the combined EMT
of gravity and matter is (non-covariantly) con-
served.
In this way, choosing different vectors vα and
different coordinate systems xα, various “energy-
momentum pseudotensors” can be constructed.
These pseudotensors are not tensors for the
following reasons:
1. They are defined through vectors vα given
by numerical components in a fixed coordinate
system.
2. They are collections of four conserved vec-
tors, rather than single conserved rank 2 ob-
jects.