Skip to content

Entanglement Entropy in QFT

Entanglement entropy is one of the places where quantum field theory most clearly remembers that it is a theory with infinitely many local degrees of freedom. It is also one of the places where AdS/CFT turns a hard field-theory question into a geometric one.

The field-theory question is simple to state. Take a quantum state of a QFT on a spatial slice. Divide the slice into a region AA and its complement AcA^c. How much information about the full state is lost when an observer has access only to AA?

The answer is encoded in the reduced density matrix

ρA=TrAcρ,\rho_A = \operatorname{Tr}_{A^c} \rho,

and in particular in the von Neumann entropy

SA=TrAρAlogρA.S_A = -\operatorname{Tr}_A \rho_A \log \rho_A .

This page builds the QFT side carefully. The next page explains why, in holographic theories at large NN, the same quantity is computed by the area of a surface in the bulk.

A spatial slice is divided into a region A and its complement. The reduced density matrix is obtained by tracing over the complement.

A spatial slice Σ\Sigma divided into AA and AcA^c. The entangling surface A\partial A is the locus where UV entanglement across the cut produces the leading area-law divergence.

The original AdS/CFT dictionary relates local boundary operators to bulk fields. Entanglement entropy goes beyond local operators. It asks about the organization of quantum information in a spatial region.

This matters for three reasons.

First, QFT entanglement entropy has a characteristic UV divergence controlled by the area of the entangling surface A\partial A. Holographically, the Ryu–Takayanagi formula turns that divergence into the near-boundary divergence of a bulk area. The match is almost too direct to ignore:

short-distance entanglement across Anear-boundary area divergence of γA.\text{short-distance entanglement across }\partial A \quad \longleftrightarrow \quad \text{near-boundary area divergence of }\gamma_A .

Second, entanglement entropy is sensitive to phases. In large-NN gauge theories, entanglement can diagnose confinement, thermal entropy, and phase transitions. In the bulk, these features become competing extremal surfaces, horizons, and topology.

Third, entanglement is now one of the main languages for emergent spacetime. Later in the course, relative entropy, entanglement wedges, and quantum extremal surfaces will make the slogan more precise: parts of the bulk are encoded in boundary subregions.

Setup: a region and its reduced density matrix

Section titled “Setup: a region and its reduced density matrix”

Let Σ\Sigma be a Cauchy slice of the boundary QFT, and choose a spatial subregion AΣA\subset \Sigma. Its complement is Ac=ΣAA^c=\Sigma\setminus A, and the common boundary

A=Ac\partial A = \partial A^c

is called the entangling surface.

In a finite-dimensional quantum system with Hilbert-space factorization

H=HAHAc,\mathcal H = \mathcal H_A \otimes \mathcal H_{A^c},

the reduced density matrix is unambiguous:

ρA=TrAcρ.\rho_A = \operatorname{Tr}_{A^c}\rho .

For a pure global state ρ=ΨΨ\rho = |\Psi\rangle\langle\Psi|, the entropy SAS_A measures entanglement between AA and AcA^c.

In continuum QFT, one should be more careful. The Hilbert space does not literally factorize across a sharp spatial cut without a regulator. Gauge theories add another subtlety: Gauss constraints couple the two sides of the entangling surface. A practical way to proceed in this foundations course is:

  1. regulate the QFT, for example by putting it on a spatial lattice or introducing a UV cutoff ϵ\epsilon;
  2. define ρA\rho_A in the regulated theory;
  3. study the cutoff dependence and universal terms as ϵ0\epsilon\to 0.

This is the right level of precision for the classical holographic formulas developed here. The divergences are not a bug. They are part of the physics.

The von Neumann entropy of the reduced density matrix is

SA=TrρAlogρA.S_A = -\operatorname{Tr}\rho_A\log\rho_A .

It is often useful to package the same information into Rényi entropies:

Sn(A)=11nlogTrρAn,n>0,n1.S_n(A) = \frac{1}{1-n}\log \operatorname{Tr}\rho_A^n, \qquad n>0, \qquad n\neq 1 .

The entanglement entropy is recovered by analytic continuation:

SA=limn1Sn(A)=nlogTrρAnn=1.S_A = \lim_{n\to 1} S_n(A) = -\left.\partial_n \log \operatorname{Tr}\rho_A^n\right|_{n=1} .

This formula is the seed of the replica trick. In Euclidean QFT, TrρAn\operatorname{Tr}\rho_A^n can often be represented as a path integral on an nn-sheeted geometry branched over A\partial A.

A useful normalized version is

TrρAn=Zn(Z1)n,\operatorname{Tr}\rho_A^n = \frac{Z_n}{(Z_1)^n},

where ZnZ_n is the partition function on the replicated geometry. Then

SA=n(nlogZ1logZn)n=1.S_A = \left.\partial_n\left(n\log Z_1-\log Z_n\right)\right|_{n=1} .

In holography, this replica expression becomes a gravitational problem: find a bulk saddle whose asymptotic boundary is the replicated boundary geometry. The Ryu–Takayanagi area term emerges from that gravitational replica construction.

The most important structural fact is that entanglement entropy in local QFT is UV divergent. For a smooth entangling surface in a dd-dimensional spacetime QFT, the leading divergence scales like

SAαd2Area(A)ϵd2+,d>2,S_A \sim \alpha_{d-2}\frac{\operatorname{Area}(\partial A)}{\epsilon^{d-2}} + \cdots, \qquad d>2,

where ϵ\epsilon is a short-distance cutoff. In d=2d=2, the entangling surface consists of points, and the leading divergence is logarithmic.

The area law is easy to understand physically. Most entanglement between AA and AcA^c comes from short-distance modes straddling the entangling surface. A thin layer of thickness ϵ\epsilon around A\partial A contributes roughly one independent amount of entanglement per cutoff cell. The number of such cells is proportional to Area(A)/ϵd2\operatorname{Area}(\partial A)/\epsilon^{d-2}.

The coefficient αd2\alpha_{d-2} is not universal. It depends on the regulator and microscopic details. Universal information sits in more delicate terms:

  • in even spacetime dimension dd, smooth entangling surfaces can have logarithmic terms whose coefficients are related to conformal anomaly data;
  • in odd spacetime dimension dd, constant terms for special regions, such as spheres, can be universal;
  • mutual information between separated regions is finite and cutoff independent.

This non-universality is important. Entanglement entropy is not usually a finite observable by itself. Universal pieces, variations, inequalities, and differences are often more physical than the full regulated entropy.

For a two-dimensional CFT on the infinite line in the vacuum, the entanglement entropy of an interval of length \ell is

SA=c3logϵ+s0,S_A = \frac{c}{3}\log\frac{\ell}{\epsilon}+s_0,

where cc is the central charge and s0s_0 is a non-universal constant. This formula is one of the basic checks of holographic entanglement entropy. In AdS3_3/CFT2_2, the Ryu–Takayanagi surface is a bulk geodesic, and its regularized length reproduces exactly the logarithm above when one uses the Brown–Henneaux relation

c=3L2G3.c = \frac{3L}{2G_3} .

At finite temperature T=1/βT=1/\beta, the interval entropy becomes

SA=c3log(βπϵsinhπβ)+s0.S_A = \frac{c}{3}\log\left(\frac{\beta}{\pi\epsilon}\sinh\frac{\pi\ell}{\beta}\right)+s_0 .

For β\ell\gg \beta, this grows linearly with \ell:

SAπc3β+,S_A \sim \frac{\pi c}{3\beta}\ell + \cdots,

which is the ordinary thermal entropy density times the interval length. In the bulk, that crossover is tied to geodesics probing a BTZ black-hole geometry.

Pure states, mixed states, and complements

Section titled “Pure states, mixed states, and complements”

If the full state on Σ\Sigma is pure, then AA and AcA^c have the same nonzero eigenvalues in their reduced density matrices. Therefore

SA=SAcS_A = S_{A^c}

for a pure global state.

If the full state is mixed, this equality need not hold. For example, in a thermal state, the entropy of a large region includes ordinary thermal entropy. In holography, this distinction becomes geometric. In a black-hole spacetime, the RT surface for a sufficiently large region may include a contribution related to the horizon.

This is one of the reasons the homology condition in the RT formula matters. The surface is not merely any minimal surface anchored at A\partial A; it must be compatible with a bulk region whose boundary includes AA.

Entanglement entropy obeys powerful inequalities. These are field-theory constraints that holographic areas must reproduce.

For two regions AA and BB, subadditivity says

SA+SBSAB.S_A + S_B \geq S_{A\cup B} .

A sharper inequality is strong subadditivity:

SAB+SBCSB+SABC.S_{A\cup B} + S_{B\cup C} \geq S_B + S_{A\cup B\cup C} .

Equivalently, the conditional mutual information is nonnegative:

I(A:CB)=SAB+SBCSBSABC0.I(A:C|B) = S_{A\cup B}+S_{B\cup C}-S_B-S_{A\cup B\cup C} \geq 0 .

The mutual information is

I(A:B)=SA+SBSAB.I(A:B) = S_A+S_B-S_{A\cup B} .

It measures correlations between AA and BB and is nonnegative. For separated regions in a local QFT, the leading area-law divergences cancel, so I(A:B)I(A:B) is finite.

Holographically, these inequalities become geometric inequalities about minimal areas. This was one of the early signs that the RT formula knows nontrivial quantum-information structure.

It is often useful to write the reduced density matrix as

ρA=eKATreKA,\rho_A = \frac{e^{-K_A}}{\operatorname{Tr}e^{-K_A}},

where KAK_A is the modular Hamiltonian. This is just a definition, but KAK_A is usually highly nonlocal.

There are special cases where KAK_A is local. For the vacuum of a relativistic QFT restricted to a half-space x1>0x^1>0, the modular Hamiltonian is the generator of boosts:

KA=2πx1>0dd1xx1T00(x).K_A = 2\pi\int_{x^1>0} d^{d-1}x\, x^1 T_{00}(x) .

For a spherical ball of radius RR centered at the origin in the vacuum of a CFT,

KA=2πx<Rdd1xR2x22RT00(x).K_A = 2\pi\int_{|\mathbf x|<R} d^{d-1}x\, \frac{R^2-|\mathbf x|^2}{2R}\,T_{00}(x) .

These local formulas are extremely important in holography. They allow one to relate entanglement variations to stress-tensor expectation values, and eventually to gravitational equations in the bulk.

For a small perturbation of the state around a reference density matrix, entanglement entropy satisfies

δSA=δKA.\delta S_A = \delta\langle K_A\rangle .

This is called the first law of entanglement. It is not a dynamical equation; it is a consequence of expanding the von Neumann entropy to first order.

A quick derivation is as follows. Write

KA=logρA(0)K_A = -\log \rho_A^{(0)}

for the modular Hamiltonian of the reference state. For a small perturbation

ρA=ρA(0)+δρA,TrδρA=0,\rho_A = \rho_A^{(0)} + \delta\rho_A, \qquad \operatorname{Tr}\delta\rho_A=0,

one finds

δSA=TrδρAlogρA(0)=TrδρAKA=δKA.\delta S_A = -\operatorname{Tr}\delta\rho_A\log\rho_A^{(0)} = \operatorname{Tr}\delta\rho_A K_A = \delta\langle K_A\rangle .

This formula is a key bridge between entanglement and dynamics. In holography, applying the first law to ball-shaped regions leads to deep relations between linearized Einstein equations and CFT entanglement.

Given two density matrices ρA\rho_A and σA\sigma_A on the same region, the relative entropy is

S(ρAσA)=TrρAlogρATrρAlogσA.S(\rho_A||\sigma_A) = \operatorname{Tr}\rho_A\log\rho_A - \operatorname{Tr}\rho_A\log\sigma_A .

If σA\sigma_A defines a modular Hamiltonian KA(σ)=logσAK_A^{(\sigma)}=-\log\sigma_A up to normalization, then

S(ρAσA)=ΔKA(σ)ΔSA.S(\rho_A||\sigma_A) = \Delta\langle K_A^{(\sigma)}\rangle - \Delta S_A .

Relative entropy is nonnegative:

S(ρAσA)0.S(\rho_A||\sigma_A)\geq 0 .

This inequality is a precise form of the idea that distinguishability cannot increase when we restrict to a subregion. Later, it becomes one of the cleanest ways to connect boundary positivity to bulk energy conditions and gravitational constraints.

CFT spheres and thermal physics on hyperbolic space

Section titled “CFT spheres and thermal physics on hyperbolic space”

For a spherical region in the vacuum of a CFT, there is an especially elegant trick. The causal domain of a ball in flat space is conformally equivalent to

R×Hd1.\mathbb R \times \mathbb H^{d-1} .

Under this conformal map, the reduced density matrix of the ball becomes a thermal density matrix on hyperbolic space at temperature

T=12πR,T = \frac{1}{2\pi R},

where RR is the radius of the ball.

This observation is important for two reasons. First, it gives a field-theory way to understand universal sphere entanglement in CFTs. Second, in holographic CFTs, the corresponding bulk geometry is related to a hyperbolic black hole. The entanglement entropy of the sphere becomes a horizon entropy.

This is one of the cleanest conceptual bridges between entanglement entropy and black-hole entropy.

Entanglement entropy versus thermal entropy

Section titled “Entanglement entropy versus thermal entropy”

Entanglement entropy and thermal entropy are related but distinct.

Entanglement entropy is defined for a spatial subregion. It can be nonzero even in a pure global state. Thermal entropy is the entropy of a mixed thermal state and is extensive in volume.

For small regions in a thermal state, entanglement entropy is dominated by UV entanglement across A\partial A. For very large regions, an extensive thermal contribution can appear. Holographically, this is why RT surfaces in black-brane geometries can run close to the horizon and pick up an area proportional to the region volume.

A useful slogan is:

short-distance entanglementnear-boundary area,thermal entropyhorizon area.\text{short-distance entanglement} \leftrightarrow \text{near-boundary area}, \qquad \text{thermal entropy} \leftrightarrow \text{horizon area}.

Entanglement entropy is hard to compute for several reasons.

The first difficulty is that ρA\rho_A is an operator on a huge Hilbert space. Even in ordinary many-body systems, tracing over AcA^c is computationally expensive.

The second difficulty is that the modular Hamiltonian is usually nonlocal. The expression ρA=eKA/TreKA\rho_A=e^{-K_A}/\operatorname{Tr}e^{-K_A} is formal unless one knows KAK_A.

The third difficulty is UV sensitivity. The leading entropy is divergent and regulator dependent.

The fourth difficulty is real-time dynamics. Time-dependent entanglement requires Lorentzian evolution of density matrices or real-time path integrals.

Holography does not make all of these difficulties disappear. But in the large-NN, classical-bulk limit, it replaces a nonlocal quantum-information computation by a geometric variational problem.

This page gives the boundary-side concepts needed for the geometric dictionary:

QFT conceptHolographic preview
spatial region AAboundary anchor of a bulk surface
entangling surface A\partial Aboundary of the RT surface γA\partial\gamma_A
area-law divergencenear-boundary area divergence
pure-state equality SA=SAcS_A=S_{A^c}homology and complementary surfaces
thermal entropyhorizon-area contribution
replica trickgravitational replica saddle
modular Hamiltoniangenerator related to bulk canonical energy in special cases
relative entropypositive bulk energy/canonical-energy statement

The next page turns the preview into the Ryu–Takayanagi formula.

“Entanglement entropy is finite because the region is finite.”

Section titled ““Entanglement entropy is finite because the region is finite.””

Not in continuum QFT. A finite spatial region still has infinitely many short-distance modes near A\partial A. The entropy is UV divergent unless regulated.

“The area law means entanglement entropy is geometric in any QFT.”

Section titled ““The area law means entanglement entropy is geometric in any QFT.””

The area law is a general short-distance statement, but the RT formula is much stronger. It says that in holographic theories at large NN, the full leading entropy is computed by a dynamical bulk extremal surface area.

“The reduced density matrix always comes from an exact tensor product.”

Section titled ““The reduced density matrix always comes from an exact tensor product.””

This is true for ordinary finite-dimensional tensor-product systems. In continuum QFT and gauge theory, one must regulate or use an algebraic definition. For the purposes of classical holography, the regulated picture captures the essential divergences and universal structures.

“Entanglement entropy and thermal entropy are the same thing.”

Section titled ““Entanglement entropy and thermal entropy are the same thing.””

They can be related, but they are not the same. Entanglement entropy is associated with a subregion. Thermal entropy is associated with a mixed state of the whole system. In holography, both can appear as areas, but of different geometric objects.

“The modular Hamiltonian is usually the physical Hamiltonian.”

Section titled ““The modular Hamiltonian is usually the physical Hamiltonian.””

Usually it is not. The modular Hamiltonian for a general region and state is highly nonlocal. It becomes a simple geometric generator only in special cases, such as half-spaces and spherical regions in the vacuum of a CFT.

Exercise 1: Reduced density matrix of a Bell pair

Section titled “Exercise 1: Reduced density matrix of a Bell pair”

Let

Ψ=12(00+11)|\Psi\rangle = \frac{1}{\sqrt 2}(|00\rangle+|11\rangle)

be a state of two qubits. Trace over the second qubit and compute the entropy of the first.

Solution

The full density matrix is

ρ=ΨΨ=12(0000+0011+1100+1111).\rho = |\Psi\rangle\langle\Psi| =\frac12\left( |00\rangle\langle 00| +|00\rangle\langle 11| +|11\rangle\langle 00| +|11\rangle\langle 11| \right).

Tracing over the second qubit gives

ρ1=Tr2ρ=12(00+11)=12I.\rho_1 = \operatorname{Tr}_2\rho = \frac12\left(|0\rangle\langle0|+|1\rangle\langle1|\right) = \frac12 I .

The eigenvalues are 1/21/2 and 1/21/2, so

S1=2(12log12)=log2.S_1 =-2\left(\frac12\log\frac12\right) =\log 2 .

The global state is pure, but the subsystem is maximally mixed.

Show that

limn111nlogTrρAn=TrρAlogρA.\lim_{n\to 1}\frac{1}{1-n}\log\operatorname{Tr}\rho_A^n = -\operatorname{Tr}\rho_A\log\rho_A .
Solution

Let

f(n)=TrρAn.f(n)=\operatorname{Tr}\rho_A^n .

Since TrρA=1\operatorname{Tr}\rho_A=1, we have f(1)=1f(1)=1. The Rényi entropy is

Sn=logf(n)1n.S_n=\frac{\log f(n)}{1-n}.

Both numerator and denominator vanish as n1n\to1, so use l’Hôpital’s rule:

limn1Sn=limn1f(n)/f(n)1=f(1).\lim_{n\to1}S_n = \lim_{n\to1}\frac{f'(n)/f(n)}{-1} =-f'(1).

But

f(n)=TrρAnlogρA,f'(n)=\operatorname{Tr}\rho_A^n\log\rho_A,

so

f(1)=TrρAlogρA.f'(1)=\operatorname{Tr}\rho_A\log\rho_A .

Therefore

limn1Sn=TrρAlogρA.\lim_{n\to1}S_n =-\operatorname{Tr}\rho_A\log\rho_A .

In a dd-dimensional spacetime QFT, the entangling surface has dimension d2d-2. Explain why the leading divergence should scale as

Area(A)ϵd2.\frac{\operatorname{Area}(\partial A)}{\epsilon^{d-2}} .
Solution

The leading contribution comes from modes localized within a distance of order ϵ\epsilon of the entangling surface. The number of independent cutoff cells along A\partial A is approximately

Area(A)ϵd2,\frac{\operatorname{Area}(\partial A)}{\epsilon^{d-2}},

because A\partial A has dimension d2d-2. If each cell contributes an order-one amount of short-distance entanglement, the leading divergence has the stated form. The coefficient is regulator dependent because the argument depends on the microscopic definition of a cutoff cell.

Let ρ=ρ0+δρ\rho=\rho_0+\delta\rho with Trδρ=0\operatorname{Tr}\delta\rho=0, and define K0=logρ0K_0=-\log\rho_0. Show that

δS=δK0.\delta S = \delta\langle K_0\rangle .
Solution

The entropy is

S(ρ)=Trρlogρ.S(\rho)=-\operatorname{Tr}\rho\log\rho .

To first order around ρ0\rho_0,

δS=Trδρlogρ0Trρ0ρ01δρ.\delta S =-\operatorname{Tr}\delta\rho\log\rho_0 -\operatorname{Tr}\rho_0\rho_0^{-1}\delta\rho .

The second term is Trδρ=0-\operatorname{Tr}\delta\rho=0. Therefore

δS=Trδρlogρ0=TrδρK0=δK0.\delta S =-\operatorname{Tr}\delta\rho\log\rho_0 =\operatorname{Tr}\delta\rho K_0 =\delta\langle K_0\rangle .