Skip to content

Page Curve and Fine-Grained Entropy

The information paradox becomes sharp only after we distinguish two questions that sound similar but are conceptually different.

The first question is thermodynamic: what entropy is carried by the Hawking radiation if we treat it as an approximately thermal flux? This is the entropy computed by the semiclassical Hawking calculation. It grows as more quanta are emitted.

The second question is microscopic: what is the von Neumann entropy of the radiation as a subsystem of a closed quantum system? This is the fine-grained entropy

S(R)=TrρRlogρR,ρR=TrHΨΨ,S(R)=-\operatorname{Tr}\rho_R\log\rho_R, \qquad \rho_R=\operatorname{Tr}_{H}|\Psi\rangle\langle\Psi|,

where RR denotes the Hawking radiation already emitted and HH denotes the remaining black-hole degrees of freedom. If black-hole formation and evaporation are unitary, the total state Ψ|\Psi\rangle is pure at all times. Therefore

S(R)=S(H).S(R)=S(H).

This equality is already enough to explain the qualitative shape of the Page curve. Early in evaporation, the radiation subsystem is small and can be highly mixed. Late in evaporation, the remaining black-hole subsystem is small, so it cannot purify an arbitrarily mixed radiation state. Therefore the fine-grained entropy of the radiation must first rise and then fall.

The result is the Page curve:

Sradfine(t)min{SHawkingcoarse(t),SBH(t)},S_{\rm rad}^{\rm fine}(t) \simeq \min\{S_{\rm Hawking}^{\rm coarse}(t),S_{\rm BH}(t)\},

up to corrections that are subleading compared with SBHS_{\rm BH} in the semiclassical limit. This formula is not yet the island formula. It is the target that the island formula must reproduce. The Page curve is the information-theoretic answer demanded by unitary quantum mechanics, while Hawking’s original calculation gives the semiclassical answer before the late-time correlations are included.

The Page curve compared with Hawking's monotonic entropy growth

The semiclassical Hawking entropy grows monotonically. A unitary evaporation process instead requires the fine-grained radiation entropy to follow the Page curve: it rises until the Page time and then decreases as the remaining black hole becomes the smaller subsystem. The numerical shape is illustrative rather than universal.

We use units with c=kB=1c=k_B=1. We keep GNG_N and \hbar explicit when they clarify scaling. The von Neumann entropy of a density matrix is

S(ρ)=Trρlogρ.S(\rho)=-\operatorname{Tr}\rho\log\rho.

Unless otherwise stated, logarithms are natural logarithms, so entropy is measured in nats. To convert to bits, divide by log2\log 2.

The basic subsystems are:

SymbolMeaning
RRHawking radiation already emitted and collected far from the black hole
HHremaining black-hole degrees of freedom
RearlyR_{\rm early}radiation emitted before some chosen time
RlateR_{\rm late}radiation emitted after that time
BBa newly emitted Hawking mode outside the horizon
CCthe interior partner mode of BB in the semiclassical description
SBHS_{\rm BH}Bekenstein-Hawking entropy A/(4GN)A/(4G_N\hbar)
SHawkingcoarseS_{\rm Hawking}^{\rm coarse}entropy of the radiation computed from the semiclassical Hawking flux
SradfineS_{\rm rad}^{\rm fine}von Neumann entropy of the exact radiation density matrix
tPaget_{\rm Page}time when the fine-grained entropy of radiation reaches its maximum

When RHR\cup H is a complete closed system in a pure state,

S(R)=S(H).S(R)=S(H).

This is a kinematic statement about bipartite pure states. It is not a dynamical assumption about black holes.

Fine-grained versus coarse-grained entropy

Section titled “Fine-grained versus coarse-grained entropy”

The coarse-grained entropy of a gas asks for the entropy after forgetting microscopic data and retaining only macroscopic variables such as energy, volume, and particle number. The fine-grained entropy of a quantum state is instead the von Neumann entropy of the exact density matrix.

For a closed system in a pure state,

ρtot=ΨΨ,S(ρtot)=0.\rho_{\rm tot}=|\Psi\rangle\langle\Psi|, \qquad S(\rho_{\rm tot})=0.

This can be true even when a subsystem has large entropy. If the system factorizes as HAHB\mathcal H_A\otimes\mathcal H_B, then

ρA=TrBΨΨ,ρB=TrAΨΨ,\rho_A=\operatorname{Tr}_B|\Psi\rangle\langle\Psi|, \qquad \rho_B=\operatorname{Tr}_A|\Psi\rangle\langle\Psi|,

and a standard theorem says

S(A)=S(B)S(A)=S(B)

for every pure state on ABA\cup B. Subsystem entropy is entanglement entropy. It measures how much information about AA is stored in BB, not whether the full state is mixed.

This is the first point at which the black-hole problem becomes subtle. The Hawking calculation naturally computes a radiation density matrix after tracing over the interior. It gives a state that looks thermal mode by mode. But unitarity is a statement about the exact global state of the radiation plus whatever degrees of freedom remain. These are not the same question.

The distinction can be summarized as follows:

QuantityWhat it measuresExpected behavior in unitary evaporation
SHawkingcoarse(R)S_{\rm Hawking}^{\rm coarse}(R)entropy of the approximate thermal radiation fluxgrows as more radiation is emitted
Sradfine(R)S_{\rm rad}^{\rm fine}(R)von Neumann entropy of the exact radiation staterises, peaks, and returns to zero
SBH(H)S_{\rm BH}(H)thermodynamic entropy capacity of the remaining black holedecreases during evaporation
S(R:H)S(R:H) or I(R:H)I(R:H)correlations between radiation and remaining holelarge near the Page time

A useful diagnostic is the information deficit

IPage(R)=SHawkingcoarse(R)Sradfine(R).I_{\rm Page}(R) = S_{\rm Hawking}^{\rm coarse}(R)-S_{\rm rad}^{\rm fine}(R).

Before the Page time this is small in the simplest random-state estimate. After the Page time it grows. At the end of unitary evaporation, the radiation may still have a large thermodynamic entropy, but its fine-grained entropy must be zero if the final state is pure.

The Page curve is based on a simple but powerful lesson from random pure states. Let

Htot=HAHB,dimHA=m,dimHB=n,\mathcal H_{\rm tot} = \mathcal H_A\otimes\mathcal H_B, \qquad \dim\mathcal H_A=m, \qquad \dim\mathcal H_B=n,

and choose a pure state Ψ|\Psi\rangle randomly with respect to the Haar measure. Suppose mnm\le n. Page’s result for the average entropy of subsystem AA is

SA=k=n+1mn1km12n.\left\langle S_A\right\rangle = \sum_{k=n+1}^{mn}\frac{1}{k} - \frac{m-1}{2n}.

Equivalently, in terms of harmonic numbers HN=k=1N1/kH_N=\sum_{k=1}^N 1/k,

SA=HmnHnm12n.\left\langle S_A\right\rangle = H_{mn}-H_n-\frac{m-1}{2n}.

For nmn\gg m,

SA=logmm2n+O ⁣(1mn).\left\langle S_A\right\rangle = \log m-\frac{m}{2n}+O\!\left(\frac{1}{mn}\right).

Thus the smaller subsystem is almost maximally mixed:

ρA1m1A,SAlogm.\rho_A\approx \frac{1}{m}\mathbf 1_A, \qquad S_A\approx \log m.

The deviation from maximal entropy,

logmSAm2n,\log m-\left\langle S_A\right\rangle \simeq \frac{m}{2n},

is tiny when the other subsystem is much larger. This is why information can be present in principle but nearly impossible to see in simple observables.

Page theorem for a typical bipartite pure state

For a typical pure state on $\mathcal H_R\otimes\mathcal H_H$, the smaller subsystem is nearly maximally mixed. Applied to black-hole evaporation, this gives the qualitative Page curve once the effective dimensions of radiation and remaining black hole are compared.

The slogan is:

For a typical pure state, the entropy is almost the entropy of the smaller Hilbert space.\boxed{\text{For a typical pure state, the entropy is almost the entropy of the smaller Hilbert space.}}

More precisely, if mnm\le n, then SAlogmS_A\approx \log m. If nmn\le m, then SA=SBlognS_A=S_B\approx \log n. Therefore, for a typical pure state,

SAmin{logm,logn}.S_A \simeq \min\{\log m,\log n\}.

This is a statement about typicality, not about thermal equilibrium in the usual canonical-ensemble sense. It does not say that every state is random. It says that if the dynamics of the black hole scrambles information efficiently enough, then the entropy of a subsystem is well approximated by the entropy of the smaller available Hilbert space.

Now apply this bipartite lesson to an evaporating black hole. The outside observer describes the system as

HtotalHRHH,\mathcal H_{\rm total} \sim \mathcal H_R\otimes\mathcal H_H,

where RR is the radiation already emitted and HH is the remaining black hole. This factorization is a useful effective description, not an exact microscopic statement about gravitational Hilbert spaces. The quantum-gravity lesson is precisely that such factorizations require care. Still, the Page argument is best understood first in this simple language.

The effective dimension of the remaining black-hole Hilbert space is controlled by the Bekenstein-Hawking entropy:

dH(t)eSBH(t).d_H(t) \sim e^{S_{\rm BH}(t)}.

The effective entropy capacity of the emitted radiation is controlled by the coarse-grained entropy of the Hawking flux:

dR(t)eSHawkingcoarse(t).d_R(t) \sim e^{S_{\rm Hawking}^{\rm coarse}(t)}.

If the total state is pure and sufficiently typical, then

Sradfine(t)min{logdR(t),logdH(t)}.S_{\rm rad}^{\rm fine}(t) \simeq \min\{\log d_R(t),\log d_H(t)\}.

Thus

Sradfine(t)min{SHawkingcoarse(t),SBH(t)}.S_{\rm rad}^{\rm fine}(t) \simeq \min\{S_{\rm Hawking}^{\rm coarse}(t),S_{\rm BH}(t)\}.

This is the Page curve.

Entropy budget during unitary black-hole evaporation

The entropy capacity of the radiation grows while the entropy capacity of the remaining black hole decreases. The Page time is the crossover at which the smaller effective Hilbert space changes from radiation to black hole.

Before the Page time,

SHawkingcoarse(t)<SBH(t),S_{\rm Hawking}^{\rm coarse}(t) < S_{\rm BH}(t),

so the radiation is the smaller subsystem and

Sradfine(t)SHawkingcoarse(t).S_{\rm rad}^{\rm fine}(t) \approx S_{\rm Hawking}^{\rm coarse}(t).

The Hawking calculation is then qualitatively correct for the entropy: the radiation looks nearly thermal and the fine-grained entropy grows.

After the Page time,

SBH(t)<SHawkingcoarse(t),S_{\rm BH}(t) < S_{\rm Hawking}^{\rm coarse}(t),

so the remaining black hole is the smaller subsystem and

Sradfine(t)SBH(t).S_{\rm rad}^{\rm fine}(t) \approx S_{\rm BH}(t).

The exact radiation state can no longer be close to an uncorrelated thermal state. It must contain correlations that purify the early radiation. These correlations are not visible in the local semiclassical calculation that treats each outgoing Hawking mode as nearly independent.

For a four-dimensional Schwarzschild black hole,

SBH(M)=A4GN=4πGNM2.S_{\rm BH}(M) = \frac{A}{4G_N\hbar} = \frac{4\pi G_N M^2}{\hbar}.

A dimensional estimate of the evaporation rate gives

dMdtαGN2M2,\frac{dM}{dt} \sim -\frac{\alpha \hbar}{G_N^2 M^2},

where α\alpha is a dimensionless number depending on the particle species and greybody factors. Integrating gives the scaling

M(t)=M0(1ttevap)1/3,M(t) = M_0\left(1-\frac{t}{t_{\rm evap}}\right)^{1/3},

and therefore

SBH(t)=S0(1ttevap)2/3,S0=SBH(0).S_{\rm BH}(t) = S_0\left(1-\frac{t}{t_{\rm evap}}\right)^{2/3}, \qquad S_0=S_{\rm BH}(0).

In the crudest reversible entropy-bookkeeping model,

SHawkingcoarse(t)S0SBH(t).S_{\rm Hawking}^{\rm coarse}(t) \approx S_0-S_{\rm BH}(t).

The Page time is then determined by

SHawkingcoarse(tPage)=SBH(tPage),S_{\rm Hawking}^{\rm coarse}(t_{\rm Page}) = S_{\rm BH}(t_{\rm Page}),

so

SBH(tPage)=S02.S_{\rm BH}(t_{\rm Page}) = \frac{S_0}{2}.

Using the Schwarzschild scaling,

(1tPagetevap)2/3=12,\left(1-\frac{t_{\rm Page}}{t_{\rm evap}}\right)^{2/3} = \frac{1}{2},

hence

tPage=(123/2)tevap0.646tevap.t_{\rm Page} = \left(1-2^{-3/2}\right)t_{\rm evap} \approx 0.646\,t_{\rm evap}.

This number is not universal. The actual thermodynamic entropy carried away by the radiation can be written schematically as

SHawkingcoarse(t)β[S0SBH(t)],S_{\rm Hawking}^{\rm coarse}(t) \approx \beta\,[S_0-S_{\rm BH}(t)],

where β\beta is an order-one coefficient that depends on the evaporation channel, greybody factors, and what one counts as radiation. Then the Page-time condition becomes

β[S0SBH(tPage)]=SBH(tPage),\beta\,[S_0-S_{\rm BH}(t_{\rm Page})] = S_{\rm BH}(t_{\rm Page}),

or

SBH(tPage)=β1+βS0.S_{\rm BH}(t_{\rm Page}) = \frac{\beta}{1+\beta}S_0.

The robust statement is not the precise numerical coefficient. The robust statement is that the Page time occurs when the entropy capacity of the remaining black hole becomes comparable to the entropy capacity of the emitted radiation.

A common misleading phrase is “information begins to come out after the Page time.” This is almost right, but it needs refinement.

In a unitary theory, information was never destroyed. The exact final state is determined by the exact initial state. What changes at the Page time is the accessibility of information from the radiation subsystem.

A useful measure is the difference between the coarse-grained entropy and the fine-grained entropy:

IPage(R)=SHawkingcoarse(R)Sradfine(R).I_{\rm Page}(R) = S_{\rm Hawking}^{\rm coarse}(R) - S_{\rm rad}^{\rm fine}(R).

At early times, in the random-state approximation,

IPage(R)1I_{\rm Page}(R) \ll 1

even though the exact state already contains correlations. These correlations are spread over a huge Hilbert space and are not visible in simple few-body observables. After the Page time,

IPage(R)SHawkingcoarse(R)SBH(t),I_{\rm Page}(R) \sim S_{\rm Hawking}^{\rm coarse}(R)-S_{\rm BH}(t),

so the deficit from thermality becomes macroscopic.

Another useful quantity is mutual information. For two systems AA and BB,

I(A:B)=S(A)+S(B)S(AB).I(A:B)=S(A)+S(B)-S(AB).

For a pure state on RHR\cup H,

S(RH)=0,S(R)=S(H),S(RH)=0, \qquad S(R)=S(H),

so

I(R:H)=2S(R).I(R:H)=2S(R).

Near the Page time this mutual information is large. The radiation and remaining black hole are strongly entangled. Near the end of unitary evaporation, HH becomes trivial and S(R)0S(R)\to0: all the purification must be inside the radiation itself.

This is the sense in which the late radiation cannot be a product of independent thermal emissions. A state can look thermal in every simple local measurement and still be globally pure because the purification is stored in very high-order correlations.

Why the Hawking curve is not the Page curve

Section titled “Why the Hawking curve is not the Page curve”

The semiclassical Hawking calculation treats quantum fields on a classical black-hole background. It predicts that each outgoing mode BB is entangled mainly with an interior partner CC. Repeating this process gives an approximate product structure

ρRHawkingiρithermal,\rho_R^{\rm Hawking} \approx \bigotimes_i \rho_i^{\rm thermal},

and therefore a radiation entropy that grows as more modes are emitted:

SHawkingcoarse(R1R2)iS(ρithermal).S_{\rm Hawking}^{\rm coarse}(R_1\cup R_2\cup\cdots) \approx \sum_i S(\rho_i^{\rm thermal}).

This is the Hawking curve. It is the right answer for a coarse-grained calculation that ignores subtle correlations among radiation quanta. It is not the right answer for the exact fine-grained entropy if evaporation is unitary.

The contradiction becomes sharp after the Page time. The Hawking calculation says that a newly emitted mode BB is purified by its interior partner CC. But the Page argument says that, for an old black hole, the new radiation must be correlated with the early radiation RR in order for S(R)S(R) to decrease. A single quantum system cannot be independently maximally entangled with two different systems. This is the entropy bookkeeping behind the firewall paradox discussed in the previous page.

The Page curve therefore does two things:

  1. It says what unitary evaporation should look like at the level of entropy.
  2. It exposes where the semiclassical Hawking description must be incomplete.

The island formula and replica wormholes will later explain how a gravitational entropy calculation can reproduce this curve.

Hayden-Preskill: old black holes as mirrors

Section titled “Hayden-Preskill: old black holes as mirrors”

The Page curve also changes the operational question: after the Page time, how quickly can information thrown into a black hole come back out?

Suppose the black hole is old, meaning that more than the Page-time amount of radiation has already escaped. The remaining black hole HH is then smaller than its purifier, the early radiation RearlyR_{\rm early}. Alice throws in a kk-qubit message MM. If the black-hole dynamics is well modeled by a rapidly scrambling unitary, then after a scrambling time the message is mixed throughout the black-hole degrees of freedom.

Hayden and Preskill argued that Bob, who possesses the early radiation and then collects a modest amount of new Hawking radiation, can in principle decode Alice’s message. In the ideal random-unitary model, the new radiation need only contain slightly more than kk qubits, up to overheads and errors. The hard part is not that the information is absent. The hard part is decoding it.

Hayden-Preskill mirror intuition for an old black hole

For an old black hole, the early radiation is essential side information. After scrambling, a newly thrown-in message can be recovered in principle from the early radiation plus a relatively small amount of additional Hawking radiation.

The relevant scrambling time is often estimated as

tβH2πlogSBH,t_* \sim \frac{\beta_H}{2\pi}\log S_{\rm BH},

where βH=1/TH\beta_H=1/T_H. This is the fast-scrambler scaling. The precise coefficient and the operational decoding complexity are separate questions. The Page-curve lesson is simply that old black holes have a very different information-theoretic behavior from young black holes.

The Page curve is not itself a gravitational derivation. It is an information-theoretic benchmark.

A complete gravitational calculation should explain why the fine-grained radiation entropy is approximately

Sradfine(t)SHawkingcoarse(t)S_{\rm rad}^{\rm fine}(t) \approx S_{\rm Hawking}^{\rm coarse}(t)

before the Page time, but approximately

Sradfine(t)SBH(t)S_{\rm rad}^{\rm fine}(t) \approx S_{\rm BH}(t)

after the Page time.

The modern answer uses quantum extremal surfaces and islands. In its simplest schematic form, the island rule says

S(R)=minIextI[Area(I)4GN+Smatter(RI)].S(R) = \min_I\,\operatorname{ext}_I \left[ \frac{\operatorname{Area}(\partial I)}{4G_N\hbar} + S_{\rm matter}(R\cup I) \right].

Before the Page time, the dominant saddle has no island, so the result agrees with the Hawking calculation. After the Page time, a new saddle with an island dominates, and the generalized entropy is controlled by an area term comparable to the remaining black-hole entropy. Thus the island formula gives a semiclassical gravitational mechanism for the Page curve.

That statement is profound, but it should not be read too quickly. The Page curve tells us the answer expected from unitarity. The island formula tells us how that answer emerges from a gravitational entropy prescription. Replica wormholes explain why the island saddle appears in the gravitational replica path integral. These are logically distinct steps.

Pitfall 1: “The Page curve is the entropy of the black hole.”
No. The Page curve is usually the fine-grained entropy of the radiation subsystem. It equals the entropy of the remaining black hole only when the combined system is pure.

Pitfall 2: “Hawking radiation is not thermal after the Page time.”
Locally it may still look very close to thermal. The point is that the exact global radiation state cannot be a product of independent thermal states. The purification is hidden in correlations.

Pitfall 3: “The Page theorem proves black-hole unitarity.”
It does not. Page’s argument assumes unitary evaporation and typicality. It derives the expected entropy curve under those assumptions.

Pitfall 4: “The Page time is exactly halfway through evaporation.”
Not in general. It is roughly the time when the radiation entropy capacity and remaining black-hole entropy capacity are comparable. The fraction of the lifetime depends on the evaporation model.

Pitfall 5: “The late radiation alone contains the message.”
In the Hayden-Preskill setup, the early radiation is crucial side information. Without it, the new radiation by itself generally does not reveal the message efficiently.

Pitfall 6: “A decreasing entropy means radiation disappears.”
No. More radiation is emitted, so the coarse-grained entropy can keep increasing. The fine-grained entropy decreases because newly emitted quanta purify correlations already present in the earlier radiation.

Exercise 1: Fine-grained entropy of a Bell pair

Section titled “Exercise 1: Fine-grained entropy of a Bell pair”

Consider the two-qubit Bell state

Φ+=12(00+11).|\Phi^+\rangle = \frac{1}{\sqrt 2} \left(|00\rangle+|11\rangle\right).

Compute the entropy of the full two-qubit state and the entropy of the first qubit.

Solution

The full state is pure:

ρAB=Φ+Φ+,\rho_{AB}=|\Phi^+\rangle\langle\Phi^+|,

so

S(AB)=0.S(AB)=0.

Tracing over the second qubit gives

ρA=TrBρAB=12(00+11)=1A2.\rho_A = \operatorname{Tr}_B\rho_{AB} = \frac{1}{2} \left(|0\rangle\langle0|+|1\rangle\langle1|\right) = \frac{\mathbf 1_A}{2}.

Therefore

S(A)=TrρAlogρA=(212log12)=log2.S(A) = -\operatorname{Tr}\rho_A\log\rho_A = -\left(2\cdot \frac12\log\frac12\right) = \log 2.

This is the simplest example of the Page-curve principle: a globally pure state can have mixed subsystems.

Exercise 2: The large-nn limit of Page’s formula

Section titled “Exercise 2: The large-nnn limit of Page’s formula”

Assume mnm\le n and

SA=HmnHnm12n.\left\langle S_A\right\rangle = H_{mn}-H_n-\frac{m-1}{2n}.

Using

HN=logN+12N+O(N2),H_N=\log N+\frac{1}{2N}+O(N^{-2}),

show that for nmn\gg m,

SA=logmm2n+O ⁣(1mn).\left\langle S_A\right\rangle = \log m-\frac{m}{2n}+O\!\left(\frac{1}{mn}\right).
Solution

Use the asymptotic expansion:

Hmn=log(mn)+12mn+O((mn)2),H_{mn} = \log(mn)+\frac{1}{2mn}+O((mn)^{-2}),

and

Hn=logn+12n+O(n2).H_n = \log n+\frac{1}{2n}+O(n^{-2}).

Then

HmnHn=logm+12mn12n+O(n2).H_{mn}-H_n = \log m+\frac{1}{2mn}-\frac{1}{2n}+O(n^{-2}).

Subtracting the Page correction gives

SA=logm+12mn12nm12n+O(n2).\left\langle S_A\right\rangle = \log m+\frac{1}{2mn}-\frac{1}{2n}-\frac{m-1}{2n}+O(n^{-2}).

The two terms proportional to 1/(2n)1/(2n) combine as

12nm12n=m2n.-\frac{1}{2n}-\frac{m-1}{2n} = -\frac{m}{2n}.

Thus

SA=logmm2n+12mn+O(n2).\left\langle S_A\right\rangle = \log m-\frac{m}{2n}+\frac{1}{2mn}+O(n^{-2}).

For nmn\gg m, the leading correction is

SA=logmm2n+O ⁣(1mn),\left\langle S_A\right\rangle = \log m-\frac{m}{2n}+O\!\left(\frac{1}{mn}\right),

up to terms smaller in the relevant hierarchy.

Exercise 3: A toy Page curve from Hilbert-space dimensions

Section titled “Exercise 3: A toy Page curve from Hilbert-space dimensions”

Let

dR(t)=eγt,dH(t)=eS0γt,d_R(t)=e^{\gamma t}, \qquad d_H(t)=e^{S_0-\gamma t},

for 0tS0/γ0\le t\le S_0/\gamma. Assume the total state is a typical pure state on HRHH\mathcal H_R\otimes\mathcal H_H. Find S(R)S(R) and the Page time.

Solution

For a typical pure state,

S(R)min{logdR,logdH}.S(R) \simeq \min\{\log d_R,\log d_H\}.

Here

logdR=γt,logdH=S0γt.\log d_R=\gamma t, \qquad \log d_H=S_0-\gamma t.

Therefore

S(R)min{γt,S0γt}.S(R) \simeq \min\{\gamma t,S_0-\gamma t\}.

The Page time is the crossing point:

γtPage=S0γtPage.\gamma t_{\rm Page} = S_0-\gamma t_{\rm Page}.

Thus

tPage=S02γ.t_{\rm Page} = \frac{S_0}{2\gamma}.

The maximum entropy is

Smax=S02.S_{\rm max} = \frac{S_0}{2}.

This toy model gives a triangular Page curve. A realistic evaporating black hole has a smoother curve because SBH(t)S_{\rm BH}(t) is not linear in time.

Exercise 4: Schwarzschild Page time in the reversible estimate

Section titled “Exercise 4: Schwarzschild Page time in the reversible estimate”

Assume a four-dimensional Schwarzschild black hole has

SBH(t)=S0(1ttevap)2/3.S_{\rm BH}(t) = S_0\left(1-\frac{t}{t_{\rm evap}}\right)^{2/3}.

In the reversible estimate,

SHawkingcoarse(t)=S0SBH(t).S_{\rm Hawking}^{\rm coarse}(t) = S_0-S_{\rm BH}(t).

Find tPaget_{\rm Page}.

Solution

The Page time is determined by

SHawkingcoarse(tPage)=SBH(tPage).S_{\rm Hawking}^{\rm coarse}(t_{\rm Page}) = S_{\rm BH}(t_{\rm Page}).

Using the assumed reversible estimate,

S0SBH(tPage)=SBH(tPage).S_0-S_{\rm BH}(t_{\rm Page}) = S_{\rm BH}(t_{\rm Page}).

Thus

SBH(tPage)=S02.S_{\rm BH}(t_{\rm Page}) = \frac{S_0}{2}.

Substitute the Schwarzschild scaling:

S0(1tPagetevap)2/3=S02.S_0\left(1-\frac{t_{\rm Page}}{t_{\rm evap}}\right)^{2/3} = \frac{S_0}{2}.

Cancel S0S_0:

(1tPagetevap)2/3=21.\left(1-\frac{t_{\rm Page}}{t_{\rm evap}}\right)^{2/3} = 2^{-1}.

Raise both sides to the power 3/23/2:

1tPagetevap=23/2.1-\frac{t_{\rm Page}}{t_{\rm evap}} = 2^{-3/2}.

Therefore

tPage=(123/2)tevap0.646tevap.t_{\rm Page} = \left(1-2^{-3/2}\right)t_{\rm evap} \approx 0.646\,t_{\rm evap}.

The precise numerical fraction changes if the emitted radiation entropy is not equal to the decrease of SBHS_{\rm BH}.

Exercise 5: Mutual information between radiation and remaining black hole

Section titled “Exercise 5: Mutual information between radiation and remaining black hole”

Suppose RHR\cup H is pure. Show that

I(R:H)=2S(R).I(R:H)=2S(R).

What does this imply near the Page time?

Solution

By definition,

I(R:H)=S(R)+S(H)S(RH).I(R:H) = S(R)+S(H)-S(RH).

Since RHR\cup H is pure,

S(RH)=0.S(RH)=0.

Also, for a bipartite pure state,

S(R)=S(H).S(R)=S(H).

Therefore

I(R:H)=S(R)+S(R)=2S(R).I(R:H) = S(R)+S(R) = 2S(R).

Near the Page time, S(R)S(R) is of order the black-hole entropy scale, so I(R:H)I(R:H) is also large. The radiation and remaining black hole are strongly correlated. After the black hole disappears in a unitary process, HH becomes trivial and S(R)0S(R)\to0.

Exercise 6: Why early radiation is side information

Section titled “Exercise 6: Why early radiation is side information”

In the Hayden-Preskill setup, assume Alice throws a kk-qubit message into an old black hole. Explain why Bob needs the early radiation RER_E as side information. Why is the newly emitted radiation alone not enough in general?

Solution

For an old black hole, the remaining black hole HH is already highly entangled with the early radiation RER_E. The message thrown into the black hole is scrambled into HH, but HH is not an isolated memory system. Its purifier is outside, in RER_E.

After scrambling, a small amount of new radiation RNR_N can carry enough information to complete the decoding, but only for an observer who also has access to the purifying side information RER_E. Without RER_E, the newly emitted radiation is generally still highly mixed and does not by itself contain a clean copy of Alice’s message.

This is how the mirror analogy avoids a naive contradiction. The black hole is not simply returning the message as ordinary visible radiation. Rather, the message is recoverable from a highly nonlocal joint system:

RERN.R_E\cup R_N.

The decoding operation may be extremely complicated even when recovery is possible in principle.