Skip to content

Fast Scrambling and Hayden–Preskill Recovery

The Page curve is a statement about entropy. It says that if black hole evaporation is unitary, then the fine-grained entropy of the radiation cannot keep increasing forever. But the Page curve by itself does not tell us how information comes out, when a newly injected message becomes recoverable, or what kind of observer could recover it.

The Hayden–Preskill thought experiment answers a sharper question. Suppose a black hole is already old, so that it is highly entangled with the radiation it emitted earlier. Alice throws a small quantum system into the black hole. Bob has collected the early radiation and continues to collect the newly emitted Hawking quanta. How long must Bob wait before Alice’s quantum state is recoverable from the radiation?

The striking answer is that an old black hole behaves like a quantum information mirror. After the message has been scrambled, Bob needs to collect only slightly more radiation entropy than the entropy of the message. The delay is not the Page time. It is approximately the scrambling time,

tβ2πlogSBH,t_*\sim {\beta\over 2\pi}\log S_{\rm BH},

plus the time required to radiate the message-size number of quanta.

This page explains what that statement means, why it depends on the black hole being old, and why it does not by itself imply that decoding Hawking radiation is easy.

Why should an old black hole return newly thrown-in quantum information quickly, while information thrown into a young black hole remains hidden until near the Page time?

The answer has two ingredients.

First, an old black hole is already entangled with a large external system: the early radiation. Second, black hole dynamics is expected to scramble information rapidly. Scrambling makes the message inaccessible to any small part of the remaining black hole, while the pre-existing entanglement with the early radiation allows the message to be reconstructed from early radiation plus a small amount of late radiation.

The Hayden-Preskill protocol

The Hayden–Preskill protocol. A message system AA, purified by a reference QQ, is thrown into an old black hole BB that is already entangled with early radiation EE. The black hole dynamics is modeled by a scrambling unitary UABU_{AB}. After the output is divided into late radiation DD and the remaining black hole CC, Bob tries to recover the message from EDE\cup D. Recovery is possible when QQ is decoupled from CC.

A black hole is an efficient thermalizer: low-point observables of matter thrown into the horizon relax quickly. But thermalization and scrambling are not the same.

Thermalization means that simple observables forget detailed initial data. For example, a perturbation of a thermal state may decay on a timescale

tdissβ,t_{\rm diss}\sim \beta,

where

β=1TH\beta={1\over T_H}

is the inverse Hawking temperature.

Scrambling means that initially localized quantum information has spread over so many degrees of freedom that no small subsystem contains a useful record of it. Equivalently, the information is hidden in high-order correlations among many degrees of freedom. Scrambling is therefore a statement about information and entanglement, not merely about expectation values of a few simple operators.

A useful diagnostic is the growth of an out-of-time-order commutator. For two simple operators VV and WW, define schematically

C(t)=[W(t),V(0)][W(t),V(0)]β.C(t)=\left\langle [W(t),V(0)]^\dagger [W(t),V(0)]\right\rangle_\beta.

In a large-NN chaotic system with a holographic black hole dual, one often finds an early exponential regime

C(t)1SBHeλLt,C(t)\sim {1\over S_{\rm BH}}e^{\lambda_L t},

until C(t)C(t) becomes order one. The coefficient 1/SBH1/S_{\rm BH} reflects the fact that one simple perturbation initially affects only a small fraction of the black hole degrees of freedom. The Lyapunov exponent is bounded by

λL2πβ,\lambda_L\leq {2\pi\over \beta},

and Einstein gravity black holes saturate this bound in the classical low-energy regime. Setting C(t)1C(t_*)\sim 1 gives

t1λLlogSBHβ2πlogSBHt_*\sim {1\over \lambda_L}\log S_{\rm BH} \sim {\beta\over 2\pi}\log S_{\rm BH}

for a maximally chaotic black hole.

The important hierarchy for a macroscopic evaporating black hole is therefore

tdissttPage.t_{\rm diss}\ll t_*\ll t_{\rm Page}.

The first inequality says that local equilibration happens before full scrambling. The second says that scrambling is fast compared with evaporation.

The hierarchy of black hole timescales

The hierarchy of timescales for a large evaporating black hole. Local relaxation occurs on a thermal timescale tdissβt_{\rm diss}\sim \beta. Scrambling occurs on t(β/2π)logSBHt_*\sim (\beta/2\pi)\log S_{\rm BH} for a maximally chaotic black hole. The Page time is parametrically longer: for an evaporating Schwarzschild black hole, tPaget_{\rm Page} is of order the evaporation time, while t/tPage(logSBH)/SBHt_*/t_{\rm Page}\sim (\log S_{\rm BH})/S_{\rm BH} up to constants.

The thought experiment uses a clean quantum-information model of an old black hole.

Let AA be the message system Alice throws in. To track whether the message is preserved, purify it by an external reference system QQ:

ψQAHQHA,dimHQ=dimHA=dA.|\psi\rangle_{QA}\in \mathcal H_Q\otimes \mathcal H_A, \qquad \dim \mathcal H_Q=\dim \mathcal H_A=d_A.

The reference QQ never falls into the black hole. It is a bookkeeper. If Bob can reconstruct AA, then Bob can restore the entanglement between AA and QQ.

Let BB denote the old black hole before the message enters, and let EE denote the early Hawking radiation. The old-black-hole assumption is that BB is highly entangled with EE. In the idealized model one takes

ΦBE|\Phi\rangle_{BE}

to be nearly maximally entangled on the relevant code subspace.

The message and black hole then undergo a unitary scrambling dynamics

UAB:HAHBHCHD,U_{AB}:\mathcal H_A\otimes\mathcal H_B \longrightarrow \mathcal H_C\otimes\mathcal H_D,

where DD is the newly emitted Hawking radiation and CC is the remaining black hole. The total state is pure on

QCDE.Q\otimes C\otimes D\otimes E.

Bob has access to EE and DD. He wants a recovery map

REDA\mathcal R_{ED\to A'}

such that AA' is entangled with QQ in the same way that AA originally was.

This is a precise way to ask whether Alice’s quantum state has come out in the Hawking radiation.

The central principle is simple:

Bob can recover the message from EDE\cup D when the reference QQ is decoupled from the remaining black hole CC.

Mathematically, the decoupling condition is

ρQCρQρC.\rho_{QC}\simeq \rho_Q\otimes \rho_C.

Equivalently,

I(Q:C)0,I(Q:C)\simeq 0,

where

I(X:Y)=S(X)+S(Y)S(XY)I(X:Y)=S(X)+S(Y)-S(XY)

is mutual information.

Why is this the right condition? Since the full system QCDEQCDE is pure,

I(Q:DE)=2S(Q)I(Q:C).I(Q:DE)=2S(Q)-I(Q:C).

Thus if I(Q:C)0I(Q:C)\simeq 0, then

I(Q:DE)2S(Q).I(Q:DE)\simeq 2S(Q).

All the entanglement of the reference QQ is carried by EDE\cup D. By Uhlmann’s theorem, or equivalently by the decoupling theorem, there exists a recovery operation on EDE\cup D that reconstructs a system AA' purified by QQ.

Decoupling and Hayden-Preskill recovery

Decoupling is the information-theoretic core of Hayden–Preskill recovery. If the reference QQ has negligible mutual information with the remaining black hole CC, then in the pure state on QCDEQ C D E the purifier of QQ must lie in DED\cup E. The message is not locally visible in a single Hawking quantum; it is encoded in correlations of the radiation.

The dimension counting is the heart of the result.

For a Haar-random scrambling unitary UABU_{AB}, the expected decoupling error in the old-black-hole limit scales schematically as

EUρQCρQρC1dAdD,\mathbb E_U\left\|\rho_{QC}-\rho_Q\otimes\rho_C\right\|_1 \lesssim {d_A\over d_D},

up to constants and small corrections associated with imperfect maximal entanglement between BB and EE. Therefore Bob can recover the message with error at most ϵ\epsilon once

dDdAϵ.d_D\gtrsim {d_A\over \epsilon}.

For a message of kk qubits,

dA=2k.d_A=2^k.

If the late radiation contains mm qubits,

dD=2m.d_D=2^m.

The condition becomes

mk+log21ϵ.m\gtrsim k+\log_2{1\over \epsilon}.

So an old black hole returns a kk-qubit message after it has radiated only slightly more than kk qubits, provided the message has first been scrambled.

This does not mean that the message bounces off the horizon like a classical wave. The message first becomes delocalized in the black hole’s microscopic degrees of freedom. Because the black hole is already entangled with the early radiation, the new outgoing radiation can serve as the final missing share needed for reconstruction.

A compact slogan is:

old black hole+scrambling+early radiationrapid recoverability\boxed{ \text{old black hole} + \text{scrambling} + \text{early radiation} \quad\Longrightarrow\quad \text{rapid recoverability} }

The retention time is therefore approximately

trett+temit(k),t_{\rm ret}\sim t_*+t_{\rm emit}(k),

where temit(k)t_{\rm emit}(k) is the time required to emit of order kk qubits of radiation entropy. For a small message thrown into a large black hole, temit(k)t_{\rm emit}(k) is usually much shorter than the Page time.

Now remove the early radiation. If EE is absent or too small, Bob cannot use a large entangled partner of the black hole as part of his decoder. In that case, a small amount of late radiation DD is not enough. The purifier of the reference QQ remains mostly in the remaining black hole CC.

This is the same physics as the Page curve. For a young black hole, the radiation Hilbert space is still the smaller subsystem. A typical unitary evaporation process makes the radiation almost maximally mixed, and information about the initial state remains inaccessible from the radiation alone. Only after the Page time does the accumulated radiation have enough Hilbert-space capacity and enough entanglement structure to purify new outgoing quanta.

Thus the Hayden–Preskill result is not in tension with the Page curve. It refines it:

  • information thrown in before the black hole is old remains hidden until roughly the Page time;
  • information thrown into an old black hole can be recovered after a scrambling-time delay, assuming access to the early radiation.

The word “old” is doing real work.

The Hayden–Preskill model can be viewed as a noisy quantum channel. The input is the message AA, the channel is the black hole unitary followed by the split into CC and DD, and the decoder is allowed to use the side information EE.

Without side information, the radiation channel may have almost no capacity at early times. With side information, the channel can have nearly optimal capacity once the black hole is old. The early radiation EE is not merely a pile of previous quanta; it is an entangled resource.

This is why the result is often described using quantum error correction. The black hole dynamics encodes Alice’s message into a larger system. The remaining black hole CC is an erased subsystem. Bob succeeds if the encoded message is correctable after losing CC, using DED\cup E as the surviving physical system.

The decoupling condition

I(Q:C)0I(Q:C)\simeq 0

is exactly the condition that the erased subsystem CC contains no information about the logical input.

Hayden–Preskill recovery is an information-theoretic statement. It says that a recovery map exists under idealized assumptions. Bob’s job is not easy.

Bob must:

  1. collect the early Hawking radiation coherently;
  2. continue collecting the appropriate late Hawking radiation;
  3. know the relevant black hole dynamics well enough to construct a decoder;
  4. perform a highly nonlocal quantum operation on the radiation.

The last two requirements are severe. A theorem can say that a message is encoded in the radiation without giving an efficient algorithm for extracting it.

This distinction becomes crucial in the firewall discussion. Harlow and Hayden argued that the decoding operations needed for certain AMPS-style experiments are generically so computationally hard that they cannot be completed before the black hole evaporates. That argument does not deny that the information is present in the radiation. It separates information-theoretic recoverability from efficient recoverability.

There are also idealized models in which explicit decoders can be built. For example, decoders inspired by Yoshida and Kitaev use time reversal, auxiliary entanglement, teleportation-like circuits, and scrambling diagnostics. These constructions are extremely useful as models of the protocol, but they should not be confused with a practical recipe for decoding the Hawking radiation of an astrophysical black hole.

Relation to complementarity and no-cloning

Section titled “Relation to complementarity and no-cloning”

At first sight the mirror behavior sounds dangerous. If Alice jumps in carrying a qubit and Bob later reconstructs that qubit from the radiation, has quantum mechanics produced two copies?

The careful answer is no, at least not in the operational sense used by black hole complementarity. Bob’s reconstruction requires waiting outside, collecting radiation, and applying a nonlocal decoder. Alice’s interior measurement is made along an infalling worldline. No single observer can both verify Alice’s interior copy and complete Bob’s exterior decoding in the simple semiclassical regime where the thought experiment is formulated.

The Hayden–Preskill result nevertheless sharpens the tension. It shows that after the Page time, unitarity wants late radiation to be strongly correlated with early radiation. Semiclassical effective field theory near the horizon wants each outgoing Hawking mode to be entangled with an interior partner. The next page, on complementarity and firewalls, turns this tension into the AMPS monogamy argument.

Relation to islands and entanglement wedges

Section titled “Relation to islands and entanglement wedges”

In modern language, Hayden–Preskill recovery is an ancestor of the island story.

After the Page time, the radiation is not merely an external record of emitted quanta. In holographic and semiclassical island calculations, the entanglement wedge of the radiation can include an island inside the gravitating region. Operators in that island are reconstructible from the radiation in the same broad sense that Hayden–Preskill says a thrown-in message is reconstructible from EDE\cup D.

The two languages emphasize different aspects:

Hayden–Preskill languageIsland / entanglement-wedge language
Random unitary scramblingSemiclassical geometry plus QES extremization
Early radiation as side informationRadiation entanglement wedge includes an island
Decoupling from remaining hole CCReconstruction from the radiation wedge
Recovery after tt_* for old holesIsland appears after the Page transition

Neither language says that the message is locally visible in an individual Hawking quantum. The information is encoded nonlocally, and recovering it requires the correct global operation.

For a young black hole:

small DI(Q:D)0.\text{small }D\quad\Rightarrow\quad I(Q:D)\simeq 0.

For an old black hole with early radiation EE:

small D after scramblingI(Q:DE)2S(Q).\text{small }D\text{ after scrambling} \quad\Rightarrow\quad I(Q:DE)\simeq 2S(Q).

The difference is the entangled side information EE.

The delay is the scrambling time:

tβ2πlogSBH.t_*\sim {\beta\over 2\pi}\log S_{\rm BH}.

The cost is decoding complexity, which can be enormous.

“The message comes out before it enters.”
No. The recovery is possible only after the message has interacted with the black hole dynamics and after enough late radiation has been emitted. The minimal delay is controlled by scrambling.

“A single late Hawking quantum contains the message.”
No. The message is encoded in correlations among early radiation, late radiation, and the black hole state. A small subsystem by itself generally contains no useful information.

“Fast scrambling means fast decoding.”
No. Scrambling is a property of the black hole dynamics. Decoding is an operation performed by Bob on the radiation. The first can be fast while the second is computationally prohibitive.

“The old black hole literally reflects information from the horizon.”
No. The mirror is a quantum-information metaphor. The message is mixed into the black hole degrees of freedom and becomes recoverable because the black hole is already entangled with early radiation.

“Hayden–Preskill solves the information paradox by itself.”
Not quite. It assumes unitary black hole dynamics and rapid scrambling, then derives consequences for recovery. It does not derive unitarity from semiclassical gravity. The island and replica-wormhole developments address a different question: how the gravitational entropy calculation itself changes.

Suppose an out-of-time-order commutator grows as

C(t)1SeλLtC(t)\sim {1\over S}e^{\lambda_L t}

until it becomes order one. Derive the scrambling time. What is tt_* if the chaos bound is saturated?

Solution

Scrambling occurs when

C(t)1.C(t_*)\sim 1.

Using the assumed growth,

1SeλLt1.{1\over S}e^{\lambda_L t_*}\sim 1.

Therefore

eλLtS,e^{\lambda_L t_*}\sim S,

and hence

t1λLlogS.t_*\sim {1\over \lambda_L}\log S.

If the chaos bound is saturated,

λL=2πβ,\lambda_L={2\pi\over \beta},

so

tβ2πlogS.t_*\sim {\beta\over 2\pi}\log S.

For a black hole one identifies SS with SBHS_{\rm BH} up to order-one conventions.

In four-dimensional asymptotically flat Schwarzschild evaporation, use units with GN==c=kB=1G_N=\hbar=c=k_B=1. The inverse temperature and entropy scale as

βM,SBHM2.\beta\sim M, \qquad S_{\rm BH}\sim M^2.

The evaporation time scales as

tevapM3.t_{\rm evap}\sim M^3.

Estimate t/tevapt_*/t_{\rm evap} for a large black hole.

Solution

The scrambling time is

tβlogSBH.t_*\sim \beta\log S_{\rm BH}.

Using

βM,SBHM2,\beta\sim M, \qquad S_{\rm BH}\sim M^2,

we get

tMlogM22MlogM.t_*\sim M\log M^2\sim 2M\log M.

Since

tevapM3,t_{\rm evap}\sim M^3,

we find

ttevaplogMM2.{t_*\over t_{\rm evap}} \sim {\log M\over M^2}.

Equivalently, since SBHM2S_{\rm BH}\sim M^2,

ttevaplogSBHSBH{t_*\over t_{\rm evap}}\sim {\log S_{\rm BH}\over S_{\rm BH}}

up to constants. For a macroscopic black hole this is parametrically tiny.

Let QCDEQCDE be a pure state. Show that

I(Q:DE)=2S(Q)I(Q:C).I(Q:DE)=2S(Q)-I(Q:C).

Explain why I(Q:C)0I(Q:C)\simeq 0 implies that the purifier of QQ lies in DEDE.

Solution

The mutual information between QQ and DEDE is

I(Q:DE)=S(Q)+S(DE)S(QDE).I(Q:DE)=S(Q)+S(DE)-S(QDE).

Since QCDEQCDE is pure,

S(DE)=S(QC),S(QDE)=S(C).S(DE)=S(QC), \qquad S(QDE)=S(C).

Thus

I(Q:DE)=S(Q)+S(QC)S(C).I(Q:DE)=S(Q)+S(QC)-S(C).

Now use

I(Q:C)=S(Q)+S(C)S(QC),I(Q:C)=S(Q)+S(C)-S(QC),

which implies

S(QC)=S(Q)+S(C)I(Q:C).S(QC)=S(Q)+S(C)-I(Q:C).

Substituting gives

I(Q:DE)=S(Q)+S(Q)+S(C)I(Q:C)S(C),I(Q:DE)=S(Q)+S(Q)+S(C)-I(Q:C)-S(C),

so

I(Q:DE)=2S(Q)I(Q:C).I(Q:DE)=2S(Q)-I(Q:C).

If I(Q:C)0I(Q:C)\simeq 0, then

I(Q:DE)2S(Q),I(Q:DE)\simeq 2S(Q),

which is the maximum possible mutual information between QQ and another system when QQ is purified by that system. Therefore the entanglement of QQ is carried by DEDE. More precisely, Uhlmann’s theorem implies the existence of an isometry, or recovery map, acting on DEDE that reconstructs a system entangled with QQ as the original message was.

Exercise 4. Qubit counting in the old-black-hole limit

Section titled “Exercise 4. Qubit counting in the old-black-hole limit”

Assume the decoupling error scales as

ϵdecdAdD.\epsilon_{\rm dec}\sim {d_A\over d_D}.

Alice throws in a kk-qubit message. Bob collects mm late radiation qubits after the scrambling time. How large must mm be to make the error at most ϵ\epsilon?

Solution

A kk-qubit message has Hilbert-space dimension

dA=2k.d_A=2^k.

If Bob collects mm late radiation qubits, then

dD=2m.d_D=2^m.

The assumed error estimate gives

ϵdec2k2m=2km.\epsilon_{\rm dec}\sim {2^k\over 2^m}=2^{k-m}.

To have

ϵdecϵ,\epsilon_{\rm dec}\lesssim \epsilon,

we need

2kmϵ.2^{k-m}\lesssim \epsilon.

Taking logarithms,

kmlog2ϵ,k-m\lesssim \log_2\epsilon,

or

mk+log21ϵ.m\gtrsim k+\log_2{1\over \epsilon}.

Thus Bob needs only slightly more late radiation qubits than the number of message qubits, after the message has been scrambled.

Give an entropy-based explanation of why Bob’s access to early radiation EE changes the recovery problem.

Solution

Without early radiation, Bob tries to recover the message from DD alone. If DD is small, then in a typical unitary evolution the complementary system CC is much larger. The reference QQ remains mostly correlated with CC, so

I(Q:C)I(Q:C)

is not small and decoupling fails.

With early radiation, Bob’s decoding system is DEDE, not DD. For an old black hole, EE already purifies much of the black hole BB. After the message is scrambled, a small amount of late radiation DD can complete the information needed to reconstruct the message. In decoupling language, QQ becomes nearly uncorrelated with the remaining black hole CC:

I(Q:C)0.I(Q:C)\simeq 0.

Then the purifier of QQ must be in DEDE.

This is why the Hayden–Preskill mirror effect is specifically an old-black-hole phenomenon.

Exercise 6. Information-theoretic versus efficient recovery

Section titled “Exercise 6. Information-theoretic versus efficient recovery”

Explain the difference between saying “the message is recoverable from the radiation” and saying “Bob can efficiently recover the message.” Why is this distinction important for firewalls?

Solution

The statement that the message is recoverable from the radiation usually means that there exists some recovery map acting on the radiation that reconstructs the message with small error. This is an information-theoretic statement. It follows from decoupling and does not by itself bound the computational complexity of the recovery map.

Efficient recovery is stronger. It requires a decoding algorithm whose time and resources are not enormous compared with the physical timescale of the problem. Harlow and Hayden argued that the decoding operations relevant to certain firewall thought experiments are generically exponentially hard in the black hole entropy. If so, an exterior observer cannot decode the early radiation quickly enough to perform the AMPS experiment before the black hole evaporates.

Thus the radiation may contain the information in principle while remaining practically, or even computationally, inaccessible to realistic observers.

  • P. Hayden and J. Preskill, “Black Holes as Mirrors: Quantum Information in Random Subsystems,” JHEP 09 (2007), 120. arXiv:0708.4025.
  • Y. Sekino and L. Susskind, “Fast Scramblers,” JHEP 10 (2008), 065. arXiv:0808.2096.
  • J. Maldacena, S. H. Shenker, and D. Stanford, “A Bound on Chaos,” JHEP 08 (2016), 106. arXiv:1503.01409.
  • D. Harlow and P. Hayden, “Quantum Computation vs. Firewalls,” JHEP 06 (2013), 085. arXiv:1301.4504.
  • B. Yoshida and A. Kitaev, “Efficient Decoding for the Hayden–Preskill Protocol.” arXiv:1710.03363.
  • P. Hosur, X.-L. Qi, D. A. Roberts, and B. Yoshida, “Chaos in Quantum Channels,” JHEP 02 (2016), 004. arXiv:1511.04021.
  • A. Almheiri, T. Hartman, J. Maldacena, E. Shaghoulian, and A. Tajdini, “The Entropy of Hawking Radiation,” Reviews of Modern Physics 93 (2021), 035002. arXiv:2006.06872.

The next page turns the Hayden–Preskill lesson into a sharper paradox. If late radiation must purify early radiation after the Page time, but local effective field theory says that a late Hawking mode is entangled with an interior partner, then quantum entanglement seems to be overcommitted. This is the logic behind complementarity, AMPS, and firewalls.