Skip to content

Ensemble Averaging, Baby Universes, and JT Gravity

Guiding question. When Euclidean wormholes make a gravitational answer look like an ensemble average, is the bulk theory literally averaging over boundary Hamiltonians, or is the ensemble language a calculational shadow of a deeper quantum-gravity Hilbert space?

The previous page introduced the factorization puzzle: connected Euclidean saddles seem to correlate decoupled boundaries, while a fixed boundary theory obeys exact factorization. This page explains the main framework in which such correlations are not paradoxical. In ensemble-averaged holography, the bulk path integral computes averaged observables such as

Z(β1)Z(β2)ens,\langle Z(\beta_1)Z(\beta_2)\rangle_{\rm ens},

not the product of partition functions in one fixed theory. A connected Euclidean wormhole is then interpreted as an ensemble covariance,

Z(β1)Z(β2)ensZ(β1)ensZ(β2)ens,\langle Z(\beta_1)Z(\beta_2)\rangle_{\rm ens} - \langle Z(\beta_1)\rangle_{\rm ens}\langle Z(\beta_2)\rangle_{\rm ens},

which is perfectly allowed.

The cleanest controlled example is Jackiw–Teitelboim gravity. Pure JT gravity, summed over Euclidean topologies, is described by a double-scaled random matrix integral. This does not mean that every AdS/CFT dual is an ensemble. It means that some gravitational path integrals naturally compute ensemble-averaged quantities, and JT gravity gives a precise laboratory for studying how wormholes, baby universes, spectral statistics, and factorization fit together.

Fixed theory versus ensemble average.

A fixed theory and an ensemble answer different questions. A fixed Hamiltonian HH gives exact factorization for decoupled copies. An ensemble average over HαH_\alpha can produce nonzero covariances such as Z1Z2Z1Z2\langle Z_1Z_2\rangle-\langle Z_1\rangle\langle Z_2\rangle, naturally represented by connected Euclidean wormholes.

Let α\alpha label theories in an ensemble with probability measure dμ(α)d\mu(\alpha). For each member we have a Hilbert space and Hamiltonian, schematically

Tα:Hα,Hα,Zα(β)=TrHαeβHα.\mathcal T_\alpha: \qquad \mathcal H_\alpha, \qquad H_\alpha, \qquad Z_\alpha(\beta)=\operatorname{Tr}_{\mathcal H_\alpha}e^{-\beta H_\alpha}.

The ensemble average is

Z(β)ens=dμ(α)Zα(β).\langle Z(\beta)\rangle_{\rm ens} = \int d\mu(\alpha)\,Z_\alpha(\beta).

For a single fixed member α\alpha, two decoupled copies factorize:

Zα(β1,β2)=Zα(β1)Zα(β2).Z_\alpha(\beta_1,\beta_2) = Z_\alpha(\beta_1)Z_\alpha(\beta_2).

But after averaging over α\alpha,

Z(β1)Z(β2)ensZ(β1)ensZ(β2)ens\langle Z(\beta_1)Z(\beta_2)\rangle_{\rm ens} \neq \langle Z(\beta_1)\rangle_{\rm ens}\langle Z(\beta_2)\rangle_{\rm ens}

in general. The difference is the covariance of the random variables Zα(β1)Z_\alpha(\beta_1) and Zα(β2)Z_\alpha(\beta_2):

Cov(Z1,Z2)=Z1Z2ensZ1ensZ2ens.\operatorname{Cov}(Z_1,Z_2) = \langle Z_1Z_2\rangle_{\rm ens} - \langle Z_1\rangle_{\rm ens}\langle Z_2\rangle_{\rm ens}.

Thus an ensemble-averaged theory can have connected multi-boundary observables without contradicting factorization in each fixed member. This is the simplest mathematical resolution of the Euclidean-wormhole tension.

The important interpretive question is then:

Does a given gravitational path integral compute a fixed-theory answer, an ensemble average, or an approximation that hides the distinction?

JT gravity gives a case where the ensemble interpretation is not merely suggestive but quantitatively sharp.

The Euclidean JT path integral has the schematic topological expansion

i=1nZ(βi)JT=Σ:Σ=iSβi1eS0χ(Σ)ZJTtrumpet/disc(Σ),\left\langle \prod_{i=1}^n Z(\beta_i)\right\rangle_{ \rm JT } = \sum_{\substack{\Sigma\,:\,\partial\Sigma=\bigsqcup_i S^1_{\beta_i}}} e^{S_0\chi(\Sigma)}\,Z_{\rm JT}^{\rm trumpet/disc}(\Sigma),

where

χ(Σ)=22gn\chi(\Sigma)=2-2g-n

for a genus-gg surface with nn asymptotic boundaries. The constant S0S_0 is the extremal entropy term in the JT action and controls the genus expansion: every handle costs a factor of e2S0e^{-2S_0}.

Saad, Shenker, and Stanford showed that this all-genus expansion is reproduced by a double-scaled random matrix integral. In the matrix-integral language,

Z(β)=TreβH,Z(\beta)=\operatorname{Tr}e^{-\beta H},

and JT computes ensemble averages of products of such traces:

i=1nTreβiHmatrix.\left\langle \prod_{i=1}^n \operatorname{Tr}e^{-\beta_iH}\right\rangle_{ \rm matrix }.

The equality is not merely a slogan. The Weil–Petersson volumes appearing in JT amplitudes obey recursion relations that match the topological recursion of the matrix model. The result is an exact perturbative dictionary between Euclidean JT topology and random-matrix spectral statistics.

JT topological expansion and matrix integral.

Pure JT gravity organizes nn-boundary amplitudes by topology. The same expansion is reproduced by a double-scaled matrix integral computing ensemble averages of iTreβiH\prod_i\operatorname{Tr}e^{-\beta_iH}. The cylinder is the geometric avatar of a connected spectral covariance.

The leading one-boundary answer is the disk. In common conventions the leading spectral density takes the universal Schwarzian/JT form

ρ0(E)sinh ⁣(2π2CE),\rho_0(E)\propto \sinh\!\left(2\pi\sqrt{2CE}\right),

where CC is the Schwarzian coefficient. The precise normalization depends on conventions, but the lesson is invariant: the JT disk computes the smooth, leading large-density approximation to the spectrum. Higher topologies encode spectral correlations and eventually the random-matrix ramp and plateau.

Consider two thermal circles with inverse temperatures β1\beta_1 and β2\beta_2. The matrix model gives

Z(β1)Z(β2)=Z(β1)Z(β2)+Z(β1)Z(β2)conn.\langle Z(\beta_1)Z(\beta_2)\rangle = \langle Z(\beta_1)\rangle\langle Z(\beta_2)\rangle + \langle Z(\beta_1)Z(\beta_2)\rangle_{\rm conn}.

In JT gravity, the disconnected product is represented by two disks. The connected contribution is represented by a Euclidean cylinder connecting the two asymptotic boundaries:

Z(β1)Z(β2)connJT cylinder.\langle Z(\beta_1)Z(\beta_2)\rangle_{\rm conn} \quad\longleftrightarrow\quad \text{JT cylinder}.

This is not a violation of quantum mechanics. It is the expected covariance of spectral observables in an ensemble. For instance, writing

Zα(β)=keβEk(α),Z_\alpha(\beta)=\sum_k e^{-\beta E_k^{(\alpha)}},

we have

Z(β1)Z(β2)ens=dμ(α)k,eβ1Ek(α)β2E(α).\langle Z(\beta_1)Z(\beta_2)\rangle_{\rm ens} = \int d\mu(\alpha) \sum_{k,\ell}e^{-\beta_1E_k^{(\alpha)}-\beta_2E_\ell^{(\alpha)}}.

The connected part measures correlations in the same spectrum. In a fixed theory those correlations are present as spectral correlations, but they do not appear as a nonfactorizing product between two independent copies. The wormhole knows about the ensemble average, not about a literal interaction between the two boundary systems.

4. Spectral form factor: ramp, plateau, and wormholes

Section titled “4. Spectral form factor: ramp, plateau, and wormholes”

A particularly useful diagnostic is the spectral form factor

K(β,t)=Z(β+it)Z(βit).K(\beta,t) = \left\langle Z(\beta+it)Z(\beta-it)\right\rangle.

It probes correlations between energy levels. In chaotic systems, the connected spectral form factor displays the famous slope–ramp–plateau structure. The early-time slope is controlled by the smooth density of states; the ramp reflects universal level repulsion; the plateau reflects the discreteness of the spectrum.

In JT gravity and its matrix-integral completion, these features have geometric avatars:

slopedisconnected saddle,rampconnected cylinder,plateaunonperturbative discreteness.\text{slope} \sim \text{disconnected saddle}, \qquad \text{ramp} \sim \text{connected cylinder}, \qquad \text{plateau} \sim \text{nonperturbative discreteness}.

The plateau is especially important. A finite quantum system has a discrete spectrum, so the spectral form factor cannot decay forever. Perturbative genus expansions can see the ramp but are not, by themselves, enough to produce the exact plateau. The matrix integral supplies a nonperturbative completion in which eigenvalue discreteness is built in.

Spectral form factor and JT saddles.

The spectral form factor links random-matrix physics and Euclidean wormholes. The connected cylinder captures the ramp, while the plateau requires nonperturbative information about the discreteness of the spectrum. The diagram is schematic; the precise time scales depend on the ensemble and normalization.

This is one reason JT gravity is so valuable. It shows how a gravitational sum over topologies can reproduce detailed quantum-chaotic spectral statistics. But it also reveals the danger: a semiclassical wormhole contribution often computes an averaged quantity, and averaged quantities do not obey the same factorization properties as fixed-theory quantities.

5. Baby universes as the Hilbert-space language of ensembles

Section titled “5. Baby universes as the Hilbert-space language of ensembles”

The ensemble interpretation can be rephrased in a more gravitational language using baby universes. The idea is old, but the modern AdS/JT setting gives it a sharper operational meaning.

A baby universe is a closed component of spacetime that can branch off from, or connect to, asymptotic regions in a Euclidean path integral. Instead of saying that wormholes directly couple different boundary theories, one can introduce a baby-universe Hilbert space HBU\mathcal H_{\rm BU} and operators Z^(β)\widehat Z(\beta) that create asymptotic boundaries.

In this language, gravitational amplitudes are expectation values in a state ΨBU|\Psi\rangle_{\rm BU}:

Z(β1)Z(βn)grav=BUΨZ^(β1)Z^(βn)ΨBU.\langle Z(\beta_1)\cdots Z(\beta_n)\rangle_{ \rm grav } = {}_{\rm BU}\langle\Psi| \widehat Z(\beta_1)\cdots\widehat Z(\beta_n) |\Psi\rangle_{\rm BU}.

If the operators Z^(β)\widehat Z(\beta) commute, they can be simultaneously diagonalized. Their common eigenstates are often called α\alpha-states:

Z^(β)α=Zα(β)α.\widehat Z(\beta)|\alpha\rangle = Z_\alpha(\beta)|\alpha\rangle.

If the Hartle–Hawking baby-universe state is a superposition or mixture over α\alpha, then

BUΨZ^(β1)Z^(βn)ΨBU=dμ(α)p(α)iZα(βi).{}_{\rm BU}\langle\Psi| \widehat Z(\beta_1)\cdots\widehat Z(\beta_n) |\Psi\rangle_{\rm BU} = \int d\mu(\alpha)\,p(\alpha) \prod_i Z_\alpha(\beta_i).

This is exactly an ensemble average.

Baby-universe Hilbert space and alpha states.

In the baby-universe description, asymptotic-boundary operators Z^(β)\widehat Z(\beta) are diagonal in α\alpha-states. A Hartle–Hawking state that is not an α\alpha-eigenstate computes an ensemble average over effective boundary theories. Projection onto one α\alpha-sector gives fixed values Zα(β)Z_\alpha(\beta).

This language clarifies the connection between wormholes and ensembles. A connected wormhole is not necessarily a mysterious force between decoupled boundaries. It can be the statement that both boundaries are probing the same hidden baby-universe sector α\alpha.

The analogy is simple. Suppose a hidden random variable α\alpha is sampled once, and two observers measure functions XαX_\alpha and YαY_\alpha. Conditional on α\alpha, the measurements factorize. But after averaging over α\alpha, they are correlated:

XYXY=Covα(Xα,Yα).\langle XY\rangle-\langle X\rangle\langle Y\rangle = \operatorname{Cov}_\alpha(X_\alpha,Y_\alpha).

Baby universes provide a gravitational realization of this logic.

The α\alpha-state perspective is useful because it separates two questions that are often conflated.

First, what does the gravitational path integral state compute? If the state is Hartle–Hawking-like and has support over many α\alpha sectors, it computes ensemble moments.

Second, what does a single exact boundary theory compute? That should correspond, if the baby-universe language applies, to a fixed α\alpha sector or to a more microscopic completion that effectively selects fixed data.

In an α\alpha-state,

αZ^(β1)Z^(β2)α=Zα(β1)Zα(β2),\langle\alpha| \widehat Z(\beta_1)\widehat Z(\beta_2) |\alpha\rangle = Z_\alpha(\beta_1)Z_\alpha(\beta_2),

so factorization is restored.

In a superposition or mixture over α\alpha,

ΨZ^(β1)Z^(β2)ΨΨZ^(β1)ΨΨZ^(β2)Ψ0\langle\Psi| \widehat Z(\beta_1)\widehat Z(\beta_2)|\Psi\rangle - \langle\Psi| \widehat Z(\beta_1)|\Psi\rangle \langle\Psi| \widehat Z(\beta_2)|\Psi\rangle \neq 0

unless the distribution is trivial or the two observables are uncorrelated across the ensemble.

This is the cleanest version of the baby-universe resolution of the factorization puzzle. But it also relocates the hard question: in an ordinary non-ensemble AdS/CFT duality, what selects the fixed α\alpha data, and how is that selection represented in the bulk?

7. Null states and the Marolf–Maxfield lesson

Section titled “7. Null states and the Marolf–Maxfield lesson”

Modern baby-universe discussions also emphasize the role of the gravitational inner product. Naively, one might try to build many states by adding many asymptotic boundaries or closed universes. But the gravitational path integral defines an inner product, and that inner product can make many naive states null.

A schematic version is as follows. Suppose one tries to define states i|i\rangle by inserting asymptotic boundary conditions. The inner product matrix is computed by a gravitational path integral:

Gij=ijgrav.G_{ij}=\langle i|j\rangle_{\rm grav}.

If GG has zero eigenvalues, then some linear combinations are null and must be quotiented out. The physical Hilbert space is not the naive vector space of all topological boundary insertions; it is

Hphys=Hnaive/N,\mathcal H_{\rm phys} = \mathcal H_{\rm naive}/\mathcal N,

where N\mathcal N is the null subspace.

This point matters for black-hole information. A gravitational path integral can look as if it creates too many independent states or too many independent topological sectors. But after quotienting by null states, the physical state space can be smaller and more consistent with unitarity or finite-dimensional boundary expectations.

The moral is not that baby universes automatically solve every factorization problem. The moral is that topology change must be discussed together with the physical inner product, not merely as a sum of classical manifolds.

Ensemble averages are treacherous because nonlinear operations do not commute with averaging. The two most common averages are:

Fann(β)=1βlogZ(β),F_{\rm ann}(\beta) =-\frac{1}{\beta}\log \langle Z(\beta)\rangle,

and

Fque(β)=1βlogZ(β).F_{\rm que}(\beta) =-\frac{1}{\beta}\langle \log Z(\beta)\rangle.

The first is the annealed free energy; the second is the quenched free energy. They are generally different. Since log\log is concave,

logZlogZ,\log\langle Z\rangle\geq \langle\log Z\rangle,

and therefore, for β>0\beta>0,

FannFque.F_{\rm ann}\leq F_{\rm que}.

The same distinction appears in entropy calculations. For a random density matrix ρα\rho_\alpha,

S ⁣(ρα)S(ρα).S\!\left(\langle \rho_\alpha\rangle\right) \neq \left\langle S(\rho_\alpha)\right\rangle.

This is crucial in replica-wormhole physics. The entropy involves a logarithm,

S(ρ)=Trρlogρ,S(\rho)=-\operatorname{Tr}\rho\log\rho,

so one must be careful about whether the gravitational replica path integral computes an annealed, quenched, or fixed-theory quantity. Many apparent paradoxes arise from silently replacing one with another.

Annealed and quenched averages.

Averages and nonlinear operations do not commute. The annealed free energy uses logZ\log\langle Z\rangle, while the quenched free energy uses logZ\langle\log Z\rangle. Fine-grained entropy has the same danger: S(ρ)S(\langle\rho\rangle) is not the same as S(ρ)\langle S(\rho)\rangle.

9. Relation to replica wormholes and islands

Section titled “9. Relation to replica wormholes and islands”

Replica wormholes compute entropies, not ordinary two-boundary partition functions. Nevertheless, the same conceptual infrastructure appears.

For a radiation density matrix ρR\rho_R, the entropy is obtained from

S(R)=limn111nlogTrρRn.S(R)=\lim_{n\to1}\frac{1}{1-n}\log\operatorname{Tr}\rho_R^n.

In gravity, the path integral for TrρRn\operatorname{Tr}\rho_R^n includes saddles with nontrivial topology connecting replica copies. After taking the n1n\to1 limit, these saddles yield the island formula.

The ensemble question is then: are replica wormholes computing the entropy of a fixed theory, an ensemble-averaged entropy, or a good semiclassical approximation to a fixed-theory answer?

The modern view is nuanced.

  • In pure JT gravity, the matrix-integral dual strongly suggests an ensemble interpretation.
  • In standard AdS/CFT with a fixed microscopic CFT, exact factorization should hold, so connected Euclidean wormholes require additional nonperturbative interpretation.
  • In many island calculations, the leading semiclassical result for the Page curve is expected to be robust even if the precise ensemble/fixed-theory completion is subtle.

The key reason is that the island formula is a statement about the dominant generalized-entropy saddle in a semiclassical regime. It can correctly capture the leading fine-grained entropy while leaving open the microscopic mechanism that enforces exact factorization in a fixed UV-complete dual.

10. Eigenbranes, half-wormholes, and fixed-spectrum refinements

Section titled “10. Eigenbranes, half-wormholes, and fixed-spectrum refinements”

Several modern ideas attempt to move beyond the simple ensemble story.

One approach is to condition the matrix integral on additional microscopic data. In JT language, eigenbranes are related to fixing eigenvalues of the random matrix. Fixing more spectral data pushes the ensemble closer to a particular member, and the gravitational description acquires new brane-like ingredients.

Another approach is the half-wormhole idea. In simple models inspired by SYK, connected wormholes can survive in a fixed-coupling description, but additional saddles also appear. These half-wormholes are sensitive to the specific microscopic couplings and restore factorization in the fixed theory. After averaging, the coupling-sensitive half-wormholes vanish, leaving the smoother wormhole contribution.

These ideas are not yet a universal solution for all AdS/CFT factorization puzzles. But they teach an important lesson:

The semiclassical wormhole is often only the smooth, averaged part of a more refined nonperturbative answer.

A fixed theory can contain extra microscopic phases, branes, null-state identifications, or saddles that are invisible in the coarse semiclassical approximation.

Ways to refine the ensemble interpretation.

Ways to refine the ensemble story. The semiclassical wormhole often captures a smooth averaged contribution. A fixed theory may require projection to an α\alpha-sector, conditioning on spectral data, adding half-wormhole-like saddles, or quotienting null states in the baby-universe Hilbert space.

11. What JT gravity proves, and what it does not prove

Section titled “11. What JT gravity proves, and what it does not prove”

JT gravity proves a remarkable statement: a precisely defined sum over Euclidean topologies can reproduce a random matrix ensemble. It also gives a controlled setting where Euclidean wormholes compute connected spectral correlations rather than contradicting quantum mechanics.

But several statements would be too strong.

First, JT gravity does not prove that every theory of quantum gravity is dual to an ensemble. Ordinary AdS/CFT examples, such as maximally supersymmetric Yang–Mills dual to string theory on asymptotically AdS backgrounds, are believed to be fixed theories.

Second, the matrix-integral completion of pure JT is not unique in the same way a full UV-complete string compactification is. It is a powerful nonperturbative completion of a low-dimensional gravitational path integral, not a universal definition of quantum gravity.

Third, baby universes do not automatically imply observable randomness in low-energy couplings. Whether α\alpha-parameters are physical, fixed by boundary conditions, gauged, or absent depends on the theory and on the allowed observables.

Fourth, ensemble averaging is not a loophole in unitarity. Each ensemble member can be perfectly unitary. What changes is that averaged quantities may look mixed, correlated, or nonfactorizing in ways that a single fixed theory would not.

12. Why this matters for black hole information

Section titled “12. Why this matters for black hole information”

The black hole information problem forces us to distinguish three levels of description:

  1. Semiclassical geometry: useful but often computes coarse or averaged quantities.
  2. Generalized entropy saddles: powerful enough to reproduce Page-curve behavior.
  3. Exact microscopic Hilbert space: required for exact factorization, exact unitarity, and the fine structure of late-time observables.

Euclidean wormholes and baby universes live precisely at the interface between these levels. They explain why gravitational path integrals can know about spectral correlations and Page-curve physics, while also warning us not to overinterpret every connected saddle as an exact observable of a fixed boundary theory.

The safest summary is:

JT gravity shows that wormholes naturally compute ensemble moments. Baby-universe language explains those moments as expectation values over α\alpha-sectors. Fixed AdS/CFT duals must recover factorization by selecting, conditioning, or otherwise refining the gravitational answer.

That is why ensemble averaging is not a side topic. It is one of the sharpest probes of what the gravitational path integral is actually computing.

Pitfall 1: “A wormhole means the two boundary theories interact.”

No. In the ensemble interpretation, the connected wormhole computes a covariance over theories or spectra. It does not introduce a dynamical interaction between two decoupled boundary Hamiltonians.

Pitfall 2: “JT gravity proves AdS/CFT is always an ensemble.”

No. Pure JT is special. It has a matrix-integral description, but familiar higher-dimensional AdS/CFT dualities are expected to be fixed theories. The puzzle is how their bulk path integrals encode that fixed-theory nature.

Pitfall 3: “Averaging destroys unitarity.”

No. An ensemble of unitary theories is still an ensemble of unitary theories. Averaged observables may not look like observables in one fixed unitary theory, but that is a statement about averaging, not about nonunitary time evolution.

Pitfall 4: “The island formula depends on literal ensemble averaging.”

Not quite. Replica wormholes and islands are clearest in models with ensemble-like features, but the leading island saddle is expected to capture robust semiclassical entropy physics more generally. The exact fixed-theory interpretation remains subtler.

Exercise 1: Factorization before and after averaging

Section titled “Exercise 1: Factorization before and after averaging”

Let XαX_\alpha and YαY_\alpha be two observables in a theory labeled by α\alpha. Suppose that for fixed α\alpha the two-copy observable factorizes:

XYα=XαYα.\langle XY\rangle_\alpha=X_\alpha Y_\alpha.

Show that the ensemble-averaged connected correlator is the covariance of XαX_\alpha and YαY_\alpha.

Solution

The ensemble average gives

XYens=dμ(α)XαYα.\langle XY\rangle_{\rm ens} = \int d\mu(\alpha)\,X_\alpha Y_\alpha.

The one-point averages are

Xens=dμ(α)Xα,Yens=dμ(α)Yα.\langle X\rangle_{\rm ens}=\int d\mu(\alpha)X_\alpha, \qquad \langle Y\rangle_{\rm ens}=\int d\mu(\alpha)Y_\alpha.

Therefore

XYensXensYens=Covα(Xα,Yα).\langle XY\rangle_{\rm ens} - \langle X\rangle_{\rm ens}\langle Y\rangle_{\rm ens} = \operatorname{Cov}_\alpha(X_\alpha,Y_\alpha).

This is the basic ensemble interpretation of a connected two-boundary wormhole.

Exercise 2: Topological weights in JT gravity

Section titled “Exercise 2: Topological weights in JT gravity”

For a connected orientable surface of genus gg with nn asymptotic boundaries, the Euler character is

χ=22gn.\chi=2-2g-n.

Compute the eS0χe^{S_0\chi} weight for a disk, a cylinder, a pair of pants, and a genus-one surface with one boundary.

Solution

A disk has g=0,n=1g=0,n=1, so

χ=1,weight=eS0.\chi=1, \qquad \text{weight}=e^{S_0}.

A cylinder has g=0,n=2g=0,n=2, so

χ=0,weight=1.\chi=0, \qquad \text{weight}=1.

A pair of pants has g=0,n=3g=0,n=3, so

χ=1,weight=eS0.\chi=-1, \qquad \text{weight}=e^{-S_0}.

A genus-one surface with one boundary has g=1,n=1g=1,n=1, so

χ=1,weight=eS0.\chi=-1, \qquad \text{weight}=e^{-S_0}.

Adding a handle lowers χ\chi by 22 and therefore costs e2S0e^{-2S_0}.

Exercise 3: Annealed versus quenched free energy

Section titled “Exercise 3: Annealed versus quenched free energy”

For positive random ZZ, prove that

FannFqueF_{\rm ann}\leq F_{\rm que}

where

Fann=1βlogZ,Fque=1βlogZ,F_{\rm ann}=-\frac{1}{\beta}\log\langle Z\rangle, \qquad F_{\rm que}=-\frac{1}{\beta}\langle\log Z\rangle,

with β>0\beta>0.

Solution

The logarithm is concave, so Jensen’s inequality gives

logZlogZ.\log\langle Z\rangle\geq \langle\log Z\rangle.

Multiplying by 1/β-1/\beta with β>0\beta>0 reverses the inequality:

1βlogZ1βlogZ.-\frac{1}{\beta}\log\langle Z\rangle \leq -\frac{1}{\beta}\langle\log Z\rangle.

Thus

FannFque.F_{\rm ann}\leq F_{\rm que}.

The inequality becomes sharp when fluctuations of ZZ over the ensemble are important.

Assume commuting baby-universe operators Z^(β)\widehat Z(\beta) with eigenstates α|\alpha\rangle satisfying

Z^(β)α=Zα(β)α.\widehat Z(\beta)|\alpha\rangle=Z_\alpha(\beta)|\alpha\rangle.

Show that connected two-boundary correlations vanish in a fixed α\alpha-state but not in a general superposition or mixture over α\alpha.

Solution

In a fixed α\alpha-state,

αZ^(β1)Z^(β2)α=Zα(β1)Zα(β2),\langle\alpha|\widehat Z(\beta_1)\widehat Z(\beta_2)|\alpha\rangle =Z_\alpha(\beta_1)Z_\alpha(\beta_2),

and

αZ^(β1)ααZ^(β2)α=Zα(β1)Zα(β2).\langle\alpha|\widehat Z(\beta_1)|\alpha\rangle \langle\alpha|\widehat Z(\beta_2)|\alpha\rangle =Z_\alpha(\beta_1)Z_\alpha(\beta_2).

The connected part is zero.

For a diagonal mixture with weights p(α)p(\alpha),

Z^1Z^2=dαp(α)Zα(β1)Zα(β2),\langle \widehat Z_1\widehat Z_2\rangle =\int d\alpha\,p(\alpha)Z_\alpha(\beta_1)Z_\alpha(\beta_2),

while

Z^1Z^2=(dαp(α)Zα(β1))(dαp(α)Zα(β2)).\langle \widehat Z_1\rangle\langle \widehat Z_2\rangle =\left(\int d\alpha\,p(\alpha)Z_\alpha(\beta_1)\right) \left(\int d\alpha\,p(\alpha)Z_\alpha(\beta_2)\right).

The difference is the covariance over α\alpha, which is generally nonzero.

Give a simple reason why

S(ρα)S(ρα)S(\langle\rho_\alpha\rangle) \neq \langle S(\rho_\alpha)\rangle

in general. Illustrate with an ensemble of two pure orthogonal states.

Solution

Take

ρ1=00,ρ2=11,\rho_1=|0\rangle\langle0|, \qquad \rho_2=|1\rangle\langle1|,

with equal probabilities. Each state is pure, so

S(ρ1)=S(ρ2)=0,S(ρα)=0.S(\rho_1)=S(\rho_2)=0, \qquad \langle S(\rho_\alpha)\rangle=0.

But the averaged density matrix is

ρα=1200+1211,\langle\rho_\alpha\rangle =\frac12|0\rangle\langle0|+\frac12|1\rangle\langle1|,

so

S(ρα)=log2.S(\langle\rho_\alpha\rangle)=\log2.

This elementary example shows why averaged density matrices and averaged entropies are different objects. Replica-wormhole entropy calculations must keep track of this distinction.