Enrico M. Malatesta

Could be worse - Random topics in statistical physics, spin glasses, optimization and average case hardness

The Random Energy Model – Part II

22 March 2025 | E. Malatesta

In a previous post we rigorously solved the Random Energy Model (REM) unveiling the presence of a ''condensation'' or ''freezing'' transition at an inverse temperature

βce0J=2ln2J . \beta_c \equiv \frac{e_0}{J} = \sqrt{\frac{2 \ln 2}{J}} \,.

Namely, for β<βc\beta<\beta_c the Gibbs measure is spread over exponentially many configurations as the corresponding entropy of the model is positive, while for β>βc\beta > \beta_c it condenses onto a sparse set of states corresponding to the lowest available energy e0=2Jln2-e_0 = - \sqrt{2J \ln 2}. This is evident in the expression of the energy as a function of temperature

e(β)=(βf)β={βJ if β<βc2Jln2 if ββc . e(\beta) = \frac{\partial (\beta f)}{\partial \beta} = \begin{cases} - \beta J & \; \mathrm{if} \;\beta < \beta_c \\ - \sqrt{2 J \ln 2} & \; \mathrm{if} \; \beta \ge \beta_c \,. \end{cases}

The goal of this post is to show explicitly how this condensation occurs by computing the participation ratio

YN(β)=i=12Nwi2 Y_N(\beta)=\sum_{i=1}^{2^N} w_i^2 %= \frac{Z_N(2\beta)}{Z_N(\beta)^2}.

where wiw_i represents the Boltzmann weight corresponding to the energy level EiE_i

wiμβ(Ei)=eβEiZN(β) w_i \equiv \mu_\beta(E_i) = \frac{e^{-\beta E_i}}{Z_N(\beta)}

Intuitively if the Boltzmann measure is equidistributed over all the energy levels we expect that YN(β)=2NY_N(\beta) = 2^{-N} on average, i.e. it vanishes exponentially fast with NN. If instead YN(β)Y_N(\beta) remains finite in the large NN limit it means that the measure is focused only on a O(1)O(1) number of energy levels.

The high temperature phase

When β<βc\beta < \beta_c it is easy to compute the participation ratio by using the identity

YN(β)=i=12Ne2βEiZN2(β)=ZN(2β)ZN(β)2 Y_N(\beta) = \sum_{i=1}^{2^N} \frac{e^{- 2\beta E_i}}{Z_N^2(\beta)} = \frac{Z_N(2\beta)}{Z_N(\beta)^2}

which expresses it in terms of the partition function found in the previous post which we report here for convenience

ZN(β)=eNβfN(β)={eN(ln2+β2J2) if β<βceNβ2Jln2 if ββc . Z_N(\beta) = e^{-N\beta f_N(\beta)} = \begin{cases} e^{N \left(\ln 2 + \frac{\beta^2 J}{2} \right)} & \; \mathrm{if} \;\beta< \beta_c\\ e^{N \beta \sqrt{2 J \ln 2} } & \; \mathrm{if} \; \beta \ge \beta_c \,. \end{cases}

We have two subcases, depending if 2β<βc2\beta< \beta_c or 2β>βc2\beta > \beta_c, i.e. when the numerator in~(5) is respectivelt in the high phase or the condensed phase.

When 2β<βc2\beta< \beta_c we have

YN(β)=eN(ln2+2β2J)eN(2ln2+β2J)=eN(ln2β2J)<eN(ln2βc2J4)=eN2ln20 Y_N(\beta) = \frac{e^{N \left(\ln 2 + 2\beta^2 J \right)}}{e^{N \left(2\ln 2 + \beta^2 J \right)}} = e^{-N\left( \ln 2 - \beta^2 J \right)} < e^{-N\left( \ln 2 - \frac{\beta_c^2 J}{4} \right)} = e^ {- \frac{N}{2}\ln 2 } \to 0

whereas if β<βc\beta< \beta_c but 2β>βc2\beta > \beta_c the numerator in~(5) is in the low temperature phase

YN(β)=e2Nβ2Jln2eN(2ln2+β2J)=eNJ(βc2+β22ββc)=eNJ(ββc)20 Y_N(\beta) = \frac{e^{2 N \beta \sqrt{2J\ln 2}}}{e^{N \left(2\ln 2 + \beta^2 J \right)}} = e^{-N J \left( \beta_c^2 + \beta^2 - 2\beta \beta_c\right)} = e^{-N J \left( \beta - \beta_c\right)^2} \to 0

Notice how the two expressions match at β=βc2\beta = \frac{\beta_c}{2}.

The condensed phase: extreme energies and the distribution of Boltzmann weights

In order to derive the expression of the participation ratio in the low temperature phase, we need to better control the energy fluctuations near the lowest energy EiNe0E_i \simeq - N e_0.

To this end, we introduce a more general observable, denoted by P(w)P(w), in terms of which the participation ratio can be easily expressed. P(w)P(w) represents the disorder-averaged density of Gibbs weights

P(w)=E[iδ(wwi)] P(w) = \mathbb{E}\left[\sum_{i} \delta(w-w_i) \right]

meaning that P(w) dwP(w)\, dw is the expected number of energy levels having Gibbs weights in [w,w+dw][w, w+dw]. The (expected) participation ratio is expressed in terms of P(w)P(w) as

E[YN(β)]=E[iwi2]=01dw P(w) w2 . \mathbb E[Y_N(\beta)] = E\left[\sum_i w_i^2\right] = \int_0^1 dw\, P(w)\,w^2 \,.

In the following sections we will derive the expression of P(w)P(w) in the condensed phase by controlling the energy fluctuations near the lowest available energy.

Zooming near the lower edge of the spectrum

To study the relevant configurations at low temperature, we therefore write

Ei=Ne0+si, E_i=-Ne_0+ s_i,

where si=O(1)s_i=O(1) measures the small energy shift from the lower edge.

Using the Gaussian density of energy levels

ρN(E)=12πNJeE22NJ, \rho_N(E)=\frac{1}{\sqrt{2\pi NJ}}e^{-\frac{E^2}{2NJ}},

one finds

ρN(Ne0+s)=12πNJe(Ne0s)22NJ=12πNJeNe022J+e0sJ(1+O(1N))AN2Neβcs, \rho_N(-Ne_0+s) = \frac{1}{\sqrt{2\pi NJ}}e^{-\frac{(N e_0 - s)^2}{2NJ}} = \frac{1}{\sqrt{2\pi NJ}}e^{-\frac{N e_0^2}{2J} + \frac{e_0 s }{J}} \left(1 + O\left(\frac{1}{N}\right)\right) \simeq A_N 2^{-N} e^{\beta_c s},

up to a prefactor AN=(2πNJ)1/2A_N= (2\pi N J)^{-1/2} that depends on NN but not on ss. This tells us that with respect to the extreme energy Ne0-N e_0 the energy levels are exponentially distributed in the shift variable ss; namely among the 2N2^N energy levels, the states with a shift s=O(1)s=O(1) occur with a density proportional to eβcs dse^{\beta_c s}\,ds.

The Gumbel law

We are now going to show that in the condensed phase the dominant states live at a distance O(1)O(1) from the spectral edge Ne0-N e_0, and their statistics is governed by extreme-value theory.

Let EE be the minimum among the 2N2^N energies. The probability density of EE is given by the probability of sampling an energy level EE times the probability that all the other 2N12^N-1 energy levels have an energy larger than EE:

ρmin(E)=2NρN(E)[EdEρN(E)]2N1=2NρN(E) [1H(ENJ)]2N1. \rho_{\min}(E)=2^N \rho_N(E)\left[\int^{\infty}_E dE' \rho_N(E')\right]^{2^N-1} = 2^N \rho_N(E) \, \left[1 - H\left(-\frac{E}{\sqrt{NJ}}\right)\right]^{2^N-1}.

where H(x)=12 Erfc(x2)H(x) = \frac{1}{2} \, \mathrm{Erfc}\left(\frac{x}{\sqrt{2}}\right). Now we impose that the minimal energy scales as E=Ne0+sE=-Ne_0+s; we use the fact that

H(x)ex2/22πxif x H(x) \simeq \frac{e^{-x^2/2}}{\sqrt{2\pi} x} \qquad \mathrm{if} \; x \to \infty

so we have

ρmin(Ne0+s)2NρN(Ne0+s) [1e(Ne0+s)22NJ2πNJβc]2N=2NρN(Ne0+s) [11βcρN(Ne0+s)]2N. \begin{split} \rho_{\min}(-N e_0 + s) &\simeq 2^N \rho_N(-N e_0 + s) \, \left[1 - \frac{e^{-\frac{(-N e_0 + s)^2}{2NJ}}}{\sqrt{2\pi N J} \beta_c}\right]^{2^N} \\ &= 2^N \rho_N(-N e_0 + s) \, \left[1 - \frac{1}{\beta_c}\rho_N(-N e_0 + s)\right]^{2^N}. \end{split}

Using the asymptotic form of the density near the edge (13), we therefore find

ρmin(s)=ANeβcsexp[ANβceβcs] \rho_{\min}(s) = A_N e^{\beta_c s} \exp\left[- \frac{A_N}{\beta_c} e^{\beta_c s}\right]

Introducing the rescaled variable

η=βcs+ln(ANβc), \eta=\beta_c s+\ln\left(\frac{A_N}{\beta_c}\right),

one gets

ρmin(η)=ρmin(s(η))dsdη=eηeη. \rho_{\min}(\eta)= \rho_{\min}(s(\eta)) \left|\frac{ds}{d\eta}\right| = e^{\eta-e^\eta}.

which is a Gumbel distribution. The Gumbel law appears in the REM because below TcT_c the Gibbs measure is controlled by the lowest energies, and the minimum of a large collection of independent Gaussian variables belongs to the Gumbel universality class of extreme-value theory (Gumbel (1958)).

From extreme energies to Gibbs weights

Now consider again the scaling Ei=Ne0+uiE_i=-Ne_0+u_i but directly applied to the Gibbs weight wi=eβEiZN(β) w_i=\frac{e^{-\beta E_i}}{Z_N(\beta)}. As I will show here this will allow us to derive the distribution of Gibbs weights P(w)P(w)

Factoring out the common contribution eβNe0e^{\beta N e_0}, we can write

wi=eβsieβsi+Zi, w_i=\frac{e^{-\beta s_i}}{e^{-\beta s_i}+Z_{\ne i}},

where

Zi=jieβsj. Z_{\ne i}=\sum_{j\neq i} e^{-\beta s_j}.

We can find the distribution of wiw_i by expressing it in terms of the distribution of the shift variables sis_i found previously in (13); solving for sis_i we find

βsi=ln(wiZi1wi)=lnZilnwi+ln(1wi) . \beta s_i= - \ln\left(\frac{w_i Z_{\ne i}}{1-w_i}\right) = -\ln Z_{\ne i} - \ln w_i + \ln (1-w_i) \,.

Since the sis_i's are distributed with intensity proportional to eβcse^{\beta_c s}, we can transform variables from ss to ww. Conditioned on the value of ZiZ_{\ne i}, this gives

P(wZi)eβcsdsdw. P(w\mid Z_{\ne i}) \propto e^{\beta_c s}\left|\frac{ds}{dw}\right|.

Using (22) we can write eβcs=(wZi1w)βc/βe^{\beta_c s}=\left(\frac{w Z_{\ne i}}{1-w}\right)^{-\beta_c/\beta} and

dsdw=T(1w+11w)=Tw(1w), \left|\frac{ds}{dw}\right|= T \left( \frac{1}{w} + \frac{1}{1-w} \right)= \frac{T}{w(1-w)},

so that we get

P(wZi)(Zi)mw1m(1w)1+m P(w\mid Z_{\ne i}) \propto (Z_{\ne i})^{-m} w^{-1-m}(1-w)^{-1+m}

where

m=TTc<1 . m=\frac{T}{T_c}<1\,.

Averaging over the random variable ZiZ_{\ne i} only affects the overall prefactor. We can compute it by simply noticing that iwi=1\sum_i w_i = 1 so that

01dw w P(w)=1 . \int_0^1 dw \, w \, P(w) = 1\,.

The normalization can be computed using the beta function. The final result is

P(w)=wm1(1w)m1Γ(m)Γ(1m) ,m=TTc . \boxed{ P(w)=\frac{w^{-m-1}(1-w)^{m-1}}{\Gamma(m)\Gamma(1-m)}\,, \qquad m=\frac{T}{T_c} \,.}

This is the main result. The expression of P(w)P(w) gives a very concrete picture of the low-temperature phase of the REM. First, notice that P(w)P(w) diverges near w0w\to 0 as

P(w)wm1(w0) P(w)\sim w^{-m-1}\quad (w\to 0)

so that

0ϵdw P(w)dw wm1=. \int_0^{\epsilon} d w\, P(w) \sim \int d w\, w^{-m-1} = \infty.

i.e. there are infinitely many configurations carrying extremely small Gibbs weights. At the same time, P(w)P(w) diverges near w=1w=1 as

P(w)(1w)m1(w1). P(w)\sim (1-w)^{m-1}\quad (w\to 1).

so that the expected number of configurations with weight between [1ϵ,1][1-\epsilon, 1] is, when ϵ\epsilon is small equal to

1ϵ1P(w)dwdw (1w)m1=ϵmm \int_{1-\epsilon}^{1} P(w) d w \propto \int dw \, (1-w)^{m-1} = \frac{\epsilon^m}{m}

This shows that a single configuration may carry a finite fraction of the whole Gibbs measure. This is the real signature of condensation: below TcT_c, the measure is no longer democratically spread over exponentially many states, but becomes strongly uneven, with a few configurations carrying a macroscopic fraction of the total Boltzmann measure and many others contributing only weakly. In other words, below TcT_c the measure is dominated by rare states sitting close to the lower edge of the energy spectrum.

These two singular behaviors therefore encode the full geometry of the frozen phase: a small number of dominant low-energy configurations coexist with a very large background of increasingly less important ones. The fact that p(w)p(w) is not integrable is not a pathology, but precisely the mathematical expression of this accumulation of arbitrarily small Gibbs weights. This is precisely the signature of condensation: the Gibbs measure is not evenly spread, but concentrated on a sparse random set of configurations.

Recovering the participation ratio

Plugging in the explicit form of P(w)P(w) in equation (10) we can recover the expected participatio ratio as

E[YN(β)]=1Γ(m)Γ(1m)01dw w1m(1w)m1. \mathbb E[Y_N(\beta)] = \frac{1}{\Gamma(m)\Gamma(1-m)} \int_0^1 dw\, w^{1-m}(1-w)^{m-1}.

Using again the Beta integrals the integral can be performed yielding to

E[YN(β)]=1m=1TTc. \mathbb E[Y_N(\beta)] = 1-m = 1-\frac{T}{T_c}.

We have so far found the participatio ratio to be

limNYN(β)={0,T>Tc,1TTc,T<Tc \lim_{N\to\infty} Y_N(\beta) = \begin{cases} 0, & T>T_c,\\ 1-\frac{T}{T_c}, & T < T_c \end{cases}

A finite participation ratio in the condensed phase signals that the Gibbs measure has condensed onto a small number of relevant low-energy states. This is why the low-temperature phase of the REM provides perhaps the simplest example of a condensed, or ''glassy'', Boltzmann measure.

The REM is special because its energies are independent, but the same structure survives in more complicated mean-field models where the energies are correlated. In particular, in the one-step replica-symmetry-breaking phase of pp-spin models, one recovers the same distribution P(w)P(w) we have derived here[1]. The REM thus captures, in its simplest possible form, a structure that persists far beyond the independent-energy setting

[1] For the readers expert in replica theory, the parameter mm appearing in the expression (28) of P(w)P(w) can be identified with the Parisi's breaking parameter.

References

[1] Derrida, Bernard, "Random-energy model: Limit of a family of disordered models", Physical Review Letters 45.2 (1980): 79.

[2] Gumbel, E. "Statistics of extremes (New York: Columbia Univ. press)." (1958).

© Enrico M. Malatesta. Last modified: March 22, 2026. Built with Franklin.jl.