Density Waves for Haar-Unitary Matrices: A Gaussian Free Field Limit and a Zeros-Only Reconstruction

Metadata

Total Words: 11,788
Export Date: 2026-01-17 06:09:34
Description: Farmer’s carrier-wave/density-wave picture attributes large values of |ζ(1/2 + it)| (and of random unitary characteristic polynomials) primarily to mesoscopic fluctuations in zero/eigenvalue density, rather than to isolated large gaps. We make this picture canonical in the CUE. For Haar-random U ∈ U(N) we study a mesoscopic density-wave observable built from low Fourier modes of the trace process, W_N, J(θ) = ∑_{m ≤ N/J}Re (Tr(U^m)e^−imθ)/m. Under J → ∞ and J = o(N), we prove W_N, J converges in distribution (as a random distribution) to the circle Gaussian free field. We then introduce a purely eigenangle-based discrepancy estimator D_N, J, directly mirroring Farmer’s neighbor-discrepancy density wave, and show that for the intrinsic window J ≍ log N it reconstructs W_N, J uniformly in θ. This provides a rigorous bridge between the ‘zeros-only’ and ‘random-field’ viewpoints on carrier waves, and supplies a foundation for future work on window universality, clustering of maxima, and fast simulation of zeta-like landscapes at astronomical effective heights.

1. Introduction and motivation: carrier waves as mesoscopic density fluctuations; why CUE is the right testbed; statement of results.
2. Background on CUE eigenangles, counting function S_U, and Sobolev spaces H^−s(𝕊¹); definition and properties of the circle GFF.
3. The trace/Fourier density wave W_N, J: invariances, covariance computation, and relation to log |Λ_U(e^iθ)| (regularized).
4. Mesoscopic Gaussianity for traces up to m ≤ N/J: Toeplitz determinant/strong Szegő route (or cumulant method for determinantal point processes) and proof of (i).
5. A zeros-only density-wave functional: defining neighbor discrepancies δ_j(θ) robustly on the circle; basic bounds and measurability; discussion of window choice J ≍ log N.
6. Converting discrepancies to a smoothed counting statistic: derive an explicit convolution representation of D_N, J in terms of S_U and a logarithmic kernel; error terms.
7. Identifying the convolution with the trace field: Fourier analysis linking K_J * S_U to W_N, J; proof of (ii) including uniform-in-θ control (with a small exclusion near eigenangles if needed).
8. Numerical illustrations and sanity checks: simulations for moderate N showing D_N, J ≈ W_N, J and GFF-like covariance; (flagged as computational).
9. Discussion and outlook: canonical carrier wave observables; relation to Farmer’s density-wave plots; implications for extremes and for zeta transfer via the recipe/ratios conjecture.

Content

1. Introduction and motivation: carrier waves as mesoscopic density fluctuations; why CUE is the right testbed; statement of results.

The eigenangles of a Haar unitary form one of the canonical models of a rigid but random point configuration on the circle. At macroscopic scale the empirical measure is essentially uniform, while at microscopic scale the local statistics are governed by the sine kernel. Between these regimes lies the range, where one averages over many spacings but still probes a window whose length vanishes as N → ∞. Our purpose is to isolate, in this intermediate range, a simple random field capturing the dominant fluctuations of the eigenangle density, and to identify its universal limiting object.

A convenient starting point is the observation that the CUE characteristic polynomial
$$ Z_U(\theta):=\det(I-e^{-i\theta}U)=\prod_{j=1}^N \bigl(1-e^{i(\theta_j-\theta)}\bigr) $$
encodes the eigenangles simultaneously as its zeros on 𝕊¹. The logarithm of |Z_U(θ)| is a log-correlated object: its covariance is approximately $-\tfrac12\log|e^{i\theta}-e^{i\phi}|$ after removing the mean mode, which is the covariance kernel of the circle Gaussian free field. This is the sense in which the CUE logarithmic characteristic polynomial is expected to approximate a Gaussian free field at scales not too fine. The viewpoint that there is a slowly varying governing a large portion of the fluctuation of a logarithmic object, with finer-scale corrections superimposed, has appeared repeatedly in the study of log |ζ(1/2 + it)| and, in the random matrix setting, in log |Z_U| and its derivatives. In the CUE context one can make this idea precise by working in Fourier space and truncating to a mesoscopic band of frequencies.

The natural Fourier statistics of the eigenangles are the traces $\Tr(U^m)=\sum_{j=1}^N e^{i m \theta_j}$. They are the Fourier coefficients of the (non-centered) empirical measure, and they appear explicitly in the Fourier expansion of log Z_U through the identity
$$ \log Z_U(\theta)= -\sum_{m\ge 1} \frac{\Tr(U^m)e^{-im\theta}}{m}, $$
valid away from the eigenangles (with the usual choice of branch and an appropriate regularization). Thus, if one truncates this series at some large frequency M_N, one obtains a smoothened random field built from $\Tr(U^m)$ for m ≤ M_N. On the other hand, the global dependence of $\Tr(U^m)$ on all eigenangles makes it nontrivial that such a truncation yields a limit in any functional topology, or that it can be reconstructed from more local (zeros-only) information.

The first theme of this work is that, provided M_N = o(N), the family of traces up to order M_N behaves asymptotically like an independent Gaussian family after the normalization $m^{-1/2}\Tr(U^m)$. This phenomenon goes back to classical results of Diaconis–Shahshahani and Diaconis–Evans and admits several modern proofs. For our purposes it is convenient to phrase it at the level of characteristic functionals of linear combinations of traces, uniformly over families of coefficients supported on {1, …, M_N}. In such a form one can pass from finite-dimensional Gaussianity to convergence of random Fourier series, and hence to the convergence in law of a random field in a negative Sobolev space. The relevant Sobolev topology is dictated by the fact that the limiting field is a distribution rather than a function: the circle Gaussian free field has Fourier coefficients of size |m|^−1/2, so it lies almost surely in H^−s(𝕊¹) for every s > 0 and in no positive Sobolev space. Accordingly, we view mesoscopic density fluctuations as an element of H^−s for arbitrary s > 0.

The resulting mesoscopic field, which we construct as a weighted cosine series with coefficients $\Re(\Tr(U^m)e^{-im\theta})/m$ truncated at m ≤ M_N, should be interpreted as a : it is the component of the centered eigenvalue counting function lying in a frequency band below M_N. In physical terms, the cutoff removes oscillations at or below the microscopic scale 1/N and retains only fluctuations averaged over arcs containing J(N) → ∞ eigenvalues. The parameter J(N) is therefore a mesoscopic window size, and the relation M_N ∼ N/J(N) expresses the duality between spatial averaging and frequency truncation. The condition J(N) = o(N) is precisely what ensures that the retained frequencies satisfy m = o(N), the range where Gaussianity of traces is available with the strength required for field convergence. Under this regime, the covariance kernel of the truncated field is the truncated logarithmic kernel $\frac12\sum_{m\le M_N}\cos(m(\theta-\phi))/m$, and letting M_N → ∞ yields the covariance of the mean-zero circle GFF. Thus the mesoscopic density wave converges, in distribution, to the GFF in H^−s.

The second theme concerns a different representation of the same mesoscopic object, one that uses only the eigenangles themselves and no traces. The motivation here comes from ``zeros-only’’ principles in analytic number theory, where one seeks to express objects defined by primes (or Euler products) purely in terms of zeros, and conversely. In the random matrix setting, traces play the role of prime sums and eigenangles play the role of zeros. Since $\Tr(U^m)$ is a global Fourier statistic, it is natural to ask whether the mesoscopic density wave can be reconstructed from local geometric information about the point configuration, such as neighbor spacings.

To that end we consider symmetric neighbor discrepancies: for a given angle θ we look at the jth point to the right and the jth point to the left (in cyclic order) and compare the actual arc length spanned by these 2j points to its uniform expectation 2π ⋅ (2j)/N. After unfolding by N/(2π), the centered discrepancy δ_j(θ) measures the excess or deficit of eigenangles in a symmetric neighborhood of θ at scale j/N. A single discrepancy δ_j is noisy, but a logarithmically weighted sum ∑_j ≥ Jδ_j(θ)/j exhibits a stabilizing effect reminiscent of harmonic summation, and it is precisely this weighting that matches the 1/m weights in the trace-based Fourier truncation. The resulting field, defined solely from the eigenangle configuration, is a candidate zeros-only density wave.

The main comparison mechanism is an identity of summation-by-parts type connecting the inverse local counting map (which sends a rank increment to a spatial increment) to an integrated error of the counting function. Concretely, one may rewrite the neighbor radius θ_+j(θ) − θ_−j(θ) in terms of an integral involving the centered counting function S_U and a piecewise-linear kernel depending on j. Summing over j then produces a convolution of S_U (or its distributional derivative) against an explicit kernel K_J whose Fourier coefficients approximate 1/|m| on the band 1 ≤ |m| ≤ M_N and decay rapidly outside. Once this reduction is established, Fourier inversion identifies the convolution term with the trace field truncated at M_N. The remaining task is to control the approximation errors uniformly in θ, which is delicate near eigenangles because the counting function has jumps and the inverse counting map has points of non-smoothness.

The strength of the comparison depends on how rapidly J(N) grows. For convergence in distribution in a negative Sobolev topology, it suffices that J(N) → ∞ while J(N) = o(N), since then the relevant kernels are sufficiently regular at the level of H^−s and the exceptional behavior near eigenangles is negligible in that topology. For a uniform-in-θ comparison, however, we need quantitative control of the remainder term and of the stability of the inverse counting map under perturbations of S_U. This is where taking J(N) of logarithmic order is natural: the CUE number variance and rigidity bounds imply that counting errors on arcs containing ≍ log N points are small enough (with high probability) to guarantee that the discrepancy-based reconstruction tracks the trace-based field uniformly. In this regime one can also isolate tiny neighborhoods of eigenangles, or adopt a right-continuity convention, to avoid ambiguous pointwise definitions without changing the limit.

We emphasize that the CUE is the appropriate testbed for these statements for two reasons. First, the eigenangles form a determinantal point process with an explicit kernel, which provides sharp control of linear statistics, number variance, and cumulants. Second, Haar measure yields exact algebraic identities for trace moments and Toeplitz determinant representations for exponential linear statistics; both are well adapted to proving asymptotic Gaussianity in the range m = o(N). While our arguments are presented in the CUE setting, the structure of the conclusions is not specific to unitary symmetry: the appearance of the GFF reflects the universal logarithmic correlations of mesoscopic eigenvalue fluctuations, and the possibility of zeros-only reconstruction reflects the general principle that sufficiently averaged local spacing information encodes low-frequency density modes.

In summary, we establish two complementary descriptions of the same mesoscopic density wave. One description is spectral and uses traces, leading directly to a random Fourier series and hence to the circle GFF limit in H^−s. The other description is geometric and uses only neighbor discrepancies, yielding a zeros-only estimator which, at logarithmic mesoscopic scale, approximates the trace field uniformly. Having motivated these objects and their roles, we now turn to the necessary background: the CUE eigenangle process and its centered counting function, the basic features of Sobolev spaces on 𝕊¹ used to encode distributional convergence, and the definition and elementary properties of the mean-zero circle Gaussian free field.

2. Background on CUE eigenangles, counting function S_U, and Sobolev spaces H^−s(𝕊¹); definition and properties of the circle GFF.

Let U ∼ Haar(U(N)) and write its eigenvalues as e^iθ₁, …, e^iθ_N with
0 ≤ θ₁ < ⋯ < θ_N < 2π.
The joint density of the eigenangles with respect to Lebesgue measure on [0, 2π)^N is given by the classical Weyl formula,
$$ \frac{1}{(2\pi)^N N!}\prod_{1\leq j<k\leq N}\bigl|e^{i\theta_j}-e^{i\theta_k}\bigr|^2, $$
which makes the strong repulsion between nearby points manifest. It is convenient to view the random configuration as the point process
$$ \Xi_U:=\sum_{j=1}^N \delta_{\theta_j} $$
on 𝕊¹ ≃ ℝ/2πℤ. A basic structural fact is that Ξ_U is a determinantal point process with correlation kernel

More precisely, the k-point correlation functions of Ξ_U are
ρ_k(θ₁, …, θ_k) = det (K_N(θ_a, θ_b))_{1 ≤ a, b ≤ k}.
Two immediate consequences will be repeatedly used: first, rotation invariance (Haar invariance) implies that the law of the configuration is invariant under the shift θ_j ↦ θ_j + α mod 2π; second, the determinantal structure provides sharp information on the variance of linear statistics and on concentration of counting functions.

Given an arc I ⊂ 𝕊¹ of length |I|, the expected number of points in I is
$$ \mathbb{E}\,\Xi_U(I)=\int_I K_N(\theta,\theta)\,d\theta=\frac{N}{2\pi}|I|. $$
The fluctuation size of Ξ_U(I) depends on the scale of I. For our mesoscopic purposes we need only the qualitative picture that for arcs containing many points one has sublinear fluctuations. Concretely, determinantal methods yield a number variance of logarithmic order on mesoscopic arcs: for |I| bounded away from 0 and 2π one has Var(Ξ_U(I)) ≍ log N, and more generally the variance can be bounded in terms of K_N by standard determinantal identities. These statements encode a form of rigidity of the CUE configuration: compared to a Poisson process, the counting function is much more stable.

We work with the centered counting function
$$ S_U(\theta):=\#\{j:0\leq \theta_j\leq \theta\}-\frac{N\theta}{2\pi},\qquad \theta\in[0,2\pi], $$
and extend S_U 2π-periodically to ℝ. Since #{j : 0 ≤ θ_j ≤ 0} = 0 and #{j : 0 ≤ θ_j ≤ 2π} = N, we have S_U(0) = S_U(2π) = 0, so the periodic extension is consistent. The function S_U is càdlàg, piecewise affine with slope −N/(2π) and upward jumps of size 1 at the eigenangles.

It is natural to differentiate S_U in the sense of distributions. Let dS_U denote its (periodic) distributional derivative, which is a signed measure on 𝕊¹. A direct computation shows

i.e. the centered empirical measure. Thus, for any smooth 2π-periodic test function f,

Equivalently, by integration by parts on the circle,

whenever f is C¹ and periodic.

Fourier analysis provides a convenient encoding of . For a (signed) measure μ on 𝕊¹ we use the convention
μ̂(m) := ∫_𝕊¹e^−imθ μ(dθ), m ∈ ℤ.
Letting $\mu_U:=\sum_{j=1}^N\delta_{\theta_j}$, we have
$$ \widehat{\mu_U}(m)=\sum_{j=1}^N e^{-im\theta_j}=\mathrm{Tr}(U^{-m})=\overline{\mathrm{Tr}(U^m)}. $$
From , it follows that $\widehat{dS_U}(0)=0$ and, for m ≠ 0,

Since dS_U is the derivative of S_U, we also have the identity

where $\widehat{S_U}(m)$ is now understood in the distributional sense (for instance by pairing S_U against e^−imθ). Formula is the basic bridge between the counting function and trace statistics; it will allow us to identify certain convolutions of dS_U with explicit kernels as Fourier series built from Tr(U^m).

Two further elementary remarks will be useful. First, by rotation invariance, the process dS_U is stationary on 𝕊¹ (mean-zero and shift invariant). Second, the total mass of dS_U vanishes, reflecting that S_U has zero mean mode. This is the reason that the limiting Gaussian field will also be taken mean-zero.

We now recall the Sobolev scale on the circle and fix conventions for Fourier coefficients. For an L² function f on 𝕊¹ we write its Fourier series as
$$ f(\theta)=\sum_{m\in\mathbb{Z}} \widehat{f}(m)e^{im\theta}, \qquad \widehat{f}(m)=\frac{1}{2\pi}\int_0^{2\pi} f(\theta)e^{-im\theta}\,d\theta. $$
For s ∈ ℝ, the Sobolev space H^s(𝕊¹) consists of those distributions f whose Fourier coefficients satisfy
∥f∥_H^s² := ∑_{m ∈ ℤ}(1 + |m|²)^s |f̂(m)|² < ∞.
When s ≥ 0 this agrees with the standard L²-based Sobolev space; when s < 0 it defines a Hilbert space of distributions. We write H₀^s(𝕊¹) for the mean-zero subspace f̂(0) = 0.

The duality pairing between H^−s and H^s (with s > 0) may be realized through Fourier coefficients: if T ∈ H^−s and f ∈ H^s, then

is absolutely convergent and satisfies |⟨T, f⟩| ≤ ∥T∥_H^−s∥f∥_H^s. In particular, random objects with Fourier coefficients of size |m|^−1/2 naturally live in H^−s for every s > 0 but not in L².

We will use this framework to encode convergence in law of random fields that are defined either by Fourier series (as in the trace-based construction) or as convolutions with dS_U (as in the discrepancy-based construction). The basic scheme is as follows: if (T_N) is a sequence of random elements of H^−s and T is a limiting random element of H^−s, then T_N ⇒ T in H^−s means that for every bounded continuous functional F : H^−s → ℝ we have 𝔼F(T_N) → 𝔼F(T). In practice we verify this by showing convergence of finite-dimensional distributions
(⟨T_N, f₁⟩, …, ⟨T_N, f_k⟩) ⇒ (⟨T, f₁⟩, …, ⟨T, f_k⟩)
for suitable test functions f_ℓ ∈ H^s, together with tightness in H^−s. The tightness step reduces to uniform control of 𝔼∥T_N∥_H^−s², which in Fourier terms is an estimate on $\mathbb{E}|\widehat{T_N}(m)|^2$ with weights (1 + |m|²)^−s.

The limiting object for our mesoscopic density wave is the mean-zero Gaussian free field on 𝕊¹, which we denote by 𝒢. We take as definition the centered Gaussian random distribution whose Fourier coefficients satisfy

Equivalently, one may realize 𝒢 as the real random Fourier series

where (a_m, b_m)_m ≥ 1 are i.i.d. standard real Gaussians. The series does not converge pointwise; it converges almost surely in H^−s(𝕊¹) for every s > 0, and in no H^t with t ≥ 0. Indeed,
$$ \mathbb{E}\|\mathcal{G}\|_{H^{-s}}^2 =\sum_{m\neq 0}(1+|m|^2)^{-s}\,\mathbb{E}|\widehat{\mathcal{G}}(m)|^2 \asymp \sum_{m\ge 1}\frac{1}{m^{1+2s}}<\infty\qquad (s>0), $$
which implies 𝒢 ∈ H^−s almost surely by standard Gaussian Hilbert space arguments.

The covariance kernel of 𝒢 can be written in several equivalent ways. For test functions f, g ∈ H₀^s with s > 0, one has

Formally, this corresponds to the logarithmic kernel on the circle. Indeed, if we define the mean-zero Green kernel
$$ G(\theta):=\sum_{m\neq 0}\frac{e^{im\theta}}{2|m|}=\sum_{m\ge 1}\frac{\cos(m\theta)}{m}, $$
then G(θ) is (up to an additive constant) equal to −log |2sin (θ/2)| for θ ≢ 0 (mod 2π), and may be rewritten as
Cov(⟨𝒢, f⟩, ⟨𝒢, g⟩) = ∫_𝕊¹∫_𝕊¹f(θ) G(θ − ϕ) g(ϕ) dθ dϕ,
with the understanding that the integral is interpreted via Fourier series (since G has a logarithmic singularity).

For later comparison with mesoscopic truncations, we also record the standard Fourier cutoff regularization. Define
$$ \mathcal{G}^{(\le M)}(\theta):=\sum_{m=1}^{M}\frac{1}{\sqrt{m}}\bigl(a_m\cos(m\theta)+b_m\sin(m\theta)\bigr). $$
Then 𝒢^( ≤ M) is a smooth random function and 𝒢^( ≤ M) → 𝒢 in H^−s almost surely and in L²(Ω; H^−s) for every s > 0. Its covariance kernel is the truncated logarithmic sum
$$ \mathbb{E}\bigl[\mathcal{G}^{(\le M)}(\theta)\mathcal{G}^{(\le M)}(\phi)\bigr] =\frac12\sum_{m=1}^{M}\frac{\cos(m(\theta-\phi))}{m}, $$
which converges (as M → ∞) to the logarithmic kernel away from the diagonal and in the distributional sense on 𝕊¹ × 𝕊¹ after removing the mean mode. This truncated kernel is the one that will arise from our trace-based density wave once we identify its Fourier coefficients and compute their asymptotic second moments in the regime M = o(N).

Having fixed these conventions, we may now treat the mesoscopic trace field W_N, J as a random element of H^−s(𝕊¹) and compare it directly to 𝒢 via Fourier coefficients and characteristic functionals. We proceed next to the basic properties of W_N, J, beginning with its symmetries and covariance structure and its relation to regularized versions of the logarithm of the characteristic polynomial.

3. The trace/Fourier density wave W_N, J: invariances, covariance computation, and relation to log |Λ_U(e^iθ)| (regularized).

We recall that for a mesoscopic parameter J = J(N) with J → ∞ and J = o(N) we set M_N := ⌊N/J⌋ and define the real-valued trigonometric polynomial
$$ W_{N,J}(\theta):=\sum_{m=1}^{M_N}\frac{\Re\!\big(\Tr(U^m)e^{-im\theta}\big)}{m},\qquad \theta\in\mathbb{S}^1. $$
Since W_N, J is a finite Fourier series, it is a smooth 2π-periodic function for each fixed N.

The observable W_N, J depends only on the spectrum of U. Indeed, writing λ_j = e^iθ_j we have $\Tr(U^m)=\sum_{j=1}^N \lambda_j^m=\sum_{j=1}^N e^{im\theta_j}$, hence

Thus W_N, J is a linear statistic of the eigenangles with the (singular, but truncated) test kernel ∑_{m ≤ M_N}cos (mt)/m evaluated at t = θ_j − θ.

We record the stationarity (rotation invariance) of the field. For any α ∈ ℝ, Haar invariance implies that e^iαU has the same law as U. Since $\Tr((e^{i\alpha}U)^m)=e^{im\alpha}\Tr(U^m)$, we obtain the identity in law
$$ W_{N,J}^{(e^{i\alpha}U)}(\theta)=\sum_{m\le M_N}\frac{\Re\!\big(e^{im\alpha}\Tr(U^m)e^{-im\theta}\big)}{m} =W_{N,J}^{(U)}(\theta-\alpha). $$
Consequently, as a random field on 𝕊¹, W_N, J is stationary: (W_N, J(θ₁ + α), …, W_N, J(θ_k + α)) has the same law as (W_N, J(θ₁), …, W_N, J(θ_k)).

We also note that $\mathbb{E}\Tr(U^m)=0$ for every fixed m ≥ 1 (for instance by multiplying U by a scalar e^iα and averaging over α), and hence 𝔼W_N, J(θ) = 0 for all θ.

It is useful to identify the Fourier coefficients of W_N, J in the conventions fixed earlier. Expanding the real part,
$$ \Re\!\big(\Tr(U^m)e^{-im\theta}\big)=\frac12\Big(\Tr(U^m)e^{-im\theta}+\overline{\Tr(U^m)}\,e^{im\theta}\Big) =\frac12\Big(\Tr(U^m)e^{-im\theta}+\Tr(U^{-m})e^{im\theta}\Big). $$
Therefore

In particular, for m ∈ ℤ \ {0},

Comparing with , we see that for 1 ≤ |m| ≤ M_N,
$$ \widehat{W_{N,J}}(m)=\frac{1}{2|m|}\,\widehat{dS_U}(m). $$

This admits a compact formulation in terms of convolution with dS_U. Let K_M be the 2π-periodic kernel defined by its Fourier coefficients

so that

For a finite signed measure μ on 𝕊¹ we use the standard periodic convolution
(K_M * μ)(θ) := ∫_𝕊¹K_M(θ − ϕ) μ(dϕ),
in which case $\widehat{K_M*\mu}(m)=2\pi\,\widehat{K_M}(m)\widehat{\mu}(m)$. Applying this with μ = dS_U and using –, we obtain for 1 ≤ |m| ≤ M,
$$ \widehat{K_M*dS_U}(m)=2\pi\cdot \frac{1}{4\pi |m|}\cdot \widehat{dS_U}(m)=\frac{1}{2|m|}\widehat{dS_U}(m), $$
and the coefficient vanishes for |m| > M as well as at m = 0. Thus, with M = M_N,

This representation will be the point of contact with the discrepancy-based construction later on: it isolates the universal kernel K_{M_N} and separates it from the random input dS_U.

We next compute the covariance kernel of W_N, J. The main input is the classical second-moment identity for CUE traces: for integers m, n ≥ 1,

and $\mathbb{E}[\Tr(U^m)\Tr(U^n)]=0$ for m, n ≥ 1. Identity may be proved by representation-theoretic methods (Diaconis–Shahshahani) or by determinantal calculations for Fourier modes of the empirical measure.

Using and bilinearity, we write
$$ \mathbb{E}\big[W_{N,J}(\theta)W_{N,J}(\phi)\big] =\sum_{m,n\le M_N}\frac{1}{m n}\, \mathbb{E}\Big[\Re\!\big(\Tr(U^m)e^{-im\theta}\big)\Re\!\big(\Tr(U^n)e^{-in\phi}\big)\Big]. $$
Expanding the real parts and using that mixed moments without conjugation vanish, we obtain

hence

Since M_N = ⌊N/J⌋, we have min (m, N) = m for all m ≤ M_N once N is large enough (and in any case min (m, N) = m + O(N/J − N)⁺), so is asymptotic to the truncated logarithmic sum

uniformly in θ, ϕ ∈ 𝕊¹. We isolate this as a lemma for later reference.

In particular, setting θ = ϕ yields

so the typical amplitude of W_N, J at a point is of order $\sqrt{\log M_N}$.

The covariance matches exactly the covariance of the Fourier cutoff regularization of the mean-zero circle GFF at frequency M_N. Since ∑_m ≥ 1cos (mt)/m is the mean-zero Green kernel on 𝕊¹ (equal to −log |2sin (t/2)| up to an additive constant away from t ≡ 0), the truncated sum may be viewed as a regularized logarithmic kernel. This already suggests that as M_N → ∞ one should obtain a log-correlated Gaussian limit; the remaining task, carried out in the next section, is to justify Gaussianity and tightness in negative Sobolev topologies.

We finally explain the standard connection to the characteristic polynomial. Let
$$ \Lambda_U(z):=\det(I-zU)=\prod_{j=1}^N (1-ze^{i\theta_j}),\qquad z\in\mathbb{C}. $$
For |z| < 1 we may expand log (1 − ze^iθ_j) into its absolutely convergent power series and sum over j, obtaining the identity

where the logarithm is the principal branch near z = 0. Taking real parts, for z = re^−iθ with 0 ≤ r < 1,

Thus the infinite series built from $\Re(\Tr(U^m)e^{-im\theta})/m$ is precisely (minus) the harmonic regularization of log |Λ_U(e^iθ)| obtained by radially approaching the unit circle. The field W_N, J is the sharp-frequency analogue of : it replaces the Poisson weights r^m by a cutoff m ≤ M_N and discards higher modes.

Equivalently, one may interpret W_N, J as the logarithmic potential generated by the centered empirical measure, with a Fourier cutoff removing the singularity at the diagonal. Indeed, combining with and , we may rewrite
$$ W_{N,J}(\theta)=\int_{\mathbb{S}^1} K_{M_N}(\theta-\phi)\,\Big(\sum_{j=1}^N\delta_{\theta_j}-\frac{N}{2\pi}d\phi\Big), $$
so W_N, J is a centered (mean-zero) logarithmic potential field, regularized by the Fourier cutoff in K_{M_N}. Since K_{M_N}(t) converges in the distributional sense (modulo constants) to the Green kernel t ↦ −log |2sin (t/2)|, the observable W_N, J may be viewed as a mesoscopic approximation to the log-correlated field naturally associated with the eigenangles.

This perspective clarifies why W_N, J is the appropriate object for comparison with the circle GFF: the latter is the canonical centered Gaussian log-correlated distribution on 𝕊¹, and shows that the covariance of W_N, J already matches the GFF covariance at the level of Fourier cutoffs. In the next section we justify that the higher joint moments (or, equivalently, characteristic functionals against test functions) are asymptotically Gaussian for modes m ≤ M_N = o(N), which leads to the distributional convergence in H^−s(𝕊¹).

4. Mesoscopic Gaussianity for traces up to m ≤ N/J: Toeplitz determinant/strong Szegő route (or cumulant method for determinantal point processes) and proof of (i).

Throughout this section we assume M_N = o(N), as is the case when M_N = ⌊N/J⌋ with J → ∞ and J = o(N). Our objective is twofold: first, to justify that the collection of trace modes $(\Tr(U^m))_{1\leq m\leq M_N}$ behaves asymptotically as an independent Gaussian family after normalization; second, to deduce from this that the field W_N, J converges in law to the circle GFF in negative Sobolev topology.

Let ψ : 𝕊¹ → ℝ be a real trigonometric polynomial of degree at most M_N,
$$ \psi(\theta)=\sum_{1\leq |m|\leq M_N} c_m e^{im\theta},\qquad c_{-m}=\overline{c_m}. $$
We write the corresponding linear statistic of the eigenangles as
$$ L_N(\psi):=\sum_{j=1}^N \psi(\theta_j). $$
A basic identity (Heine–Szeg) expresses the Laplace transform of L_N(ψ) as a Toeplitz determinant:

where T_N(e^ψ) is the N × N Toeplitz matrix with symbol e^ψ, i.e. $(T_N(e^\psi))_{k\ell}=\widehat{e^\psi}(k-\ell)$ in our Fourier conventions. We emphasize that is exact and uses only that the CUE eigenangles form a determinantal point process with the projection kernel onto Fourier modes {0, 1, …, N − 1}.

The relevance of is that Toeplitz determinants admit precise asymptotics when the symbol is sufficiently regular. In the present mesoscopic regime the symbol depends on N through its Fourier support M_N, and we will use a strong Szeg-type estimate uniform for symbols whose Fourier support is o(N). Concretely, writing
ψ̂(m) = c_m, ψ̂(0) = 0,
the strong Szeg formula suggests
$$ \log \det\big(T_N(e^\psi)\big)=N\widehat{\psi}(0)+\sum_{m\geq 1} m\,\widehat{\psi}(m)\widehat{\psi}(-m)+o(1) =\sum_{m=1}^{M_N} m |c_m|^2 + o(1), $$
provided ψ remains in a bounded set of an appropriate Sobolev class. We record a version tailored to our setting.

We now translate Lemma~ into asymptotic Gaussianity of traces. Observe that
$$ L_N(\psi) =\sum_{j=1}^N \sum_{1\leq |m|\leq M_N} c_m e^{im\theta_j} =\sum_{m=1}^{M_N}\Big(c_m \Tr(U^m)+c_{-m}\Tr(U^{-m})\Big) =2\Re\sum_{m=1}^{M_N} c_m \Tr(U^m). $$
Thus Lemma~ describes the joint Laplace transforms of linear combinations of $(\Tr(U^m))_{m\leq M_N}$.

Fix k ≥ 1 and distinct integers 1 ≤ m₁ < ⋯ < m_k ≤ M_N. For complex coefficients (t₁, …, t_k) define
$$ \psi(\theta):=\sum_{\ell=1}^k \Big(t_\ell e^{i m_\ell \theta}+\overline{t_\ell}e^{-i m_\ell \theta}\Big), $$
so that c_{m_ℓ} = t_ℓ and $c_{-m_\ell}=\overline{t_\ell}$. Then
$$ L_N(\psi)=2\Re\sum_{\ell=1}^k t_\ell \Tr(U^{m_\ell}). $$
Applying Lemma~ gives

uniformly for t_ℓ in bounded sets. The right-hand side is precisely the Laplace transform of a vector of independent complex Gaussians (Z_{m_ℓ})_ℓ = 1^k with 𝔼Z_{m_ℓ} = 0 and 𝔼|Z_{m_ℓ}|² = m_ℓ. In particular, yields the following.

We now deduce the distributional convergence of the field W_N, J in negative Sobolev topology. Since W_N, J is a trigonometric polynomial supported on frequencies |m| ≤ M_N, it suffices to test against f ∈ H^s(𝕊¹) with zero mean and identify the limiting law of ⟨W_N, J, f⟩.

Let f(θ) = ∑_{m ∈ ℤ}f̂(m)e^imθ with f̂(0) = 0. Using the Fourier expansion of W_N, J and orthogonality of exponentials, we obtain

For real-valued f this simplifies to

Thus ⟨W_N, J, f⟩ is a finite linear combination of the trace modes $\Tr(U^m)$ with coefficients of size f̂(m)/m.

To apply Proposition~ uniformly, we note that f ∈ H^s implies
$$ \sum_{m\in\mathbb{Z}} (1+m^2)^{s}|\widehat{f}(m)|^2<\infty, \qquad\text{hence}\qquad \sum_{m=1}^{\infty} m\,\Big|\frac{\widehat{f}(m)}{m}\Big|^2 =\sum_{m=1}^\infty \frac{|\widehat{f}(m)|^2}{m}<\infty, $$
since s > 0 implies |f̂(m)|² ≪ m^{−2s − 1} and therefore |f̂(m)|²/m is summable.

Define a real trigonometric polynomial ψ_f (depending on N through the cutoff M_N) by
$$ \psi_f(\theta):=\sum_{m=1}^{M_N}\frac{\pi}{m}\Big(\widehat{f}(m)e^{im\theta}+\widehat{f}(-m)e^{-im\theta}\Big). $$
Then reads
⟨W_N, J, f⟩ = L_N(ψ_f).
Lemma~ applies since
$$ \sum_{m=1}^{M_N} m \Big|\frac{\pi\widehat{f}(m)}{m}\Big|^2 =\pi^2\sum_{m=1}^{M_N}\frac{|\widehat{f}(m)|^2}{m} \leq \pi^2\sum_{m=1}^{\infty}\frac{|\widehat{f}(m)|^2}{m}<\infty, $$
uniformly in N. Therefore, for each fixed f (and jointly for finitely many such f), ⟨W_N, J, f⟩ converges to a centered Gaussian with variance determined by the quadratic form ∑_m ≥ 1|f̂(m)|²/m in the normalization consistent with the covariance computed previously for W_N, J. This identifies the limiting finite-dimensional distributions with those of the mean-zero circle GFF in our normalization.

It remains to upgrade convergence of finite-dimensional distributions to convergence in law in H^−s(𝕊¹) for any s > 0. We use the standard criterion based on uniform bounds on Sobolev norms. Writing the Fourier coefficients of W_N, J as in , we have $\widehat{W_{N,J}}(0)=0$ and $\widehat{W_{N,J}}(m)=0$ for |m| > M_N. For s > 0,
$$ \|W_{N,J}\|_{H^{-s}}^2 :=\sum_{m\in\mathbb{Z}} (1+m^2)^{-s}\,|\widehat{W_{N,J}}(m)|^2 =\sum_{1\leq |m|\leq M_N} (1+m^2)^{-s}\,|\widehat{W_{N,J}}(m)|^2. $$
Taking expectations and using $\mathbb{E}|\Tr(U^m)|^2=\min(m,N)$, we obtain for N large (so that min (m, N) = m for m ≤ M_N),

Since (1 + m²)^−s ≤ m^−2s for m ≥ 1 and ∑_m ≥ 1m^{−1 − 2s} < ∞ whenever s > 0, the right-hand side is bounded uniformly in N. Hence (W_N, J)_N ≥ 1 is tight in H^−s(𝕊¹).

Combining tightness with the convergence of the finite-dimensional distributions ⟨W_N, J, f⟩ for f ∈ H^s yields the claimed convergence in law in H^−s. This completes the proof of part~(i) of the main theorem.

We now turn to a complementary question: how much of the field W_N, J can be reconstructed from the eigenangle configuration alone without access to traces, by using only symmetric neighbor discrepancies on mesoscopic scales.

5. A zeros-only density-wave functional: defining neighbor discrepancies δ_j(θ) robustly on the circle; basic bounds and measurability; discussion of window choice J ≍ log N.

Our aim in part~(ii) is to construct, from the eigenangle configuration alone, an observable which approximates the trace-based field W_N, J. The input we allow ourselves is purely ``geometric’’: for each point θ ∈ 𝕊¹ we look at the locations of the jth eigenangles immediately to the right and to the left of θ in the cyclic order, and we compare the resulting symmetric arc length to its deterministic value 4πj/N. The corresponding normalized error is the δ_j(θ), and the zeros-only density wave is a harmonic sum of these discrepancies over mesoscopic scales j ≥ J.

A minor issue is that the neighbor maps θ ↦ θ_±j(θ) are not canonically defined at the eigenangles themselves (or, more generally, on a null set where ``ties’’ may occur if one insists on working purely modulo 2π). We therefore begin by fixing a robust circle convention and by recording elementary measurability and deterministic bounds. These points are logically separate from the probabilistic estimates used later, but they clarify that D_N, J is an honest random field, canonically constructed from {θ₁, …, θ_N}.

We work with a fixed realization 0 ≤ θ₁ < ⋯ < θ_N < 2π and introduce a 2π-periodic lift to ℝ by setting
θ̃_k := θ_k, 1 ≤ k ≤ N, and θ̃_k + N := θ̃_k + 2π, k ∈ ℤ.
For each θ ∈ ℝ we define the (right-continuous) index
n(θ) := max {k ∈ ℤ: θ̃_k ≤ θ},
so that θ̃_n(θ) ≤ θ < θ̃_{n(θ) + 1} for all θ which are not themselves eigenangles, and at θ = θ̃_k we have n(θ) = k (this is the right-continuity convention). For integers j ≥ 1 we then set the lifted neighbors
θ̃_+j(θ) := θ̃_{n(θ) + j}, θ̃_−j(θ) := θ̃_{n(θ) − j + 1},
and define θ_±j(θ) ∈ 𝕊¹ as their images modulo 2π. With this choice, the arc from the left neighbor to the right neighbor is represented by the genuine real interval
[θ̃_−j(θ), θ̃_+j(θ)] ⊂ ℝ,
and its length is simply θ̃_+j(θ) − θ̃_−j(θ) > 0. In particular, for 1 ≤ j ≤ ⌊N/2⌋ we always have
0 < θ̃_+j(θ) − θ̃_−j(θ) < 2π,
since the index gap is (n(θ) + j) − (n(θ) − j + 1) = 2j − 1 < N and thus does not cross a full period in the lifted sequence. This is one reason for later restricting j to at most a fixed fraction of N (we shall use ⌊N/4⌋).

For 1 ≤ j ≤ ⌊N/4⌋ and θ ∈ ℝ we define the symmetric j-neighbor discrepancy by

The normalization is chosen so that in the perfectly rigid ``picket fence’’ configuration θ_k = 2πk/N one has θ̃_+j(θ) − θ̃_−j(θ) = 4πj/N and hence δ_j(θ) ≡ 0. In a random configuration, δ_j(θ) records an over-density (negative discrepancy) or under-density (positive discrepancy) of eigenangles around θ on the symmetric scale of approximately 2j points.

We then define, for an integer cutoff J ≥ 1,

This is the zeros-only density-wave functional appearing in part~(ii). The lower cutoff J will be taken to infinity with N; the upper cutoff ⌊N/4⌋ is inessential (any fixed fraction < N/2 would do) and is imposed to avoid large symmetric windows that nearly cover the entire circle, where later summation-by-parts manipulations become notationally less stable.

The definition above yields, for each fixed eigenangle configuration, a function θ ↦ δ_j(θ) which is 2π-periodic and right-continuous with left limits. Indeed, n(θ) is a right-continuous step function with jumps of size 1 at each eigenangle (and constant on each interval between successive eigenangles); hence θ̃_±j(θ) are also right-continuous step functions, and so is their difference. Consequently δ_j is piecewise constant on the connected components of ℝ \ {θ̃_k : k ∈ ℤ}, with jumps only at eigenangles, and the same is true of D_N, J.

The dependence on the randomness is purely through the ordered list (θ₁, …, θ_N). More precisely, if we view θ ↦ D_N, J(θ) as an element of the Skorokhod space D(𝕊¹) of c`adl`ag functions on the circle (or simply as a bounded measurable function on 𝕊¹), then the mapping
(θ₁, …, θ_N) ↦ D_N, J( ⋅ )
is Borel measurable.

In applications we will occasionally want to take suprema over θ. Since δ_j (and hence D_N, J) may jump at eigenangles, it is useful to note that
sup_{θ ∈ 𝕊¹}|D_N, J(θ)| = max_{1 ≤ k ≤ N} sup_{θ ∈ [θ_k, θ_k + 1)}|D_N, J(θ)|,
so one may equivalently take the supremum over any deterministic set of representatives, for instance over midpoints of the eigenangle gaps, without changing the value. This observation will be convenient when we exclude tiny neighborhoods of eigenangles in later uniform estimates.

We record simple bounds which hold for every configuration. Since 0 < θ̃_+j(θ) − θ̃_−j(θ) < 2π for j ≤ N/2, we have

Consequently,

uniformly in θ. These bounds are far from sharp for Haar-typical configurations, but they show that the functional is well-defined without any further truncation and that any probabilistic control in sup norm will necessarily use cancellations beyond the trivial estimate .

A further deterministic point is the invariance under rotation: if we shift all eigenangles by a common amount α (modulo 2π), then δ_j(θ) and D_N, J(θ) shift as functions of θ by the same amount. This is immediate from the cyclic indexing definition and will be consistent with the stationarity of the limiting Gaussian field.

The harmonic weights 1/j in are dictated by the logarithmic structure of the target field. One expects (and later proves, after converting discrepancies to a smoothed counting statistic) that the sum over j behaves as a Fourier multiplier comparable to 1/|m| at frequencies |m| ≲ N/J. In this sense, restricting to j ≥ J is analogous to discarding high Fourier modes, exactly as W_N, J discards trace modes above M_N = ⌊N/J⌋.

From the viewpoint of well-posedness of a limiting object, the condition J → ∞ is also natural. Even if δ_j(θ) stayed of order one, the series ∑_j ≥ 1δ_j(θ)/j would have no reason to converge as N → ∞ because of the logarithmic divergence of ∑1/j. Taking J → ∞ removes the microscopic scales and ensures that the remaining sum corresponds to a mesoscopic smoothing, compatible with convergence in negative Sobolev topology.

For the reconstruction statement in part~(ii), we need a more specific growth rate. The key stability issue is that δ_j(θ) is defined in terms of inverse counting maps: the endpoints θ̃_±j(θ) are quantiles of the point configuration around θ. Inverting the counting function is Lipschitz only on scales larger than the maximal counting error. For Haar(CUE), it is known (and we will use later through standard rigidity/number variance bounds) that the centered counting function satisfies a logarithmic sup bound with high probability,
∥S_U∥_{L^∞(𝕊¹)} ≪ log N.
Heuristically, the local quantile map becomes stable once the number of points in the window dominates these counting fluctuations, i.e. once j ≫ ∥S_U∥_∞. This suggests precisely a logarithmic choice J(N) ≍ log N as the threshold beyond which neighbor discrepancies can be related, uniformly in θ, to smoothed versions of S_U with negligible error.

We therefore adopt in part~(ii) the regime
J(N) = ⌊clog N⌋,
which is minimal up to constants for the ensuing uniform approximation argument. In the weaker distributional comparison (where one tests against smooth functions and works in H^−s rather than sup norm), it is typically sufficient to assume only J → ∞ and J = o(N); nonetheless, even there, the diverging cutoff plays the same conceptual role of separating microscopic irregularities from the mesoscopic density wave.

The remainder of the proof will proceed by rewriting D_N, J as a linear functional of the counting function S_U against an explicit logarithmic-type kernel, up to an error which vanishes under the above choice of J. This reduction is the point where the particular form of becomes effective: symmetric neighbor radii admit a summation-by-parts identity that turns the inverse-quantile information into an integrated counting error, and hence into a convolution amenable to Fourier comparison with W_N, J.

6. Converting discrepancies to a smoothed counting statistic: derive an explicit convolution representation of D_N, J in terms of S_U and a logarithmic kernel; error terms.

Our next task is to replace the inverse–quantile nature of δ_j(θ) by a direct linear statistic of the counting function. The guiding principle is that the discrepancy compares the of a symmetric window (the arc between the two neighbors) to the of the eigenangle configuration in that same window; the difference is precisely the signed mass of the centered counting measure dS_U on the window. After summing in j with harmonic weights, one obtains a logarithmic kernel.

We regard S_U as a 2π-periodic c`adl`ag function and write its associated signed measure on ℝ as
$$ dS_U(t)=\sum_{k\in\mathbb{Z}}\delta_{\widetilde{\theta}_k}(dt)-\frac{N}{2\pi}\,dt, $$
so that for any half-open interval (a, b] ⊂ ℝ we have
$$ \int_{(a,b]} dS_U =\#\{k:\ a<\widetilde{\theta}_k\leq b\}-\frac{N}{2\pi}(b-a). $$
For fixed θ and j ≤ ⌊N/4⌋, set
I_j(θ) := (θ̃_−j(θ), θ̃_+j(θ)] ⊂ ℝ.
By construction, I_j(θ) contains exactly 2j lifted eigenangles (counting the right endpoint and excluding the left endpoint), i.e.
#{k: θ̃_−j(θ) < θ̃_k ≤ θ̃_+j(θ)} = 2j.
Consequently,

and hence

This identity is deterministic and uses only the half-open convention in the definition of the Stieltjes integral; it is precisely at this point that the right-continuity convention for the neighbor maps is convenient, since it provides an unambiguous choice of endpoints.

Substituting into and exchanging the sum with the integral (which is legitimate since the sum is finite) yields the exact formula

Thus D_N, J is a linear statistic of dS_U with a random weight that depends on θ through the random windows I_j(θ).

The right-hand side of becomes a convolution once we replace the random indicator 1_{I_j(θ)}(t) by a deterministic translation-invariant approximation. The natural deterministic window of ``radius j’’ around θ is the symmetric interval
$$ I_j^{(0)}(\theta):=\big(\theta-\tfrac{2\pi j}{N},\,\theta+\tfrac{2\pi j}{N}\big]\subset\mathbb{R}, $$
whose length is exactly 4πj/N. Define the corresponding deterministic weight

extended 2π-periodically to 𝕊¹. Then
$$ \sum_{j=J}^{\lfloor N/4\rfloor}\frac{\mathbf{1}_{I_j^{(0)}(\theta)}(t)}{j} =L_J(t-\theta), $$
and therefore

where convolution is taken on 𝕊¹.

The difference between and is supported near the boundaries of the random windows, and can be bounded in terms of how far the random endpoints θ̃_±j(θ) deviate from the deterministic endpoints θ ± 2πj/N. A convenient way to encode this is through an
$$ \Delta_j(\theta):=\max\Big\{\big|\widetilde{\theta}_{+j}(\theta)-(\theta+\tfrac{2\pi j}{N})\big|,\ \big|\widetilde{\theta}_{-j}(\theta)-(\theta-\tfrac{2\pi j}{N})\big|\Big\}. $$
On the event that Δ_j(θ) ≤ η/N for all θ and all j ≥ J, the symmetric difference of windows satisfies
$$ I_j(\theta)\,\triangle\, I_j^{(0)}(\theta)\subset \big(\theta-\tfrac{2\pi j}{N}-\tfrac{\eta}{N},\,\theta-\tfrac{2\pi j}{N}+\tfrac{\eta}{N}\big] \ \cup\ \big(\theta+\tfrac{2\pi j}{N}-\tfrac{\eta}{N},\,\theta+\tfrac{2\pi j}{N}+\tfrac{\eta}{N}\big], $$
and hence, by ,

The right-hand side involves only the total variation of dS_U on a union of boundary layers of thickness O(η/N). In particular, if we have a rigidity-type control on the number of eigenangles in arcs of length O(η/N), then the contribution of these layers is o(1) after summing the harmonic weights provided J is at least logarithmic. We will implement this probabilistically in the uniform comparison step, but for the present reduction it is enough to isolate the corresponding remainder term.

The weight L_J defined in is explicitly computable in Fourier series. For m ≠ 0, let
$$ \widehat{L}_J(m):=\frac{1}{2\pi}\int_{-\pi}^{\pi}L_J(u)e^{-imu}\,du. $$
Since the summands in are even, L̂_J(m) is real and L̂_J(−m) = L̂_J(m). A direct calculation gives
$$ \widehat{L}_J(m) =\frac{1}{2\pi}\sum_{j=J}^{\lfloor N/4\rfloor}\frac{1}{j}\int_{-2\pi j/N}^{2\pi j/N} e^{-imu}\,du =\sum_{j=J}^{\lfloor N/4\rfloor}\frac{1}{j}\cdot \frac{\sin(2\pi m j/N)}{\pi m}. $$
The sum in j is a truncated harmonic exponential sum, and it is at this point that the mesoscopic scale M_N = ⌊N/J⌋ appears: for |m| ≪ N/J, the phase increment 2πm/N is small compared to the lower limit 1/J in the harmonic sum, so one expects L̂_J(m) to behave like 1/|m| up to an error of order 1/J; for |m| ≫ N/J, oscillation forces additional decay.

Rather than keeping L_J itself, it is conceptually cleaner to replace it by a kernel with Fourier multiplier 1_{1 ≤ |m| ≤ M_N}/|m|, which is the multiplier relevant for W_N, J. Accordingly, define the 2π-periodic kernel

so that its Fourier coefficients satisfy
$$ \widehat{K}_J(m)=\frac{\mathbf{1}_{1\leq |m|\leq M_N}}{2|m|},\qquad m\in\mathbb{Z}\setminus\{0\}, $$
with K̂_J(0) = 0 (the constant term is irrelevant for convolution against dS_U, whose total mass is 0). In particular, K_J is a truncated logarithmic kernel: as M_N → ∞ one has the classical identity
$$ \sum_{m=1}^{M}\frac{\cos(mu)}{m} =-\log\big|2\sin(u/2)\big|+O(1) \quad\text{for }u\not\equiv 0\ (\mathrm{mod}\ 2\pi), $$
with an error uniform on compact subsets of 𝕊¹ \ {0}.

We now record the reduction in a form tailored to later comparison with W_N, J.

Lemma~ isolates two independent mechanisms behind the approximation . The first term in is a : it reflects that the raw harmonic windowing in j produces a kernel L_J whose Fourier multiplier is only approximately 1_{|m| ≤ M_N}/|m|. This contribution is purely analytic and is already small once J → ∞, provided we control ∥S_U∥_L^∞.

The second term in is the genuinely ``zeros-only’’ difficulty: it measures the cost of replacing the random inverse-counting windows I_j(θ) by deterministic windows centered at θ. This term is small precisely when the inverse counting map is stable, i.e. when the endpoint fluctuations θ̃_±j(θ) − (θ ± 2πj/N) are negligible compared to the scale 2πj/N. For CUE, the typical size of the counting error is logarithmic in N, so stability is ensured uniformly once j ≥ J with J a sufficiently large constant multiple of log N. In the next section we will combine this stability with standard rigidity/number-variance bounds to show that the second term in vanishes in probability uniformly in θ (with a harmless exclusion of tiny neighborhoods of eigenangles if one wishes to avoid pointwise issues at jumps). This will complete the conversion of D_N, J into the convolution statistic K_J * dS_U, at which point Fourier inversion will identify it with the trace field W_N, J.

7. Identifying the convolution with the trace field: Fourier analysis linking K_J * S_U to W_N, J; proof of (ii) including uniform-in-θ control (with a small exclusion near eigenangles if needed).

We now verify that the convolution statistic appearing in Lemma~ is the trace field W_N, J. This is a deterministic Fourier identity which uses only the definition of the Stieltjes measure dS_U and the fact that the eigenangles encode the traces of powers of U.

Recall that, as a signed 2π-periodic measure,
$$ dS_U(t)=\sum_{k=1}^N\delta_{\theta_k}(dt)-\frac{N}{2\pi}\,dt, $$
and in particular, for each integer m ≠ 0,

(We used that the Lebesgue term has vanishing mth Fourier coefficient for m ≠ 0.)

Combining Lemma~ and Lemma~, we obtain the exact decomposition

with r_N, J satisfying the structural bound . Thus the proof of the uniform reconstruction statement in (ii) reduces to showing that sup_θ|r_N, J(θ)| → 0 in probability for J = ⌊clog N⌋.

Fix c > 0 and set J = ⌊clog N⌋ and M_N = ⌊N/J⌋. We work with the decomposition
r_N, J(θ) = r_N, J^ker(θ) + r_N, J^inv(θ),
where
$$ r^{\mathrm{ker}}_{N,J}(\theta):=\big((K_J-L_J)*dS_U\big)(\theta), \qquad r^{\mathrm{inv}}_{N,J}(\theta):= -\int_{\mathbb{R}} \Bigg(\sum_{j=J}^{\lfloor N/4\rfloor}\frac{\mathbf{1}_{I_j(\theta)}(t)-\mathbf{1}_{I_j^{(0)}(\theta)}(t)}{j}\Bigg)\,dS_U(t), $$
so that is simply the crude bound |r_N, J^inv(θ)| ≤ (⋯) obtained by total variation. For the argument, we will not rely on this total-variation estimate; instead we treat r_N, J^inv(θ) as a linear statistic of the determinantal process with a random test function and control it through standard rigidity/number-variance input.

The field r_N, J^ker is a trigonometric polynomial with frequencies ≪ M_N, and we may express it directly in Fourier modes. Writing the Fourier series of R_J := K_J − L_J as R_J(u) = ∑_m ≠ 0R̂_J(m)e^imu, we have

For m = o(N) we have $\mathbb{E}|\Tr(U^m)|^2=m$ (indeed min (m, N) = m), and the Fourier analysis preceding Lemma~ yields the pointwise estimate

with additional decay for |m| ≫ M_N (coming from the oscillation in L̂_J(m)). Restricting to |m| ≤ M_N (the only range relevant for the dominant part of the field) and using orthogonality, we obtain the L² bound

Since J ∼ clog N, this gives log M_N/J² → 0, hence r_N, J^ker(θ) → 0 in L², uniformly in θ.

To upgrade this to a supremum over θ, we use the fact that r_N, J^ker is a trigonometric polynomial of degree O(M_N); hence it is Lipschitz with
$$ \| (r^{\mathrm{ker}}_{N,J})'\|_{L^\infty} \ll M_N \sum_{m=1}^{M_N} |\widehat{R}_J(m)|\,|\Tr(U^m)|. $$
By Cauchy–Schwarz and , the right-hand side has bounded second moment of order o(1) ⋅ M_N; consequently, a standard discretization argument (evaluate on a grid of mesh M_N⁻² and use the Lipschitz bound to pass from the grid to the continuum) yields

We omit routine details, since any polynomial mesh finer than M_N⁻¹ suffices once one has an o(1)-control of the L² size of the coefficients in together with Bernstein inequalities for trigonometric polynomials.

The inverse-neighbor maps θ ↦ θ_±j(θ) are well-defined Lebesgue-a.e. and extended by right-continuity. To avoid any pointwise ambiguity at discontinuities of S_U, it is convenient to first prove uniform control on the set
$$ \Omega_\varepsilon:=\Big\{\theta\in\mathbb{S}^1:\ \dist(\theta,\{\theta_1,\dots,\theta_N\})\geq \varepsilon/N\Big\}, $$
and then pass to all θ using right-continuity and the fact that 𝕊¹ \ Ω_ε has Lebesgue measure O(ε) and consists of N tiny arcs. In the present setting one may take, for example, ε = N⁻⁹.

On Ω_ε, the counting function is locally constant and the neighbor endpoints are stable under small perturbations of θ. The key quantitative input is a rigidity/number-variance bound for CUE: for any arc A ⊂ 𝕊¹ of length |A|, the centered count ∫_AdS_U is subgaussian with variance ≍ log (N|A|+2), uniformly in the location of A. In particular, for j ≥ J the arc length 4πj/N is at least of order (log N)/N, so the typical counting error on such arcs is $O(\sqrt{\log j})$, and the relative error $O(\sqrt{\log j}/j)$ tends to 0 as soon as j → ∞.

Using this (together with a union bound on a grid in θ and j and a monotonicity argument to pass from the grid to the continuum), one proves that for every fixed α > 0 there exists C_α < ∞ such that, with probability at least 1 − N^−α,

and in particular Δ_j(θ) = o(j/N) uniformly for j ≥ J and θ ∈ Ω_ε.

Fix θ ∈ Ω_ε and define the random test function
$$ g_\theta(t):=\sum_{j=J}^{\lfloor N/4\rfloor}\frac{\mathbf{1}_{I_j(\theta)}(t)-\mathbf{1}_{I_j^{(0)}(\theta)}(t)}{j}, $$
so that r_N, J^inv(θ) = −∫g_θ(t) dS_U(t). On the event , each term 1_{I_j(θ)} − 1_{I_j⁽⁰⁾(θ)} is supported on a symmetric difference consisting of two short arcs whose total length is O(Δ_j(θ)), with Δ_j(θ) much smaller than the underlying scale j/N. Moreover, since the harmonic weights are summable, g_θ is uniformly bounded:
$$ \|g_\theta\|_{L^\infty}\leq \sum_{j=J}^{\lfloor N/4\rfloor}\frac{2}{j}\ll \log(N/J)\ll \log N. $$
The relevant feature is that g_θ has on low modes once j ≥ J, because each summand is a small translation error of a fixed-shape window. More precisely, one checks from the explicit Fourier transform of an interval indicator that for each m ≠ 0,
$$ |\widehat{g_\theta}(m)| \ll \sum_{j=J}^{\lfloor N/4\rfloor}\frac{1}{j}\cdot |m|\,\Delta_j(\theta)\cdot \frac{1}{|m|} \ll \sum_{j=J}^{\lfloor N/4\rfloor}\frac{\Delta_j(\theta)}{j}, $$
and therefore, using ,
$$ |\widehat{g_\theta}(m)| \ll \frac{1}{N}\sum_{j=J}^{\infty}\frac{\sqrt{\log j}}{j^{3/2}} \ll \frac{\sqrt{\log J}}{N\sqrt{J}}. $$
Inserting this into the determinantal variance formula for CUE linear statistics (in the form $\Var(\int h\,dS_U)\ll \sum_{m\neq 0}\min(|m|,N)\,|\widehat{h}(m)|^2$ for piecewise H^1/2 functions h) yields
$$ \sup_{\theta\in\Omega_\varepsilon}\Var\big(r^{\mathrm{inv}}_{N,J}(\theta)\big) \ll \sum_{m=1}^{N} m\,|\widehat{g_\theta}(m)|^2 \ll N^2 \cdot \Big(\frac{\log J}{N^2 J}\Big) \ll \frac{\log J}{J} \to 0, $$
since J → ∞. Hence, for each fixed θ ∈ Ω_ε, we have r_N, J^inv(θ) → 0 in L², and in particular in probability.

To obtain a bound over θ ∈ Ω_ε, we again discretize. The map θ ↦ r_N, J^inv(θ) is piecewise constant between eigenangles (because the windows I_j(θ) change only when θ crosses an eigenangle), while the deterministic windows I_j⁽⁰⁾(θ) vary continuously in θ. It follows that θ ↦ r_N, J^inv(θ) has controlled oscillation on Ω_ε on scales ≫ 1/N. Choosing a grid {θ_ℓ} of mesh N⁻² contained in Ω_ε and applying a union bound together with the L² estimate above yields

(The passage from the grid to the continuum uses that on Ω_ε the neighbor maps are locally constant and the deterministic part is Lipschitz, so no large fluctuations can occur between nearby grid points.)

Combining and gives
$$ \sup_{\theta\in\Omega_\varepsilon}|r_{N,J}(\theta)|\xrightarrow{\mathbb{P}}0. $$
Finally, if we take the supremum over all θ ∈ 𝕊¹, we may either (a) adopt the right-continuous conventions for θ ↦ θ_±j(θ) and note that the exceptional set consists only of the eigenangles themselves, or (b) keep the supremum over Ω_ε as stated. In either interpretation, recalling we obtain
$$ \sup_{\theta\in\mathbb{S}^1}|D_{N,J}(\theta)-W_{N,J}(\theta)| =\sup_{\theta\in\mathbb{S}^1}|r_{N,J}(\theta)| \xrightarrow{\mathbb{P}}0, $$
which is precisely the uniform reconstruction statement in (ii).

8. Numerical illustrations and sanity checks: simulations for moderate N showing D_N, J ≈ W_N, J and GFF-like covariance; (flagged as computational).

The results proved above are asymptotic and distributional, and the comparison statement in~(ii) is of a uniform, probabilistic nature. It is therefore useful to record a small collection of numerical experiments that (a) illustrate the fields W_N, J and D_N, J for moderate N, (b) visually corroborate the reconstruction phenomenon D_N, J ≈ W_N, J when J ≍ log N, and (c) check the covariance structure and approximate Gaussianity predicted by the GFF limit in~(i). We emphasize that this section is purely computational and plays no role in the proofs; it is best read as a diagnostic guide for implementation choices and for interpreting the observables.

A standard method to sample U ∼ Haar(U(N)) is to draw a complex Ginibre matrix G with i.i.d. entries g_ab ∼ 𝒩(0, 1) + i𝒩(0, 1), perform a QR decomposition G = QR with Q unitary and R upper-triangular, and then normalize the phases by setting
$$ U := Q \cdot \mathrm{diag}\Big(\frac{R_{11}}{|R_{11}|},\dots,\frac{R_{NN}}{|R_{NN}|}\Big)^{-1}. $$
We then compute the eigenvalues of U (or directly its eigenangles) and sort them as 0 ≤ θ₁ < ⋯ < θ_N < 2π. All plots below are based on these eigenangles; the only additional ingredients are finite sums and simple nearest-neighbor manipulations. For the comparison in~(ii), it is numerically preferable to work directly with eigenangles throughout, so that both W_N, J and D_N, J can be evaluated without explicitly forming high powers U^m.

Although W_N, J is defined in terms of traces $\Tr(U^m)$, for moderate N and mesoscopic cutoff M_N = ⌊N/J⌋ one can compute these traces via the eigenangles:
$$ \Tr(U^m)=\sum_{k=1}^N e^{i m \theta_k},\qquad 1\leq m\leq M_N. $$
Thus, for a given grid θ ∈ {θ_ℓ^grid}_ℓ = 1^L ⊂ 𝕊¹, we may evaluate
$$ W_{N,J}(\theta^{\mathrm{grid}}_\ell) =\sum_{m=1}^{M_N}\frac{1}{m}\Re\!\Big(\Big(\sum_{k=1}^N e^{i m \theta_k}\Big)e^{-i m \theta^{\mathrm{grid}}_\ell}\Big). $$
The naive cost of this evaluation is O(NM_N + LM_N) if one first computes all traces and then evaluates the trigonometric polynomial on the grid. In the regime relevant for~(ii), say J = ⌊clog N⌋, we have M_N ∼ N/(clog N), so for N up to a few thousand this is entirely feasible.

Two implementation details are worth noting.

The observable W_N, J is mean-zero and uses only m ≥ 1, so one may simply omit m = 0. Numerically, it is convenient to store the coefficients
$$ A_m:=\frac{\Tr(U^m)}{m},\qquad 1\leq m\leq M_N, $$
and then evaluate ℜ(∑_{m ≤ M_N}A_me^−imθ).

We view angles modulo 2π and use the principal representatives in [0, 2π). When evaluating e^±imθ, the periodicity is automatic, but when comparing with D_N, J(θ) it is convenient to ensure that both are evaluated on the same grid in [0, 2π) and plotted with wrap-around.

The field D_N, J(θ) depends only on the ordered eigenangles and the neighbor maps θ ↦ θ_±j(θ). A practical way to evaluate D_N, J on a grid is as follows. For each θ, find the index k(θ) such that θ_k(θ) ≤ θ < θ_{k(θ) + 1} (with cyclic convention at 2π). Then the jth neighbor to the right is θ_+(j)(θ) = θ_{k(θ) + j} and to the left is θ_−(j)(θ) = θ_{k(θ) − j + 1} (again cyclic), and we set
$$ \delta_j(\theta)=\frac{N}{2\pi}\big(\theta_{+j}(\theta)-\theta_{-j}(\theta)\big)-2j, \qquad D_{N,J}(\theta)=\sum_{j=J}^{\lfloor N/4\rfloor}\frac{\delta_j(\theta)}{j}. $$
In practice, one must interpret θ_+j(θ) − θ_−j(θ) as the positive arc-length from θ_−j(θ) to θ_+j(θ) in cyclic order; equivalently one may lift the eigenangles to an increasing sequence on ℝ by appending θ_k + N = θ_k + 2π and use this lift to compute differences without ambiguity.

Two further numerical remarks parallel the technical points in the proof.

The maps θ ↦ θ_±j(θ) are piecewise constant with jumps at eigenangles. If one evaluates on a grid that includes eigenangles, one may adopt the right-continuous convention used above, or else simply exclude tiny neighborhoods of eigenangles. Numerically, we recommend either evaluating at midpoints $\frac12(\theta_k+\theta_{k+1})$ (which avoids discontinuities by construction), or removing points within distance ε/N of any θ_k, with ε chosen modestly (in experiments, any polynomially small choice behaves similarly).

The upper limit ⌊N/4⌋ is not canonical; it is chosen to keep the symmetric window [θ_−j(θ), θ_+j(θ)] away from wrapping around the circle in a degenerate way and to remain in a mesoscopic regime where the harmonic weights ∑1/j behave as expected. Numerically, varying the upper cutoff within a fixed fraction of N changes only the very low-frequency component of the resulting field.

For a typical run with N between 10³ and 10⁴ and with J = ⌊clog N⌋ (say c between 1/2 and 2), one may compute both fields on a uniform grid θ_ℓ^grid = 2πℓ/L with L on the order of 10³–10⁴, or else on the eigenangle midpoints. Plotting θ ↦ W_N, J(θ) and θ ↦ D_N, J(θ) on the same axes typically reveals two visually indistinguishable log-correlated profiles, and plotting the difference θ ↦ D_N, J(θ) − W_N, J(θ) yields a comparatively flat curve.

In these experiments, the magnitude of the difference is strongly affected by J. If J is taken too small (for instance, bounded or growing very slowly), the discrepancy field D_N, J includes many short-range contributions that are sensitive to the discontinuities of the inverse counting map and the local fluctuations of gaps; the resulting error is visible at moderate N. By contrast, when J is of order log N, the harmonic sum begins sufficiently far out that the stability mechanism (as in the proof of~(ii)) becomes effective, and the difference is dominated by a small-amplitude, slowly varying remainder. This is consistent with the form of the decomposition
D_N, J(θ) − W_N, J(θ) = r_N, J(θ),
where the principal contributions to r_N, J arise from (i) the mismatch between the ideal Fourier cutoff kernel and the discrepancy-induced kernel, and (ii) the random perturbation of window endpoints caused by inverting the local counting function.

From a computational perspective, one may use the empirical quantity
Δ_N, J^(∞) := max_ℓ|D_N, J(θ_ℓ^grid) − W_N, J(θ_ℓ^grid)|
as a proxy for the supremum norm in~(ii). Repeating over many draws of U yields an empirical distribution of Δ_N, J^(∞) which decreases as N increases when J = ⌊clog N⌋, whereas it stabilizes or decreases much more slowly if J is kept fixed.

To check the GFF covariance structure predicted by~(i), we may estimate the covariance function
C_N, J(ϑ) := 𝔼[W_N, J(ϑ) W_N, J(0)]
by averaging W_N, J(ϑ)W_N, J(0) over many independent Haar samples. Lemma~3.1 predicts that
$$ C_{N,J}(\vartheta)\approx \frac12\sum_{m=1}^{M_N}\frac{\cos(m\vartheta)}{m}, $$
up to an error of order 1/J in the regime M_N = o(N). Numerically, the agreement is typically excellent already for moderate N provided one uses enough samples to reduce Monte Carlo noise. Moreover, as M_N grows, the truncated sum approaches the classical logarithmic kernel
$$ \frac12\sum_{m=1}^{M}\frac{\cos(m\vartheta)}{m} = -\frac12\log\big|2\sin(\vartheta/2)\big| + O\Big(\frac{1}{M\,|\sin(\vartheta/2)|}\Big), $$
valid away from ϑ ≡ 0 (mod 2π). Thus, plotting C_N, J(ϑ) against $-\frac12\log|2\sin(\vartheta/2)|$ (after subtracting an M-dependent constant if desired) gives a direct visual confirmation of log-correlation at mesoscopic cutoffs.

One may similarly test the covariance of the zeros-only field D_N, J; when J = ⌊clog N⌋, its covariance function closely tracks that of W_N, J, providing an additional check of the reconstruction phenomenon at the level of second moments.

A complementary diagnostic, closer to the mechanism behind Proposition~C and Theorem~A, is to examine the distribution of normalized traces
$$ X_m:=\frac{\Tr(U^m)}{\sqrt{m}},\qquad 1\leq m\leq M_N, $$
for M_N = o(N). Empirically, for each fixed m the real and imaginary parts of X_m appear approximately $\mathcal{N}(0,\tfrac12)$, and for distinct m the cross-correlations are close to 0. One may quantify this by computing the empirical covariance matrix of (X_m)_{m ≤ M_N} over repeated draws of U, or by producing Q–Q plots of ℜ(X_m) and ℑ(X_m) against standard normals. While such tests are necessarily limited by sample size and by the fact that M_N is not asymptotically negligible for very small N, they align well with the principle that the low Fourier modes of the CUE counting measure behave essentially as independent Gaussians up to the o(N) threshold.

These checks also illuminate the meaning of the H^−s topology in~(i): the Sobolev norm effectively weights the mth Fourier coefficient by |m|^−s, so that convergence is robust to the omission of high modes and to modest distortions near the cutoff.

Finally, it is helpful to verify that both observables behave correctly on simple deterministic configurations, independent of Haar randomness.

If we take θ_k = 2πk/N, then S_U ≡ 0, hence (K_J * dS_U) ≡ 0 for any kernel K_J with mean zero, and therefore W_N, J ≡ 0. On the other hand, θ_+j(θ) − θ_−j(θ) = 4πj/N for all θ away from the grid points, giving δ_j(θ) = 0 and thus D_N, J ≡ 0. This confirms that both observables measure deviations from uniform density rather than absolute density.

If we artificially place a macroscopic fraction of the points in a shorter arc while leaving the remaining points sparse elsewhere, then the symmetric neighbor window around θ inside the dense region has length smaller than 4πj/N, yielding δ_j(θ) < 0 for a range of j and hence a negative excursion of D_N, J(θ). The trace field W_N, J exhibits the same qualitative sign and profile, consistent with the interpretation of both fields as density waves.

In summary, these numerical illustrations serve three purposes: they provide a concrete view of the random log-correlated profiles generated by W_N, J; they demonstrate that the zeros-only construction D_N, J captures the same mesoscopic field when J ≍ log N; and they offer practical checks (covariance matching and trace Gaussianity) that mirror the theoretical inputs used in the proofs. The next section places these observables in a broader context, relating them to carrier-wave heuristics and to the corresponding constructions for the Riemann zeta function.

9. Discussion and outlook: canonical carrier wave observables; relation to Farmer’s density-wave plots; implications for extremes and for zeta transfer via the recipe/ratios conjecture.

We close by interpreting the observables introduced above as canonical ``carrier wave’’ functionals, by relating the zeros-only reconstruction to Farmer’s density-wave methodology, and by indicating a few consequences and possible extensions, especially those suggested by the random-matrix–to–zeta dictionary.

The definition
$$ W_{N,J}(\theta)=\sum_{m=1}^{M_N}\frac{\Re\big(\Tr(U^m)e^{-im\theta}\big)}{m} $$
is not ad hoc: it is precisely the low-frequency part of the log-characteristic polynomial. Indeed, on the unit circle one has the identity (in a principal value sense)
$$ \log\big|\det(I-e^{-i\theta}U)\big| =-\sum_{m\geq 1}\frac{\Re\big(\Tr(U^m)e^{-im\theta}\big)}{m}, $$
so that W_N, J is the truncation of −log |det (I − e^−iθU)| to Fourier modes 1 ≤ m ≤ M_N. The weight 1/m is therefore forced by harmonic analysis and by the logarithm. From this perspective, Theorem~A is a mesoscopic version of the well-known principle that the log-characteristic polynomial is asymptotically a log-correlated Gaussian field after suitable regularization: we are not approximating the full field at microscopic resolution, but rather its smoothed/low-frequency content, and in that regime a clean convergence in H^−s is available.

The cutoff M_N = ⌊N/J⌋ is also natural. The trace statistics $\Tr(U^m)$ are asymptotically Gaussian only up to m = o(N), and the determinantal/Szeg inputs used in Proposition~C become stable precisely when the Fourier support is o(N). Thus J → ∞ is not an auxiliary technical condition: it encodes the requirement that we stay below the ``Nyquist’’ threshold where the finite-size constraints of U(N) begin to correlate the modes.

Theorem~B provides a zeros-only proxy D_N, J which, for J ≍ log N, uniformly reconstructs the same mesoscopic field as W_N, J. Conceptually, the existence of such a proxy rests on two complementary facts.

The map θ ↦ θ_±j(θ) is an inverse to the local counting function, and symmetric windows [θ_−j(θ), θ_+j(θ)] encode integrated counting errors. Summation-by-parts identities then convert harmonic weights ∑δ_j(θ)/j into convolutions of S_U against an explicit kernel. This mechanism is deterministic and is the analytic heart of Lemma~6.1.

Although inversion is discontinuous at eigenangles, it becomes stable when we average over scales j ≥ J with J → ∞. One may view J as a regularization parameter: small j are sensitive to individual gaps, while large j average over enough points to suppress the local instabilities of the inverse counting map. The requirement J ≍ log N in the uniform statement is then consistent with the familiar heuristic that log N is the scale at which union bounds and rigidity estimates begin to control suprema over θ.

In this light, the field D_N, J is not merely an alternative estimator of W_N, J; it is an intrinsic object attached to the point process itself. It uses only nearest-neighbor geometry, and it does so in the uniquely natural way dictated by the harmonic weights that correspond to a logarithm.

Farmer’s density-wave plots for zeta are based on the idea that fluctuations in the zeros (encoded through neighbor discrepancies) carry a coherent long-range signal, which he interprets as a ``wave’’ modulating the local density. The observable
$$ \sum_{j\geq J}\frac{\delta(\gamma_{-j},\gamma_{j})}{j} $$
in that context is the direct analogue of our D_N, J, and the qualitative features observed numerically (large-scale oscillations, apparent log-correlation, and stability once J is taken moderately large) match the behavior of D_N, J in the CUE setting.

Theorem~B may therefore be read as a random-matrix justification of the following interpretation: the density wave is the low-frequency component of the log-characteristic polynomial (or, on the zeta side, the low-frequency component of log |ζ| after regularization), and the neighbor-discrepancy functional is a zeros-only method to recover that component. In the CUE model this is an identity up to a small remainder, and the size of the remainder is controlled uniformly when J ≍ log N.

We emphasize a subtle but important point. The zeros-only reconstruction does not claim that information in the characteristic polynomial is encoded in the local gaps. Rather, it identifies a particular, canonical projection of the field—the Fourier band 1 ≤ m ≤ N/J with the logarithmic weights—as being recoverable from neighbor statistics. This is exactly the ``carrier’’ part in carrier-wave heuristics: the component that varies on macroscopic/mesoscopic scales and can be viewed as a random environment in which microscopic fluctuations live.

Once one accepts the GFF limit in Theorem~A, it is natural to ask what this implies for extremes of the underlying fields. For log-correlated fields, maxima typically exhibit a log –log log structure (with model-dependent constants), and the characteristic polynomial of CUE is conjectured to fall into this universality class. Our results do not directly address the maximum of the full log |det (I − e^−iθU)|, but they do yield two consequences at the mesoscopic level.

Because D_N, J and W_N, J are uniformly close for J ≍ log N, any functional that is continuous with respect to the supremum norm (in particular, the maximum over θ on a fine grid, or smoothed maxima) has the same asymptotics for both fields. Thus the extreme-value behavior of W_N, J can, in principle, be studied using only eigenangles, without traces.

The truncation parameter J separates the log-correlated coarse field'' from thefine field’’ coming from modes m > M_N. In many arguments for log-correlated maxima, one decomposes the field into a sum of approximately independent increments across scales. Here J provides an explicit coarse component with a clean GFF limit. A natural next problem is to quantify the dependence between the coarse field W_N, J and the complementary high-frequency tail, and to determine whether D_N, J can be augmented with additional local terms to capture some of that tail.

A plausible working conjecture is that, at least for J in a polylogarithmic range, the maxima of W_N, J already exhibit the correct leading-order growth for the maxima of the fully regularized log-characteristic polynomial, with an error that can be controlled as J → ∞. Establishing such a statement would require nontrivial multiscale control (and is well beyond the scope of the present work), but Theorem~B indicates that the relevant coarse component can be accessed from zeros alone.

The motivation for a zeros-only carrier wave is that, for $\zeta(\tfrac12+it)$, the prime side'' and thezero side’’ are connected by the explicit formula. Heuristically, one expects a decomposition in which a smoothed log |ζ| behaves like a log-correlated Gaussian field whose Fourier modes correspond to prime sums, while the zeros encode the same object through fluctuations of the counting function of zeros on the critical line. Farmer’s methodology suggests that one can bypass primes and recover the carrier wave directly from zeros; Theorem~B proves the analogous statement in the CUE model.

To make an analogous theorem for zeta precise, one would need replacements for the two main inputs used here.

Proposition~C is a statement about joint Gaussianity of $\Tr(U^m)$ for m = o(N). On the zeta side, one would seek joint Gaussianity (or at least asymptotic quadratic log-moments) for smoothed prime sums
$$ \sum_{p\leq T^{1/J}} \frac{p^{-it}}{\sqrt{p}}\, w(\log p), $$
in a range corresponding to m ≤ T^1/J, with J → ∞ and J ≍ log log T playing the role of log N. The ``recipe’’ and the ratios conjecture are designed precisely to predict such joint moments for Euler products and logarithmic derivatives, and thus provide a plausible substitute for the Szeg/Toeplitz machinery available in CUE.

Lemma~6.1 is essentially deterministic and only uses that the zeros form a simple point configuration on a circle. For zeta, the zeros lie on the line and the relevant object is the centered zero counting function S(t), together with the inverse map giving the location of the jth zero to the left/right of a point. The same summation-by-parts mechanism should convert neighbor discrepancies into a convolution of dS against a kernel with Fourier transform 1_{|m| ≤ M}/|m|. The analytic difficulty is then to control the remainder uniformly, which in CUE is done using rigidity/number-variance bounds. On the zeta side one would require uniform control on the fluctuations of S(t) on short intervals (at least in probability with respect to t), at a strength comparable to what is provided unconditionally for CUE by determinantal structure.

If these two ingredients can be established (even conditionally), the CUE theorem suggests a precise formulation of Farmer’s principle: at a cutoff J comparable to log log T, the density-wave functional built from neighbor discrepancies of zeros should approximate the low-frequency part of $\log|\zeta(\tfrac12+it)|$, uniformly in t over long ranges after excluding tiny neighborhoods of zeros.

We record a few concrete directions suggested by the present results.

The statements above are qualitative (o(1) in probability). It is natural to ask for quantitative rates in N and the dependence on c in J = ⌊clog N⌋. In particular, one expects a tradeoff: larger J improves the stability of inversion but removes more low frequencies from the reconstruction. Making this tradeoff explicit would clarify which portion of the field is practically recoverable from zeros at a given N.

The determinantal structure of CUE is convenient but not conceptually essential for the discrepancy-to-convolution step. One may ask whether analogous zeros-only reconstructions hold for other circular ensembles, for β-ensembles on the line, or for mesoscopic statistics in general log-gas models where rigidity is available. In such settings one expects the limiting log-correlated field to persist (with β-dependent variance), and the reconstruction should then identify a canonical coarse field determined by the configuration alone.

Theorem~A identifies the H^−s limit of the coarse field. A sharper description of the joint law of (W_N, J, tail), and of how much of the tail is encoded in higher-order local statistics of eigenangles, would bring the discussion closer to full carrier-wave decompositions.

In sum, the picture emerging from (i)–(ii) is that there is a canonical mesoscopic carrier wave associated with the eigenangle configuration, and that it admits two equivalent representations: a trace/Fourier representation W_N, J and a purely geometric neighbor-discrepancy representation D_N, J. The former connects directly to the log-characteristic polynomial and the circle GFF; the latter connects directly to zeros-only methodologies and, in particular, to Farmer’s density-wave plots. The random matrix model thus supplies a setting in which the carrier-wave philosophy can be formulated as a theorem, and it suggests a concrete route by which analogous statements for ζ may be attacked under appropriate probabilistic input on zero statistics and prime sums.

Density Waves for Haar-Unitary Matrices: A Gaussian Free Field Limit and a Zeros-Only Reconstruction

Metadata

Table of Contents

Content

1. Introduction and motivation: carrier waves as mesoscopic density fluctuations; why CUE is the right testbed; statement of results.

2. Background on CUE eigenangles, counting function SU, and Sobolev spaces H−s(𝕊1); definition and properties of the circle GFF.

3. The trace/Fourier density wave WN, J: invariances, covariance computation, and relation to log |ΛU(eiθ)| (regularized).

4. Mesoscopic Gaussianity for traces up to m ≤ N/J: Toeplitz determinant/strong Szegő route (or cumulant method for determinantal point processes) and proof of (i).

5. A zeros-only density-wave functional: defining neighbor discrepancies δj(θ) robustly on the circle; basic bounds and measurability; discussion of window choice J ≍ log N.

6. Converting discrepancies to a smoothed counting statistic: derive an explicit convolution representation of DN, J in terms of SU and a logarithmic kernel; error terms.

7. Identifying the convolution with the trace field: Fourier analysis linking KJ * SU to WN, J; proof of (ii) including uniform-in-θ control (with a small exclusion near eigenangles if needed).

8. Numerical illustrations and sanity checks: simulations for moderate N showing DN, J ≈ WN, J and GFF-like covariance; (flagged as computational).