Cosmology & General Relativity

Statistical Field Theory
for Weak Gravitational Lensing

Lensing as a path integral over the Sachs optical scalars
11 June 2026 · Old Prison of Aegina, Greece
based on the following work (in prep.):
Statistical Field Theory for Weak Gravitational Lensing
Z. Zhang, P. Bull, C. Clarkson, A. Nicola
SFT-WICK: Automating Wick contractions & Feynman diagrams for stochastic field theories
Z. Zhang
CANOES: Cosmological ANgular Observables and Estimator Suite
Z. Zhang, P. Bull, A. Nicola
background κ-map: Gevolution (ΛCDM N-body)
The observable

A decade of weak lensing is arriving

Surveys are mapping billions of galaxy shapes:

  • Rubin / LSST · ~20 billion galaxies imaged (~2 billion WL shapes)
  • Euclid · 1.5 billion shapes, deep IR
  • Roman · high-resolution space shear

Cosmic shear: the tiny, spatially-coherent stretching of galaxy shapes by all the intervening mass — a spin-2 field on the sky. The Stage-IV precision jump puts the modelling under real pressure.

spin-2 cosmic shear whisker field
Galaxy ellipses tangentially stretched around unseen mass — coherent shear "whiskers".
The standard picture

Lensing is treated as a projection

line-of-sight projection schematic
Sum the matter density along the line of sight, weighted by a geometric kernel.
\[ \kappa(\hat{\mathbf n}) = \int_0^{\chi_s}\! d\chi\; W(\chi)\,\delta\!\big(\chi\hat{\mathbf n},\chi\big) \]

A clean, linear hierarchy follows:

\( \xi_\kappa \leftarrow P_{\rm m}(k) \qquad \zeta_\kappa \leftarrow B_{\rm m}(k_1,k_2,k_3) \)

The Born & Limber approximations: simple, fast, the workhorse behind every Stage-IV forecast.

Motivation of this work

But the projection is an approximation

Its known corrections are each bolted on separately, in their own formalisms:

post-Born / lens–lens coupling
beyond-Limber radial mixing
higher-order corrections of the matter field

What does the projection throw away,
and when does it matter?

Ray-tracing combines them — at the price of expensive runs, sample variance, and no clean answer to which mechanism dominates a residual.

Part 1 · Back to the geometry

A photon bundle focuses and shears

\[ \dot\theta = -\theta^2-\sigma_+^2-\sigma_\times^2+\textcolor{#f5b942}{\Phi_{00}} \qquad \dot\sigma_{+,\times} = -2\,\theta\,\sigma_{+,\times}+\textcolor{#4cc9f0}{\mathcal{W}_{1,2}} \]
Sachs optical scalars and the driving-field dictionary
\(\Phi_{00}=-4\pi G\,T_{\mu\nu}k^\mu k^\nu\) — Ricci, matter density.  ·  \(\Psi_0=-C_{\alpha\beta\gamma\delta}k^\alpha Z^\beta k^\gamma Z^\delta\) — Weyl, tides. Standard lensing merges these into one \(\kappa\); we keep them separate.
Part 1 · Reframing

The spacetime is a random field

one realization: delta to Phi to Phi00 to |Psi0|
One realization of the field chain the bundle integrates through: \(\delta \to \Phi \to \Phi_{00},\,|\Psi_0|\).

Structure formation makes the curvature a draw from an ensemble: the Sachs sources \(\Phi_{00},\Psi_0\) become random fields, so the optical scalars are not a fixed integral but a stochastic dynamical system.

Part 1 · Reframing

Null-geodesic dynamics as a stochastic field theory

A distribution over the driving fields induces a functional probability over trajectories. The Sachs system is purely nonlinear, with no linear term to expand around, so first expand around the mean field: the fluctuation \(X = \vec X - X^{\rm(sa)}\) about the background saddle obeys

\[ \dot X_a = A_{ab}\,X_b + F_{abc}\,X_b\,X_c + \Delta S_a \]

A genuine linear drift appears — on FLRW just the background focusing, \(A_{ab}=-2\,\delta_{ab}\,\theta^{\rm(sa)}\), \(\theta^{\rm(sa)}=\dot{\bar D}/\bar D\). For each driving realization the trajectory is fixed, so a functional \(\delta\) pins it:

\[ P[X\,|\,\Delta S]=\prod_a \delta\!\big(\dot X_a - A_{ab}X_b - F_{abc}X_bX_c - \Delta S_a\big) \]

Martin–Siggia–Rose writes the probability of \(X\) as a path integral over \(X\) and a response field \(\tilde X\):

\[ P[X]\;\propto\!\int\!\mathcal{D}\tilde X\; e^{-S} \]
\[ S=\underbrace{\,i\!\int\!\tilde X_a(\dot X_a-A_{ab}X_b)}_{\text{linear}}-\,i\!\int\! F_{abc}\tilde X_a X_b X_c-W[i\tilde X] \]

Marginalising over the driving-field statistics \(P[\Delta S]\) collapses the source integral into its cumulant generating functional \(W\):

\[ W[i\tilde X]=\sum_{n\ge2}\frac{i^n}{n!}\int \kappa^{(n)}_{a_1\cdots a_n}\,\tilde X_{a_1}\!\cdots\tilde X_{a_n} \]

\(\kappa^{(n)}=\langle\Delta S_{a_1}\!\cdots\Delta S_{a_n}\rangle_c\): the connected driving-field cumulants — the only cosmological input. (\(F\): nonlinear vertex.)

SFT‑WICK — Automating Wick contractions & Feynman diagrams for stochastic field theories · Z. Zhang (in prep)
Part 1 · Reframing

Lensing observables are moments of that path integral

rotating convergence map (slow)
The lensing field whose moments we want (gevolution, ΛCDM).

Convergence \(\kappa\) and shear \(\gamma\) are line-of-sight integrals of the fluctuation Sachs scalars \(X=(\delta\theta,\delta\sigma_+,\delta\sigma_\times)\):

\[ \kappa=-\!\int_0^{\lambda}\!\delta\theta\,d\lambda',\quad \gamma_+\!\pm i\gamma_\times=-\!\int_0^{\lambda}\!(\delta\sigma_+\!\pm i\delta\sigma_\times)\,d\lambda' \]

So every statistic is a moment of \(X\), taken directly against the path integral:

\[ \langle X_{a_1}\!\cdots X_{a_n}\rangle=\!\int\!\mathcal{D}X\,\mathcal{D}\tilde X\;X_{a_1}\!\cdots X_{a_n}\,e^{-S} \]

No projection step is assumed: the moment is of the full stochastic solution.

Part 2 · The engine

Split the action, then Wick-contract

Split the action into a Gaussian reference plus interactions:

\[ \begin{aligned} S &= S_{\rm ref}+S_{\rm int},\\[2pt] S_{\rm int} &= \underbrace{-\,i\!\int\!F_{abc}\tilde X_a X_bX_c}_{F\ \text{vertex}}-\underbrace{\textstyle\sum_{k\ge3}W^{(k)}[i\tilde X]}_{K\ \text{vertices}} \end{aligned} \]
\[ S_{\rm ref}=i\!\int\!\tilde X_a(\dot X_a-A_{ab}X_b)-W^{(2)}[i\tilde X] \]

Expand each moment about the reference; Wick's theorem factorises it into the two free propagators, \(C\) (solid) and \(R\) (dashed, causal):

\[ \langle O\rangle_S=\sum_{n}\frac{(-1)^n}{n!}\,\big\langle O\,S_{\rm int}^{\,n}\big\rangle_{\rm ref} \]

The reference theory has exactly two free 2-point functions (and \(\langle\tilde X\tilde X\rangle_{\rm ref}=0\)):

\[ \langle X_a X_b\rangle_{\rm ref}=\textcolor{#4cc9f0}{C_{ab}},\qquad \langle \tilde X_a X_b\rangle_{\rm ref}=-i\,\textcolor{#f5b942}{R_{ab}} \]
\[ \textcolor{#f5b942}{R_{ab}}=\delta_{ab}\,\delta(\hat n'{-}\hat n)\,\Theta(\lambda'{-}\lambda)\,e^{-2\int_\lambda^{\lambda'}\theta^{\rm(sa)}d\tau} \]
\[ \textcolor{#4cc9f0}{C}=\textcolor{#f5b942}{R}\,\kappa^{(2)}\,\textcolor{#f5b942}{R}\quad(\kappa^{(2)}\text{: driving 2-cumulant}) \]
2pt order-2 diagrams O0/FF/FK in sft-wick convention
The contractions are the Feynman diagrams (sft-wick convention).
Part 2 · Does it work?

On a toy model, the diagram sum converges to the truth

A 2-field stochastic system with a known answer:

\[ \dot\varphi_1=-\varphi_1+\varphi_2^2+\eta_1,\qquad \dot\varphi_2=-\varphi_2+\varphi_1\varphi_2+\eta_2 \]
driving/solution waterfall
Driving noise (bottom) imprints on the solution (top).
diagram sum converging to Monte-Carlo truth
Markers: direct Monte-Carlo (\(10^5\) realizations). Curve: the diagram sum, order 0→2→4, climbing onto the truth.
Part 2 · Application to cosmological weak lensing

From the response propagator to the lensing efficiency kernel

response propagator and lensing efficiency kernel vs comoving distance

Correlation propagator \(C\): the response folded twice around the driving 2-cumulant \(\kappa^{(2)}\):

\[ C_{ij}=\!\int_0^{\lambda_1}\!\!\!d\lambda'\!\int_0^{\lambda_2}\!\!\!d\lambda''\,\Big(\tfrac{\bar D(\lambda')}{\bar D(\lambda_1)}\Big)^{\!2}\kappa^{(2)}_{ij}\,\Big(\tfrac{\bar D(\lambda'')}{\bar D(\lambda_2)}\Big)^{\!2} \]

Projected onto convergence, the two responses fold into kernels — the textbook projection drops out:

\[ \xi_\kappa=\!\int_0^{\lambda_s}\!\!\!d\lambda'\!\int_0^{\lambda_s'}\!\!\!d\lambda''\,\textcolor{#f5b942}{K(\lambda',\lambda_s)}\,\textcolor{#f5b942}{K(\lambda'',\lambda_s')}\,\kappa^{(2)}_{11}(\gamma;\lambda',\lambda'') \]

Exact — no Limber, no projection assumed: \(K\) is the order-0 response.

On FLRW the response is causal, direction-local, a ratio of angular-diameter distances \(\bar D=a\chi\):

\[ R_{ij}=\delta_{ij}\,\delta(\hat{\mathbf n}-\hat{\mathbf n}')\Big(\tfrac{\bar D(\lambda)}{\bar D(\lambda')}\Big)^{2},\ \ \lambda<\lambda' \]

Its window has two equivalent forms — an affine-parameter integral and a closed comoving expression, the textbook lensing efficiency kernel:

\[ K=\bar D^2(\lambda)\!\!\int_\lambda^{\lambda_s}\!\!\tfrac{d\lambda_1}{\bar D^2(\lambda_1)}=a^2\,\tfrac{\chi(\chi_s-\chi)}{\chi_s} \]

The conventional weak-lensing window is the order-0 response — not an input, a consequence.

Part 2 · Application to cosmological weak lensing

Driving-field statistics, from one input \(P_\delta(k)\)

2-cumulant (Gaussian power). The Ricci (\(\Phi_{00}\)) and Weyl (\(\Psi_0\)) driving fields are screen-space derivatives of the light-cone potentials \(\Phi,\Psi\):

\[ \Phi_{00}=-\tfrac{E_0^2(1{+}z)^2}{a^2}\Big[\tfrac12\nabla_\perp^2(\Phi{+}\Psi)+\mathcal D^2\Phi+\cdots\Big] \]
\[ \Psi_0=-\tfrac{E_0^2(1{+}z)^2}{a^2}\,\hat s^i\hat s^j\,\partial_i\partial_j(\Phi{+}\Psi) \]

Per multipole each derivative is a single angular multiplier — the spin weight is explicit:

\[ \Phi_{00}:\ \frac{L^2}{\chi^2},\qquad \Psi_0:\ \frac{\sqrt{L^2(L^2{-}2)}}{\chi^2},\qquad L^2\equiv\ell(\ell{+}1) \]

Spin-0 trace (focusing) vs spin-2 \(\eth^2\) shear (\(\to 0\) for \(\ell<2\)); all from \(P_\delta(k)\).

3-cumulant \(\zeta\) (non-Gaussian). Tree-level matter bispectrum (\(\propto D^4\)),

\[ B_\delta=2\,F_2^{(s)}(\mathbf k_1,\mathbf k_2)\,P_\delta P_\delta+\text{cyc.} \]

projected to the modulus spin channels that survive the contraction,

\[ \zeta_B=\langle\Phi_{00}|\Psi_0|^2\rangle,\quad \zeta_D=\langle\Psi_0|\Psi_0|^2\rangle \]

Equal-shell (squeezed) limit, \(<\)0.5% error. Built by canoes.

Part 3 · The lensing 2PCF

Example: The convergence 2PCF, order by order

O0 / FF / FK diagrams for the 2PCF

No single vertex feeds a connected 2-point function, so the leading correction is order 2, in three groups — a preview of the insights:

O0
A unified view — the conventional kernel, \(\xi_\kappa=\int\!\!\int C_{11}\,d\lambda_1 d\lambda_2\)
FF
Nonlinear propagation
FK
Statistical Hierarchy Mixing
Part 3 · Insight A

A unified view: the textbook kernel is Order-0

Order-0 vs PyCCL, few-percent agreement

Order-0 (no vertices) reduces analytically to the standard non-Limber projection — not an analogy, the same integral.

Matches an independent PyCCL computation at the few-percent level (\(0.5'\!-\!2000'\), all four 2-point functions).

\(z_s=5\); \(|\xi|\) on log axes, filled \(=\xi>0\)/open \(=\xi<0\). The ratio departs only at the sign-flip zero-crossing.

Part 3 · Insight B

Mixing Hierarchy & A Selection Rule

cumulant-observable mixing hierarchy and selection rule

Mixing Hierarchy

A driving-field \(n\)-cumulant can feed different-order observables — but at a definite cost: each step its index runs ahead of the observable's adds one \(F\)-vertex, i.e. one higher order.

A Selection Rule

At a fixed order an \(n\)-point observable couples to exactly one cumulant: leading is \(\kappa^{(n)}\); the first correction reaches \(\kappa^{(n+1)}\) through one \(F\)-vertex. Higher cumulants are forbidden until higher order.

For \(\xi\) this singles out the bispectrum \(\kappa^{(3)}\) — structure from perturbative expansion, not a truncation by hand.

Part 3 · Insight C

A squeezed 3-cumulant leaks into the 2PCF at large scales

O0, FF, FK and full (O0+FF+FK) at z_s=5
Order-0, FF, FK and the full sum O0+FF+FK (\(z_s=5\)).

FK \(>\) FF: the bispectrum term exceeds the nonlinear-propagation (FF) term by ~2× at arcmin scales (several× by tens of arcmin).

On a \(z_s=5\) plane the full sum departs from Order-0, FK overtaking beyond ~120' (~2°) (for our tree-level bispectrum model).

which 2PCF FK feeds

Separable: FK lives only in \(\xi_\kappa,\xi_+\).

C_ell kappa-kappa: full vs Order-0 vs PyCCL
Harmonic-space image: the FK leak is excess power at low \(\ell\) in \(C_\ell^{\kappa\kappa}\).
Part 3 · Insight C

The leakage grows with source redshift

Order-0 vs full (O0+FF+FK), source planes z=0.5 to 5

Full (O0+FF+FK) vs Order-0, swept over source planes \(z_s=0.5\to5\).

The turning point — where FK overtakes Order-0 — marches to smaller scales as \(z_s\) grows: from \(\sim\)280' at \(z_s=0.5\) to \(\sim\)120' (\(\sim\)2°) at \(z_s=5\).

Most consequential for deep, high-redshift samples (Euclid/Roman tails, CMB lensing).

Part 3 · Insight C · validation

Checked against direct simulation

full MC xi vs SFT O0+FF+FK

A brute-force Monte-Carlo of the full non-Gaussian Sachs system (the \(K\) vertex is active) reproduces the SFT full prediction O0+FF+FK.

\(|\xi|\) on log axes (filled \(=\xi>0\), open \(=\xi<0\)): the direct simulation tracks the diagram sum, sign change and all. The expansion is not just internally consistent — it matches the numbers you get by brute force.

Summary

Not a substitute for conventional methods.

But a necessary reorganization & paradigm improvement.

Bad news Large-scale two-point lensing is contaminated by non-Gaussian structure — small-scale power leaks to large angles along the squeezed \(\kappa^{(3)}\).

Good news You can probe cosmic non-Gaussianity with the two-point function itself.

The same machinery describes any null congruence in a random spacetime — e.g. lensing of the stochastic GW background.

Take-aways

Three things to remember

  1. The textbook lensing kernel is the zeroth-order diagram of one systematic expansion.
  2. A selection rule ties the observable hierarchy to the driving-field cumulant hierarchy.
  3. Small-scale non-Gaussianity (FK) contaminates large-angle 2-pt lensing — but is separable by spin, and grows with \(z_s\).

Thank you — questions welcome.   zheng.zhang@manchester.ac.uk

with the support of The University of Manchester European Research Council (ERC)
Backup

Backup slides

Non-Limber propagators · 3-pt diagrams · squeezed bispectrum · input 3-cumulant validation · coordinate maps

(press ↓ to browse)

Non-Limber propagators

Response propagator \(R\)
Correlation propagator \(C\) slices

Three-point diagrams (Order-1)

order-1 three-point diagrams
One \(F\) or one \(K\) vertex feeds the convergence 3-point function.

Squeezed driving-field bispectrum

Equal-shell 3rd cumulant on the squeezed configuration; small-scale modes set the sign below ~40'.

Input 3-cumulant validation (vs fastnc)

cosmic-shear 3PCF amplitude: SFT zeta_D vs fastnc, within a factor of 2
Cosmic-shear 3PCF amplitude on isosceles triangles (\(\gamma=50'\), opening angle \(\varphi\)): our SFT \(\zeta_D\) (cyan) vs the independent tree-level code fastnc (Sugiyama+2024, amber). Agreement within a factor of 2 across triangle shapes validates the input driving-field 3-cumulant.

Light-cone coordinate maps

\(\chi(\lambda)\) and \(z(\lambda)\) along the past light cone.