Cross-View Variance Field — Po-Ting Lin (林柏廷)

Goal

Test a hunch: that the per-pixel variance fields of a path-traced stereo pair are correlated once aligned by ground-truth disparity — even though the per-sample Monte Carlo noise streams are independent by construction — and that the correlation is strong enough to act as a learnable matching cue.

Progress

The correlation is real. Across 20 Mitsuba 3 scenes the warped Pearson correlation reaches ρ ≈ 0.754, invariant over a 16× sampling range, strongest in Lambertian regions and weaker in glass. A block-shuffle intervention showed the structure functions as a usable cue at the cost-volume level — a sim-to-real gap mechanism unique to rendered data.

That finding is written up and posted as a preprint: Cross-View Variance Correlation in Path-Traced Stereo (preprint v2) (Zenodo, DOI 10.5281/zenodo.19907884).

Where it’s going

The preprint establishes that the shortcut exists. The open question — and the reason this stays in progress — is whether stereo networks actually exploit it in practice, and what that implies for training on rendered data. That’s the thread I’m still pulling.