Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Flowing with Confidence

About

Generative models can produce nonsensical text, unrealistic images, and unstable materials faster than simulation or human review can absorb; without per-sample confidence, trust erodes. Existing fixes run $k$ ensembles or stochastic trajectories at $k\times$ compute, measuring variability between models, not model confidence. We propose Flow Matching with Confidence (FMwC). FMwC injects input-dependent multiplicative noise at selected layers, propagates its variance through the network in closed form, and integrates it along the ODE trajectory, yielding a per-sample confidence score at standard sampling cost. The score supports multiple uses: filtering improves image quality and thermodynamic stability of crystals; editing rewinds trajectories to the points where the model commits and redirects them; and adaptive stepping concentrates ODE compute where the flow is ambiguous. We find that the confidence score correlates with the magnitude of the divergence of the learned velocity field, which gives us a window to understand the generative process, opening up surgical forms of guidance that target the moments that matter, new sampling algorithms and interpretability of generative models.

Friso de Kruiff, Dario Coscia, Max Welling, Erik Bekkers• 2026

Related benchmarks

TaskDatasetResultRank
Conditional GenerationMNIST
FID4.22
9
Confidence-guided retentionCheckerboard Toy
AUPRC83
9
Confidence-guided retentionMNIST
AUPRC9
9
Confidence-guided retentionCRYSTAL
AUPRC6
7
Generative Modeling2D Checkerboard
Misplacement (%)4.6
4
Crystal GenerationInorganic Crystals
e.a.h.0.21
2
Showing 6 of 6 rows

Other info

Follow for update