Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields
About
{Closed-loop inverse source localization and characterization (ISLC) requires a mobile agent to select measurements that localize sources and infer latent field parameters under strict time constraints.} {The core challenge lies in the belief-space objective: valid uncertainty estimation requires expensive Bayesian inference, whereas using fast learned belief model leads to reward hacking, in which the policy exploits approximation errors rather than actually reducing uncertainty.} {We propose \textbf{Distill-Belief}, a teacher--student framework that decouples correctness from efficiency. A Bayes-correct particle-filter teacher maintains the posterior and supplies a dense information-gain signal, while a compact student distills the posterior into belief statistics for control and an uncertainty certificate for stopping. At deployment, only the student is used, yielding constant per-step cost.} {Experiments on seven field modalities and two stress tests show that Distill-Belief consistently reduces sensing cost and improves success, posterior contraction, and estimation accuracy over baselines, while mitigating reward hacking.}
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Source Localization | Single-Source ISLC Temperature field (ID held-out) | SR95 | 8 | |
| Source Localization | Single-Source ISLC Concentration field ID held-out | SR94 | 8 | |
| Source Localization | Single-Source ISLC Magnetic field ID held-out | SR94 | 8 | |
| Source Localization | Single-Source ISLC Electric field ID held-out | SR82 | 8 | |
| Source Localization | Single-Source ISLC Gas field (ID held-out) | SR96 | 8 | |
| Source Localization | Single-Source ISLC Energy field ID held-out | SR63 | 8 | |
| Source Localization | Single-Source ISLC Noise field ID held-out | Source Recovery (SR)94 | 8 | |
| Multi-source localization | Temperature field 2 sources | SR77 | 7 | |
| Multi-source localization | Temperature field 3 sources | SR70 | 7 | |
| Multi-source localization | Temperature field 4 sources | Success Rate (SR)61 | 7 |