Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adding Thermal Awareness to Visual Systems in Real-Time via Distilled Diffusion Models

About

Purely RGB-based vision models often fail to provide reliable cues in challenging scenarios such as nighttime and fog, leading to degraded performance and safety risks. Infrared imaging captures heat-emitting sources and provides critical complementary information, but existing high-fidelity fusion methods suffer from prohibitive latency, rendering them impractical for real-time edge deployment. To address this, we propose FusionProxy, a real-time image fusion module designed as a fully independent, plug-and-play component with diffusion level quality. FusionProxy exploits two complementary statistics of a teacher sample ensemble: per-pixel variance in raw image space, used to weight pixel-level supervision, and per-pixel variance inside frozen foundation backbones, used to route feature-level alignment spatially. Once trained, FusionProxy can be directly integrated into any visual perception system without joint optimization. Extensive experiments demonstrate that our method achieves superior performance on static recognition tasks and significantly enhances robustness in dynamic tasks, including closed-loop autonomous driving. Crucially, FusionProxy achieves real-time inference speeds on diverse platforms, from high-end GPUs to commodity hardware, providing a flexible and generalizable solution for all-day perception.

Yuchen Guo, Junli Gong, Wenjun Dong, Yiuming Cheung, Weifeng Su• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationMSRS
mIoU65.4
93
Infrared-Visible Image FusionMSRS
MUSIQ48.22
11
Object DetectionMSRS
mAP76.5
11
Inference Speed EvaluationCommodity Hardware Inference Efficiency Benchmark
Latency (ms)11.91
6
Closed-loop Autonomous DrivingCARLA Fog (Town02)
Success Rate96.5
2
Closed-loop Autonomous DrivingCARLA Town03 Fog
Success Rate93.8
2
Closed-loop Autonomous DrivingCARLA Town02 Night
Success Rate94.2
2
Closed-loop Autonomous DrivingCARLA Town03 Night
Success Rate91.5
2
Showing 8 of 8 rows

Other info

Follow for update