Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing

About

Always-on sensing is essential for next-generation edge/wearable AI systems, yet continuous high-fidelity RGB video capture remains prohibitively expensive for resource-constrained mobile and edge platforms. We present a new paradigm for efficient streaming video understanding: grayscale-always, color-on-demand. Through preliminary studies, we discover that color is not always necessary. Sparse RGB frames suffice for comparable performance when temporal structure is preserved via continuous grayscale streams. Building on this insight, we propose ColorTrigger, an online training-free trigger that selectively activates color capture based on windowed grayscale affinity analysis. Designed for real-time edge deployment, ColorTrigger uses lightweight quadratic programming to detect chromatic redundancy causally, coupled with credit-budgeted control and dynamic token routing to jointly reduce sensing and inference costs. On streaming video understanding benchmarks, ColorTrigger achieves 91.6% of full-color baseline performance while using only 8.1% RGB frames, demonstrating substantial color redundancy in natural videos and enabling practical always-on video sensing on resource-constrained devices.

Weitong Cai, Hang Zhang, Yukai Huang, Shitong Sun, Jiankang Deng, Songcen Xu, Jifei Song, Zhensong Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Video UnderstandingVideo-MME
Overall Score67.6
96
Real-Time Visual UnderstandingStreamingBench
Overall Score75.24
96
Real-Time Visual UnderstandingStreamingBench Real-Time Visual Understanding (test)
OP81.57
33
Long-form Video UnderstandingVideo-MME long-form duration
Overall Performance66.1
12
Showing 4 of 4 rows

Other info

Follow for update