Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

StreamVLA: Breaking the Reason-Act Cycle via Completion-State Gating

About

Long-horizon robotic manipulation requires bridging the gap between high-level planning (System 2) and low-level control (System 1). Current Vision-Language-Action (VLA) models often entangle these processes, performing redundant multimodal reasoning at every timestep, which leads to high latency and goal instability. To address this, we present StreamVLA, a dual-system architecture that unifies textual task decomposition, visual goal imagination, and continuous action generation within a single parameter-efficient backbone. We introduce a "Lock-and-Gated" mechanism to intelligently modulate computation: only when a sub-task transition is detected, the model triggers slow thinking to generate a textual instruction and imagines the specific visual completion state, rather than generic future frames. Crucially, this completion state serves as a time-invariant goal anchor, making the policy robust to execution speed variations. During steady execution, these high-level intents are locked to condition a Flow Matching action head, allowing the model to bypass expensive autoregressive decoding for 72% of timesteps. This hierarchical abstraction ensures sub-goal focus while significantly reducing inference latency. Extensive evaluations demonstrate that StreamVLA achieves state-of-the-art performance, with a 98.5% success rate on the LIBERO benchmark and robust recovery in real-world interference scenarios, achieving a 48% reduction in latency compared to full-reasoning baselines.

Tongqing Chen, Hang Wu, Jiasen Wang, Xiaotao Li, Lu Fang• 2026

Related benchmarks

TaskDatasetResultRank
Robot ManipulationLIBERO
Goal Achievement98.6
494
Robot ManipulationRoboTwin Hard 2.0
Beat Block Hammer Success Rate42
8
Dual-arm manipulationRoboTwin Easy 2.0
BBH Success Rate79
6
InsertionReal-world
Success Rate70
4
Interference SpellingReal-world
Success Rate55
4
SpellingReal-world
Success Rate90
4
Showing 6 of 6 rows

Other info

Follow for update