Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Resolving State Ambiguity in Robot Manipulation via Adaptive Working Memory Recoding

About

State ambiguity is common in robotic manipulation. Identical observations may correspond to multiple valid behavior trajectories. The visuomotor policy must correctly extract the appropriate types and levels of information from the history to identify the current task phase. However, naively extending the history window is computationally expensive and may cause severe overfitting. Inspired by the continuous nature of human reasoning and the recoding of working memory, we introduce PAM, a novel visuomotor Policy equipped with Adaptive working Memory. With minimal additional training cost in a two-stage manner, PAM supports a 300-frame history window while maintaining high inference speed. Specifically, a hierarchical frame feature extractor yields two distinct representations for motion primitives and temporal disambiguation. For compact representation, a context router with range-specific queries is employed to produce compact context features across multiple history lengths. And an auxiliary objective of reconstructing historical information is introduced to ensure that the context router acts as an effective bottleneck. We meticulously design 7 tasks and verify that PAM can handle multiple scenarios of state ambiguity simultaneously. With a history window of approximately 10 seconds, PAM still supports stable training and maintains inference speeds above 20Hz. Project website: https://tinda24.github.io/pam/

Qingda Hu, Ziheng Qiu, Zijun Xu, Kaizhao Zhang, Xizhou Bu, Zuolei Sun, Bo Zhang, Jieru Zhao, Zhongxue Gan, Wenchao Ding• 2025

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationLibero-Long 10 tasks
Success Rate84.7
5
Buttons in SequenceReal-world robotic manipulation tasks
Success Rate80
4
Exchange ObjectsReal-world robotic manipulation tasks
Success Rate90
4
Guessing GameReal-world robotic manipulation tasks
Success Rate97
4
Hold the Pot LidReal-world robotic manipulation tasks
Success Rate95
4
Overall Average PerformanceReal-world robotic manipulation tasks
Avg Success Rate0.91
4
Sponge and SquareReal-world robotic manipulation tasks
Success Rate96
4
Wipe the Table OnceReal-world robotic manipulation tasks
Success Rate100
4
Wipe the Table TwiceReal-world robotic manipulation tasks
Success Rate92
4
Showing 9 of 9 rows

Other info

Follow for update