Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MTIL: Encoding Full History with Mamba for Temporal Imitation Learning

About

Standard imitation learning (IL) methods have achieved considerable success in robotics, yet often rely on the Markov assumption, which falters in long-horizon tasks where history is crucial for resolving perceptual ambiguity. This limitation stems not only from a conceptual gap but also from a fundamental computational barrier: prevailing architectures like Transformers are often constrained by quadratic complexity, rendering the processing of long, high-dimensional observation sequences infeasible. To overcome this dual challenge, we introduce Mamba Temporal Imitation Learning (MTIL). Our approach represents a new paradigm for robotic learning, which we frame as a practical synthesis of World Model and Dynamical System concepts. By leveraging the linear-time recurrent dynamics of State Space Models (SSMs), MTIL learns an implicit, action-oriented world model that efficiently encodes the entire trajectory history into a compressed, evolving state. This allows the policy to be conditioned on a comprehensive temporal context, transcending the confines of Markovian approaches. Through extensive experiments on simulated benchmarks (ACT, Robomimic, LIBERO) and on challenging real-world tasks, MTIL demonstrates superior performance against SOTA methods like ACT and Diffusion Policy, particularly in resolving long-term temporal ambiguities. Our findings not only affirm the necessity of full temporal context but also validate MTIL as a powerful and a computationally feasible approach for learning long-horizon, non-Markovian behaviors from high-dimensional observations.

Yulin Zhou, Yuankai Lin, Fanzhe Peng, Jiahui Chen, Kaiji Huang, Hua Yang, Zhouping Yin• 2025

Related benchmarks

TaskDatasetResultRank
Guessing GameReal-world robotic manipulation tasks
Success Rate60
4
Buttons in SequenceReal-world robotic manipulation tasks
Success Rate45
4
Exchange ObjectsReal-world robotic manipulation tasks
Success Rate44
4
Hold the Pot LidReal-world robotic manipulation tasks
Success Rate20
4
Overall Average PerformanceReal-world robotic manipulation tasks
Avg Success Rate0.45
4
Sponge and SquareReal-world robotic manipulation tasks
Success Rate60
4
Wipe the Table OnceReal-world robotic manipulation tasks
Success Rate50
4
Wipe the Table TwiceReal-world robotic manipulation tasks
Success Rate35
4
Showing 8 of 8 rows

Other info

Follow for update