MTIL: Encoding Full History with Mamba for Temporal Imitation Learning
About
Standard imitation learning (IL) methods have achieved considerable success in robotics, yet often rely on the Markov assumption, which falters in long-horizon tasks where history is crucial for resolving perceptual ambiguity. This limitation stems not only from a conceptual gap but also from a fundamental computational barrier: prevailing architectures like Transformers are often constrained by quadratic complexity, rendering the processing of long, high-dimensional observation sequences infeasible. To overcome this dual challenge, we introduce Mamba Temporal Imitation Learning (MTIL). Our approach represents a new paradigm for robotic learning, which we frame as a practical synthesis of World Model and Dynamical System concepts. By leveraging the linear-time recurrent dynamics of State Space Models (SSMs), MTIL learns an implicit, action-oriented world model that efficiently encodes the entire trajectory history into a compressed, evolving state. This allows the policy to be conditioned on a comprehensive temporal context, transcending the confines of Markovian approaches. Through extensive experiments on simulated benchmarks (ACT, Robomimic, LIBERO) and on challenging real-world tasks, MTIL demonstrates superior performance against SOTA methods like ACT and Diffusion Policy, particularly in resolving long-term temporal ambiguities. Our findings not only affirm the necessity of full temporal context but also validate MTIL as a powerful and a computationally feasible approach for learning long-horizon, non-Markovian behaviors from high-dimensional observations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Guessing Game | Real-world robotic manipulation tasks | Success Rate60 | 4 | |
| Buttons in Sequence | Real-world robotic manipulation tasks | Success Rate45 | 4 | |
| Exchange Objects | Real-world robotic manipulation tasks | Success Rate44 | 4 | |
| Hold the Pot Lid | Real-world robotic manipulation tasks | Success Rate20 | 4 | |
| Overall Average Performance | Real-world robotic manipulation tasks | Avg Success Rate0.45 | 4 | |
| Sponge and Square | Real-world robotic manipulation tasks | Success Rate60 | 4 | |
| Wipe the Table Once | Real-world robotic manipulation tasks | Success Rate50 | 4 | |
| Wipe the Table Twice | Real-world robotic manipulation tasks | Success Rate35 | 4 |