Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control
About
Diffusion-based models have recently shown strong performance in trajectory planning, as they are capable of capturing diverse, multimodal distributions of complex behaviors. A key limitation of these models is their slow inference speed, which results from the iterative denoising process. This makes them less suitable for real-time applications such as closed-loop model predictive control (MPC), where plans must be generated quickly and adapted continuously to a changing environment. In this paper, we investigate Implicit Maximum Likelihood Estimation (IMLE) as an alternative generative modeling approach for planning. IMLE offers strong mode coverage while enabling inference that is two orders of magnitude faster, making it particularly well suited for real-time MPC tasks. Our results demonstrate that IMLE achieves competitive performance on standard offline reinforcement learning benchmarks compared to the standard diffusion-based planner, while substantially improving planning speed in both open-loop and closed-loop settings. We further validate IMLE in a closed-loop human navigation scenario, operating in real-time, demonstrating how it enables rapid and adaptive plan generation in dynamic environments. Real-world videos and code are available at https://gmpc-imle.github.io/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| hopper locomotion | D4RL hopper medium-replay | Normalized Score85 | 66 | |
| walker2d locomotion | D4RL walker2d medium-replay | Normalized Score69.7 | 63 | |
| Locomotion | D4RL walker2d-medium-expert | Normalized Score107.9 | 63 | |
| Locomotion | D4RL HalfCheetah Medium-Replay | Normalized Score0.395 | 61 | |
| Locomotion | D4RL Halfcheetah medium | Normalized Score43.1 | 60 | |
| Locomotion | D4RL Walker2d medium | Normalized Score78.3 | 60 | |
| Locomotion | D4RL halfcheetah-medium-expert | Normalized Score91.9 | 53 | |
| hopper locomotion | D4RL hopper-medium-expert | Normalized Score104.2 | 48 | |
| Locomotion | D4RL Hopper medium | Normalized Score85 | 30 | |
| Goal-conditioned Planning | Maze2D Single Task | Performance Score129.2 | 6 |