Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control

About

Diffusion-based models have recently shown strong performance in trajectory planning, as they are capable of capturing diverse, multimodal distributions of complex behaviors. A key limitation of these models is their slow inference speed, which results from the iterative denoising process. This makes them less suitable for real-time applications such as closed-loop model predictive control (MPC), where plans must be generated quickly and adapted continuously to a changing environment. In this paper, we investigate Implicit Maximum Likelihood Estimation (IMLE) as an alternative generative modeling approach for planning. IMLE offers strong mode coverage while enabling inference that is two orders of magnitude faster, making it particularly well suited for real-time MPC tasks. Our results demonstrate that IMLE achieves competitive performance on standard offline reinforcement learning benchmarks compared to the standard diffusion-based planner, while substantially improving planning speed in both open-loop and closed-loop settings. We further validate IMLE in a closed-loop human navigation scenario, operating in real-time, demonstrating how it enables rapid and adaptive plan generation in dynamic environments. Real-world videos and code are available at https://gmpc-imle.github.io/.

Grayson Lee, Minh Bui, Shuzi Zhou, Yankai Li, Mo Chen, Ke Li• 2026

Related benchmarks

TaskDatasetResultRank
hopper locomotionD4RL hopper medium-replay
Normalized Score85
66
walker2d locomotionD4RL walker2d medium-replay
Normalized Score69.7
63
LocomotionD4RL walker2d-medium-expert
Normalized Score107.9
63
LocomotionD4RL HalfCheetah Medium-Replay
Normalized Score0.395
61
LocomotionD4RL Halfcheetah medium
Normalized Score43.1
60
LocomotionD4RL Walker2d medium
Normalized Score78.3
60
LocomotionD4RL halfcheetah-medium-expert
Normalized Score91.9
53
hopper locomotionD4RL hopper-medium-expert
Normalized Score104.2
48
LocomotionD4RL Hopper medium
Normalized Score85
30
Goal-conditioned PlanningMaze2D Single Task
Performance Score129.2
6
Showing 10 of 15 rows

Other info

Follow for update