Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control

About

Diffusion-based models have recently shown strong performance in trajectory planning, as they are capable of capturing diverse, multimodal distributions of complex behaviors. A key limitation of these models is their slow inference speed, which results from the iterative denoising process. This makes them less suitable for real-time applications such as closed-loop model predictive control (MPC), where plans must be generated quickly and adapted continuously to a changing environment. In this paper, we investigate Implicit Maximum Likelihood Estimation (IMLE) as an alternative generative modeling approach for planning. IMLE offers strong mode coverage while enabling inference that is two orders of magnitude faster, making it particularly well suited for real-time MPC tasks. Our results demonstrate that IMLE achieves competitive performance on standard offline reinforcement learning benchmarks compared to the standard diffusion-based planner, while substantially improving planning speed in both open-loop and closed-loop settings. We further validate IMLE in a closed-loop human navigation scenario, operating in real-time, demonstrating how it enables rapid and adaptive plan generation in dynamic environments. Real-world videos and code are available at https://gmpc-imle.github.io/.

Grayson Lee, Minh Bui, Shuzi Zhou, Yankai Li, Mo Chen, Ke Li• 2026

Related benchmarks

Task	Dataset	Result
Locomotion	D4RL walker2d-medium-expert	Normalized Score107.9	90
walker2d locomotion	D4RL walker2d medium-replay	Normalized Score69.7	78
hopper locomotion	D4RL hopper medium-replay	Normalized Score85	71
Locomotion	D4RL Halfcheetah medium	Normalized Score43.1	70
Locomotion	D4RL Walker2d medium	Normalized Score78.3	70
Locomotion	D4RL HalfCheetah Medium-Replay	Normalized Score0.395	68
Locomotion	D4RL halfcheetah-medium-expert	Normalized Score91.9	53
hopper locomotion	D4RL hopper-medium-expert	Normalized Score104.2	53
Locomotion	D4RL Hopper medium	Normalized Score85	30
Goal-conditioned Planning	Maze2D Single Task	Performance Score129.2	6

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord