Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PPGuide: Steering Diffusion Policies with Performance Predictive Guidance

About

Diffusion policies have shown to be very efficient at learning complex, multi-modal behaviors for robotic manipulation. However, errors in generated action sequences can compound over time which can potentially lead to failure. Some approaches mitigate this by augmenting datasets with expert demonstrations or learning predictive world models which might be computationally expensive. We introduce Performance Predictive Guidance (PPGuide), a lightweight, classifier-based framework that steers a pre-trained diffusion policy away from failure modes at inference time. PPGuide makes use of a novel self-supervised process: it uses attention-based multiple instance learning to automatically estimate which observation-action chunks from the policy's rollouts are relevant to success or failure. We then train a performance predictor on this self-labeled data. During inference, this predictor provides a real-time gradient to guide the policy toward more robust actions. We validated our proposed PPGuide across a diverse set of tasks from the Robomimic and MimicGen benchmarks, demonstrating consistent improvements in performance.

Zixing Wang, Devesh K. Jha, Ahmed H. Qureshi, Diego Romeres• 2026

Related benchmarks

TaskDatasetResultRank
Coffee Making/HandlingRobomimic MimicGen Coffee (D2)
Success Rate60
25
Mug CleanupRobomimic MimicGen Mug Cleanup (D1)
Success Rate36
20
Coffee PreparationRobomimic/MimicGen Coffee Prep. (D1)
Success Rate24
20
Kitchen manipulationRobomimic/MimicGen Kitchen (D1)
Success Rate54
10
Object TransportRobomimic Transport
Success Rate76
10
Square Nut InsertionRobomimic Square
Success Rate72
10
StackingRobomimic MimicGen Stack D1
Success Rate94
10
Stacking Three BlocksRobomimic MimicGen Stack Three (D1)
Success Rate36
10
Robot Manipulationsquare
Success Rate70
8
Robot ManipulationTransport
Success Rate74
8
Showing 10 of 10 rows

Other info

Follow for update