Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Accelerated Diffusion Models via Speculative Sampling

About

Speculative sampling is a popular technique for accelerating inference in Large Language Models by generating candidate tokens using a fast draft model and accepting or rejecting them based on the target model's distribution. While speculative sampling was previously limited to discrete sequences, we extend it to diffusion models, which generate samples via continuous, vector-valued Markov chains. In this context, the target model is a high-quality but computationally expensive diffusion model. We propose various drafting strategies, including a simple and effective approach that does not require training a draft model and is applicable out of the box to any diffusion model. Our experiments demonstrate significant generation speedup on various diffusion models, halving the number of function evaluations, while generating exact samples from the target model.

Valentin De Bortoli, Alexandre Galashov, Arthur Gretton, Arnaud Doucet• 2025

Related benchmarks

TaskDatasetResultRank
Robotic Manipulation (Square)Square PH
Success Rate70
16
Robotic Manipulation (Lift)Lift PH--
11
Multi-stage taskKitchen & BP
Success Rate (BP p1)0.86
10
Robot ManipulationMixed Human (MH)
Success Rate (Lift)66
10
Robot Manipulation (Can)Proficient Human (PH)
Success Rate (1)82
5
Robot Manipulation (Overall Suite Performance)Proficient Human (PH)
Avg Success Rate (Trial 1)56
5
Robot Manipulation (Push-T)Proficient Human (PH)
Success Rate (Criteria 1)16
5
Robot Manipulation (Tool)Proficient Human (PH)
Success Rate (Crit 1)26
5
Robot Manipulation (Transport)Proficient Human (PH)
Success Rate 150
5
Showing 9 of 9 rows

Other info

Follow for update