Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Trajectory-Consistent Calibration for Cache-Accelerated Diffusion Models

About

Diffusion Transformers require repeated denoiser evaluations during iterative sampling, making inference computationally expensive. Cache-based acceleration reduces this cost by reusing intermediate representations across denoising steps, but can introduce representation deviations and degrade generation quality. In this paper, we analyze these deviations and show that effective calibration should consider both the direct mismatch caused by reuse and the subsequent trajectory shift induced by earlier corrections. To address this challenge, we propose Trajectory-Consistent Calibration (TCC), a training-free method that calibrates cached representations toward their full-computation counterparts. Specifically, rather than estimating all calibration priors from a single uncorrected cache trajectory, TCC uses an offline iterative procedure so that each prior accounts for the trajectory shift induced by preceding calibrations. Experiments on PixArt-alpha and DiT-XL/2 show that TCC consistently improves FID across representative cache-based acceleration methods while preserving their underlying reuse policies. Notably, in a representative PixArt-alpha cache-acceleration setting based on FORA, TCC reduces FID from 29.83 to 27.35, slightly surpassing the full-computation baseline.

Mingyu Liang, Dingkun Xu, Jingwei Xu• 2026

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256 (val)
Inception Score (IS)224.5
493
Class-conditional Image GenerationImageNet 512x512 (val)
FID (Val)6.03
102
Class-conditional Image GenerationImageNet class-conditional 256x256
Inception Score (IS)245.2
61
Text-to-Image GenerationMS-COCO 2017 (test)
FID27.35
21
Showing 4 of 4 rows

Other info

Follow for update