Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MagCache: Fast Video Generation with Magnitude-Aware Cache

About

Existing acceleration techniques for video diffusion models often rely on uniform heuristics or time-embedding variants to skip timesteps and reuse cached features. These approaches typically require extensive calibration with curated prompts and risk inconsistent outputs due to prompt-specific overfitting. In this paper, we introduce a novel and robust discovery: a unified magnitude law observed across different models and prompts. Specifically, the magnitude ratio of successive residual outputs decreases monotonically, steadily in most timesteps while rapidly in the last several steps. Leveraging this insight, we introduce a Magnitude-aware Cache (MagCache) that adaptively skips unimportant timesteps using an error modeling mechanism and adaptive caching strategy. Unlike existing methods requiring dozens of curated samples for calibration, MagCache only requires a single sample for calibration. Experimental results show that MagCache achieves 2.10x-2.68x speedups on Open-Sora, CogVideoX, Wan 2.1, and HunyuanVideo, while preserving superior visual fidelity. It significantly outperforms existing methods in LPIPS, SSIM, and PSNR, under similar computational budgets.

Zehong Ma, Longhui Wei, Feng Wang, Shiliang Zhang, Qi Tian• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationFLUX.1 (dev)
PSNR29.96
8
Text-to-Image GenerationFLUX.1 T=50 (dev)
GenEval64.61
7
Text-to-Image GenerationBAGEL T=50
GenEval87.29
7
Image-to-VideoHunyuanVideo 81 frames, 480p 1.5
FLOPs (P)3.42
6
Text-to-VideoWan 81 frames 720p 2.2
FLOPs (P)19.57
6
Text-to-VideoWan 81 frames, 480p 2.2
FLOPs (P)5.03
6
Text-to-Video GenerationWan 1.3B T2V 2.1
PSNR25.55
4
Showing 7 of 7 rows

Other info

Follow for update