Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

About

Synthesizing high-quality dynamic medical videos remains a significant challenge due to the need for modeling both spatial consistency and temporal dynamics. Existing Transformer-based approaches face critical limitations, including insufficient channel interactions, high computational complexity from self-attention, and coarse denoising guidance from timestep embeddings when handling varying noise levels. In this work, we propose FEAT, a full-dimensional efficient attention Transformer, which addresses these issues through three key innovations: (1) a unified paradigm with sequential spatial-temporal-channel attention mechanisms to capture global dependencies across all dimensions, (2) a linear-complexity design for attention mechanisms in each dimension, utilizing weighted key-value attention and global channel attention, and (3) a residual value guidance module that provides fine-grained pixel-level guidance to adapt to different noise levels. We evaluate FEAT on standard benchmarks and downstream tasks, demonstrating that FEAT-S, with only 23\% of the parameters of the state-of-the-art model Endora, achieves comparable or even superior performance. Furthermore, FEAT-L surpasses all comparison methods across multiple datasets, showcasing both superior effectiveness and scalability. Code is available at https://github.com/Yaziwel/FEAT.

Huihan Wang, Zhiwen Yang, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu• 2025

Related benchmarks

TaskDatasetResultRank
Colonoscopy Video GenerationColonoscopic
FVD351
7
Colonoscopy Video GenerationSUN-SEG
FVD356
7
Colonoscopy Video GenerationHyperKvasir
FVD511
7
Colonoscopy Video GenerationHospital Database
FVD402
7
Video GenerationColonoscopic 25
FVD351.1
6
Video GenerationKvasir-Capsule 26
FVD59.2
6
Semi-supervised Video ClassificationPolyDiag (test)
F1 Score91.3
5
Showing 7 of 7 rows

Other info

Follow for update