LongCat-Video Technical Report

About

Video generation is a critical pathway toward world models, with efficient long video inference as a key capability. Toward this end, we introduce LongCat-Video, a foundational video generation model with 13.6B parameters, delivering strong performance across multiple video generation tasks. It particularly excels in efficient and high-quality long video generation, representing our first step toward world models. Key features include: Unified architecture for multiple tasks: Built on the Diffusion Transformer (DiT) framework, LongCat-Video supports Text-to-Video, Image-to-Video, and Video-Continuation tasks with a single model; Long video generation: Pretraining on Video-Continuation tasks enables LongCat-Video to maintain high quality and temporal coherence in the generation of minutes-long videos; Efficient inference: LongCat-Video generates 720p, 30fps videos within minutes by employing a coarse-to-fine generation strategy along both the temporal and spatial axes. Block Sparse Attention further enhances efficiency, particularly at high resolutions; Strong performance with multi-reward RLHF: Multi-reward RLHF training enables LongCat-Video to achieve performance on par with the latest closed-source and leading open-source models. Code and model weights are publicly available to accelerate progress in the field.

Meituan LongCat Team: Xunliang Cai, Qilong Huang, Zhuoliang Kang, Hongyu Li, Shijun Liang, Liya Ma, Siyu Ren, Xiaoming Wei, Rixu Xie, Tong Zhang• 2025

Related benchmarks

Task	Dataset	Result
Robotic Video Generation	R-Bench	Average Score43.7	44
Video Generation	short videos 81-frames 240 prompts	Total Score6.3	38
Long Video Generation	120, 240, 720 and 1440-frames long videos	Total Score6.54	20
Interactive Video Generation	Matrix-Game 3.0	Self PSNR16.66	5
Interactive Video Generation	HY-WorldPlay	PSNR (Self-Comparison)15.44	5
Dyadic conversational video generation	curated dyadic conversational dataset (test)	FID45.4698	4

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord