Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation

About

Recent advances in 4D content generation have attracted increasing attention, yet creating high-quality animated 3D models remains challenging due to the complexity of modeling spatio-temporal distributions and the scarcity of 4D training data. In this paper, we present AnimateAnyMesh, the first feed-forward framework that enables efficient text-driven animation of arbitrary 3D meshes. Our approach leverages a novel DyMeshVAE architecture that effectively compresses and reconstructs dynamic mesh sequences by disentangling spatial and temporal features while preserving local topological structures. To enable high-quality text-conditional generation, we employ a Rectified Flow-based training strategy in the compressed latent space. Additionally, we contribute the DyMesh Dataset, containing over 4M diverse dynamic mesh sequences with text annotations. Experimental results demonstrate that our method generates semantically accurate and temporally coherent mesh animations in a few seconds, significantly outperforming existing approaches in both quality and efficiency. Our work marks a substantial step forward in making 4D content creation more accessible and practical. All the data, code, and models will be open-released.

Zijie Wu, Chaohui Yu, Fan Wang, Xiang Bai• 2025

Related benchmarks

TaskDatasetResultRank
Single-object 4D Motion GenerationUser Study Single-object 4D Motion Generation 1.0 (test)
Prompt Alignment1
36
4D GenerationVBench
Dynamic Degree62.5
13
4D Scene Motion GenerationSix diverse dynamic scenes animation set 1.0 (test)
Alignment1.01
6
Physically-inspired 4D GenerationWorldScore
CLIP Score73
5
Physically-inspired 4D GenerationUser Study
Alignment Score5.83
5
Video-guided Mesh AnimationVideo-RDMesh Generated reference subset
PSNR13.5
5
3D Motion Generation20 static meshes (test)
OC0.155
4
Text-to-motion generationBIMO
Text-to-Motion Agreement (TA)2.314
4
3D mesh animation from 2D sketches100 hand-drawn sketches dataset
T2VA0.1967
3
4D Human-Object Interaction Generation10 HOI Scenarios
VQA0.19
3
Showing 10 of 10 rows

Other info

Follow for update