Scaling Mesh Generation via Compressive Tokenization

About

We propose a compressive yet effective mesh representation, Blocked and Patchified Tokenization (BPT), facilitating the generation of meshes exceeding 8k faces. BPT compresses mesh sequences by employing block-wise indexing and patch aggregation, reducing their length by approximately 75\% compared to the original sequences. This compression milestone unlocks the potential to utilize mesh data with significantly more faces, thereby enhancing detail richness and improving generation robustness. Empowered with the BPT, we have built a foundation mesh generative model training on scaled mesh data to support flexible control for point clouds and images. Our model demonstrates the capability to generate meshes with intricate details and accurate topology, achieving SoTA performance on mesh generation and reaching the level for direct product usage.

Haohan Weng, Zibo Zhao, Biwen Lei, Xianghui Yang, Jian Liu, Zeqiang Lai, Zhuo Chen, Yuhong Liu, Jie Jiang, Chunchao Guo, Tong Zhang, Shenghua Gao, C. L. Philip Chen• 2024

Related benchmarks

Task	Dataset	Result
3D Geometry Synthesis	Toys4K (test)	CD5.71	19
3D Mesh Generation	Objaverse	Chamfer Distance0.027	18
Mesh Reconstruction	Toys4k	Chamfer Distance0.037	16
3D Head Mesh Reconstruction	D (test)	CD0.0417	15
Mesh Tokenization	3D Mesh Representation	Compression Ratio0.26	12
3D Object Generation	ShapeNet	Chamfer Distance (CD)0.003	10
Mesh Reconstruction	Objaverse (test)	Chamfer Distance0.043	10
3D Mesh Reconstruction	Artistic meshes	Chamfer Distance (L2)0.052	10
Mesh Tokenization	Toys4K (test)	Compression Ratio0.26	8
Mesh Generation	Hunyuan3D Dense Meshes 2.5	Chamfer Distance (CD)0.109	7

Showing 10 of 30 rows

Other info

Code

Follow for update

@wizwand_team Discord