Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VideoMatGen: PBR Materials through Joint Generative Modeling

About

We present a method for generating physically-based materials for 3D shapes based on a video diffusion transformer architecture. Our method is conditioned on input geometry and a text description, and jointly models multiple material properties (base color, roughness, metallicity, height map) to form physically plausible materials. We further introduce a custom variational auto-encoder which encodes multiple material modalities into a compact latent space, which enables joint generation of multiple modalities without increasing the number of tokens. Our pipeline generates high-quality materials for 3D shapes given a text prompt, compatible with common content creation tools.

Jon Hasselgren, Zheng Zeng, Milos Hasan, Jacob Munkberg• 2026

Related benchmarks

TaskDatasetResultRank
Material generationBlenderVault (test)
CLIP-FID4.032
8
Showing 1 of 1 rows

Other info

Follow for update