Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Feed-forward Gaussian Registration for Head Avatar Creation and Editing

About

We present MATCH (Multi-view Avatars from Topologically Corresponding Heads), a multi-view Gaussian registration method for high-quality head avatar creation and editing. State-of-the-art multi-view head avatar methods require time-consuming head tracking followed by expensive avatar optimization, often resulting in a total creation time of more than one day. MATCH, in contrast, directly predicts Gaussian splat textures in correspondence from calibrated multi-view images in just 0.5 seconds per frame, without requiring data preprocessing. The learned intra-subject correspondence across frames enables fast creation of personalized head avatars, while correspondence across subjects supports applications such as expression transfer, optimization-free tracking, semantic editing, and identity interpolation. We establish these correspondences end-to-end using a transformer-based model that predicts Gaussian splat textures in the fixed UV layout of a template mesh. To achieve this, we introduce a novel registration-guided attention block, where each UV-map token attends exclusively to image tokens depicting its corresponding mesh region. This design improves efficiency and performance compared to dense cross-view attention. MATCH outperforms existing methods in novel-view synthesis, geometry registration, and head avatar generation, while making avatar creation 10 times faster than the closest competing baseline. The code and model weights are available on the project website.

Malte Prinzler, Paulo Gotardo, Siyu Tang, Timo Bolkart• 2026

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisNeRSemble v2 (test)
LPIPS0.152
7
Novel View SynthesisAva-256 v1 (test)
LPIPS0.163
6
Cross-ReenactmentAva-256 held-out sequences (test)
CSIM0.813
4
Self-ReenactmentAva-256 held-out sequences (test)
LPIPS0.174
4
Showing 4 of 4 rows

Other info

Follow for update