Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars

About

Recent 3D Gaussian splatting methods built atop SMPL achieve remarkable visual fidelity while continually increasing the complexity of the overall training architecture. We demonstrate that much of this complexity is unnecessary: by replacing SMPL with the Momentum Human Rig (MHR), estimated via SAM-3D-Body, a minimal pipeline with no learned deformations or pose-dependent corrections achieves the highest reported PSNR and competitive or superior LPIPS and SSIM on PeopleSnapshot and ZJU-MoCap. To disentangle pose estimation quality from body model representational capacity, we perform two controlled ablations: translating SAM-3D-Body meshes to SMPL-X, and translating the original dataset's SMPL poses into MHR both retrained under identical conditions. These ablations confirm that body model expressiveness has been a primary bottleneck in avatar reconstruction, with both mesh representational capacity and pose estimation quality contributing meaningfully to the full pipeline's gains.

Derek Austin• 2026

Related benchmarks

TaskDatasetResultRank
View SynthesisPeople-Snapshot male-3-casual
PSNR36.94
15
View SynthesisPeople-Snapshot male-4-casual
PSNR34.2
15
View SynthesisPeople-Snapshot female-3-casual
PSNR37.56
15
Novel Pose SynthesisPeopleSnapshot Female-4-casual
PSNR37.01
7
Novel Pose SynthesisPeopleSnapshot Average
PSNR36.43
7
Novel View SynthesisZJU-MoCap single-camera training protocol 1.0 (test)
PSNR (Frame 377)34.26
5
Showing 6 of 6 rows

Other info

Follow for update