Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning the 3D Fauna of the Web

About

Learning 3D models of all animals on the Earth requires massively scaling up existing solutions. With this ultimate goal in mind, we develop 3D-Fauna, an approach that learns a pan-category deformable 3D animal model for more than 100 animal species jointly. One crucial bottleneck of modeling animals is the limited availability of training data, which we overcome by simply learning from 2D Internet images. We show that prior category-specific attempts fail to generalize to rare species with limited training images. We address this challenge by introducing the Semantic Bank of Skinned Models (SBSM), which automatically discovers a small set of base animal shapes by combining geometric inductive priors with semantic knowledge implicitly captured by an off-the-shelf self-supervised feature extractor. To train such a model, we also contribute a new large-scale dataset of diverse animal species. At inference time, given a single image of any quadruped animal, our model reconstructs an articulated 3D mesh in a feed-forward fashion within seconds.

Zizhang Li, Dor Litvak, Ruining Li, Yunzhi Zhang, Tomas Jakab, Christian Rupprecht, Shangzhe Wu, Andrea Vedaldi, Jiajun Wu• 2024

Related benchmarks

TaskDatasetResultRank
3D Shape ReconstructionAnimodel (test)
Chamfer Distance (Horse)3.13
12
Point cloud generationAnimodel-Points (Cow)
Chamfer Distance (cm)3.2
10
Point cloud generationAnimodel-Points Sheep
Chamfer Distance (cm)3.06
10
Point Cloud ReconstructionAnimodel-Points
RMS CD (Horse, cm)11.86
6
3D Animal Pose and Shape EstimationAnimal3D (test)--
6
Keypoint TransferPASCAL VOC 12 (test)
PCK (Horse)53.9
5
3D Animal ReconstructionPASCAL VOC Horses 10 (test)
KT-PCK@0.153.9
4
3D Animal ReconstructionAPT-36K (test)
PCK@0.184.1
3
3D Animal ReconstructionPASCAL VOC Multiple Species 10 (test)
PCK@0.178.2
2
Showing 9 of 9 rows

Other info

Code

Follow for update