Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MagicPony: Learning Articulated 3D Animals in the Wild

About

We consider the problem of predicting the 3D shape, articulation, viewpoint, texture, and lighting of an articulated animal like a horse given a single test image as input. We present a new method, dubbed MagicPony, that learns this predictor purely from in-the-wild single-view images of the object category, with minimal assumptions about the topology of deformation. At its core is an implicit-explicit representation of articulated shape and appearance, combining the strengths of neural fields and meshes. In order to help the model understand an object's shape and pose, we distil the knowledge captured by an off-the-shelf self-supervised vision transformer and fuse it into the 3D model. To overcome local optima in viewpoint estimation, we further introduce a new viewpoint sampling scheme that comes at no additional training cost. MagicPony outperforms prior work on this challenging task and demonstrates excellent generalisation in reconstructing art, despite the fact that it is only trained on real images.

Shangzhe Wu, Ruining Li, Tomas Jakab, Christian Rupprecht, Andrea Vedaldi• 2022

Related benchmarks

TaskDatasetResultRank
3D Shape ReconstructionAnimodel (test)
Chamfer Distance (Horse)2.58
12
Point cloud generationAnimodel-Points (Cow)
Chamfer Distance (cm)2.53
10
Point cloud generationAnimodel-Points Sheep
Chamfer Distance (cm)3
10
Keypoint TransferCUB Bird (test)
PCK@0.155.5
8
Keypoint TransferCUB Bird excluding 50 aquatic bird classes (test)
PCK@0.163.5
6
Point Cloud ReconstructionAnimodel-Points
RMS CD (Horse, cm)11.19
6
3D Animal Pose and Shape EstimationAnimal3D (test)--
6
Keypoint TransferPASCAL VOC 12 (test)
PCK (Horse)42.9
5
3D Animal ReconstructionPASCAL VOC Horses 10 (test)
KT-PCK@0.142.9
4
Point correspondencePASCAL VOS Cow
PCK42.5
4
Showing 10 of 13 rows

Other info

Code

Follow for update