Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild

About

This paper investigates the task of 2D whole-body human pose estimation, which aims to localize dense landmarks on the entire human body including body, feet, face, and hands. We propose a single-network approach, termed ZoomNet, to take into account the hierarchical structure of the full human body and solve the scale variation of different body parts. We further propose a neural architecture search framework, termed ZoomNAS, to promote both the accuracy and efficiency of whole-body pose estimation. ZoomNAS jointly searches the model architecture and the connections between different sub-modules, and automatically allocates computational complexity for searched sub-modules. To train and evaluate ZoomNAS, we introduce the first large-scale 2D human whole-body dataset, namely COCO-WholeBody V1.0, which annotates 133 keypoints for in-the-wild images. Extensive experiments demonstrate the effectiveness of ZoomNAS and the significance of COCO-WholeBody V1.0.

Lumin Xu, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang• 2022

Related benchmarks

TaskDatasetResultRank
Facial Landmark DetectionAFLW Full
NME1.42
101
Facial Landmark DetectionCOFW (test)
NME3.36
93
Whole-body Pose EstimationCOCO-Wholebody 1.0 (val)
Body AP74
64
Face AlignmentAFLW Frontal
NME (%)1.27
22
Whole-body Pose EstimationCOCO-WholeBody 1.0
Whole-body AP65.4
20
Pose EstimationHumans-5K (test)
Body AP59.7
13
Whole-body Pose EstimationCOCO-WholeBody V1.0 (test)
Body AP74.5
10
Hand Pose EstimationWholeBody-Hand (WBH) (test)
PCK (%)80.2
7
Facial Landmark DetectionWholeBody-Face (WBF)--
7
2D hand pose estimationPanoptic 7 (test)
PCK99.9
4
Showing 10 of 13 rows

Other info

Code

Follow for update