SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation

About

The 2D heatmap-based approaches have dominated Human Pose Estimation (HPE) for years due to high performance. However, the long-standing quantization error problem in the 2D heatmap-based methods leads to several well-known drawbacks: 1) The performance for the low-resolution inputs is limited; 2) To improve the feature map resolution for higher localization precision, multiple costly upsampling layers are required; 3) Extra post-processing is adopted to reduce the quantization error. To address these issues, we aim to explore a brand new scheme, called \textit{SimCC}, which reformulates HPE as two classification tasks for horizontal and vertical coordinates. The proposed SimCC uniformly divides each pixel into several bins, thus achieving \emph{sub-pixel} localization precision and low quantization error. Benefiting from that, SimCC can omit additional refinement post-processing and exclude upsampling layers under certain settings, resulting in a more simple and effective pipeline for HPE. Extensive experiments conducted over COCO, CrowdPose, and MPII datasets show that SimCC outperforms heatmap-based counterparts, especially in low-resolution settings by a large margin.

Yanjie Li, Sen Yang, Peidong Liu, Shoukui Zhang, Yunxiao Wang, Zhicheng Wang, Wankou Yang, Shu-Tao Xia• 2021

Related benchmarks

Task	Dataset	Result
Pose Estimation	COCO 2017 (val)	AP76.1	71
2D Human Pose Estimation	MPII (val)	--	61
Keypoint Detection	COCO (val)	AP76.5	60
Bleeding Region Detection	SurgBlood (test)	IoU51.02	14
Bleeding Region Detection	HemoSet	IoU51.62	14
Bleeding Point Detection	SurgBlood (test)	PCK @ 2%3.62	14
Pose Estimation	Cattle Mounting Dataset (test)	AP87.4	10
Keypoint Localization	Human-Art (cross-dataset)	AP51.7	6
Keypoint Localization	MPII (cross-dataset)	Acc (Shoulder)92.2	6

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord