AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time
About
Accurate whole-body multi-person pose estimation and tracking is an important yet challenging topic in computer vision. To capture the subtle actions of humans for complex behavior analysis, whole-body pose estimation including the face, body, hand and foot is essential over conventional body-only pose estimation. In this paper, we present AlphaPose, a system that can perform accurate whole-body pose estimation and tracking jointly while running in realtime. To this end, we propose several new techniques: Symmetric Integral Keypoint Regression (SIKR) for fast and fine localization, Parametric Pose Non-Maximum-Suppression (P-NMS) for eliminating redundant human detections and Pose Aware Identity Embedding for jointly pose estimation and tracking. During training, we resort to Part-Guided Proposal Generator (PGPG) and multi-domain knowledge distillation to further improve the accuracy. Our method is able to localize whole-body keypoints accurately and tracks humans simultaneously given inaccurate bounding boxes and redundant detections. We show a significant improvement over current state-of-the-art methods in both speed and accuracy on COCO-wholebody, COCO, PoseTrack, and our proposed Halpe-FullBody pose estimation dataset. Our model, source codes and dataset are made publicly available at https://github.com/MVIG-SJTU/AlphaPose.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 2D Human Pose Estimation | COCO 2017 (val) | AP72.6 | 386 | |
| Action Recognition | Diving-48 (test) | Top-1 Acc70.66 | 81 | |
| Whole-body Pose Estimation | COCO-Wholebody 1.0 (val) | Body AP70.6 | 64 | |
| Keypoint Detection | COCO (val) | AP73.3 | 60 | |
| Gait Recognition | CASIA-B (test) | Rank-1 Accuracy (NM)96.82 | 44 | |
| Whole-body Pose Estimation | COCO-WholeBody 1.0 | Whole-body AP59.2 | 20 |