Pose estimator and tracker using temporal flow maps for limbs
About
For human pose estimation in videos, it is significant how to use temporal information between frames. In this paper, we propose temporal flow maps for limbs (TML) and a multi-stride method to estimate and track human poses. The proposed temporal flow maps are unit vectors describing the limbs' movements. We constructed a network to learn both spatial information and temporal information end-to-end. Spatial information such as joint heatmaps and part affinity fields is regressed in the spatial network part, and the TML is regressed in the temporal network part. We also propose a data augmentation method to learn various types of TML better. The proposed multi-stride method expands the data by randomly selecting two frames within a defined range. We demonstrate that the proposed method efficiently estimates and tracks human poses on the PoseTrack 2017 and 2018 datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Pose Estimation | PoseTrack 2018 (val) | Total Score74.6 | 78 | |
| Human Pose Estimation | PoseTrack 2017 (val) | Total Accuracy71.5 | 54 | |
| Multi-person Pose Estimation | PoseTrack 2017 (val) | mAP (Total)71.5 | 39 | |
| Multi-person Pose Estimation | PoseTrack 2017 (test) | Total mAP67.8 | 39 | |
| Multi-person pose tracking | PoseTrack 2017 (val) | mAP61.3 | 30 | |
| Multi-person pose tracking | PoseTrack 2018 (val) | mAP74.6 | 25 | |
| Multi-person Pose Estimation | PoseTrack 2018 (test) | Ankle Joint Accuracy56.9 | 19 | |
| Pose Estimation and Tracking | PoseTrack 2017 (test) | MOTA54.5 | 13 | |
| Pose Tracking | PoseTrack 2018 (test) | MOTA54.9 | 11 | |
| Multi-person pose tracking | PoseTrack 2017 (test) | MOTA54.5 | 8 |