Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EfficientHRNet: Efficient Scaling for Lightweight High-Resolution Multi-Person Pose Estimation

About

There is an increasing demand for lightweight multi-person pose estimation for many emerging smart IoT applications. However, the existing algorithms tend to have large model sizes and intense computational requirements, making them ill-suited for real-time applications and deployment on resource-constrained hardware. Lightweight and real-time approaches are exceedingly rare and come at the cost of inferior accuracy. In this paper, we present EfficientHRNet, a family of lightweight multi-person human pose estimators that are able to perform in real-time on resource-constrained devices. By unifying recent advances in model scaling with high-resolution feature representations, EfficientHRNet creates highly accurate models while reducing computation enough to achieve real-time performance. The largest model is able to come within 4.4% accuracy of the current state-of-the-art, while having 1/3 the model size and 1/6 the computation, achieving 23 FPS on Nvidia Jetson Xavier. Compared to the top real-time approach, EfficientHRNet increases accuracy by 22% while achieving similar FPS with 1/3 the power. At every level, EfficientHRNet proves to be more computationally efficient than other bottom-up 2D human pose estimation approaches, while achieving highly competitive accuracy.

Christopher Neff, Aneri Sheth, Steven Furgurson, Hamed Tabkhi• 2020

Related benchmarks

TaskDatasetResultRank
2D Human Pose EstimationCOCO 2017 (val)
AP52.9
386
Multi-person Pose EstimationCrowdPose (test)
AP56.3
177
Multi-person Pose EstimationCOCO 2017 (test-dev)
AP52.8
99
Showing 3 of 3 rows

Other info

Follow for update