Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rethinking pose estimation in crowds: overcoming the detection information-bottleneck and ambiguity

About

Frequent interactions between individuals are a fundamental challenge for pose estimation algorithms. Current pipelines either use an object detector together with a pose estimator (top-down approach), or localize all body parts first and then link them to predict the pose of individuals (bottom-up). Yet, when individuals closely interact, top-down methods are ill-defined due to overlapping individuals, and bottom-up methods often falsely infer connections to distant bodyparts. Thus, we propose a novel pipeline called bottom-up conditioned top-down pose estimation (BUCTD) that combines the strengths of bottom-up and top-down methods. Specifically, we propose to use a bottom-up model as the detector, which in addition to an estimated bounding box provides a pose proposal that is fed as condition to an attention-based top-down model. We demonstrate the performance and efficiency of our approach on animal and human pose estimation benchmarks. On CrowdPose and OCHuman, we outperform previous state-of-the-art models by a significant margin. We achieve 78.5 AP on CrowdPose and 48.5 AP on OCHuman, an improvement of 8.6% and 7.8% over the prior art, respectively. Furthermore, we show that our method strongly improves the performance on multi-animal benchmarks involving fish and monkeys. The code is available at https://github.com/amathislab/BUCTD

Mu Zhou, Lucas Stoffl, Mackenzie Weygandt Mathis, Alexander Mathis• 2023

Related benchmarks

TaskDatasetResultRank
Pose EstimationCOCO (val)
AP77.8
319
Multi-person Pose EstimationCrowdPose (test)
AP78.5
177
Pose EstimationOCHuman (test)
AP48.3
95
Pose EstimationOCHuman (val)
AP48.8
24
Object Keypoint DetectionOCHuman v1.0 (test)
AP48.5
14
Object Keypoint DetectionOCHuman v1.0 (val)
AP49
13
Animal Pose EstimationMarmosets (test)
AP93.7
12
Animal Pose EstimationTri-Mouse (test)
AP99.1
12
Animal Pose EstimationSchoolingFish (test)
AP88.7
12
Showing 9 of 9 rows

Other info

Code

Follow for update