Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Objects as Points

About

Detection identifies objects as axis-aligned boxes in an image. Most successful object detectors enumerate a nearly exhaustive list of potential object locations and classify each. This is wasteful, inefficient, and requires additional post-processing. In this paper, we take a different approach. We model an object as a single point --- the center point of its bounding box. Our detector uses keypoint estimation to find center points and regresses to all other object properties, such as size, 3D location, orientation, and even pose. Our center point based approach, CenterNet, is end-to-end differentiable, simpler, faster, and more accurate than corresponding bounding box based detectors. CenterNet achieves the best speed-accuracy trade-off on the MS COCO dataset, with 28.1% AP at 142 FPS, 37.4% AP at 52 FPS, and 45.1% AP with multi-scale testing at 1.4 FPS. We use the same approach to estimate 3D bounding box in the KITTI benchmark and human pose on the COCO keypoint dataset. Our method performs competitively with sophisticated multi-stage methods and runs in real-time.

Xingyi Zhou, Dequan Wang, Philipp Kr\"ahenb\"uhl• 2019

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)
AP40.2
2643
Object DetectionCOCO (test-dev)
mAP47
1239
3D Object DetectionnuScenes (val)
NDS32.8
981
3D Object DetectionnuScenes (test)
mAP33.8
874
Object DetectionMS COCO (test-dev)
mAP@.563.9
677
Object DetectionCOCO (val)
mAP37.6
633
Object DetectionCOCO v2017 (test-dev)
mAP42.1
499
Human Pose EstimationCOCO (test-dev)
AP63
432
2D Human Pose EstimationCOCO 2017 (val)
AP64
386
Pose EstimationCOCO (val)
AP64
319
Showing 10 of 79 rows
...

Other info

Code

Follow for update