Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Microsoft COCO: Common Objects in Context

About

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding. This is achieved by gathering images of complex everyday scenes containing common objects in their natural context. Objects are labeled using per-instance segmentations to aid in precise object localization. Our dataset contains photos of 91 objects types that would be easily recognizable by a 4 year old. With a total of 2.5 million labeled instances in 328k images, the creation of our dataset drew upon extensive crowd worker involvement via novel user interfaces for category detection, instance spotting and instance segmentation. We present a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN. Finally, we provide baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.

Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Doll\'ar• 2014

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)--
2643
Instance SegmentationCOCO 2017 (val)--
1201
Image-to-Text RetrievalFlickr30K 1K (test)
R@192.5
491
Text-to-Image RetrievalFlickr30K 1K (test)
R@180.4
432
Natural Language Visual ReasoningNLVR2 (test-p)
Accuracy87
346
Image-to-Text RetrievalMS-COCO 5K (test)
R@166.3
320
Text-to-Image RetrievalMSCOCO 5K (test)
R@151.2
308
Natural Language Visual ReasoningNLVR2 (dev)
Accuracy86.1
307
3D Human Mesh Recovery3DPW (test)
MPJPE121.2
299
Instance SegmentationCityscapes (val)--
239
Showing 10 of 41 rows

Other info

Follow for update