Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LVIS: A Dataset for Large Vocabulary Instance Segmentation

About

Progress on object detection is enabled by datasets that focus the research community's attention on open challenges. This process led us from simple images to complex scenes and from bounding boxes to segmentation masks. In this work, we introduce LVIS (pronounced `el-vis'): a new dataset for Large Vocabulary Instance Segmentation. We plan to collect ~2 million high-quality instance segmentation masks for over 1000 entry-level object categories in 164k images. Due to the Zipfian distribution of categories in natural images, LVIS naturally has a long tail of categories with few training samples. Given that state-of-the-art deep learning methods for object detection perform poorly in the low-sample regime, we believe that our dataset poses an important and exciting new scientific challenge. LVIS is available at http://www.lvisdataset.org.

Agrim Gupta, Piotr Doll\'ar, Ross Girshick• 2019

Related benchmarks

TaskDatasetResultRank
Object DetectionLVIS v1.0 (val)
APbbox27.3
518
Instance SegmentationLVIS v1.0 (val)
AP (Rare)17.5
189
Object DetectionCOCO (minival)--
184
Object DetectionCOCO standard (5% labeled)
mAP17.71
70
Instance SegmentationLVIS
mAP (Mask)23.1
68
Object DetectionLVIS v0.5 (val)
APr14.8
61
Object DetectionLVIS
APr11.6
59
Instance SegmentationLVIS 0.5 (val)
APr14.6
58
Object DetectionCOCO standard (1%)
mAP9.25
44
Object DetectionCOCO standard (2%)
mAP12.7
42
Showing 10 of 17 rows

Other info

Follow for update