Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Focal Loss for Dense Object Detection

About

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations. In contrast, one-stage detectors that are applied over a regular, dense sampling of possible object locations have the potential to be faster and simpler, but have trailed the accuracy of two-stage detectors thus far. In this paper, we investigate why this is the case. We discover that the extreme foreground-background class imbalance encountered during training of dense detectors is the central cause. We propose to address this class imbalance by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples. Our novel Focal Loss focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training. To evaluate the effectiveness of our loss, we design and train a simple dense detector we call RetinaNet. Our results show that when trained with the focal loss, RetinaNet is able to match the speed of previous one-stage detectors while surpassing the accuracy of all existing state-of-the-art two-stage detectors. Code is at: https://github.com/facebookresearch/Detectron.

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Doll\'ar• 2017

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)
Accuracy98.49
3381
Object DetectionCOCO 2017 (val)
AP43.8
2643
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU69.99
2142
Object DetectionCOCO (test-dev)
mAP40.8
1239
Instance SegmentationCOCO 2017 (val)
APm0.428
1201
Image ClassificationCIFAR-10 (test)
Accuracy93.03
906
Image ClassificationMNIST (test)
Accuracy99.3
894
Object DetectionPASCAL VOC 2007 (test)
mAP77.3
844
Image ClassificationCIFAR-100 (val)
Accuracy59
776
Showing 10 of 620 rows
...

Other info

Code

Follow for update