Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

About

YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 160 FPS and has the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100. YOLOv7-E6 object detector (56 FPS V100, 55.9% AP) outperforms both transformer-based detector SWIN-L Cascade-Mask R-CNN (9.2 FPS A100, 53.9% AP) by 509% in speed and 2% in accuracy, and convolutional-based detector ConvNeXt-XL Cascade-Mask R-CNN (8.6 FPS A100, 55.2% AP) by 551% in speed and 0.7% AP in accuracy, as well as YOLOv7 outperforms: YOLOR, YOLOX, Scaled-YOLOv4, YOLOv5, DETR, Deformable DETR, DINO-5scale-R50, ViT-Adapter-B and many other object detectors in speed and accuracy. Moreover, we train YOLOv7 only on MS COCO dataset from scratch without using any other datasets or pre-trained weights. Source code is released in https://github.com/WongKinYiu/yolov7.

Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao• 2022

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)
AP56.8
2454
Object DetectionMS COCO (test-dev)
mAP@.574.4
677
Object DetectionCOCO (val)--
613
Object DetectionMS-COCO 2017 (val)--
237
Object DetectionMS-COCO (val)--
138
Object DetectionM3FD dataset
mAP@0.578.1
48
Object DetectionMSCOCO (val)
AP51.2
43
Object DetectionTesla T4 GPU TensorRT 8.2
FPS (bs=1)464
36
Grasp point detectionViCoS Towel Dataset (test)
Precision44.8
26
Object DetectionDOTA v1.0--
24
Showing 10 of 28 rows

Other info

Code

Follow for update