DQ-DETR: DETR with Dynamic Query for Tiny Object Detection

About

Despite previous DETR-like methods having performed successfully in generic object detection, tiny object detection is still a challenging task for them since the positional information of object queries is not customized for detecting tiny objects, whose scale is extraordinarily smaller than general objects. Also, DETR-like methods using a fixed number of queries make them unsuitable for aerial datasets, which only contain tiny objects, and the numbers of instances are imbalanced between different images. Thus, we present a simple yet effective model, named DQ-DETR, which consists of three different components: categorical counting module, counting-guided feature enhancement, and dynamic query selection to solve the above-mentioned problems. DQ-DETR uses the prediction and density maps from the categorical counting module to dynamically adjust the number of object queries and improve the positional information of queries. Our model DQ-DETR outperforms previous CNN-based and DETR-like methods, achieving state-of-the-art mAP 30.2% on the AI-TOD-V2 dataset, which mostly consists of tiny objects. Our code will be available at https://github.com/hoiliu-0801/DQ-DETR.

Yi-Xin Huang, Hou-I Liu, Hong-Han Shuai, Wen-Huang Cheng• 2024

Related benchmarks

Task	Dataset	Result
Referring Expression Comprehension	RefCOCO v1 (val)	Top-1 Accuracy88.63	49
Tiny Object Detection	AI-TOD v2 (test)	AP30.2	7
Surgical instrument counting	SurgCount-HD (test)	MAE4.24	6
Localization	SurgCount-HD (test)	L2 Distance (Mean)5.84	4
Spatial Counting	SurgCount-HD (test)	GAME-L10.68	4

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord