Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection

About

Dark environment becomes a challenge for computer vision algorithms owing to insufficient photons and undesirable noise. To enhance object detection in a dark environment, we propose a novel multitask auto encoding transformation (MAET) model which is able to explore the intrinsic pattern behind illumination translation. In a self-supervision manner, the MAET learns the intrinsic visual structure by encoding and decoding the realistic illumination-degrading transformation considering the physical noise model and image signal processing (ISP). Based on this representation, we achieve the object detection task by decoding the bounding box coordinates and classes. To avoid the over-entanglement of two tasks, our MAET disentangles the object and degrading features by imposing an orthogonal tangent regularity. This forms a parametric manifold along which multitask predictions can be geometrically formulated by maximizing the orthogonality between the tangents along the outputs of respective tasks. Our framework can be implemented based on the mainstream object detection architecture and directly trained end-to-end using normal target detection datasets, such as VOC and COCO. We have achieved the state-of-the-art performance using synthetic and real-world datasets. Code is available at https://github.com/cuiziteng/MAET.

Ziteng Cui, Guo-Jun Qi, Lin Gu, Shaodi You, Zenghui Zhang, Tatsuya Harada• 2022

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)
AP33
2454
Object DetectionVOC 2007 (test)
AP@5078.8
52
Object DetectionExDark (test)
mAP (Mean Average Precision)74
25
Object DetectionExDark
mAP (Mean Average Precision)77.7
17
Face DetectionUG2+ DARK FACE (test)
mAP0.558
7
Face DetectionDARK FACE (val)
mAP55.8
7
Image ClassificationCODaN
Top-1 Acc56.48
4
Showing 7 of 7 rows

Other info

Code

Follow for update