Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Row-Column Separated Attention Based Low-Light Image/Video Enhancement

About

U-Net structure is widely used for low-light image/video enhancement. The enhanced images result in areas with large local noise and loss of more details without proper guidance for global information. Attention mechanisms can better focus on and use global information. However, attention to images could significantly increase the number of parameters and computations. We propose a Row-Column Separated Attention module (RCSA) inserted after an improved U-Net. The RCSA module's input is the mean and maximum of the row and column of the feature map, which utilizes global information to guide local information with fewer parameters. We propose two temporal loss functions to apply the method to low-light video enhancement and maintain temporal consistency. Extensive experiments on the LOL, MIT Adobe FiveK image, and SDSD video datasets demonstrate the effectiveness of our approach. The code is publicly available at https://github.com/cq-dong/URCSA.

Chengqi Dong, Zhiyuan Cao, Tuoshi Qi, Kexin Wu, Yixing Gao, Fan Tang• 2026

Related benchmarks

TaskDatasetResultRank
Low-light Image EnhancementLOL
PSNR24.77
122
Low-light Video EnhancementSDSD
PSNR27.01
9
Low-light Image EnhancementLSRW (test)
PSNR17.97
6
Showing 3 of 3 rows

Other info

Follow for update