Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SIRR-LMM: Single-image Reflection Removal via Large Multimodal Model

About

Glass surfaces create complex interactions of reflected and transmitted light, making single-image reflection removal (SIRR) challenging. Existing datasets suffer from limited physical realism in synthetic data or insufficient scale in real captures. We introduce a synthetic dataset generation framework that path-traces 3D glass models over real background imagery to create physically accurate reflection scenarios with varied glass properties, camera settings, and post-processing effects. To leverage the capabilities of Large Multimodal Model (LMM), we concatenate the image layers into a single composite input, apply joint captioning, and fine-tune the model using task-specific LoRA rather than full-parameter training. This enables our approach to achieve improved reflection removal and separation performance compared to state-of-the-art methods.

Yu Guo, Zhiqiang Lao, Xiyun Song, Yubin Zhou, Heather Yu• 2026

Related benchmarks

TaskDatasetResultRank
Single Image Reflection RemovalReal 20 55 (test)
PSNR24.89
7
Single Image Reflection RemovalSIR^2 141 36 (test)
PSNR (Postcard)26.2
7
Single Image Reflection RemovalNature 27 (test)
PSNR25.14
7
Single Image Reflection RemovalReaL
Win Rate57.32
4
Single Image Reflection RemovalNature
Win Rate35
4
Single Image Reflection RemovalPostcard
Win Rate67.03
4
Single Image Reflection RemovalSolidObject
Win Rate41.43
4
Single Image Reflection RemovalWildscene
Win Rate48.57
4
Showing 8 of 8 rows

Other info

Follow for update