Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization

About

We present a simple yet effective self-supervised pre-training method for image harmonization which can leverage large-scale unannotated image datasets. To achieve this goal, we first generate pre-training data online with our Label-Efficient Masked Region Transform (LEMaRT) pipeline. Given an image, LEMaRT generates a foreground mask and then applies a set of transformations to perturb various visual attributes, e.g., defocus blur, contrast, saturation, of the region specified by the generated mask. We then pre-train image harmonization models by recovering the original image from the perturbed image. Secondly, we introduce an image harmonization model, namely SwinIH, by retrofitting the Swin Transformer [27] with a combination of local and global self-attention mechanisms. Pre-training SwinIH with LEMaRT results in a new state of the art for image harmonization, while being label-efficient, i.e., consuming less annotated data for fine-tuning than existing methods. Notably, on iHarmony4 dataset [8], SwinIH outperforms the state of the art, i.e., SCS-Co [16] by a margin of 0.4 dB when it is fine-tuned on only 50% of the training data, and by 1.0 dB when it is trained on the full training dataset.

Sheng Liu, Cong Phuoc Huynh, Cong Chen, Maxim Arap, Raffay Hamid• 2023

Related benchmarks

TaskDatasetResultRank
Image HarmonizationiHarmony4 HFlickr
MSE40.7
58
Image HarmonizationiHarmony4 (all)
MSE20.9
53
Image HarmonizationiHarmony4 Hday2night
MSE42.3
51
Image HarmonizationiHarmony4 HAdobe5k
MSE18.8
43
Image HarmonizationiHarmony4 HCOCO
MSE10.1
38
Image HarmonizationHAdobe5k iHarmony4 (test)
MSE18.8
37
Image HarmonizationiHarmony4
MSE16.8
27
Image HarmonizationiHarmony4 HCOCO 1.0 (test)
PSNR41
11
Image HarmonizationiHarmony4 HFlickr 1.0 (test)
PSNR35.3
11
Image HarmonizationiHarmony4 HD2N 1.0 (test)
PSNR38.1
11
Showing 10 of 12 rows

Other info

Follow for update