Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

The Spatially-Correlative Loss for Various Image Translation Tasks

About

We propose a novel spatially-correlative loss that is simple, efficient and yet effective for preserving scene structure consistency while supporting large appearance changes during unpaired image-to-image (I2I) translation. Previous methods attempt this by using pixel-level cycle-consistency or feature-level matching losses, but the domain-specific nature of these losses hinder translation across large domain gaps. To address this, we exploit the spatial patterns of self-similarity as a means of defining scene structure. Our spatially-correlative loss is geared towards only capturing spatial relationships within an image rather than domain appearance. We also introduce a new self-supervised learning method to explicitly learn spatially-correlative maps for each specific translation task. We show distinct improvement over baseline models in all three modes of unpaired I2I translation: single-modal, multi-modal, and even single-image translation. This new loss can easily be integrated into existing network architectures and thus allows wide applicability.

Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai• 2021

Related benchmarks

TaskDatasetResultRank
Image-to-Image TranslationHorse -> Zebra
FID43.4
23
Object DetectionFoggy Cityscapes to Cityscapes (test)
AP (person)40.9
21
Unpaired Image-to-Image TranslationCat → Dog v1 (test)
FID72.8
14
Unpaired Image-to-Image TranslationCityscapes
Pixel Accuracy73.2
8
Unpaired Image-to-Image TranslationHorse-to-Zebra
FID38
8
Artistic Style TransferWikiArt Cezanne
FID141.3
8
Artistic Style TransferGeneral Content Images
Inference Time (s)0.0365
8
Artistic Style TransferWikiArt Van Gogh
FID105.2
8
Artistic Style TransferWikiArt Ukiyoe
FID130.8
8
Artistic Style TransferWikiArt Gauguin
FID172.2
8
Showing 10 of 18 rows

Other info

Code

Follow for update