Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

About

Cross-view image translation is challenging because it involves images with drastically different views and severe deformation. In this paper, we propose a novel approach named Multi-Channel Attention SelectionGAN (SelectionGAN) that makes it possible to generate images of natural scenes in arbitrary viewpoints, based on an image of the scene and a novel semantic map. The proposed SelectionGAN explicitly utilizes the semantic information and consists of two stages. In the first stage, the condition image and the target semantic map are fed into a cycled semantic-guided generation network to produce initial coarse results. In the second stage, we refine the initial results by using a multi-channel attention selection mechanism. Moreover, uncertainty maps automatically learned from attentions are used to guide the pixel loss for better network optimization. Extensive experiments on Dayton, CVUSA and Ego2Top datasets show that our model is able to generate significantly better results than the state-of-the-art methods. The source code, data and trained models are available at https://github.com/Ha0Tang/SelectionGAN.

Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan• 2019

Related benchmarks

TaskDatasetResultRank
Image-to-Image TranslationCelebA-HQ
FID34.67
28
Aerial-to-Ground Image SynthesisCVUSA
Top-1 Acc0.6551
23
Aerial-to-Ground Image SynthesisDayton 256x256
Top-1 Accuracy0.6812
22
Aerial-to-Ground Image SynthesisDayton 64x64
Top-1 Accuracy0.79
18
Ground-to-Aerial Image SynthesisDayton 64x64
Top-1 Accuracy0.5181
18
Ground-to-Aerial Image SynthesisDayton 256x256
Top-1 Accuracy33.7
14
Aerial-to-Ground Image TranslationCVUSA (test)
Top-1 Accuracy41.52
10
Cross-View Image TranslationDayton 64x64 (test)
SSIM0.6865
9
Cross-view Image Translation (aerial-to-ground)Dayton (test)
Top-1 Accuracy42.11
9
Image-to-Image TranslationADE20K (train val)
FID35.1
9
Showing 10 of 28 rows

Other info

Code

Follow for update