Single Stage Virtual Try-on via Deformable Attention Flows
About
Virtual try-on aims to generate a photo-realistic fitting result given an in-shop garment and a reference person image. Existing methods usually build up multi-stage frameworks to deal with clothes warping and body blending respectively, or rely heavily on intermediate parser-based labels which may be noisy or even inaccurate. To solve the above challenges, we propose a single-stage try-on framework by developing a novel Deformable Attention Flow (DAFlow), which applies the deformable attention scheme to multi-flow estimation. With pose keypoints as the guidance only, the self- and cross-deformable attention flows are estimated for the reference person and the garment images, respectively. By sampling multiple flow fields, the feature-level and pixel-level information from different semantic areas are simultaneously extracted and merged through the attention mechanism. It enables clothes warping and body synthesizing at the same time which leads to photo-realistic results in an end-to-end manner. Extensive experiments on two try-on datasets demonstrate that our proposed method achieves state-of-the-art performance both qualitatively and quantitatively. Furthermore, additional experiments on the other two image editing tasks illustrate the versatility of our method for multi-view synthesis and image animation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Virtual Try-On | VITON (test) | SSIM0.85 | 14 | |
| Virtual Try-On | VITON-HD unpaired 1.0 (test) | FID33.511 | 14 | |
| Virtual Try-On | VITON-HD high-resolution (test) | SSIM0.8821 | 13 | |
| Virtual Try-On | DressCode Dresses | SSIM0.8776 | 9 | |
| Virtual Try-On | Street2Street (test) | FID42.432 | 9 | |
| Virtual Try-On | Model2Street (test) | FID44.537 | 9 | |
| Virtual Try-On | Model2Model (test) | FID14.316 | 9 | |
| Virtual Try-On | Shop2Model VITON-HD (test) | FID9.4 | 8 | |
| Virtual Try-On | Shop2Street (test) | FID62.735 | 7 | |
| Virtual Try-On | VITON unpaired | FID9.46 | 6 |