CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model
About
Generative Adversarial Networks (GANs) dominate the research field in image-based virtual try-on, but have not resolved problems such as unnatural deformation of garments and the blurry generation quality. While the generative quality of diffusion models is impressive, achieving controllability poses a significant challenge when applying it to virtual try-on and multiple denoising iterations limit its potential for real-time applications. In this paper, we propose Controllable Accelerated virtual Try-on with Diffusion Model (CAT-DM). To enhance the controllability, a basic diffusion-based virtual try-on network is designed, which utilizes ControlNet to introduce additional control conditions and improves the feature extraction of garment images. In terms of acceleration, CAT-DM initiates a reverse denoising process with an implicit distribution generated by a pre-trained GAN-based model. Compared with previous try-on methods based on diffusion models, CAT-DM not only retains the pattern and texture details of the inshop garment but also reduces the sampling steps without compromising generation quality. Extensive experiments demonstrate the superiority of CAT-DM against both GANbased and diffusion-based methods in producing more realistic images and accurately reproducing garment patterns.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Virtual Try-On | VITON-HD (test) | SSIM87.7 | 48 | |
| Virtual Try-On | VITON-HD 1.0 (test) | FID6.1394 | 27 | |
| Virtual Try-On | DressCode 1.0 (test) | FID3.2755 | 14 | |
| Virtual Try-On | DressCode Upper (unpaired and paired) | FIDu14.772 | 13 | |
| Virtual Try-On | StreetTryOn Shop-to-Street | FID37.484 | 13 | |
| Virtual Try-On | DressCode Lower unpaired and paired | FID (Unpaired)21.99 | 13 | |
| Virtual Try-On | DressCode Dresses (unpaired and paired) | FIDu34.61 | 13 | |
| Video Virtual Try-on | ViViD (test) | SSIM0.826 | 13 | |
| Virtual Try-On | Handfit-3K (test) | FID10.6966 | 12 | |
| Virtual Try-On | WildVTON (test) | FIDu42.16 | 11 |