Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Training-free Clothing Region of Interest Self-correction for Virtual Try-On

About

VTON (Virtual Try-ON) aims at synthesizing the target clothing on a certain person, preserving the details of the target clothing while keeping the rest of the person unchanged. Existing methods suffer from the discrepancies between the generated clothing results and the target ones, in terms of the patterns, textures and boundaries. Therefore, we propose to use an energy function to impose constraints on the attention map extracted through the generation process. Thus, at each generation step, the attention can be more focused on the clothing region of interest, thereby influencing the generation results to be more consistent with the target clothing details. Furthermore, to address the limitation that existing evaluation metrics concentrate solely on image realism and overlook the alignment with target elements, we design a new metric, Virtual Try-on Inception Distance (VTID), to bridge this gap and ensure a more comprehensive assessment. On the VITON-HD and DressCode datasets, our approach has outperformed the previous state-of-the-art (SOTA) methods by 1.4%, 2.3%, 12.3%, and 5.8% in the traditional metrics of LPIPS, FID, KID, and the new VTID metrics, respectively. Additionally, by applying the generated data to downstream Clothing-Change Re-identification (CC-Reid) methods, we have achieved performance improvements of 2.5%, 1.1%, and 1.6% on the LTCC, PRCC, VC-Clothes datasets in the metrics of Rank-1. The code of our method is public at https://github.com/MrWhiteSmall/CSC-VTON.git.

Shengjie Lu, Zhibin Wan, Jiejie Liu, Quan Zhang, Mingjie Sun• 2025

Related benchmarks

TaskDatasetResultRank
Person Re-IdentificationLTCC Clothing Change (test)
Top-1 Acc48.4
21
Person Re-IdentificationPRCC Clothes-Changing (CC) (test)
R-1 Accuracy62.7
20
Clothing-Change Person Re-identificationVC-Clothes (test)
Rank-190.8
8
Virtual Try-OnVITON-HD paired
LPIPS0.07
7
Virtual Try-OnVITON-HD unpaired
LPIPS0.086
7
Virtual Try-OnDressCode Upper-body (paired) 1.0 (test)
LPIPS0.076
5
Virtual Try-OnDressCode Lower-body (paired) 1.0 (test)
LPIPS0.082
5
Virtual Try-OnDressCode Dresses (paired) 1.0 (test)
LPIPS0.085
5
Virtual Try-OnDressCode Upper-body (unpaired)
LPIPS0.095
5
Virtual Try-OnDressCode Lower-body (unpaired)
LPIPS0.097
5
Showing 10 of 11 rows

Other info

Follow for update