TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models

About

This paper introduces Virtual Try-Off (VTOFF), a novel task generating standardized garment images from single photos of clothed individuals. Unlike Virtual Try-On (VTON), which digitally dresses models, VTOFF extracts canonical garment images, demanding precise reconstruction of shape, texture, and complex patterns, enabling robust evaluation of generative model fidelity. We propose TryOffDiff, adapting Stable Diffusion with SigLIP-based visual conditioning to deliver high-fidelity reconstructions. Experiments on VITON-HD and Dress Code datasets show that TryOffDiff outperforms adapted pose transfer and VTON baselines. We observe that traditional metrics such as SSIM inadequately reflect reconstruction quality, prompting our use of DISTS for reliable assessment. Our findings highlight VTOFF's potential to improve e-commerce product imagery, advance generative model evaluation, and guide future research on high-fidelity reconstruction. Demo, code, and models are available at: https://rizavelioglu.github.io/tryoffdiff

Riza Velioglu, Petra Bevandic, Robin Chan, Barbara Hammer• 2024

Related benchmarks

Task	Dataset	Result
Image Virtual Try-on	VITON-HD	LPIPS39.56	41
Virtual Try-Off	VITON-HD	FID18.1	12
Virtual Try-Off	VITON-HD (test)	SSIM80.3	11
try-off	Omni-TryOn	CLIP-I86.06	10
Virtual Try-On	VITON-HD high-resolution (1024 x 768) (test)	FID22.54	8
Garment Reconstruction	VITON-HD 1024x768 (test)	LPIPS0.212	7
Virtual Try-On	VITON-HD (try-off)	CLIP-I0.9177	5
try-off	DressCode	CLIP Score0.9257	4
Virtual Try-Off	DressCode upper-body	SSIM76.6	3
Virtual Try-Off	DressCode (test)	SSIM80.8	2

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord