Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MGT: Extending Virtual Try-Off to Multi-Garment Scenarios

About

Computer vision is transforming fashion industry through Virtual Try-On (VTON) and Virtual Try-Off (VTOFF). VTON generates images of a person in a specified garment using a target photo and a standardized garment image, while a more challenging variant, Person-to-Person Virtual Try-On (p2p-VTON), uses a photo of another person wearing the garment. VTOFF, in contrast, extracts standardized garment images from photos of clothed individuals. We introduce Multi-Garment TryOffDiff (MGT), a diffusion-based VTOFF model capable of handling diverse garment types, including upper-body, lower-body, and dresses. MGT builds on a latent diffusion architecture with SigLIP-based image conditioning to capture garment characteristics such as shape, texture, and pattern. To address garment diversity, MGT incorporates class-specific embeddings, achieving state-of-the-art VTOFF results on VITON-HD and competitive performance on DressCode. When paired with VTON models, it further enhances p2p-VTON by reducing unwanted attribute transfer, such as skin tone, ensuring preservation of person-specific characteristics. Demo, code, and models are available at: https://rizavelioglu.github.io/tryoffdiff/

Riza Velioglu, Petra Bevandic, Robin Chan, Barbara Hammer• 2025

Related benchmarks

TaskDatasetResultRank
Image Virtual Try-onVITON-HD
LPIPS36.3
41
Virtual Try-OffVITON-HD
FID21.9
12
try-offOmni-TryOn
CLIP-I87.22
10
Virtual Try-OnVITON-HD high-resolution (1024 x 768) (test)
FID24
8
Virtual Try-OnDress Code All (test)
SSIM77.77
6
Virtual Try-OnVITON-HD (try-off)
CLIP-I0.9015
5
try-offDressCode
CLIP Score0.9259
4
Virtual Try-OffVITON-HD and Dress Code (test)
Competitor Wins16.38
4
Virtual Try-OffDressCode Lower-Body (test)
FID22.15
3
Virtual Try-OffDressCode Dresses (test)
FID20.35
3
Showing 10 of 11 rows

Other info

Follow for update