Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Neural Clothing Tryer: Customized Virtual Try-On via Semantic Enhancement and Controlling Diffusion Model

About

This work aims to address a novel Customized Virtual Try-ON (Cu-VTON) task, enabling the superimposition of a specified garment onto a model that can be customized in terms of appearance, posture, and additional attributes. Compared with traditional VTON task, it enables users to tailor digital avatars to their individual preferences, thereby enhancing the virtual fitting experience with greater flexibility and engagement. To address this task, we introduce a Neural Clothing Tryer (NCT) framework, which exploits the advanced diffusion models equipped with semantic enhancement and controlling modules to better preserve semantic characterization and textural details of the garment and meanwhile facilitating the flexible editing of the model's postures and appearances. Specifically, NCT introduces a semantic-enhanced module to take semantic descriptions of garments and utilizes a visual-language encoder to learn aligned features across modalities. The aligned features are served as condition input to the diffusion model to enhance the preservation of the garment's semantics. Then, a semantic controlling module is designed to take the garment image, tailored posture image, and semantic description as input to maintain garment details while simultaneously editing model postures, expressions, and various attributes. Extensive experiments on the open available benchmark demonstrate the superior performance of the proposed NCT framework.

Zhijing Yang, Weiwei Zhang, Mingliang Yang, Siyuan Peng, Yukai Shi, Junpeng Tan, Tianshui Chen, Liruo Zhong• 2026

Related benchmarks

TaskDatasetResultRank
Customized Virtual Try-OnDress Code Dresses
Clothing Naturalness65.65
5
Customized Virtual Try-OnDress Code Upper Body
Clothing Naturalness62.5
5
Customized Virtual Try-OnDress Code Lower Body
Clothing Naturalness56.27
5
Virtual Try-OnDress Code Dresses (test)
CLIP Image Similarity0.766
4
Virtual Try-OnDress Code Upper body (test)
CLIP-I0.766
4
Virtual Try-OnDress Code Lower body (test)
CLIP-I0.794
4
Showing 6 of 6 rows

Other info

Follow for update