Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ChatGarment: Garment Estimation, Generation and Editing via Large Language Models

About

We introduce ChatGarment, a novel approach that leverages large vision-language models (VLMs) to automate the estimation, generation, and editing of 3D garments from images or text descriptions. Unlike previous methods that struggle in real-world scenarios or lack interactive editing capabilities, ChatGarment can estimate sewing patterns from in-the-wild images or sketches, generate them from text descriptions, and edit garments based on user instructions, all within an interactive dialogue. These sewing patterns can then be draped on a 3D body and animated. This is achieved by finetuning a VLM to directly generate a JSON file that includes both textual descriptions of garment types and styles, as well as continuous numerical attributes. This JSON file is then used to create sewing patterns through a programming parametric model. To support this, we refine the existing programming model, GarmentCode, by expanding its garment type coverage and simplifying its structure for efficient VLM fine-tuning. Additionally, we construct a large-scale dataset of image-to-sewing-pattern and text-to-sewing-pattern pairs through an automated data pipeline. Extensive evaluations demonstrate ChatGarment's ability to accurately reconstruct, generate, and edit garments from multimodal inputs, highlighting its potential to simplify workflows in fashion and gaming applications. Code and data are available at https://chatgarment.github.io/ .

Siyuan Bian, Chenghao Xu, Yuliang Xiu, Artur Grigorev, Zhen Liu, Cewu Lu, Michael J. Black, Yao Feng• 2024

Related benchmarks

TaskDatasetResultRank
Sewing pattern estimationCloSE single layer
Chamfer Distance2.94
7
Sewing Pattern ReconstructionDress4D single layer
CD3.12
7
Human evaluation of garment reconstructionASOS 5K Multi-layer
Mean Score2.327
5
Human evaluation of garment reconstructionASOS_5K Single-layer
Mean Score2.556
5
Clothing reconstruction4D-Dress Lower (test)
CD28.9
4
Clothing reconstruction4D-Dress Upper (test)
Chamfer Distance28.1
4
image-to-garment predictionFTAG Jumping Jack sequence (test)
CD88.9
4
image-to-garment predictionFTAG Joyful Jump sequence (test)
CD7.7
4
image-to-garment predictionFTAG Hit Reaction sequence (test)
CD109
4
image-to-garment predictionFTAG Average across sequences (test)
CD182.7
4
Showing 10 of 13 rows

Other info

Follow for update