Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AIpparel: A Multimodal Foundation Model for Digital Garments

About

Apparel is essential to human life, offering protection, mirroring cultural identities, and showcasing personal style. Yet, the creation of garments remains a time-consuming process, largely due to the manual work involved in designing them. To simplify this process, we introduce AIpparel, a multimodal foundation model for generating and editing sewing patterns. Our model fine-tunes state-of-the-art large multimodal models (LMMs) on a custom-curated large-scale dataset of over 120,000 unique garments, each with multimodal annotations including text, images, and sewing patterns. Additionally, we propose a novel tokenization scheme that concisely encodes these complex sewing patterns so that LLMs can learn to predict them efficiently. AIpparel achieves state-of-the-art performance in single-modal tasks, including text-to-garment and image-to-garment prediction, and enables novel multimodal garment generation applications such as interactive garment editing. The project website is at https://georgenakayama.github.io/AIpparel/.

Kiyohiro Nakayama, Jan Ackermann, Timur Levent Kesdogan, Yang Zheng, Maria Korosteleva, Olga Sorkine-Hornung, Leonidas J. Guibas, Guandao Yang, Gordon Wetzstein• 2024

Related benchmarks

TaskDatasetResultRank
image-to-garment predictionFTAG Hit Reaction sequence (test)
CD57.6
4
image-to-garment predictionFTAG Northern Spin sequence (test)
CD257
4
image-to-garment predictionFTAG Average across sequences (test)
CD114.9
4
image-to-garment predictionFTAG Jumping Jack sequence (test)
CD98.1
4
image-to-garment predictionFTAG Joyful Jump sequence (test)
CD46.8
4
Clothing reconstruction4D-Dress Lower (test)
CD380
4
Clothing reconstruction4D-Dress Upper (test)
Chamfer Distance380
4
Sewing Pattern Panel Quality EstimationGCD (test)
IoU0.834
2
Stitching predictionGCD
F1 Score82.1
2
Showing 9 of 9 rows

Other info

Follow for update