Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SVGen: Interpretable Vector Graphics Generation with Large Language Models

About

Scalable Vector Graphics (SVG) is widely used in front-end development and UI/UX design due to its scalability, editability, and rendering efficiency. However, turning creative ideas into precise vector graphics remains a time-consuming challenge. To address this, we introduce SVG-1M, a large-scale dataset of high-quality SVGs paired with natural language descriptions. Through advanced data augmentation and annotation, we create well-aligned Text to SVG training pairs, including a subset with Chain of Thought annotations for enhanced semantic guidance. Based on this dataset, we propose SVGen, an end-to-end model that generates SVG code from natural language inputs. Our approach ensures semantic accuracy and structural completeness, supported by curriculum learning and reinforcement learning optimization. Experiments show that SVGen outperforms general large models and traditional rendering methods in both effectiveness and efficiency. Code, model, and dataset are available on GitHub.

Feiyu Wang, Zhiyuan Zhao, Yuandong Liu, Da Zhang, Junyu Gao, Hao Sun, Xuelong Li• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-SVGText-to-SVG (eval)
Avg. Token Length1.53e+3
13
Text-to-SVG GenerationText2SVG (test)
CLIP Score0.223
10
Text-to-SVGMMSVG-Icon
FID129.2
9
Text-to-SVGMMSVG Illustration
FID139.5
9
Text-to-SVG GenerationMMSVG-Bench
CLIP-T2I0.2235
4
Showing 5 of 5 rows

Other info

Follow for update