SVGDreamer: Text Guided SVG Generation with Diffusion Model
About
Text-guided scalable vector graphics (SVG) synthesis has broad applications in icon and sketch generation. However, existing text-to-SVG methods often suffer from limited editability, suboptimal visual quality, and low sample diversity. To address these challenges, we propose \textbf{SVGDreamer}, a novel framework for text-guided vector graphics synthesis. Our method introduces a \textbf{semantic-driven image vectorization (SIVE)} process, which decomposes the generation procedure into foreground objects and background elements, thereby improving structural controllability and editability. In particular, SIVE incorporates attention-based primitive control and an attention-mask loss to facilitate fine-grained manipulation of individual vector elements. To further improve generation quality and diversity, we propose \textbf{Vectorized Particle-based Score Distillation (VPSD)}, which models SVGs as distributions over control points and colors. Compared with existing text-to-SVG optimization methods, VPSD alleviates over-smoothed shapes, over-saturated colors, limited diversity, and slow convergence. Moreover, VPSD leverages a reward model to reweight vector particles, leading to better visual aesthetics and faster convergence. Extensive experiments demonstrate that SVGDreamer consistently outperforms existing baselines in editability, visual quality, and diversity. Project page: https://ximinng.github.io/SVGDreamer-project/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text-to-SVG Generation | Text-to-SVG Benchmark | FID72.68 | 16 | |
| Text-to-SVG | SArena Icon | FID26.612 | 15 | |
| Text-to-SVG Generation | SVGX-DwT-10k 1.0 (test) | Validation Accuracy100 | 14 | |
| Text-to-SVG | SVGX-Dataset | FID70.1 | 14 | |
| Text-to-SVG | SVG-Hub-5M (test) | FID56.743 | 13 | |
| Image-guided portrait sketch generation | CelebA-HQ 1024x1024 (randomly sampled 50 images) | LPIPS0.555 | 9 | |
| Text-to-SVG | MMSVG-Icon | FID308.9 | 9 | |
| Text-to-SVG | MMSVG Illustration | FID419.7 | 9 | |
| Text-to-SVG | Text-to-SVG Evaluation Prompts (test) | FID59.13 | 7 | |
| SVG Generation | Reason-SVG Human Evaluation Set | Semantic Accuracy3.6 | 7 |