Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

Wu, Ronghuan; Su, Wanchao; Liao, Jing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.16602 (cs)

[Submitted on 25 Nov 2024]

Title:Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

Authors:Ronghuan Wu, Wanchao Su, Jing Liao

View PDF HTML (experimental)

Abstract:Scalable Vector Graphics (SVG) has become the de facto standard for vector graphics in digital design, offering resolution independence and precise control over individual elements. Despite their advantages, creating high-quality SVG content remains challenging, as it demands technical expertise with professional editing software and a considerable time investment to craft complex shapes. Recent text-to-SVG generation methods aim to make vector graphics creation more accessible, but they still encounter limitations in shape regularity, generalization ability, and expressiveness. To address these challenges, we introduce Chat2SVG, a hybrid framework that combines the strengths of Large Language Models (LLMs) and image diffusion models for text-to-SVG generation. Our approach first uses an LLM to generate semantically meaningful SVG templates from basic geometric primitives. Guided by image diffusion models, a dual-stage optimization pipeline refines paths in latent space and adjusts point coordinates to enhance geometric complexity. Extensive experiments show that Chat2SVG outperforms existing methods in visual fidelity, path regularity, and semantic alignment. Additionally, our system enables intuitive editing through natural language instructions, making professional vector graphics creation accessible to all users.

Comments:	Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2411.16602 [cs.CV]
	(or arXiv:2411.16602v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.16602

Submission history

From: Ronghuan Wu [view email]
[v1] Mon, 25 Nov 2024 17:31:57 UTC (1,922 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators