Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Wan, Yuxuan; Wang, Chaozheng; Dong, Yi; Wang, Wenxuan; Li, Shuqing; Huo, Yintong; Lyu, Michael R.

Computer Science > Software Engineering

arXiv:2406.16386 (cs)

[Submitted on 24 Jun 2024 (v1), last revised 25 Oct 2024 (this version, v2)]

Title:Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Authors:Yuxuan Wan, Chaozheng Wang, Yi Dong, Wenxuan Wang, Shuqing Li, Yintong Huo, Michael R. Lyu

View PDF HTML (experimental)

Abstract:Websites are critical in today's digital world, with over 1.11 billion currently active and approximately 252,000 new sites launched daily. Converting website layout design into functional UI code is a time-consuming yet indispensable step of website development. Manual methods of converting visual designs into functional code present significant challenges, especially for non-experts. To explore automatic design-to-code solutions, we first conduct a motivating study on GPT-4o and identify three types of issues in generating UI code: element omission, element distortion, and element misarrangement. We further reveal that a focus on smaller visual segments can help multimodal large language models (MLLMs) mitigate these failures in the generation process. In this paper, we propose DCGen, a divide-and-conquer-based approach to automate the translation of webpage design to UI code. DCGen starts by dividing screenshots into manageable segments, generating descriptions for each segment, and then reassembling them into complete UI code for the entire screenshot. We conduct extensive testing with a dataset comprised of real-world websites and various MLLMs and demonstrate that DCGen achieves up to a 14% improvement in visual similarity over competing methods. To the best of our knowledge, DCGen is the first segment-aware prompt-based approach for generating UI code directly from screenshots.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.16386 [cs.SE]
	(or arXiv:2406.16386v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2406.16386

Submission history

From: Yuxuan Wan [view email]
[v1] Mon, 24 Jun 2024 07:58:36 UTC (11,778 KB)
[v2] Fri, 25 Oct 2024 11:22:53 UTC (23,233 KB)

Computer Science > Software Engineering

Title:Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators