ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models

Han, Hojae; Kim, Jaejin; Yoo, Jaeseok; Lee, Youngwon; Hwang, Seung-won

Computer Science > Software Engineering

arXiv:2408.00994 (cs)

[Submitted on 2 Aug 2024]

Title:ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models

Authors:Hojae Han, Jaejin Kim, Jaeseok Yoo, Youngwon Lee, Seung-won Hwang

View PDF HTML (experimental)

Abstract:This paper aims to extend the code generation capability of large language models (LLMs) to automatically manage comprehensive software requirements from given textual descriptions. Such requirements include both functional (i.e. achieving expected behavior for inputs) and non-functional (e.g., time/space performance, robustness, maintainability) requirements. However, textual descriptions can either express requirements verbosely or may even omit some of them. We introduce ARCHCODE, a novel framework that leverages in-context learning to organize requirements observed in descriptions and to extrapolate unexpressed requirements from them. ARCHCODE generates requirements from given descriptions, conditioning them to produce code snippets and test cases. Each test case is tailored to one of the requirements, allowing for the ranking of code snippets based on the compliance of their execution results with the requirements. Public benchmarks show that ARCHCODE enhances to satisfy functional requirements, significantly improving Pass@k scores. Furthermore, we introduce HumanEval-NFR, the first evaluation of LLMs' non-functional requirements in code generation, demonstrating ARCHCODE's superiority over baseline methods. The implementation of ARCHCODE and the HumanEval-NFR benchmark are both publicly accessible.

Comments:	Accepted by ACL 2024 main conference
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2408.00994 [cs.SE]
	(or arXiv:2408.00994v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2408.00994

Submission history

From: Hojae Han [view email]
[v1] Fri, 2 Aug 2024 03:54:36 UTC (316 KB)

Computer Science > Software Engineering

Title:ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators