Dafny as Verification-Aware Intermediate Language for Code Generation

Li, Yue Chen; Zetzsche, Stefan; Somayyajula, Siva

Computer Science > Software Engineering

arXiv:2501.06283 (cs)

[Submitted on 10 Jan 2025]

Title:Dafny as Verification-Aware Intermediate Language for Code Generation

Authors:Yue Chen Li, Stefan Zetzsche, Siva Somayyajula

View PDF HTML (experimental)

Abstract:Using large language models (LLMs) to generate source code from natural language prompts is a popular and promising idea with a wide range of applications. One of its limitations is that the generated code can be faulty at times, often in a subtle way, despite being presented to the user as correct. In this paper, we explore ways in which formal methods can assist with increasing the quality of code generated by an LLM. Instead of emitting code in a target language directly, we propose that the user guides the LLM to first generate an opaque intermediate representation, in the verification-aware language Dafny, that can be automatically validated for correctness against agreed on specifications. The correct Dafny program is then compiled to the target language and returned to the user. All user-system interactions throughout the procedure occur via natural language; Dafny code is never exposed. We describe our current prototype and report on its performance on the HumanEval Python code generation benchmarks.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
Cite as:	arXiv:2501.06283 [cs.SE]
	(or arXiv:2501.06283v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2501.06283

Submission history

From: Stefan Zetzsche [view email]
[v1] Fri, 10 Jan 2025 17:23:14 UTC (72 KB)

Computer Science > Software Engineering

Title:Dafny as Verification-Aware Intermediate Language for Code Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Dafny as Verification-Aware Intermediate Language for Code Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators