The Impact of Prompt Programming on Function-Level Code Generation

Khojah, Ranim; Neto, Francisco Gomes de Oliveira; Mohamad, Mazen; Leitner, Philipp

Computer Science > Software Engineering

arXiv:2412.20545 (cs)

[Submitted on 29 Dec 2024]

Title:The Impact of Prompt Programming on Function-Level Code Generation

Authors:Ranim Khojah, Francisco Gomes de Oliveira Neto, Mazen Mohamad, Philipp Leitner

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) are increasingly used by software engineers for code generation. However, limitations of LLMs such as irrelevant or incorrect code have highlighted the need for prompt programming (or prompt engineering) where engineers apply specific prompt techniques (e.g., chain-of-thought or input-output examples) to improve the generated code. Despite this, the impact of different prompt techniques -- and their combinations -- on code generation remains underexplored. In this study, we introduce CodePromptEval, a dataset of 7072 prompts designed to evaluate five prompt techniques (few-shot, persona, chain-of-thought, function signature, list of packages) and their effect on the correctness, similarity, and quality of complete functions generated by three LLMs (GPT-4o, Llama3, and Mistral). Our findings show that while certain prompt techniques significantly influence the generated code, combining multiple techniques does not necessarily improve the outcome. Additionally, we observed a trade-off between correctness and quality when using prompt techniques. Our dataset and replication package enable future research on improving LLM-generated code and evaluating new prompt techniques.

Comments:	CodePromptEval dataset and replication package on GitHub: this https URL
Subjects:	Software Engineering (cs.SE); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2412.20545 [cs.SE]
	(or arXiv:2412.20545v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2412.20545

Submission history

From: Ranim Khojah [view email]
[v1] Sun, 29 Dec 2024 18:34:10 UTC (5,542 KB)

Computer Science > Software Engineering

Title:The Impact of Prompt Programming on Function-Level Code Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:The Impact of Prompt Programming on Function-Level Code Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators