Large Language Models as Code Executors: An Exploratory Study

Lyu, Chenyang; Yan, Lecheng; Xing, Rui; Li, Wenxi; Samih, Younes; Ji, Tianbo; Wang, Longyue

Computer Science > Computation and Language

arXiv:2410.06667 (cs)

[Submitted on 9 Oct 2024 (v1), last revised 10 Oct 2024 (this version, v2)]

Title:Large Language Models as Code Executors: An Exploratory Study

Authors:Chenyang Lyu, Lecheng Yan, Rui Xing, Wenxi Li, Younes Samih, Tianbo Ji, Longyue Wang

View PDF HTML (experimental)

Abstract:The capabilities of Large Language Models (LLMs) have significantly evolved, extending from natural language processing to complex tasks like code understanding and generation. We expand the scope of LLMs' capabilities to a broader context, using LLMs to execute code snippets to obtain the output. This paper pioneers the exploration of LLMs as code executors, where code snippets are directly fed to the models for execution, and outputs are returned. We are the first to comprehensively examine this feasibility across various LLMs, including OpenAI's o1, GPT-4o, GPT-3.5, DeepSeek, and Qwen-Coder. Notably, the o1 model achieved over 90% accuracy in code execution, while others demonstrated lower accuracy levels. Furthermore, we introduce an Iterative Instruction Prompting (IIP) technique that processes code snippets line by line, enhancing the accuracy of weaker models by an average of 7.22% (with the highest improvement of 18.96%) and an absolute average improvement of 3.86% against CoT prompting (with the highest improvement of 19.46%). Our study not only highlights the transformative potential of LLMs in coding but also lays the groundwork for future advancements in automated programming and the completion of complex tasks.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.06667 [cs.CL]
	(or arXiv:2410.06667v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.06667

Submission history

From: Chenyang Lyu [view email]
[v1] Wed, 9 Oct 2024 08:23:22 UTC (1,488 KB)
[v2] Thu, 10 Oct 2024 05:12:44 UTC (1,484 KB)

Computer Science > Computation and Language

Title:Large Language Models as Code Executors: An Exploratory Study

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Language Models as Code Executors: An Exploratory Study

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators