Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code?

Kou, Bonan; Chen, Shengmai; Wang, Zhijie; Ma, Lei; Zhang, Tianyi

doi:10.1145/3660807

Computer Science > Software Engineering

arXiv:2306.01220 (cs)

[Submitted on 2 Jun 2023 (v1), last revised 23 May 2024 (this version, v2)]

Title:Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code?

Authors:Bonan Kou, Shengmai Chen, Zhijie Wang, Lei Ma, Tianyi Zhang

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have recently been widely used for code generation. Due to the complexity and opacity of LLMs, little is known about how these models generate code. We made the first attempt to bridge this knowledge gap by investigating whether LLMs attend to the same parts of a task description as human programmers during code generation. An analysis of six LLMs, including GPT-4, on two popular code generation benchmarks revealed a consistent misalignment between LLMs' and programmers' attention. We manually analyzed 211 incorrect code snippets and found five attention patterns that can be used to explain many code generation errors. Finally, a user study showed that model attention computed by a perturbation-based method is often favored by human programmers. Our findings highlight the need for human-aligned LLMs for better interpretability and programmer trust.

Comments:	To appear in 2024 the ACM International Conference on the Foundations of Software Engineering (FSE '24)
Subjects:	Software Engineering (cs.SE); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2306.01220 [cs.SE]
	(or arXiv:2306.01220v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2306.01220
Related DOI:	https://doi.org/10.1145/3660807

Submission history

From: Zhijie Wang [view email]
[v1] Fri, 2 Jun 2023 00:57:03 UTC (1,993 KB)
[v2] Thu, 23 May 2024 17:27:12 UTC (362 KB)

Computer Science > Software Engineering

Title:Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators