Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

Phung, Tung; Cambronero, José; Gulwani, Sumit; Kohn, Tobias; Majumdar, Rupak; Singla, Adish; Soares, Gustavo

Computer Science > Programming Languages

arXiv:2302.04662 (cs)

[Submitted on 24 Jan 2023 (v1), last revised 28 Apr 2023 (this version, v2)]

Title:Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

Authors:Tung Phung, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

View PDF

Abstract:Large language models (LLMs), such as Codex, hold great promise in enhancing programming education by automatically generating feedback for students. We investigate using LLMs to generate feedback for fixing syntax errors in Python programs, a key scenario in introductory programming. More concretely, given a student's buggy program, our goal is to generate feedback comprising a fixed program along with a natural language explanation describing the errors/fixes, inspired by how a human tutor would give feedback. While using LLMs is promising, the critical challenge is to ensure high precision in the generated feedback, which is imperative before deploying such technology in classrooms. The main research question we study is: Can we develop LLMs-based feedback generation techniques with a tunable precision parameter, giving educators quality control over the feedback that students receive? To this end, we introduce PyFiXV, our technique to generate high-precision feedback powered by Codex. The key idea behind PyFiXV is to use a novel run-time validation mechanism to decide whether the generated feedback is suitable for sharing with the student; notably, this validation mechanism also provides a precision knob to educators. We perform an extensive evaluation using two real-world datasets of Python programs with syntax errors and show the efficacy of PyFiXV in generating high-precision feedback.

Comments:	Published in International Conference on Educational Data Mining (EDM) 2023
Subjects:	Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2302.04662 [cs.PL]
	(or arXiv:2302.04662v2 [cs.PL] for this version)
	https://doi.org/10.48550/arXiv.2302.04662

Submission history

From: Adish Singla [view email]
[v1] Tue, 24 Jan 2023 13:00:25 UTC (572 KB)
[v2] Fri, 28 Apr 2023 11:33:44 UTC (574 KB)

Computer Science > Programming Languages

Title:Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Programming Languages

Title:Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators