Investigating the Transferability of Code Repair for Low-Resource Programming Languages

Wong, Kyle; Amayuelas, Alfonso; Pan, Liangming; Wang, William Yang

Computer Science > Machine Learning

arXiv:2406.14867 (cs)

[Submitted on 21 Jun 2024 (v1), last revised 16 Oct 2024 (this version, v2)]

Title:Investigating the Transferability of Code Repair for Low-Resource Programming Languages

Authors:Kyle Wong, Alfonso Amayuelas, Liangming Pan, William Yang Wang

View PDF

Abstract:Large language models (LLMs) have shown remarkable performance on code generation tasks. A recent use case is iterative code repair, where an LLM fixes an incorrect program by rationalizing about errors and generating new code. Recent works augment the code repair process by integrating modern techniques such as chain-of-thought reasoning or distillation, but only study their benefits on high-resource languages like Python, and ignore low-resource languages like Perl. To address this gap of knowledge, we investigate the benefits of distilling code repair for both high and low resource languages to determine if the techniques that are effective in a high resource setting are also applicable in a low resource setting. Our evaluation shows that distilling the ability to repair code has language dependent benefits. To explain this behavior, we perform a further analysis and find that contrary to preexisting beliefs, the correlation between reasoning ability and code correction ability is weak. We hypothesize this weak correlation is magnified in low-resource settings where base models lack deep knowledge of a programming language, leading to wavering benefits of code repair.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2406.14867 [cs.LG]
	(or arXiv:2406.14867v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.14867

Submission history

From: Kyle Wong [view email]
[v1] Fri, 21 Jun 2024 05:05:39 UTC (796 KB)
[v2] Wed, 16 Oct 2024 05:03:04 UTC (806 KB)

Computer Science > Machine Learning

Title:Investigating the Transferability of Code Repair for Low-Resource Programming Languages

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Investigating the Transferability of Code Repair for Low-Resource Programming Languages

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators