Katana: Dual Slicing-Based Context for Learning Bug Fixes

Sintaha, Mifta; Nashid, Noor; Mesbah, Ali

Computer Science > Software Engineering

arXiv:2205.00180 (cs)

[Submitted on 30 Apr 2022 (v1), last revised 5 Dec 2022 (this version, v3)]

Title:Katana: Dual Slicing-Based Context for Learning Bug Fixes

Authors:Mifta Sintaha, Noor Nashid, Ali Mesbah

View PDF

Abstract:Contextual information plays a vital role for software developers when understanding and fixing a bug. Consequently, deep learning-based program repair techniques leverage context for bug fixes. However, existing techniques treat context in an arbitrary manner, by extracting code in close proximity of the buggy statement within the enclosing file, class, or method, without any analysis to find actual relations with the bug. To reduce noise, they use a predefined maximum limit on the number of tokens to be used as context. We present a program slicing-based approach, in which instead of arbitrarily including code as context, we analyze statements that have a control or data dependency on the buggy statement. We propose a novel concept called dual slicing, which leverages the context of both buggy and fixed versions of the code to capture relevant repair ingredients. We present our technique and tool called Katana, the first to apply slicing-based context for a program repair task. The results show Katana effectively preserves sufficient information for a model to choose contextual information while reducing noise. We compare against four recent state-of-the-art context-aware program repair techniques. Our results show Katana fixes between 1.5 to 3.7 times more bugs than existing techniques.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2205.00180 [cs.SE]
	(or arXiv:2205.00180v3 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2205.00180

Submission history

From: Mifta Sintaha [view email]
[v1] Sat, 30 Apr 2022 07:04:41 UTC (378 KB)
[v2] Thu, 2 Jun 2022 20:43:53 UTC (385 KB)
[v3] Mon, 5 Dec 2022 22:09:36 UTC (767 KB)

Computer Science > Software Engineering

Title:Katana: Dual Slicing-Based Context for Learning Bug Fixes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Katana: Dual Slicing-Based Context for Learning Bug Fixes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators