Repository-level Code Translation Benchmark Targeting Rust

Ou, Guangsheng; Liu, Mingwei; Chen, Yuxuan; Peng, Xin; Zheng, Zibin

Computer Science > Software Engineering

arXiv:2411.13990 (cs)

[Submitted on 21 Nov 2024 (v1), last revised 27 Mar 2025 (this version, v5)]

Title:Repository-level Code Translation Benchmark Targeting Rust

Authors:Guangsheng Ou, Mingwei Liu, Yuxuan Chen, Xin Peng, Zibin Zheng

View PDF HTML (experimental)

Abstract:Recent advancements in large language models (LLMs) have demonstrated impressive capabilities in code translation, typically evaluated using benchmarks like CodeTransOcean. However, these benchmarks fail to capture real-world complexities by focusing primarily on simple function-level translations and overlooking repository-level context (e.g., dependencies). Moreover, LLMs' effectiveness in translating to newer, low-resource languages like Rust remains largely underexplored. To address this gap, we introduce RustRepoTrans, the first repository-level code translation benchmark, comprising 375 tasks translating into Rust from C++, Java, and Python. Using this benchmark, we evaluate four state-of-the-art LLMs, analyzing their errors to assess limitations in complex translation scenarios. Among them, Claude-3.5 performs best with 43.5% Pass@1, excelling in both basic functionality and additional translation abilities, such as noise robustness and syntactical difference identification. However, even Claude-3.5 experiences a 30.8% performance drop (Pass@1 from 74.3% to 43.5%) when handling repository-level context compared to previous benchmarks without such context. We also find that LLMs struggle with language differences in complex tasks, and dependencies further increase translation difficulty.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2411.13990 [cs.SE]
	(or arXiv:2411.13990v5 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2411.13990

Submission history

From: Guangsheng Ou [view email]
[v1] Thu, 21 Nov 2024 10:00:52 UTC (1,263 KB)
[v2] Mon, 25 Nov 2024 06:57:25 UTC (1,263 KB)
[v3] Tue, 26 Nov 2024 13:21:44 UTC (1,263 KB)
[v4] Mon, 24 Mar 2025 03:15:28 UTC (1,350 KB)
[v5] Thu, 27 Mar 2025 07:12:39 UTC (1,359 KB)

Computer Science > Software Engineering

Title:Repository-level Code Translation Benchmark Targeting Rust

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Repository-level Code Translation Benchmark Targeting Rust

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators