Scaling Test-time Compute for Low-resource Languages: Multilingual Reasoning in LLMs

Tran, Khanh-Tung; O'Sullivan, Barry; Nguyen, Hoang D.

Computer Science > Computation and Language

arXiv:2504.02890 (cs)

[Submitted on 2 Apr 2025]

Title:Scaling Test-time Compute for Low-resource Languages: Multilingual Reasoning in LLMs

Authors:Khanh-Tung Tran, Barry O'Sullivan, Hoang D. Nguyen

View PDF HTML (experimental)

Abstract:Recent advances in test-time compute scaling have enabled Large Language Models (LLMs) to tackle deep reasoning tasks by generating a chain-of-thought (CoT) that includes trial and error, backtracking, and intermediate reasoning steps before producing the final answer. However, these techniques have been applied predominantly to popular languages, such as English, leaving reasoning in low-resource languages underexplored and misaligned. In this work, we investigate the multilingual mechanism by which LLMs internally operate in a latent space biased toward their inherently dominant language. To leverage this phenomenon for low-resource languages, we train models to generate the CoT in English while outputting the final response in the target language, given input in the low-resource language. Our experiments demonstrate that this approach, named English-Pivoted CoT Training, outperforms other baselines, including training to generate both the CoT and the final response solely in the target language, with up to 28.33% improvement. Further analysis provides novel insights into the relationships between reasoning and multilinguality of LLMs, prompting for better approaches in developing multilingual large reasoning models

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2504.02890 [cs.CL]
	(or arXiv:2504.02890v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.02890

Submission history

From: Hoang D. Nguyen [view email]
[v1] Wed, 2 Apr 2025 16:58:36 UTC (915 KB)

Computer Science > Computation and Language

Title:Scaling Test-time Compute for Low-resource Languages: Multilingual Reasoning in LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Scaling Test-time Compute for Low-resource Languages: Multilingual Reasoning in LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators