Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Hu, Peng; Liu, Sizhe; Gao, Changjiang; Huang, Xin; Han, Xue; Feng, Junlan; Deng, Chao; Huang, Shujian

Computer Science > Computation and Language

arXiv:2406.16655 (cs)

[Submitted on 24 Jun 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Authors:Peng Hu, Sizhe Liu, Changjiang Gao, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang

View PDF HTML (experimental)

Abstract:Large Language Models have demonstrated impressive reasoning capabilities across multiple languages. However, the relationship between capabilities in different languages is less explored. In this work, we decompose the process of reasoning tasks into two separated components: knowledge retrieval and knowledge-free reasoning, and analyze the relationship between cross-lingual transferability and these two components. With adapted commonsense reasoning datasets and constructed knowledge-free reasoning datasets, we show that the knowledge-free reasoning capability can be nearly perfectly transferred across various source-target language directions despite the secondary impact of resource in some specific target languages, while cross-lingual knowledge retrieval significantly hinders the transfer. Moreover, by analyzing the hidden states and feed-forward network neuron activation during the reasoning, we show that higher similarity of hidden representations and larger overlap of activated neurons could explain the better cross-lingual transferability of knowledge-free reasoning than knowledge retrieval. Thus, we hypothesize that knowledge-free reasoning shares similar neurons in different languages for reasoning, while knowledge is stored separately in different languages. Our code and data is available at: this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.16655 [cs.CL]
	(or arXiv:2406.16655v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.16655

Submission history

From: Peng Hu [view email]
[v1] Mon, 24 Jun 2024 14:03:04 UTC (211 KB)
[v2] Tue, 15 Oct 2024 13:08:01 UTC (639 KB)

Computer Science > Computation and Language

Title:Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators