Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Xiao, Chenghao; Chan, Hou Pong; Zhang, Hao; Aljunied, Mahani; Bing, Lidong; Moubayed, Noura Al; Rong, Yu

Computer Science > Computation and Language

arXiv:2504.13816 (cs)

[Submitted on 18 Apr 2025]

Title:Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Authors:Chenghao Xiao, Hou Pong Chan, Hao Zhang, Mahani Aljunied, Lidong Bing, Noura Al Moubayed, Yu Rong

View PDF HTML (experimental)

Abstract:While understanding the knowledge boundaries of LLMs is crucial to prevent hallucination, research on knowledge boundaries of LLMs has predominantly focused on English. In this work, we present the first study to analyze how LLMs recognize knowledge boundaries across different languages by probing their internal representations when processing known and unknown questions in multiple languages. Our empirical studies reveal three key findings: 1) LLMs' perceptions of knowledge boundaries are encoded in the middle to middle-upper layers across different languages. 2) Language differences in knowledge boundary perception follow a linear structure, which motivates our proposal of a training-free alignment method that effectively transfers knowledge boundary perception ability across languages, thereby helping reduce hallucination risk in low-resource languages; 3) Fine-tuning on bilingual question pair translation further enhances LLMs' recognition of knowledge boundaries across languages. Given the absence of standard testbeds for cross-lingual knowledge boundary analysis, we construct a multilingual evaluation suite comprising three representative types of knowledge boundary data. Our code and datasets are publicly available at this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2504.13816 [cs.CL]
	(or arXiv:2504.13816v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.13816

Submission history

From: Chenghao Xiao [view email]
[v1] Fri, 18 Apr 2025 17:44:12 UTC (30,155 KB)

Computer Science > Computation and Language

Title:Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators