Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Yang, Yukang; Campbell, Declan; Huang, Kaixuan; Wang, Mengdi; Cohen, Jonathan; Webb, Taylor

Computer Science > Computation and Language

arXiv:2502.20332 (cs)

[Submitted on 27 Feb 2025]

Title:Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Authors:Yukang Yang, Declan Campbell, Kaixuan Huang, Mengdi Wang, Jonathan Cohen, Taylor Webb

View PDF HTML (experimental)

Abstract:Many recent studies have found evidence for emergent reasoning capabilities in large language models, but debate persists concerning the robustness of these capabilities, and the extent to which they depend on structured reasoning mechanisms. To shed light on these issues, we perform a comprehensive study of the internal mechanisms that support abstract rule induction in an open-source language model (Llama3-70B). We identify an emergent symbolic architecture that implements abstract reasoning via a series of three computations. In early layers, symbol abstraction heads convert input tokens to abstract variables based on the relations between those tokens. In intermediate layers, symbolic induction heads perform sequence induction over these abstract variables. Finally, in later layers, retrieval heads predict the next token by retrieving the value associated with the predicted abstract variable. These results point toward a resolution of the longstanding debate between symbolic and neural network approaches, suggesting that emergent reasoning in neural networks depends on the emergence of symbolic mechanisms.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.20332 [cs.CL]
	(or arXiv:2502.20332v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.20332

Submission history

From: Yukang Yang [view email]
[v1] Thu, 27 Feb 2025 18:02:15 UTC (1,964 KB)

Computer Science > Computation and Language

Title:Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators