Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Zhang, Yifan; Du, Wenyu; Jin, Dongming; Fu, Jie; Jin, Zhi

Computer Science > Computation and Language

arXiv:2502.20129 (cs)

[Submitted on 27 Feb 2025]

Title:Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Authors:Yifan Zhang, Wenyu Du, Dongming Jin, Jie Fu, Zhi Jin

View PDF HTML (experimental)

Abstract:Chain-of-Thought (CoT) significantly enhances the performance of large language models (LLMs) across a wide range of tasks, and prior research shows that CoT can theoretically increase expressiveness. However, there is limited mechanistic understanding of the algorithms that Transformer+CoT can learn. In this work, we (1) evaluate the state tracking capabilities of Transformer+CoT and its variants, confirming the effectiveness of CoT. (2) Next, we identify the circuit, a subset of model components, responsible for tracking the world state, finding that late-layer MLP neurons play a key role. We propose two metrics, compression and distinction, and show that the neuron sets for each state achieve nearly 100% accuracy, providing evidence of an implicit finite state automaton (FSA) embedded within the model. (3) Additionally, we explore three realistic settings: skipping intermediate steps, introducing data noise, and testing length generalization. Our results demonstrate that Transformer+CoT learns robust algorithms (FSA), highlighting its resilience in challenging scenarios.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2502.20129 [cs.CL]
	(or arXiv:2502.20129v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.20129

Submission history

From: Yifan Zhang [view email]
[v1] Thu, 27 Feb 2025 14:24:51 UTC (23,591 KB)

Computer Science > Computation and Language

Title:Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators