Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models

Yu, Zeping; Belinkov, Yonatan; Ananiadou, Sophia

Computer Science > Computation and Language

arXiv:2502.10835 (cs)

[Submitted on 15 Feb 2025]

Title:Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models

Authors:Zeping Yu, Yonatan Belinkov, Sophia Ananiadou

View PDF HTML (experimental)

Abstract:We investigate how large language models perform latent multi-hop reasoning in prompts like "Wolfgang Amadeus Mozart's mother's spouse is". To analyze this process, we introduce logit flow, an interpretability method that traces how logits propagate across layers and positions toward the final prediction. Using logit flow, we identify four distinct stages in single-hop knowledge prediction: (A) entity subject enrichment, (B) entity attribute extraction, (C) relation subject enrichment, and (D) relation attribute extraction. Extending this analysis to multi-hop reasoning, we find that failures often stem from the relation attribute extraction stage, where conflicting logits reduce prediction accuracy. To address this, we propose back attention, a novel mechanism that enables lower layers to leverage higher-layer hidden states from different positions during attention computation. With back attention, a 1-layer transformer achieves the performance of a 2-layer transformer. Applied to four LLMs, back attention improves accuracy on five reasoning datasets, demonstrating its effectiveness in enhancing latent multi-hop reasoning ability.

Comments:	preprint
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.10835 [cs.CL]
	(or arXiv:2502.10835v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.10835

Submission history

From: Zeping Yu [view email]
[v1] Sat, 15 Feb 2025 15:36:42 UTC (1,095 KB)

Computer Science > Computation and Language

Title:Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators