Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

Yan, Tianyi Lorena; Jia, Robin

Computer Science > Computation and Language

arXiv:2502.20475 (cs)

[Submitted on 27 Feb 2025]

Title:Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

Authors:Tianyi Lorena Yan, Robin Jia

View PDF HTML (experimental)

Abstract:To answer one-to-many factual queries (e.g., listing cities of a country), a language model (LM) must simultaneously recall knowledge and avoid repeating previous answers. How are these two subtasks implemented and integrated internally? Across multiple datasets and models, we identify a promote-then-suppress mechanism: the model first recalls all answers, and then suppresses previously generated ones. Specifically, LMs use both the subject and previous answer tokens to perform knowledge recall, with attention propagating subject information and MLPs promoting the answers. Then, attention attends to and suppresses previous answer tokens, while MLPs amplify the suppression signal. Our mechanism is corroborated by extensive experimental evidence: in addition to using early decoding and causal tracing, we analyze how components use different tokens by introducing both \emph{Token Lens}, which decodes aggregated attention updates from specified tokens, and a knockout method that analyzes changes in MLP outputs after removing attention to specified tokens. Overall, we provide new insights into how LMs' internal components interact with different input tokens to support complex factual recall. Code is available at this https URL.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2502.20475 [cs.CL]
	(or arXiv:2502.20475v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.20475

Submission history

From: Tianyi Yan [view email]
[v1] Thu, 27 Feb 2025 19:23:15 UTC (24,687 KB)

Computer Science > Computation and Language

Title:Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators