Outlier dimensions favor frequent tokens in language models

Macocco, Iuri; Graichen, Nora; Boleda, Gemma; Baroni, Marco

Computer Science > Computation and Language

arXiv:2503.21718 (cs)

[Submitted on 27 Mar 2025 (v1), last revised 9 Apr 2025 (this version, v3)]

Title:Outlier dimensions favor frequent tokens in language models

Authors:Iuri Macocco, Nora Graichen, Gemma Boleda, Marco Baroni

View PDF HTML (experimental)

Abstract:We study last-layer outlier dimensions, i.e. dimensions that display extreme activations for the majority of inputs. We show that outlier dimensions arise in many different modern language models, and trace their function back to the heuristic of constantly predicting frequent words. We further show how a model can block this heuristic when it is not contextually appropriate, by assigning a counterbalancing weight mass to the remaining dimensions, and we investigate which model parameters boost outlier dimensions and when they arise during training. We conclude that outlier dimensions are a specialized mechanism discovered by many distinct models to implement a useful token prediction heuristic.

Comments:	9 pages, 4 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
ACM classes:	I.2.7
Cite as:	arXiv:2503.21718 [cs.CL]
	(or arXiv:2503.21718v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.21718

Submission history

From: Iuri Macocco [view email]
[v1] Thu, 27 Mar 2025 17:30:50 UTC (3,720 KB)
[v2] Fri, 28 Mar 2025 14:55:05 UTC (3,720 KB)
[v3] Wed, 9 Apr 2025 14:37:48 UTC (3,720 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2025-03

Change to browse by:

cs
cs.CL

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Outlier dimensions favor frequent tokens in language models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Outlier dimensions favor frequent tokens in language models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators