How to Dissect a Muppet: The Structure of Transformer Embedding Spaces

Mickus, Timothee; Paperno, Denis; Constant, Mathieu

Computer Science > Computation and Language

arXiv:2206.03529 (cs)

[Submitted on 7 Jun 2022]

Title:How to Dissect a Muppet: The Structure of Transformer Embedding Spaces

Authors:Timothee Mickus, Denis Paperno, Mathieu Constant

View PDF

Abstract:Pretrained embeddings based on the Transformer architecture have taken the NLP community by storm. We show that they can mathematically be reframed as a sum of vector factors and showcase how to use this reframing to study the impact of each component. We provide evidence that multi-head attentions and feed-forwards are not equally useful in all downstream applications, as well as a quantitative overview of the effects of finetuning on the overall embedding space. This approach allows us to draw connections to a wide range of previous studies, from vector space anisotropy to attention weights.

Comments:	Accepted at TACL (pre-MIT Press publication version)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2206.03529 [cs.CL]
	(or arXiv:2206.03529v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2206.03529

Submission history

From: Timothee Mickus [view email]
[v1] Tue, 7 Jun 2022 18:24:46 UTC (448 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-06

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:How to Dissect a Muppet: The Structure of Transformer Embedding Spaces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:How to Dissect a Muppet: The Structure of Transformer Embedding Spaces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators