Garden-Path Traversal in GPT-2

Jurayj, William; Rudman, William; Eickhoff, Carsten

Computer Science > Computation and Language

arXiv:2205.12302 (cs)

[Submitted on 24 May 2022 (v1), last revised 20 Oct 2022 (this version, v2)]

Title:Garden-Path Traversal in GPT-2

Authors:William Jurayj, William Rudman, Carsten Eickhoff

View PDF

Abstract:In recent years, large-scale transformer decoders such as the GPT-x family of models have become increasingly popular. Studies examining the behavior of these models tend to focus only on the output of the language modeling head and avoid analysis of the internal states of the transformer decoder. In this study, we present a collection of methods to analyze the hidden states of GPT-2 and use the model's navigation of garden path sentences as a case study. To enable this, we compile the largest currently available dataset of garden path sentences. We show that Manhattan distances and cosine similarities provide more reliable insights compared to established surprisal methods that analyze next-token probabilities computed by a language modeling head. Using these methods, we find that negating tokens have minimal impacts on the model's representations for unambiguous forms of sentences with ambiguity solely over what the object of a verb is, but have a more substantial impact of representations for unambiguous sentences whose ambiguity would stem from the voice of a verb. Further, we find that analyzing the decoder model's hidden states reveals periods of ambiguity that might conclude in a garden path effect but happen not to, whereas surprisal analyses routinely miss this detail.

Comments:	9 pages, 6 figures, Accepted to EMNLP BlackBox NLP 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.12302 [cs.CL]
	(or arXiv:2205.12302v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12302
Journal reference:	ACL 2022, BlackBox NLP, pages 305-313 https://aclanthology.org/2022.blackboxnlp-1.25/

Submission history

From: William Jurayj [view email]
[v1] Tue, 24 May 2022 18:21:58 UTC (7,665 KB)
[v2] Thu, 20 Oct 2022 16:21:06 UTC (7,690 KB)

Computer Science > Computation and Language

Title:Garden-Path Traversal in GPT-2

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Garden-Path Traversal in GPT-2

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators