Garden-Path Traversal within GPT-2

Jurayj, William; Rudman, William; Eickhoff, Carsten

Computer Science > Computation and Language

arXiv:2205.12302v1 (cs)

[Submitted on 24 May 2022 (this version), latest version 20 Oct 2022 (v2)]

Title:Garden-Path Traversal within GPT-2

Authors:William Jurayj, William Rudman, Carsten Eickhoff

View PDF

Abstract:In recent years, massive language models consisting exclusively of transformer decoders, led by the GPT-x family, have become increasingly popular. While studies have examined the behavior of these models, they tend to only focus on the output of the language model, avoiding analyzing their internal states despite such analyses being popular tools used within BERTology to study transformer encoders. We present a collection of methods for analyzing GPT-2's hidden states, and use the model's navigation of garden path sentences as a case study to demonstrate the utility of studying this model's behavior beyond its output alone. To support this analysis, we introduce a novel dataset consisting of 3 different types of garden path sentences, along with scripts to manipulate them. We find that measuring Manhattan distances and cosine similarities between hidden states shows that GPT-2 navigates these sentences more intuitively than conventional methods that predict from the model's output alone.

Comments:	7 pages, 6 figures, preprint
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.12302 [cs.CL]
	(or arXiv:2205.12302v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12302

Submission history

From: William Jurayj [view email]
[v1] Tue, 24 May 2022 18:21:58 UTC (7,665 KB)
[v2] Thu, 20 Oct 2022 16:21:06 UTC (7,690 KB)

Computer Science > Computation and Language

Title:Garden-Path Traversal within GPT-2

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Garden-Path Traversal within GPT-2

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators