Can language models handle recursively nested grammatical structures? A case study on comparing models and humans

Lampinen, Andrew Kyle

Computer Science > Computation and Language

arXiv:2210.15303v1 (cs)

[Submitted on 27 Oct 2022 (this version), latest version 16 Feb 2023 (v3)]

Title:Can language models handle recursively nested grammatical structures? A case study on comparing models and humans

Authors:Andrew Kyle Lampinen

View PDF

Abstract:How should we compare the capabilities of language models and humans? Here, I consider a case study: processing of recursively nested grammatical structures. Prior work has suggested that language models cannot handle these structures as reliably as humans can. However, the humans were provided with instructions and training before being evaluated, while the language models were evaluated zero-shot. I therefore attempt to more closely match the evaluation paradigms by providing language models with few-shot prompts. A simple prompt, which contains substantially less content than the human training, allows large language models to consistently outperform the human results. The same prompt even allows extrapolation to more-deeply-nested conditions than have been tested in humans. Further, a reanalysis of the prior human experiments suggests that the humans may not perform above chance at the difficult structures initially. These results suggest that large language models can in fact process recursively nested grammatical structures comparably to humans. This case study highlights how discrepancies in the quantity of experiment-specific context can confound comparisons of language models and humans. I use this case study to reflect on the broader challenge of comparing human and model capabilities, and to suggest that there is an important difference between evaluating cognitive models of a specific phenomenon and evaluating broadly-trained models.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2210.15303 [cs.CL]
	(or arXiv:2210.15303v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.15303

Submission history

From: Andrew Lampinen [view email]
[v1] Thu, 27 Oct 2022 10:25:12 UTC (138 KB)
[v2] Tue, 1 Nov 2022 16:03:04 UTC (222 KB)
[v3] Thu, 16 Feb 2023 14:58:19 UTC (223 KB)

Computer Science > Computation and Language

Title:Can language models handle recursively nested grammatical structures? A case study on comparing models and humans

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can language models handle recursively nested grammatical structures? A case study on comparing models and humans

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators