Probing Quantifier Comprehension in Large Language Models: Another Example of Inverse Scaling

Gupta, Akshat

Computer Science > Computation and Language

arXiv:2306.07384 (cs)

[Submitted on 12 Jun 2023 (v1), last revised 30 Nov 2023 (this version, v3)]

Title:Probing Quantifier Comprehension in Large Language Models: Another Example of Inverse Scaling

Authors:Akshat Gupta

View PDF

Abstract:With their increasing size, large language models (LLMs) are becoming increasingly good at language understanding tasks. But even with high performance on specific downstream task, LLMs fail at simple linguistic tests for negation or quantifier understanding. Previous work on quantifier understanding in LLMs show inverse scaling in understanding few-type quantifiers. In this paper, we question the claims of of previous work and show that it is a result of inappropriate testing methodology. We also present alternate methods to measure quantifier comprehension in LLMs and show that LLMs are able to better understand the difference between the meaning of few-type and most-type quantifiers as their size increases, although they are not particularly good at it. We also observe inverse scaling for most-type quantifier understanding, which is contrary to human psycho-linguistic experiments and previous work, where the model's understanding of most-type quantifier gets worse as the model size increases. We do this evaluation on models ranging from 125M-175B parameters, which suggests that LLMs do not do as well as expected with quantifiers. We also discuss the possible reasons for this and the relevance of quantifier understanding in evaluating language understanding in LLMs.

Comments:	Accepted to BlackboxNLP (EMNLP 2023)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2306.07384 [cs.CL]
	(or arXiv:2306.07384v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.07384

Submission history

From: Akshat Gupta [view email]
[v1] Mon, 12 Jun 2023 19:20:18 UTC (1,315 KB)
[v2] Tue, 15 Aug 2023 18:40:20 UTC (1,316 KB)
[v3] Thu, 30 Nov 2023 01:01:32 UTC (1,316 KB)

Computer Science > Computation and Language

Title:Probing Quantifier Comprehension in Large Language Models: Another Example of Inverse Scaling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Probing Quantifier Comprehension in Large Language Models: Another Example of Inverse Scaling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators