How do we get there? Evaluating transformer neural networks as cognitive models for English past tense inflection

Ma, Xiaomeng; Gao, Lingyu

Computer Science > Computation and Language

arXiv:2210.09167 (cs)

[Submitted on 17 Oct 2022 (v1), last revised 13 May 2023 (this version, v2)]

Title:How do we get there? Evaluating transformer neural networks as cognitive models for English past tense inflection

Authors:Xiaomeng Ma, Lingyu Gao

View PDF

Abstract:There is an ongoing debate on whether neural networks can grasp the quasi-regularities in languages like humans. In a typical quasi-regularity task, English past tense inflections, the neural network model has long been criticized that it learns only to generalize the most frequent pattern, but not the regular pattern, thus can not learn the abstract categories of regular and irregular and is dissimilar to human performance. In this work, we train a set of transformer models with different settings to examine their behavior on this task. The models achieved high accuracy on unseen regular verbs and some accuracy on unseen irregular verbs. The models' performance on the regulars is heavily affected by type frequency and ratio but not token frequency and ratio, and vice versa for the irregulars. The different behaviors on the regulars and irregulars suggest that the models have some degree of symbolic learning on the regularity of the verbs. In addition, the models are weakly correlated with human behavior on nonce verbs. Although the transformer model exhibits some level of learning on the abstract category of verb regularity, its performance does not fit human data well, suggesting that it might not be a good cognitive model.

Comments:	AACL-IJCNLP 2022 camera-ready
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.09167 [cs.CL]
	(or arXiv:2210.09167v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.09167

Submission history

From: Lingyu Gao [view email]
[v1] Mon, 17 Oct 2022 15:13:35 UTC (1,328 KB)
[v2] Sat, 13 May 2023 21:01:37 UTC (516 KB)

Computer Science > Computation and Language

Title:How do we get there? Evaluating transformer neural networks as cognitive models for English past tense inflection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:How do we get there? Evaluating transformer neural networks as cognitive models for English past tense inflection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators