Confidence Regularized Masked Language Modeling using Text Length

Ji, Seunghyun; Lee, Soowon

Computer Science > Computation and Language

arXiv:2504.06037 (cs)

[Submitted on 8 Apr 2025 (v1), last revised 9 Apr 2025 (this version, v2)]

Title:Confidence Regularized Masked Language Modeling using Text Length

Authors:Seunghyun Ji, Soowon Lee

View PDF

Abstract:Masked language modeling is a widely used method for learning language representations, where the model predicts a randomly masked word in each input. However, this approach typically considers only a single correct answer during training, ignoring the variety of plausible alternatives that humans might choose. This issue becomes more pronounced when the input text is short, as the possible word distribution tends to have higher entropy, potentially causing the model to become overconfident in its predictions. To mitigate this, we propose a novel confidence regularizer that adaptively adjusts the regularization strength based on the input length. Experiments on the GLUE and SQuAD benchmarks show that our method improves both accuracy and expected calibration error

Comments:	10 pages, 1 figure
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2504.06037 [cs.CL]
	(or arXiv:2504.06037v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.06037

Submission history

From: Seunghyun Ji [view email]
[v1] Tue, 8 Apr 2025 13:37:08 UTC (601 KB)
[v2] Wed, 9 Apr 2025 02:32:58 UTC (607 KB)

Computer Science > Computation and Language

Title:Confidence Regularized Masked Language Modeling using Text Length

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Confidence Regularized Masked Language Modeling using Text Length

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators