A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Lin, Nankai; Wu, Hongyan; Fu, Sihui; Jiang, Shengyi; Yang, Aimin

Computer Science > Computation and Language

arXiv:2210.13823 (cs)

[Submitted on 25 Oct 2022 (v1), last revised 6 Jul 2023 (this version, v2)]

Title:A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Authors:Nankai Lin, Hongyan Wu, Sihui Fu, Shengyi Jiang, Aimin Yang

View PDF

Abstract:Chinese spelling check is a task to detect and correct spelling mistakes in Chinese text. Existing research aims to enhance the text representation and use multi-source information to improve the detection and correction capabilities of models, but does not pay too much attention to improving their ability to distinguish between confusable words. Contrastive learning, whose aim is to minimize the distance in representation space between similar sample pairs, has recently become a dominant technique in natural language processing. Inspired by contrastive learning, we present a novel framework for Chinese spelling checking, which consists of three modules: language representation, spelling check and reverse contrastive learning. Specifically, we propose a reverse contrastive learning strategy, which explicitly forces the model to minimize the agreement between the similar examples, namely, the phonetically and visually confusable characters. Experimental results show that our framework is model-agnostic and could be combined with existing Chinese spelling check models to yield state-of-the-art performance.

Comments:	11 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.13823 [cs.CL]
	(or arXiv:2210.13823v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.13823

Submission history

From: Nankai Lin [view email]
[v1] Tue, 25 Oct 2022 08:05:38 UTC (223 KB)
[v2] Thu, 6 Jul 2023 07:34:14 UTC (218 KB)

Computer Science > Computation and Language

Title:A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators