Towards Accurate Translation via Semantically Appropriate Application of Lexical Constraints

Baek, Yujin; Lee, Koanho; Ki, Dayeon; Lee, Hyoung-Gyu; Park, Cheonbok; Choo, Jaegul

Computer Science > Computation and Language

arXiv:2306.12089 (cs)

[Submitted on 21 Jun 2023]

Title:Towards Accurate Translation via Semantically Appropriate Application of Lexical Constraints

Authors:Yujin Baek (1), Koanho Lee (1), Dayeon Ki (2), Hyoung-Gyu Lee (3), Cheonbok Park (3), Jaegul Choo (1) ((1) KAIST, (2) Korea University, (3) Papago, Naver Corp.)

View PDF

Abstract:Lexically-constrained NMT (LNMT) aims to incorporate user-provided terminology into translations. Despite its practical advantages, existing work has not evaluated LNMT models under challenging real-world conditions. In this paper, we focus on two important but under-studied issues that lie in the current evaluation process of LNMT studies. The model needs to cope with challenging lexical constraints that are "homographs" or "unseen" during training. To this end, we first design a homograph disambiguation module to differentiate the meanings of homographs. Moreover, we propose PLUMCOT, which integrates contextually rich information about unseen lexical constraints from pre-trained language models and strengthens a copy mechanism of the pointer network via direct supervision of a copying score. We also release HOLLY, an evaluation benchmark for assessing the ability of a model to cope with "homographic" and "unseen" lexical constraints. Experiments on HOLLY and the previous test setup show the effectiveness of our method. The effects of PLUMCOT are shown to be remarkable in "unseen" constraints. Our dataset is available at this https URL

Comments:	Findings of ACL2023. 15 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2306.12089 [cs.CL]
	(or arXiv:2306.12089v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.12089

Submission history

From: Yujin Baek [view email]
[v1] Wed, 21 Jun 2023 08:08:15 UTC (1,085 KB)

Computer Science > Computation and Language

Title:Towards Accurate Translation via Semantically Appropriate Application of Lexical Constraints

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Towards Accurate Translation via Semantically Appropriate Application of Lexical Constraints

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators