Improved Data Augmentation for Translation Suggestion

Zhang, Hongxiao; Lai, Siyu; Zhang, Songming; Huang, Hui; Chen, Yufeng; Xu, Jinan; Liu, Jian

Computer Science > Computation and Language

arXiv:2210.06138 (cs)

[Submitted on 12 Oct 2022]

Title:Improved Data Augmentation for Translation Suggestion

Authors:Hongxiao Zhang, Siyu Lai, Songming Zhang, Hui Huang, Yufeng Chen, Jinan Xu, Jian Liu

View PDF

Abstract:Translation suggestion (TS) models are used to automatically provide alternative suggestions for incorrect spans in sentences generated by machine translation. This paper introduces the system used in our submission to the WMT'22 Translation Suggestion shared task. Our system is based on the ensemble of different translation architectures, including Transformer, SA-Transformer, and DynamicConv. We use three strategies to construct synthetic data from parallel corpora to compensate for the lack of supervised data. In addition, we introduce a multi-phase pre-training strategy, adding an additional pre-training phase with in-domain data. We rank second and third on the English-German and English-Chinese bidirectional tasks, respectively.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.06138 [cs.CL]
	(or arXiv:2210.06138v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.06138

Submission history

From: Hongxiao Zhang [view email]
[v1] Wed, 12 Oct 2022 12:46:43 UTC (94 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-10

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Improved Data Augmentation for Translation Suggestion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improved Data Augmentation for Translation Suggestion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators