Modeling Homophone Noise for Robust Neural Machine Translation

Qin, Wenjie; Li, Xiang; Sun, Yuhui; Xiong, Deyi; Cui, Jianwei; Wang, Bin

Computer Science > Computation and Language

arXiv:2012.08396 (cs)

[Submitted on 15 Dec 2020]

Title:Modeling Homophone Noise for Robust Neural Machine Translation

Authors:Wenjie Qin, Xiang Li, Yuhui Sun, Deyi Xiong, Jianwei Cui, Bin Wang

View PDF

Abstract:In this paper, we propose a robust neural machine translation (NMT) framework. The framework consists of a homophone noise detector and a syllable-aware NMT model to homophone errors. The detector identifies potential homophone errors in a textual sentence and converts them into syllables to form a mixed sequence that is then fed into the syllable-aware NMT. Extensive experiments on Chinese->English translation demonstrate that our proposed method not only significantly outperforms baselines on noisy test sets with homophone noise, but also achieves a substantial improvement on clean text.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.08396 [cs.CL]
	(or arXiv:2012.08396v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.08396

Submission history

From: Xiang Li [view email]
[v1] Tue, 15 Dec 2020 16:12:04 UTC (55 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wenjie Qin
Xiang Li
Deyi Xiong
Bin Wang

export BibTeX citation

Computer Science > Computation and Language

Title:Modeling Homophone Noise for Robust Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Modeling Homophone Noise for Robust Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators