Non-Autoregressive Document-Level Machine Translation

Bao, Guangsheng; Teng, Zhiyang; Zhou, Hao; Yan, Jianhao; Zhang, Yue

Computer Science > Computation and Language

arXiv:2305.12878 (cs)

[Submitted on 22 May 2023 (v1), last revised 9 Dec 2023 (this version, v3)]

Title:Non-Autoregressive Document-Level Machine Translation

Authors:Guangsheng Bao, Zhiyang Teng, Hao Zhou, Jianhao Yan, Yue Zhang

View PDF HTML (experimental)

Abstract:Non-autoregressive translation (NAT) models achieve comparable performance and superior speed compared to auto-regressive translation (AT) models in the context of sentence-level machine translation (MT). However, their abilities are unexplored in document-level MT, hindering their usage in real scenarios. In this paper, we conduct a comprehensive examination of typical NAT models in the context of document-level MT and further propose a simple but effective design of sentence alignment between source and target. Experiments show that NAT models achieve high acceleration on documents, and sentence alignment significantly enhances their performance.
However, current NAT models still have a significant performance gap compared to their AT counterparts. Further investigation reveals that NAT models suffer more from the multi-modality and misalignment issues in the context of document-level MT, and current NAT models struggle with exploiting document context and handling discourse phenomena. We delve into these challenges and provide our code at \url{this https URL}.

Comments:	EMNLP2023 Findings camera-ready version. Review soundness 443 and excitement 443
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.12878 [cs.CL]
	(or arXiv:2305.12878v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.12878

Submission history

From: Guangsheng Bao [view email]
[v1] Mon, 22 May 2023 09:59:59 UTC (32 KB)
[v2] Sun, 8 Oct 2023 08:03:11 UTC (296 KB)
[v3] Sat, 9 Dec 2023 11:31:32 UTC (301 KB)

Computer Science > Computation and Language

Title:Non-Autoregressive Document-Level Machine Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Non-Autoregressive Document-Level Machine Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators