Unified Model Learning for Various Neural Machine Translation

Liang, Yunlong; Meng, Fandong; Xu, Jinan; Wang, Jiaan; Chen, Yufeng; Zhou, Jie

Computer Science > Computation and Language

arXiv:2305.02777 (cs)

[Submitted on 4 May 2023 (v1), last revised 18 May 2023 (this version, v2)]

Title:Unified Model Learning for Various Neural Machine Translation

Authors:Yunlong Liang, Fandong Meng, Jinan Xu, Jiaan Wang, Yufeng Chen, Jie Zhou

View PDF

Abstract:Existing neural machine translation (NMT) studies mainly focus on developing dataset-specific models based on data from different tasks (e.g., document translation and chat translation). Although the dataset-specific models have achieved impressive performance, it is cumbersome as each dataset demands a model to be designed, trained, and stored. In this work, we aim to unify these translation tasks into a more general setting. Specifically, we propose a ``versatile'' model, i.e., the Unified Model Learning for NMT (UMLNMT) that works with data from different tasks, and can translate well in multiple settings simultaneously, and theoretically it can be as many as possible. Through unified learning, UMLNMT is able to jointly train across multiple tasks, implementing intelligent on-demand translation. On seven widely-used translation tasks, including sentence translation, document translation, and chat translation, our UMLNMT results in substantial improvements over dataset-specific models with significantly reduced model deployment costs. Furthermore, UMLNMT can achieve competitive or better performance than state-of-the-art dataset-specific methods. Human evaluation and in-depth analysis also demonstrate the superiority of our approach on generating diverse and high-quality translations. Additionally, we provide a new genre translation dataset about famous aphorisms with 186k Chinese->English sentence pairs.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.02777 [cs.CL]
	(or arXiv:2305.02777v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.02777

Submission history

From: Yunlong Liang [view email]
[v1] Thu, 4 May 2023 12:21:52 UTC (1,049 KB)
[v2] Thu, 18 May 2023 11:53:14 UTC (1,049 KB)

Computer Science > Computation and Language

Title:Unified Model Learning for Various Neural Machine Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unified Model Learning for Various Neural Machine Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators