Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning

Niehues, Jan; Cho, Eunah

Computer Science > Computation and Language

arXiv:1708.00993 (cs)

[Submitted on 3 Aug 2017]

Title:Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning

Authors:Jan Niehues, Eunah Cho

View PDF

Abstract:Linguistic resources such as part-of-speech (POS) tags have been extensively used in statistical machine translation (SMT) frameworks and have yielded better performances. However, usage of such linguistic annotations in neural machine translation (NMT) systems has been left under-explored.
In this work, we show that multi-task learning is a successful and a easy approach to introduce an additional knowledge into an end-to-end neural attentional model. By jointly training several natural language processing (NLP) tasks in one system, we are able to leverage common information and improve the performance of the individual task.
We analyze the impact of three design decisions in multi-task learning: the tasks used in training, the training schedule, and the degree of parameter sharing across the tasks, which is defined by the network architecture. The experiments are conducted for an German to English translation task. As additional linguistic resources, we exploit POS information and named-entities (NE). Experiments show that the translation quality can be improved by up to 1.5 BLEU points under the low-resource condition. The performance of the POS tagger is also improved using the multi-task learning scheme.

Comments:	9 pages, Second Conference on Machine Translation(WMT17)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1708.00993 [cs.CL]
	(or arXiv:1708.00993v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1708.00993

Submission history

From: Jan Niehues [view email]
[v1] Thu, 3 Aug 2017 04:30:37 UTC (427 KB)

Computer Science > Computation and Language

Title:Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators