On the use of BERT for Neural Machine Translation

Clinchant, Stéphane; Jung, Kweon Woo; Nikoulina, Vassilina

Computer Science > Computation and Language

arXiv:1909.12744 (cs)

[Submitted on 27 Sep 2019]

Title:On the use of BERT for Neural Machine Translation

Authors:Stéphane Clinchant, Kweon Woo Jung, Vassilina Nikoulina

View PDF

Abstract:Exploiting large pretrained models for various NMT tasks have gained a lot of visibility recently. In this work we study how BERT pretrained models could be exploited for supervised Neural Machine Translation. We compare various ways to integrate pretrained BERT model with NMT model and study the impact of the monolingual data used for BERT training on the final translation quality. We use WMT-14 English-German, IWSLT15 English-German and IWSLT14 English-Russian datasets for these experiments. In addition to standard task test set evaluation, we perform evaluation on out-of-domain test sets and noise injected test sets, in order to assess how BERT pretrained representations affect model robustness.

Comments:	10 pages
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1909.12744 [cs.CL]
	(or arXiv:1909.12744v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1909.12744

Submission history

From: Stéphane Clinchant [view email]
[v1] Fri, 27 Sep 2019 15:23:17 UTC (43 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-09

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Stéphane Clinchant

export BibTeX citation

Computer Science > Computation and Language

Title:On the use of BERT for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On the use of BERT for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators