Putting Machine Translation in Context with the Noisy Channel Model

Yu, Lei; Sartran, Laurent; Stokowiec, Wojciech; Ling, Wang; Kong, Lingpeng; Blunsom, Phil; Dyer, Chris

Computer Science > Computation and Language

arXiv:1910.00553v1 (cs)

[Submitted on 1 Oct 2019 (this version), latest version 2 Jul 2020 (v2)]

Title:Putting Machine Translation in Context with the Noisy Channel Model

Authors:Lei Yu, Laurent Sartran, Wojciech Stokowiec, Wang Ling, Lingpeng Kong, Phil Blunsom, Chris Dyer

View PDF

Abstract:We show that Bayes' rule provides a compelling mechanism for controlling unconditional document language models, using the long-standing challenge of effectively leveraging document context in machine translation. In our formulation, we estimate the probability of a candidate translation as the product of the unconditional probability of the candidate output document and the ``reverse translation probability'' of translating the candidate output back into the input source language document---the so-called ``noisy channel'' decomposition. A particular advantage of our model is that it requires only parallel sentences to train, rather than parallel documents, which are not always available. Using a new beam search reranking approximation to solve the decoding problem, we find that document language models outperform language models that assume independence between sentences, and that using either a document or sentence language model outperforms comparable models that directly estimate the translation probability. We obtain the best-published results on the NIST Chinese--English translation task, a standard task for evaluating document translation. Our model also outperforms the benchmark Transformer model by approximately 2.5 BLEU on the WMT19 Chinese--English translation task.

Comments:	14 pages
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1910.00553 [cs.CL]
	(or arXiv:1910.00553v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1910.00553

Submission history

From: Lei Yu [view email]
[v1] Tue, 1 Oct 2019 17:30:56 UTC (946 KB)
[v2] Thu, 2 Jul 2020 10:47:17 UTC (987 KB)

Computer Science > Computation and Language

Title:Putting Machine Translation in Context with the Noisy Channel Model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Putting Machine Translation in Context with the Noisy Channel Model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators