Joint Source-Target Self Attention with Locality Constraints

Fonollosa, José A. R.; Casas, Noe; Costa-jussà, Marta R.

Computer Science > Computation and Language

arXiv:1905.06596 (cs)

[Submitted on 16 May 2019]

Title:Joint Source-Target Self Attention with Locality Constraints

Authors:José A. R. Fonollosa, Noe Casas, Marta R. Costa-jussà

View PDF

Abstract:The dominant neural machine translation models are based on the encoder-decoder structure, and many of them rely on an unconstrained receptive field over source and target sequences. In this paper we study a new architecture that breaks with both conventions. Our simplified architecture consists in the decoder part of a transformer model, based on self-attention, but with locality constraints applied on the attention receptive field. As input for training, both source and target sentences are fed to the network, which is trained as a language model. At inference time, the target tokens are predicted autoregressively starting with the source sequence as previous tokens. The proposed model achieves a new state of the art of 35.7 BLEU on IWSLT'14 German-English and matches the best reported results in the literature on the WMT'14 English-German and WMT'14 English-French translation benchmarks.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1905.06596 [cs.CL]
	(or arXiv:1905.06596v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1905.06596

Submission history

From: José A. R. Fonollosa [view email]
[v1] Thu, 16 May 2019 08:35:12 UTC (60 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-05

Change to browse by:

cs
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

José A. R. Fonollosa
Noe Casas
Marta R. Costa-jussà

export BibTeX citation

Computer Science > Computation and Language

Title:Joint Source-Target Self Attention with Locality Constraints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Joint Source-Target Self Attention with Locality Constraints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators