Learning compositionally through attentive guidance

Hupkes, Dieuwke; Singh, Anand; Korrel, Kris; Kruszewski, German; Bruni, Elia

Computer Science > Computation and Language

arXiv:1805.09657 (cs)

[Submitted on 20 May 2018 (v1), last revised 5 Jul 2019 (this version, v4)]

Title:Learning compositionally through attentive guidance

Authors:Dieuwke Hupkes, Anand Singh, Kris Korrel, German Kruszewski, Elia Bruni

View PDF

Abstract:While neural network models have been successfully applied to domains that require substantial generalisation skills, recent studies have implied that they struggle when solving the task they are trained on requires inferring its underlying compositional structure. In this paper, we introduce Attentive Guidance, a mechanism to direct a sequence to sequence model equipped with attention to find more compositional solutions. We test it on two tasks, devised precisely to assess the compositional capabilities of neural models, and we show that vanilla sequence to sequence models with attention overfit the training distribution, while the guided versions come up with compositional solutions that fit the training and testing distributions almost equally well. Moreover, the learned solutions generalise even in cases where the training and testing distributions strongly diverge. In this way, we demonstrate that sequence to sequence models are capable of finding compositional solutions without requiring extra components. These results helps to disentangle the causes for the lack of systematic compositionality in neural networks, which can in turn fuel future work.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1805.09657 [cs.CL]
	(or arXiv:1805.09657v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1805.09657

Submission history

From: Dieuwke Hupkes [view email]
[v1] Sun, 20 May 2018 10:33:00 UTC (3,592 KB)
[v2] Fri, 7 Sep 2018 09:46:30 UTC (4,819 KB)
[v3] Mon, 10 Sep 2018 12:02:27 UTC (3,465 KB)
[v4] Fri, 5 Jul 2019 12:41:30 UTC (5,035 KB)

Computer Science > Computation and Language

Title:Learning compositionally through attentive guidance

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning compositionally through attentive guidance

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators