CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

Suglia, Alessandro; Konstas, Ioannis; Vanzo, Andrea; Bastianelli, Emanuele; Elliott, Desmond; Frank, Stella; Lemon, Oliver

Computer Science > Computation and Language

arXiv:2006.02174 (cs)

[Submitted on 3 Jun 2020]

Title:CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

Authors:Alessandro Suglia, Ioannis Konstas, Andrea Vanzo, Emanuele Bastianelli, Desmond Elliott, Stella Frank, Oliver Lemon

View PDF

Abstract:Approaches to Grounded Language Learning typically focus on a single task-based final performance measure that may not depend on desirable properties of the learned hidden representations, such as their ability to predict salient attributes or to generalise to unseen situations. To remedy this, we present GROLLA, an evaluation framework for Grounded Language Learning with Attributes with three sub-tasks: 1) Goal-oriented evaluation; 2) Object attribute prediction evaluation; and 3) Zero-shot evaluation. We also propose a new dataset CompGuessWhat?! as an instance of this framework for evaluating the quality of learned neural representations, in particular concerning attribute grounding. To this end, we extend the original GuessWhat?! dataset by including a semantic layer on top of the perceptual one. Specifically, we enrich the VisualGenome scene graphs associated with the GuessWhat?! images with abstract and situated attributes. By using diagnostic classifiers, we show that current models learn representations that are not expressive enough to encode object attributes (average F1 of 44.27). In addition, they do not learn strategies nor representations that are robust enough to perform well when novel scenes or objects are involved in gameplay (zero-shot best accuracy 50.06%).

Comments:	Accepted to the Annual Conference of the Association for Computational Linguistics (ACL) 2020
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2006.02174 [cs.CL]
	(or arXiv:2006.02174v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2006.02174

Submission history

From: Alessandro Suglia [view email]
[v1] Wed, 3 Jun 2020 11:21:42 UTC (2,904 KB)

Computer Science > Computation and Language

Title:CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators