Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Zhu, Wanrong; Wang, Xin Eric; Narayana, Pradyumna; Sone, Kazoo; Basu, Sugato; Wang, William Yang

Computer Science > Computation and Language

arXiv:2010.03644 (cs)

[Submitted on 7 Oct 2020]

Title:Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Authors:Wanrong Zhu, Xin Eric Wang, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang

View PDF

Abstract:A major challenge in visually grounded language generation is to build robust benchmark datasets and models that can generalize well in real-world settings. To do this, it is critical to ensure that our evaluation protocols are correct, and benchmarks are reliable. In this work, we set forth to design a set of experiments to understand an important but often ignored problem in visually grounded language generation: given that humans have different utilities and visual attention, how will the sample variance in multi-reference datasets affect the models' performance? Empirically, we study several multi-reference datasets and corresponding vision-and-language tasks. We show that it is of paramount importance to report variance in experiments; that human-generated references could vary drastically in different datasets/tasks, revealing the nature of each task; that metric-wise, CIDEr has shown systematically larger variances than others. Our evaluations on reference-per-instance shed light on the design of reliable datasets in the future.

Comments:	EMNLP 2020
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2010.03644 [cs.CL]
	(or arXiv:2010.03644v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.03644

Submission history

From: Wanrong Zhu [view email]
[v1] Wed, 7 Oct 2020 20:45:14 UTC (9,426 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.AI
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wanrong Zhu
Sugato Basu
William Yang Wang

export BibTeX citation

Computer Science > Computation and Language

Title:Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators