On the use of human reference data for evaluating automatic image descriptions

van Miltenburg, Emiel

Computer Science > Computation and Language

arXiv:2006.08792 (cs)

[Submitted on 15 Jun 2020]

Title:On the use of human reference data for evaluating automatic image descriptions

Authors:Emiel van Miltenburg

View PDF

Abstract:Automatic image description systems are commonly trained and evaluated using crowdsourced, human-generated image descriptions. The best-performing system is then determined using some measure of similarity to the reference data (BLEU, Meteor, CIDER, etc). Thus, both the quality of the systems as well as the quality of the evaluation depends on the quality of the descriptions. As Section 2 will show, the quality of current image description datasets is insufficient. I argue that there is a need for more detailed guidelines that take into account the needs of visually impaired users, but also the feasibility of generating suitable descriptions. With high-quality data, evaluation of image description systems could use reference descriptions, but we should also look for alternatives.

Comments:	Originally presented as a (non-archival) poster at the VizWiz 2020 workshop, collocated with CVPR 2020. See: this https URL
Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2006.08792 [cs.CL]
	(or arXiv:2006.08792v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2006.08792

Submission history

From: Emiel Van Miltenburg [view email]
[v1] Mon, 15 Jun 2020 21:57:27 UTC (67 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-06

Change to browse by:

cs
cs.CV
cs.HC

References & Citations

DBLP - CS Bibliography

listing | bibtex

Emiel van Miltenburg

export BibTeX citation

Computer Science > Computation and Language

Title:On the use of human reference data for evaluating automatic image descriptions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On the use of human reference data for evaluating automatic image descriptions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators