How to Evaluate Your Dialogue Models: A Review of Approaches

Li, Xinmeng; Wu, Wansen; Qin, Long; Yin, Quanjun

Computer Science > Computation and Language

arXiv:2108.01369 (cs)

[Submitted on 3 Aug 2021]

Title:How to Evaluate Your Dialogue Models: A Review of Approaches

Authors:Xinmeng Li, Wansen Wu, Long Qin, Quanjun Yin

View PDF

Abstract:Evaluating the quality of a dialogue system is an understudied problem. The recent evolution of evaluation method motivated this survey, in which an explicit and comprehensive analysis of the existing methods is sought. We are first to divide the evaluation methods into three classes, i.e., automatic evaluation, human-involved evaluation and user simulator based evaluation. Then, each class is covered with main features and the related evaluation metrics. The existence of benchmarks, suitable for the evaluation of dialogue techniques are also discussed in detail. Finally, some open issues are pointed out to bring the evaluation method into a new frontier.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2108.01369 [cs.CL]
	(or arXiv:2108.01369v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2108.01369

Submission history

From: Xinmeng Li [view email]
[v1] Tue, 3 Aug 2021 08:52:33 UTC (1,358 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Long Qin
Quanjun Yin

export BibTeX citation

Computer Science > Computation and Language

Title:How to Evaluate Your Dialogue Models: A Review of Approaches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:How to Evaluate Your Dialogue Models: A Review of Approaches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators