Evaluating Conversational Recommender Systems via User Simulation

Zhang, Shuo; Balog, Krisztian

doi:10.1145/3394486.3403202

Computer Science > Information Retrieval

arXiv:2006.08732 (cs)

[Submitted on 15 Jun 2020]

Title:Evaluating Conversational Recommender Systems via User Simulation

Authors:Shuo Zhang, Krisztian Balog

View PDF

Abstract:Conversational information access is an emerging research area. Currently, human evaluation is used for end-to-end system evaluation, which is both very time and resource intensive at scale, and thus becomes a bottleneck of progress. As an alternative, we propose automated evaluation by means of simulating users. Our user simulator aims to generate responses that a real human would give by considering both individual preferences and the general flow of interaction with the system. We evaluate our simulation approach on an item recommendation task by comparing three existing conversational recommender systems. We show that preference modeling and task-specific interaction models both contribute to more realistic simulations, and can help achieve high correlation between automatic evaluation measures and manual human assessments.

Comments:	Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '20), 2020
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2006.08732 [cs.IR]
	(or arXiv:2006.08732v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2006.08732
Related DOI:	https://doi.org/10.1145/3394486.3403202

Submission history

From: Shuo Zhang [view email]
[v1] Mon, 15 Jun 2020 20:05:39 UTC (959 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2020-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shuo Zhang
Krisztian Balog

export BibTeX citation

Computer Science > Information Retrieval

Title:Evaluating Conversational Recommender Systems via User Simulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Evaluating Conversational Recommender Systems via User Simulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators