Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems

Srivastava, Biplav; Lakkaraju, Kausik; Koppel, Tarmo; Narayanan, Vignesh; Kundu, Ashish; Joshi, Sachindra

Computer Science > Human-Computer Interaction

arXiv:2309.05680 (cs)

[Submitted on 9 Sep 2023 (v1), last revised 14 Sep 2023 (this version, v2)]

Title:Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems

Authors:Biplav Srivastava, Kausik Lakkaraju, Tarmo Koppel, Vignesh Narayanan, Ashish Kundu, Sachindra Joshi

View PDF

Abstract:Chatbots, the common moniker for collaborative assistants, are Artificial Intelligence (AI) software that enables people to naturally interact with them to get tasks done. Although chatbots have been studied since the dawn of AI, they have particularly caught the imagination of the public and businesses since the launch of easy-to-use and general-purpose Large Language Model-based chatbots like ChatGPT. As businesses look towards chatbots as a potential technology to engage users, who may be end customers, suppliers, or even their own employees, proper testing of chatbots is important to address and mitigate issues of trust related to service or product performance, user satisfaction and long-term unintended consequences for society. This paper reviews current practices for chatbot testing, identifies gaps as open problems in pursuit of user trust, and outlines a path forward.

Subjects:	Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
Cite as:	arXiv:2309.05680 [cs.HC]
	(or arXiv:2309.05680v2 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2309.05680

Submission history

From: Kausik Lakkaraju [view email]
[v1] Sat, 9 Sep 2023 22:40:30 UTC (2,668 KB)
[v2] Thu, 14 Sep 2023 01:38:49 UTC (2,668 KB)

Computer Science > Human-Computer Interaction

Title:Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators