Evaluation and Continual Improvement for an Enterprise AI Assistant

Maharaj, Akash V.; Qian, Kun; Bhattacharya, Uttaran; Fang, Sally; Galatanu, Horia; Garg, Manas; Hanessian, Rachel; Kapoor, Nishant; Russell, Ken; Vaithyanathan, Shivakumar; Li, Yunyao

Computer Science > Human-Computer Interaction

arXiv:2407.12003 (cs)

[Submitted on 15 Jun 2024]

Title:Evaluation and Continual Improvement for an Enterprise AI Assistant

Authors:Akash V. Maharaj, Kun Qian, Uttaran Bhattacharya, Sally Fang, Horia Galatanu, Manas Garg, Rachel Hanessian, Nishant Kapoor, Ken Russell, Shivakumar Vaithyanathan, Yunyao Li

View PDF HTML (experimental)

Abstract:The development of conversational AI assistants is an iterative process with multiple components. As such, the evaluation and continual improvement of these assistants is a complex and multifaceted problem. This paper introduces the challenges in evaluating and improving a generative AI assistant for enterprises, which is under active development, and how we address these challenges. We also share preliminary results and discuss lessons learned.

Comments:	Accepted to DaSH Workshop at NAACL 2024
Subjects:	Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2407.12003 [cs.HC]
	(or arXiv:2407.12003v1 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2407.12003

Submission history

From: Kun Qian [view email]
[v1] Sat, 15 Jun 2024 06:00:23 UTC (2,087 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.HC

< prev | next >

new | recent | 2024-07

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Human-Computer Interaction

Title:Evaluation and Continual Improvement for an Enterprise AI Assistant

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Evaluation and Continual Improvement for an Enterprise AI Assistant

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators