GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

Fischer, Sophie; Gemmell, Carlos; Tecklenburg, Niklas; Mackie, Iain; Rossetto, Federico; Dalton, Jeffrey

doi:10.1145/3637528.3671622

Computer Science > Information Retrieval

arXiv:2402.07647 (cs)

[Submitted on 12 Feb 2024 (v1), last revised 28 Jun 2024 (this version, v2)]

Title:GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

Authors:Sophie Fischer, Carlos Gemmell, Niklas Tecklenburg, Iain Mackie, Federico Rossetto, Jeffrey Dalton

View PDF HTML (experimental)

Abstract:We tackle the challenge of building real-world multimodal assistants for complex real-world tasks. We describe the practicalities and challenges of developing and deploying GRILLBot, a leading (first and second prize winning in 2022 and 2023) system deployed in the Alexa Prize TaskBot Challenge. Building on our Open Assistant Toolkit (OAT) framework, we propose a hybrid architecture that leverages Large Language Models (LLMs) and specialised models tuned for specific subtasks requiring very low latency. OAT allows us to define when, how and which LLMs should be used in a structured and deployable manner. For knowledge-grounded question answering and live task adaptations, we show that LLM reasoning abilities over task context and world knowledge outweigh latency concerns. For dialogue state management, we implement a code generation approach and show that specialised smaller models have 84% effectiveness with 100x lower latency. Overall, we provide insights and discuss tradeoffs for deploying both traditional models and LLMs to users in complex real-world multimodal environments in the Alexa TaskBot challenge. These experiences will continue to evolve as LLMs become more capable and efficient -- fundamentally reshaping OAT and future assistant architectures.

Comments:	11 pages, KDD Preprint
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2402.07647 [cs.IR]
	(or arXiv:2402.07647v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2402.07647
Related DOI:	https://doi.org/10.1145/3637528.3671622

Submission history

From: Sophie Fischer [view email]
[v1] Mon, 12 Feb 2024 13:42:11 UTC (2,449 KB)
[v2] Fri, 28 Jun 2024 05:32:23 UTC (2,446 KB)

Computer Science > Information Retrieval

Title:GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators