Multi-LLM QA with Embodied Exploration

Patel, Bhrij; Dorbala, Vishnu Sashank; Bedi, Amrit Singh; Manocha, Dinesh

Computer Science > Machine Learning

arXiv:2406.10918 (cs)

[Submitted on 16 Jun 2024 (v1), last revised 18 Oct 2024 (this version, v5)]

Title:Multi-LLM QA with Embodied Exploration

Authors:Bhrij Patel, Vishnu Sashank Dorbala, Amrit Singh Bedi, Dinesh Manocha

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have grown in popularity due to their natural language interface and pre trained knowledge, leading to rapidly increasing success in question-answering (QA) tasks. More recently, multi-agent systems with LLM-based agents (Multi-LLM) have been utilized increasingly more for QA. In these scenarios, the models may each answer the question and reach a consensus or each model is specialized to answer different domain questions. However, most prior work dealing with Multi-LLM QA has focused on scenarios where the models are asked in a zero-shot manner or are given information sources to extract the answer. For question answering of an unknown environment, embodied exploration of the environment is first needed to answer the question. This skill is necessary for personalizing embodied AI to environments such as households. There is a lack of insight into whether a Multi-LLM system can handle question-answering based on observations from embodied exploration. In this work, we address this gap by investigating the use of Multi-Embodied LLM Explorers (MELE) for QA in an unknown environment. Multiple LLM-based agents independently explore and then answer queries about a household environment. We analyze different aggregation methods to generate a single, final answer for each query: debating, majority voting, and training a central answer module (CAM). Using CAM, we observe a $46\%$ higher accuracy compared against the other non-learning-based aggregation methods. We provide code and the query dataset for further research.

Comments:	16 pages, 9 Figures, 5 Tables
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2406.10918 [cs.LG]
	(or arXiv:2406.10918v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.10918

Submission history

From: Bhrij Patel [view email]
[v1] Sun, 16 Jun 2024 12:46:40 UTC (4,384 KB)
[v2] Tue, 18 Jun 2024 01:18:46 UTC (4,384 KB)
[v3] Tue, 25 Jun 2024 10:50:09 UTC (4,863 KB)
[v4] Mon, 16 Sep 2024 07:12:12 UTC (1,819 KB)
[v5] Fri, 18 Oct 2024 12:27:07 UTC (5,821 KB)

Computer Science > Machine Learning

Title:Multi-LLM QA with Embodied Exploration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multi-LLM QA with Embodied Exploration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators