Efficient Sequential Decision Making with Large Language Models

Chen, Dingyang; Zhang, Qi; Zhu, Yinglun

Computer Science > Machine Learning

arXiv:2406.12125 (cs)

[Submitted on 17 Jun 2024]

Title:Efficient Sequential Decision Making with Large Language Models

Authors:Dingyang Chen, Qi Zhang, Yinglun Zhu

View PDF HTML (experimental)

Abstract:This paper focuses on extending the success of large language models (LLMs) to sequential decision making. Existing efforts either (i) re-train or finetune LLMs for decision making, or (ii) design prompts for pretrained LLMs. The former approach suffers from the computational burden of gradient updates, and the latter approach does not show promising results. In this paper, we propose a new approach that leverages online model selection algorithms to efficiently incorporate LLMs agents into sequential decision making. Statistically, our approach significantly outperforms both traditional decision making algorithms and vanilla LLM agents. Computationally, our approach avoids the need for expensive gradient updates of LLMs, and throughout the decision making process, it requires only a small number of LLM calls. We conduct extensive experiments to verify the effectiveness of our proposed approach. As an example, on a large-scale Amazon dataset, our approach achieves more than a $6$x performance gain over baselines while calling LLMs in only $1.5$\% of the time steps.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2406.12125 [cs.LG]
	(or arXiv:2406.12125v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.12125

Submission history

From: Yinglun Zhu [view email]
[v1] Mon, 17 Jun 2024 22:13:22 UTC (590 KB)

Computer Science > Machine Learning

Title:Efficient Sequential Decision Making with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Sequential Decision Making with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators