Uncertainty-Guided Optimization on Large Language Model Search Trees

Grosse, Julia; Wu, Ruotian; Rashid, Ahmad; Hennig, Philipp; Poupart, Pascal; Kristiadi, Agustinus

Computer Science > Machine Learning

arXiv:2407.03951 (cs)

[Submitted on 4 Jul 2024 (v1), last revised 9 Oct 2024 (this version, v2)]

Title:Uncertainty-Guided Optimization on Large Language Model Search Trees

Authors:Julia Grosse, Ruotian Wu, Ahmad Rashid, Philipp Hennig, Pascal Poupart, Agustinus Kristiadi

View PDF HTML (experimental)

Abstract:Tree search algorithms such as greedy and beam search are the standard when it comes to finding sequences of maximum likelihood in the decoding processes of large language models (LLMs). However, they are myopic since they do not take the complete root-to-leaf path into account. Moreover, they are agnostic to prior knowledge available about the process: For example, it does not consider that the objective being maximized is a probability and thereby has specific properties like being bound in the unit interval. Taking a probabilistic approach, we define prior beliefs over LLMs' transition probabilities and obtain posterior beliefs over the most promising paths in each iteration. These beliefs are useful for defining a sample-based, non-myopic acquisition function that allows for a more data-efficient exploration scheme than standard search algorithms on LLMs. Crucially, unlike expensive simulation-based non-myopic methods like the Monte Carlo tree search, our method only requires samples from the beliefs. Our formulation thus views LLM decoding as Bayesian optimization on trees. We discuss how to select the prior and the acquisition function, and demonstrate in experiments with various LLMs that our method achieves higher efficiency than recent baselines: Our method achieves the same or a higher likelihood while expanding fewer nodes.

Comments:	10 pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2407.03951 [cs.LG]
	(or arXiv:2407.03951v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.03951

Submission history

From: Julia Grosse [view email]
[v1] Thu, 4 Jul 2024 14:08:50 UTC (2,191 KB)
[v2] Wed, 9 Oct 2024 08:16:18 UTC (2,280 KB)

Computer Science > Machine Learning

Title:Uncertainty-Guided Optimization on Large Language Model Search Trees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Uncertainty-Guided Optimization on Large Language Model Search Trees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators