Functional multi-armed bandit and the best function identification problems

Dorn, Yuriy; Katrutsa, Aleksandr; Latypov, Ilgam; Soboleva, Anastasiia

Computer Science > Machine Learning

arXiv:2503.00509 (cs)

[Submitted on 1 Mar 2025]

Title:Functional multi-armed bandit and the best function identification problems

Authors:Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Anastasiia Soboleva

View PDF HTML (experimental)

Abstract:Bandit optimization usually refers to the class of online optimization problems with limited feedback, namely, a decision maker uses only the objective value at the current point to make a new decision and does not have access to the gradient of the objective function. While this name accurately captures the limitation in feedback, it is somehow misleading since it does not have any connection with the multi-armed bandits (MAB) problem class. We propose two new classes of problems: the functional multi-armed bandit problem (FMAB) and the best function identification problem. They are modifications of a multi-armed bandit problem and the best arm identification problem, respectively, where each arm represents an unknown black-box function. These problem classes are a surprisingly good fit for modeling real-world problems such as competitive LLM training. To solve the problems from these classes, we propose a new reduction scheme to construct UCB-type algorithms, namely, the F-LCB algorithm, based on algorithms for nonlinear optimization with known convergence rates. We provide the regret upper bounds for this reduction scheme based on the base algorithms' convergence rates. We add numerical experiments that demonstrate the performance of the proposed scheme.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2503.00509 [cs.LG]
	(or arXiv:2503.00509v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.00509

Submission history

From: Aleksandr Katrutsa [view email]
[v1] Sat, 1 Mar 2025 14:28:52 UTC (515 KB)

Computer Science > Machine Learning

Title:Functional multi-armed bandit and the best function identification problems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Functional multi-armed bandit and the best function identification problems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators