Thompson sampling for improved exploration in GFlowNets

Rector-Brooks, Jarrid; Madan, Kanika; Jain, Moksh; Korablyov, Maksym; Liu, Cheng-Hao; Chandar, Sarath; Malkin, Nikolay; Bengio, Yoshua

Computer Science > Machine Learning

arXiv:2306.17693 (cs)

[Submitted on 30 Jun 2023]

Title:Thompson sampling for improved exploration in GFlowNets

Authors:Jarrid Rector-Brooks, Kanika Madan, Moksh Jain, Maksym Korablyov, Cheng-Hao Liu, Sarath Chandar, Nikolay Malkin, Yoshua Bengio

View PDF

Abstract:Generative flow networks (GFlowNets) are amortized variational inference algorithms that treat sampling from a distribution over compositional objects as a sequential decision-making problem with a learnable action policy. Unlike other algorithms for hierarchical sampling that optimize a variational bound, GFlowNet algorithms can stably run off-policy, which can be advantageous for discovering modes of the target distribution. Despite this flexibility in the choice of behaviour policy, the optimal way of efficiently selecting trajectories for training has not yet been systematically explored. In this paper, we view the choice of trajectories for training as an active learning problem and approach it using Bayesian techniques inspired by methods for multi-armed bandits. The proposed algorithm, Thompson sampling GFlowNets (TS-GFN), maintains an approximate posterior distribution over policies and samples trajectories from this posterior for training. We show in two domains that TS-GFN yields improved exploration and thus faster convergence to the target distribution than the off-policy exploration strategies used in past work.

Comments:	Structured Probabilistic Inference and Generative Modeling (SPIGM) workshop @ ICML 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.17693 [cs.LG]
	(or arXiv:2306.17693v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.17693

Submission history

From: Nikolay Malkin [view email]
[v1] Fri, 30 Jun 2023 14:19:44 UTC (2,745 KB)

Computer Science > Machine Learning

Title:Thompson sampling for improved exploration in GFlowNets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Thompson sampling for improved exploration in GFlowNets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators