Adaptive Sampling for Discovery

Xu, Ziping; Shim, Eunjae; Tewari, Ambuj; Zimmerman, Paul

Statistics > Machine Learning

arXiv:2205.14829 (stat)

[Submitted on 30 May 2022 (v1), last revised 2 Jan 2023 (this version, v3)]

Title:Adaptive Sampling for Discovery

Authors:Ziping Xu, Eunjae Shim, Ambuj Tewari, Paul Zimmerman

View PDF

Abstract:In this paper, we study a sequential decision-making problem, called Adaptive Sampling for Discovery (ASD). Starting with a large unlabeled dataset, algorithms for ASD adaptively label the points with the goal to maximize the sum of responses.
This problem has wide applications to real-world discovery problems, for example drug discovery with the help of machine learning models. ASD algorithms face the well-known exploration-exploitation dilemma. The algorithm needs to choose points that yield information to improve model estimates but it also needs to exploit the model. We rigorously formulate the problem and propose a general information-directed sampling (IDS) algorithm. We provide theoretical guarantees for the performance of IDS in linear, graph and low-rank models. The benefits of IDS are shown in both simulation experiments and real-data experiments for discovering chemical reaction conditions.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2205.14829 [stat.ML]
	(or arXiv:2205.14829v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2205.14829

Submission history

From: Ziping Xu [view email]
[v1] Mon, 30 May 2022 03:30:45 UTC (3,120 KB)
[v2] Fri, 3 Jun 2022 00:57:47 UTC (3,120 KB)
[v3] Mon, 2 Jan 2023 23:44:58 UTC (3,174 KB)

Statistics > Machine Learning

Title:Adaptive Sampling for Discovery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Adaptive Sampling for Discovery

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators