Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Qiu, Jiahao; Yuan, Hui; Zhang, Jinghong; Chen, Wentao; Wang, Huazheng; Wang, Mengdi

Quantitative Biology > Biomolecules

arXiv:2401.06173 (q-bio)

[Submitted on 8 Jan 2024]

Title:Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Authors:Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang

View PDF HTML (experimental)

Abstract:While modern biotechnologies allow synthesizing new proteins and function measurements at scale, efficiently exploring a protein sequence space and engineering it remains a daunting task due to the vast sequence space of any given protein. Protein engineering is typically conducted through an iterative process of adding mutations to the wild-type or lead sequences, recombination of mutations, and running new rounds of screening. To enhance the efficiency of such a process, we propose a tree search-based bandit learning method, which expands a tree starting from the initial sequence with the guidance of a bandit machine learning model. Under simplified assumptions and a Gaussian Process prior, we provide theoretical analysis and a Bayesian regret bound, demonstrating that the combination of local search and bandit learning method can efficiently discover a near-optimal design. The full algorithm is compatible with a suite of randomized tree search heuristics, machine learning models, pre-trained embeddings, and bandit techniques. We test various instances of the algorithm across benchmark protein datasets using simulated screens. Experiment results demonstrate that the algorithm is both sample-efficient and able to find top designs using reasonably small mutation counts.

Comments:	AAAI 2024
Subjects:	Biomolecules (q-bio.BM); Machine Learning (cs.LG)
Cite as:	arXiv:2401.06173 [q-bio.BM]
	(or arXiv:2401.06173v1 [q-bio.BM] for this version)
	https://doi.org/10.48550/arXiv.2401.06173

Submission history

From: Huazheng Wang [view email]
[v1] Mon, 8 Jan 2024 06:33:27 UTC (32,376 KB)

Quantitative Biology > Biomolecules

Title:Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Biomolecules

Title:Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators