Data efficient protein backmapping with backbone-to-side chain transformers

Chennakesavalu, Shriram; Rotskoff, Grant M.

Condensed Matter > Statistical Mechanics

arXiv:2311.11459v1 (cond-mat)

[Submitted on 19 Nov 2023 (this version), latest version 2 Feb 2024 (v2)]

Title:Data efficient protein backmapping with backbone-to-side chain transformers

Authors:Shriram Chennakesavalu, Grant M. Rotskoff

View PDF

Abstract:Excitement at the prospect of using data-driven generative models to sample configurational ensembles of biomolecular systems stems from the extraordinary success of these models on a diverse set of high-dimensional sampling tasks. Unlike image generation or even the closely related problem of protein structure prediction, there are not currently data sources with sufficient breadth to parameterize generative models for conformational ensembles. To enable discovery, a fundamentally different approach to building generative models is required: models should be able to propose rare, albeit physical, conformations that may not arise in even the largest data sets. Here we introduce a modular strategy to generate conformations based on ``backmapping'' from a fixed protein backbone that 1) maintains conformational diversity of the side chains and 2) couples the side chain fluctuations using global information about the protein conformation. Our model combines simple statistical models of side chain conformations based on rotamer libraries with the now ubiquitous transformer architecture to sample with atomistic accuracy. Together, these ingredients provide a strategy for rapid data acquistion and hence a crucial ingredient for scalable physical simulation with generative neural networks.

Subjects:	Statistical Mechanics (cond-mat.stat-mech)
Cite as:	arXiv:2311.11459 [cond-mat.stat-mech]
	(or arXiv:2311.11459v1 [cond-mat.stat-mech] for this version)
	https://doi.org/10.48550/arXiv.2311.11459

Submission history

From: Shriram Chennakesavalu [view email]
[v1] Sun, 19 Nov 2023 23:50:13 UTC (6,986 KB)
[v2] Fri, 2 Feb 2024 20:18:06 UTC (8,553 KB)

Condensed Matter > Statistical Mechanics

Title:Data efficient protein backmapping with backbone-to-side chain transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Condensed Matter > Statistical Mechanics

Title:Data efficient protein backmapping with backbone-to-side chain transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators