Predicting aggregate morphology of sequence-defined macromolecules with Recurrent Neural Networks

Bhattacharya, Debjyoti; Kleeblatt, Devon C.; Statt, Antonia; Reinhart, Wesley F.

Condensed Matter > Soft Condensed Matter

arXiv:2204.04502 (cond-mat)

[Submitted on 9 Apr 2022 (v1), last revised 7 Jun 2022 (this version, v2)]

Title:Predicting aggregate morphology of sequence-defined macromolecules with Recurrent Neural Networks

Authors:Debjyoti Bhattacharya, Devon C. Kleeblatt, Antonia Statt, Wesley F. Reinhart

View PDF

Abstract:Self-assembly of dilute sequence-defined macromolecules is a complex phenomenon in which the local arrangement of chemical moieties can lead to the formation of long-range structure. The dependence of this structure on the sequence necessarily implies that a mapping between the two exists, yet it has been difficult to model so far. Predicting the aggregation behavior of these macromolecules is challenging due to the lack of effective order parameters, a vast design space, inherent variability, and high computational costs associated with currently available simulation techniques. Here, we accurately predict the morphology of aggregates self-assembled from sequence-defined macromolecules using supervised machine learning. We find that regression models with implicit representation learning perform significantly better than those based on engineered features such as $k$-mer counting, and a Recurrent-Neural-Network-based regressor performs the best out of nine model architectures we tested. Furthermore, we demonstrate the high-throughput screening of monomer sequences using the regression model to identify candidates for self-assembly into selected morphologies. Our strategy is shown to successfully identify multiple suitable sequences in every test we performed, so we hope the insights gained here can be extended to other increasingly complex design scenarios in the future, such as the design of sequences under polydispersity and at varying environmental conditions.

Comments:	38 pages, 11 figures
Subjects:	Soft Condensed Matter (cond-mat.soft); Disordered Systems and Neural Networks (cond-mat.dis-nn); Materials Science (cond-mat.mtrl-sci); Other Condensed Matter (cond-mat.other)
Cite as:	arXiv:2204.04502 [cond-mat.soft]
	(or arXiv:2204.04502v2 [cond-mat.soft] for this version)
	https://doi.org/10.48550/arXiv.2204.04502

Submission history

From: Debjyoti Bhattacharya [view email]
[v1] Sat, 9 Apr 2022 15:56:43 UTC (40,240 KB)
[v2] Tue, 7 Jun 2022 02:26:45 UTC (14,352 KB)

Condensed Matter > Soft Condensed Matter

Title:Predicting aggregate morphology of sequence-defined macromolecules with Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Condensed Matter > Soft Condensed Matter

Title:Predicting aggregate morphology of sequence-defined macromolecules with Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators