Optimal Invariant Bases for Atomistic Machine Learning

Allen, Alice E. A.; Shinkle, Emily; Bujack, Roxana; Lubbers, Nicholas

Physics > Chemical Physics

arXiv:2503.23515 (physics)

[Submitted on 30 Mar 2025 (v1), last revised 3 Apr 2025 (this version, v2)]

Title:Optimal Invariant Bases for Atomistic Machine Learning

Authors:Alice E. A. Allen, Emily Shinkle, Roxana Bujack, Nicholas Lubbers

View PDF HTML (experimental)

Abstract:The representation of atomic configurations for machine learning models has led to the development of numerous descriptors, often to describe the local environment of atoms. However, many of these representations are incomplete and/or functionally dependent. Incomplete descriptor sets are unable to represent all meaningful changes in the atomic environment. Complete constructions of atomic environment descriptors, on the other hand, often suffer from a high degree of functional dependence, where some descriptors can be written as functions of the others. These redundant descriptors do not provide additional power to discriminate between different atomic environments and increase the computational burden. By employing techniques from the pattern recognition literature to existing atomistic representations, we remove descriptors that are functions of other descriptors to produce the smallest possible set that satisfies completeness. We apply this in two ways: first we refine an existing description, the Atomistic Cluster Expansion. We show that this yields a more efficient subset of descriptors. Second, we augment an incomplete construction based on a scalar neural network, yielding a new message-passing network architecture that can recognize up to 5-body patterns in each neuron by taking advantage of an optimal set of Cartesian tensor invariants. This architecture shows strong accuracy on state-of-the-art benchmarks while retaining low computational cost. Our results not only yield improved models, but point the way to classes of invariant bases that minimize cost while maximizing expressivity for a host of applications.

Comments:	Update cross-reference to companion paper
Subjects:	Chemical Physics (physics.chem-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2503.23515 [physics.chem-ph]
	(or arXiv:2503.23515v2 [physics.chem-ph] for this version)
	https://doi.org/10.48550/arXiv.2503.23515

Submission history

From: Nicholas Lubbers [view email]
[v1] Sun, 30 Mar 2025 16:52:29 UTC (3,711 KB)
[v2] Thu, 3 Apr 2025 16:35:44 UTC (3,711 KB)

Physics > Chemical Physics

Title:Optimal Invariant Bases for Atomistic Machine Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Physics > Chemical Physics

Title:Optimal Invariant Bases for Atomistic Machine Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators