Multimodal Language and Graph Learning of Adsorption Configuration in Catalysis

Ock, Janghoon; Badrinarayanan, Srivathsan; Magar, Rishikesh; Antony, Akshay; Farimani, Amir Barati

Computer Science > Computational Engineering, Finance, and Science

arXiv:2401.07408 (cs)

[Submitted on 15 Jan 2024 (v1), last revised 12 Oct 2024 (this version, v4)]

Title:Multimodal Language and Graph Learning of Adsorption Configuration in Catalysis

Authors:Janghoon Ock, Srivathsan Badrinarayanan, Rishikesh Magar, Akshay Antony, Amir Barati Farimani

View PDF HTML (experimental)

Abstract:Adsorption energy is a reactivity descriptor that must be accurately predicted for effective machine learning (ML) application in catalyst screening. This process involves determining the lowest energy across various adsorption configurations on a catalytic surface, which can exhibit very similar energy values. While graph neural networks (GNNs) have shown great success in computing the energy of catalyst systems, they rely heavily on atomic spatial coordinates. In contrast, transformer-based language models can directly use human-readable text inputs, potentially bypassing the need for detailed atomic positions. However, these language models often struggle with accurately predicting the energy of adsorption configurations. Our study addresses this limitation by introducing a self-supervised multi-modal learning approach called graph-assisted pretraining, which connects well-established GNNs with emerging language model applications. This method reduces the MAE of energy prediction for adsorption configurations by about 10%. Furthermore, our findings demonstrate that graph-assisted pretraining enhances fine-tuning with different datasets, indicating the transferability of this approach. This method also redirects the model's attention toward adsorption configuration, rather than individual adsorbate and catalyst information, similar to common domain knowledge. Building on this, we propose using generative large language models to create text inputs for the predictive model, based solely on chemical composition and surface orientation, without relying on exact atomic positions. This demonstrates a potential use case of language models in energy prediction without geometric information.

Comments:	manuscript updated
Subjects:	Computational Engineering, Finance, and Science (cs.CE)
Cite as:	arXiv:2401.07408 [cs.CE]
	(or arXiv:2401.07408v4 [cs.CE] for this version)
	https://doi.org/10.48550/arXiv.2401.07408

Submission history

From: Janghoon Ock [view email]
[v1] Mon, 15 Jan 2024 01:11:46 UTC (2,692 KB)
[v2] Wed, 28 Feb 2024 20:46:18 UTC (1,463 KB)
[v3] Thu, 8 Aug 2024 05:53:21 UTC (9,448 KB)
[v4] Sat, 12 Oct 2024 17:57:20 UTC (12,855 KB)

Computer Science > Computational Engineering, Finance, and Science

Title:Multimodal Language and Graph Learning of Adsorption Configuration in Catalysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computational Engineering, Finance, and Science

Title:Multimodal Language and Graph Learning of Adsorption Configuration in Catalysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators