Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations

Jiang, Pengcheng; Xiao, Cao; Fu, Tianfan; Sun, Jimeng

Computer Science > Machine Learning

arXiv:2306.01631v2 (cs)

[Submitted on 2 Jun 2023 (v1), revised 16 Aug 2023 (this version, v2), latest version 16 Feb 2025 (v6)]

Title:Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations

Authors:Pengcheng Jiang, Cao Xiao, Tianfan Fu, Jimeng Sun

View PDF

Abstract:Molecule representation learning underpins diverse downstream applications such as molecular property and side effect understanding and prediction. In this paper, we recognize the two-level structure of individual molecule as having intrinsic graph structure as well as being a node in a large molecule knowledge graph, and present GODE, a new approach that seamlessly integrates graph representations of individual molecules with multi-domain biomedical data from knowledge graphs. By pre-training two graph neural networks (GNNs) on different graph structures, combined with contrastive learning, GODE adeptly fuses molecular structures with their corresponding knowledge graph substructures. This fusion results in a more robust and informative representation, enhancing molecular property prediction by harnessing both chemical and biological information. Finetuned on 11 chemical property tasks, our model surpasses benchmarks, achieving an average ROC-AUC improvement of 14.5%, 9.8%, and 7.3% on BBBP, SIDER, and Tox21 datasets. In regression tasks on ESOL and QM7 datasets, we achieve average improvements of 21.0% and 29.6% improvements in RMSE and MAE, setting a new field benchmark.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
Cite as:	arXiv:2306.01631 [cs.LG]
	(or arXiv:2306.01631v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.01631

Submission history

From: Pengcheng Jiang [view email]
[v1] Fri, 2 Jun 2023 15:49:45 UTC (878 KB)
[v2] Wed, 16 Aug 2023 12:30:27 UTC (4,937 KB)
[v3] Fri, 29 Sep 2023 19:35:24 UTC (5,812 KB)
[v4] Sat, 20 Jan 2024 03:22:43 UTC (6,189 KB)
[v5] Tue, 10 Dec 2024 01:31:17 UTC (4,964 KB)
[v6] Sun, 16 Feb 2025 05:22:45 UTC (6,003 KB)

Computer Science > Machine Learning

Title:Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators