Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

Ni, Yuyan; Feng, Shikun; Hong, Xin; Sun, Yuancheng; Ma, Wei-Ying; Ma, Zhi-Ming; Ye, Qiwei; Lan, Yanyan

Computer Science > Machine Learning

arXiv:2407.11086 (cs)

[Submitted on 14 Jul 2024]

Title:Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

Authors:Yuyan Ni, Shikun Feng, Xin Hong, Yuancheng Sun, Wei-Ying Ma, Zhi-Ming Ma, Qiwei Ye, Yanyan Lan

View PDF HTML (experimental)

Abstract:Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. While many existing methods utilize common pre-training tasks in computer vision (CV) and natural language processing (NLP), they often overlook the fundamental physical principles governing molecules. In contrast, applying denoising in pre-training can be interpreted as an equivalent force learning, but the limited noise distribution introduces bias into the molecular distribution. To address this issue, we introduce a molecular pre-training framework called fractional denoising (Frad), which decouples noise design from the constraints imposed by force learning equivalence. In this way, the noise becomes customizable, allowing for incorporating chemical priors to significantly improve molecular distribution modeling. Experiments demonstrate that our framework consistently outperforms existing methods, establishing state-of-the-art results across force prediction, quantum chemical properties, and binding affinity tasks. The refined noise design enhances force accuracy and sampling coverage, which contribute to the creation of physically consistent molecular representations, ultimately leading to superior predictive performance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Chemical Physics (physics.chem-ph)
Cite as:	arXiv:2407.11086 [cs.LG]
	(or arXiv:2407.11086v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.11086

Submission history

From: Yuyan Ni [view email]
[v1] Sun, 14 Jul 2024 11:09:42 UTC (6,166 KB)

Computer Science > Machine Learning

Title:Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators