可解释性
计算机科学
人工智能
机器学习
生成模型
财产(哲学)
特征学习
代表(政治)
图形
遮罩(插图)
生成语法
特征(语言学)
模式识别(心理学)
理论计算机科学
艺术
视觉艺术
哲学
认识论
政治
法学
语言学
政治学
作者
Yingxu Liu,Qing Fan,Chengcheng Xu,Xiangzhen Ning,Yu Wang,Yang Liu,Yanmin Zhang,Yadong Chen,Haichun Liu
标识
DOI:10.1002/minf.202400146
摘要
Abstract Background : Effective molecular feature representation is crucial for drug property prediction. Recent years have seen increased attention on graph neural networks (GNNs) that are pre‐trained using self‐supervised learning techniques, aiming to overcome the scarcity of labeled data in molecular property prediction. Traditional GNNs in self‐supervised molecular property prediction typically perform a single masking operation on the nodes and edges of the input molecular graph, masking only local information and insufficient for thorough self‐supervised training. Method : Hence, we propose a model for molecular property prediction based on generative double‐masking self‐supervised learning, termed as GDMol. This integrates generative learning into the self‐supervised learning framework for latent representation, and applies a second round of masking to these latent representations, enabling the model to better capture global information and semantic knowledge of the molecules for a richer, more informative representation, thereby achieving more accurate and robust molecular property prediction. Results : Our experiments on 5 datasets demonstrated superior performance of GDMol in predicting molecular properties across different domains. Moreover, we used the masking operation to traverse through the gradient changes of each node, the magnitude and sign of which reflect the positive and negative contribution respectively of the local structure in the molecule to the prediction outcome. This in‐depth interpretative analysis not only enhances the model's interpretability, but also provides more targeted insights and direction for optimizing drug molecules. Conclusions : In summary, this research offers novel insights on improving molecular property prediction tasks, and paves the way for further research on the application of generative learning and self‐supervised learning in the field of chemistry.
科研通智能强力驱动
Strongly Powered by AbleSci AI