化学
可解释性
量子化学
化学空间
概化理论
分子描述符
人工神经网络
分子图
图形
机器学习
QM/毫米
集合(抽象数据类型)
数量结构-活动关系
计算化学
人工智能
理论计算机科学
分子动力学
有机化学
分子
数学
药物发现
计算机科学
生物化学
统计
程序设计语言
作者
Shih‐Cheng Li,Haoyang Wu,Angiras Menon,Kevin Spiekermann,Yi‐Pei Li,William H. Green
摘要
Deep graph neural networks are extensively utilized to predict chemical reactivity and molecular properties. However, because of the complexity of chemical space, such models often have difficulty extrapolating beyond the chemistry contained in the training set. Augmenting the model with quantum mechanical (QM) descriptors is anticipated to improve its generalizability. However, obtaining QM descriptors often requires CPU-intensive computational chemistry calculations. To identify when QM descriptors help graph neural networks predict chemical properties, we conduct a systematic investigation of the impact of atom, bond, and molecular QM descriptors on the performance of directed message passing neural networks (D-MPNNs) for predicting 16 molecular properties. The analysis surveys computational and experimental targets, as well as classification and regression tasks, and varied data set sizes from several hundred to hundreds of thousands of data points. Our results indicate that QM descriptors are mostly beneficial for D-MPNN performance on small data sets, provided that the descriptors correlate well with the targets and can be readily computed with high accuracy. Otherwise, using QM descriptors can add cost without benefit or even introduce unwanted noise that can degrade model performance. Strategic integration of QM descriptors with D-MPNN unlocks potential for physics-informed, data-efficient modeling with some interpretability that can streamline
科研通智能强力驱动
Strongly Powered by AbleSci AI