Three-Dimensional Convolutional Neural Networks Utilizing Molecular Topological Features for Accurate Atomization Energy Predictions

计算机科学 卷积神经网络 背景(考古学) 水准点(测量) 代表(政治) 人工神经网络 拓扑(电路) 过程(计算) 化学空间 集合(抽象数据类型) 功能(生物学) 人工智能 化学 数学 药物发现 古生物学 生物化学 大地测量学 组合数学 进化生物学 政治 政治学 法学 生物 程序设计语言 地理 操作系统
作者
Ankur K. Gupta,Krishnan Raghavachari
出处
期刊:Journal of Chemical Theory and Computation [American Chemical Society]
卷期号:18 (4): 2132-2143 被引量:5
标识
DOI:10.1021/acs.jctc.1c00504
摘要

Deep learning methods provide a novel way to establish a correlation between two quantities. In this context, computer vision techniques such as three-dimensional (3D)-convolutional neural networks become a natural choice to associate a molecular property with its structure due to the inherent 3D nature of a molecule. However, traditional 3D input data structures are intrinsically sparse in nature, which tend to induce instabilities during the learning process, which in turn may lead to underfitted results. To address this deficiency, in this project, we propose to use quantum-chemically derived molecular topological features, namely, localized orbital locator and electron localization function, as molecular descriptors, which provide a relatively denser input representation in a 3D space. Such topological features provide a detailed picture of the atomic and electronic configuration and interatomic interactions in the molecule and hence are ideal for predicting properties that are highly dependent on the physical or electronic structure of the molecule. Herein, we demonstrate the efficacy of our proposed model by applying it to the task of predicting atomization energies for the QM9-G4MP2 data set, which contains ∼134k molecules. Furthermore, we incorporated the Δ-machine learning approach into our model, which enabled us to reach beyond benchmark accuracy levels (∼1.0 kJ mol-1). As a result, we consistently obtain impressive mean absolute errors of the order 0.1 kcal mol-1 (∼0.42 kJ mol-1) versus the G4(MP2) theory using relatively modest models, which could potentially be improved further in a systematic manner using additional compute resources.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
临风完成签到,获得积分10
1秒前
李新宁完成签到,获得积分10
1秒前
1秒前
阿六发布了新的文献求助10
3秒前
时间完成签到,获得积分10
3秒前
3秒前
jiahaixu关注了科研通微信公众号
4秒前
T-SL完成签到,获得积分10
4秒前
4秒前
qz完成签到,获得积分10
6秒前
AAA王哥发布了新的文献求助50
7秒前
orixero应助超级天晴采纳,获得10
7秒前
科研通AI6.2应助友好的储采纳,获得10
8秒前
万能图书馆应助macart采纳,获得10
8秒前
直率曼荷完成签到,获得积分10
8秒前
9秒前
10秒前
lalala完成签到,获得积分10
10秒前
蜂蜜发布了新的文献求助10
11秒前
11秒前
Always完成签到,获得积分10
12秒前
NexusExplorer应助Markov采纳,获得10
13秒前
leosong完成签到,获得积分10
13秒前
xiaosu发布了新的文献求助30
14秒前
14秒前
短腿柯基完成签到,获得积分10
14秒前
盒子发布了新的文献求助20
14秒前
15秒前
赘婿应助DTS采纳,获得10
15秒前
18秒前
酷波er应助阿六采纳,获得10
19秒前
顾矜应助超帅的又槐采纳,获得10
20秒前
wakaka应助李秋莉采纳,获得10
21秒前
阿喔完成签到,获得积分10
21秒前
大神瓜完成签到,获得积分10
21秒前
21秒前
风思雅完成签到,获得积分10
22秒前
行知完成签到,获得积分10
22秒前
牛马发布了新的文献求助10
23秒前
诚心的蜗牛完成签到,获得积分10
23秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Instituting Science: The Cultural Production of Scientific Disciplines 666
Signals, Systems, and Signal Processing 610
The Organization of knowledge in modern America, 1860-1920 / 600
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6360351
求助须知:如何正确求助?哪些是违规求助? 8174573
关于积分的说明 17218162
捐赠科研通 5415407
什么是DOI,文献DOI怎么找? 2865917
邀请新用户注册赠送积分活动 1843138
关于科研通互助平台的介绍 1691313