Three-Dimensional Convolutional Neural Networks Utilizing Molecular Topological Features for Accurate Atomization Energy Predictions

计算机科学 卷积神经网络 背景(考古学) 水准点(测量) 代表(政治) 人工神经网络 拓扑(电路) 过程(计算) 化学空间 集合(抽象数据类型) 功能(生物学) 人工智能 化学 数学 药物发现 古生物学 生物化学 大地测量学 组合数学 进化生物学 政治 政治学 法学 生物 程序设计语言 地理 操作系统
作者
Ankur K. Gupta,Krishnan Raghavachari
出处
期刊:Journal of Chemical Theory and Computation [American Chemical Society]
卷期号:18 (4): 2132-2143 被引量:5
标识
DOI:10.1021/acs.jctc.1c00504
摘要

Deep learning methods provide a novel way to establish a correlation between two quantities. In this context, computer vision techniques such as three-dimensional (3D)-convolutional neural networks become a natural choice to associate a molecular property with its structure due to the inherent 3D nature of a molecule. However, traditional 3D input data structures are intrinsically sparse in nature, which tend to induce instabilities during the learning process, which in turn may lead to underfitted results. To address this deficiency, in this project, we propose to use quantum-chemically derived molecular topological features, namely, localized orbital locator and electron localization function, as molecular descriptors, which provide a relatively denser input representation in a 3D space. Such topological features provide a detailed picture of the atomic and electronic configuration and interatomic interactions in the molecule and hence are ideal for predicting properties that are highly dependent on the physical or electronic structure of the molecule. Herein, we demonstrate the efficacy of our proposed model by applying it to the task of predicting atomization energies for the QM9-G4MP2 data set, which contains ∼134k molecules. Furthermore, we incorporated the Δ-machine learning approach into our model, which enabled us to reach beyond benchmark accuracy levels (∼1.0 kJ mol-1). As a result, we consistently obtain impressive mean absolute errors of the order 0.1 kcal mol-1 (∼0.42 kJ mol-1) versus the G4(MP2) theory using relatively modest models, which could potentially be improved further in a systematic manner using additional compute resources.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
star发布了新的文献求助10
2秒前
善良书蝶完成签到,获得积分10
2秒前
海蓝云天应助nono采纳,获得50
3秒前
我是老大应助文静灵阳采纳,获得10
3秒前
亮仔完成签到,获得积分10
3秒前
Augustines完成签到,获得积分10
4秒前
huanghuang发布了新的文献求助10
5秒前
6秒前
后知后觉完成签到,获得积分10
7秒前
CodeCraft应助star采纳,获得10
7秒前
AYing完成签到,获得积分10
7秒前
7秒前
joshar完成签到,获得积分10
7秒前
斯文败类应助单于靖荷采纳,获得10
8秒前
蓝天小小鹰完成签到 ,获得积分10
8秒前
8秒前
liuy发布了新的文献求助10
9秒前
9秒前
10秒前
ambition完成签到,获得积分20
10秒前
One完成签到,获得积分0
10秒前
11秒前
12秒前
12秒前
JRoon完成签到,获得积分10
12秒前
12秒前
12秒前
bkagyin应助文静灵阳采纳,获得10
13秒前
orixero应助仁爱问芙采纳,获得10
13秒前
13秒前
烟染完成签到,获得积分10
14秒前
14秒前
14秒前
归一完成签到,获得积分10
15秒前
15秒前
yinan完成签到,获得积分20
15秒前
16秒前
16秒前
星星发布了新的文献求助10
16秒前
琅琊为刃完成签到,获得积分10
16秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
CLSI M100 Performance Standards for Antimicrobial Susceptibility Testing 36th edition 400
How to Design and Conduct an Experiment and Write a Lab Report: Your Complete Guide to the Scientific Method (Step-by-Step Study Skills) 333
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6363461
求助须知:如何正确求助?哪些是违规求助? 8177390
关于积分的说明 17232734
捐赠科研通 5418609
什么是DOI,文献DOI怎么找? 2867125
邀请新用户注册赠送积分活动 1844328
关于科研通互助平台的介绍 1691850