Three-Dimensional Convolutional Neural Networks Utilizing Molecular Topological Features for Accurate Atomization Energy Predictions

计算机科学 卷积神经网络 背景(考古学) 水准点(测量) 代表(政治) 人工神经网络 拓扑(电路) 过程(计算) 化学空间 集合(抽象数据类型) 功能(生物学) 人工智能 化学 数学 药物发现 古生物学 生物化学 大地测量学 组合数学 进化生物学 政治 政治学 法学 生物 程序设计语言 地理 操作系统
作者
Ankur K. Gupta,Krishnan Raghavachari
出处
期刊:Journal of Chemical Theory and Computation [American Chemical Society]
卷期号:18 (4): 2132-2143 被引量:5
标识
DOI:10.1021/acs.jctc.1c00504
摘要

Deep learning methods provide a novel way to establish a correlation between two quantities. In this context, computer vision techniques such as three-dimensional (3D)-convolutional neural networks become a natural choice to associate a molecular property with its structure due to the inherent 3D nature of a molecule. However, traditional 3D input data structures are intrinsically sparse in nature, which tend to induce instabilities during the learning process, which in turn may lead to underfitted results. To address this deficiency, in this project, we propose to use quantum-chemically derived molecular topological features, namely, localized orbital locator and electron localization function, as molecular descriptors, which provide a relatively denser input representation in a 3D space. Such topological features provide a detailed picture of the atomic and electronic configuration and interatomic interactions in the molecule and hence are ideal for predicting properties that are highly dependent on the physical or electronic structure of the molecule. Herein, we demonstrate the efficacy of our proposed model by applying it to the task of predicting atomization energies for the QM9-G4MP2 data set, which contains ∼134k molecules. Furthermore, we incorporated the Δ-machine learning approach into our model, which enabled us to reach beyond benchmark accuracy levels (∼1.0 kJ mol-1). As a result, we consistently obtain impressive mean absolute errors of the order 0.1 kcal mol-1 (∼0.42 kJ mol-1) versus the G4(MP2) theory using relatively modest models, which could potentially be improved further in a systematic manner using additional compute resources.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
刘旭完成签到,获得积分10
2秒前
楚天完成签到,获得积分10
2秒前
2秒前
3秒前
你是最胖的完成签到,获得积分10
3秒前
SciGPT应助明理的帆布鞋采纳,获得10
4秒前
Owen应助wenxianqiuzhu采纳,获得10
4秒前
4秒前
晚风应助cao采纳,获得10
4秒前
YDSG完成签到,获得积分10
5秒前
5秒前
帕斯卡尔完成签到,获得积分10
6秒前
6秒前
小蘑菇应助包容的紫萍采纳,获得10
6秒前
6秒前
微笑寒珊发布了新的文献求助10
7秒前
穆有问题发布了新的文献求助10
7秒前
旧日完成签到,获得积分10
8秒前
wty完成签到,获得积分10
8秒前
Sunny发布了新的文献求助10
8秒前
lys发布了新的文献求助10
9秒前
9秒前
淡定以亦发布了新的文献求助10
9秒前
典雅采珊发布了新的文献求助10
10秒前
奋斗奇迹发布了新的文献求助10
10秒前
霸气的枫叶关注了科研通微信公众号
11秒前
11秒前
满天星完成签到,获得积分10
11秒前
CodeCraft应助蜗牛星星采纳,获得10
12秒前
13秒前
wty发布了新的文献求助10
13秒前
充电宝应助英勇睿渊采纳,获得10
13秒前
hechuangye完成签到,获得积分10
14秒前
winwin完成签到,获得积分10
15秒前
wqy发布了新的文献求助10
15秒前
大个应助典雅采珊采纳,获得10
15秒前
科研乞丐发布了新的文献求助10
16秒前
星星2012发布了新的文献求助30
16秒前
17秒前
高分求助中
Cronologia da história de Macau 5000
Matrix Methods in Data Mining and Pattern Recognition 510
Interactions of Vowel Quality and Prosody in East Slavic 500
Vander's Renal Physiology第10版 500
Forensic Science An Introduction to Scientific and Investigative Techniques 6th Edition 400
Virus-like particles empower RNAi for effective control of a Coleopteran pest 400
Materials Informatics Molecules, Crystals and Beyond A volume in Acta Materialia Book Series 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7098090
求助须知:如何正确求助?哪些是违规求助? 8754257
关于积分的说明 18515480
捐赠科研通 6654015
什么是DOI,文献DOI怎么找? 3138761
关于科研通互助平台的介绍 2248104
邀请新用户注册赠送积分活动 2113647