波函数
水准点(测量)
哈密顿量(控制论)
量子化学
密度泛函理论
计算机科学
量子
功能(生物学)
统计物理学
一般化
航程(航空)
计算复杂性理论
比例(比率)
分子
计算化学
算法
化学
量子力学
物理
数学
数学优化
材料科学
超分子化学
生物
数学分析
进化生物学
复合材料
大地测量学
地理
作者
Kuzma Khrabrov,Ilya Shenbin,Alexander Ryabov,Artem Tsypin,Alexander Telepov,Anton Alekseev,Alexander Grishin,П. В. Страшнов,Petr Zhilyaev,Sergey I. Nikolenko,Artur Kadurin
摘要
Electronic wave function calculation is a fundamental task of computational quantum chemistry. Knowledge of the wave function parameters allows one to compute physical and chemical properties of molecules and materials. Unfortunately, it is infeasible to compute the wave functions analytically even for simple molecules. Classical quantum chemistry approaches such as the Hartree-Fock method or density functional theory (DFT) allow to compute an approximation of the wave function but are very computationally expensive. One way to lower the computational complexity is to use machine learning models that can provide sufficiently good approximations at a much lower computational cost. In this work we: (1) introduce a new curated large-scale dataset of electron structures of drug-like molecules, (2) establish a novel benchmark for the estimation of molecular properties in the multi-molecule setting, and (3) evaluate a wide range of methods with this benchmark. We show that the accuracy of recently developed machine learning models deteriorates significantly when switching from the single-molecule to the multi-molecule setting. We also show that these models lack generalization over different chemistry classes. In addition, we provide experimental evidence that larger datasets lead to better ML models in the field of quantum chemistry.
科研通智能强力驱动
Strongly Powered by AbleSci AI