计算机科学
Atom(片上系统)
水准点(测量)
电子密度
缩放比例
基础(线性代数)
统计物理学
试验装置
集合(抽象数据类型)
算法
电子
物理
数学
人工智能
量子力学
几何学
大地测量学
程序设计语言
嵌入式系统
地理
作者
Andrea Grisafi,Alan M. Lewis,Mariana Rossi,Michele Ceriotti
标识
DOI:10.1021/acs.jctc.2c00850
摘要
The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multicentered atomic basis analogous to that routinely used in density fitting approximations. However, the nonorthogonality of the basis poses challenges for the learning exercise, as it requires accounting for all the atomic density components at once. We devise a gradient-based approach to directly minimize the loss function of the regression problem in an optimized and highly sparse feature space. In so doing, we overcome the limitations associated with adopting an atom-centered model to learn the electron density over arbitrarily complex data sets, obtaining very accurate predictions using a comparatively small training set. The enhanced framework is tested on 32-molecule periodic cells of liquid water, presenting enough complexity to require an optimal balance between accuracy and computational efficiency. We show that starting from the predicted density a single Kohn-Sham diagonalization step can be performed to access total energy components that carry an error of just 0.1 meV/atom with respect to the reference density functional calculations. Finally, we test our method on the highly heterogeneous QM9 benchmark data set, showing that a small fraction of the training data is enough to derive ground-state total energies within chemical accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI