计算机科学
水准点(测量)
热化学
标杆管理
离群值
化学空间
人工智能
热力学
化学
物理
大地测量学
生物化学
药物发现
业务
营销
地理
作者
Sambit Das,Sabyasachi Chakraborty,Raghunathan Ramakrishnan
摘要
First-principles calculation of the standard formation enthalpy, ΔHf° (298 K), in such a large scale as required by chemical space explorations, is amenable only with density functional approximations (DFAs) and certain composite wave function theories (cWFTs). Unfortunately, the accuracies of popular range-separated hybrid, “rung-4” DFAs, and cWFTs that offer the best accuracy-vs-cost trade-off have until now been established only for datasets predominantly comprising small molecules; their transferability to larger systems remains vague. In this study, we present an extended benchmark dataset of ΔHf° for structurally and electronically diverse molecules. We apply quartile-ranking based on boundary-corrected kernel density estimation to filter outliers and arrive at probabilistically pruned enthalpies of 1694 compounds (PPE1694). For this dataset, we rank the prediction accuracies of G4, G4(MP2), ccCA, CBS-QB3, and 23 popular DFAs using conventional and probabilistic error metrics. We discuss systematic prediction errors and highlight the role an empirical higher-level correction plays in the G4(MP2) model. Furthermore, we comment on uncertainties associated with the reference empirical data for atoms and the systematic errors stemming from these that grow with the molecular size. We believe that these findings will aid in identifying meaningful application domains for quantum thermochemical methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI