氯
数量结构-活动关系
量子化学
化学
反应速率常数
线性回归
生物系统
分子描述符
训练集
量子化学
Atom(片上系统)
计算化学
计算机科学
分子
人工智能
机器学习
有机化学
立体化学
物理
量子力学
动力学
超分子化学
生物
嵌入式系统
作者
Shanshan Zheng,Wenlei Qin,He Ji,Wanqian Guo,Jingyun Fang
标识
DOI:10.1021/acs.estlett.3c00494
摘要
Reactive chlorine species (RCS), such as chlorine (HOCl/OCl–), chlorine dioxide (ClO2), chlorine atom (Cl•), and dichlorine radical (Cl2•–), play a crucial role in oxidation and disinfection worldwide. In this study, we developed machine learning (ML)-based quantitative structure–activity relationship (QSAR) models to predict the rate constants of RCS toward organic compounds by using quantum chemical descriptors (QDs) and Morgan fingerprints (MFs) as input features along with three tree-based ML algorithms. The ML-based models (RMSEtest = 0.528–1.131) outperform multiple linear regression-based models (RMSEtest = 0.772–4.837). Moreover, the QSAR models developed by combining QDs and MFs as input features (RMSEtest = 0.528–0.948) show better prediction performance than that by QDs (RMSEtest = 0.616–1.875) or MFs alone (RMSEtest = 0.636–1.439) for all four RCS. The SHapely Additive exPlanation (SHAP) analysis reveals that the energy of the highest occupied molecular orbital (EHOMO), charge, and −O––NH2 and −CO are the most important descriptors affecting the rate constants of RCS. This study demonstrates that the combination of QDs and MFs as input features achieves much better model prediction performance for RCS, which can be extrapolated to other oxidants in water treatment.
科研通智能强力驱动
Strongly Powered by AbleSci AI