纳滤
反渗透
膜
可解释性
指纹(计算)
化学
生物系统
分子描述符
超滤(肾)
渗透
工作(物理)
数量结构-活动关系
工艺工程
计算机科学
色谱法
人工智能
机器学习
热力学
工程类
生物化学
物理
生物
作者
Sangsuk Lee,Michael R. Shirts,Anthony P. Straub
标识
DOI:10.1016/j.memsci.2024.122927
摘要
Reverse osmosis and nanofiltration are used to purify feedwaters that contain a range of harmful organic solutes. The rejection of many of these solutes is poorly understood due to our limited ability to experimentally measure removal of any given compound. In this work, we present a machine learning approach that predicts organic solute rejection using molecular fingerprints that encode chemical structure features, such as functional groups and rings, into simple binary vectors. We trained machine learning models on a database of 1906 membrane rejection measurements including 228 organic compounds and 39 types of reverse osmosis and nanofiltration membranes. Three types of molecular fingerprint models (structural key, circular, and path based) were compared, and we observed that the Molecular Access System (MACCS) structural key had high performance (coefficient of determination of 0.87 with the testing set), fast calculation time due to its short bit-length, and easy interpretability. In addition to evaluating prediction performance, Shapley Additive Explanations (SHAP) analysis was implemented to gain a better molecular-scale understanding of membrane rejection, identifying molecular substructures that are important in determining their rejection. Overall, this work presents a method to predict the rejection of compounds that uses readily available molecular structure information and improves our ability to understand rejection mechanisms.
科研通智能强力驱动
Strongly Powered by AbleSci AI