生物信息学
工作流程
贝叶斯概率
计算机科学
一致性(知识库)
水准点(测量)
异方差
数据挖掘
贝叶斯定理
化学
机器学习
人工智能
数据库
基因
地理
生物化学
大地测量学
作者
Timothy F. M. Rodgers,Joseph O. Okeme,J. Mark Parnis,Kyle Girdhari,Terry F. Bidleman,Yuchao Wan,Liisa M. Jantunen,Miriam L. Diamond
标识
DOI:10.1021/acs.est.1c01418
摘要
Accurate values of physicochemical properties are essential for screening semivolatile organic compounds for human and environmental hazard and risk. In silico approaches for estimation are widely used, but the accuracy of these and measured values can be difficult to ascertain. Final adjusted values (FAVs) harmonize literature-reported measurements to ensure consistency and minimize uncertainty. We propose a workflow, including a novel Bayesian approach, for estimating FAVs that combines measurements using direct and indirect methods and in silico values. The workflow was applied to 74 compounds across nine classes to generate recommended FAVs (FAVRs). Estimates generated by in silico methods (OPERA, COSMOtherm, EPI Suite, SPARC, and polyparameter linear free energy relationships (pp-LFER) models) differed by orders of magnitude for some properties and compounds and performed systematically worse for larger, more polar compounds. COSMOtherm and OPERA generally performed well with low bias although no single in silico method performed best across all compound classes and properties. Indirect measurement methods produced highly accurate and precise estimates compared with direct measurement methods. Our Bayesian method harmonized measured and in silico estimated physicochemical properties without introducing observable biases. We thus recommend use of the FAVRs presented here and that the proposed Bayesian workflow be used to generate FAVRs for SVOCs beyond those in this study.
科研通智能强力驱动
Strongly Powered by AbleSci AI