数量结构-活动关系
计算机科学
机器学习
水准点(测量)
任务(项目管理)
人工智能
随机森林
适用范围
水生毒理学
资源(消歧)
毒性
化学
工程类
计算机网络
有机化学
大地测量学
系统工程
地理
作者
Thalea Schlender,Markus Viljanen,Jan N. van Rijn,Felix Mohr,Willie J.G.M. Peijnenburg,Holger H. Hoos,Emiel Rorije,Albert Wong
标识
DOI:10.1021/acs.est.3c00334
摘要
Toxicological information as needed for risk assessments of chemical compounds is often sparse. Unfortunately, gathering new toxicological information experimentally often involves animal testing. Simulated alternatives, e.g., quantitative structure–activity relationship (QSAR) models, are preferred to infer the toxicity of new compounds. Aquatic toxicity data collections consist of many related tasks─each predicting the toxicity of new compounds on a given species. Since many of these tasks are inherently low-resource, i.e., involve few associated compounds, this is challenging. Meta-learning is a subfield of artificial intelligence that can lead to more accurate models by enabling the utilization of information across tasks. In our work, we benchmark various state-of-the-art meta-learning techniques for building QSAR models, focusing on knowledge sharing between species. Specifically, we employ and compare transformational machine learning, model-agnostic meta-learning, fine-tuning, and multi-task models. Our experiments show that established knowledge-sharing techniques outperform single-task approaches. We recommend the use of multi-task random forest models for aquatic toxicity modeling, which matched or exceeded the performance of other approaches and robustly produced good results in the low-resource settings we studied. This model functions on a species level, predicting toxicity for multiple species across various phyla, with flexible exposure duration and on a large chemical applicability domain.
科研通智能强力驱动
Strongly Powered by AbleSci AI