ADME Properties Evaluation in Drug Discovery: Prediction of Caco-2 Cell Permeability Using a Combination of NSGA-II and Boosting

数量结构-活动关系 适用范围 偏最小二乘回归 支持向量机 Boosting(机器学习) 分子描述符 药物发现 试验装置 交叉验证 人工智能 多元统计 化学 计算机科学 线性回归 特征选择 生物系统 机器学习 数学 生物 生物化学
作者
Ningning Wang,Jie Dong,Yin-Hua Deng,Minfeng Zhu,Ming Wen,Zhi‐Jiang Yao,Aiping Lü,Jianbing Wang,Dongsheng Cao
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:56 (4): 763-773 被引量:229
标识
DOI:10.1021/acs.jcim.5b00642
摘要

The Caco-2 cell monolayer model is a popular surrogate in predicting the in vitro human intestinal permeability of a drug due to its morphological and functional similarity with human enterocytes. A quantitative structure-property relationship (QSPR) study was carried out to predict Caco-2 cell permeability of a large data set consisting of 1272 compounds. Four different methods including multivariate linear regression (MLR), partial least-squares (PLS), support vector machine (SVM) regression and Boosting were employed to build prediction models with 30 molecular descriptors selected by nondominated sorting genetic algorithm-II (NSGA-II). The best Boosting model was obtained finally with R(2) = 0.97, RMSEF = 0.12, Q(2) = 0.83, RMSECV = 0.31 for the training set and RT(2) = 0.81, RMSET = 0.31 for the test set. A series of validation methods were used to assess the robustness and predictive ability of our model according to the OECD principles and then define its applicability domain. Compared with the reported QSAR/QSPR models about Caco-2 cell permeability, our model exhibits certain advantage in database size and prediction accuracy to some extent. Finally, we found that the polar volume, the hydrogen bond donor, the surface area and some other descriptors can influence the Caco-2 permeability to some extent. These results suggest that the proposed model is a good tool for predicting the permeability of drug candidates and to perform virtual screening in the early stage of drug development.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
周美言完成签到,获得积分10
1秒前
李健应助endeavor采纳,获得10
1秒前
bkagyin应助zuoyou采纳,获得10
1秒前
2秒前
粗心的人生完成签到,获得积分10
2秒前
承序完成签到,获得积分10
2秒前
2秒前
2秒前
遗忘者关注了科研通微信公众号
2秒前
科研通AI6.4应助ryen采纳,获得10
3秒前
xxx发布了新的文献求助10
4秒前
7秒前
勤恳雅香完成签到,获得积分10
7秒前
学术脑袋发布了新的文献求助10
9秒前
颜一完成签到,获得积分10
10秒前
11秒前
xx发布了新的文献求助10
11秒前
wddd333333完成签到,获得积分10
16秒前
16秒前
16秒前
史文韬发布了新的文献求助10
17秒前
彭于晏应助Pearson采纳,获得10
17秒前
18秒前
Hello应助moon采纳,获得10
19秒前
Dawn发布了新的文献求助10
20秒前
孙泉发布了新的文献求助10
21秒前
zhangjworks发布了新的文献求助10
21秒前
CodeCraft应助xx采纳,获得10
23秒前
昏睡的凝雁完成签到,获得积分20
24秒前
熊仔一百完成签到,获得积分0
24秒前
24秒前
谦让的紫蓝完成签到,获得积分10
25秒前
沐Mu完成签到,获得积分10
26秒前
29秒前
上官若男应助30采纳,获得10
30秒前
31秒前
SciGPT应助一个迷途小书童采纳,获得10
32秒前
烟花应助en采纳,获得10
34秒前
eeush完成签到,获得积分10
34秒前
35秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Photodetectors: From Ultraviolet to Infrared 500
Cancer Targets: Novel Therapies and Emerging Research Directions (Part 1) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6359773
求助须知:如何正确求助?哪些是违规求助? 8173861
关于积分的说明 17215784
捐赠科研通 5414746
什么是DOI,文献DOI怎么找? 2865640
邀请新用户注册赠送积分活动 1842949
关于科研通互助平台的介绍 1691148