广告
下部结构
计算机科学
支持向量机
人工智能
数量结构-活动关系
模式识别(心理学)
生物信息学
数据挖掘
分子描述符
训练集
试验装置
机器学习
计算生物学
化学
生物信息学
工程类
生物
药代动力学
基因
结构工程
生物化学
作者
Jie Shen,Feixiong Cheng,You Xu,Weihua Li,Yun Tang
摘要
Over the past decade, absorption, distribution, metabolism, and excretion (ADME) property evaluation has become one of the most important issues in the process of drug discovery and development. Since in vivo and in vitro evaluations are costly and laborious, in silico techniques had been widely used to estimate ADME properties of chemical compounds. Traditional prediction methods usually try to build a functional relationship between a set of molecular descriptors and a given ADME property. Although traditional methods have been successfully used in many cases, the accuracy and efficiency of molecular descriptors must be concerned. Herein, we report a new classification method based on substructure pattern recognition, in which each molecule is represented as a substructure pattern fingerprint based on a predefined substructure dictionary, and then a support vector machine (SVM) algorithm is applied to build the prediction model. Therefore, a direct connection between substructures and molecular properties is built. The most important substructure patterns can be identified via the information gain analysis, which could help to interpret the models from a medicinal chemistry perspective. Afterward, this method was verified with two data sets, one for blood-brain barrier (BBB) penetration and the other for human intestinal absorption (HIA). The results demonstrated that the overall predictive accuracies of the best HIA model for the training and test sets were 98.5 and 98.8%, and the overall predictive accuracies of the best BBB model for the training and test sets were 98.8 and 98.4%, which confirmed the reliability of our method. In the additional validations, the predictive accuracies were 94 and 69.5% for the HIA and the BBB models, respectively. Moreover, some of the representative key substructure patterns which significantly correlated with the HIA and BBB penetration properties were also presented.
科研通智能强力驱动
Strongly Powered by AbleSci AI