特征选择
数量结构-活动关系
人工智能
计算机科学
稳健性(进化)
模式识别(心理学)
机器学习
降维
维数之咒
二进制数
特征(语言学)
算法
数学
生物
哲学
基因
算术
生物化学
语言学
作者
Ronghe Zhou,Yong Zhang,Kai He
标识
DOI:10.1016/j.eswa.2023.121015
摘要
High dimensionality is one of the main challenges in Quantitative Structure-Activity Relationship (QSAR) classification modeling, and feature selection as an effective dimensionality reduction method plays an important role in machine learning, particularly in fields such as chemometrics. In this paper, for feature selection in QSAR classification modeling, a hybrid whale optimization algorithm (WOA) with a chameleon hunting mechanism (HWOA-CHM) is proposed, and its binary version is used to find the best subset for wrapper feature selection in the QSAR classification model. First, a chaos weighting factor is introduced and used as a perturbation factor to increase the diversity of populations. Second, a retractable transformation strategy is designed to prevent the HWOA-CHM from falling into a local optimum. Third, the chameleon predation mechanism is introduced to improve the convergence accuracy of the HWOA-CHM. The performance of HWOA-CHM is evaluated and compared with state-of-the-art classical algorithms and well-known WOA variants. Then, a binary HWOA-CHM (BHWOA-CHM) was designed to solve the feature selection, the BHWOA-CHM is validated using the UCI machine learning repository and compared with binary version WOA, and well-known WOA variants in terms of accuracy, number of features, and time. Finally, BHWOA-CHM was used to solve the high-dimensional feature selection problem in the drug-induced liver injury classification model. It has shown excellent results in terms of feature selection compared to other methods. The proposed method effectively improves the robustness of QSAR predictions while reducing the complexity of the feature sets, demonstrating its potential for improving the accuracy of QSAR models.
科研通智能强力驱动
Strongly Powered by AbleSci AI