过度拟合
判别式
人工智能
支持向量机
亚细胞定位
编码(社会科学)
过采样
机器学习
相互信息
模式识别(心理学)
计算机科学
数据挖掘
人工神经网络
数学
生物化学
统计
化学
细胞质
带宽(计算)
计算机网络
作者
Zhao‐Yue Zhang,Zi‐Jie Sun,Yu-He Yang,Hao Lin
标识
DOI:10.1007/s11704-021-1015-3
摘要
The spatial distribution pattern of long non-coding RNA (lncRNA) in cell is tightly related to their function. With the increment of publicly available subcellular location data, a number of computational methods have been developed for the recognition of the subcellular localization of lncRNA. Unfortunately, these computational methods suffer from the low discriminative power of redundant features or overfitting of oversampling. To address those issues and enhance the prediction performance, we present a support vector machine-based approach by incorporating mutual information algorithm and incremental feature selection strategy. As a result, the new predictor could achieve the overall accuracy of 91.60%. The highly automated web-tool is available at lin-group.cn/server/iLoc-LncRNA(2.0)/website. It will help to get the knowledge of lncRNA subcellular localization.
科研通智能强力驱动
Strongly Powered by AbleSci AI