小RNA
分类器(UML)
计算机科学
人工智能
鉴定(生物学)
计算生物学
机器学习
模式识别(心理学)
数据挖掘
生物信息学
生物
基因
遗传学
生态学
作者
Shu-Hao Wang,Yan Zhao,Chun-Chun Wang,Fei Chu,Lianying Miao,Li Zhang,Linlin Zhuo,Xing Chen
标识
DOI:10.1016/j.compbiomed.2024.108177
摘要
With the increasing number of microRNAs (miRNAs), identifying essential miRNAs has become an important task that needs to be solved urgently. However, there are few computational methods for essential miRNA identification. Here, we proposed a novel framework called Rotation Forest for Essential MicroRNA identification (RFEM) to predict the essentiality of miRNAs in mice. We first constructed 1264 miRNA features of all miRNA samples by fusing 38 miRNA features obtained from the PESM paper and 1226 miRNA functional features calculated based on miRNA-target gene interactions. Then, we employed 182 training samples with 1264 features to train the rotation forest model, which was applied to compute the essentiality scores of the candidate samples. The main innovations of RFEM were as follows: 1) miRNA functional features were introduced to enrich the diversity of miRNA features; 2) the rotation forest model used decision tree as the base classifier and could increase the difference among base classifiers through feature transformation to achieve better ensemble results. Experimental results show that RFEM significantly outperformed two previous models with the AUC (AUPR) of 0.942 (0.944) in three comparison experiments under 5-fold cross validation, which proved the model's reliable performance. Moreover, ablation study was further conducted to demonstrate the effectiveness of the novel miRNA functional features. Additionally, in the case studies of assessing the essentiality of unlabeled miRNAs, experimental literature confirmed that 7 of the top 10 predicted miRNAs have crucial biological functions in mice. Therefore, RFEM would be a reliable tool for identifying essential miRNAs.
科研通智能强力驱动
Strongly Powered by AbleSci AI