生物
亚细胞定位
核苷酸
计算生物学
遗传学
基因
作者
Ahsan Ahmad,Hao Lin,Swakkhar Shatabda
出处
期刊:Genomics
[Elsevier]
日期:2020-05-01
卷期号:112 (3): 2583-2589
被引量:27
标识
DOI:10.1016/j.ygeno.2020.02.011
摘要
Knowledge of the sub-cellular localization of the most diverse class of transcribed RNA, long non-coding RNAs (lncRNAs) will lead us to identify different types of cancers and other diseases as lncRNAs play key role in related cellular functions. In recent days with the exponential growth of known records, it becomes essential to establish new machine learning based techniques to identify the new one due to faster and cheaper solutions provided compared to laboratory methods. In this paper, we propose Locate-R, a novel method for predicting the sub-cellular location of lncRNAs. We have used only n -gapped l -mer composition and l -mer composition as features and select best 655 features to build the model. This model is based locally deep support vector machines which significantly enhance the prediction accuracy with respect to exiting state-of-the-art methods. Our predictor is readily available for use as a stand-alone web application from: http://locate-r.azurewebsites.net/ . • An efficient feature extraction and selection procedure. • An extensible framework/methodology for binary and multi-class classification problems. • A prediction tool available for RNA location prediction.
科研通智能强力驱动
Strongly Powered by AbleSci AI