随机森林
计算机科学
决策树
机器学习
鉴定(生物学)
人工智能
树(集合论)
计算生物学
生物
数学
植物
数学分析
作者
Md. Inzamam-Ul-Hossain,Md. Rafiqul Islam
出处
期刊:Communications in computer and information science
日期:2021-01-01
卷期号:: 211-223
被引量:1
标识
DOI:10.1007/978-3-030-91244-4_17
摘要
For the survival and reproduction of organisms, essential proteins are crucial. Identification of the essential protein is important for cell working and drug design. Essential proteins are predicted from many protein-protein interactions (PPI) networks that are developed using high-throughput techniques. Computational methods are used by many existing proposed techniques to identify essential proteins. Many of them considered topological features for essential protein prediction. Some of the research works consider both topological and biological features to identify essential proteins. In this paper, we have proposed a method using machine learning techniques to accomplish the purpose. Here the Saccharomyces Cerevisiae dataset is considered for essential protein prediction. Three classifiers such as XGBoost, Random Forest, and decision tree have been used to predict the essential proteins. We also apply ensemble methods combined with the three classifiers XGBoost, Random Forest, and decision tree for essential protein prediction. The ensemble method gives the highest accuracy to identify essential proteins compared with the other existing methods. On the other hand, XGBoost gives the highest F1 score.
科研通智能强力驱动
Strongly Powered by AbleSci AI