计算机科学
蛋白质-蛋白质相互作用
数据挖掘
机器学习
人工智能
生物
遗传学
作者
Changqing Mei,Yuan‐Yuan Wang,Kun Lü,Bing Wang,Peng Chen
标识
DOI:10.1109/itme.2018.00077
摘要
Protein-protein interactions play essential roles in various biological progresses. Identifying protein interaction sites can facilitate researchers to understand life activities and therefore will be helpful for drug design. However, the unbalance between the negative and positive samples comes from the current definitions of interaction sites places restriction on the prediction of protein interaction sites by computational approaches. In this work, we proposed three imbalance data processing strategies to improve the performance of protein interaction sites prediction. We first proposed the extraction of relevant features based on the evolutionary conservation of amino acids to predict protein interaction sites. At the same time, three methods are proposed to deal with the imbalance of positive and negative samples in data sets. Experimental results demonstrated the effectiveness of our method.
科研通智能强力驱动
Strongly Powered by AbleSci AI