杠杆(统计)
计算机科学
功能(生物学)
融合
源代码
数据挖掘
特征(语言学)
人工智能
组分(热力学)
机器学习
生物
语言学
哲学
物理
热力学
进化生物学
操作系统
作者
xuemin li,Yurong Qian,Yue Hu,Jiaying Chen,Haitao Yue,Lei Deng
标识
DOI:10.1021/acs.jcim.3c01794
摘要
Protein function prediction is essential for disease treatment and drug development; yet, traditional biological experimental methods are less efficient in annotating protein function, and existing automated methods fail to fully leverage protein multisource data. Here, we present MSF-PFP, a computational framework that fuses multisource data features to predict protein function with high accuracy. Our framework designs specific models for feature extraction based on the characteristics of various data sources, including a global-local-individual strategy for local location features. MSF-PFP then integrates extracted features through a multisource feature fusion model, ultimately categorizing protein functions. Experimental results demonstrate that MSF-PFP outperforms eight state-of-the-art models, achieving FMax scores of 0.542, 0.675, and 0.624 for the biological process (BP), molecular function (MF), and cellular component (CC), respectively. The source code and data set for MSF-PFP are available at https://swanhub.co/TianGua/MSF-PFP, facilitating further exploration and validation of the proposed framework. This study highlights the potential of multisource data fusion in enhancing protein function prediction, contributing to improved disease therapy and medication discovery strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI