计算机科学
一般化
序列(生物学)
图形
蛋白质结构预测
源代码
蛋白质结构数据库
人工智能
蛋白质结构
机器学习
数据挖掘
计算生物学
理论计算机科学
生物
程序设计语言
序列数据库
数学
数学分析
基因
生物化学
作者
Yitian Fang,Yi Jiang,Leyi Wei,Qin Ma,Zhixiang Ren,Qianmu Yuan,Dong‐Qing Wei
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2023-11-28
卷期号:39 (12)
标识
DOI:10.1093/bioinformatics/btad718
摘要
Identifying the functional sites of a protein, such as the binding sites of proteins, peptides, or other biological components, is crucial for understanding related biological processes and drug design. However, existing sequence-based methods have limited predictive accuracy, as they only consider sequence-adjacent contextual features and lack structural information.In this study, DeepProSite is presented as a new framework for identifying protein binding site that utilizes protein structure and sequence information. DeepProSite first generates protein structures from ESMFold and sequence representations from pretrained language models. It then uses Graph Transformer and formulates binding site predictions as graph node classifications. In predicting protein-protein/peptide binding sites, DeepProSite outperforms state-of-the-art sequence- and structure-based methods on most metrics. Moreover, DeepProSite maintains its performance when predicting unbound structures, in contrast to competing structure-based prediction methods. DeepProSite is also extended to the prediction of binding sites for nucleic acids and other ligands, verifying its generalization capability. Finally, an online server for predicting multiple types of residue is established as the implementation of the proposed DeepProSite.The datasets and source codes can be accessed at https://github.com/WeiLab-Biology/DeepProSite. The proposed DeepProSite can be accessed at https://inner.wei-group.net/DeepProSite/.
科研通智能强力驱动
Strongly Powered by AbleSci AI