计算机科学
图形
集合(抽象数据类型)
数据集
人工神经网络
人工智能
计算生物学
数据挖掘
机器学习
理论计算机科学
生物
程序设计语言
作者
Zhiyuan Zhou,Yueming Yin,Hao Han,Yiping Jia,Jun Hong Koh,Adams Wai‐Kin Kong,Yuguang Mu
标识
DOI:10.1021/acs.jcim.4c01850
摘要
Protein-protein interactions (PPIs) are crucial for understanding biological processes and disease mechanisms, contributing significantly to advances in protein engineering and drug discovery. The accurate determination of binding affinities, essential for decoding PPIs, faces challenges due to the substantial time and financial costs involved in experimental and theoretical methods. This situation underscores the urgent need for more effective and precise methodologies for predicting binding affinity. Despite the abundance of research on PPI modeling, the field of quantitative binding affinity prediction remains underexplored, mainly due to a lack of comprehensive data. This study seeks to address these needs by manually curating pairwise interaction labels on available 3D structures of protein complexes, with experimentally determined binding affinities, creating the largest data set for structure-based pairwise protein interaction with binding affinity to date. Subsequently, we introduce ProAffinity-GNN, a novel deep learning framework using protein language model and graph neural network (GNN) to improve the accuracy of prediction of structure-based protein-protein binding affinities. The evaluation results across several benchmark test sets and an additional case study demonstrate that ProAffinity-GNN not only outperforms existing models in terms of accuracy but also shows strong generalization capabilities.
科研通智能强力驱动
Strongly Powered by AbleSci AI