卷积神经网络
计算机科学
分子
氢键
源代码
生物系统
蛋白质结构预测
人工神经网络
蛋白质结构
功能(生物学)
人工智能
蛋白质结晶
化学
生物
生物化学
进化生物学
操作系统
有机化学
结晶
作者
Sangwoo Park,Chaok Seok
标识
DOI:10.1021/acs.jcim.2c00306
摘要
Proteins interact with numerous water molecules to perform their physiological functions in biological organisms. Most water molecules act as solvent media; hence, their roles may be considered implicitly in theoretical treatments of protein structure and function. However, some water molecules interact intimately with proteins and require explicit treatment to understand their effects. Most physics-based computational methods are limited in their ability to accurately locate water molecules on protein surfaces because of inaccurate energy functions. Instead of relying on an energy function, this study attempts to learn the locations of water molecules from structural data. GalaxyWater-convolutional neural network (CNN) predicts water positions on protein chains, protein-protein interfaces, and protein-compound binding sites using a 3D-CNN model that is trained to generate a water score map on a given protein structure. The training data are compiled from high-resolution protein crystal structures resolved together with water molecules. GalaxyWater-CNN shows improved water prediction performance both in the coverage of crystal water molecules and in the accuracy of the predicted water positions when compared with previous energy-based methods. This method shows a superior performance in predicting water molecules that form hydrogen-bond networks precisely. The web service and the source code of this water prediction method are freely available at https://galaxy.seoklab.org/gwcnn and https://github.com/seoklab/GalaxyWater-CNN, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI