SPIN-CGNN: Improved fixed backbone protein design with contact map-based graph construction and contact graph neural network

计算机科学 人工智能 图形 深度学习 卷积神经网络 人工神经网络 蛋白质结构预测 困惑 模式识别(心理学) 理论计算机科学 算法 蛋白质结构 语言模型 生物 生物化学
作者
Xing Zhang,Yin Hong-mei,Fei Ling,Jian Zhan,Yaoqi Zhou
出处
期刊:PLOS Computational Biology [Public Library of Science]
卷期号:19 (12): e1011330-e1011330 被引量:4
标识
DOI:10.1371/journal.pcbi.1011330
摘要

Recent advances in deep learning have significantly improved the ability to infer protein sequences directly from protein structures for the fix-backbone design. The methods have evolved from the early use of multi-layer perceptrons to convolutional neural networks, transformers, and graph neural networks (GNN). However, the conventional approach of constructing K-nearest-neighbors (KNN) graph for GNN has limited the utilization of edge information, which plays a critical role in network performance. Here we introduced SPIN-CGNN based on protein contact maps for nearest neighbors. Together with auxiliary edge updates and selective kernels, we found that SPIN-CGNN provided a comparable performance in refolding ability by AlphaFold2 to the current state-of-the-art techniques but a significant improvement over them in term of sequence recovery, perplexity, deviation from amino-acid compositions of native sequences, conservation of hydrophobic positions, and low complexity regions, according to the test by unseen structures, “hallucinated” structures and diffusion models. Results suggest that low complexity regions in the sequences designed by deep learning, for generated structures in particular, remain to be improved, when compared to the native sequences.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刘金金发布了新的文献求助30
1秒前
2秒前
丘比特应助小郭采纳,获得10
2秒前
勤奋日光发布了新的文献求助10
2秒前
乌拉坦完成签到,获得积分10
3秒前
5秒前
深情安青应助李jl采纳,获得10
6秒前
7秒前
星辰大海应助actor2006采纳,获得10
8秒前
丰富的白开水完成签到 ,获得积分10
8秒前
9秒前
紫雨完成签到,获得积分20
10秒前
橙猫猫发布了新的文献求助10
10秒前
顾矜应助温暖的鸿采纳,获得10
10秒前
11秒前
连国完成签到 ,获得积分10
13秒前
14秒前
16秒前
QZZ发布了新的文献求助10
16秒前
思源应助勤劳半青采纳,获得10
19秒前
19秒前
19秒前
Nala发布了新的文献求助10
20秒前
明曦发布了新的文献求助10
20秒前
慕青应助cchi采纳,获得10
21秒前
22秒前
24秒前
果汁发布了新的文献求助10
24秒前
wfh完成签到,获得积分10
25秒前
激情的代曼完成签到,获得积分10
26秒前
27秒前
28秒前
aaa发布了新的文献求助10
28秒前
璇儿的完成签到,获得积分10
30秒前
QZZ完成签到,获得积分10
31秒前
31秒前
Beyond完成签到 ,获得积分10
33秒前
Nala完成签到,获得积分10
34秒前
aaa完成签到,获得积分10
35秒前
37秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
CLSI M100 Performance Standards for Antimicrobial Susceptibility Testing 36th edition 400
Cancer Targets: Novel Therapies and Emerging Research Directions (Part 1) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6361608
求助须知:如何正确求助?哪些是违规求助? 8175410
关于积分的说明 17222416
捐赠科研通 5416423
什么是DOI,文献DOI怎么找? 2866340
邀请新用户注册赠送积分活动 1843584
关于科研通互助平台的介绍 1691450