A conditional protein diffusion model generates artificial programmable endonuclease sequences with enhanced activity

计算生物学 火球菌属 生物信息学 蛋白质工程 合成生物学 蛋白质设计 定向进化 蛋白质结构域 生物 蛋白质结构 计算机科学 生物化学 基因 突变体 古细菌
作者
Bingxin Zhou,Lirong Zheng,Banghao Wu,Kai Yi,Bozitao Zhong,Yang Tan,Qian Liu,Píetro Lió,Liang Hong
出处
期刊:Cell discovery [Springer Nature]
卷期号:10 (1)
标识
DOI:10.1038/s41421-024-00728-2
摘要

Abstract Deep learning-based methods for generating functional proteins address the growing need for novel biocatalysts, allowing for precise tailoring of functionalities to meet specific requirements. This advancement leads to the development of highly efficient and specialized proteins with diverse applications across scientific, technological, and biomedical fields. This study establishes a pipeline for protein sequence generation with a conditional protein diffusion model, namely CPDiffusion, to create diverse sequences of proteins with enhanced functions. CPDiffusion accommodates protein-specific conditions, such as secondary structures and highly conserved amino acids. Without relying on extensive training data, CPDiffusion effectively captures highly conserved residues and sequence features for specific protein families. We applied CPDiffusion to generate artificial sequences of Argonaute (Ago) proteins based on the backbone structures of wild-type (WT) Kurthia massiliensis Ago (KmAgo) and Pyrococcus furiosus Ago (PfAgo), which are complex multi-domain programmable endonucleases. The generated sequences deviate by up to nearly 400 amino acids from their WT templates. Experimental tests demonstrated that the majority of the generated proteins for both KmAgo and PfAgo show unambiguous activity in DNA cleavage, with many of them exhibiting superior activity as compared to the WT. These findings underscore CPDiffusion’s remarkable success rate in generating novel sequences for proteins with complex structures and functions in a single step, leading to enhanced activity. This approach facilitates the design of enzymes with multi-domain molecular structures and intricate functions through in silico generation and screening, all accomplished without the need for supervision from labeled data.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
小马甲应助牛仔很忙采纳,获得10
1秒前
自由代芹发布了新的文献求助10
2秒前
沉默水蓝完成签到 ,获得积分10
2秒前
2秒前
ZSH发布了新的文献求助10
2秒前
Owen应助杨荣花采纳,获得10
2秒前
sice完成签到,获得积分10
2秒前
3秒前
Dannerys完成签到 ,获得积分10
3秒前
NIU完成签到,获得积分10
4秒前
healer完成签到,获得积分10
6秒前
淡淡灯泡发布了新的文献求助20
7秒前
姜汁完成签到,获得积分10
7秒前
搜集达人应助yu采纳,获得10
7秒前
笑点低绝义完成签到,获得积分10
10秒前
王鑫发布了新的文献求助10
10秒前
13秒前
顾矜应助healer采纳,获得10
13秒前
13秒前
TaoJ应助汉堡麻麻采纳,获得10
14秒前
Cc完成签到,获得积分10
14秒前
KK完成签到 ,获得积分10
15秒前
呵呵完成签到,获得积分10
16秒前
16秒前
17秒前
晚风发布了新的文献求助10
17秒前
17秒前
ardejiang发布了新的文献求助10
19秒前
张磊完成签到,获得积分10
19秒前
蜡笔小鑫完成签到,获得积分10
20秒前
六六完成签到 ,获得积分10
20秒前
莫言发布了新的文献求助50
20秒前
shikaly发布了新的文献求助21
20秒前
张秋贤完成签到,获得积分10
21秒前
fane完成签到,获得积分10
21秒前
LIUJIE发布了新的文献求助10
22秒前
NIU关注了科研通微信公众号
22秒前
琉璃苣发布了新的文献求助10
22秒前
金先生发布了新的文献求助10
23秒前
张磊发布了新的文献求助10
23秒前
高分求助中
LNG地下式貯槽指針(JGA指-107-19)(Recommended practice for LNG inground storage) 1000
rhetoric, logic and argumentation: a guide to student writers 1000
QMS18Ed2 | process management. 2nd ed 1000
Eric Dunning and the Sociology of Sport 850
Operative Techniques in Pediatric Orthopaedic Surgery 510
Generalized Linear Mixed Models 第二版 500
人工地层冻结稳态温度场边界分离方法及新解答 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2921018
求助须知:如何正确求助?哪些是违规求助? 2563349
关于积分的说明 6933820
捐赠科研通 2221229
什么是DOI,文献DOI怎么找? 1180754
版权声明 588757
科研通“疑难数据库(出版商)”最低求助积分说明 577670