亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Machine-Learning-Guided Library Design Cycle for Directed Evolution of Enzymes: The Effects of Training Data Composition on Sequence Space Exploration

定向进化 序列空间 序列(生物学) 定向分子进化 蛋白质工程 作文(语言) 系列(地层学) 蛋白质测序 化学空间 功能(生物学) 计算机科学 计算生物学 生物 人工智能 生物信息学 遗传学 肽序列 生物化学 数学 基因 药物发现 语言学 突变体 古生物学 哲学 巴拿赫空间 纯数学
作者
Yutaka Saitô,Misaki Oikawa,T. Sato,Hikaru Nakazawa,Tsuyoshi Ito,Tomoshi Kameda,Koji Tsuda,Mitsuo Umetsu
出处
期刊:ACS Catalysis [American Chemical Society]
卷期号:11 (23): 14615-14624 被引量:17
标识
DOI:10.1021/acscatal.1c03753
摘要

Machine learning (ML) is becoming an attractive tool in mutagenesis-based protein engineering because of its ability to design a variant library containing proteins with a desired function. However, it remains unclear how ML guides directed evolution in sequence space depending on the composition of training data. Here, we present a ML-guided directed evolution study of an enzyme to investigate the effects of a known “highly positive” variant (i.e., variant known to have high enzyme activity) in training data. We performed two separate series of ML-guided directed evolution of Sortase A with and without a known highly positive variant called 5M in training data. In each series, two rounds of ML were conducted: variants predicted by the initial round were experimentally evaluated and used as additional training data for the second-round of prediction. The improvements in enzyme activity were comparable between the two series, both achieving enzyme activity 2.2–2.5 times higher than 5M. Intriguingly, the sequences of the improved variants were largely different between the two series, indicating that ML guided the directed evolution to the distinct regions of sequence space depending on the presence/absence of the highly positive variant in the training data. This suggests that the sequence diversity of improved variants can be expanded not only by conventional ML using the whole training data but also by ML using a subset of the training data even when it lacks highly positive variants. In summary, this study demonstrates the importance of regulating the composition of training data in ML-guided directed evolution.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
17秒前
18秒前
25秒前
酷波er应助整齐惜芹采纳,获得10
29秒前
啵啵鸡完成签到,获得积分20
31秒前
麻花阳应助科研通管家采纳,获得10
35秒前
整齐惜芹完成签到,获得积分10
39秒前
乐乐应助啵啵鸡采纳,获得10
42秒前
明理仰发布了新的文献求助10
46秒前
彪壮的幻丝完成签到 ,获得积分0
49秒前
zxh发布了新的文献求助10
51秒前
59秒前
zxh完成签到,获得积分10
1分钟前
苹果尔柳发布了新的文献求助10
1分钟前
1分钟前
苹果尔柳完成签到,获得积分10
1分钟前
Zimba完成签到,获得积分20
1分钟前
zxzb完成签到 ,获得积分10
2分钟前
2分钟前
充电宝应助Chloe采纳,获得10
2分钟前
muzi完成签到 ,获得积分10
2分钟前
魔幻的易梦发布了新的文献求助100
2分钟前
muzi关注了科研通微信公众号
2分钟前
2分钟前
jinshiyu58发布了新的文献求助10
2分钟前
香蕉觅云应助科研通管家采纳,获得10
2分钟前
领导范儿应助科研通管家采纳,获得30
2分钟前
CipherSage应助科研通管家采纳,获得10
2分钟前
Owen应助科研通管家采纳,获得10
2分钟前
英俊的铭应助科研通管家采纳,获得10
2分钟前
共享精神应助科研通管家采纳,获得10
2分钟前
2分钟前
今后应助muzi采纳,获得10
3分钟前
3分钟前
瞬间发布了新的文献求助10
3分钟前
3分钟前
Chloe发布了新的文献求助10
3分钟前
墨月白完成签到,获得积分10
3分钟前
3分钟前
3分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 2000
Digital Twins of Advanced Materials Processing 2000
晋绥日报合订本24册(影印本1986年)【1940年9月–1949年5月】 1000
Social Cognition: Understanding People and Events 1000
Polymorphism and polytypism in crystals 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6034207
求助须知:如何正确求助?哪些是违规求助? 7736690
关于积分的说明 16205516
捐赠科研通 5180694
什么是DOI,文献DOI怎么找? 2772573
邀请新用户注册赠送积分活动 1755724
关于科研通互助平台的介绍 1640537