Precise Prediction of Promoter Strength Based on a De Novo Synthetic Promoter Library Coupled with Machine Learning

发起人 突变体 计算生物学 生物 遗传学 人工智能 数学 基因 机器学习 计算机科学 基因表达
作者
Mei Zhao,Zhenqi Yuan,Longtao Wu,Shenghu Zhou,Yu Deng
出处
期刊:ACS Synthetic Biology [American Chemical Society]
卷期号:11 (1): 92-102 被引量:58
标识
DOI:10.1021/acssynbio.1c00117
摘要

Promoters are one of the most critical regulatory elements controlling metabolic pathways. However, the fast and accurate prediction of promoter strength remains challenging, leading to time- and labor-consuming promoter construction and characterization processes. This dilemma is caused by the lack of a big promoter library that has gradient strengths, broad dynamic ranges, and clear sequence profiles that can be used to train an artificial intelligence model of promoter strength prediction. To overcome this challenge, we constructed and characterized a mutant library of Trc promoters (Ptrc) using 83 rounds of mutation-construction-screening-characterization engineering cycles. After excluding invalid mutation sites, we established a synthetic promoter library that consisted of 3665 different variants, displaying an intensity range of more than two orders of magnitude. The strongest variant was ∼69-fold stronger than the original Ptrc and 1.52-fold stronger than a 1 mM isopropyl-β-d-thiogalactoside-driven PT7 promoter, with an ∼454-fold difference between the strongest and weakest expression levels. Using this synthetic promoter library, different machine learning models were built and optimized to explore the relationships between promoter sequences and transcriptional strength. Finally, our XgBoost model exhibited optimal performance, and we utilized this approach to precisely predict the strength of artificially designed promoter sequences (R2 = 0.88, mean absolute error = 0.15, and Pearson correlation coefficient = 0.94). Our work provides a powerful platform that enables the predictable tuning of promoters to achieve optimal transcriptional strength.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
袁珊发布了新的文献求助10
刚刚
情怀应助akun采纳,获得10
1秒前
JBZ完成签到,获得积分10
1秒前
WANGCHU发布了新的文献求助10
2秒前
小松奈奈完成签到,获得积分10
2秒前
gry真棒完成签到,获得积分10
3秒前
3秒前
5秒前
5秒前
小海绵发布了新的文献求助10
5秒前
科研通AI6.3应助Narcissus153采纳,获得10
5秒前
沉静立辉完成签到,获得积分10
6秒前
英姑应助bibi11采纳,获得10
7秒前
量子星尘发布了新的文献求助10
7秒前
7秒前
8秒前
liangliang发布了新的文献求助10
8秒前
10秒前
10秒前
科研通AI6.3应助liam采纳,获得10
10秒前
烟花应助Labixix采纳,获得10
10秒前
搜集达人应助xxxhhh采纳,获得10
11秒前
11秒前
Ghiocel发布了新的文献求助10
11秒前
11秒前
孤独的素发布了新的文献求助30
11秒前
万能图书馆应助coffee采纳,获得10
12秒前
能干巨人发布了新的文献求助10
13秒前
Eve关注了科研通微信公众号
14秒前
mmgf完成签到,获得积分10
14秒前
核桃发布了新的文献求助10
14秒前
哈哈哈发布了新的文献求助10
14秒前
小小自由发布了新的文献求助10
15秒前
搜集达人应助胜哥的歌采纳,获得10
15秒前
蒋美桥发布了新的文献求助10
15秒前
mason发布了新的文献求助10
16秒前
天天快乐应助yuting采纳,获得30
16秒前
16秒前
顺心钥匙完成签到,获得积分10
17秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Relation between chemical structure and local anesthetic action: tertiary alkylamine derivatives of diphenylhydantoin 1000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
Principles of town planning : translating concepts to applications 500
Iron‐Sulfur Clusters: Biogenesis and Biochemistry 400
Healable Polymer Systems: Fundamentals, Synthesis and Applications 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6070918
求助须知:如何正确求助?哪些是违规求助? 7902455
关于积分的说明 16338351
捐赠科研通 5211548
什么是DOI,文献DOI怎么找? 2787372
邀请新用户注册赠送积分活动 1770147
关于科研通互助平台的介绍 1648083