Increasing methionine content of soybean using CRISPR/Cas9 and developing machine learning predictive models

蛋氨酸 氨基酸 限制 食品科学 生物技术 生物 必需氨基酸 生物化学 工程类 机械工程
作者
Adama R. Tukuli
标识
DOI:10.32469/10355/93992
摘要

Soybean [Glycine max (L.) Merr.] is an important protein source for both humans and animals. Its relatively low cost combined with its excellent nutritive value has enabled soybeans to attain elite stature as the world's dominant protein feed ingredient. However, soybean protein is relatively poor in sulfur-containing essential amino acids (SCEAA), especially methionine (Met). The SCEAA Met is central to protein synthesis, and it is encoded by the first codon that initiates protein synthesis and, hence, is essential in all living organisms including plants. It is the most limiting amino acid and roughly US$100 million are spent annually by poultry and swine producers to supplement animal feed with Met. The leaching of Met supplements leads to the formation of undesirable volatile sulfides due to bacterial degradation, which can have negative effects on the environment. Hence, a goal of soybean research has been to improve the quality of soy protein by increasing the levels of Met to create a more complete, high-quality food and feed items. However, although a variety of attempts have been made, these efforts have largely failed, with little or no increase in soybean seed Met levels, suggesting a need for new strategies. Low abundance of Met codons in seed storage proteins (SSP) genes and Met catabolism (or degradation) are major factors that limit the production of total Met in seeds. In this dissertation, a 'push' and 'pull' strategy was used. Push refers to efforts to increase the pool levels of free Met (FM) to be incorporated into soybean SSP by blocking Met catabolism. Pull refers to efforts to increase the levels of SSP rich in Met codons by knocking out the soybean [eszett] -conglycinin genes (Gm7s), which encodes SSP that are relatively Met-poor (7S). Through protein rebalancing, the lack of 7S proteins can be compensated by increased production of the relatively Met-rich 11S proteins. These efforts made broad use of CRISPR/Cas9 gene editing tools to knock-out the genes for Methionine [gamma]-lyase (MGL), a Met catabolic enzyme, and 7S SSPs. Consistent with newly emerging literature, a positive connection between high Met content and the synthesis of other amino acids was observed in the generated mutant genotypes. The initial milestone of increasing overall amino acid content in soybean was achieved as gene edited mutant lines showed higher 11s and higher Met levels. The exact relationship between free amino acids (FAA) and protein bound amino acids (PBAA), particularly for soybean, is an open question. Moreover, prediction of total free amino acid (TFAA) and total protein bound amino acids (TPBAA) from individual AA metabolic data is critical for planning AA biofortification, especially in designing CRISPR/Cas9 edits where multiple genes or pathways can be targeted. Machine learning (ML) algorithms are particularly useful for studying complex biological systems, as they can efficiently capture non-linear relationships and complex interactions among the driving variables. ML predictive models for TFAA and TPBAA were developed. TFAA model shows R2 of 0.86 with FAA such as arginine, asparagine, and isoleucine showing top importance in TFAA predictions. TPBAA model shows R2 of 0.95 with PBAA such as Asx (i.e., output of glutamine and asparagine after hydrolysis), leucine and alanine show top importance in TPBAA predictions. Mathematical equations were generated to explain the relationship of TPBAA with TFAA (TPBAA = B0 + B1TFAA) and protein bound Met (PBM) with FM (PBM = B0 + B1FM) where B1 are coefficients (slopes) and B0 are intercepts. Also, ML classification model to differentiate mutant from controls based on AA metabolomic data was developed with accuracy of 1 and robust classification report. Results presented here showed that the dual-gRNA CRISPR/Cas9 system indeed offers a rapid and highly efficient genetic tool to knockout multiple genes simultaneously. Knock out mutations in three GmMGLs genes (GmMGL1, GmMGL2 and GmMGL3) were simultaneously created and, as predicted, the resulting soybean genotypes were 'pushed' for increased FM content. Simultaneous knock out mutations in 7S genes were also created to create protein rebalanced soybean genotypes. Furthermore, ML predictive models developed from AA metabolomic data mining which can aid in planning soybean AA composition biofortification experiments especially CRISPR/Cas9 system where multiple genes (pathways) can be targeted simultaneously.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
1秒前
陌上花开完成签到,获得积分0
1秒前
柚子完成签到,获得积分10
2秒前
等乙天发布了新的文献求助10
3秒前
共享精神应助潇洒从阳采纳,获得10
4秒前
4秒前
keyan完成签到,获得积分20
5秒前
龙飞凤舞完成签到,获得积分0
5秒前
lin发布了新的文献求助10
5秒前
6秒前
6秒前
年轻寒蕾完成签到,获得积分10
7秒前
冯1发布了新的文献求助10
7秒前
默mo完成签到 ,获得积分10
7秒前
7秒前
8秒前
Silverexile完成签到,获得积分10
9秒前
啦啦啦发布了新的文献求助10
10秒前
一一发布了新的文献求助10
11秒前
11秒前
12秒前
12秒前
12秒前
10完成签到,获得积分10
12秒前
XO完成签到,获得积分10
13秒前
Zhang发布了新的文献求助10
14秒前
申晏荣发布了新的文献求助20
14秒前
科研通AI6.1应助tt采纳,获得10
14秒前
科目三应助Leah采纳,获得10
15秒前
xiaoh发布了新的文献求助10
16秒前
gggg102完成签到,获得积分10
16秒前
朴实昊强完成签到 ,获得积分10
16秒前
20231125完成签到,获得积分10
17秒前
17秒前
空山完成签到,获得积分10
17秒前
ding应助纯白汉玉采纳,获得10
17秒前
脑洞疼应助卤笋采纳,获得10
18秒前
笑着敷衍发布了新的文献求助10
18秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Modern Epidemiology, Fourth Edition 5000
Digital Twins of Advanced Materials Processing 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
Polymorphism and polytypism in crystals 1000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6023452
求助须知:如何正确求助?哪些是违规求助? 7650975
关于积分的说明 16173207
捐赠科研通 5171995
什么是DOI,文献DOI怎么找? 2767346
邀请新用户注册赠送积分活动 1750690
关于科研通互助平台的介绍 1637238