Increasing methionine content of soybean using CRISPR/Cas9 and developing machine learning predictive models

蛋氨酸 氨基酸 限制 食品科学 生物技术 生物 必需氨基酸 生物化学 工程类 机械工程
作者
Adama R. Tukuli
标识
DOI:10.32469/10355/93992
摘要

Soybean [Glycine max (L.) Merr.] is an important protein source for both humans and animals. Its relatively low cost combined with its excellent nutritive value has enabled soybeans to attain elite stature as the world's dominant protein feed ingredient. However, soybean protein is relatively poor in sulfur-containing essential amino acids (SCEAA), especially methionine (Met). The SCEAA Met is central to protein synthesis, and it is encoded by the first codon that initiates protein synthesis and, hence, is essential in all living organisms including plants. It is the most limiting amino acid and roughly US$100 million are spent annually by poultry and swine producers to supplement animal feed with Met. The leaching of Met supplements leads to the formation of undesirable volatile sulfides due to bacterial degradation, which can have negative effects on the environment. Hence, a goal of soybean research has been to improve the quality of soy protein by increasing the levels of Met to create a more complete, high-quality food and feed items. However, although a variety of attempts have been made, these efforts have largely failed, with little or no increase in soybean seed Met levels, suggesting a need for new strategies. Low abundance of Met codons in seed storage proteins (SSP) genes and Met catabolism (or degradation) are major factors that limit the production of total Met in seeds. In this dissertation, a 'push' and 'pull' strategy was used. Push refers to efforts to increase the pool levels of free Met (FM) to be incorporated into soybean SSP by blocking Met catabolism. Pull refers to efforts to increase the levels of SSP rich in Met codons by knocking out the soybean [eszett] -conglycinin genes (Gm7s), which encodes SSP that are relatively Met-poor (7S). Through protein rebalancing, the lack of 7S proteins can be compensated by increased production of the relatively Met-rich 11S proteins. These efforts made broad use of CRISPR/Cas9 gene editing tools to knock-out the genes for Methionine [gamma]-lyase (MGL), a Met catabolic enzyme, and 7S SSPs. Consistent with newly emerging literature, a positive connection between high Met content and the synthesis of other amino acids was observed in the generated mutant genotypes. The initial milestone of increasing overall amino acid content in soybean was achieved as gene edited mutant lines showed higher 11s and higher Met levels. The exact relationship between free amino acids (FAA) and protein bound amino acids (PBAA), particularly for soybean, is an open question. Moreover, prediction of total free amino acid (TFAA) and total protein bound amino acids (TPBAA) from individual AA metabolic data is critical for planning AA biofortification, especially in designing CRISPR/Cas9 edits where multiple genes or pathways can be targeted. Machine learning (ML) algorithms are particularly useful for studying complex biological systems, as they can efficiently capture non-linear relationships and complex interactions among the driving variables. ML predictive models for TFAA and TPBAA were developed. TFAA model shows R2 of 0.86 with FAA such as arginine, asparagine, and isoleucine showing top importance in TFAA predictions. TPBAA model shows R2 of 0.95 with PBAA such as Asx (i.e., output of glutamine and asparagine after hydrolysis), leucine and alanine show top importance in TPBAA predictions. Mathematical equations were generated to explain the relationship of TPBAA with TFAA (TPBAA = B0 + B1TFAA) and protein bound Met (PBM) with FM (PBM = B0 + B1FM) where B1 are coefficients (slopes) and B0 are intercepts. Also, ML classification model to differentiate mutant from controls based on AA metabolomic data was developed with accuracy of 1 and robust classification report. Results presented here showed that the dual-gRNA CRISPR/Cas9 system indeed offers a rapid and highly efficient genetic tool to knockout multiple genes simultaneously. Knock out mutations in three GmMGLs genes (GmMGL1, GmMGL2 and GmMGL3) were simultaneously created and, as predicted, the resulting soybean genotypes were 'pushed' for increased FM content. Simultaneous knock out mutations in 7S genes were also created to create protein rebalanced soybean genotypes. Furthermore, ML predictive models developed from AA metabolomic data mining which can aid in planning soybean AA composition biofortification experiments especially CRISPR/Cas9 system where multiple genes (pathways) can be targeted simultaneously.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
dzz完成签到,获得积分10
1秒前
Baymax给Baymax的求助进行了留言
1秒前
2秒前
李健应助liu采纳,获得10
2秒前
realwd发布了新的文献求助10
2秒前
3秒前
4秒前
烟花应助12345采纳,获得10
4秒前
5秒前
8秒前
8秒前
8秒前
飞云完成签到,获得积分10
8秒前
LHC发布了新的文献求助10
9秒前
小蘑菇应助边伯贤采纳,获得10
9秒前
天天快乐应助Zdh同学采纳,获得10
9秒前
9秒前
香菜丸子完成签到,获得积分10
9秒前
你是八戒呀完成签到,获得积分10
10秒前
10秒前
10秒前
liaoyu完成签到,获得积分10
11秒前
MarcoPolo发布了新的文献求助100
11秒前
李昕123发布了新的文献求助10
11秒前
11秒前
JamesPei应助wangnan采纳,获得10
12秒前
12秒前
希望天下0贩的0应助feihu采纳,获得10
12秒前
Biophysics完成签到,获得积分20
13秒前
13秒前
香菜丸子发布了新的文献求助10
13秒前
勤奋的灵波完成签到 ,获得积分10
13秒前
13秒前
在水一方应助谦让雨珍采纳,获得10
14秒前
14秒前
15秒前
可研完成签到 ,获得积分10
15秒前
15秒前
15秒前
15秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Modern Epidemiology, Fourth Edition 5000
Kinesiophobia : a new view of chronic pain behavior 5000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 3000
Digital Twins of Advanced Materials Processing 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 化学工程 生物化学 物理 计算机科学 内科学 复合材料 催化作用 物理化学 光电子学 电极 冶金 细胞生物学 基因
热门帖子
关注 科研通微信公众号,转发送积分 6018734
求助须知:如何正确求助?哪些是违规求助? 7609016
关于积分的说明 16160056
捐赠科研通 5166454
什么是DOI,文献DOI怎么找? 2765313
邀请新用户注册赠送积分活动 1746922
关于科研通互助平台的介绍 1635411