Increasing methionine content of soybean using CRISPR/Cas9 and developing machine learning predictive models

蛋氨酸 氨基酸 限制 食品科学 生物技术 生物 必需氨基酸 生物化学 工程类 机械工程
作者
Adama R. Tukuli
标识
DOI:10.32469/10355/93992
摘要

Soybean [Glycine max (L.) Merr.] is an important protein source for both humans and animals. Its relatively low cost combined with its excellent nutritive value has enabled soybeans to attain elite stature as the world's dominant protein feed ingredient. However, soybean protein is relatively poor in sulfur-containing essential amino acids (SCEAA), especially methionine (Met). The SCEAA Met is central to protein synthesis, and it is encoded by the first codon that initiates protein synthesis and, hence, is essential in all living organisms including plants. It is the most limiting amino acid and roughly US$100 million are spent annually by poultry and swine producers to supplement animal feed with Met. The leaching of Met supplements leads to the formation of undesirable volatile sulfides due to bacterial degradation, which can have negative effects on the environment. Hence, a goal of soybean research has been to improve the quality of soy protein by increasing the levels of Met to create a more complete, high-quality food and feed items. However, although a variety of attempts have been made, these efforts have largely failed, with little or no increase in soybean seed Met levels, suggesting a need for new strategies. Low abundance of Met codons in seed storage proteins (SSP) genes and Met catabolism (or degradation) are major factors that limit the production of total Met in seeds. In this dissertation, a 'push' and 'pull' strategy was used. Push refers to efforts to increase the pool levels of free Met (FM) to be incorporated into soybean SSP by blocking Met catabolism. Pull refers to efforts to increase the levels of SSP rich in Met codons by knocking out the soybean [eszett] -conglycinin genes (Gm7s), which encodes SSP that are relatively Met-poor (7S). Through protein rebalancing, the lack of 7S proteins can be compensated by increased production of the relatively Met-rich 11S proteins. These efforts made broad use of CRISPR/Cas9 gene editing tools to knock-out the genes for Methionine [gamma]-lyase (MGL), a Met catabolic enzyme, and 7S SSPs. Consistent with newly emerging literature, a positive connection between high Met content and the synthesis of other amino acids was observed in the generated mutant genotypes. The initial milestone of increasing overall amino acid content in soybean was achieved as gene edited mutant lines showed higher 11s and higher Met levels. The exact relationship between free amino acids (FAA) and protein bound amino acids (PBAA), particularly for soybean, is an open question. Moreover, prediction of total free amino acid (TFAA) and total protein bound amino acids (TPBAA) from individual AA metabolic data is critical for planning AA biofortification, especially in designing CRISPR/Cas9 edits where multiple genes or pathways can be targeted. Machine learning (ML) algorithms are particularly useful for studying complex biological systems, as they can efficiently capture non-linear relationships and complex interactions among the driving variables. ML predictive models for TFAA and TPBAA were developed. TFAA model shows R2 of 0.86 with FAA such as arginine, asparagine, and isoleucine showing top importance in TFAA predictions. TPBAA model shows R2 of 0.95 with PBAA such as Asx (i.e., output of glutamine and asparagine after hydrolysis), leucine and alanine show top importance in TPBAA predictions. Mathematical equations were generated to explain the relationship of TPBAA with TFAA (TPBAA = B0 + B1TFAA) and protein bound Met (PBM) with FM (PBM = B0 + B1FM) where B1 are coefficients (slopes) and B0 are intercepts. Also, ML classification model to differentiate mutant from controls based on AA metabolomic data was developed with accuracy of 1 and robust classification report. Results presented here showed that the dual-gRNA CRISPR/Cas9 system indeed offers a rapid and highly efficient genetic tool to knockout multiple genes simultaneously. Knock out mutations in three GmMGLs genes (GmMGL1, GmMGL2 and GmMGL3) were simultaneously created and, as predicted, the resulting soybean genotypes were 'pushed' for increased FM content. Simultaneous knock out mutations in 7S genes were also created to create protein rebalanced soybean genotypes. Furthermore, ML predictive models developed from AA metabolomic data mining which can aid in planning soybean AA composition biofortification experiments especially CRISPR/Cas9 system where multiple genes (pathways) can be targeted simultaneously.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
小大董发布了新的文献求助10
刚刚
cxy3311完成签到,获得积分10
1秒前
sss完成签到 ,获得积分10
1秒前
1秒前
1秒前
丘比特应助cc采纳,获得10
2秒前
张雯思发布了新的文献求助10
2秒前
3秒前
小马甲应助Haoyu采纳,获得10
3秒前
tzl完成签到,获得积分10
3秒前
scl完成签到 ,获得积分10
3秒前
小马甲应助月旻采纳,获得10
3秒前
3秒前
CLOWNSUYU发布了新的文献求助30
3秒前
Arzu发布了新的文献求助10
3秒前
超级冥王星完成签到,获得积分10
4秒前
4秒前
yyyyyyy发布了新的文献求助10
4秒前
4秒前
4秒前
joy发布了新的文献求助10
5秒前
5秒前
weijiang发布了新的文献求助10
5秒前
5秒前
5秒前
5秒前
6秒前
6秒前
6秒前
yuan发布了新的文献求助10
6秒前
dsfsd发布了新的文献求助30
7秒前
7秒前
英俊的铭应助爱偷懒的Q采纳,获得10
7秒前
8秒前
8秒前
8秒前
稳重羽毛发布了新的文献求助10
9秒前
Robust发布了新的文献求助10
9秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Modern Epidemiology, Fourth Edition 5000
Kinesiophobia : a new view of chronic pain behavior 5000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 3000
Digital Twins of Advanced Materials Processing 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 化学工程 生物化学 物理 计算机科学 内科学 复合材料 催化作用 物理化学 光电子学 电极 冶金 细胞生物学 基因
热门帖子
关注 科研通微信公众号,转发送积分 6017040
求助须知:如何正确求助?哪些是违规求助? 7600720
关于积分的说明 16154591
捐赠科研通 5164894
什么是DOI,文献DOI怎么找? 2764769
邀请新用户注册赠送积分活动 1745863
关于科研通互助平台的介绍 1635068