Increasing methionine content of soybean using CRISPR/Cas9 and developing machine learning predictive models

蛋氨酸 氨基酸 限制 食品科学 生物技术 生物 必需氨基酸 生物化学 工程类 机械工程
作者
Adama R. Tukuli
标识
DOI:10.32469/10355/93992
摘要

Soybean [Glycine max (L.) Merr.] is an important protein source for both humans and animals. Its relatively low cost combined with its excellent nutritive value has enabled soybeans to attain elite stature as the world's dominant protein feed ingredient. However, soybean protein is relatively poor in sulfur-containing essential amino acids (SCEAA), especially methionine (Met). The SCEAA Met is central to protein synthesis, and it is encoded by the first codon that initiates protein synthesis and, hence, is essential in all living organisms including plants. It is the most limiting amino acid and roughly US$100 million are spent annually by poultry and swine producers to supplement animal feed with Met. The leaching of Met supplements leads to the formation of undesirable volatile sulfides due to bacterial degradation, which can have negative effects on the environment. Hence, a goal of soybean research has been to improve the quality of soy protein by increasing the levels of Met to create a more complete, high-quality food and feed items. However, although a variety of attempts have been made, these efforts have largely failed, with little or no increase in soybean seed Met levels, suggesting a need for new strategies. Low abundance of Met codons in seed storage proteins (SSP) genes and Met catabolism (or degradation) are major factors that limit the production of total Met in seeds. In this dissertation, a 'push' and 'pull' strategy was used. Push refers to efforts to increase the pool levels of free Met (FM) to be incorporated into soybean SSP by blocking Met catabolism. Pull refers to efforts to increase the levels of SSP rich in Met codons by knocking out the soybean [eszett] -conglycinin genes (Gm7s), which encodes SSP that are relatively Met-poor (7S). Through protein rebalancing, the lack of 7S proteins can be compensated by increased production of the relatively Met-rich 11S proteins. These efforts made broad use of CRISPR/Cas9 gene editing tools to knock-out the genes for Methionine [gamma]-lyase (MGL), a Met catabolic enzyme, and 7S SSPs. Consistent with newly emerging literature, a positive connection between high Met content and the synthesis of other amino acids was observed in the generated mutant genotypes. The initial milestone of increasing overall amino acid content in soybean was achieved as gene edited mutant lines showed higher 11s and higher Met levels. The exact relationship between free amino acids (FAA) and protein bound amino acids (PBAA), particularly for soybean, is an open question. Moreover, prediction of total free amino acid (TFAA) and total protein bound amino acids (TPBAA) from individual AA metabolic data is critical for planning AA biofortification, especially in designing CRISPR/Cas9 edits where multiple genes or pathways can be targeted. Machine learning (ML) algorithms are particularly useful for studying complex biological systems, as they can efficiently capture non-linear relationships and complex interactions among the driving variables. ML predictive models for TFAA and TPBAA were developed. TFAA model shows R2 of 0.86 with FAA such as arginine, asparagine, and isoleucine showing top importance in TFAA predictions. TPBAA model shows R2 of 0.95 with PBAA such as Asx (i.e., output of glutamine and asparagine after hydrolysis), leucine and alanine show top importance in TPBAA predictions. Mathematical equations were generated to explain the relationship of TPBAA with TFAA (TPBAA = B0 + B1TFAA) and protein bound Met (PBM) with FM (PBM = B0 + B1FM) where B1 are coefficients (slopes) and B0 are intercepts. Also, ML classification model to differentiate mutant from controls based on AA metabolomic data was developed with accuracy of 1 and robust classification report. Results presented here showed that the dual-gRNA CRISPR/Cas9 system indeed offers a rapid and highly efficient genetic tool to knockout multiple genes simultaneously. Knock out mutations in three GmMGLs genes (GmMGL1, GmMGL2 and GmMGL3) were simultaneously created and, as predicted, the resulting soybean genotypes were 'pushed' for increased FM content. Simultaneous knock out mutations in 7S genes were also created to create protein rebalanced soybean genotypes. Furthermore, ML predictive models developed from AA metabolomic data mining which can aid in planning soybean AA composition biofortification experiments especially CRISPR/Cas9 system where multiple genes (pathways) can be targeted simultaneously.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
木末发布了新的文献求助20
1秒前
顾矜应助墨与白采纳,获得10
2秒前
打打应助lucky采纳,获得10
3秒前
3秒前
Arlene发布了新的文献求助10
3秒前
3秒前
Hello应助英勇的书本采纳,获得10
4秒前
李健应助小江采纳,获得10
4秒前
4秒前
葛力发布了新的文献求助10
4秒前
5秒前
6秒前
6秒前
体贴的如之完成签到,获得积分10
7秒前
8秒前
8秒前
8秒前
8秒前
搜集达人应助鲨鱼辣椒采纳,获得10
9秒前
許1111发布了新的文献求助10
9秒前
耍酷芙蓉发布了新的文献求助10
10秒前
辛勤枕头完成签到,获得积分10
10秒前
渡边曜发布了新的文献求助10
10秒前
11秒前
12秒前
lmq发布了新的文献求助10
12秒前
12秒前
隐形期待发布了新的文献求助10
12秒前
脑洞疼应助哈哈采纳,获得10
12秒前
万能图书馆应助llxgjx采纳,获得10
13秒前
没天赋发布了新的文献求助10
13秒前
科研通AI6.1应助zc采纳,获得10
13秒前
冰阔落发布了新的文献求助10
14秒前
不易完成签到,获得积分10
15秒前
忧心的若云完成签到,获得积分10
15秒前
zhonglv7应助苏州河采纳,获得10
15秒前
15秒前
辛勤枕头发布了新的文献求助10
15秒前
刘小蕊发布了新的文献求助10
16秒前
16秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Modern Epidemiology, Fourth Edition 5000
Handbook of pharmaceutical excipients, Ninth edition 5000
Kinesiophobia : a new view of chronic pain behavior 5000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 3000
Digital Twins of Advanced Materials Processing 2000
Weaponeering, Fourth Edition – Two Volume SET 2000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 化学工程 生物化学 物理 计算机科学 内科学 复合材料 催化作用 物理化学 光电子学 电极 冶金 细胞生物学 基因
热门帖子
关注 科研通微信公众号,转发送积分 6019078
求助须知:如何正确求助?哪些是违规求助? 7611249
关于积分的说明 16160998
捐赠科研通 5166790
什么是DOI,文献DOI怎么找? 2765444
邀请新用户注册赠送积分活动 1747168
关于科研通互助平台的介绍 1635478