生物
基因型
表型
基因
贮藏蛋白
生物技术
编目
遗传学
计算机科学
图书馆学
作者
Zhaoheng Zhang,Dan Liu,Binyong Li,Wenxi Wang,Jize Zhang,Mingming Xin,Zhaorong Hu,Jie Liu,Jinkun Du,Huiru Peng,Chenyang Hao,Xueyong Zhang,Zhongfu Ni,Qixin Sun,Weilong Guo,Yingyin Yao
标识
DOI:10.1016/j.molp.2024.05.006
摘要
Wheat is a staple food for more than 35% of the world's population, with wheat flour used to make hundreds of baked goods. Superior end-use quality is a major breeding target; however, improving it is especially time-consuming and expensive. Furthermore, genes encoding seed-storage proteins (SSPs) form multi-gene families and are repetitive, with gaps commonplace in several genome assemblies. To overcome these barriers and efficiently identify superior wheat SSP alleles, we developed "PanSK" (Pan-SSP k-mer) for genotype-to-phenotype prediction based on an SSP-based pangenome resource. PanSK uses 29-mer sequences that represent each SSP gene at the pangenomic level to reveal untapped diversity across landraces and modern cultivars. Genome-wide association studies with k-mers identified 23 SSP genes associated with end-use quality that represent novel targets for improvement. We evaluated the effect of rye secalin genes on end-use quality and found that removal of ω-secalins from 1BL/1RS wheat translocation lines is associated with enhanced end-use quality. Finally, using machine-learning-based prediction inspired by PanSK, we predicted the quality phenotypes with high accuracy from genotypes alone. This study provides an effective approach for genome design based on SSP genes, enabling the breeding of wheat varieties with superior processing capabilities and improved end-use quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI