主成分分析
混合的
最佳线性无偏预测
基因组选择
数据集
粮食产量
农学
统计
数学
环境科学
生物技术
生物
选择(遗传算法)
单核苷酸多态性
计算机科学
基因型
机器学习
遗传学
基因
作者
Jingxin Wang,Liwei Liu,Kunhui He,Takele Weldu Gebrewahid,Shang Gao,Qiu Tian,Zhanyi Li,Yiqun Song,Y. Y. Guo,Yanwei Li,Qinxin Cui,Luyan Zhang,Jiankang Wang,Changling Huang,Liang Li,Tingting Guo,Huihui Li
摘要
ABSTRACT Incorporating genotype‐by‐environment (GE) interaction effects into genomic prediction (GP) models with multi‐environment climate data can improve selection accuracy to accelerate crop breeding but has received little research attention. Here, we conducted a cross‐region GP study of grain moisture content (GMC) and grain yield (GY) in maize hybrids in two major Chinese growing regions using data for 19 climatic factors across 34 environments in 2020 and 2021. Predictions were conducted in 2,126 hybrids generated from 475 maize inbred lines, using 9,355 single nucleotide polymorphism markers for genotyping. Models based on genomic best linear unbiased prediction (GBLUP) incorporating GE interaction effects of 19 climatic factors associated with day length, transpiration, temperature, and radiation (GBLUP‐GE 19CF ) trained on whole data set outperformed the traditional GBLUP or BayesB models in predicting GMC or GY by 10‐fold cross‐validation, achieving prediction accuracies of 0.731 and 0.331, respectively. To refine the climate data, we examined 84 statistical features associated with these climatic factors and identified nine factors most correlated with GMC or GY. Principal component analysis of climate data yielded nine principal components responsible for 97% of the variability in the data. Incorporating these nine factors or principal components into the GBLUP‐GE framework with a similarity matrix of environments (GBLUP‐GE 9CF and GBLUP‐GE PCA ) provided similar prediction accuracies but could reduce the computational burden. In addition, increasing the number of test set environments in the training set from 8 to 14 increased the prediction accuracy of GBLUP‐GE 19CF trained with monthly average climate data for 2020–2021. Examining prediction accuracy based on concordance, the proportion of overlapping hybrids between the top 50% of predicted and observed values for GMC and GY, indicated that concordance exceeded 50% for the GBLUP‐GE 19CF model, confirming the reliability of our predictions. This study can provide practical guidance for optimizing GPs for maize breeding programs in multi‐environment selection.
科研通智能强力驱动
Strongly Powered by AbleSci AI