物候学
核(代数)
加性模型
水准点(测量)
核方法
计算机科学
计算
支持向量机
基因组学
最佳线性无偏预测
高斯分布
生物
线性模型
机器学习
计算生物学
人工智能
基因组
算法
数学
遗传学
地理
选择(遗传算法)
大地测量学
量子力学
物理
组合数学
基因
作者
Germano Costa‐Neto,Roberto Fritsche‐Neto,José Crossa
出处
期刊:Heredity
[Springer Nature]
日期:2020-08-27
卷期号:126 (1): 92-106
被引量:111
标识
DOI:10.1038/s41437-020-00353-1
摘要
Abstract Modern whole-genome prediction (WGP) frameworks that focus on multi-environment trials (MET) integrate large-scale genomics, phenomics, and envirotyping data. However, the more complex the statistical model, the longer the computational processing times, which do not always result in accuracy gains. We investigated the use of new kernel methods and modeling structures involving genomics and nongenomic sources of variation in two MET maize data sets. Five WGP models were considered, advancing in complexity from a main-effect additive model (A) to more complex structures, including dominance deviations (D), genotype × environment interaction (AE and DE), and the reaction-norm model using environmental covariables (W) and their interaction with A and D (AW + DW). A combination of those models built with three different kernel methods, Gaussian kernel (GK), Deep kernel (DK), and the benchmark genomic best linear-unbiased predictor (GBLUP/GB), was tested under three prediction scenarios: newly developed hybrids (CV1), sparse MET conditions (CV2), and new environments (CV0). GK and DK outperformed GB in prediction accuracy and reduction of computation time (~up to 20%) under all model–kernel scenarios. GK was more efficient in capturing the variation due to A + AE and D + DE effects and translated it into accuracy gains (~up to 85% compared with GB). DK provided more consistent predictions, even for more complex structures such as W + AW + DW. Our results suggest that DK and GK are more efficient in translating model complexity into accuracy, and more suitable for including dominance and reaction-norm effects in a biologically accurate and faster way.
科研通智能强力驱动
Strongly Powered by AbleSci AI