Large-scale phenotyping in dairy sector using milk MIR spectra: Key factors affecting the quality of predictions

稳健性(进化) 预测建模 计算机科学 均方误差 统计 计量经济学 比例(比率) 生物 质量(理念) 生物技术 数学 地理 地图学 生物化学 哲学 认识论 基因
作者
Clément Grelet,Pierre Dardenne,Hélène Soyeurt,J.Á. Fernández,Amélie Vanlierde,François Stevens,Nicolas Gengler,Frédéric Dehareng
出处
期刊:Methods [Elsevier BV]
卷期号:186: 97-111 被引量:60
标识
DOI:10.1016/j.ymeth.2020.07.012
摘要

Methods and technologies enabling the estimation at large scale of important traits for the dairy sector are of great interest. Those phenotypes are necessary to improve herd management, animal genetic evaluation, and milk quality control. In the recent years, the research was very active to predict new phenotypes from the mid-infrared (MIR) analysis of milk. Models were developed to predict phenotypes such as fine milk composition, milk technological properties or traits related to cow health, fertility and environmental impact. Most of models were developed within research contexts and often not designed for routine use. The implementation of models at a large scale to predict new traits of interest brings new challenges as the factors influencing the robustness of models are poorly documented. The first objective of this work is to highlight the impact on prediction accuracy of factors such as the variability of the spectral and reference data, the spectral regions used and the complexity of models. The second objective is to emphasize methods and indicators to evaluate the quality of models and the quality of predictions generated under routine conditions. The last objective is to outline the issues and the solutions linked with the use and transfer of models on large number of instruments. Based on partial least square regression and 10 datasets including milk MIR spectra and reference quantitative values for 57 traits of interest, the impact of the different factors is illustrated by evaluating the influence on the validation root mean square error of prediction (RMSEP). In the displayed examples, all factors, when well set up, increase the quality of predictions, with an improvement of the RMSEP ranging from 12% to 43%. This work also aims to underline the need for and the complementarity between different validation procedures, statistical parameters and quality assurance methods. Finally, when using and transferring models, the impact of the spectral standardization on the prediction reproducibility is highlighted with an improvement up to 86% with the tested models, and the monitoring of individual spectrometer stability over time appears essential. This list inspired from our experience is of course not exhaustive. The displayed results are only examples and not general rules and other aspects play a role in the quality of final predictions. However, this work highlights good practices, methods and indicators to increase and evaluate quality of phenotypes predicted at a large scale. The results obtained argue for the development of guidelines at international levels, as well as international collaborations in order to constitute large and robust datasets and enable the use of models in routine conditions.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
上官若男应助小新一护采纳,获得10
1秒前
星辰大海应助美少女战士采纳,获得10
2秒前
啊吧啊吧发布了新的文献求助10
2秒前
3秒前
3秒前
酷波er应助伶俐的草莓采纳,获得10
5秒前
最爱吃火锅完成签到,获得积分10
5秒前
凉窗曦夏发布了新的文献求助10
6秒前
wushangyu发布了新的文献求助10
8秒前
9秒前
李健的小迷弟应助狒狒采纳,获得10
11秒前
飞快的迎夏应助饱满服饰采纳,获得10
11秒前
隐形曼青应助guihai采纳,获得10
11秒前
凉窗曦夏完成签到,获得积分10
11秒前
搜集达人应助ZHOUYY采纳,获得10
12秒前
12秒前
科研通AI6.4应助thezwt采纳,获得10
12秒前
13秒前
深情安青应助Laepu采纳,获得10
14秒前
14秒前
15秒前
xc发布了新的文献求助10
16秒前
18秒前
lucygaga发布了新的文献求助10
18秒前
CodeCraft应助迷路依白采纳,获得10
19秒前
chu发布了新的文献求助10
20秒前
简单发布了新的文献求助10
21秒前
李寳发布了新的文献求助10
22秒前
BTim完成签到,获得积分10
22秒前
Aryatarg发布了新的文献求助10
24秒前
科研通AI6.2应助波哥采纳,获得10
25秒前
25秒前
26秒前
斯文败类应助chu采纳,获得10
26秒前
JamesPei应助wushangyu采纳,获得10
27秒前
27秒前
28秒前
叶qing发布了新的文献求助10
29秒前
酷炫邑发布了新的文献求助20
29秒前
高分求助中
Invited Discussant 63O and 64O 1000
Ideology and Meaning-Making under the Putin Regime 750
Petrology and Plate Tectonics 500
Writing Systems 500
A Handbook of User Experience Research & Design in Libraries 400
Understanding Modeling and Simulation of Polymerization Reactions 400
Direct and Iterative Linear System Solvers 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6904018
求助须知:如何正确求助?哪些是违规求助? 8597961
关于积分的说明 18252400
捐赠科研通 6306408
什么是DOI,文献DOI怎么找? 3063455
关于科研通互助平台的介绍 2085652
邀请新用户注册赠送积分活动 2041236