风味
机器学习
计算机科学
人工智能
班级(哲学)
随机森林
重采样
过程(计算)
数据挖掘
食品科学
化学
操作系统
作者
Fabio Herrera‐Rocha,Miguel Fernández-Niño,Jorge Duitama,Mónica P. Cala,María José Chica,Ludger A. Wessjohann,Mehdi D. Davari,Andrés Fernando González Barrios
出处
期刊:Research Square - Research Square
日期:2024-08-18
标识
DOI:10.21203/rs.3.rs-4768238/v1
摘要
Abstract Flavor is the main factor driving consumers acceptance of food products. However, tracking the biochemistry of flavor is a formidable challenge due to the complexity of food composition. Current methodologies for linking individual molecules to flavor in foods and beverages are expensive and time-consuming. Predictive models based on machine learning (ML) are emerging as an alternative to speed up this process. Nonetheless, the optimal approach to predict flavor features of molecules remains elusive. In this work we present FlavorMiner, an ML-based multilabel flavor predictor. FlavorMiner seamlessly integrates different combinations of algorithms and mathematical representations, augmented with class balance strategies to address the inherent class of the input dataset. Notably, Random Forest and K-Nearest Neighbors combined with Extended Connectivity Fingerprint and RDKit molecular descriptors consistently outperform other combinations in most cases. Resampling strategies surpass weight balance methods in mitigating bias associated with class imbalance. FlavorMiner exhibits remarkable accuracy, with an average ROC AUC score of 0.88. This algorithm was used to analyze cocoa metabolomics data, unveiling its profound potential to help extract valuable insights from intricate food metabolomics data. FlavorMiner can be used for flavor mining in any food product, drawing from a diverse training dataset that spans over 934 distinct food products.
科研通智能强力驱动
Strongly Powered by AbleSci AI