朴素贝叶斯分类器
支持向量机
随机森林
人工智能
类黄酮
机器学习
计算机科学
涩的
公共化学
化学
模式识别(心理学)
食品科学
生物化学
品味
抗氧化剂
作者
Tianyang Guo,Fei Pan,Zhiyong Cui,Zichen Yang,Qiong Chen,Lei Zhao,Huanlu Song
标识
DOI:10.1021/acs.jafc.2c08822
摘要
Astringency is a puckering or velvety sensation mainly derived from flavonoid compounds in food. The traditional experimental approach for astringent compound discovery was labor-intensive and cost-consuming, while machine learning (ML) can greatly accelerate this procedure. Herein, we propose the Flavonoid Astringency Prediction Database (FAPD) based on ML. First, the Molecular Fingerprint Similarities (MFSs) and thresholds of flavonoid compounds were hierarchically clustering analyzed. For the astringency threshold prediction, four regressions models (i.e., Gaussian Process Regression (GPR), Support Vector Regression (SVR), Random Forest (RF), and Gradient Boosted Decision Tree (GBDT)) were established, and the best model was RF which was interpreted by the SHapley Additive exPlanations (SHAP) approach. For the astringency type prediction, six classification models (i.e., RF, GBDT, Gaussian Naive Bayes (GNB), Support Vector Machine (SVM), k-Nearest Neighbor (kNN), and Stochastic Gradient Descent (SGD)) were established, and the best model was SGD. Furthermore, over 1200 natural flavonoid compounds were discovered and built into the customized FAPD. In FAPD, the astringency thresholds were achieved by RF; the astringency types were distinguished by SGD, and the real and predicted astringency types were verified by t-Distributed Stochastic Neighbor Embedding (t-SNE). Therefore, ML models can be used to predict the astringency threshold and astringency type of flavonoid compounds, which provides a new paradigm to research the molecular structure-flavor property relationship of food components.
科研通智能强力驱动
Strongly Powered by AbleSci AI