人工智能
机器学习
计算机科学
鉴定(生物学)
贝叶斯定理
工作流程
朴素贝叶斯分类器
保留时间
分子描述符
人工神经网络
数据挖掘
支持向量机
化学
数量结构-活动关系
贝叶斯概率
数据库
植物
生物
色谱法
作者
Joshua Klingberg,Adam Cawley,Ronald Shimmon,Shanlin Fu
摘要
The constant evolution of the illicit drug market makes the identification of unknown compounds problematic. Obtaining certified reference materials for a broad array of new analogues can be difficult and cost prohibitive. Machine learning provides a promising avenue to putatively identify a compound before confirmation against a standard. In this study, machine learning approaches were used to develop class prediction and retention time prediction models. The developed class prediction model used a naïve Bayes architecture to classify opioids as belonging to either the fentanyl analogues, AH series or U series, with an accuracy of 89.5%. The model was most accurate for the fentanyl analogues, most likely due to their greater number in the training data. This classification model can provide guidance to an analyst when determining a suspected structure. A retention time prediction model was also trained for a wide array of synthetic opioids. This model utilised Gaussian process regression to predict the retention time of analytes based on multiple generated molecular features with 79.7% of the samples predicted within ±0.1 min of their experimental retention time. Once the suspected structure of an unknown compound is determined, molecular features can be generated and input for the prediction model to compare with experimental retention time. The incorporation of machine learning prediction models into a compound identification workflow can assist putative identifications with greater confidence and ultimately save time and money in the purchase and/or production of superfluous certified reference materials.
科研通智能强力驱动
Strongly Powered by AbleSci AI