人工智能
计算机科学
试验装置
支持向量机
分析物
保留时间
数据集
分子描述符
机器学习
集合(抽象数据类型)
数据挖掘
数量结构-活动关系
模式识别(心理学)
化学
色谱法
程序设计语言
作者
Lina Wu,Xiao Fu,Xiaomin Luo,Keming Yun,Di Wen,Jiaman Lin,Shuo Yang,Tianle Li,Ping Xiang,Yan Shi
出处
期刊:Heliyon
[Elsevier BV]
日期:2023-05-25
卷期号:9 (6): e16671-e16671
标识
DOI:10.1016/j.heliyon.2023.e16671
摘要
Abuse of Synthetic Cannabinoids (SCs) has become a serious threat to public health. Due to the various structural and chemical group modified by criminals, their detection is a major challenge in forensic toxicological identification. Therefore, rapid and efficient identification of SCs is important for forensic toxicology and drug bans. The prediction of an analyte's retention time in liquid chromatography is an important index for the qualitative analysis of compounds and can provide informatics solutions for the interpretation of chromatographic data.In this study, experimental data from high-resolution mass spectrometry (HRMS) are used to construct a regression model for predicting the retention time of SCs using machine learning methods. The prediction ability of the model is improved by adopting a strategy that combines different descriptors in different independent machine-learning methods.The best model was obtained with a method that combined Substructure Fingerprint Count and Finger printer features and the support vector regression (SVR) method, as it exhibited an R2 value of 0.81 for the validation set and 0.83 for the test set. In addition, 4 new SCs were predicted by the optimized model, with a prediction error within 3%.Our study provides a model that can predict the retention time of compounds and it can be used as a filter to reduce false-positive candidates when used in combination with LC-HRMS, especially in the absence of reference standards. This can improve the confidence of identification in non-targeted analysis and the reliability of identifying unknown substances.
科研通智能强力驱动
Strongly Powered by AbleSci AI