随机森林
支持向量机
朴素贝叶斯分类器
人工智能
计算机科学
机器学习
数据集
试验装置
逻辑回归
算法
试验数据
多元统计
蛋白质组学
数据挖掘
化学
生物化学
基因
程序设计语言
作者
Hyunsoo Kim,Yoseop Kim,Buhm Han,Jin‐Young Jang,Youngsoo Kim
标识
DOI:10.1021/acs.jproteome.9b00268
摘要
Deep learning (DL), a type of machine learning approach, is a powerful tool for analyzing large sets of data that are derived from biomedical sciences. However, it remains unknown whether DL is suitable for identifying contributing factors, such as biomarkers, in quantitative proteomics data. In this study, we describe an optimized DL-based analytical approach using a data set that was generated by selected reaction monitoring-mass spectrometry (SRM-MS), comprising SRM-MS data from 1008 samples for the diagnosis of pancreatic cancer, to test its classification power. Its performance was compared with that of 5 conventional multivariate and machine learning methods: random forest (RF), support vector machine (SVM), logistic regression (LR), k-nearest neighbors (k-NN), and naïve Bayes (NB). The DL method yielded the best classification (AUC 0.9472 for the test data set) of all approaches. We also optimized the parameters of DL individually to determine which factors were the most significant. In summary, the DL method has advantages in classifying the quantitative proteomics data of pancreatic cancer patients, and our results suggest that its implementation can improve the performance of diagnostic assays in clinical settings.
科研通智能强力驱动
Strongly Powered by AbleSci AI