计算机科学
寨卡病毒
主成分分析
人工智能
数据挖掘
机器学习
数据集
支持向量机
逻辑回归
投影寻踪
监督学习
模式识别(心理学)
人工神经网络
医学
病毒
病毒学
作者
Enrique Peláez,Fernanda Bertuccez Cordeiro,Washington Cardenas Medranda,Michael P. Barrett,Mildred Zambrano,Mary Regato
标识
DOI:10.1109/la-cci47412.2019.9037029
摘要
Data analysis for metabolomic studies is challenging considering the number of statistical tools and standardization processes, which provides different results and projection in a single study. In addition, generation of high complexity data is common for untargeted metabolomics, requiring careful analysis and interpretation of results. In order to propose an innovative method for the analysis of a mass spectrometry metabolomics dataset, data from a Zika virus study was used. The analysis of this dataset combined principal component analysis and supervised learning methods such as support vector machines and logistic regression, to provide a truthful prediction model for discriminating samples of individuals with Zika virus infection and healthy controls. These supervised methods were used to learn the features that configured the "fingerprint" for the viral infection, showing over 98% of accuracy in a validation set. This model could be used as a fast and reliable test for determining Zika virus infections as part of healthcare services. Furthermore, this novel method shows potential for diagnosing other arboviral diseases.
科研通智能强力驱动
Strongly Powered by AbleSci AI