反褶积
离群值
计算机科学
肿瘤浸润淋巴细胞
噪音(视频)
人口
计算生物学
模式识别(心理学)
数据挖掘
人工智能
免疫系统
生物
算法
免疫学
免疫疗法
医学
图像(数学)
环境卫生
作者
Yuning Hao,Ming Yan,Blake R. Heath,Yu Lei,Yuying Xie
标识
DOI:10.1371/journal.pcbi.1006976
摘要
Gene-expression deconvolution is used to quantify different types of cells in a mixed population. It provides a highly promising solution to rapidly characterize the tumor-infiltrating immune landscape and identify cold cancers. However, a major challenge is that gene-expression data are frequently contaminated by many outliers that decrease the estimation accuracy. Thus, it is imperative to develop a robust deconvolution method that automatically decontaminates data by reliably detecting and removing outliers. We developed a new machine learning tool, Fast And Robust DEconvolution of Expression Profiles (FARDEEP), to enumerate immune cell subsets from whole tumor tissue samples. To reduce noise in the tumor gene expression datasets, FARDEEP utilizes an adaptive least trimmed square to automatically detect and remove outliers before estimating the cell compositions. We show that FARDEEP is less susceptible to outliers and returns a better estimation of coefficients than the existing methods with both numerical simulations and real datasets. FARDEEP provides an estimate related to the absolute quantity of each immune cell subset in addition to relative percentages. Hence, FARDEEP represents a novel robust algorithm to complement the existing toolkit for the characterization of tissue-infiltrating immune cell landscape. The source code for FARDEEP is implemented in R and available for download at https://github.com/YuningHao/FARDEEP.git.
科研通智能强力驱动
Strongly Powered by AbleSci AI