管道(软件)
计算机科学
人工神经网络
数据类型
人工智能
软件
大数据
深度学习
机器学习
数据挖掘
作者
Yating Lin,Haojun Li,Xiaowei Xu,Lei Zhang,Kejia Wang,Jingbo Zhao,Minshu Wang,Frank Zheng,Minwei Zhang,Wenxian Yang,Jiahuai Han,Rongshan Yu
出处
期刊:Patterns
[Elsevier]
日期:2022-02-01
卷期号:: 100440-100440
标识
DOI:10.1016/j.patter.2022.100440
摘要
Summary
Understanding the immune cell abundance of cancer and other disease-related tissues has an important role in guiding disease treatments. Computational cell type proportion estimation methods have been previously developed to derive such information from bulk RNA sequencing data. Unfortunately, our results show that the performance of these methods can be seriously plagued by the mismatch between training data and real-world data. To tackle this issue, we propose the DAISM-DNNXMBD (XMBD: Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.) (denoted as DAISM-DNN) pipeline that trains a deep neural network (DNN) with dataset-specific training data populated from a certain amount of calibrated samples using DAISM, a novel data augmentation method with an in silico mixing strategy. The evaluation results demonstrate that the DAISM-DNN pipeline outperforms other existing methods consistently and substantially for all the cell types under evaluation in real-world datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI