表型
计算生物学
单细胞分析
生物
单细胞测序
移液管
细胞
计算机科学
遗传学
基因
化学
物理化学
外显子组测序
作者
Xinjia Ruan,C. Yu,Yuqing Ye,Yuhang Wang,Xinyi Chen,Yuqing Yang,Tiantian Liu,Fangrong Yan
摘要
Single-cell RNA sequencing has revealed cellular heterogeneity in complex tissues, notably benefiting research on diseases such as cancer. However, the integration of single-cell data from small samples with extensive clinical features in bulk data remains underexplored. In this study, we introduce PIPET, an algorithmic method for predicting relevant subpopulations in single-cell data based on multivariate phenotypic information from bulk data. PIPET generates feature vectors for each phenotype from differentially expressed genes in bulk data and then identifies relevant cellular subpopulations by assessing the similarity between single-cell data and these vectors. Subsequently, phenotype-related cell states can be analyzed based on these subpopulations. In simulated datasets, PIPET showed robust performance in predicting multiclassification cellular subpopulations. Application of PIPET to lung adenocarcinoma single-cell RNA sequencing data revealed cellular subpopulations with poor survival and associations with TP53 mutations. Similarly, in breast cancer single-cell data, PIPET identified cellular subpopulations associated with the PAM50 clinical subtypes and triple-negative breast cancer subtypes. Overall, PIPET effectively identified relevant cellular subpopulations in single-cell data, guided by phenotypic information from bulk data. This approach comprehensively delineates the molecular characteristics of each cellular subpopulation, offering insights into disease-related subpopulations and guiding personalized treatment strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI