计算生物学
生物
肿瘤异质性
基因
RNA序列
基因表达谱
癌症
转录组
基因表达
遗传学
作者
Yusha Liu,Peter Carbonetto,Jason Willwerscheid,Scott A. Oakes,Kay F. Macleod,Matthew Stephens
标识
DOI:10.1101/2023.08.15.553436
摘要
Profiling tumors with single-cell RNA sequencing (scRNA-seq) has the potential to identify recurrent patterns of transcription variation related to cancer progression, and produce new therapeutically relevant insights. However, the presence of strong inter-tumor heterogeneity often obscures more subtle patterns that are shared across tumors, some of which may characterize clinically relevant disease subtypes. Here we introduce a new statistical method, generalized binary covariance decomposition (GBCD), to address this problem. We show that GBCD can help decompose transcriptional heterogeneity into interpretable components — including patient-specific, dataset-specific and shared components relevant to disease subtypes — and that, in the presence of strong inter-tumor heterogeneity, it can produce more interpretable results than existing methods. Applied to data from three studies on pancreatic cancer adenocarcinoma (PDAC), GBCD produces a refined characterization of existing tumor subtypes (e.g., classical vs. basal), and identifies a new gene expression program (GEP) that is prognostic of poor survival independent of established prognostic factors such as tumor stage and subtype. The new GEP is enriched for genes involved in a variety of stress responses, and suggests a potentially important role for the integrated stress response in PDAC development and prognosis.
科研通智能强力驱动
Strongly Powered by AbleSci AI