亚型
计算生物学
聚类分析
计算机科学
生物
数据挖掘
人工智能
程序设计语言
作者
Yabin Kuang,Minzhu Xie,Z Zhao,De‐Yao Deng,Ergude Bao
出处
期刊:Methods
[Elsevier]
日期:2024-10-17
卷期号:232: 1-8
标识
DOI:10.1016/j.ymeth.2024.09.016
摘要
The identification of cancer subtypes is crucial for advancing precision medicine, as it facilitates the development of more effective and personalized treatment and prevention strategies. With the development of high-throughput sequencing technologies, researchers now have access to a wealth of multi-omics data from cancer patients, making computational cancer subtyping increasingly feasible. One of the main challenges in integrating multi-omics data is handling missing data, since not all biomolecules are consistently measured across all samples. Current computational models based on multi-omics data for cancer subtyping often struggle with the challenge of weakly paired omics data. To address this challenge, we propose a novel unsupervised cancer subtyping model named Subtype-MVCC. This model leverages graph convolutional networks to extract and represent low-dimensional features from each omics data type, using intra-view and inter-view contrastive learning approaches. By incorporating a weighted average fusion strategy to unify the dimension of each sample, Subtype-MVCC effectively handles weakly paired multi-omics datasets. Comprehensive evaluations on established benchmark datasets demonstrate that Subtype-MVCC outperforms nine leading models in this domain. Additionally, simulations with varying levels of missing data highlight the model's robust performance in handling weakly paired omics data. The clinical relevance and survival outcomes associated with the identified subtypes further validate the interpretability and reliability of the clustering results produced by Subtype-MVCC.
科研通智能强力驱动
Strongly Powered by AbleSci AI