聚类分析
计算机科学
约束(计算机辅助设计)
秩(图论)
表达式(计算机科学)
DNA甲基化
张量(固有定义)
一致性(知识库)
数据集成
利用
代表(政治)
投影(关系代数)
表观遗传学
人工智能
计算生物学
数据挖掘
理论计算机科学
机器学习
基因
生物
基因表达
算法
数学
遗传学
组合数学
几何学
政治
计算机安全
程序设计语言
法学
纯数学
政治学
作者
Xiaowei Gao,Yan Wang,Weimin Hou,Zaiyi Liu,Xiaoke Ma
标识
DOI:10.1109/tcbb.2022.3229678
摘要
The accumulated DNA methylation and gene expression provide a great opportunity to exploit the epigenetic patterns of genes, which is the foundation for revealing the underlying mechanisms of biological systems. Current integrative algorithms are criticized for undesirable performance because they fail to address the heterogeneity of expression and methylation data, and the intrinsic relations among them. To solve this issue, a novel multi-view clustering with self-representation learning and low-rank tensor constraint (MCSL-LTC) is proposed for the integration of gene expression and DNA methylation data, which are treated as complementary views. Specifically, MCSL-LTC first learns the low-dimensional features for each view with the linear projection, and then these features are fused in a unified tensor space with low-rank constraints. In this case, the complementary information of various views is precisely captured, where the heterogeneity of omic data is avoided, thereby enhancing the consistency of different views. Finally, MCSL-LTC obtains a consensus cluster of genes reflecting the structure and features of various views. Experimental results demonstrate that the proposed approach outperforms state-of-the-art baselines in terms of accuracy on both the social and cancer data, which provides an effective and efficient method for the integration of heterogeneous genomic data.
科研通智能强力驱动
Strongly Powered by AbleSci AI