聚类分析
嵌入
计算机科学
计算生物学
数据挖掘
人工智能
数据科学
生物
作者
Zuqi Li,Sam F. L. Windels,Noël Malod‐Dognin,Seth M. Weinberg,Mary L. Marazita,Susan Walsh,Mark D. Shriver,David W. Fardo,Peter Claes,Nataša Pržulj,Kristel Van Steen
标识
DOI:10.1101/2024.09.23.614478
摘要
Motivation: Combining omics and images, can lead to a more comprehensive clustering of individuals than classic single-view approaches. Among the various approaches for multi-view clustering, nonnegative matrix tri-factorization (NMTF) and nonnegative Tucker decomposition (NTD) are advantageous in learning low-rank embeddings with promising interpretability. Besides, there is a need to handle unwanted drivers of clusterings (i.e. confounders). Results: In this work, we introduce a novel multi-view clustering method based on NMTF and NTD, named INMTD, that integrates omics and 3D imaging data to derive unconfounded subgroups of individuals. In the application to real-life facial-genomic data, INMTD generated biologically relevant embeddings for individuals, genetics and facial morphology. By removing confounded embedding vectors, we derived an unconfounded clustering with better internal and external quality; the genetic and facial annotations of each derived subgroup highlighted distinctive characteristics. In conclusion, INMTD can effectively integrate omics data and 3D images for unconfounded clustering with biologically meaningful interpretation. Availability and implementation: https://github.com/ZuqiLi/INMTD
科研通智能强力驱动
Strongly Powered by AbleSci AI