降维
计算机科学
生物学数据
探索性数据分析
数据挖掘
维数之咒
步伐
还原(数学)
数据缩减
高维数据聚类
可视化
多维数据
人工智能
模式识别(心理学)
数学
聚类分析
生物信息学
几何学
大地测量学
生物
地理
作者
Tamasha Malepathirana,Damith Senanayake,Rajith Vidanaarachchi,Vini Gautam,Saman Halgamuge
出处
期刊:BioSystems
[Elsevier]
日期:2022-07-30
卷期号:220: 104749-104749
被引量:6
标识
DOI:10.1016/j.biosystems.2022.104749
摘要
High throughput technologies used in experimental biological sciences produce data with a vast number of variables at a rapid pace, making large volumes of high-dimensional data available. The exploratory analysis of such high-dimensional data can be aided by human interpretable low-dimensional visualizations. This work investigates how both discrete and continuous structures in biological data can be captured using the recently proposed dimensionality reduction method SONG, and compares the results with commonly used methods UMAP and PHATE. Using simulated and real-world datasets, we observe that SONG produces insightful visualizations by preserving various patterns, including discrete clusters, continuums, and branching structures in all considered datasets. More importantly, for datasets containing both discrete and continuous structures, SONG performs better at preserving both the structures compared to UMAP and PHATE. Furthermore, our quantitative evaluation of the three methods using downstream analysis validates the on par quality of the SONG's low-dimensional embeddings compared to the other methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI