计算机科学
可视化
范畴变量
降维
嵌入
数据挖掘
高维数据聚类
数据可视化
维数之咒
高维
算法
理论计算机科学
人工智能
机器学习
聚类分析
作者
Jayesh Soni,Nagarajan Prabakar,Himanshu Upadhyay
出处
期刊:Transactions on computational science and computational intelligence
日期:2020-01-01
卷期号:: 189-206
被引量:16
标识
DOI:10.1007/978-3-030-43981-1_9
摘要
Data visualization is a powerful tool and widely adopted by organizations for its effectiveness to abstract the right information, understand, and interpret results clearly and easily. The real challenge in any data science exploration is to visualize it. Visualizing a discrete, categorical data attribute using bar plots, pie charts are a few of the effective ways for data exploration. Most of the datasets have a large number of features. In other words, data is distributed across a high number of dimensions. Visually exploring such high-dimensional data can then become challenging and even practically impossible to do manually. Hence it is essential to understand how to visualize high-dimensional datasets. t-Distributed stochastic neighbor embedding (t-SNE) is a technique for dimensionality reduction and explicitly applicable to the visualization of high-dimensional datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI