降维
判别式
聚类分析
计算机科学
图形
人工智能
特征向量
模式识别(心理学)
维数之咒
可视化
特征学习
数据结构
高维数据聚类
理论计算机科学
数据挖掘
程序设计语言
作者
Qi Mao,Li Wang,Steve Goodison,Yijun Sun
出处
期刊:Knowledge Discovery and Data Mining
日期:2015-08-07
卷期号:: 765-774
被引量:94
标识
DOI:10.1145/2783258.2783309
摘要
We present a new dimensionality reduction setting for a large family of real-world problems. Unlike traditional methods, the new setting aims to explicitly represent and learn an intrinsic structure from data in a high-dimensional space, which can greatly facilitate data visualization and scientific discovery in downstream analysis. We propose a new dimensionality-reduction framework that involves the learning of a mapping function that projects data points in the original high-dimensional space to latent points in a low-dimensional space that are then used directly to construct a graph. Local geometric information of the projected data is naturally captured by the constructed graph. As a showcase, we develop a new method to obtain a discriminative and compact feature representation for clustering problems. In contrast to assumptions used in traditional clustering methods, we assume that centers of clusters should be close to each other if they are connected in a learned graph, and other cluster centers should be distant. Extensive experiments are performed that demonstrate that the proposed method is able to obtain discriminative feature representations yielding superior clustering performance, and correctly recover the intrinsic structures of various real-world datasets including curves, hierarchies and a cancer progression path.
科研通智能强力驱动
Strongly Powered by AbleSci AI