数学
核希尔伯特再生空间
非线性系统
非线性降维
核(代数)
核方法
聚类分析
算法
嵌入
核主成分分析
分布的核嵌入
希尔伯特空间
收敛速度
歧管(流体力学)
应用数学
人工智能
计算机科学
降维
数学分析
支持向量机
离散数学
钥匙(锁)
机械工程
统计
物理
量子力学
工程类
计算机安全
摘要
We propose a kernel-spectral embedding algorithm for learning low-dimensional nonlinear structures from noisy and high-dimensional observations, where the data sets are assumed to be sampled from a nonlinear manifold model and corrupted by high-dimensional noise. The algorithm employs an adaptive bandwidth selection procedure which does not rely on prior knowledge of the underlying manifold. The obtained low-dimensional embeddings can be further utilized for downstream purposes such as data visualization, clustering and prediction. Our method is theoretically justified and practically interpretable. Specifically, for a general class of kernel functions, we establish the convergence of the final embeddings to their noiseless counterparts when the dimension grows polynomially with the size, and characterize the effect of the signal-to-noise ratio on the rate of convergence and phase transition. We also prove the convergence of the embeddings to the eigenfunctions of an integral operator defined by the kernel map of some reproducing kernel Hilbert space capturing the underlying nonlinear structures. Our results hold even when the dimension of the manifold grows with the sample size. Numerical simulations and analysis of real data sets show the superior empirical performance of the proposed method, compared to many existing methods, on learning various nonlinear manifolds in diverse applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI