缺少数据
多元统计
多元分析
统计
探索性数据分析
计算机科学
数学
作者
Julie Josse,François Husson
出处
期刊:Le Centre pour la Communication Scientifique Directe - HAL - Diderot
日期:2012-01-01
被引量:25
摘要
This paper is a written version of the talk Julie Josse delivered at the 44 Journees de Statistique (Bruxelles,
2012), when being awarded the Marie-Jeanne Laurent-Duhamel prize for her Ph.D. dissertation by the French Statistical
Society. It proposes an overview of some results, proposed in Julie Josse and Francois Husson’s papers, as well as new
challenges in the field of handling missing values in exploratory multivariate data analysis methods and especially in
principal component analysis (PCA). First we describe a regularized iterative PCA algorithm to provide point estimates
of the principal axes and components and to overcome the major issue of overfitting. Then, we give insight in the
parameters variance using a non parametric multiple imputation procedure. Finally, we discuss the problem of the
choice of the number of dimensions and we detail cross-validation approximation criteria. The proposed methodology
is implemented in the R package missMDA.
科研通智能强力驱动
Strongly Powered by AbleSci AI