MNIST数据库
模式识别(心理学)
人工智能
计算机科学
局部二进制模式
卷积神经网络
面部识别系统
深度学习
直方图
上下文图像分类
联营
线性判别分析
图像(数学)
作者
Tsung-Han Chan,Kui Jia,Shenghua Gao,Jiwen Lu,Zinan Zeng,Yi Ma
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2015-09-01
卷期号:24 (12): 5017-5032
被引量:1480
标识
DOI:10.1109/tip.2015.2475625
摘要
In this work, we propose a very simple deep learning network for image classification which comprises only the very basic data processing components: cascaded principal component analysis (PCA), binary hashing, and block-wise histograms. In the proposed architecture, PCA is employed to learn multistage filter banks. It is followed by simple binary hashing and block histograms for indexing and pooling. This architecture is thus named as a PCA network (PCANet) and can be designed and learned extremely easily and efficiently. For comparison and better understanding, we also introduce and study two simple variations to the PCANet, namely the RandNet and LDANet. They share the same topology of PCANet but their cascaded filters are either selected randomly or learned from LDA. We have tested these basic networks extensively on many benchmark visual datasets for different tasks, such as LFW for face verification, MultiPIE, Extended Yale B, AR, FERET datasets for face recognition, as well as MNIST for hand-written digits recognition. Surprisingly, for all tasks, such a seemingly naive PCANet model is on par with the state of the art features, either prefixed, highly hand-crafted or carefully learned (by DNNs). Even more surprisingly, it sets new records for many classification tasks in Extended Yale B, AR, FERET datasets, and MNIST variations. Additional experiments on other public datasets also demonstrate the potential of the PCANet serving as a simple but highly competitive baseline for texture classification and object recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI