过度拟合
卷积神经网络
计算机科学
人工智能
深度学习
联营
规范化(社会学)
模式识别(心理学)
卷积(计算机科学)
人工神经网络
机器学习
人类学
社会学
作者
Shengwei Zhou,Caikou Chen,Guojiang Han,Xielian Hou
标识
DOI:10.23919/chicc.2019.8865226
摘要
Since Alex Krizhevsky won the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) 2012 competition by building a very intelligent deep convolutional neural network (D-CNNs), more and more researchers have been engaged in the research and development of deep convolutional neural network (D-CNNs). However, recent researches on deep convolutional neural networks are mostly based on ImageNet datasets. The network model based on such a large dataset is mostly blind to increase the number of network layers, ignoring that most data sets in application are far from the order of magnitude of ImageNet datasets. Such deep networks tend to perform poorly in small datasets (CIFAR-10), since deep models are easy to overfitting. In this paper, we've applied some of the more efficient methods that have been proposed in recent years to traditional deep convolutional neural networks. We proposed a modified Alex network and used this model to fit CIFAR-10. By adding Batch Normalization, using Dilated Convolution and replacing Fully Connected layer (FC) with Global Average Pooling (GAP), we achieved 8.6% error rate on CIFAR-10 without severe overfitting. Our results show that the deep CNN can be used to fit small datasets with proper modifications and the results are much better than before.
科研通智能强力驱动
Strongly Powered by AbleSci AI