过度拟合
MNIST数据库
计算机科学
人工智能
乙状窦函数
激活函数
深度学习
机器学习
人工神经网络
一般化
卷积神经网络
感知器
模式识别(心理学)
数学
数学分析
作者
Haidong Li,Jiongcheng Li,Guan Xiaoming,Liang Binghao,Lai Yuting,Xinglong Luo
标识
DOI:10.1109/cis.2019.00025
摘要
Deep learning has been widely used in search engines, data mining, machine learning, natural language processing, multimedia learning, voice recognition, recommendation system, and other related fields. In this paper, a deep neural network based on multilayer perceptron and its optimization algorithm are studied. MNIST handwritten digital datasets were used to verify the reliability of the model and the optimization algorithm. The important correlation between the overfitting and the activation functions, network structures, training epochs, learning rates is verified. The study of overfitting is of great significance to reduce generalization error. This paper proposes an innovative activation function called: modified-sigmoid which is based on the well-known sigmoid function. This activation function can effectively improve the accuracy of the model and inhibit the overfitting problem.
科研通智能强力驱动
Strongly Powered by AbleSci AI