计算机科学
卷积(计算机科学)
卷积神经网络
人工智能
深度学习
建筑
模式识别(心理学)
上下文图像分类
图像(数学)
机器学习
比例(比率)
计算机视觉
人工神经网络
地图学
艺术
视觉艺术
地理
作者
Karen Simonyan,Andrew Zisserman
出处
期刊:Computer Vision and Pattern Recognition
日期:2014-09-04
被引量:71827
摘要
In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.
科研通智能强力驱动
Strongly Powered by AbleSci AI