判别式
联营
人工智能
计算机科学
卷积神经网络
模式识别(心理学)
跳跃式监视
最小边界框
深度学习
上下文图像分类
代表(政治)
图像(数学)
简单
对象(语法)
图层(电子)
注释
哲学
化学
有机化学
认识论
政治
政治学
法学
作者
Bolei Zhou,Aditya Khosla,Àgata Lapedriza,Aude Oliva,Antonio Torralba
标识
DOI:10.1109/cvpr.2016.319
摘要
In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network (CNN) to have remarkable localization ability despite being trained on imagelevel labels. While this technique was previously proposed as a means for regularizing training, we find that it actually builds a generic localizable deep representation that exposes the implicit attention of CNNs on an image. Despite the apparent simplicity of global average pooling, we are able to achieve 37.1% top-5 error for object localization on ILSVRC 2014 without training on any bounding box annotation. We demonstrate in a variety of experiments that our network is able to localize the discriminative image regions despite just being trained for solving classification task1.
科研通智能强力驱动
Strongly Powered by AbleSci AI