卷积神经网络
帕斯卡(单位)
目标检测
模式识别(心理学)
深度学习
任务(项目管理)
作者
Jifeng Dai,Kai He,Jian Sun
出处
期刊:Cornell University - arXiv
日期:2015-12-01
被引量:788
标识
DOI:10.1109/iccv.2015.191
摘要
Recent leading approaches to semantic segmentation rely on deep convolutional networks trained with human-annotated, pixel-level segmentation masks. Such pixel-accurate supervision demands expensive labeling effort and limits the performance of deep networks that usually benefit from more training data. In this paper, we propose a method that achieves competitive accuracy but only requires easily obtained bounding box annotations. The basic idea is to iterate between automatically generating region proposals and training convolutional networks. These two steps gradually recover segmentation masks for improving the networks, and vise versa. Our method, called "BoxSup", produces competitive results (e.g., 62.0% mAP for validation) supervised by boxes only, on par with strong baselines (e.g., 63.8% mAP) fully supervised by masks under the same setting. By leveraging a large amount of bounding boxes, BoxSup further yields state-of-the-art results on PASCAL VOC 2012 and PASCAL-CONTEXT [26].
科研通智能强力驱动
Strongly Powered by AbleSci AI