计算机科学
帕斯卡(单位)
定位
人工智能
分割
跳跃式监视
最小边界框
目标检测
对象(语法)
背景(考古学)
计算机视觉
图像分割
模式识别(心理学)
图像(数学)
地理
考古
程序设计语言
作者
Tsung-Yi Lin,Michael Maire,Serge Belongie,James Hays,Pietro Perona,Deva Ramanan,Piotr Dollár,C. Lawrence Zitnick
标识
DOI:10.1007/978-3-319-10602-1_48
摘要
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding. This is achieved by gathering images of complex everyday scenes containing common objects in their natural context. Objects are labeled using per-instance segmentations to aid in precise object localization. Our dataset contains photos of 91 objects types that would be easily recognizable by a 4 year old. With a total of 2.5 million labeled instances in 328k images, the creation of our dataset drew upon extensive crowd worker involvement via novel user interfaces for category detection, instance spotting and instance segmentation. We present a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN. Finally, we provide baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.
科研通智能强力驱动
Strongly Powered by AbleSci AI