分割
计算机科学
帕斯卡(单位)
目标检测
人工智能
推论
任务(项目管理)
对象(语法)
机器学习
深度学习
尺度空间分割
多任务学习
计算机视觉
基于分割的对象分类
图像分割
模式识别(心理学)
经济
管理
程序设计语言
作者
Wenwen Zhang,Kunfeng Wang,Yutong Wang,Lan Yan,Fei–Yue Wang
标识
DOI:10.1016/j.neucom.2020.11.024
摘要
Scene understanding comes in many flavors, two of the most popular being object detection and semantic segmentation, which act as two important aspects for scene understanding, and are applied to many areas, such as autonomous driving and intelligent surveillance. Although much progress has already been made, the two tasks of object detection and semantic segmentation are often investigated independently. In practice, scene understanding is complicated, and comprises many sub-tasks, so that research of learning multiple tasks simultaneously with a single model is feasible. With the interrelated goals of these two tasks, there is a strong motivation to improve the object detection accuracy with the help of semantic segmentation, and vice versa. In this paper, we propose a loss-balanced multi-task model for simultaneous object detection and semantic segmentation. We explore multi-task learning with sharing parameters based on deep learning to realize improved object detection and segmentation, and propose a single-stage deep architecture based on multi-task learning, jointly performing object detection and semantic segmentation to boost each other. With no more computation load in the inference compared with the baselines of SSD and FCN, we show that these two tasks, object detection and semantic segmentation, benefit from each other. Experimental results on Pascal VOC and COCO show that our method improves much in object detection and semantic segmentation compared with the corresponding baselines of both tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI