计算机科学
人工智能
突出
概括性
监督学习
机器学习
编码(集合论)
特征(语言学)
解码方法
人工神经网络
对象(语法)
目标检测
任务(项目管理)
模式识别(心理学)
源代码
算法
哲学
经济
集合(抽象数据类型)
管理
程序设计语言
心理治疗师
操作系统
语言学
心理学
作者
Chang‐De Gong,Gang Yang,Hongbin Dong
标识
DOI:10.1007/978-981-99-4761-4_6
摘要
Weakly supervised salient object detection aims to address the limitations of fully supervised methods that heavily rely on pixel-level data. However, the sparse nature of weak labels often results in suboptimal detection accuracy. Drawing inspiration from human visual attention mechanisms, we propose a Mixed-Supervised Learning method to mitigate this issue. Mixed-Supervised Learning refers to training a neural network with hybrid data. Specifically, we propose a two-stage training strategy. In stage I, the model is supervised by a large number of scribble annotations so that it can roughly locate salient objects. In stage II, a small number of pixel-level labels are used for learning to endow the model with detail decoding capability. Our training strategy decomposes the SOD task into two sub-tasks, object localization and detail refinement, and we design a corresponding network, LRNet, which includes the supplementary detail information, a Feature Attention module (FA), and a Detail Refinement module (DF). The two-stage training strategy is simple and generalizable. Extensive experiments demonstrate the effectiveness of the training strategy, and our model detection accuracy surpasses the existing state-of-the-art models of weakly supervised learning, even reaching fully supervised results. Besides, experiments on COD and RSI SOD tasks demonstrate the generality of our method. Our code will be released at https://github.com/nightmengna/LRNet .
科研通智能强力驱动
Strongly Powered by AbleSci AI