人工智能
计算机科学
目标检测
模式识别(心理学)
突出
特征(语言学)
融合
计算机视觉
RGB颜色模型
对象(语法)
特征提取
哲学
语言学
作者
Rui Huang,Qingyi Zhao,Yan Xing,Sihua Gao,Weifeng Xu,Yuxiang Zhang,Wei Fan
标识
DOI:10.1109/icassp48485.2024.10447807
摘要
Multiscale convolutional neural network (CNN) has demonstrated remarkable capabilities in solving various vision problems. However, fusing features of different scales always results in large model sizes, impeding the application of multiscale CNNs in RGB-D saliency detection. In this paper, we propose a customized feature fusion module, called Saliency Enhanced Feature Fusion (SEFF), for RGB-D saliency detection. SEFF utilizes saliency maps of the neighboring scales to enhance the necessary features for fusing, resulting in more representative fused features. Our multiscale RGB-D saliency detector uses SEFF and processes images with three different scales. SEFF is used to fuse the features of RGB and depth images, as well as the features of decoders at different scales. Extensive experiments on five benchmark datasets have demonstrated the superiority of our method over ten SOTA saliency detectors.
科研通智能强力驱动
Strongly Powered by AbleSci AI