计算机科学
RGB颜色模型
特征(语言学)
比例(比率)
情态动词
人工智能
突出
对象(语法)
接头(建筑物)
模式识别(心理学)
算法
工程类
高分子化学
化学
哲学
物理
量子力学
语言学
建筑工程
作者
Xian Fang,Mingfeng Jiang,Jinchao Zhu,Xiuli Shao,Hongpeng Wang
标识
DOI:10.1016/j.patcog.2022.109139
摘要
• A nested dual attention module (NDAM) is proposed to explicitly exploit the combined features of RGB and depth flows. • An adjacent interactive aggregation module (AIAM) is proposed to gradually integrate the neighbor features of high, middle and low levels. • A joint hybrid optimization loss (JHOL) is proposed to make the predictions have a prominent outline. • A novel multi-modal and multi-scale refined network (M2RNet) is proposed for salient object detection. • Extensive experiments demonstrate that our method achieves consistently superior performance against 12 state-of-the-art approaches. Salient object detection is a fundamental topic in computer vision, which has promising application prospects. The previous methods based on RGB-D may potentially suffer from the incompatibility of multi-modal feature fusion and the insufficiency of multi-scale feature aggregation. To tackle these two dilemmas, we propose a novel multi-modal and multi-scale refined network (M 2 RNet). Specifically, three essential components are presented in this network. The nested dual attention module (NDAM) explicitly exploits the combined features of RGB and depth flows. The adjacent interactive aggregation module (AIAM) gradually integrates the neighbor features of high, middle and low levels. The joint hybrid optimization loss (JHOL) makes the predictions have a prominent outline. Extensive experiments quantitatively and qualitatively demonstrate that our method outperforms other state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI