计算机科学
保险丝(电气)
人工智能
RGB颜色模型
互补性(分子生物学)
情态动词
卷积神经网络
模式识别(心理学)
成对比较
工程类
遗传学
生物
电气工程
化学
高分子化学
作者
Fasheng Wang,Yiming Su,Ruimin Wang,Jing Sun,Fuming Sun,Haojie Li
出处
期刊:IEEE transactions on artificial intelligence
[Institute of Electrical and Electronics Engineers]
日期:2023-11-20
卷期号:: 1-15
被引量:5
标识
DOI:10.1109/tai.2023.3333827
摘要
Most existing RGB-D salient object detection methods utilize the Convolutional Neural Networks (CNNs) to extract features. However, they fail to extract global information due to the inherent defect of sliding window. On the other hand, with the emergence of depth clues, how to effectively incorporate cross-modal features has become an underlying challenge. In addition, in terms of cross-level feature fusion, most methods do not fully consider the complementarity between different layers and usually adopt simple fusion strategies, thereby leading to the missing of detailed information. To relieve these issues, a Cross-modal and Cross-level Attention Interaction Network (CAINet) is proposed. First, different from most existing methods, we adopt a two-stream Swin Transformers to extract RGB and depth features. Second, a High-level Context Refinement Module (HCRM) is designed to further extract refined features and give accurate guidance in early prediction stage. Third, we design a Cross-modal Interaction Enhancement Module (CIEM) to explore the complementarity of different modalities via co-attention. In terms of fusion for high-level and low-level features in decoding, a Multi-scale Attention Induced Decoder (MAID) is designed to extract and fuse the complementary information at different scales. Finally, the Edge Enhancement Module (EEM) is employed to compensate the dilution of edge. Our proposed CAINet achieves excellent performance compared to other state-of-the-art (SOTA) methods on seven widely used datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI