Edge-Guided Recurrent Positioning Network for Salient Object Detection in Optical Remote Sensing Images

GSM演进的增强数据速率突出计算机科学编码器人工智能对象（语法）计算机视觉解码方法特征（语言学）代表（政治）过程（计算）遥感地理算法政治操作系统哲学语言学法学政治学

作者

Xiaofei Zhou,Kunye Shen,Li Weng,Runmin Cong,Bolun Zheng,Jiyong Zhang,Chenggang Yan

出处

期刊：IEEE transactions on cybernetics [Institute of Electrical and Electronics Engineers]
日期：2022-04-13 卷期号：53 (1): 539-552 被引量：105

链接

nih.govdoi.org

标识

DOI：10.1109/tcyb.2022.3163152

摘要

Optical remote sensing images (RSIs) have been widely used in many applications, and one of the interesting issues about optical RSIs is the salient object detection (SOD). However, due to diverse object types, various object scales, numerous object orientations, and cluttered backgrounds in optical RSIs, the performance of the existing SOD models often degrade largely. Meanwhile, cutting-edge SOD models targeting optical RSIs typically focus on suppressing cluttered backgrounds, while they neglect the importance of edge information which is crucial for obtaining precise saliency maps. To address this dilemma, this article proposes an edge-guided recurrent positioning network (ERPNet) to pop-out salient objects in optical RSIs, where the key point lies in the edge-aware position attention unit (EPAU). First, the encoder is used to give salient objects a good representation, that is, multilevel deep features, which are then delivered into two parallel decoders, including: 1) an edge extraction part and 2) a feature fusion part. The edge extraction module and the encoder form a U-shape architecture, which not only provides accurate salient edge clues but also ensures the integrality of edge information by extra deploying the intraconnection. That is to say, edge features can be generated and reinforced by incorporating object features from the encoder. Meanwhile, each decoding step of the feature fusion module provides the position attention about salient objects, where position cues are sharpened by the effective edge information and are used to recurrently calibrate the misaligned decoding process. After that, we can obtain the final saliency map by fusing all position attention cues. Extensive experiments are conducted on two public optical RSIs datasets, and the results show that the proposed ERPNet can accurately and completely pop-out salient objects, which consistently outperforms the state-of-the-art SOD models.

求助该文献

最长约 10秒，即可获得该文献文件

Edge-Guided Recurrent Positioning Network for Salient Object Detection in Optical Remote Sensing Images

今日热心研友