判别式
计算机科学
人工智能
平滑的
图像检索
残余物
特征(语言学)
模式识别(心理学)
卷积神经网络
卷积(计算机科学)
计算机视觉
图像(数学)
人工神经网络
算法
语言学
哲学
作者
Dongyang Hou,Siyuan Wang,Xueqing Tian,Huaqiao Xing
标识
DOI:10.1109/jstars.2022.3208107
摘要
The discriminative ability of image features plays a decisive role in content-based remote sensing image retrieval (CBRSIR). However, the widely-used convolutional neural networks cannot focus on the discriminative features of important scenes, resulting in unsatisfactory retrieval performance in complex contexts. In this paper, an attention-enhanced end-to-end discriminative network with multiscale learning for CBRSIR is proposed to solve this issue. First, a multiscale dilated convolution module is embedded into some of ResNet50's residual blocks to increase the perceptual field and capture the multiscale features of remote sensing image scenes. Then, a lightweight and efficient triplet attention module is added behind each residual block to capture the salient features of remote sensing images and establish the inter-dimensional dependencies using residual transform. In addition, the end-to-end training approach is performed using an online label smoothing loss to reduce the intra-class variance of features and enhance inter-class differentiability. Experimental results on four publicly available remote sensing image datasets show that our network achieves state-of-the-art or competitive performance, especially on complex scene dataset UCMD with an average retrieval precision improvement of 3.23% to 29.35% compared to other new methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI