计算机科学
人工智能
图像分割
计算机视觉
分割
模式识别(心理学)
机制(生物学)
变压器
电压
工程类
哲学
电气工程
认识论
作者
Ju He,Jianfeng Chen,Hu Xu,Yang Yu
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:62: 1-17
被引量:2
标识
DOI:10.1109/tgrs.2024.3368659
摘要
Forward-looking sonar (FLS) image segmentation plays a significant role in ocean engineering. However, the existing image segmentation algorithms present difficulties in extracting features from FLS images with weak semantic information, complex backgrounds and strong environmental noises. Convolutional neural networks (CNNs) have demonstrated remarkable capabilities in semantic segmentation tasks, but the locality of convolution limits the ability to extract global context and long-range semantic information. The effective extraction of global contextual information is indispensable for achieving accurate segmentation results in sonar image processing. In this paper, we propose a novel semantic segmentation architecture for forward-looking sonar images called SonarNet. SonarNet is based on a hybrid CNN-Transformer-HOG framework and comprises four modules. 1) The Global-Local Encoder can extract both global and detailed feature information of the underwater target; 2) the Network Decoder converts the high-semantic feature map into a pixel-level classification; 3) as a bridge between dual encoders, the Global-Local Fusion Module ensures semantic consistency between different encoders; 4) the HOG Feature Encoder and Fusion can extract traditional manual features and perform feature alignment. We conducted comprehensive ablation experiments to validate the efficacy of the designed modules. Finally, experimentation revealed that SonarNet significantly outperforms other CNN-based and CNN-Transformer FLS image segmentation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI