计算机科学
快速傅里叶变换
变压器
编码器
卷积神经网络
分割
雷达
人工智能
电子工程
电气工程
算法
电信
工程类
操作系统
电压
作者
Raktim Ghosh,Francesca Bovolo
摘要
Radar Sounders (RSs) are sensors operating in the nadir-looking geometry (with HF or VHF bands) by transmitting modulated electromagnetic (EM) pulses and receiving the backscattering response from different subsurface targets. Recently, convolutional neural network (CNN) architectures were established for characterizing RS signals under the semantic segmentation framework. In this paper, we design a Fast Fourier Transform (FFT) based CNN-Transformer encoder to effectively capture the long-range contexts in the radargram. In our hybrid architecture, CNN models the high-dimensional local spatial contexts, and the Transformer establishes the global spatial contexts between the local spatial ones. To overcome Transformer complex self-attention layers by reducing learnable parameters; - we replace the self-attention mechanism of the Transformer with unparameterized FFT modules as depicted in FNet architecture for Natural Language Processing (NLP). The experimental results on the MCoRDS dataset indicate the capability of the CNN-Transformer encoder along with the unparameterized FFT modules to characterize the radargram with limited accuracy cost and by reducing the time consumption. A comparative analysis is carried out with the state-of-the-art Transformer-based architecture.
科研通智能强力驱动
Strongly Powered by AbleSci AI