F3Net: Fast Fourier Filter Network for Hyperspectral Image Classification

快速傅里叶变换计算机科学高光谱成像傅里叶变换滤波器（信号处理）人工智能离散傅里叶变换（通用）频域卷积（计算机科学）模式识别（心理学）算法块（置换群论）计算机视觉人工神经网络傅里叶分析数学短时傅里叶变换数学分析几何学

作者

Hao Shi,Guo Cao,Youqiang Zhang,Zixian Ge,Yanbo Liu,Di Yang

出处

期刊：IEEE Transactions on Instrumentation and Measurement [Institute of Electrical and Electronics Engineers]
日期：2023-01-01 卷期号：72: 1-18 被引量：2

标识

DOI：10.1109/tim.2023.3277100

摘要

In the hyperspectral image (HSI) classification, there are numerous deep learning-based research routes that have emerged recently. Among them, two methodologies attract our attention. One is CNN-based classification and the other is transformer-based classification. The essence of these two methodologies is to interchange information locally or at a long distance for HSI pixels in the spatial or spectral-spatial domain. There are two principles underlying this essence—the information mixing mechanism and the information mixing domain. Although both CNN-based and transformer-based have made efforts in these two principles and obtained favorable classification results, there is still room for improvement in terms of accuracy and efficiency. To further enhance the accuracy and efficiency under the two principles, fast Fourier transform (FFT) is introduced to HSI classification and a fast Fourier filter is designed to mix information efficiently in the frequency domain by means of FFT. The parametric-free characteristic and fast computation of FFT can assist us in efficiently learning interactions among features in the frequency domain. Furthermore, a fast Fourier filter block is built upon the fast Fourier filter for repeatedly using as a basic block. In addition, we propose a spectral-spatial convolution tokenizer (SSCT) to extract shallow features and prepare spectral-spatial tokens for fast Fourier filter blocks. Finally, by employing SSCT and fast Fourier filter blocks, a novel deep neural network architecture—fast Fourier filter network (F ³ Net) is proposed for HSI classification. F ³ Net-P as a pyramidal variant of F ³ Net is also investigated. Experimental results on four datasets comprehensively evaluate our models and indicate that they are competitive with several current state-of-the-art methods, especially when the training samples are extremely limited. Specifically, F ³ Net-P achieves the highest accuracy of 97.25%, 98.08%, 97.49% and 97.95% on the four datasets, respectively, outperforming second best compared model by 1.49%, 2.03%, 2.14% and 1.94%.

求助该文献

最长约 10秒，即可获得该文献文件

F3Net: Fast Fourier Filter Network for Hyperspectral Image Classification

今日热心研友