快速傅里叶变换
计算机科学
高光谱成像
傅里叶变换
滤波器(信号处理)
人工智能
离散傅里叶变换(通用)
频域
卷积(计算机科学)
模式识别(心理学)
算法
块(置换群论)
计算机视觉
人工神经网络
傅里叶分析
数学
短时傅里叶变换
数学分析
几何学
作者
Hao Shi,Guo Cao,Youqiang Zhang,Zixian Ge,Yanbo Liu,Di Yang
出处
期刊:IEEE Transactions on Instrumentation and Measurement
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:72: 1-18
被引量:2
标识
DOI:10.1109/tim.2023.3277100
摘要
In the hyperspectral image (HSI) classification, there are numerous deep learning-based research routes that have emerged recently. Among them, two methodologies attract our attention. One is CNN-based classification and the other is transformer-based classification. The essence of these two methodologies is to interchange information locally or at a long distance for HSI pixels in the spatial or spectral-spatial domain. There are two principles underlying this essence—the information mixing mechanism and the information mixing domain. Although both CNN-based and transformer-based have made efforts in these two principles and obtained favorable classification results, there is still room for improvement in terms of accuracy and efficiency. To further enhance the accuracy and efficiency under the two principles, fast Fourier transform (FFT) is introduced to HSI classification and a fast Fourier filter is designed to mix information efficiently in the frequency domain by means of FFT. The parametric-free characteristic and fast computation of FFT can assist us in efficiently learning interactions among features in the frequency domain. Furthermore, a fast Fourier filter block is built upon the fast Fourier filter for repeatedly using as a basic block. In addition, we propose a spectral-spatial convolution tokenizer (SSCT) to extract shallow features and prepare spectral-spatial tokens for fast Fourier filter blocks. Finally, by employing SSCT and fast Fourier filter blocks, a novel deep neural network architecture—fast Fourier filter network (F 3 Net) is proposed for HSI classification. F 3 Net-P as a pyramidal variant of F 3 Net is also investigated. Experimental results on four datasets comprehensively evaluate our models and indicate that they are competitive with several current state-of-the-art methods, especially when the training samples are extremely limited. Specifically, F 3 Net-P achieves the highest accuracy of 97.25%, 98.08%, 97.49% and 97.95% on the four datasets, respectively, outperforming second best compared model by 1.49%, 2.03%, 2.14% and 1.94%.
科研通智能强力驱动
Strongly Powered by AbleSci AI