高光谱成像
计算机科学
人工智能
卷积(计算机科学)
模式识别(心理学)
上下文图像分类
计算机视觉
遥感
图像(数学)
地质学
人工神经网络
作者
Yufan Wang,Xiaodong Yu,Xiaoyan Wen,Xiaohui Li,Hongbin Dong,Shuying Zang
出处
期刊:IEEE Geoscience and Remote Sensing Letters
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:: 1-1
被引量:4
标识
DOI:10.1109/lgrs.2024.3365615
摘要
Hyperspectral image classification is an important but challenging task. Conventional convolutional neural networks (CNNs) are able to extract local spectral spatial features but ignore long-range dependencies and global features. To address this problem, we propose a new model combining 3D-CNN and Convolutional Vision Transformer, aiming to improve the performance of the image recognition task by utilizing the advantages of CNN in local feature extraction while retaining the advantages of Transformer in long-range dependency processing. Our model is tested on three publicly available hyperspectral image datasets, and the results show that our model outperforms other state-of-the-art models in terms of classification accuracy and robustness. The source code for our work is available at [https://github.com/Dreamvai/ViT-Convolution]. The model proposed in this letter provides a new idea for hyperspectral image classification and expands a new field for the application of convolutional transformers. In the future, we intend to further explore the performance of the convolutional transformer and the possibility of combining it with other types of data.
科研通智能强力驱动
Strongly Powered by AbleSci AI