高光谱成像
计算机科学
卷积(计算机科学)
人工智能
遥感
模式识别(心理学)
上下文图像分类
图像(数学)
人工神经网络
地质学
作者
Qixing Yu,Weibo Wei,Dantong Li,Zhenkuan Pan,Chenyu Li,Danfeng Hong
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:62: 1-18
被引量:3
标识
DOI:10.1109/tgrs.2024.3362471
摘要
In hyperspectral images (HSIs), both local and non-local features play crucial roles in classification tasks. Vision Transformer (VIT) can extract non-local features through attention mechanisms, while Convolutional Neural Networks (CNN) excel at handling local components. However, in traditional dual-branch models based on VIT and CNN, there is a lack of interaction during feature processing, leading to potential compatibility issues when merging the two types of features. In this article, we propose HyperSINet, a Synergetic Interaction Network that combines VIT and CNN to establish interaction between the two branches, enabling mutual compensation between local and non-local features during the training process and ultimately enhancing the performance of classification tasks. Specifically, we devise a pair of interactors, namely Conv2Trans and Trans2Conv, which serve as intermediaries between the two branches, enabling the VIT branch to refine its local details, while allowing the CNN branch to process larger receptive field non-local features. Typical feature Maps are implemented to visualize the function of the interactors. Furthermore, within the VIT branch, a VIT Encoder with the local mask is developed to strike a balance between emphasizing non-local features and preserving local details, while a lightweight CNN block is designed to process spectral and spatial features in the CNN branch. Extensive experiments conducted on four real-world datasets demonstrate that, under a reasonable count of parameters, HyperSINet surpasses several current state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI