变压器
计算机科学
分割
试验装置
人工智能
图像分割
模式识别(心理学)
数据挖掘
工程类
电压
电气工程
作者
Di Wang,Ronghao Yang,Zhenxin Zhang,Hanhu Liu,Junxiang Tan,Shaoda Li,Xiaoxia Yang,Xiao Wang,Kangqi Tang,Yichun Qiao,Po-Chyi Su
标识
DOI:10.1016/j.cageo.2023.105340
摘要
With the recent development of remote sensing technology and deep learning, semantic segmentation methods have been increasingly used in land cover classification. However, this method is faced with the challenge of incomplete recognition caused by big differences in scale of ground objects. Owing to multi-head self-attention, the Swin Transformer Network (Swin) has a large receptive field at its shallow level, which is conducive to the identification of large-scale objects. However, Swin does not fully mine the context information of features, so it is easy to cause incomplete recognition. Based on Swin, we propose a parallel window-based Transformer Network, Parallel Swin Transformer Network (P-Swin). The core of P-Swin is a Parallel Swin Transformer Block (PST Block), which includes Window-based Self Attention Interaction (WSAI) and Feed Forward Network (FFN). WSAI can not only calculate the relationship within windows, but also establish the relationship between windows. Therefore, it improves the ability of network to obtain feature context information. P-Swin outperformed Swin and reached the highest level, with 76.42% mIoU for the test set in the ISPRS Potsdam 2D dataset (Swin: 75.95%), 65.13% mIoU for the test set in the Gaofen Image Dataset (Swin: 63.41%), and 64.61% mIoU for the test set in the WHDLD Dataset (Swin: 63.01%)
科研通智能强力驱动
Strongly Powered by AbleSci AI