计算机科学
分割
人工智能
特征提取
卷积(计算机科学)
编解码器
模式识别(心理学)
核(代数)
图像分割
编码器
特征(语言学)
计算机视觉
人工神经网络
数学
组合数学
操作系统
哲学
语言学
计算机硬件
作者
Ke Yang,Jizheng Yi,Aibin Chen,Jiaqi Liu,Wenjie Chen
出处
期刊:IEEE Geoscience and Remote Sensing Letters
[Institute of Electrical and Electronics Engineers]
日期:2021-07-09
卷期号:19: 1-5
被引量:32
标识
DOI:10.1109/lgrs.2021.3093101
摘要
Extracting roads from aerial images is an issue that has attracted much attention. Using semantic segmentation methods to extract roads often faces the problem of narrow and occluded roads. In this letter, we propose a network called ConDinet++, which improves the general codec architecture. In the encoder part, the VGG16 with pretraining parameters is utilized for the feature extraction. In the decoder part, we perform a feature fusion mechanism on the full-scale feature map. In order to improve the ability of the network to extract and integrate semantic information and further increase the receptive field, we recommend adopting the conditional dilated convolution blocks (CDBs) in the encoder, and each CDB consists of a group of cascaded conditional dilated convolutions. More importantly, the designed codec architecture can adjust the number of convolutions and the parameters of the convolution kernel according to the input data. For a slender area like a road, which occupies a small area in the picture, we use the joint loss function and introduce the joint loss of Lovasz loss and cross-entropy loss to avoid the segmentation model having a serious bias caused by highly unbalanced object sizes between roads and background. The proposed method was tested on two public datasets Massachusetts Roads Dataset and Mini DeepGlobe Road Extraction Challenge. Compared with some previous semantic segmentation networks, the proposed ConDinet++ achieved the best values of recall, F-score, and mIoU.
科研通智能强力驱动
Strongly Powered by AbleSci AI