计算机科学
分割
人工智能
特征(语言学)
图像分割
计算复杂性理论
模式识别(心理学)
卷积神经网络
计算机视觉
算法
哲学
语言学
作者
Yanyan Liu,Xiaotian Bai,Jiafei Wang,Guoning Li,Jin Li,Zengming Lv
标识
DOI:10.1016/j.engappai.2023.107260
摘要
Image semantic segmentation is a technique that distinguishes different kinds of things in an image by assigning a label to each point in a target category based on its "semantics". The Deeplabv3+ image semantic segmentation method currently in use has high computational complexity and large memory consumption, making it difficult to deploy on embedded platforms with limited computational power. When extracting image feature information, Deeplabv3+ struggles to fully utilize multiscale information. This can result in a loss of detailed information and damage to segmentation accuracy. An improved image semantic segmentation method based on the DeepLabv3+ network is proposed, with the lightweight MobileNetv2 serving as the model's backbone. The ECAnet channel attention mechanism is applied to low-level features, reducing computational complexity and improving target boundary clarity. The polarized self-attention mechanism is introduced after the ASPP module to improve the spatial feature representation of the feature map. Validated on the VOC2012 dataset, the experimental results indicate that the improved model achieved an MloU of 69.29% and a mAP of 80.41%, which can predict finer semantic segmentation results and effectively optimize the model complexity and segmentation accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI