计算机科学
分割
背景(考古学)
联营
块(置换群论)
骨干网
人工智能
特征(语言学)
棱锥(几何)
推论
编码器
增采样
深度学习
网络体系结构
卷积神经网络
模式识别(心理学)
计算机网络
图像(数学)
哲学
物理
古生物学
光学
操作系统
生物
语言学
数学
几何学
作者
Saquib Mazhar,Nadeem Atif,M. K. Bhuyan,Shaik Rafi Ahamed
标识
DOI:10.1016/j.engappai.2023.107086
摘要
Deep-learning-based semantic segmentation networks typically incorporate object classification networks in their backbone. This leads to a loss of context because classification networks have a smaller field of view. The architecture has been extended to recover context with additional downsampling feature maps, a parallel context branch, or pyramid pooling modules after the backbone. However, these extensions increase multiply–accumulate operations and memory requirements, thus, making them unsuitable for resource-constrained devices. To overcome this limitation, a novel convolutional building block with attention-based context guidance is proposed. The block is repeated to build an efficient encoder–decoder network. Our network runs in real-time, has a lightweight design with only 0.72 Million parameters, and achieves 70.1%, and 66.3% mean intersection-over-union scores on the highly competitive Cityscapes and CamVid datasets, respectively. An efficient decoder is also designed to replace other semantic segmentation network decoders with minimal performance loss. The performance measures on mobile platforms show that our network suits resource-constrained devices. Further, experimental results show that the proposed method can optimally balance the model size-inference speed and segmentation accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI