外观
计算机科学
解析
棱锥(几何)
人工智能
卷积神经网络
分割
背景(考古学)
任务(项目管理)
特征(语言学)
语义学(计算机科学)
核(代数)
模式识别(心理学)
计算机视觉
自然语言处理
地理
数学
程序设计语言
哲学
组合数学
经济
考古
语言学
管理
几何学
作者
Wenguang Ma,Wei Ma,Shibiao Xu,Hongbin Zha
出处
期刊:IEEE Geoscience and Remote Sensing Letters
[Institute of Electrical and Electronics Engineers]
日期:2020-05-20
卷期号:18 (6): 1009-1013
被引量:21
标识
DOI:10.1109/lgrs.2020.2993451
摘要
The semantic parsing of building facade images is a fundamental yet challenging task in urban scene understanding. Existing works sought to tackle this task by using facade grammars or convolutional neural networks (CNNs). The former can hardly generate parsing results coherent with real images while the latter often fails to capture relationships among facade elements. In this letter, we propose a pyramid atrous large kernel (ALK) network (ALKNet) for the semantic segmentation of facade images. The pyramid ALKNet captures long-range dependencies among building elements by using ALK modules in multiscale feature maps. It makes full use of the regular structures of facades to aggregate useful nonlocal context information and thereby is capable of dealing with challenging image regions caused by occlusions, ambiguities, and so on. Experiments on both rectified and unrectified facade data sets show that ALKNet has better performances than those of state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI