人工智能
计算机科学
特征提取
特征(语言学)
判别式
粒度
计算机视觉
频道(广播)
端到端原则
块(置换群论)
模式识别(心理学)
骨干网
数学
操作系统
哲学
语言学
计算机网络
几何学
作者
Jiaqi Yin,Kun Dai,Lan Cheng,Xin Xu,Zhe Zhang
标识
DOI:10.1109/ccdc58219.2023.10326912
摘要
Loop closure detection(LCD) is a key component in VSLAM systems to eliminate cumulative errors. We propose an end-to-end image feature extraction-aggregate LCD network, Res2Net-SE-NetVLAD, to extract discriminative multi-scale fusion features for VSLAM. In Res2Net-SE-NetVLAD, the deep learning network Res2Net is chosen as the backbone, and the channel attention mechanism SE-block is implemented to obtain multiple perceptual fields with different granularity. Based on this, the channel optimization module is used to quantify the feature maps from the channel level, and the NetVLAD layer is further fused to implement the scale feature extraction network Res2Net-SE-NetVLAD, for which end-to-end training can be performed to achieve LCD. The experimental results show that the proposed model outperforms other deep learning-based LCD methods in scenes with loop closure attributes.
科研通智能强力驱动
Strongly Powered by AbleSci AI