计算机科学
人工智能
模式识别(心理学)
深度学习
水准点(测量)
棱锥(几何)
特征(语言学)
人工神经网络
特征提取
卷积神经网络
网络体系结构
数学
地理
地图学
几何学
哲学
语言学
计算机安全
作者
Jun Yu,Chuhong Zhu,Jian Zhang,Qingming Huang,Dacheng Tao
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2020-02-01
卷期号:31 (2): 661-674
被引量:203
标识
DOI:10.1109/tnnls.2019.2908982
摘要
We propose an end-to-end place recognition model based on a novel deep neural network. First, we propose to exploit the spatial pyramid structure of the images to enhance the vector of locally aggregated descriptors (VLAD) such that the enhanced VLAD features can reflect the structural information of the images. To encode this feature extraction into the deep learning method, we build a spatial pyramid-enhanced VLAD (SPE-VLAD) layer. Next, we impose weight constraints on the terms of the traditional triplet loss (T-loss) function such that the weighted T-loss (WT-loss) function avoids the suboptimal convergence of the learning process. The loss function can work well under weakly supervised scenarios in that it determines the semantically positive and negative samples of each query through not only the GPS tags but also the Euclidean distance between the image representations. The SPE-VLAD layer and the WT-loss layer are integrated with the VGG-16 network or ResNet-18 network to form a novel end-to-end deep neural network that can be easily trained via the standard backpropagation method. We conduct experiments on three benchmark data sets, and the results demonstrate that the proposed model defeats the state-of-the-art deep learning approaches applied to place recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI