变压器
计算机科学
嵌入
人工智能
模式识别(心理学)
注意力网络
图形
特征(语言学)
人工神经网络
特征匹配
混合神经网络
特征提取
理论计算机科学
工程类
电压
语言学
哲学
电气工程
作者
Youcheng Cai,Lin Li,Dong Wang,Xinjie Li,Xiaoping Liu
标识
DOI:10.1016/j.sigpro.2022.108859
摘要
Local feature matching plays a vital role in various computer vision tasks. In this work, we present a novel network that combines feature matching and outlier rejection for finding reliable correspondences between image pairs. The proposed method is a hybrid transformer-based graph neural network (GNN), termed HTMatch, which aims to achieve high accuracy and efficient feature matching. Specifically, we first propose a hybrid transformer that integrates self- and cross-attention together to condition the feature descriptors between image pairs. By doing so, the intra/inter-graph attentional aggregation can be realized by a single transformer layer, which achieves more efficient message passing. Then, we introduce a new spatial embedding module to enhance the spatial constraints across images. The spatial information from one image is embedded into another, which can significantly improve matching performance. Finally, we adopt a seeded GNN architecture for establishing a sparse graph, which improves both efficiency and effectiveness. Experiments show that HTMatch reaches state-of-the-art results on several public benchmarks.
科研通智能强力驱动
Strongly Powered by AbleSci AI