计算机科学
模态(人机交互)
水准点(测量)
匹配(统计)
保险丝(电气)
人工智能
传感器融合
情态动词
对偶(语法数字)
维数(图论)
计算
模式
算法
工程类
数学
艺术
社会科学
统计
化学
文学类
大地测量学
社会学
高分子化学
纯数学
电气工程
地理
作者
Weikuo Guo,Xiangwei Kong
标识
DOI:10.1109/icassp49357.2023.10096438
摘要
Cross-modal matching is one of the most fundamental and widely studied tasks in the field of data science. To have a better understanding of the complicated cross-modal correspondences, the powerful attention mechanism has been widely used recently. In this paper, we propose a novel Dual Gated Attention Fusion (DGAF) unit to save cross-modal matching from heavy attention computation. Specifically, the attention unit in the main information flow is alternated to a single-head low-dimension light-weighted attention bypass which serves as a gate to selectively cast away noise in both modality. To strengthen the interaction between modalities, an auxiliary memory unit is appended. A gated memory fusion unit is designed to fuse the memorized inter-modality information into both modality streams. Extensive experiments on two benchmark datasets show that the proposed DGAF achieves good balance between the efficiency and the effectiveness.
科研通智能强力驱动
Strongly Powered by AbleSci AI