发布文献求助

Focus Your Attention: A Focal Attention for Multimodal Learning

计算机科学光学（聚焦）模态（人机交互）焦点人工智能模式匹配（统计）自然语言处理基点数学社会科学统计光学物理社会学

作者

Chunxiao Liu,Zhendong Mao,Tianzhu Zhang,An-An Liu,Bin Wang,Yongdong Zhang

出处

期刊：IEEE Transactions on Multimedia [Institute of Electrical and Electronics Engineers]
日期：2022-01-01 卷期号：24: 103-115 被引量：5

标识

DOI：10.1109/tmm.2020.3046855

摘要

The key point in multimodal learning is to learn semantic alignment that finds the correspondence between sub-elements of instances from different modality data. Attention mechanism has shown its power in semantic alignment learning as it enables to densely associate sub-elements across different modalities. However, for each sub-element, existing attention aligns it with all the sub-elements from another modality, while most of them have no correspondence with it, i.e. irrelevant sub-elements. The irrelevant sub-elements will distract the semantic alignment if they are also attended. In this paper, we propose a novel focal attention mechanism to learn more accurate semantic alignment. The focal attention sparsely attends to a subset of sub-elements, which are identified as relevant ones according to their posterior probabilities given each sub-element from another modality. Based on the observation that relevant sub-elements mostly describe the same semantic, the posterior probability can precisely distinguish relevant and irrelevant ones by taking interactions within the same modality into consideration, such that relevant sub-elements get higher and closer posterior probabilities, while irrelevant ones get lower probabilities. Such a design learns better semantic alignment by preventing the interference of irrelevant sub-elements, and it facilitates subsequent multimodal tasks that demand semantic alignment. To validate the effectiveness of the focal attention, we conduct extensive experiments on image-text matching and text-to-image generation, and we propose a bidirectional and stacked version of focal attention for them, respectively. Experimental results on benchmarks show that the focal attention can significantly and consistently outperform state-of-the-arts.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 从容芮的应助被思源采纳，获得50

刚刚; Lucas的应助被不要碧莲采纳，获得10

刚刚; 皓月孤烟发布了新的文献求助10

1秒前; 从容的山兰发布了新的文献求助10

3秒前; tian发布了新的文献求助10

3秒前; 科研文献搬运工上传了应助文件

4秒前; 从容芮上传了应助文件

6秒前; FoxLY发布了新的文献求助10

7秒前; 小二郎的应助被氟锑酸采纳，获得10

8秒前; 在水一方上传了应助文件

8秒前; tttt完成签到，获得积分10

10秒前; 汉堡包上传了应助文件

12秒前; li完成签到，获得积分10

13秒前; 瓜皮糖浆发布了新的文献求助10

13秒前; SciGPT的应助被学术废柴采纳，获得10

13秒前; 小二郎上传了应助文件

14秒前; 充电宝的应助被默默柚子采纳，获得10

17秒前; 从容芮上传了应助文件

18秒前; 冷静的奇迹发布了新的文献求助10

18秒前; 氟锑酸发布了新的文献求助10

20秒前; 李健的粉丝团团长的应助被Dr.Lee采纳，获得10

22秒前; 爆米花上传了应助文件

26秒前; 桑丘关闭了桑丘的文献求助

26秒前; 暴躁的奇异果完成签到，获得积分10

28秒前; 烟花上传了应助文件

30秒前; 无奈乐松上传了应助文件

30秒前; 从容芮上传了应助文件

30秒前; 星海发布了新的文献求助20

31秒前; Hello的应助被余弥采纳，获得10

32秒前; Jiawww完成签到，获得积分10

33秒前; 暴躁的奇异果发布了新的文献求助10

34秒前; 撒大苏打完成签到，获得积分10

34秒前; 孤独的幻桃发布了新的文献求助20

36秒前; 冷静的奇迹发布了新的文献求助10

36秒前; 小猪佩奇完成签到，获得积分10

36秒前; SciGPT的应助被虚幻过客采纳，获得30

37秒前; ding的应助被大阳阳采纳，获得10

40秒前; 科研通AI2.0上传了应助文件

42秒前; 阳yang完成签到，获得积分10

43秒前; 从容芮上传了应助文件

43秒前

高分求助中: Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000; Die Elektra-Partitur von Richard Strauss : ein Lehrbuch für die Technik der dramatischen Komposition 1000; How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000; Gerard de Lairesse : an artist between stage and studio 670; 大平正芳: 「戦後保守」とは何か 550; LNG地下タンク躯体の構造性能照査指針 500; Cathodoluminescence and its Application to Geoscience 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3000581; 求助须知：如何正确求助？哪些是违规求助？ 2660351; 关于积分的说明 7205018; 捐赠科研通 2296234; 什么是DOI，文献DOI怎么找？ 1217586; 科研通“疑难数据库（出版商）”最低求助积分说明 593826; 版权声明 592931

今日热心研友

坚强的广山

科研文献搬运工

热心市民小红花

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通