发布文献求助

Medical visual question answering with symmetric interaction attention and cross-modal gating

计算机科学情态动词人工智能模态（人机交互）嵌入编码器答疑特征（语言学）情报检索自然语言处理模式识别（心理学）语言学操作系统哲学化学高分子化学

作者

Zhi Chen,Beiji Zou,Yulan Dai,Chengzhang Zhu,Guilan Kong,Wensheng Zhang

出处

期刊：Biomedical Signal Processing and Control [Elsevier]
日期：2023-08-01 卷期号：85: 105049-105049

标识

DOI：10.1016/j.bspc.2023.105049

摘要

The purpose of medical visual question answering (Med-VQA) is to provide accurate answers to clinical questions related to visual content of medical images. However, previous attempts neglect to take full advantage of the information interaction between medical images and clinical questions, which hinders the further progress of Med-VQA. The above issue requires the efforts to focus on critical information interaction within each modality and relevant information interaction between modalities. In this paper, we utilize the multiple meta-model quantifying model as visual encoder and the GloVe word embedding followed by the LSTM as textual encoder to form our feature extraction module. Then, we design a symmetric interaction attention module to construct dense and deep intra- and inter-modal information interaction on medical images and clinical questions for the Med-VQA task. Specifically, the symmetric interaction attention module consists of multiple symmetric interaction attention blocks that contain two basic units, i.e., self-attention and interaction attention. Technically, self-attention is introduced for intra-modal information interaction, while interaction attention is constructed for inter-modal information interaction. In addition, we develop a multi-modal fusion scheme that leverages the cross-modal gating to effectively fuse multi-modal information and avoid redundant information after sufficient intra- and inter-modal information interaction. Experimental results on the VQA-RAD dataset and PathVQA dataset show that our method outperforms other state-of-the-art Med-VQA models, achieving 74.7% and 48.7% on accuracy, 73.5% and 46.0% on F1-score, respectively.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 李健上传了应助文件

刚刚; 淡淡的若冰上传了应助文件

1秒前; 稳重的灵安完成签到，获得积分10

1秒前; 大模型上传了应助文件

4秒前; lidialon发布了新的文献求助10

5秒前; zhaxiao完成签到，获得积分10

5秒前; flj7038完成签到，获得积分0

6秒前; 英俊的铭上传了应助文件

6秒前; 萧水白上传了应助文件

6秒前; 曾无忧发布了新的文献求助10

7秒前; 丰富幻悲完成签到，获得积分10

7秒前; niuniu完成签到，获得积分10

7秒前; 简单发布了新的文献求助10

8秒前; Rainnn发布了新的文献求助10

8秒前; Siso完成签到，获得积分10

8秒前; Rainnn完成签到，获得积分10

11秒前; 天空完成签到，获得积分10

12秒前; Siso发布了新的文献求助10

12秒前; Jasper上传了应助文件

13秒前; lidialon完成签到，获得积分10

14秒前; 852上传了应助文件

15秒前; 科研通AI2S上传了应助文件

16秒前; 汉堡包的应助被沉静的画板采纳，获得10

16秒前; 大雁完成签到，获得积分10

17秒前; 柔弱元瑶的应助被马鑫麟采纳，获得10

17秒前; VDC上传了应助文件

19秒前; 雪霁初晴发布了新的文献求助10

19秒前; 猫咪老师的应助被高兴的冬瓜采纳，获得30

19秒前; 温柔忆曼完成签到，获得积分10

19秒前; Ava上传了应助文件

19秒前; 科研通管家关闭了巴萨老板的文献求助

20秒前; 玛琪玛小姐的狗发布了新的文献求助10

20秒前; 长情的初瑶发布了新的文献求助10

24秒前; 深情安青的应助被Zqq采纳，获得10

25秒前; wang发布了新的文献求助30

25秒前; guyutian的应助被ww采纳，获得10

29秒前; 有魅力凉面关注了科研通微信公众号

31秒前; 柔弱元瑶上传了应助文件

32秒前; 含糊的幻波的应助被科研通管家采纳，获得20

33秒前; FashionBoy的应助被科研通管家采纳，获得10

33秒前

高分求助中: 歯科矯正学第7版（或第5版） 1004; SIS-ISO/IEC TS 27100:2024 Information technology — Cybersecurity — Overview and concepts (ISO/IEC TS 27100:2020, IDT)(Swedish Standard) 1000; Smart but Scattered: The Revolutionary Executive Skills Approach to Helping Kids Reach Their Potential (第二版) 1000; Semiconductor Process Reliability in Practice 720; GROUP-THEORY AND POLARIZATION ALGEBRA 500; Mesopotamian divination texts : conversing with the gods : sources from the first millennium BCE 500; Days of Transition. The Parsi Death Rituals(2011) 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3232940; 求助须知：如何正确求助？哪些是违规求助？ 2879558; 关于积分的说明 8212027; 捐赠科研通 2547095; 什么是DOI，文献DOI怎么找？ 1376547; 科研通“疑难数据库（出版商）”最低求助积分说明 647658; 邀请新用户注册赠送积分活动 623056

今日热心研友

乐乐乐乐乐乐

清脆的土豆

库昊的假粉丝

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通