发布文献求助

Asymmetric cross-modal attention network with multimodal augmented mixup for medical visual question answering

计算机科学语义学（计算机科学）人工智能一般化答疑机器学习情态动词化学数学高分子化学数学分析程序设计语言

作者

Yong Li,Qihao Yang,Fu Lee Wang,Lap-Kei Lee,Yingying Qu,Tianyong Hao

出处

期刊：Artificial Intelligence in Medicine [Elsevier]
日期：2023-09-18 卷期号：144: 102667-102667 被引量：1

链接

标识

DOI：10.1016/j.artmed.2023.102667

摘要

Insufficient training data is a common barrier to effectively learn multimodal information interactions and question semantics in existing medical Visual Question Answering (VQA) models. This paper proposes a new Asymmetric Cross Modal Attention network called ACMA, which constructs an image-guided attention and a question-guided attention to improve multimodal interactions from insufficient data. In addition, a Semantic Understanding Auxiliary (SUA) in the question-guided attention is newly designed to learn rich semantic embeddings for improving model performance on question understanding by integrating word-level and sentence-level information. Moreover, we propose a new data augmentation method called Multimodal Augmented Mixup (MAM) to train the ACMA, denoted as ACMA-MAM. The MAM incorporates various data augmentations and a vanilla mixup strategy to generate more non-repetitive data, which avoids time-consuming artificial data annotations and improves model generalization capability. Our ACMA-MAM outperforms state-of-the-art models on three publicly accessible medical VQA datasets (VQA-Rad, VQA-Slake, and PathVQA) with accuracies of 76.14 %, 83.13 %, and 53.83 % respectively, achieving improvements of 2.00 %, 1.32 %, and 1.59 % accordingly. Moreover, our model achieves F1 scores of 78.33 %, 82.83 %, and 51.86 %, surpassing the state-of-the-art models by 2.80 %, 1.15 %, and 1.37 % respectively.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: CodeCraft的应助被没有名字的期待采纳，获得10

1秒前; meng完成签到，获得积分10

1秒前; zhuxl完成签到，获得积分10

1秒前; 华仔的应助被下酒菜采纳，获得10

1秒前; 六花发布了新的文献求助10

1秒前; 赘婿上传了应助文件

2秒前; 赘婿上传了应助文件

2秒前; 芥楠完成签到，获得积分10

2秒前; hxm发布了新的文献求助20

3秒前; 李健的小迷弟上传了应助文件

3秒前; 嘻嘻完成签到，获得积分10

3秒前; lizzzzzz发布了新的文献求助10

4秒前; Ava上传了应助文件

5秒前; aikey发布了新的文献求助10

6秒前; orixero的应助被郑玉成采纳，获得10

7秒前; 希望天下0贩的0上传了应助文件

7秒前; mmyhn的应助被嘻嘻采纳，获得20

7秒前; 日行一善完成签到，获得积分10

7秒前; Zhou完成签到，获得积分10

8秒前; ich发布了新的文献求助10

8秒前; 六花完成签到，获得积分10

8秒前; 老迟的新瑶发布了新的文献求助10

8秒前; 淡定的疾的应助被笑点低战斗机采纳，获得10

8秒前; splaker7完成签到，获得积分10

9秒前; 顾陌完成签到，获得积分10

10秒前; 小鳄鱼一只上传了应助文件

11秒前; orange发布了新的文献求助10

11秒前; 捱小秋发布了新的文献求助10

11秒前; chujun发布了新的文献求助10

11秒前; 下午不喝姜糖膏上传了应助文件

11秒前; 桐桐的应助被压缩采纳，获得10

11秒前; 希望天下0贩的0的应助被小李采纳，获得10

12秒前; pcr163的应助被xiaoliu采纳，获得80

12秒前; 桐桐上传了应助文件

12秒前; 幸福剑身完成签到，获得积分10

12秒前; drift发布了新的文献求助10

12秒前; 飒飒上传了应助文件

13秒前; 鄂惜霜发布了新的文献求助10

14秒前; TTTYYY发布了新的文献求助10

14秒前; 汉堡包的应助被科研通管家采纳，获得10

15秒前

高分求助中: 좌파는 어떻게 좌파가 됐나:한국 급진노동운동의 형성과 궤적 2500; Sustainability in Tides Chemistry 1500; TM 5-855-1(Fundamentals of protective design for conventional weapons) 1000; CLSI EP47 Evaluation of Reagent Carryover Effects on Test Results, 1st Edition 800; Threaded Harmony: A Sustainable Approach to Fashion 799; Livre et militantisme : La Cité éditeur 1958-1967 500; Retention of title in secured transactions law from a creditor's perspective: A comparative analysis of selected (non-)functional approaches 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3054259; 求助须知：如何正确求助？哪些是违规求助？ 2711253; 关于积分的说明 7425350; 捐赠科研通 2355845; 什么是DOI，文献DOI怎么找？ 1247387; 科研通“疑难数据库（出版商）”最低求助积分说明 606388; 版权声明 596048

今日热心研友

抱住仙人球

坚强的广山

会游泳的思维

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通