答疑
计算机科学
主流
任务(项目管理)
人工智能
工程类
哲学
神学
系统工程
作者
Qi Wu,Peng Wang,Xin Wang,Xiaodong He,Wenwu Zhu
出处
期刊:Advances in computer vision and pattern recognition
日期:2022-01-01
卷期号:: 165-176
被引量:3
标识
DOI:10.1007/978-981-19-0964-1_11
摘要
Inspired by the rise of VQA research in general domain, the task of Medical VQA has received great attention from computer vision, natural language processing and biomedical research communities in recent years. Given a medical image and clinically related question about the visual elements in the medical image, a Medical VQA system is required to deeply comprehend both the medical image and the asked question to predict the correct answer. In this chapter, we first introduce mainstream datasets used for Medical VQA tasks, such as VQA-RAD, VQA-Med, PathVQA and SLAKE datasets. Then, we elaborate the prevalent methods for Medical VQA tasks in detail. These methods can be classified into three categories based on their main characteristics: classical VQA methods, meta-learning methods and BERT-based methods for Medical VQA.
科研通智能强力驱动
Strongly Powered by AbleSci AI