计算机科学
领域(数学分析)
过程(计算)
质量(理念)
自然语言处理
人工智能
语言模型
机器学习
程序设计语言
数学
认识论
数学分析
哲学
作者
He Zhu,Ren Togo,Takahiro Ogawa,Miki Haseyama
标识
DOI:10.1109/icce-taiwan58799.2023.10227045
摘要
This paper proposes a medical visual question generation model for generating higher-quality questions from medical images. The visual question generation model can guide the diagnostic process and improve the utilization of medical resources by reducing the dependence on physician involvement. Our model uses cross-attention and the large language model to preserve inherent information and addresses the issue of inferior generation performance in the medical domain due to a lack of data. We also control the category of generated questions by setting guidance sentences that include interrogative words. The experimental results demonstrate that our method generates higher-quality questions than previous approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI