计算机科学
可靠性(半导体)
置信区间
人工智能
隐藏字幕
深度学习
医学物理学
机器学习
图像(数学)
放射科
数据挖掘
医学
统计
数学
功率(物理)
物理
量子力学
作者
Yixin Wang,Zihao Lin,Zhe Xu,Haoyu Dong,Jie Luo,Jiang Tian,Zhongchao Shi,Lifu Huang,Y.S. Zhang,Jianping Fan,Zhiqiang He
出处
期刊:Neurocomputing
[Elsevier]
日期:2024-02-08
卷期号:578: 127374-127374
被引量:2
标识
DOI:10.1016/j.neucom.2024.127374
摘要
Medical imaging plays a pivotal role in diagnosis and treatment in clinical practice. Inspired by the significant progress in automatic image captioning, various deep learning (DL)-based methods have been proposed to generate radiology reports for medical images. Despite promising results, previous works overlook the uncertainties of their models and are thus unable to provide clinicians with the reliability/confidence of the generated radiology reports to assist their decision-making. In this paper, we propose a novel method to explicitly quantify both the visual uncertainty and the textual uncertainty for DL-based radiology report generation. Such multi-modal uncertainties can sufficiently capture the model's confidence degree at both the report level and sentence level, which can be further leveraged to weight the losses for more comprehensive model optimization. Experimental results have demonstrated that the proposed method for model uncertainty characterization and estimation can produce more reliable confidence scores for radiology report generation, and the modified loss function, which takes into account the uncertainties, leads to better model performance on two public radiology report datasets based on both standard evaluation metrics and experienced radiologists.
科研通智能强力驱动
Strongly Powered by AbleSci AI