An efficient but effective writer: Diffusion-based semi-autoregressive transformer for automated radiology report generation

计算机科学 人工智能 推论 自然语言处理 变压器 连贯性(哲学赌博策略) 词(群论) 语音识别 语言学 量子力学 物理 哲学 电压
作者
Yuhao Tang,Dacheng Wang,Liyan Zhang,Yuan Yuan
出处
期刊:Biomedical Signal Processing and Control [Elsevier]
卷期号:88: 105651-105651
标识
DOI:10.1016/j.bspc.2023.105651
摘要

It is firmly believed that manually diagnosing radiology images is clinically critical but labour-intensive and error-prone. Therefore, an automatic radiology report generation method is highly desired for alleviating the burden imposed on doctors. However, a typical report contains numerous template descriptions and only a few abnormal sentences. This unbalanced distribution makes the generation of template sentences more likely. Additionally, describing an entire report in a word-by-word manner can lead to significant latency during the inference step. Besides, the existing datasets are limited to conventional pneumonia, making them incomplete and one-sided. This work is concerned with forming a better trade-off between generation performance. One key design is an abnormal semantic diffusion module, which progressively absorbs the semantics of abnormal medical terminology and strengthens the linguistic coherence between local tokens. In detail, the generated report is refined by enhancing the incorporation of informative words with limited occurrence frequencies, which alleviates the monotony of template-based generation. Another design is a length-controllable self-attention decoder, which regulates the input length of the sentences used for target word generation. This framework preserves the autoregressive nature of word generation while also maintaining a controllable range, ensuring the efficiency of report generation. Moreover, a novel XRG-COVID-19 clinical dataset is tailored, which includes X-ray scans and professional diagnostic reports of 8676 patients. The experimental results demonstrate the proposed model achieves a better trade-off between performance and speed than those of carefully designed baselines on both the IU X-ray dataset and the proposed XRG-COVID-19 dataset.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
英吉利25发布了新的文献求助10
1秒前
ckeong89应助杨谊采纳,获得10
1秒前
munire发布了新的文献求助50
1秒前
jason完成签到 ,获得积分10
2秒前
3秒前
曲幻梅发布了新的文献求助10
4秒前
YY发布了新的文献求助10
4秒前
5秒前
5秒前
712关闭了712文献求助
6秒前
6秒前
年轻凌瑶完成签到,获得积分10
6秒前
yc完成签到,获得积分10
7秒前
8秒前
独特小凡关注了科研通微信公众号
8秒前
AAAfuwj发布了新的文献求助10
8秒前
舒适的藏花完成签到 ,获得积分10
9秒前
9秒前
林夕发布了新的文献求助10
9秒前
脑洞疼应助欢喜冷S亦A采纳,获得10
9秒前
KP完成签到,获得积分10
10秒前
Orange应助路过采纳,获得10
11秒前
11秒前
12秒前
12秒前
北辰关注了科研通微信公众号
13秒前
奥沙利楠完成签到,获得积分10
13秒前
建建发布了新的文献求助10
14秒前
15秒前
张柔完成签到,获得积分10
15秒前
16秒前
王志鹏完成签到 ,获得积分10
16秒前
17秒前
饱满完成签到,获得积分10
18秒前
18秒前
19秒前
20秒前
Diliam应助小米粥24采纳,获得30
20秒前
20秒前
852应助munire采纳,获得10
20秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 2000
Digital Twins of Advanced Materials Processing 2000
晋绥日报合订本24册(影印本1986年)【1940年9月–1949年5月】 1000
Social Cognition: Understanding People and Events 1000
Polymorphism and polytypism in crystals 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6032705
求助须知:如何正确求助?哪些是违规求助? 7722753
关于积分的说明 16201263
捐赠科研通 5179362
什么是DOI,文献DOI怎么找? 2771782
邀请新用户注册赠送积分活动 1755051
关于科研通互助平台的介绍 1640057