计算机科学
变压器
编码器
人工智能
卷积神经网络
聚类分析
高斯分布
模式识别(心理学)
医学影像学
量子力学
操作系统
物理
电压
作者
Bin Yan,Mingtao Pei,Meng Zhao,Caifeng Shan,Zhaoxing Tian
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2022-08-08
卷期号:26 (11): 5631-5640
被引量:13
标识
DOI:10.1109/jbhi.2022.3197162
摘要
In this paper, we propose a prior guided transformer for accurate radiology reports generation. In the encoder part, a radiograph is firstly represented by a set of patch features, which is obtained through a convolutional neural network and a traditional transformer encoder. Then an Additive Gaussian model is applied to represent the prior knowledge based on unsupervised clustering and sparse attention. In the decoder part, prior embeddings are acquired by probabilistically sampling from the radiograph prior. Then the visual features, language embeddings, and prior embeddings are fused by our proposed Prior Guided Attention to generate accurate radiology reports. Experiment results show that our method achieves better performance than state-of-the-art methods on two public radiology datasets, which proves the effectiveness of our prior guided transformer.
科研通智能强力驱动
Strongly Powered by AbleSci AI