计算机科学
人工智能
注释
自编码
医学诊断
图像自动标注
特征(语言学)
模式识别(心理学)
对象(语法)
隐藏字幕
图像(数学)
图像检索
组分(热力学)
深度学习
机器学习
医学
物理
哲学
病理
热力学
语言学
作者
Yidong Chai,Hongyan Liu,Jie Xu,Sagar Samtani,Yuanchun Jiang,Haoxin Liu
出处
期刊:ACM transactions on management information systems
[Association for Computing Machinery]
日期:2023-01-25
卷期号:14 (2): 1-21
被引量:9
摘要
Medical image annotation aims to automatically describe the content of medical images. It helps doctors to understand the content of medical images and make better informed decisions like diagnoses. Existing methods mainly follow the approach for natural images and fail to emphasize the object abnormalities, which is the essence of medical images annotation. In light of this, we propose to transform the medical image annotation to a multi-label classification problem, where object abnormalities are focused directly. However, extant multi-label classification studies rely on arduous feature engineering, or do not solve label correlation issues well in medical images. To solve these problems, we propose a novel deep learning model where a frequent pattern mining component and an adversarial-based denoising autoencoder component are introduced. Extensive experiments are conducted on a real retinal image dataset to evaluate the performance of the proposed model. Results indicate that the proposed model significantly outperforms image captioning baselines and multi-label classification baselines.
科研通智能强力驱动
Strongly Powered by AbleSci AI