人工智能
计算机科学
鉴定(生物学)
文字嵌入
可视化
深度学习
段落
模式识别(心理学)
图像(数学)
嵌入
人工神经网络
上下文图像分类
编码器
医学影像学
特征提取
机器学习
植物
万维网
生物
操作系统
作者
Bokai Yang,Yujie Yang,Qi Li,Denan Lin,Ye Li,Jing Zheng,Yunpeng Cai
出处
期刊:Tsinghua Science & Technology
[Tsinghua University Press]
日期:2023-01-06
卷期号:28 (4): 613-627
被引量:3
标识
DOI:10.26599/tst.2022.9010012
摘要
The lack of labeled image data poses a serious challenge to the application of artificial intelligence (AI) in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label images for machine learning tasks.However, most image note texts are unstructured with heterogeneity and short-paragraph characters, which fail traditional keyword-based techniques.We utilized a deep learning approach to recover missing labels for medical image notes automatically by using a combination of deep word embedding and deep neural network classifiers.Bidirectional encoder representations from transformers trained on medical image notes corpus (MinBERT) were proposed.We applied the proposed techniques to two typical classification tasks: Medical image type identification and clinical diagnosis identification.The two methods significantly outperformed baseline methods and presented high accuracies of 99.56% and 99.72% in image type identification and of 94.56% and 92.45% in clinical diagnosis identification.Visualization analysis further indicated that word embedding could efficiently capture semantic similarities and regularities across diverse expressions.Results indicated that our proposed framework could accurately recover the missing label information of medical images through the automatic extraction of electronic medical record information.Hence, it could serve as a powerful tool for exploring useful training data in various medical AI applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI