FusionM4Net: A multi-stage multi-modal learning algorithm for multi-label skin lesion classification

计算机科学人工智能模态（人机交互）模式识别（心理学）构造（python库）特征（语言学）情态动词支持向量机阶段（地层学）机器学习哲学古生物学生物化学高分子化学程序设计语言语言学

作者

Peng Tang,Xintong Yan,Yang Nan,Xiang Shao,Sebastian Krammer,Tobias Lasser

出处

期刊：Medical Image Analysis [Elsevier]
日期：2022-02-01 卷期号：76: 102307-102307 被引量：39

链接

nih.govdoi.org

标识

DOI：10.1016/j.media.2021.102307

摘要

Skin disease is one of the most common diseases in the world. Deep learning-based methods have achieved excellent skin lesion recognition performance, most of which are based on only dermoscopy images. In recent works that use multi-modality data (patient's meta-data, clinical images, and dermoscopy images), the methods adopt a one-stage fusion approach and only optimize the information fusion at the feature level. These methods do not use information fusion at the decision level and thus cannot fully use the data of all modalities. This work proposes a novel two-stage multi-modal learning algorithm (FusionM4Net) for multi-label skin diseases classification. At the first stage, we construct a FusionNet, which exploits and integrates the representation of clinical and dermoscopy images at the feature level, and then uses a Fusion Scheme 1 to conduct the information fusion at the decision level. At the second stage, to further incorporate the patient's meta-data, we propose a Fusion Scheme 2, which integrates the multi-label predictive information from the first stage and patient's meta-data information to train an SVM cluster. The final diagnosis is formed by the fusion of the predictions from the first and second stages. Our algorithm was evaluated on the seven-point checklist dataset, a well-established multi-modality multi-label skin disease dataset. Without using the patient's meta-data, the proposed FusionM4Net's first stage (FusionM4Net-FS) achieved an average accuracy of 75.7% for multi-classification tasks and 74.9% for diagnostic tasks, which is more accurate than other state-of-the-art methods. By further fusing the patient's meta-data at FusionM4Net's second stage (FusionM4Net-SS), the entire FusionM4Net finally boosts the average accuracy to 77.0% and the diagnostic accuracy to 78.5%, which indicates its robust and excellent classification performance on the label-imbalanced dataset. The corresponding code is available at: https://github.com/pixixiaonaogou/MLSDR.

求助该文献

最长约 10秒，即可获得该文献文件

FusionM4Net: A multi-stage multi-modal learning algorithm for multi-label skin lesion classification

今日热心研友