Probabilistic Attention Based on Gaussian Processes for Deep Multiple Instance Learning

过度拟合计算机科学人工智能概率逻辑机器学习 MNIST数据库高斯过程稳健性（进化）深度学习高斯分布不确定度量化人工神经网络量子力学基因生物化学物理化学

作者

Arne Schmidt,Pablo Morales-Álvarez,Rafael Molina

出处

期刊：IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
日期：2024-01-01 卷期号：: 1-14 被引量：3

链接

arxiv.org arxiv.org nih.govdoi.org

标识

DOI：10.1109/tnnls.2023.3245329

摘要

Multiple instance learning (MIL) is a weakly supervised learning paradigm that is becoming increasingly popular because it requires less labeling effort than fully supervised methods. This is especially interesting for areas where the creation of large annotated datasets remains challenging, as in medicine. Although recent deep learning MIL approaches have obtained state-of-the-art results, they are fully deterministic and do not provide uncertainty estimations for the predictions. In this work, we introduce the attention Gaussian process (AGP) model, a novel probabilistic attention mechanism based on Gaussian processes (GPs) for deep MIL. AGP provides accurate bag-level predictions as well as instance-level explainability and can be trained end-to-end. Moreover, its probabilistic nature guarantees robustness to overfit on small datasets and uncertainty estimations for the predictions. The latter is especially important in medical applications, where decisions have a direct impact on the patient's health. The proposed model is validated experimentally as follows. First, its behavior is illustrated in two synthetic MIL experiments based on the well-known MNIST and CIFAR-10 datasets, respectively. Then, it is evaluated in three different real-world cancer detection experiments. AGP outperforms state-of-the-art MIL approaches, including deterministic deep learning ones. It shows a strong performance even on a small dataset with less than 100 labels and generalizes better than competing methods on an external test set. Moreover, we experimentally show that predictive uncertainty correlates with the risk of wrong predictions, and therefore it is a good indicator of reliability in practice. Our code is publicly available.

求助该文献

最长约 10秒，即可获得该文献文件

Probabilistic Attention Based on Gaussian Processes for Deep Multiple Instance Learning

今日热心研友