计算机科学
人工智能
卷积神经网络
深度学习
目标检测
分割
蒙特卡罗方法
不确定度量化
机器学习
变压器
贝叶斯推理
贝叶斯概率
模式识别(心理学)
数学
工程类
统计
电气工程
电压
作者
Sai Harsha Yelleni,Deepshikha Kumari,P. K. Srijith,C. Krishna Mohan
标识
DOI:10.1016/j.patcog.2023.110003
摘要
With the advancements made in deep learning, computer vision problems have seen a great improvement in performance. However, in many real-world applications such as autonomous driving vehicles, the risk associated with incorrect predictions of objects or segmentation of images is very high. Standard deep learning models for object detection and segmentation such as YOLO models are often overconfident in their predictions and do not take into account the uncertainty in predictions on out-of-distribution data. In this work, we propose an efficient and effective approach, Monte-Carlo DropBlock (MC-DropBlock), to model uncertainty in YOLO and convolutional vision Transformers for object detection. The proposed approach applies drop-block during training time and testing time on the convolutional layer of the deep learning models such as YOLO and convolutional transformer. We theoretically show that this leads to a Bayesian convolutional neural network capable of capturing the epistemic uncertainty in the model. Additionally, we capture the aleatoric uncertainty in the data using a Gaussian likelihood. We demonstrate the effectiveness of the proposed approach on modeling uncertainty in object detection and segmentation tasks using out-of-distribution experiments. Experimental results show that MC-DropBlock improves the generalization, calibration, and uncertainty modeling capabilities of YOLO models and convolutional Transformer models for object detection and segmentation.
科研通智能强力驱动
Strongly Powered by AbleSci AI