最小边界框
计算机科学
基本事实
推论
跳跃式监视
回归
功能(生物学)
度量(数据仓库)
边距(机器学习)
对象(语法)
趋同(经济学)
人工智能
机器学习
过程(计算)
算法
数学
数据挖掘
统计
图像(数学)
进化生物学
经济
生物
经济增长
操作系统
出处
期刊:Cornell University - arXiv
日期:2022-01-01
被引量:132
标识
DOI:10.48550/arxiv.2205.12740
摘要
The effectiveness of Object Detection, one of the central problems in computer vision tasks, highly depends on the definition of the loss function - a measure of how accurately your ML model can predict the expected outcome. Conventional object detection loss functions depend on aggregation of metrics of bounding box regression such as the distance, overlap area and aspect ratio of the predicted and ground truth boxes (i.e. GIoU, CIoU, ICIoU etc). However, none of the methods proposed and used to date considers the direction of the mismatch between the desired ground box and the predicted, "experimental" box. This shortage results in slower and less effective convergence as the predicted box can "wander around" during the training process and eventually end up producing a worse model. In this paper a new loss function SIoU was suggested, where penalty metrics were redefined considering the angle of the vector between the desired regression. Applied to conventional Neural Networks and datasets it is shown that SIoU improves both the speed of training and the accuracy of the inference. The effectiveness of the proposed loss function was revealed in a number of simulations and tests.
科研通智能强力驱动
Strongly Powered by AbleSci AI