帕斯卡(单位)
计算机科学
跳跃式监视
单调函数
趋同(经济学)
功能(生物学)
最小边界框
回归
算法
人工智能
数学
统计
图像(数学)
经济增长
进化生物学
生物
数学分析
经济
程序设计语言
作者
Liu Can,Kaige Wang,Qing Li,Fazhan Zhao,Kun Zhao,Hongtu Ma
标识
DOI:10.1016/j.neunet.2023.11.041
摘要
Bounding box regression (BBR) is one of the core tasks in object detection, and the BBR loss function significantly impacts its performance. However, we have observed that existing IoU-based loss functions suffer from unreasonable penalty factors, leading to anchor boxes expanding during regression and significantly slowing down convergence. To address this issue, we intensively analyzed the reasons for anchor box enlargement. In response, we propose a Powerful-IoU (PIoU) loss function, which combines a target size-adaptive penalty factor and a gradient-adjusting function based on anchor box quality. The PIoU loss guides anchor boxes to regress along efficient paths, resulting in faster convergence than existing IoU-based losses. Additionally, we investigate the focusing mechanism and introduce a non-monotonic attention layer that was combined with PIoU to obtain a new loss function PIoU v2. PIoU v2 loss enhances the capability to focus on anchor boxes of medium quality. By incorporating PIoU v2 into popular object detectors such as YOLOv8 and DINO, we achieved an increase in average precision (AP) and improved performance compared to their original loss functions on the MS COCO and PASCAL VOC datasets, thus validating the effectiveness of our proposed improvement strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI