最小边界框
跳跃式监视
交叉口(航空)
计算机科学
模式识别(心理学)
马克西玛
公制(单位)
聚类分析
人工智能
汇流
边距(机器学习)
算法
数学
图像(数学)
机器学习
工程类
艺术
程序设计语言
运营管理
表演艺术
艺术史
航空航天工程
作者
Andrew Shepley,Greg Falzon,Paul Kwan,Ljiljana Branković
标识
DOI:10.1109/tpami.2023.3273210
摘要
Confluence is a novel non-Intersection over Union (IoU) alternative to Non-Maxima Suppression (NMS) in bounding box post-processing in object detection. It overcomes the inherent limitations of IoU-based NMS variants to provide a more stable, consistent predictor of bounding box clustering by using a normalized Manhattan Distance inspired proximity metric to represent bounding box clustering. Unlike Greedy and Soft NMS, it does not rely solely on classification confidence scores to select optimal bounding boxes, instead selecting the box which is closest to every other box within a given cluster and removing highly confluent neighboring boxes. Confluence is experimentally validated on the MS COCO and CrowdHuman benchmarks, improving Average Precision by 0.2--2.7% and 1--3.8% respectively and Average Recall by 1.3--9.3 and 2.4--7.3% when compared against Greedy and Soft-NMS variants. Quantitative results are supported by extensive qualitative analysis and threshold sensitivity analysis experiments support the conclusion that Confluence is more robust than NMS variants. Confluence represents a paradigm shift in bounding box processing, with potential to replace IoU in bounding box regression processes.
科研通智能强力驱动
Strongly Powered by AbleSci AI