计算机科学
人工智能
蒸馏
任务(项目管理)
一般化
回归
机器学习
模式识别(心理学)
对象(语法)
目标检测
分类器(UML)
数学
统计
化学
数学分析
管理
有机化学
经济
作者
Ruining Tang,Zhenyu Liu,Yangguang Li,Yiguo Song,Hui Liu,Qide Wang,Jing Shao,Guifang Duan,Jianrong Tan
标识
DOI:10.1016/j.patcog.2023.109320
摘要
Mainstream object detectors are commonly constituted of two sub-tasks, including classification and regression tasks, implemented by two parallel heads. This classic design paradigm inevitably leads to inconsistent spatial distributions between classification score and localization quality (IOU). Therefore, this paper alleviates this misalignment in the view of knowledge distillation. First, we observe that the massive teacher achieves a higher proportion of harmonious predictions than the lightweight student. Based on this intriguing observation, a novel Harmony Score (HS) is devised to estimate the alignment of classification and regression qualities. HS models the relationship between two sub-tasks and is seen as prior knowledge to promote harmonious predictions for the student. Second, this spatial misalignment will result in inharmonious region selection when distilling features. To alleviate this problem, a novel Task-decoupled Feature Distillation (TFD) is proposed by flexibly balancing the contributions of classification and regression tasks. Eventually, HD and TFD constitute the proposed method, named Task-Balanced Distillation (TBD). Extensive experiments demonstrate the considerable potential and generalization of the proposed method. Notably, when equipped with TBD, the performances of RetinaNet-R18/RetinaNet-R50/Faster-RCNN-R18 can be boosted from 33.2/37.4/34.5 to 37.3/41.2/37.7, outperforming the recent KD-based methods like FRS, FGD, and MGD.
科研通智能强力驱动
Strongly Powered by AbleSci AI