计算机科学
初始化
人工智能
模式识别(心理学)
特征(语言学)
嵌入
分类器(UML)
特征提取
机器学习
哲学
语言学
程序设计语言
作者
Weiqiu Wang,Zhicheng Zhao,Pingyu Wang,Fei Su,Hongying Meng
标识
DOI:10.1109/tcsvt.2022.3161427
摘要
Deep neural networks have achieved great success on many visual recognition tasks. However, training data with a long-tailed distribution dramatically degenerates the performance of recognition models. In order to relieve this imbalance problem, an effective Long-Tailed Visual Recognition (LTVR) framework is proposed based on learned balance and robust features under long-tailed distribution circumstances. In this framework, a plug-and-play Attentive Feature Augmentation (AFA) module is designed to mine class-related and variation-related features of original samples via a novel hierarchical channel attention mechanism. Then, those features are aggregated to synthesize fake features to cope with the imbalance of the original dataset. Moreover, a Lay-Back Learning Schedule (LBLS) is developed to ensure a good initialization of feature embedding. Extensive experiments are conducted with a two-stage training method to verify the effectiveness of the proposed framework on both feature learning and classifier rebalancing in the long-tailed image recognition task. Experimental results show that, when trained with imbalanced datasets, the proposed framework achieves superior performance over the state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI