计算机科学
联营
判别式
特征(语言学)
人工智能
分类
卷积神经网络
代表(政治)
模式识别(心理学)
统计模型
协方差
特征提取
特征学习
机器学习
领域(数学)
数学
统计
政治学
哲学
政治
法学
纯数学
语言学
作者
Qingtao Wang,Ke Zhang,Jin Fan,Shaoli Huang,Lianbo Zhang
标识
DOI:10.1109/icpr48806.2021.9412537
摘要
Fine-grained visual categorization aims to learn a robust image representation modeling subtle differences from similar categories. Existing methods in this field tackle the problem by designing complex frameworks, which produce high-level features by performing first-order or second-order pooling. Despite the impressive performance achieved by these strategies, the single-order networks only carry linear or non-linear information of the last convolutional layer, neglecting the fact that features from different orders are mutually complementary. In this paper, we propose a multi-order feature statistical method (MOFS), which learns fine-grained features characterizing multiple orders. Specifically, the MOFS consists of two sub-modules: (i) a first-order module modeling both mid-level and high-level features. (ii) a covariance feature statistical module capturing high-order features. By deploying these two sub-modules on the top of existing backbone networks, MOFS simultaneously captures multi-level of discriminative patters including local, global and co-related patters. We evaluate the proposed method on three challenging benchmarks, namely CUB-200-2011, Stanford Cars, and FGVC-Aircraft. Compared with state-of-the-art methods, experiment results exhibit superior performance in recognizing fine-grained objects.
科研通智能强力驱动
Strongly Powered by AbleSci AI