计算机科学
分类
判别式
人工智能
卷积神经网络
深度学习
特征提取
特征(语言学)
机器学习
光学(聚焦)
人工神经网络
模式识别(心理学)
特征学习
相似性(几何)
领域(数学)
图像(数学)
语言学
光学
物理
哲学
数学
纯数学
作者
Yuxiang Xie,Quanzhi Gong,Xidao Luan,Jie Yan,Jiahui Zhang
出处
期刊:Chinese Journal of Systems Engineering and Electronics
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:: 1-20
标识
DOI:10.23919/jsee.2022.000155
摘要
Deep learning has achieved excellent results in various tasks in the field of computer vision, especially in fine-grained visual categorization. It aims to distinguish the subordinate categories of the label-level categories. Due to high intra-class variances and high inter-class similarity, the fine-grained visual categorization is extremely challenging. This paper first briefly introduces and analyzes the related public datasets. After that, some of the latest methods are reviewed. Based on the feature types, the feature processing methods, and the overall structure used in the model, we divide them into three types of methods: methods based on general convolutional neural network (CNN) and strong supervision of parts, methods based on single feature processing, and methods based on multiple feature processing. Most methods of the first type have a relatively simple structure, which is the result of the initial research. The methods of the other two types include models that have special structures and training processes, which are helpful to obtain discriminative features. We conduct a specific analysis on several methods with high accuracy on public datasets. In addition, we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power. In terms of technology, the extraction of the subtle feature information with the burgeoning vision transformer (ViT) network is also an important research direction.
科研通智能强力驱动
Strongly Powered by AbleSci AI