A survey of fine-grained visual categorization based on deep learning

计算机科学 分类 判别式 人工智能 卷积神经网络 深度学习 特征提取 特征(语言学) 机器学习 光学(聚焦) 人工神经网络 模式识别(心理学) 特征学习 相似性(几何) 领域(数学) 图像(数学) 语言学 光学 物理 哲学 数学 纯数学
作者
Yuxiang Xie,Quanzhi Gong,Xidao Luan,Jie Yan,Jiahui Zhang
出处
期刊:Chinese Journal of Systems Engineering and Electronics [Institute of Electrical and Electronics Engineers]
卷期号:: 1-20
标识
DOI:10.23919/jsee.2022.000155
摘要

Deep learning has achieved excellent results in various tasks in the field of computer vision, especially in fine-grained visual categorization. It aims to distinguish the subordinate categories of the label-level categories. Due to high intra-class variances and high inter-class similarity, the fine-grained visual categorization is extremely challenging. This paper first briefly introduces and analyzes the related public datasets. After that, some of the latest methods are reviewed. Based on the feature types, the feature processing methods, and the overall structure used in the model, we divide them into three types of methods: methods based on general convolutional neural network (CNN) and strong supervision of parts, methods based on single feature processing, and methods based on multiple feature processing. Most methods of the first type have a relatively simple structure, which is the result of the initial research. The methods of the other two types include models that have special structures and training processes, which are helpful to obtain discriminative features. We conduct a specific analysis on several methods with high accuracy on public datasets. In addition, we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power. In terms of technology, the extraction of the subtle feature information with the burgeoning vision transformer (ViT) network is also an important research direction.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
甘木木木木关注了科研通微信公众号
刚刚
1秒前
2秒前
peike完成签到,获得积分10
2秒前
komorebi发布了新的文献求助10
2秒前
yyy完成签到 ,获得积分10
3秒前
小蘑菇应助丸子采纳,获得10
4秒前
赵正洁发布了新的文献求助10
4秒前
田様应助朴实忆安采纳,获得10
4秒前
风笛发布了新的文献求助10
5秒前
111发布了新的文献求助10
5秒前
5秒前
atun完成签到,获得积分10
5秒前
小关发布了新的文献求助10
6秒前
7秒前
7秒前
那年丶看夕阳完成签到,获得积分10
7秒前
8秒前
9秒前
9秒前
9秒前
zho应助科研通管家采纳,获得10
9秒前
orixero应助科研通管家采纳,获得10
9秒前
9秒前
科研通AI2S应助科研通管家采纳,获得10
9秒前
共享精神应助科研通管家采纳,获得10
9秒前
zho应助科研通管家采纳,获得10
9秒前
xie应助科研通管家采纳,获得10
10秒前
orixero应助科研通管家采纳,获得10
10秒前
搜集达人应助科研通管家采纳,获得10
10秒前
10秒前
10秒前
NexusExplorer应助科研通管家采纳,获得10
10秒前
10秒前
TT应助科研通管家采纳,获得10
10秒前
jinshijie应助科研通管家采纳,获得10
10秒前
ding应助科研通管家采纳,获得10
10秒前
深情安青应助科研通管家采纳,获得10
10秒前
思源应助科研通管家采纳,获得10
10秒前
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 3000
Les Mantodea de guyane 2500
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 2000
Standard: In-Space Storable Fluid Transfer for Prepared Spacecraft (AIAA S-157-2024) 1000
Signals, Systems, and Signal Processing 510
Discrete-Time Signals and Systems 510
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5948968
求助须知:如何正确求助?哪些是违规求助? 7119799
关于积分的说明 15914362
捐赠科研通 5082096
什么是DOI,文献DOI怎么找? 2732368
邀请新用户注册赠送积分活动 1692792
关于科研通互助平台的介绍 1615538