计算机科学
分类
人工智能
突出
机制(生物学)
过程(计算)
计算机图形学
人类视觉系统模型
计算机视觉
人机交互
图像(数学)
认识论
操作系统
哲学
作者
Meng-Hao Guo,Tianxing Xu,Jiangjiang Liu,Zheng-Ning Liu,Peng-Tao Jiang,Tai‐Jiang Mu,Song-Hai Zhang,Ralph R. Martin,Ming‐Ming Cheng,Shi‐Min Hu
标识
DOI:10.1007/s41095-022-0271-y
摘要
Humans can naturally and effectively find salient regions in complex scenes. Motivated by this observation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system. Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image. Attention mechanisms have achieved great success in many visual tasks, including image classification, object detection, semantic segmentation, video understanding, image generation, 3D vision, multi-modal tasks and self-supervised learning. In this survey, we provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention and branch attention; a related repository https://github.com/MenghaoGuo/Awesome-Vision-Attentions is dedicated to collecting related work. We also suggest future directions for attention mechanism research.
科研通智能强力驱动
Strongly Powered by AbleSci AI