计算机科学
变压器
人工智能
分割
图像分割
像素
计算机视觉
卷积神经网络
模式识别(心理学)
作者
Xiaojie Cui,Xuehua Chen,Jian Zhou,Dong Lin
摘要
Different from convolutional neural network, transformer is able to model the long-distance relationship between the image pixels, thus it is now widely used in computer vision and remote sensing community. This paper comprehensively reviews the development of transformer models in automatic image interpretation tasks, especially the applications in image classification, object detection and semantic segmentation. Specifically, the popular transformer models are thoroughly analyzed and compared to acquire their advantages and limitations. Finally, current challenges and future works are concluded.
科研通智能强力驱动
Strongly Powered by AbleSci AI