变压器
人工智能
机器视觉
工程类
计算机视觉
冶金
材料科学
计算机科学
电气工程
电压
作者
Vinod Vasan,Naveen Venkatesh Sridharan,V. Sugumaran,Mohammadreza Aghaei
出处
期刊:Heliyon
[Elsevier]
日期:2024-10-01
卷期号:10 (19): e38498-e38498
标识
DOI:10.1016/j.heliyon.2024.e38498
摘要
This study proposes a vision transformer to detect visual defects on steel surfaces. The proposed approach utilizes an open-source image dataset to classify steel surface conditions into six fault categories namely, crazing, inclusion, rolled in, pitted surface, scratches and patches. The defect images are first subject to resizing and then fed into a vision transformer subject to different hyperparameter configurations to determine the most optimal setting to render highest classification performance. The performance of the model is evaluated for different hyperparameter configurations, and the most optimal configuration is examined using the associated confusion matrices. It was observed that the proposed model presents a high overall accuracy of 96.39 % for detection and classification of steel surface faults. The study presents a descriptive insight into the vision transformer architecture and in addition, compares the performance of the current model with the results of other approaches suggested for application in literature. Vision transformers can serve as standalone approaches and suitable alternatives to the widely used convolution neural networks (CNNs) by actuating complex defect detection and classification tasks in real-time, enabling efficient and robust condition monitoring of a wide range of defects.
科研通智能强力驱动
Strongly Powered by AbleSci AI