计算机科学
过度拟合
人工智能
计算机辅助设计
稳健性(进化)
管道(软件)
机器学习
推论
自动化
最小边界框
模式识别(心理学)
数据挖掘
图像(数学)
工程制图
人工神经网络
机械工程
生物化学
化学
工程类
基因
程序设计语言
作者
Igor Garcia Ballhausen Sampaio,José Viterbo,Joris Guérin
摘要
Abstract Object detection (OD) is used for visual quality control in factories. Images that compose training datasets are often collected directly from the production line and labeled with bounding boxes manually. Such data represent well the inference context but might lack diversity, implying a risk of overfitting. To address this issue, we propose a dataset construction method based on an automated pipeline, which receives a CAD model of an object and returns a set of realistic synthetic labeled images (code publicly available). Our approach can be easily used by non‐expert users and is relevant for industrial applications, where CAD models are widely available. We performed experiments to compare the use of datasets obtained by the two different ways—collecting and labeling real images or applying the proposed automated pipeline—in the classification of five different industrial parts. To ensure that both approaches can be used without deep learning expertise, all training parameters were kept fixed during these experiments. In our results, both methods were successful for some objects but failed for others. However, we have shown that the combined use of real and synthetic images led to better results. This finding has the potential to make industrial OD models more robust to poor data collection and labeling errors, without increasing the difficulty of the training process.
科研通智能强力驱动
Strongly Powered by AbleSci AI