自编码
计算机科学
人工智能
Boosting(机器学习)
模式识别(心理学)
半导体器件制造
故障检测与隔离
分类器(UML)
原始数据
机器学习
数据挖掘
深度学习
薄脆饼
工程类
电气工程
执行机构
程序设计语言
作者
Shu‐Kai S. Fan,Du‐Ming Tsai,Yeh Pei-Chi
标识
DOI:10.1109/tsm.2023.3238555
摘要
In current semiconductor manufacturing, limited raw trace data pertaining to defective wafers make fault detection (FD) assignments extremely difficult due to the data imbalance in wafer classification. To mitigate, this paper proposes using a variational autoencoder (VAE) as a data augmentation strategy for resolving data imbalance of temporal raw trace data. A VAE with few defective samples is first trained. By means of extracting the latent variables that characterize the distribution of the defective samples, we make use of the statistical randomness of the latent variables to generate synthesized defective samples via the decoder scheme in the trained VAE. Two data representations and VAE modeling strategies, concatenation of multiple and individual raw trace data as the input of the VAE during the training stage, are investigated. A real-data plasma enhanced chemical vapor deposition (PECVD) process having only few defective samples is used to illustrate the performance enhancement to wafer classification arising from the proposed data augmentation framework. Based on the computational comparisons between noted classification models, the proposed generative VAE model via the individual strategy enables the adaptive boosting (AdaBoost) classifier to achieve perfect performances in every metrics if the 80% and 100% over-sampling ratios are adopted.
科研通智能强力驱动
Strongly Powered by AbleSci AI