A Zero-Shot Learning Method Using Artificial Neural Network for Drift Calibration of Gas Sensor Array

电子鼻 计算机科学 分类器(UML) 人工智能 模式识别(心理学) 人工神经网络 概念漂移 传感器阵列 校准 独立成分分析 学习迁移 数据挖掘 机器学习 数据流挖掘 数学 统计
作者
Yu‐Chieh Cheng,Ting-I Chou,Jye-Luen Lee,Shih-Wen Chiu,Kea Tiong Tang
出处
期刊:Meeting abstracts 卷期号:MA2020-01 (26): 1856-1856
标识
DOI:10.1149/ma2020-01261856mtgabs
摘要

Introduction Electronic nose (E-nose) has many applications in gas detection and classification such as identifying toxic gases from the environment or detecting breath biomarkers for various cancer diseases. The E-noses are usually designed using an array of gas sensors and a machine learning classifier, which is comprised of various models to distinguish the gas sensor data. However, the response of the gas sensors often suffers from unpredictable and uncertain drift issues due to sensor aging, process variation, and environmental interference. From the perspective of pattern recognition, drift causes test data distribution to differ from prior data distribution and reduce the classification accuracy. To solve the above-mentioned problems, drift correction methods such as component correction based on principle component analysis (CC-PCA) and orthogonal signal correction (OSC) can be employed. The correction methods try to remove some components from the data, while the drift is a dynamic and nonlinear one that cannot be easily separated from the data. The other method can be adaption in which new labeled data collected at different time or from different devices are required to update the classifier. The adaption method has better classification capability than the correction method; however, collecting new labeled data or transferring data is a laborious job. Hence, this paper proposes a transfer learning method to adapt different distributions of data without any additional data. Since no transfer data is required, the learning method can also be named as zero-shot learning. Method Fig. 1 shows the proposed method that considers both sensor response correction and adaption. This method combines auto-encoder as well as a neural network (NN) classifier with certain restrictions. The auto-encoder comprises an encoder and a decoder made by a 64-30-20 fully-connected neural network and a 20-30-64 fully-connected neural network, respectively. The purpose of the auto-encoder is to find latent vectors that can highly represent the input data. To find a highly representative latent vector, the encoder encodes the input sample to a 20-dimension latent vector, and then this latent vector is used to reconstruct a 64-dimension vector through the decoder. By forcing the reconstructed vector close to the input data, the latent vector can extract useful information from the input data and represent the corresponding input data. However, a highly representative latent space made up of latent vectors is not equivalent to a space that is easy to be clustered into different classes. For seeking a better classifiable latent space, the performance of the classifier is taken into consideration. In this work, a 20-10-6 fully-connected neural network is adopted as a classifier. In addition, statistical measurements are introduced to determine the transfer sample. The proposed method can be divided into two main stages: (1) Training stage (2) Calibration stage. During the training stage, training data are used to train the auto-encoder and the classifier along with the objective function including reconstruction error, classification error, and statistic error. The reconstruction error is the mean square error between the reconstructed data and the input. The classification error is determined by the classifier, which would be the binary cross-entropy error between NN output and the true label. Further, the statistic error measures the sparsity of the data distribution. For the calibration stage, some unknown samples are treated as test data. First, the test data will be given as the input of the model, consequently, the output of the model is obtained in the form of latent vector and classification result. Then, the statistic error can be calculated by the latent vector and the training data distribution in latent space. According to the customized qualification, the sample is determined to be reliable or not. If it is reliable, then it is put into the transfer dataset with its classification result as the label. Finally, the transfer dataset is used to calibrate the pre-train model through utilizing the transfer dataset as the training data in the first stage. Results and Conclusion The proposed method is evaluated on an open dataset discuss in [1], which collects 13,910 samples using a 16 metal-oxide (MOX) gas sensor array over a period of 36 months. While the dataset is processed using the proposed method, attention should be paid in determining what kind of sample is reliable. If less reliable samples are used, the model might collapse. On the other hand, if the transfer dataset contains only highly reliable samples, the classifier may be stuck with a similar distribution of training data, which means it is unable to adopt the new data distribution and leads to a big hurdle. Hence, the constraints of selecting transfer data are varied with calibration epochs. This work is the first zero-shot transfer learning method applied to E-nose data. The experimental results show that the performance is improved after data transfer. Besides, the other calibration methods on this dataset usually select the transfer data which can describe the overall data distribution in the test batch. Yet, the proposed method can be more practical and general, because a similar operation can be also applied to calibrate the instrumental variation. Furthermore, the proposed method lacks human labeling after the pre-trained model. That is, it is possible to obtain a better result working together with the traditional method and collecting newly labeled transfer data once in a while. Reference [1] A. Vergara, S. Vembu, T. Ayhan, M.A. Ryan, M.L. Homer, R. Huerta, Chemical gas sensor drift compensation using classifier ensembles, Sens. Actuators B: Chem. 166 (2012) 320-329 Figure 1

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
chenyuyuan完成签到,获得积分10
1秒前
1秒前
Liang完成签到,获得积分10
1秒前
华仔应助PATTOM采纳,获得10
2秒前
NN完成签到,获得积分10
2秒前
鲤鱼寻菡完成签到,获得积分10
3秒前
4秒前
ferritin发布了新的文献求助30
4秒前
开放灭绝发布了新的文献求助10
5秒前
健壮夏天发布了新的文献求助30
5秒前
6秒前
在水一方应助Mercury采纳,获得10
8秒前
大喜完成签到,获得积分10
8秒前
可爱的函函应助nashanbei采纳,获得10
8秒前
10秒前
qiao发布了新的文献求助10
11秒前
Akim应助Ma_Fangru采纳,获得30
12秒前
13秒前
十月的天空完成签到,获得积分10
14秒前
15秒前
星星轨迹发布了新的文献求助10
17秒前
18秒前
钦林发布了新的文献求助10
21秒前
21秒前
22秒前
heheheli发布了新的文献求助10
22秒前
23秒前
在水一方应助车灵波采纳,获得10
23秒前
24秒前
FashionBoy应助xiaoxiaoz采纳,获得10
25秒前
舒适访风发布了新的文献求助10
25秒前
hmgdktf发布了新的文献求助10
26秒前
大木头发布了新的文献求助10
26秒前
27秒前
wj完成签到,获得积分10
27秒前
28秒前
29秒前
30秒前
30秒前
赵凌完成签到,获得积分10
32秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
计划经济时代的工厂管理与工人状况(1949-1966)——以郑州市国营工厂为例 500
INQUIRY-BASED PEDAGOGY TO SUPPORT STEM LEARNING AND 21ST CENTURY SKILLS: PREPARING NEW TEACHERS TO IMPLEMENT PROJECT AND PROBLEM-BASED LEARNING 500
Sociologies et cosmopolitisme méthodologique 400
Why America Can't Retrench (And How it Might) 400
Another look at Archaeopteryx as the oldest bird 390
Partial Least Squares Structural Equation Modeling (PLS-SEM) using SmartPLS 3.0 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 催化作用 遗传学 冶金 电极 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 4624923
求助须知:如何正确求助?哪些是违规求助? 4024171
关于积分的说明 12456546
捐赠科研通 3708857
什么是DOI,文献DOI怎么找? 2045726
邀请新用户注册赠送积分活动 1077723
科研通“疑难数据库(出版商)”最低求助积分说明 960238