A Zero-Shot Learning Method Using Artificial Neural Network for Drift Calibration of Gas Sensor Array

电子鼻 计算机科学 分类器(UML) 人工智能 模式识别(心理学) 人工神经网络 概念漂移 传感器阵列 校准 独立成分分析 学习迁移 数据挖掘 机器学习 数据流挖掘 数学 统计
作者
Yu‐Chieh Cheng,Ting-I Chou,Jye-Luen Lee,Shih-Wen Chiu,Kea Tiong Tang
出处
期刊:Meeting abstracts 卷期号:MA2020-01 (26): 1856-1856
标识
DOI:10.1149/ma2020-01261856mtgabs
摘要

Introduction Electronic nose (E-nose) has many applications in gas detection and classification such as identifying toxic gases from the environment or detecting breath biomarkers for various cancer diseases. The E-noses are usually designed using an array of gas sensors and a machine learning classifier, which is comprised of various models to distinguish the gas sensor data. However, the response of the gas sensors often suffers from unpredictable and uncertain drift issues due to sensor aging, process variation, and environmental interference. From the perspective of pattern recognition, drift causes test data distribution to differ from prior data distribution and reduce the classification accuracy. To solve the above-mentioned problems, drift correction methods such as component correction based on principle component analysis (CC-PCA) and orthogonal signal correction (OSC) can be employed. The correction methods try to remove some components from the data, while the drift is a dynamic and nonlinear one that cannot be easily separated from the data. The other method can be adaption in which new labeled data collected at different time or from different devices are required to update the classifier. The adaption method has better classification capability than the correction method; however, collecting new labeled data or transferring data is a laborious job. Hence, this paper proposes a transfer learning method to adapt different distributions of data without any additional data. Since no transfer data is required, the learning method can also be named as zero-shot learning. Method Fig. 1 shows the proposed method that considers both sensor response correction and adaption. This method combines auto-encoder as well as a neural network (NN) classifier with certain restrictions. The auto-encoder comprises an encoder and a decoder made by a 64-30-20 fully-connected neural network and a 20-30-64 fully-connected neural network, respectively. The purpose of the auto-encoder is to find latent vectors that can highly represent the input data. To find a highly representative latent vector, the encoder encodes the input sample to a 20-dimension latent vector, and then this latent vector is used to reconstruct a 64-dimension vector through the decoder. By forcing the reconstructed vector close to the input data, the latent vector can extract useful information from the input data and represent the corresponding input data. However, a highly representative latent space made up of latent vectors is not equivalent to a space that is easy to be clustered into different classes. For seeking a better classifiable latent space, the performance of the classifier is taken into consideration. In this work, a 20-10-6 fully-connected neural network is adopted as a classifier. In addition, statistical measurements are introduced to determine the transfer sample. The proposed method can be divided into two main stages: (1) Training stage (2) Calibration stage. During the training stage, training data are used to train the auto-encoder and the classifier along with the objective function including reconstruction error, classification error, and statistic error. The reconstruction error is the mean square error between the reconstructed data and the input. The classification error is determined by the classifier, which would be the binary cross-entropy error between NN output and the true label. Further, the statistic error measures the sparsity of the data distribution. For the calibration stage, some unknown samples are treated as test data. First, the test data will be given as the input of the model, consequently, the output of the model is obtained in the form of latent vector and classification result. Then, the statistic error can be calculated by the latent vector and the training data distribution in latent space. According to the customized qualification, the sample is determined to be reliable or not. If it is reliable, then it is put into the transfer dataset with its classification result as the label. Finally, the transfer dataset is used to calibrate the pre-train model through utilizing the transfer dataset as the training data in the first stage. Results and Conclusion The proposed method is evaluated on an open dataset discuss in [1], which collects 13,910 samples using a 16 metal-oxide (MOX) gas sensor array over a period of 36 months. While the dataset is processed using the proposed method, attention should be paid in determining what kind of sample is reliable. If less reliable samples are used, the model might collapse. On the other hand, if the transfer dataset contains only highly reliable samples, the classifier may be stuck with a similar distribution of training data, which means it is unable to adopt the new data distribution and leads to a big hurdle. Hence, the constraints of selecting transfer data are varied with calibration epochs. This work is the first zero-shot transfer learning method applied to E-nose data. The experimental results show that the performance is improved after data transfer. Besides, the other calibration methods on this dataset usually select the transfer data which can describe the overall data distribution in the test batch. Yet, the proposed method can be more practical and general, because a similar operation can be also applied to calibrate the instrumental variation. Furthermore, the proposed method lacks human labeling after the pre-trained model. That is, it is possible to obtain a better result working together with the traditional method and collecting newly labeled transfer data once in a while. Reference [1] A. Vergara, S. Vembu, T. Ayhan, M.A. Ryan, M.L. Homer, R. Huerta, Chemical gas sensor drift compensation using classifier ensembles, Sens. Actuators B: Chem. 166 (2012) 320-329 Figure 1

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
丢丢完成签到 ,获得积分10
刚刚
张立人发布了新的文献求助10
1秒前
dawn发布了新的文献求助10
2秒前
丢丢关注了科研通微信公众号
4秒前
小韩同学发布了新的文献求助10
5秒前
stuffmatter应助活着采纳,获得10
6秒前
6秒前
安好发布了新的文献求助10
7秒前
9秒前
waxler发布了新的文献求助10
11秒前
11秒前
善学以致用应助风趣友瑶采纳,获得10
11秒前
11秒前
端午关注了科研通微信公众号
12秒前
Ava应助dawn采纳,获得10
12秒前
12发布了新的文献求助10
13秒前
grgr发布了新的文献求助10
15秒前
vivin完成签到,获得积分10
15秒前
16秒前
17秒前
17秒前
我要吃饭发布了新的文献求助10
19秒前
善学以致用应助12采纳,获得10
19秒前
三桥发布了新的文献求助10
19秒前
鳗鱼不尤发布了新的文献求助10
20秒前
雪白胡萝卜完成签到 ,获得积分10
20秒前
卡他发布了新的文献求助10
21秒前
22秒前
杨乐多完成签到,获得积分10
23秒前
grgr完成签到,获得积分20
24秒前
洁净伟祺发布了新的文献求助10
25秒前
你好呀发布了新的文献求助10
25秒前
我要吃饭完成签到,获得积分10
25秒前
26秒前
852应助俏皮问晴采纳,获得10
26秒前
26秒前
宝玉发布了新的文献求助50
29秒前
29秒前
yiyi完成签到,获得积分10
30秒前
今后应助TOMORI酱采纳,获得10
30秒前
高分求助中
中国国际图书贸易总公司40周年纪念文集 大事记1949-1987 2000
TM 5-855-1(Fundamentals of protective design for conventional weapons) 1000
草地生态学 880
Threaded Harmony: A Sustainable Approach to Fashion 799
Basic Modern Theory of Linear Complex Analytic 𝑞-Difference Equations 500
Queer Politics in Times of New Authoritarianisms: Popular Culture in South Asia 500
Livre et militantisme : La Cité éditeur 1958-1967 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3058279
求助须知:如何正确求助?哪些是违规求助? 2714388
关于积分的说明 7440415
捐赠科研通 2359676
什么是DOI,文献DOI怎么找? 1250252
科研通“疑难数据库(出版商)”最低求助积分说明 607401
版权声明 596410