计算机科学
合成数据
原始数据
人工智能
生成语法
动态时间归整
过程(计算)
机器学习
生成对抗网络
对抗制
数据挖掘
模式识别(心理学)
深度学习
程序设计语言
操作系统
作者
Marcos Lupión,Federico Cruciani,Ian Cleland,Chris Nugent,Pilar M. Ortigosa
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2024-02-12
卷期号:28 (4): 2350-2361
被引量:2
标识
DOI:10.1109/jbhi.2024.3364910
摘要
Currently, Human Activity Recognition (HAR) applications need a large volume of data to be able to generalize to new users and environments.However, the availability of labeled data is usually limited and the process of recording new data is costly and time-consuming.Synthetically increasing datasets using Generative Adversarial Networks (GANs) has been proposed, outperforming cropping, time-warping, and jittering techniques on raw signals.Incorporating GAN-generated synthetic data into datasets has been demonstrated to improve the accuracy of trained models.Regardless, currently, there is no optimal GAN architecture to generate accelerometry signals, neither a proper evaluation methodology to assess signal quality or accuracy using synthetic data.This work is the first to propose conditional Wasserstein Generative Adversarial Networks (cWGANs) to generate synthetic HAR accelerometry signals.Furthermore, we calculate quality metrics from the literature and study the impact of synthetic data on a large HAR dataset involving 395 users.Results show that i) cWGAN outperforms original Conditional Generative Adversarial Networks (cGANs), being 1D convolutional layers appropriate for generating accelerometry signals, ii) the performance improvement incorporating synthetic data is more significant as the dataset size is smaller, and iii) the quantity of synthetic data required is inversely proportional to the quantity of real data.
科研通智能强力驱动
Strongly Powered by AbleSci AI