波形
合成数据
集合(抽象数据类型)
数据集
任务(项目管理)
数据挖掘
数据质量
训练集
机器学习
噪音(视频)
质量(理念)
人工智能
事件(粒子物理)
计算机科学
图像(数学)
工程类
电信
量子力学
运营管理
雷达
系统工程
程序设计语言
公制(单位)
哲学
物理
认识论
作者
Tiantong Wang,Daniel T. Trugman,Youzuo Lin
摘要
Abstract Detecting earthquake arrivals within seismic time series can be a challenging task. Visual, human detection has long been considered the gold standard but requires intensive manual labor that scales poorly to large data sets. In recent years, automatic detection methods based on machine learning have been developed to improve the accuracy and efficiency. However, the accuracy of those methods relies on access to a sufficient amount of high‐quality labeled training data, often tens of thousands of records or more. We aim to resolve this dilemma by answering two questions: (1) provided with a limited amount of reliable labeled data, can we use them to generate additional, realistic synthetic waveform data? and (2) can we use those synthetic data to further enrich the training set through data augmentation, thereby enhancing detection algorithms? To address these questions, we use a generative adversarial network (GAN), a type of machine learning model which has shown supreme capability in generating high‐quality synthetic samples in multiple domains. Once trained, our GAN model is capable of producing realistic seismic waveforms of multiple labels (noise and event classes). Applied to real Earth seismic data sets in Oklahoma, we show that data augmentation from our GAN‐generated synthetic waveforms can be used to improve earthquake detection algorithms in instances when only small amounts of labeled training data are available.
科研通智能强力驱动
Strongly Powered by AbleSci AI