计算机科学
人工智能
机器学习
深度学习
自编码
卷积神经网络
发作性
预处理器
脑电图
模式识别(心理学)
医学
精神科
作者
Duong Nhu,Mubeen Janmohamed,Ana Antonic‐Baker,Piero Perucca,Terence J. O’Brien,Amanda Gilligan,Patrick Kwan,Chang Wei Tan,Levin Kuhlmann
出处
期刊:Journal of Neural Engineering
[IOP Publishing]
日期:2022-10-01
卷期号:19 (5): 051002-051002
被引量:15
标识
DOI:10.1088/1741-2552/ac9644
摘要
Abstract Automated interictal epileptiform discharge (IED) detection has been widely studied, with machine learning methods at the forefront in recent years. As computational resources become more accessible, researchers have applied deep learning (DL) to IED detection with promising results. This systematic review aims to provide an overview of the current DL approaches to automated IED detection from scalp electroencephalography (EEG) and establish recommendations for the clinical research community. We conduct a systematic review according to the PRISMA guidelines. We searched for studies published between 2012 and 2022 implementing DL for automating IED detection from scalp EEG in major medical and engineering databases. We highlight trends and formulate recommendations for the research community by analyzing various aspects: data properties, preprocessing methods, DL architectures, evaluation metrics and results, and reproducibility. The search yielded 66 studies, and 23 met our inclusion criteria. There were two main DL networks, convolutional neural networks in 14 studies and long short-term memory networks in three studies. A hybrid approach combining a hidden Markov model with an autoencoder was employed in one study. Graph convolutional network was seen in one study, which considered a montage as a graph. All DL models involved supervised learning. The median number of layers was 9 (IQR: 5–21). The median number of IEDs was 11 631 (IQR: 2663–16 402). Only six studies acquired data from multiple clinical centers. AUC was the most reported metric (median: 0.94; IQR: 0.94–0.96). The application of DL to IED detection is still limited and lacks standardization in data collection, multi-center testing, and reporting of clinically relevant metrics (i.e. F1, AUCPR, and false-positive/minute). However, the performance is promising, suggesting that DL might be a helpful approach. Further testing on multiple datasets from different clinical centers is required to confirm the generalizability of these methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI