Can Left Atrial Enlargement Be Estimated From the Electrocardiogram? An Artificial Intelligence Approach to Screening Risk of Atrial Fibrillation and Heart Disease

医学 心房颤动 左心房扩大 心脏病学 内科学 P波 心脏病 疾病 窦性心律
作者
Vijay S. Chauhan
出处
期刊:Canadian Journal of Cardiology [Elsevier]
卷期号:40 (4): 595-597
标识
DOI:10.1016/j.cjca.2023.11.015
摘要

Assessing the risk of cardiovascular disease is challenging, but essential to appropriate clinical work-up and management. Among the plethora of cardiac risk markers available to primary care physicians, internists, and cardiologist, left atrial enlargement (LAE) is rarely considered, but it has been shown to independently predict atrial fibrillation (AF), heart failure, coronary artery disease, and all-cause mortality.1Bouzas-Mosquera A. Broullón F.J. Álvarez-García N. et al.Left atrial size and risk for all-cause mortality and ischemic stroke.CMAJ. 2011; 183: r657-r664Crossref PubMed Scopus (122) Google Scholar,2Gardin J.M. McClelland R. Kitzman D. et al.M-Mode echocardiographic predictors of six- to seven-year incidence of coronary heart disease, stroke, congestive heart failure, and mortality in an elderly cohort (the Cardiovascular Health Study).Am J Cardiol. 2001; 87: 1051-1057Abstract Full Text Full Text PDF PubMed Scopus (352) Google Scholar In fact, severe LAE (left atrial diameter [LAD] ≥ 50 mm) is associated with a 4-fold increased risk of new-onset AF.3Psaty B.M. Manolio T.A. Kuller L.H. et al.Incidence of and risk factors for atrial fibrillation in older adults.Circulation. 1997; 96: 2455-2461Crossref PubMed Scopus (1255) Google Scholar As such, there may be value in screening for LAE even before it is apparent on echocardiography. The 12-lead electrocardiogram (ECG) is a readily available diagnostic tool, but standard criteria for LAE are insensitive because only a few P-wave dimensions in leads II and V1 are considered.4Munuswamy K. Alpert M.A. Martin R.H. et al.Sensitivity and specificity of commonly used electrocardiographic criteria for left atrial enlargement determined by M-mode echocardiography.Am J Cardiol. 1984; 53: 829-832Abstract Full Text PDF PubMed Scopus (72) Google Scholar,5Lee K.S. Appleton C.P. Lester S.J. et al.Relation of electrocardiographic criteria for left atrial enlargement to two-dimensional echocardiographic left atrial volume measurements.Am J Cardiol. 2007; 99: 113-118Abstract Full Text Full Text PDF PubMed Scopus (38) Google Scholar In the past decade, artificial intelligence using deep learning has been applied to a variety of complex biological signals, including the ECG, to enhance the diagnosis and prognosis of cardiovascular disease.6Siontis K.C. Noseworthy P.A. Attia Z.I. Friedman P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management.Nate Rev. 2021; 18: 465-478Google Scholar Given the nuances in abnormal P-wave morphology and its company with the QRS-T wave, analysis of the 12-lead ECG waveform with the use of deep learning may improve ECG-based detection of LAE. In this issue of the Canadian Journal of Cardiology, Chou et al.7Chou C.C. Liu Z.Y. Chang P.C. et al.Comparing artificial intelligence-enabled electrocardiogram models in identifying left atrium enlargement and long-term cardiovascular risk.Can J Cardiol. 2024; 40: 585-594Abstract Full Text Full Text PDF Scopus (1) Google Scholar develop a deep learning–enabled ECG model to detect LAE ≥ 50 mm and its long-term cardiovascular risk. Their single-centre, retrospective study included 382,593 consecutive adults each with a 12-lead ECG and echocardiogram performed within 2 weeks of one another. The echocardiogram provided the gold standard for classifying LAE (ie, LAD in the parasternal long-axis view). A convolutional neural network (CNN) was trained using 1 of 4 different 10-second ECG data sets—12-lead P-QRS-T waves, 12-lead segmented P-waves, and single-lead I or V2 P-QRS-T waves—to discriminate LAD ranging in size from 38 to 50 mm (ECGs for training 35%, internal validation 15%, testing 50%). External validation was performed using a separate cohort of 91,425 patient ECGs from another hospital. Cardiovascular events associated with LAE ≥ 50 mm were assessed from administrative International Classification of Diseases codes. The main study finding was that all 4 deep learning ECG models performed similarly in classifying LAE, with performance improving as LAE cutoff points increased from 38 mm (receiver operating characteristic (ROC) area under curve (AUC) 0.74-0.79) to 50 mm (ROC AUCs 0.86-0.88). The models performed equally well with external validation (ROC AUCs 0.85-0.88). When all 4 deep learning ECG models were developed separately on the 338,264 sinus rhythm ECGs vs 60,766 non–sinus rhythm ECGs (eg, AF, flutter, frequent premature atrial beats), model performance was better for the sinus rhythm ECGs in detecting LAE ≥ 50 mm (ROC AUCs 0.85-0.73). With deep learning ECG–detected LAE ≥ 50 mm, the hazard ratios (HRs) for predicting new-onset AF (HR 9.7-10.1), heart failure (HR 4.6-5.3), stroke (HR 1.5-1.6) and acute myocardial infarction (HR 1.7-2.1) over 1-year follow-up was similar with all 4 ECG models. These HRs were higher than those for echocardiography-detected LAE ≥ 50 mm, due in part to a lower predicted event rate among patients classified as LAE < 50 mm using the CNN ECG models. Overall, this is a well designed, rigourous study with novel findings that build on the literature of deep learning–enabled ECG analysis for diagnostic screening and prognostication. The selection of the database used for training significantly affects the quality and generalisability of machine learning models. Here, a large, diverse real-world ECG database was used that included sinus, nonsinus and even noisy ECGs unique to each patient. In addition, the model was externally validated to improve generalisability. The optimal ECG input format for deep neural network training remains an open question, but the signal-based ECG used in this study permitted single lead training without ECG grid interference, and this may offer higher fidelity than 2-dimensional ECG images. The CNN architecture itself is well described and necessary for the multidimensional ECG waveform analysis that would not be possible with supervised machine learning. With CNN-assisted diagnosis comes the concern of whether the output can be trusted and interpreted by the clinician and end-user. Chou et al.6Siontis K.C. Noseworthy P.A. Attia Z.I. Friedman P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management.Nate Rev. 2021; 18: 465-478Google Scholar demonstrate better performance with sinus rhythm ECGs than non–sinus rhythm ECGs, suggesting that the P-wave is relevant in their CNN-based classification of LAE ≥ 50 mm. Among all ECGs, the 12-lead P-QRS-T waves only marginally improved model performance compared with the 12-lead segmented P waves, further supporting the relevance of the P-wave in model classification. Before considering the clinical relevance and application of this study, it is important to understand its shortcomings. Foremost, although a large number of ECGs were included with random allocation to training, validation, and testing, significant class imbalance exists in these cohorts because only 5% of all ECGs were from patients with LAE ≥ 50 mm. Standard classification metrics such as ROC AUC do not adequately quantify model performance when data sets are imbalanced. Instead, the precision-recall AUC and F1 score (ie, harmonic mean of precision and recall) are more accurate,8Saito T. Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.PLoS One. 2015; 10e0118432Crossref Scopus (2226) Google Scholar and in this study both were low (F1 score 0.27, precision-recall AUC 0.31) indicating poor performance in the internal testing and external validation cohorts. Data augmentation strategies9Perez L, Wang J. The effectiveness of data augmentation in image classification using deep learning [preprint; December 13, 2017]. arXiv:1712.04621.Google Scholar can help to improve class imbalance, but these were not considered. Data augmentation increases the number of cases by adding slightly modified copies of already existing data or newly created synthetic data from existing data. This approach was used by Liao et al.10Liao S. Bokhari M. Chakraborty P. et al.Use of wearable technology and deep learning to improve the diagnosis of Brugada syndrome.JACC Clin Electrophysiol. 2022; 8: 1010-1020Crossref PubMed Scopus (9) Google Scholar when developing a CNN model to classify the type 1 Brugada pattern from 12-lead Holters because only a limited number of cases were available for model training. Another important consideration is selecting the optimal deep learning architecture for training and inference. Although CNNs have dominated the field until now, recent advancements in attention-based generative pretrained transformers and vision transformers offer promising avenues for improving machine learning–enhanced ECG classification.11Pan S.J. Yang Q. A survey on transfer learning.IEEE Trans Knowl Data Eng. 2010; 22: 1345-1359Crossref Scopus (16223) Google Scholar,12van Steenkiste G. van Loon G. Crevecoeur G. Transfer learning in ECG classification from human to horse using a novel parallel neural network architecture.Sci Rep. 2020; 10: 186Crossref PubMed Scopus (38) Google Scholar Pretraining transformer models on large unlabelled data sets may enable the creation of specialised models when there are limited cases or rare diseases. Using a similar approach with transfer learning, Liu et al.13Liu C.M. Liu C.L. Hu K.W. et al.A deep learning–enabled electrocardiogram model for the identification of a rare inherited arrhythmia: Brugada syndrome.Can J Cardiol. 2022; 38: 152-159Abstract Full Text Full Text PDF PubMed Scopus (23) Google Scholar trained a CNN to accurately classify the type 1 Brugada pattern from 12-lead ECGs. Their model parameters from a source model trained on 12-lead ECGs of right bundle branch block were transferred to a target model trained on type 1 Brugada ECGs to enhance its performance. Besides model development, the study has limitations in how clinical metrics and outcomes were evaluated. Although a large real-world ECG database was collected, there may still be referral bias among patients undergoing both ECG and echocardiography within 2 weeks. As such, the performance of the model in an unselected population for LAE screening may be quite different, and class imbalance would become even more significant. Second, detecting LAE based on LAD is far less sensitive than using LA volume, which is now more commonly reported. Third, LAE often does not exist in isolation, but rather coexists with structural heart disease (eg, mitral regurgitation, cardiomyopathy, left ventricular hypertrophy) and clinical conditions (eg, congestive heart failure, AF, hypertension). Without adjusting for these confounders, it is unknown whether LAE independently predicts cardiovascular events or is simply a marker of underlying disease that is driving adverse outcomes. Finally, the interpretation of the higher hazard ratio for cardiovascular events with the CNN ECG model–detected LAE ≥ 50 mm compared with echocardiography-detected LAE ≥ 50 mm is complex and needs clarification. The CNN ECG models classified 5- to 6-fold more patients with LAE > 50 mm compared with echocardiography, indicating many false positives. However, the cardiovascular event rate with CNN ECG–detected LAE < 50 mm was lower (high negative predictive value of 0.96) compared with echocardiography. This difference contributed to the higher hazard ratio with the CNN ECG model–detected LAE ≥ 50 mm, especially for AF and heart failure. Notwithstanding these limitations, the study findings provide a framework for future development in this field that may ultimately pave the way for ECG-enabled LAE screening and prognostication. Interestingly, the performance of the single leads I and V2 ECG models (accuracy ∼ 77%, ROC AUC ∼ 0.85) in discriminating LAE ≥ 50 mm was similar to that of the 12-lead ECG model (accuracy 78%, ROC AUC 0.88). However, single-lead recordings, for example, from wearables, may not be relevant in detecting LAE, unlike arrhythmias, and may compromise ECG signal quality. Model performance also did not improve significantly with inclusion of the QRS-T complex. In contrast, the machine learning–enabled single-lead II ECG model of Hsu et al.14Hsu Cy Liu P.Y. Liu S.H. et al.Machine learning for electrocardiographic features to identify left atrial enlargement in young adults: CHIEF Heart study.Front Cardiovasc Med. 2022; 9840585Crossref Scopus (10) Google Scholar identified LAE > 40 mm based on features from the QRS complex rather than P-wave. It is unexpected that a full 12-lead P-QRS-T disclosure was not more discriminatory for LAE compared with the single-lead P-wave. The QRS-T complex would be influenced by ventricular pathology (eg, left ventricular hypertrophy, cardiomyopathy), which is an important cause of LAE. Furthermore, P-wave morphology would be affected by LA fibrosis burden independently from LA volume, leading to changes in P-wave morphology in multiple leads. As such, deep learning–enabled 12-lead P-QRS-T ECG models may hold the greatest promise in discriminating LAE. The premise of this study is that deep learning ECG models will detect severe LAE in a relatively asymptomatic population without atrial arrhythmias or cardiovascular disease to inform early work-up and treatment. Although this is a rational premise, it may not be clinically relevant because the prevalence of this population is very low (∼ 3%), and it is uncertain whether early intervention will actually change cardiovascular outcomes. Identifying a cohort with a higher pretest probability of severe LAE, such as those with more cardiovascular risk factors, may improve diagnostic yield. Ultimately, echocardiography will still be required to confirm deep learning–ECG detection of severe LAE. Therefore, minimising false positives with appropriate patient selection and a robust deep learning model are essential for cost-effective screening. Chou et al.6Siontis K.C. Noseworthy P.A. Attia Z.I. Friedman P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management.Nate Rev. 2021; 18: 465-478Google Scholar are to be commended for their novel application of deep learning–enabled ECG analysis with inclusion of the P-wave to detect severe LAE. Future studies in diverse populations will be needed to confirm deep learning model performance and the patient populations best served by this screening tool. Although in its infancy, this point of care approach to characterising abnormal atrial electroanatomic substrate has the potential to enhance the prediction of heart failure, AF, stroke, and even response to AF catheter ablation. Whether this prompts early intervention and changes clinical outcomes remains to be determined.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
daydayup发布了新的文献求助10
1秒前
大模型应助tianruiyi采纳,获得10
1秒前
淡淡盼旋完成签到,获得积分10
1秒前
廖同学完成签到 ,获得积分10
2秒前
巴黎的防发布了新的文献求助10
2秒前
谭语君发布了新的文献求助10
2秒前
669完成签到,获得积分10
3秒前
ww完成签到,获得积分10
3秒前
大模型应助zzx采纳,获得10
4秒前
CipherSage应助乐观的鞋垫采纳,获得10
5秒前
5秒前
赘婿应助十一采纳,获得10
5秒前
大牛完成签到,获得积分10
5秒前
crowd_lpy发布了新的文献求助10
6秒前
追寻啤酒发布了新的文献求助10
6秒前
情怀应助舒庆春采纳,获得10
6秒前
科研通AI2S应助野性的凡蕾采纳,获得10
7秒前
9秒前
含蓄可乐发布了新的文献求助20
9秒前
希望天下0贩的0应助楚楚采纳,获得10
9秒前
10秒前
暴躁的梦岚完成签到,获得积分10
10秒前
可爱的函函应助平常雨泽采纳,获得10
10秒前
10秒前
执着正豪完成签到,获得积分10
11秒前
梅子完成签到,获得积分10
11秒前
nanda完成签到,获得积分0
11秒前
11秒前
11秒前
12秒前
彭于晏应助谭语君采纳,获得10
12秒前
13秒前
salapao完成签到,获得积分10
13秒前
huangjs完成签到,获得积分10
13秒前
阿怪发布了新的文献求助10
14秒前
年轻乘云发布了新的文献求助10
14秒前
along完成签到,获得积分10
14秒前
15秒前
迟大猫应助好难啊采纳,获得10
15秒前
zzx发布了新的文献求助10
15秒前
高分求助中
Continuum Thermodynamics and Material Modelling 3000
Production Logging: Theoretical and Interpretive Elements 2700
Les Mantodea de Guyane Insecta, Polyneoptera 1000
Conference Record, IAS Annual Meeting 1977 820
England and the Discovery of America, 1481-1620 600
Fault identification method of electrical automation distribution equipment in distribution networks based on neural network 560
Teaching language in context (Third edition) by Derewianka, Beverly; Jones, Pauline 550
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 量子力学 光电子学 冶金
热门帖子
关注 科研通微信公众号,转发送积分 3581274
求助须知:如何正确求助?哪些是违规求助? 3150774
关于积分的说明 9484947
捐赠科研通 2852548
什么是DOI,文献DOI怎么找? 1568177
邀请新用户注册赠送积分活动 734562
科研通“疑难数据库(出版商)”最低求助积分说明 720699