医学
梅德林
样本量测定
指南
样品(材料)
临床决策支持系统
决策支持系统
医学物理学
人工智能
统计
计算机科学
病理
数学
政治学
法学
化学
色谱法
作者
Tyler J. Loftus,Maria S. Altieri,Jeremy A. Balch,Kenneth L. Abbott,Jeff Choi,Jayson S. Marwaha,Daniel A. Hashimoto,Gabriel A. Brat,Yannis Raftopoulos,Heather L. Evans,Gretchen Purcell Jackson,Danielle S. Walsh,Christopher J. Tignanelli
出处
期刊:Annals of Surgery
[Ovid Technologies (Wolters Kluwer)]
日期:2023-03-21
卷期号:278 (1): 51-58
被引量:19
标识
DOI:10.1097/sla.0000000000005853
摘要
Objective: To summarize state-of-the-art artificial intelligence–enabled decision support in surgery and to quantify deficiencies in scientific rigor and reporting. Background: To positively affect surgical care, decision-support models must exceed current reporting guideline requirements by performing external and real-time validation, enrolling adequate sample sizes, reporting model precision, assessing performance across vulnerable populations, and achieving clinical implementation; the degree to which published models meet these criteria is unknown. Methods: Embase, PubMed, and MEDLINE databases were searched from their inception to September 21, 2022 for articles describing artificial intelligence–enabled decision support in surgery that uses preoperative or intraoperative data elements to predict complications within 90 days of surgery. Scientific rigor and reporting criteria were assessed and reported according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews guidelines. Results: Sample size ranged from 163–2,882,526, with 8/36 articles (22.2%) featuring sample sizes of less than 2000; 7 of these 8 articles (87.5%) had below-average (<0.83) area under the receiver operating characteristic or accuracy. Overall, 29 articles (80.6%) performed internal validation only, 5 (13.8%) performed external validation, and 2 (5.6%) performed real-time validation. Twenty-three articles (63.9%) reported precision. No articles reported performance across sociodemographic categories. Thirteen articles (36.1%) presented a framework that could be used for clinical implementation; none assessed clinical implementation efficacy. Conclusions: Artificial intelligence–enabled decision support in surgery is limited by reliance on internal validation, small sample sizes that risk overfitting and sacrifice predictive performance, and failure to report confidence intervals, precision, equity analyses, and clinical implementation. Researchers should strive to improve scientific quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI