亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

AI in the ED: Assessing the efficacy of GPT models vs. physicians in medical score calculation

一致性 卡帕 科恩卡帕 医学 急诊科 急诊医学 急诊分诊台 内科学 机器学习 精神科 计算机科学 哲学 语言学
作者
Gal Ben Haim,Adi Braun,Haggai Eden,Livnat Burshtein,Yiftach Barash,Avinoah Irony,Eyal Klang
出处
期刊:American Journal of Emergency Medicine [Elsevier]
被引量:2
标识
DOI:10.1016/j.ajem.2024.02.016
摘要

Artificial Intelligence (AI) models like GPT-3.5 and GPT-4 have shown promise across various domains but remain underexplored in healthcare. Emergency Departments (ED) rely on established scoring systems, such as NIHSS and HEART score, to guide clinical decision-making. This study aims to evaluate the proficiency of GPT-3.5 and GPT-4 against experienced ED physicians in calculating five commonly used medical scores. This retrospective study analyzed data from 150 patients who visited the ED over one week. Both AI models and two human physicians were tasked with calculating scores for NIH Stroke Scale, Canadian Syncope Risk Score, Alvarado Score for Acute Appendicitis, Canadian CT Head Rule, and HEART Score. Cohen's Kappa statistic and AUC values were used to assess inter-rater agreement and predictive performance, respectively. The highest level of agreement was observed between the human physicians (Kappa = 0.681), while GPT-4 also showed moderate to substantial agreement with them (Kappa values of 0.473 and 0.576). GPT-3.5 had the lowest agreement with human scorers. These results highlight the superior predictive performance of human expertise over the currently available automated systems for this specific medical outcome. Human physicians achieved a higher ROC-AUC on 3 of the 5 scores, but none of the differences were statistically significant. While AI models demonstrated some level of concordance with human expertise, they fell short in emulating the complex clinical judgments that physicians make. The study suggests that current AI models may serve as supplementary tools but are not ready to replace human expertise in high-stakes settings like the ED. Further research is needed to explore the capabilities and limitations of AI in emergency medicine.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1分钟前
orixero应助sunny采纳,获得10
1分钟前
hongxuezhi完成签到,获得积分10
1分钟前
2分钟前
2分钟前
简宁发布了新的文献求助10
2分钟前
李剑鸿完成签到,获得积分10
2分钟前
spark810应助科研通管家采纳,获得10
3分钟前
4分钟前
扣子完成签到,获得积分20
5分钟前
spark810应助科研通管家采纳,获得10
5分钟前
5分钟前
5分钟前
sunny发布了新的文献求助10
5分钟前
5分钟前
5分钟前
liuyamei发布了新的文献求助10
5分钟前
长情半邪发布了新的文献求助10
5分钟前
CC完成签到 ,获得积分10
6分钟前
烟花应助长情半邪采纳,获得10
6分钟前
6分钟前
深情安青应助长情半邪采纳,获得10
6分钟前
Hello应助长情半邪采纳,获得10
6分钟前
毛豆应助长情半邪采纳,获得10
6分钟前
李爱国应助长情半邪采纳,获得10
6分钟前
NexusExplorer应助长情半邪采纳,获得10
6分钟前
所所应助扣子采纳,获得10
6分钟前
三人水明完成签到 ,获得积分10
7分钟前
不想读书发布了新的文献求助10
7分钟前
spark810应助科研通管家采纳,获得10
7分钟前
FashionBoy应助饭饭采纳,获得10
8分钟前
饺子生面包完成签到 ,获得积分10
8分钟前
大然完成签到,获得积分10
8分钟前
赘婿应助研友_nEoDm8采纳,获得10
8分钟前
8分钟前
饭饭发布了新的文献求助10
9分钟前
9分钟前
扣子发布了新的文献求助10
9分钟前
spark810应助科研通管家采纳,获得10
9分钟前
spark810应助科研通管家采纳,获得10
9分钟前
高分求助中
Evolution 2024
中国国际图书贸易总公司40周年纪念文集: 回忆录 2000
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000
Experimental investigation of the mechanics of explosive welding by means of a liquid analogue 1060
Die Elektra-Partitur von Richard Strauss : ein Lehrbuch für die Technik der dramatischen Komposition 1000
How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000
Gerard de Lairesse : an artist between stage and studio 670
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3004732
求助须知:如何正确求助?哪些是违规求助? 2664069
关于积分的说明 7219897
捐赠科研通 2300569
什么是DOI,文献DOI怎么找? 1220104
科研通“疑难数据库(出版商)”最低求助积分说明 594570
版权声明 593197