“Dr. AI Will See You Now”: How Do ChatGPT-4 Treatment Recommendations Align With Orthopaedic Clinical Practice Guidelines?

医学 物理疗法 骨关节炎 一致性 临床实习 骨科手术 运动医学 梅德林 清晰 外科 替代医学 病理 内科学 生物化学 化学 政治学 法学
作者
Tanios Dagher,Emma Dwyer,Hayden P. Baker,Senthooran Kalidoss,Jason Strelzow
出处
期刊:Clinical Orthopaedics and Related Research [Ovid Technologies (Wolters Kluwer)]
标识
DOI:10.1097/corr.0000000000003234
摘要

Background Artificial intelligence (AI) is engineered to emulate tasks that have historically required human interaction and intellect, including learning, pattern recognition, decision-making, and problem-solving. Although AI models like ChatGPT-4 have demonstrated satisfactory performance on medical licensing exams, suggesting a potential for supporting medical diagnostics and decision-making, no study of which we are aware has evaluated the ability of these tools to make treatment recommendations when given clinical vignettes and representative medical imaging of common orthopaedic conditions. As AI continues to advance, a thorough understanding of its strengths and limitations is necessary to inform safe and helpful integration into medical practice. Questions/purposes (1) What is the concordance between ChatGPT-4-generated treatment recommendations for common orthopaedic conditions with both the American Academy of Orthopaedic Surgeons (AAOS) clinical practice guidelines (CPGs) and an orthopaedic attending physician’s treatment plan? (2) In what specific areas do the ChatGPT-4-generated treatment recommendations diverge from the AAOS CPGs? Methods Ten common orthopaedic conditions with associated AAOS CPGs were identified: carpal tunnel syndrome, distal radius fracture, glenohumeral joint osteoarthritis, rotator cuff injury, clavicle fracture, hip fracture, hip osteoarthritis, knee osteoarthritis, ACL injury, and acute Achilles rupture. For each condition, the medical records of 10 deidentified patients managed at our facility were used to construct clinical vignettes that each had an isolated, single diagnosis with adequate clarity. The vignettes also encompassed a range of diagnostic severity to evaluate more thoroughly adherence to the treatment guidelines outlined by the AAOS. These clinical vignettes were presented alongside representative radiographic imaging. The model was prompted to provide a single treatment plan recommendation. Each treatment plan was compared with established AAOS CPGs and to the treatment plan documented by the attending orthopaedic surgeon treating the specific patient. Vignettes where ChatGPT-4 recommendations diverged from CPGs were reviewed to identify patterns of error and summarized. Results ChatGPT-4 provided treatment recommendations in accordance with the AAOS CPGs in 90% (90 of 100) of clinical vignettes. Concordance between ChatGPT-generated plans and the plan recommended by the treating orthopaedic attending physician was 78% (78 of 100). One hundred percent (30 of 30) of ChatGPT-4 recommendations for fracture vignettes and hip and knee arthritis vignettes matched with CPG recommendations, whereas the model struggled most with recommendations for carpal tunnel syndrome (3 of 10 instances demonstrated discordance). ChatGPT-4 recommendations diverged from AAOS CPGs for three carpal tunnel syndrome vignettes; two ACL injury, rotator cuff injury, and glenohumeral joint osteoarthritis vignettes; as well as one acute Achilles rupture vignette. In these situations, ChatGPT-4 most often struggled to correctly interpret injury severity and progression, incorporate patient factors (such as lifestyle or comorbidities) into decision-making, and recognize a contraindication to surgery. Conclusion ChatGPT-4 can generate accurate treatment plans aligned with CPGs but can also make mistakes when it is required to integrate multiple patient factors into decision-making and understand disease severity and progression. Physicians must critically assess the full clinical picture when using AI tools to support their decision-making. Clinical Relevance ChatGPT-4 may be used as an on-demand diagnostic companion, but patient-centered decision-making should continue to remain in the hands of the physician.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
酷波er应助小马同学采纳,获得10
刚刚
若俗人完成签到,获得积分10
2秒前
chiron完成签到,获得积分10
2秒前
生言生语完成签到,获得积分10
9秒前
lielizabeth完成签到 ,获得积分0
10秒前
12秒前
17秒前
开心夏旋完成签到 ,获得积分10
17秒前
尹俊采完成签到,获得积分10
20秒前
xiaobin完成签到,获得积分10
21秒前
23秒前
树枝丫完成签到,获得积分10
25秒前
小马同学发布了新的文献求助10
28秒前
现代的紫霜完成签到,获得积分10
29秒前
sophia完成签到 ,获得积分10
34秒前
zhaoyaoshi完成签到 ,获得积分10
45秒前
Jocelyn完成签到,获得积分10
45秒前
小庄完成签到 ,获得积分10
46秒前
宁夕完成签到 ,获得积分10
51秒前
56秒前
小米完成签到,获得积分10
59秒前
conghuang完成签到,获得积分10
59秒前
和谐的醉山完成签到,获得积分10
59秒前
ANESTHESIA_XY完成签到 ,获得积分10
1分钟前
jiangxinzhi完成签到 ,获得积分10
1分钟前
1分钟前
闪闪的以山完成签到 ,获得积分10
1分钟前
FXT完成签到 ,获得积分10
1分钟前
1分钟前
大模型应助tingtingliuok采纳,获得10
1分钟前
宋北北完成签到,获得积分10
1分钟前
洁净夏山发布了新的文献求助10
1分钟前
-Me完成签到 ,获得积分10
1分钟前
夏侯卿完成签到,获得积分0
1分钟前
1分钟前
小小果妈完成签到 ,获得积分10
1分钟前
平常从蓉完成签到,获得积分10
1分钟前
tingtingliuok发布了新的文献求助10
1分钟前
美丽的小松鼠完成签到 ,获得积分10
1分钟前
1分钟前
高分求助中
LNG地下式貯槽指針(JGA Guideline-107)(LNG underground storage tank guidelines) 1000
Generalized Linear Mixed Models 第二版 1000
Preparation and Characterization of Five Amino-Modified Hyper-Crosslinked Polymers and Performance Evaluation for Aged Transformer Oil Reclamation 700
Operative Techniques in Pediatric Orthopaedic Surgery 510
Full waveform acoustic data processing 500
A High Efficiency Grating Coupler Based on Hybrid Si-Lithium Niobate on Insulator Platform 500
人工地层冻结稳态温度场边界分离方法及新解答 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2926599
求助须知:如何正确求助?哪些是违规求助? 2575288
关于积分的说明 6951800
捐赠科研通 2226796
什么是DOI,文献DOI怎么找? 1183519
版权声明 589241
科研通“疑难数据库(出版商)”最低求助积分说明 579186