已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Evaluating the Potential of Large Language Models for Vestibular Rehabilitation Education: A Comparison of ChatGPT, Google Gemini, and Clinicians

认证 考试(生物学) 康复 多项选择 临床实习 心理学 前庭康复 医学教育 应用心理学 物理医学与康复 医学 物理疗法 显著性差异 法学 古生物学 内科学 生物 政治学
作者
Yael Arbel,Yoav Gimmon,Liora Shmueli
出处
期刊:Physical therapy [Oxford University Press]
标识
DOI:10.1093/ptj/pzaf010
摘要

Abstract Objective This study aimed to compare the performance of 2 large language models, ChatGPT and Google Gemini, against experienced physical therapists and students in responding to multiple-choice questions related to vestibular rehabilitation. The study further aimed to assess the accuracy of ChatGPT’s responses by board-certified otoneurologists. Methods This study was conducted among 30 physical therapist professionals experienced with vestibular rehabilitation and 30 physical therapist students. They were asked to complete a vestibular knowledge test (VKT) consisting of 20 multiple-choice questions that were divided into 3 categories: (1) Clinical Knowledge, (2) Basic Clinical Practice, and (3) Clinical Reasoning. ChatGPT and Google Gemini were tasked with answering the same 20 VKT questions. Three board-certified otoneurologists independently evaluated the accuracy of each response using a 4-level scale, ranging from comprehensive to completely incorrect. Results ChatGPT outperformed Google Gemini with a 70% score on the VKT test, while Gemini scored 60%. Both excelled in Clinical Knowledge scoring 100% but struggled in Clinical Reasoning with ChatGPT scoring 50% and Gemini scoring 25%. According to 3 otoneurologic experts, ChatGPT’s accuracy was considered “comprehensive” in 45% of the 20 questions, while 25% were found to be completely incorrect. ChatGPT provided “comprehensive” responses in 50% of Clinical Knowledge and Basic Clinical Practice questions, but only 25% in Clinical Reasoning. Conclusion Caution is advised when using ChatGPT and Google Gemini due to their limited accuracy in clinical reasoning. While they provide accurate responses concerning Clinical Knowledge, their reliance on web information may lead to inconsistencies. ChatGPT performed better than Gemini. Health care professionals should carefully formulate questions and be aware of the potential influence of the online prevalence of information on ChatGPT’s and Google Gemini’s responses. Combining clinical expertise and clinical guidelines with ChatGPT and Google Gemini can maximize benefits while mitigating limitations. The results are based on current models of ChatGPT3.5 and Google Gemini. Future iterations of these models are expected to offer improved accuracy as the underlying modeling and algorithms are further refined. Impact This study highlights the potential utility of large language models like ChatGPT in supplementing clinical knowledge for physical therapists, while underscoring the need for caution in domains requiring complex clinical reasoning. The findings emphasize the importance of integrating technological tools carefully with human expertise to enhance patient care and rehabilitation outcomes.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
的微博发布了新的文献求助30
1秒前
ssau发布了新的文献求助10
1秒前
1秒前
2秒前
KUYAA完成签到 ,获得积分10
4秒前
薛微有点甜完成签到 ,获得积分10
4秒前
可爱的函函应助细胞色素采纳,获得10
6秒前
在水一方应助Stella采纳,获得10
8秒前
善学以致用应助南吕采纳,获得50
9秒前
瘦瘦牛排完成签到 ,获得积分10
13秒前
艺二叁完成签到,获得积分10
13秒前
ding应助ssau采纳,获得10
13秒前
15秒前
wanci应助wxy采纳,获得10
15秒前
Qifan完成签到 ,获得积分10
15秒前
FashionBoy应助周哥来学习采纳,获得10
17秒前
李健应助鱼儿游采纳,获得10
19秒前
yk发布了新的文献求助10
20秒前
天天快乐应助文刀大可采纳,获得10
20秒前
领导范儿应助科研通管家采纳,获得10
21秒前
小蘑菇应助科研通管家采纳,获得10
21秒前
21秒前
今后应助科研通管家采纳,获得10
21秒前
大模型应助科研通管家采纳,获得10
21秒前
Lucas应助科研通管家采纳,获得10
21秒前
慕青应助自然的亦巧采纳,获得10
24秒前
24秒前
26秒前
29秒前
31秒前
31秒前
33秒前
若月画萤完成签到,获得积分10
34秒前
34秒前
波鲁鲁爱喝酸奶完成签到 ,获得积分10
36秒前
37秒前
37秒前
罐装发布了新的文献求助10
37秒前
奋斗机器猫完成签到 ,获得积分10
41秒前
小小咸鱼发布了新的文献求助10
42秒前
高分求助中
All the Birds of the World 3000
Weirder than Sci-fi: Speculative Practice in Art and Finance 960
IZELTABART TAPATANSINE 500
Introduction to Comparative Public Administration: Administrative Systems and Reforms in Europe: Second Edition 2nd Edition 300
Spontaneous closure of a dural arteriovenous malformation 300
GNSS Applications in Earth and Space Observations 300
Not Equal : Towards an International Law of Finance 260
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3725103
求助须知:如何正确求助?哪些是违规求助? 3270217
关于积分的说明 9964981
捐赠科研通 2985104
什么是DOI,文献DOI怎么找? 1637795
邀请新用户注册赠送积分活动 777716
科研通“疑难数据库(出版商)”最低求助积分说明 747164