Usefulness and Accuracy of Artificial Intelligence Chatbot Responses to Patient Questions for Neurosurgical Procedures

有用性 医学 可读性 聊天机器人 医学物理学 常见问题 人工智能 心理学 医学教育 理解力 社会心理学 计算机科学 程序设计语言
作者
Avi A. Gajjar,Rohit Prem Kumar,Ethan Paliwoda,Cathleen C. Kuo,Samuel Adida,Andrew D. Legarreta,Hansen Deng,Sharath Kumar Anand,D. Kojo Hamilton,Thomas J. Buell,Nitin Agarwal,Peter C. Gerszten,Joseph S. Hudson
出处
期刊:Neurosurgery [Oxford University Press]
被引量:13
标识
DOI:10.1227/neu.0000000000002856
摘要

BACKGROUND AND OBJECTIVES: The Internet has become a primary source of health information, leading patients to seek answers online before consulting health care providers. This study aims to evaluate the implementation of Chat Generative Pre-Trained Transformer (ChatGPT) in neurosurgery by assessing the accuracy and helpfulness of artificial intelligence (AI)–generated responses to common postsurgical questions. METHODS: A list of 60 commonly asked questions regarding neurosurgical procedures was developed. ChatGPT-3.0, ChatGPT-3.5, and ChatGPT-4.0 responses to these questions were recorded and graded by numerous practitioners for accuracy and helpfulness. The understandability and actionability of the answers were assessed using the Patient Education Materials Assessment Tool. Readability analysis was conducted using established scales. RESULTS: A total of 1080 responses were evaluated, equally divided among ChatGPT-3.0, 3.5, and 4.0, each contributing 360 responses. The mean helpfulness score across the 3 subsections was 3.511 ± 0.647 while the accuracy score was 4.165 ± 0.567. The Patient Education Materials Assessment Tool analysis revealed that the AI-generated responses had higher actionability scores than understandability. This indicates that the answers provided practical guidance and recommendations that patients could apply effectively. On the other hand, the mean Flesch Reading Ease score was 33.5, suggesting that the readability level of the responses was relatively complex. The Raygor Readability Estimate scores ranged within the graduate level, with an average score of the 15th grade. CONCLUSION: The artificial intelligence chatbot's responses, although factually accurate, were not rated highly beneficial, with only marginal differences in perceived helpfulness and accuracy between ChatGPT-3.0 and ChatGPT-3.5 versions. Despite this, the responses from ChatGPT-4.0 showed a notable improvement in understandability, indicating enhanced readability over earlier versions.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
韩麒嘉完成签到,获得积分10
4秒前
天天快乐应助Mango采纳,获得10
4秒前
rainbow发布了新的文献求助10
5秒前
6秒前
烟花应助遇见馅儿饼采纳,获得10
8秒前
一团小煤球完成签到,获得积分10
9秒前
木木发布了新的文献求助10
11秒前
钱大大完成签到,获得积分10
11秒前
鲁大师完成签到 ,获得积分10
11秒前
11秒前
12秒前
14秒前
18秒前
丘比特应助昏睡的雨寒采纳,获得10
19秒前
不需要昵称完成签到,获得积分10
21秒前
24秒前
小于完成签到,获得积分20
26秒前
Karma完成签到 ,获得积分10
26秒前
wanci应助South朝484采纳,获得10
26秒前
ding应助千年雪松采纳,获得10
26秒前
27秒前
28秒前
在水一方应助力口氵由采纳,获得10
28秒前
29秒前
29秒前
昏睡的雨寒完成签到,获得积分20
31秒前
xcr发布了新的文献求助10
33秒前
盒子先生完成签到,获得积分10
33秒前
钱俊完成签到,获得积分10
34秒前
sylvan发布了新的文献求助30
35秒前
zqw发布了新的文献求助10
36秒前
Poker完成签到 ,获得积分10
36秒前
hsy发布了新的文献求助10
36秒前
123发布了新的文献求助20
36秒前
小蘑菇应助吉他平方采纳,获得10
36秒前
38秒前
39秒前
39秒前
高分求助中
Sustainability in Tides Chemistry 2800
The Young builders of New china : the visit of the delegation of the WFDY to the Chinese People's Republic 1000
юрские динозавры восточного забайкалья 800
English Wealden Fossils 700
Chen Hansheng: China’s Last Romantic Revolutionary 500
China's Relations With Japan 1945-83: The Role of Liao Chengzhi 400
Classics in Total Synthesis IV 400
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3148107
求助须知:如何正确求助?哪些是违规求助? 2799178
关于积分的说明 7833767
捐赠科研通 2456390
什么是DOI,文献DOI怎么找? 1307222
科研通“疑难数据库(出版商)”最低求助积分说明 628099
版权声明 601655