Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients

医学 可读性 血管外科 外科 心脏外科 语言学 哲学
作者
Ethan Chervonski,Keerthi Harish,Caron Rockman,Mikel Sadek,Katherine Teter,Glenn R. Jacobowitz,Todd Berland,Joann M. Lohr,Colleen M. Moore,Thomas S. Maldonado
出处
期刊:Vascular [SAGE]
被引量:8
标识
DOI:10.1177/17085381241240550
摘要

Objectives Generative artificial intelligence (AI) has emerged as a promising tool to engage with patients. The objective of this study was to assess the quality of AI responses to common patient questions regarding vascular surgery disease processes. Methods OpenAI’s ChatGPT-3.5 and Google Bard were queried with 24 mock patient questions spanning seven vascular surgery disease domains. Six experienced vascular surgery faculty at a tertiary academic center independently graded AI responses on their accuracy (rated 1–4 from completely inaccurate to completely accurate), completeness (rated 1–4 from totally incomplete to totally complete), and appropriateness (binary). Responses were also evaluated with three readability scales. Results ChatGPT responses were rated, on average, more accurate than Bard responses (3.08 ± 0.33 vs 2.82 ± 0.40, p < .01). ChatGPT responses were scored, on average, more complete than Bard responses (2.98 ± 0.34 vs 2.62 ± 0.36, p < .01). Most ChatGPT responses (75.0%, n = 18) and almost half of Bard responses (45.8%, n = 11) were unanimously deemed appropriate. Almost one-third of Bard responses (29.2%, n = 7) were deemed inappropriate by at least two reviewers (29.2%), and two Bard responses (8.4%) were considered inappropriate by the majority. The mean Flesch Reading Ease, Flesch–Kincaid Grade Level, and Gunning Fog Index of ChatGPT responses were 29.4 ± 10.8, 14.5 ± 2.2, and 17.7 ± 3.1, respectively, indicating that responses were readable with a post-secondary education. Bard’s mean readability scores were 58.9 ± 10.5, 8.2 ± 1.7, and 11.0 ± 2.0, respectively, indicating that responses were readable with a high-school education ( p < .0001 for three metrics). ChatGPT’s mean response length (332 ± 79 words) was higher than Bard’s mean response length (183 ± 53 words, p < .001). There was no difference in the accuracy, completeness, readability, or response length of ChatGPT or Bard between disease domains ( p > .05 for all analyses). Conclusions AI offers a novel means of educating patients that avoids the inundation of information from “Dr Google” and the time barriers of physician-patient encounters. ChatGPT provides largely valid, though imperfect, responses to myriad patient questions at the expense of readability. While Bard responses are more readable and concise, their quality is poorer. Further research is warranted to better understand failure points for large language models in vascular surgery patient education.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
奋斗小真发布了新的文献求助10
刚刚
刚刚
1秒前
1秒前
大个应助CHSLN采纳,获得10
1秒前
1秒前
166完成签到 ,获得积分10
2秒前
2秒前
zwying完成签到,获得积分10
2秒前
Xantareas完成签到,获得积分10
3秒前
Alex发布了新的文献求助10
3秒前
CodeCraft应助再见不难采纳,获得10
3秒前
3秒前
4秒前
4秒前
5秒前
Geisha发布了新的文献求助10
5秒前
Hello应助dudu采纳,获得10
6秒前
6秒前
hk1900发布了新的文献求助10
6秒前
6秒前
wanci应助More采纳,获得10
6秒前
7秒前
orixero应助高挑的板凳采纳,获得30
7秒前
wendy_1006完成签到 ,获得积分10
7秒前
简单完成签到,获得积分10
8秒前
谢奕完成签到,获得积分10
8秒前
8秒前
阿洁发布了新的文献求助10
8秒前
9秒前
pluto应助jingwei72采纳,获得10
9秒前
9秒前
开放明雪发布了新的文献求助10
10秒前
10秒前
ding应助yuyuyu采纳,获得10
10秒前
zho应助lichaolun采纳,获得10
10秒前
11秒前
11秒前
11秒前
11秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 3000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 1100
Signals, Systems, and Signal Processing 510
Discrete-Time Signals and Systems 510
Proceedings of the Fourth International Congress of Nematology, 8-13 June 2002, Tenerife, Spain 500
Le genre Cuphophyllus (Donk) st. nov 500
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5939433
求助须知:如何正确求助?哪些是违规求助? 7049277
关于积分的说明 15878621
捐赠科研通 5069404
什么是DOI,文献DOI怎么找? 2726650
邀请新用户注册赠送积分活动 1685171
关于科研通互助平台的介绍 1612654