亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

How AI Responds to Common Lung Cancer Questions: ChatGPT versus Google Bard

医学 一致性(知识库) 术语 肺癌 情报检索 医学物理学 病理 人工智能 计算机科学 语言学 哲学
作者
Amir Ali Rahsepar,Neda Tavakoli,Grace Hyun J. Kim,Cameron Hassani,Fereidoun Abtin,Arash Bedayat
出处
期刊:Radiology [Radiological Society of North America]
卷期号:307 (5) 被引量:175
标识
DOI:10.1148/radiol.230922
摘要

Background The recent release of large language models (LLMs) for public use, such as ChatGPT and Google Bard, has opened up a multitude of potential benefits as well as challenges. Purpose To evaluate and compare the accuracy and consistency of responses generated by publicly available ChatGPT-3.5 and Google Bard to non-expert questions related to lung cancer prevention, screening, and terminology commonly used in radiology reports based on the recommendation of Lung Imaging Reporting and Data System (Lung-RADS) v2022 from American College of Radiology and Fleischner society. Materials and Methods Forty of the exact same questions were created and presented to ChatGPT-3.5 and Google Bard experimental version as well as Bing and Google search engines by three different authors of this paper. Each answer was reviewed by two radiologists for accuracy. Responses were scored as correct, partially correct, incorrect, or unanswered. Consistency was also evaluated among the answers. Here, consistency was defined as the agreement between the three answers provided by ChatGPT-3.5, Google Bard experimental version, Bing, and Google search engines regardless of whether the concept conveyed was correct or incorrect. The accuracy among different tools were evaluated using Stata. Results ChatGPT-3.5 answered 120 questions with 85 (70.8%) correct, 14 (11.7%) partially correct, and 21 (17.5%) incorrect. Google Bard did not answer 23 (19.1%) questions. Among the 97 questions answered by Google Bard, 62 (51.7%) were correct, 11 (9.2%) were partially correct, and 24 (20%) were incorrect. Bing answered 120 questions with 74 (61.7%) correct, 13 (10.8%) partially correct, and 33 (27.5%) incorrect. Google search engine answered 120 questions with 66 (55%) correct, 27 (22.5%) partially correct, and 27 (22.5%) incorrect. The ChatGPT-3.5 is more likely to provide correct or partially answer than Google Bard, approximately by 1.5 folds (OR = 1.55, P = 0.004). ChatGPT-3.5 and Google search engine were more likely to be consistent than Google Bard by approximately 7 and 29 folds (OR = 6.65, P = 0.002 for ChatGPT and OR = 28.83, P = 0.002 for Google search engine, respectively). Conclusion Although ChatGPT-3.5 had a higher accuracy in comparison with the other tools, neither ChatGPT nor Google Bard, Bing and Google search engines answered all questions correctly and with 100% consistency.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Akim应助logical采纳,获得10
1秒前
2秒前
ccm应助啊魏采纳,获得10
3秒前
9秒前
16秒前
18秒前
阿楠完成签到,获得积分10
18秒前
木由发布了新的文献求助10
18秒前
姚老表完成签到,获得积分10
20秒前
sy1639完成签到,获得积分10
21秒前
阿楠发布了新的文献求助10
22秒前
只鱼完成签到 ,获得积分10
26秒前
30秒前
漂亮白枫完成签到,获得积分10
30秒前
31秒前
漂亮白枫发布了新的文献求助10
33秒前
8D发布了新的文献求助10
35秒前
37秒前
siwei发布了新的文献求助10
40秒前
8D完成签到,获得积分10
51秒前
你嵙这个期刊没买应助666采纳,获得10
56秒前
闪闪的梦槐完成签到 ,获得积分10
56秒前
朱佳玲完成签到 ,获得积分10
57秒前
57秒前
1分钟前
Song完成签到,获得积分10
1分钟前
1分钟前
Lion完成签到,获得积分10
1分钟前
1分钟前
王壕发布了新的文献求助10
1分钟前
喜悦宫苴完成签到,获得积分10
1分钟前
一二三四完成签到 ,获得积分10
1分钟前
movinglee完成签到,获得积分10
1分钟前
哈基米德应助科研通管家采纳,获得20
1分钟前
Ava应助科研通管家采纳,获得10
1分钟前
Criminology34应助科研通管家采纳,获得10
1分钟前
浮游应助科研通管家采纳,获得30
1分钟前
哈基米德应助科研通管家采纳,获得20
1分钟前
小二郎应助科研通管家采纳,获得10
1分钟前
Lucas应助科研通管家采纳,获得10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kolmogorov, A. N. Qualitative study of mathematical models of populations. Problems of Cybernetics, 1972, 25, 100-106 800
FUNDAMENTAL STUDY OF ADAPTIVE CONTROL SYSTEMS 500
微纳米加工技术及其应用 500
Nanoelectronics and Information Technology: Advanced Electronic Materials and Novel Devices 500
Performance optimization of advanced vapor compression systems working with low-GWP refrigerants using numerical and experimental methods 500
Constitutional and Administrative Law 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 遗传学 催化作用 冶金 量子力学 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 5301612
求助须知:如何正确求助?哪些是违规求助? 4449085
关于积分的说明 13847800
捐赠科研通 4335167
什么是DOI,文献DOI怎么找? 2380143
邀请新用户注册赠送积分活动 1375107
关于科研通互助平台的介绍 1341144