Assessing chatbots ability to produce leaflets on cataract surgery: Bing AI, chatGPT 3.5, chatGPT 4o, ChatSonic, Google Bard, Perplexity and Pi

困惑 可读性 误传 医学 传单(植物学) 白内障手术 可靠性(半导体) 人工智能 眼科 计算机科学 语言模型 古生物学 功率(物理) 物理 计算机安全 量子力学 生物 程序设计语言
作者
Dean Thompson,David Thornton,Conor Ramsden
出处
期刊:Journal of Cataract and Refractive Surgery [Ovid Technologies (Wolters Kluwer)]
标识
DOI:10.1097/j.jcrs.0000000000001622
摘要

Purpose: This study aimed to evaluate leaflets on cataract surgery produced by seven common free chatbots. Setting: Usage of conversational artificial intelligence services (chatbots) is becoming more prevalent in all aspects of life, including healthcare. Cataract surgery is the most commonly performed operation in the world, with numbers set to increase. Possible applications for chatbots include information giving and education, allowing clinicians to allocate their time more efficiently. Design: Analysis of answers given by seven chatbots (Bing AI, chatGPT 3.5, chatGPT 4o, ChatSonic, Google Bard, Perplexity and Pi) were prompted to “make a patient information leaflet on cataract surgery”. Methods: Answers were evaluated using the DISCERN instrument, Patient Education Materials Assessment Tool (PEMAT), presence of misinformation, the Flesch-Kincaid Grade Level readability score and material reliability. Results: The highest overall scored response was from ChatSonic, followed by Bing AI and then Perplexity. The lowest scoring was ChatGPT 3.5. ChatSonic achieved the highest DISCERN and PEMAT scores, and had the highest Flesch-Kincaid Grade level. The lowest DISCERN and PEMAT scores were for Pi. Only ChatGPT 3.5 included some misinformation in its response. Bing AI, ChatSonic and Perplexity included reliable references; the other chatbots provided no references. Conclusions: This study demonstrates a range of answers given by chatbots creating a cataract surgery leaflet, suggesting variation in their development and reliability. ChatGPT 3.5 scored the most poorly. However, ChatSonic indicated promise in how technology may be used to assist information giving in ophthalmology.

科研通智能强力驱动
Strongly Powered by AbleSci AI

祝大家在新的一年里科研腾飞
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
任三颜发布了新的文献求助10
刚刚
细腻雨莲发布了新的文献求助10
2秒前
2秒前
妮儿发布了新的文献求助10
3秒前
夏侯德东发布了新的文献求助10
4秒前
胖胖完成签到 ,获得积分0
4秒前
领导范儿应助必胜客采纳,获得30
4秒前
田様应助秦艽采纳,获得10
4秒前
脑洞疼应助深秋远塞采纳,获得10
5秒前
木木杨完成签到,获得积分10
7秒前
8秒前
任三颜完成签到,获得积分20
9秒前
11秒前
充电宝应助李明采纳,获得10
11秒前
12秒前
雏菊完成签到,获得积分10
12秒前
领导范儿应助Clash采纳,获得10
13秒前
13秒前
pp发布了新的文献求助30
14秒前
ouyangshi发布了新的文献求助10
15秒前
布坎南完成签到,获得积分20
16秒前
17秒前
mavissss发布了新的文献求助10
18秒前
18秒前
科目三应助放手一搏采纳,获得10
19秒前
xiuxue424应助1874采纳,获得30
21秒前
21秒前
汉堡包应助安详凡采纳,获得10
23秒前
mountawind完成签到,获得积分10
24秒前
英勇香氛发布了新的文献求助10
25秒前
陈昇发布了新的文献求助10
25秒前
MchemG应助snowskating采纳,获得10
26秒前
细腻雨莲完成签到,获得积分20
26秒前
王萱发布了新的文献求助10
26秒前
27秒前
思源应助动人的科研采纳,获得10
29秒前
weinaonao发布了新的文献求助10
30秒前
miaomiao_ma发布了新的文献求助10
30秒前
布坎南关注了科研通微信公众号
31秒前
江峰发布了新的文献求助10
31秒前
高分求助中
Востребованный временем 2500
The Three Stars Each: The Astrolabes and Related Texts 1500
Very-high-order BVD Schemes Using β-variable THINC Method 990
Les Mantodea de Guyane 800
Mantids of the euro-mediterranean area 700
Field Guide to Insects of South Africa 660
Mantodea of the World: Species Catalog 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 细胞生物学 免疫学 冶金
热门帖子
关注 科研通微信公众号,转发送积分 3396849
求助须知:如何正确求助?哪些是违规求助? 3006346
关于积分的说明 8820631
捐赠科研通 2693370
什么是DOI,文献DOI怎么找? 1475345
科研通“疑难数据库(出版商)”最低求助积分说明 682396
邀请新用户注册赠送积分活动 675680