Accuracy of ChatGPT‐Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

四分位间距 观察研究 医学物理学 完备性(序理论) 头颈部 医学 外科 计算机科学 人工智能 内科学 数学 数学分析
作者
Luigi Angelo Vaira,Jérôme R. Lechien,Vincenzo Abbate,Fabiana Allevi,Giovanni Audino,Giada Anna Beltramini,Michela Bergonzani,Alessandro Bolzoni,Umberto Committeri,Salvatore Crimi,Guido Gabriele,F. Lonardi,Fabio Maglitto,Marzia Petrocelli,Resi Pucci,Gianmarco Saponaro,Alessandro Tel,Valentino Vellone,Carlos M. Chiesa‐Estomba,Paolo Boscolo‐Rizzo
出处
期刊:Otolaryngology-Head and Neck Surgery [Wiley]
卷期号:170 (6): 1492-1503 被引量:63
标识
DOI:10.1002/ohn.489
摘要

Abstract Objective To investigate the accuracy of Chat‐Based Generative Pre‐trained Transformer (ChatGPT) in answering questions and solving clinical scenarios of head and neck surgery. Study Design Observational and valuative study. Setting Eighteen surgeons from 14 Italian head and neck surgery units. Methods A total of 144 clinical questions encompassing different subspecialities of head and neck surgery and 15 comprehensive clinical scenarios were developed. Questions and scenarios were inputted into ChatGPT4, and the resulting answers were evaluated by the researchers using accuracy (range 1‐6), completeness (range 1‐3), and references' quality Likert scales. Results The overall median score of open‐ended questions was 6 (interquartile range[IQR]: 5‐6) for accuracy and 3 (IQR: 2‐3) for completeness. Overall, the reviewers rated the answer as entirely or nearly entirely correct in 87.2% of cases and as comprehensive and covering all aspects of the question in 73% of cases. The artificial intelligence (AI) model achieved a correct response in 84.7% of the closed‐ended questions (11 wrong answers). As for the clinical scenarios, ChatGPT provided a fully or nearly fully correct diagnosis in 81.7% of cases. The proposed diagnostic or therapeutic procedure was judged to be complete in 56.7% of cases. The overall quality of the bibliographic references was poor, and sources were nonexistent in 46.4% of the cases. Conclusion The results generally demonstrate a good level of accuracy in the AI's answers. The AI's ability to resolve complex clinical scenarios is promising, but it still falls short of being considered a reliable support for the decision‐making process of specialists in head‐neck surgery.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
hiliang发布了新的文献求助10
1秒前
孟一完成签到,获得积分10
3秒前
英姑应助一椰包富采纳,获得10
3秒前
英姑应助小马采纳,获得10
4秒前
4秒前
TTRO完成签到,获得积分10
5秒前
5秒前
hushiyu发布了新的文献求助10
5秒前
张杰完成签到,获得积分10
6秒前
西西完成签到 ,获得积分10
7秒前
8秒前
8秒前
王致远发布了新的文献求助10
8秒前
开朗书本完成签到,获得积分10
9秒前
9秒前
10秒前
abou完成签到 ,获得积分10
13秒前
13秒前
华仔应助jjsun采纳,获得30
15秒前
Ricky完成签到,获得积分10
15秒前
16秒前
曾礽发布了新的文献求助10
17秒前
生动的小白菜完成签到,获得积分10
17秒前
18秒前
tree完成签到,获得积分10
18秒前
paxjustitia完成签到,获得积分10
19秒前
20秒前
20秒前
阿狸完成签到,获得积分10
20秒前
hu发布了新的文献求助10
20秒前
zzk完成签到,获得积分10
21秒前
Alicexpp发布了新的文献求助30
23秒前
小马发布了新的文献求助10
23秒前
打打应助曾礽采纳,获得10
23秒前
24秒前
24秒前
左丘以云完成签到,获得积分10
25秒前
风趣的元气完成签到,获得积分10
26秒前
阿找找发布了新的文献求助10
26秒前
研友_LJGpan完成签到,获得积分10
27秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Salmon nasal cartilage-derived proteoglycan complexes influence the gut microbiota and bacterial metabolites in mice 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Picture this! Including first nations fiction picture books in school library collections 1500
ON THE THEORY OF BIRATIONAL BLOWING-UP 666
Signals, Systems, and Signal Processing 610
The Impostor Phenomenon: When Success Makes You Feel Like a Fake 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6377894
求助须知:如何正确求助?哪些是违规求助? 8190899
关于积分的说明 17303573
捐赠科研通 5431423
什么是DOI,文献DOI怎么找? 2873458
邀请新用户注册赠送积分活动 1850143
关于科研通互助平台的介绍 1695451