清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Accuracy of ChatGPT‐Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

四分位间距 观察研究 医学物理学 完备性(序理论) 头颈部 医学 外科 计算机科学 人工智能 内科学 数学 数学分析
作者
Luigi Angelo Vaira,Jérôme R. Lechien,Vincenzo Abbate,Fabiana Allevi,Giovanni Audino,Giada Anna Beltramini,Michela Bergonzani,Alessandro Bolzoni,Umberto Committeri,Salvatore Crimi,Guido Gabriele,F. Lonardi,Fabio Maglitto,Marzia Petrocelli,Resi Pucci,Gianmarco Saponaro,Alessandro Tel,Valentino Vellone,Carlos M. Chiesa‐Estomba,Paolo Boscolo‐Rizzo
出处
期刊:Otolaryngology-Head and Neck Surgery [SAGE]
卷期号:170 (6): 1492-1503 被引量:63
标识
DOI:10.1002/ohn.489
摘要

Abstract Objective To investigate the accuracy of Chat‐Based Generative Pre‐trained Transformer (ChatGPT) in answering questions and solving clinical scenarios of head and neck surgery. Study Design Observational and valuative study. Setting Eighteen surgeons from 14 Italian head and neck surgery units. Methods A total of 144 clinical questions encompassing different subspecialities of head and neck surgery and 15 comprehensive clinical scenarios were developed. Questions and scenarios were inputted into ChatGPT4, and the resulting answers were evaluated by the researchers using accuracy (range 1‐6), completeness (range 1‐3), and references' quality Likert scales. Results The overall median score of open‐ended questions was 6 (interquartile range[IQR]: 5‐6) for accuracy and 3 (IQR: 2‐3) for completeness. Overall, the reviewers rated the answer as entirely or nearly entirely correct in 87.2% of cases and as comprehensive and covering all aspects of the question in 73% of cases. The artificial intelligence (AI) model achieved a correct response in 84.7% of the closed‐ended questions (11 wrong answers). As for the clinical scenarios, ChatGPT provided a fully or nearly fully correct diagnosis in 81.7% of cases. The proposed diagnostic or therapeutic procedure was judged to be complete in 56.7% of cases. The overall quality of the bibliographic references was poor, and sources were nonexistent in 46.4% of the cases. Conclusion The results generally demonstrate a good level of accuracy in the AI's answers. The AI's ability to resolve complex clinical scenarios is promising, but it still falls short of being considered a reliable support for the decision‐making process of specialists in head‐neck surgery.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Nan驳回了李爱国应助
26秒前
ChenYX完成签到 ,获得积分10
26秒前
zhang完成签到,获得积分20
50秒前
樱桃猴子应助白华苍松采纳,获得10
53秒前
顺利的小蚂蚁完成签到,获得积分10
1分钟前
1分钟前
1分钟前
鱼太闲发布了新的文献求助10
1分钟前
Guo完成签到 ,获得积分10
1分钟前
小马甲应助鱼太闲采纳,获得10
2分钟前
2分钟前
单薄绮露完成签到,获得积分10
2分钟前
2分钟前
2分钟前
文艺猫咪发布了新的文献求助10
2分钟前
2分钟前
樱桃猴子应助白华苍松采纳,获得10
2分钟前
2分钟前
Nan发布了新的文献求助10
2分钟前
2分钟前
3分钟前
行走完成签到,获得积分10
3分钟前
马马马完成签到 ,获得积分10
3分钟前
3分钟前
小蘑菇应助文艺猫咪采纳,获得10
3分钟前
3分钟前
4分钟前
ChenYX发布了新的文献求助10
4分钟前
Lucas应助白华苍松采纳,获得10
4分钟前
4分钟前
雷九万班完成签到 ,获得积分0
4分钟前
4分钟前
青出于蓝蔡完成签到,获得积分10
4分钟前
4分钟前
5分钟前
Aliceq发布了新的文献求助10
5分钟前
英俊的铭应助Aliceq采纳,获得10
5分钟前
5分钟前
科研通AI5应助ping采纳,获得30
5分钟前
二行完成签到 ,获得积分10
5分钟前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2700
Social media impact on athlete mental health: #RealityCheck 1020
1.3μm GaAs基InAs量子点材料生长及器件应用 1000
Ensartinib (Ensacove) for Non-Small Cell Lung Cancer 1000
Unseen Mendieta: The Unpublished Works of Ana Mendieta 1000
Bacterial collagenases and their clinical applications 800
El viaje de una vida: Memorias de María Lecea 800
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 量子力学 光电子学 冶金
热门帖子
关注 科研通微信公众号,转发送积分 3526577
求助须知:如何正确求助?哪些是违规求助? 3107022
关于积分的说明 9282092
捐赠科研通 2804617
什么是DOI,文献DOI怎么找? 1539534
邀请新用户注册赠送积分活动 716583
科研通“疑难数据库(出版商)”最低求助积分说明 709581