聊天机器人
指南
背景(考古学)
计算机科学
威尔科克森符号秩检验
考试(生物学)
锥束ct
医学物理学
医学
人工智能
放射科
计算机断层摄影术
病理
内科学
古生物学
生物
曼惠特尼U检验
作者
Maximilian Russe,Alexander Rau,Michael Andreas Ermer,René Rothweiler,Sina Wenger,Klara Klöble,Ralf Schulze,Fabian Bamberg,Rainer Schmelzeisen,Marco Reisert,Wiebke Semper-Hogg
出处
期刊:Dentomaxillofacial Radiology
[British Institute of Radiology]
日期:2024-01-05
卷期号:53 (2): 109-114
被引量:3
摘要
Abstract Objectives To develop a content-aware chatbot based on GPT-3.5-Turbo and GPT-4 with specialized knowledge on the German S2 Cone-Beam CT (CBCT) dental imaging guideline and to compare the performance against humans. Methods The LlamaIndex software library was used to integrate the guideline context into the chatbots. Based on the CBCT S2 guideline, 40 questions were posed to content-aware chatbots and early career and senior practitioners with different levels of experience served as reference. The chatbots’ performance was compared in terms of recommendation accuracy and explanation quality. Chi-square test and one-tailed Wilcoxon signed rank test evaluated accuracy and explanation quality, respectively. Results The GPT-4 based chatbot provided 100% correct recommendations and superior explanation quality compared to the one based on GPT3.5-Turbo (87.5% vs. 57.5% for GPT-3.5-Turbo; P = .003). Moreover, it outperformed early career practitioners in correct answers (P = .002 and P = .032) and earned higher trust than the chatbot using GPT-3.5-Turbo (P = 0.006). Conclusions A content-aware chatbot using GPT-4 reliably provided recommendations according to current consensus guidelines. The responses were deemed trustworthy and transparent, and therefore facilitate the integration of artificial intelligence into clinical decision-making.
科研通智能强力驱动
Strongly Powered by AbleSci AI