医学
可读性
肾结石
医学物理学
泌尿科
语言学
内科学
泌尿系统
哲学
作者
Abdulghafour Halawani,Alec Mitchell,Mohammadali Saffarzadeh,Victor K. Wong,Ben H. Chew,Connor M. Forbes
出处
期刊:Urology
[Elsevier]
日期:2024-02-01
被引量:3
标识
DOI:10.1016/j.urology.2023.11.042
摘要
Abstract
Objective
To compare the readability and accuracy of large language model generated patient information materials (PIMs) to those supplied by the American Urological Association (AUA), Canadian Urological Association (CUA), and European Association of Urology (EAU) for kidney stones. Methods
PIMs from AUA, CUA, and EAU related to nephrolithiasis were obtained and categorized. The most frequent patient questions related to kidney stones were identified from an internet query and input into GPT-3.5 and GPT-4. PIMs and ChatGPT outputs were assessed for accuracy and readability using previously published indexes. We also assessed changes in ChatGPT outputs when a reading level was specified (grade 6). Results
Readability scores were better for PIMs from the CUA (Grade level 10 – 12), AUA (8 – 10), or EAU (9 -11) compared to the chatbot. GPT-3.5 had the worst readability scores at Grade 13-14 and GPT-4 was likewise less readable than urologic organization PIMs with scores of 11-13. While organizational PIMs were deemed to be accurate, the chatbot had high accuracy with minor details omitted. GPT-4 was more accurate in general stone information, dietary and medical management of kidney stones topics in comparison to GPT-3.5, while both models had the same accuracy in the surgical management of nephrolithiasis topics. Conclusions
Current PIMs from major urologic organizations for kidney stones remain more readable than publicly available GPT outputs, but they are still higher than the reading ability of the general population. Of the available PIMs for kidney stones, those from the AUA are the most readable. Although Chatbot outputs for common kidney stone patient queries have a high degree of accuracy with minor omitted details, it is important for clinicians to understand their strengths and limitations.
科研通智能强力驱动
Strongly Powered by AbleSci AI