可读性
指南
理解力
验光服务
医学
视网膜
疾病
医学物理学
家庭医学
眼科
计算机科学
病理
程序设计语言
作者
Michael Balas,Efrem D. Mandelcorn,Peng Yan,Edsel Ing,Sean A. Crawford,Parnian Arjmand
标识
DOI:10.1016/j.jcjo.2024.06.001
摘要
ObjectiveTo evaluate the performance of an artificial intelligence (AI) large language model, ChatGPT (version 4.0), for common retinal diseases, in accordance with the American Academy of Ophthalmology (AAO) Preferred Practice Pattern (PPP) guidelines.DesignA cross-sectional survey study design was employed to compare the responses made by ChatGPT to established clinical guidelines.ParticipantsResponses by the AI were reviewed by a panel of three vitreoretinal specialists for evaluation.MethodsTo investigate ChatGPT's comprehension of clinical guidelines, we designed 130 questions covering a broad spectrum of topics within 12 AAO PPP domains of retinal disease These questions were crafted to encompass diagnostic criteria, treatment guidelines, and management strategies, including both medical and surgical aspects of retinal care. A panel of 3 retinal specialists independently evaluated responses on a Likert scale from 1 to 5 based on their relevance, accuracy, and adherence to AAO PPP guidelines. Response readability was evaluated using Flesch Readability Ease and Flesch-Kincaid grade level scores.ResultsChatGPT achieved an overall average score of 4.9/5.0, suggesting high alignment with the AAO PPP guidelines. Scores varied across domains, with the lowest in the surgical management of disease. The responses had a low reading ease score and required a college-to-graduate level of comprehension. Identified errors were related to diagnostic criteria, treatment options, and methodological procedures.ConclusionChatGPT 4.0 demonstrated significant potential in generating guideline-concordant responses, particularly for common medical retinal diseases. However, its performance slightly decreased in surgical retina, highlighting the ongoing need for clinician input, further model refinement, and improved comprehensibility.
科研通智能强力驱动
Strongly Powered by AbleSci AI