Performance of artificial intelligence on a simulated Canadian urology board exam

董事会认证 泌尿科 多项选择 百分位 医学 认证 专业 教育测量 内科学 医学教育 心理学 课程 家庭医学 住院医师培训 数学 继续教育 统计 显著性差异 教育学 政治学 法学
作者
Naji J. Touma,Jessica E. Caterini,Naji J. Touma
出处
期刊:Canadian Urological Association journal [Canadian Urological Association Journal]
卷期号:18 (10) 被引量:1
标识
DOI:10.5489/cuaj.8800
摘要

Introduction: Generative artificial intelligence (AI) has proven to be a powerful tool with increasing applications in clinical care and medical education. CHATGPT has performed adequately on many specialty certification and knowledge assessment exams. The objective of this study was to assess the performance of CHATGPT 4 on a multiple-choice exam meant to simulate the Canadian urology board exam. Methods: Graduating urology residents representing all Canadian training programs gather yearly for a mock exam that simulates their upcoming board-certifying exam. The exam consists of written multiple-choice questions (MCQs) and an oral objective structured clinical examination (OSCE). The 2022 exam was taken by 29 graduating residents and was administered to CHATGPT 4. Results: CHATGPT 4 scored 46% on the MCQ exam, whereas the mean and median scores of graduating urology residents were 62.6%, and 62.7%, respectively. This would place CHATGPT's score 1.8 standard deviations from the median. The percentile rank of CHATGPT would be in the sixth percentile. CHATGPT scores on different topics of the exam were as follows: oncology 35%, andrology/benign prostatic hyperplasia 62%, physiology/anatomy 67%, incontinence/female urology 23%, infections 71%, urolithiasis 57%, and trauma/reconstruction 17%, with ChatGPT 4’s oncology performance being significantly below that of postgraduate year 5 residents. Conclusions: CHATGPT 4 underperforms on an MCQ exam meant to simulate the Canadian board exam. Ongoing assessments of the capability of generative AI is needed as these models evolve and are trained on additional urology content.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刘洋完成签到,获得积分10
刚刚
翠T完成签到 ,获得积分10
1秒前
彭于晏应助Lion采纳,获得10
3秒前
传奇3应助晚安采纳,获得10
5秒前
徒玦完成签到 ,获得积分10
6秒前
pebble完成签到,获得积分20
7秒前
乐易李关注了科研通微信公众号
9秒前
妩媚的半雪完成签到,获得积分10
10秒前
默默向雪完成签到,获得积分10
13秒前
15秒前
拼搏的向雁完成签到 ,获得积分10
15秒前
彤航完成签到,获得积分10
17秒前
英姑应助强健的黑猫采纳,获得10
19秒前
Lion发布了新的文献求助10
22秒前
领导范儿应助zcg采纳,获得10
26秒前
乐易李发布了新的文献求助10
26秒前
wyg117完成签到,获得积分10
27秒前
核桃完成签到,获得积分10
30秒前
34秒前
赘婿应助Gaopkid采纳,获得10
38秒前
39秒前
regina完成签到,获得积分10
40秒前
平淡雪糕发布了新的文献求助10
41秒前
了了完成签到,获得积分10
43秒前
满眼喜欢遍布星河完成签到,获得积分10
44秒前
细心的梦芝完成签到 ,获得积分10
45秒前
刻苦从阳完成签到,获得积分10
46秒前
参禅不说话完成签到 ,获得积分10
47秒前
48秒前
小丸子完成签到 ,获得积分10
49秒前
共享精神应助勤劳的绿竹采纳,获得10
51秒前
Gaopkid完成签到,获得积分10
51秒前
51秒前
51秒前
51秒前
jzhou65关注了科研通微信公众号
51秒前
罗博超完成签到,获得积分10
52秒前
53秒前
WFLLL完成签到,获得积分10
53秒前
wang发布了新的社区帖子
54秒前
高分求助中
LNG地下式貯槽指針(JGA指-107) 1000
LNG地上式貯槽指針 (JGA指 ; 108) 1000
QMS18Ed2 | process management. 2nd ed 600
LNG as a marine fuel—Safety and Operational Guidelines - Bunkering 560
How Stories Change Us A Developmental Science of Stories from Fiction and Real Life 500
九经直音韵母研究 500
Full waveform acoustic data processing 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2934348
求助须知:如何正确求助?哪些是违规求助? 2589366
关于积分的说明 6975952
捐赠科研通 2234932
什么是DOI,文献DOI怎么找? 1186899
版权声明 589834
科研通“疑难数据库(出版商)”最低求助积分说明 580913