Quality and Dependability of ChatGPT and DingXiangYuan Forums for Remote Orthopedic Consultations: Comparative Analysis

质量(理念) 病历 控制(管理) 骨科手术 认证 医疗保健 家庭医学 计算机科学 医学 内科学 人工智能 外科 哲学 经济 认识论 法学 经济增长 政治学
作者
Zhaowen Xue,Yiming Zhang,Wenyi Gan,Huajun Wang,Guorong She,Xiaofei Zheng
出处
期刊:Journal of Medical Internet Research [JMIR Publications]
卷期号:26: e50882-e50882 被引量:7
标识
DOI:10.2196/50882
摘要

Background The widespread use of artificial intelligence, such as ChatGPT (OpenAI), is transforming sectors, including health care, while separate advancements of the internet have enabled platforms such as China’s DingXiangYuan to offer remote medical services. Objective This study evaluates ChatGPT-4’s responses against those of professional health care providers in telemedicine, assessing artificial intelligence’s capability to support the surge in remote medical consultations and its impact on health care delivery. Methods We sourced remote orthopedic consultations from “Doctor DingXiang,” with responses from its certified physicians as the control and ChatGPT’s responses as the experimental group. In all, 3 blindfolded, experienced orthopedic surgeons assessed responses against 7 criteria: “logical reasoning,” “internal information,” “external information,” “guiding function,” “therapeutic effect,” “medical knowledge popularization education,” and “overall satisfaction.” We used Fleiss κ to measure agreement among multiple raters. Results Initially, consultation records for a cumulative count of 8 maladies (equivalent to 800 cases) were gathered. We ultimately included 73 consultation records by May 2023, following primary and rescreening, in which no communication records containing private information, images, or voice messages were transmitted. After statistical scoring, we discovered that ChatGPT’s “internal information” score (mean 4.61, SD 0.52 points vs mean 4.66, SD 0.49 points; P=.43) and “therapeutic effect” score (mean 4.43, SD 0.75 points vs mean 4.55, SD 0.62 points; P=.32) were lower than those of the control group, but the differences were not statistically significant. ChatGPT showed better performance with a higher “logical reasoning” score (mean 4.81, SD 0.36 points vs mean 4.75, SD 0.39 points; P=.38), “external information” score (mean 4.06, SD 0.72 points vs mean 3.92, SD 0.77 points; P=.25), and “guiding function” score (mean 4.73, SD 0.51 points vs mean 4.72, SD 0.54 points; P=.96), although the differences were not statistically significant. Meanwhile, the “medical knowledge popularization education” score of ChatGPT was better than that of the control group (mean 4.49, SD 0.67 points vs mean 3.87, SD 1.01 points; P<.001), and the difference was statistically significant. In terms of “overall satisfaction,” the difference was not statistically significant between the groups (mean 8.35, SD 1.38 points vs mean 8.37, SD 1.24 points; P=.92). According to how Fleiss κ values were interpreted, 6 of the control group’s score points were classified as displaying “fair agreement” (P<.001), and 1 was classified as showing “substantial agreement” (P<.001). In the experimental group, 3 points were classified as indicating “fair agreement,” while 4 suggested “moderate agreement” (P<.001). Conclusions ChatGPT-4 matches the expertise found in DingXiangYuan forums’ paid consultations, excelling particularly in scientific education. It presents a promising alternative for remote health advice. For health care professionals, it could act as an aid in patient education, while patients may use it as a convenient tool for health inquiries.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
HYF完成签到,获得积分10
刚刚
刚刚
小兵应助科研通管家采纳,获得10
刚刚
晚来客应助科研通管家采纳,获得20
刚刚
刚刚
刚刚
Lucas应助科研通管家采纳,获得10
1秒前
1秒前
CodeCraft应助科研通管家采纳,获得10
1秒前
FashionBoy应助科研通管家采纳,获得10
1秒前
1秒前
周涛发布了新的文献求助30
3秒前
柴桑青木完成签到,获得积分0
5秒前
小柠檬完成签到,获得积分10
6秒前
少盐完成签到,获得积分10
7秒前
刻苦牛马完成签到 ,获得积分10
8秒前
8秒前
Cold发布了新的文献求助10
9秒前
好奇小怪发布了新的文献求助10
10秒前
11秒前
12秒前
SciGPT应助四糸乃采纳,获得10
13秒前
愉快日记本完成签到,获得积分10
13秒前
执着绿草发布了新的文献求助10
13秒前
14秒前
16秒前
完美世界应助lianliyou采纳,获得10
16秒前
深情安青应助xieyuanxing采纳,获得10
16秒前
17秒前
17秒前
hhan完成签到,获得积分10
17秒前
NexusExplorer应助星河在眼里采纳,获得10
18秒前
李健的小迷弟应助xiaomi采纳,获得10
18秒前
18秒前
18秒前
木子完成签到,获得积分10
19秒前
邺yu完成签到,获得积分10
20秒前
li发布了新的文献求助10
20秒前
Cecilia完成签到,获得积分10
20秒前
21秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Inherited Metabolic Disease in Adults: A Clinical Guide 500
计划经济时代的工厂管理与工人状况(1949-1966)——以郑州市国营工厂为例 500
Sociologies et cosmopolitisme méthodologique 400
Why America Can't Retrench (And How it Might) 400
Another look at Archaeopteryx as the oldest bird 390
Partial Least Squares Structural Equation Modeling (PLS-SEM) using SmartPLS 3.0 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 催化作用 遗传学 冶金 电极 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 4633293
求助须知:如何正确求助?哪些是违规求助? 4029304
关于积分的说明 12466863
捐赠科研通 3715514
什么是DOI,文献DOI怎么找? 2050190
邀请新用户注册赠送积分活动 1081753
科研通“疑难数据库(出版商)”最低求助积分说明 964055