Evolution in Development of a Predictive Deep-Learning Model for Total Hip Replacement Based on Radiographs

接收机工作特性 深度学习 射线照相术 医学 人工智能 骨关节炎 观察研究 分级(工程) 卷积神经网络 机器学习 医学物理学 放射科 计算机科学 内科学 病理 工程类 土木工程 替代医学
作者
K S R K Prasad
出处
期刊:Journal of Bone and Joint Surgery, American Volume [Journal of Bone and Joint Surgery]
卷期号:106 (5): e12-e12
标识
DOI:10.2106/jbjs.23.01317
摘要

Commentary Although a multitude of predictive factors for total hip replacement (THR) in osteoarthritis (OA) patients have been identified for use in traditional predictive statistical models1, their use in emerging deep-learning models has been limited, with deep-learning predominantly being utilized for the prediction of established radiographic grades2-4. von Schacky et al.2 developed a multitask deep convoluted neural network (DCNN) model to grade OA features in hip radiographs obtained from the Osteoarthritis Initiative (OAI) study. The DCNN model demonstrated similar diagnostic accuracy compared with an expert musculoskeletal radiologist. Leung et al.3 developed a deep-learning model that utilized radiographs from the OAI study to predict the Kellgren-Lawrence grade and probability of total knee replacement within 9 years, and the deep-learning model outperformed human binary outcome models based on standard grading systems. However, prior to the study by Xu et al., no study is believed to have constructed a DCNN model to assess the risk of THR. This retrospective, multicenter, case-control study thus represents the first to utilize a DCNN model to assess the risk of THR with use of baseline radiographs and basic clinical symptoms. The authors applied robust data from the OAI, a National Institutes of Health-initiated longitudinal, multicenter, observational study. Within limitations, the DCNN-based study model achieved an overall sensitivity and specificity of 92.59% and 86.96%, respectively, and a high area under the receiver operating characteristic curve (AUC) of 0.944 to predict THR within 9 years. The AUC for the most likely time frame was 0.907 for 0 to 2 years, 0.916 for 3 to 5 years, and 0.841 (95% confidence interval, 0.697 to 0.985) for 6 to 9 years. These high values for the DCNN deep-learning model developed in this study indicate the feasibility of using the model for predicting the risk of THR from baseline radiographs and clinical symptoms. The model not only resulted in a high AUC for the 9-year risk estimate, but also displayed good discrimination between patients who would and would not undergo THR during the three 3-year time intervals within the 9 years. Thus, the model would enable the identification of patients with an imminent risk of osteoarthritis progression resulting in arthroplasty within 3 years as well as aid in monitoring of the patients predicted to be at risk for THR in the 2 later time periods and arranging appropriately timed interventions. A total of 736 participants from the OAI data set were analyzed, including 184 with OA who subsequently underwent THR and 552 controls. Over 4,000 individuals were excluded from the analysis of the OAI data set for not meeting the previously defined selection criteria or not having a propensity-score-based match. Cases and controls were each split at 72% (n = 528), 14% (n = 104), and 14% (n = 104) into training, validation, and testing cohorts. This split implies a cohort of just 26 patients each for validation and testing in the case group. Most participants were White and most had relatively high levels of education, income, and medical insurance, which may have impacted patient decision-making in favor of THR and the generalizability of the results. A high rate of loss to follow-up is to be expected for a study with this design and this duration (108 months in the OAI data set). The study defined the outcome as the performance of THR during various time periods, which enabled training of the DCNN model to classify patients regarding whether or not they were expected to undergo THR at any time during one of the time periods. Predicting a particular likely time to THR would have required a very different approach, using regression rather than classification. Although pure researchers and data scientists may prefer precise estimates of timing to THR, in clinical practice the choice and determination of THR timing are multifactorial; furthermore, the timing involves shared-decision-making between the patient and surgeon. The radiographs utilized in this DCNN model were entirely anteroposterior pelvic radiographs, which could be considered a limitation as other models have selectively utilized other views, allowing for greater transfer learning. An additional limitation of the study is that the methodological steps involved in the learning process and the parsing of the input data are inherently indiscernible with the use of artificial intelligence (AI); however, the precision and accuracy of the model are adequate indirect corroborators that the model was appropriately developed. Such deep-learning models to predict THR in patients with OA need to be refined and validated in a large, diverse, prospective cohort study before being adopted into routine clinical practice; however, such superior prospective studies would be resource-intensive, and their feasibility is uncertain. Realistically, the incorporation of other statistical models, especially in countries with robust clinical data, may complement and improve the accuracy of deep-learning models. In theory, the use of AI eliminates the potential for human error of interpretation, ensures diagnostic accuracy comparable with that of expert interpretation, prognosticates the potential risk and timing of THR, and informs shared clinical decision-making. The model described by Xu et al. provides an estimate of the risk of arthroplasty, and of its timing (in 3-year intervals), within 9 years with use of basic anteroposterior radiographs and clinical data. These results represent a fascinating prospect for patient counseling and operative planning, and could factor into the formulation of institutional, regional, and national policy and into health-care delivery. The use of deep-learning AI to predict operative risk, both generally and within specific time intervals, represents an interesting, imaginative, and innovative field with immense potential for evolution, and it may well prove to be a useful addition to the clinical frontier.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
冷艳莛发布了新的文献求助10
1秒前
1秒前
11发布了新的文献求助10
2秒前
123yaoyao发布了新的文献求助10
3秒前
3秒前
bing完成签到,获得积分10
4秒前
lth完成签到 ,获得积分10
4秒前
ZunyeLiu完成签到,获得积分10
4秒前
Summering666完成签到,获得积分10
5秒前
5秒前
大个应助linyu采纳,获得10
5秒前
6秒前
霜之哀伤完成签到,获得积分10
6秒前
XS_QI完成签到 ,获得积分10
7秒前
唧唧咕咕发布了新的文献求助10
7秒前
Ck发布了新的文献求助10
7秒前
7秒前
收集快乐完成签到 ,获得积分10
8秒前
leo007发布了新的文献求助10
9秒前
雪满头发布了新的文献求助10
9秒前
10秒前
牧童完成签到 ,获得积分20
10秒前
蓝天发布了新的文献求助10
10秒前
Lily完成签到,获得积分10
12秒前
我不会乱起名字的完成签到,获得积分10
14秒前
当时的发布了新的文献求助10
14秒前
沙糖桔完成签到,获得积分10
14秒前
荔枝发布了新的文献求助10
14秒前
科研通AI2S应助LZH采纳,获得10
15秒前
小橘完成签到,获得积分10
16秒前
三毛完成签到 ,获得积分10
16秒前
滴滴哩哩完成签到,获得积分10
17秒前
王燕峰发布了新的文献求助10
19秒前
Ck完成签到,获得积分10
19秒前
ts完成签到,获得积分10
20秒前
finish完成签到 ,获得积分10
21秒前
搞怪的寄文完成签到 ,获得积分10
21秒前
dd完成签到,获得积分10
23秒前
foyefeng发布了新的文献求助10
23秒前
24秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Clinical Microbiology Procedures Handbook, Multi-Volume, 5th Edition 临床微生物学程序手册,多卷,第5版 2000
人脑智能与人工智能 1000
King Tyrant 720
Silicon in Organic, Organometallic, and Polymer Chemistry 500
Peptide Synthesis_Methods and Protocols 400
Principles of Plasma Discharges and Materials Processing, 3rd Edition 400
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5603665
求助须知:如何正确求助?哪些是违规求助? 4688648
关于积分的说明 14855380
捐赠科研通 4694577
什么是DOI,文献DOI怎么找? 2540936
邀请新用户注册赠送积分活动 1507124
关于科研通互助平台的介绍 1471814