Evolution in Development of a Predictive Deep-Learning Model for Total Hip Replacement Based on Radiographs

接收机工作特性 深度学习 射线照相术 医学 人工智能 骨关节炎 观察研究 分级(工程) 卷积神经网络 机器学习 医学物理学 放射科 计算机科学 内科学 病理 工程类 土木工程 替代医学
作者
K S R K Prasad
出处
期刊:Journal of Bone and Joint Surgery, American Volume [Journal of Bone and Joint Surgery]
卷期号:106 (5): e12-e12
标识
DOI:10.2106/jbjs.23.01317
摘要

Commentary Although a multitude of predictive factors for total hip replacement (THR) in osteoarthritis (OA) patients have been identified for use in traditional predictive statistical models1, their use in emerging deep-learning models has been limited, with deep-learning predominantly being utilized for the prediction of established radiographic grades2-4. von Schacky et al.2 developed a multitask deep convoluted neural network (DCNN) model to grade OA features in hip radiographs obtained from the Osteoarthritis Initiative (OAI) study. The DCNN model demonstrated similar diagnostic accuracy compared with an expert musculoskeletal radiologist. Leung et al.3 developed a deep-learning model that utilized radiographs from the OAI study to predict the Kellgren-Lawrence grade and probability of total knee replacement within 9 years, and the deep-learning model outperformed human binary outcome models based on standard grading systems. However, prior to the study by Xu et al., no study is believed to have constructed a DCNN model to assess the risk of THR. This retrospective, multicenter, case-control study thus represents the first to utilize a DCNN model to assess the risk of THR with use of baseline radiographs and basic clinical symptoms. The authors applied robust data from the OAI, a National Institutes of Health-initiated longitudinal, multicenter, observational study. Within limitations, the DCNN-based study model achieved an overall sensitivity and specificity of 92.59% and 86.96%, respectively, and a high area under the receiver operating characteristic curve (AUC) of 0.944 to predict THR within 9 years. The AUC for the most likely time frame was 0.907 for 0 to 2 years, 0.916 for 3 to 5 years, and 0.841 (95% confidence interval, 0.697 to 0.985) for 6 to 9 years. These high values for the DCNN deep-learning model developed in this study indicate the feasibility of using the model for predicting the risk of THR from baseline radiographs and clinical symptoms. The model not only resulted in a high AUC for the 9-year risk estimate, but also displayed good discrimination between patients who would and would not undergo THR during the three 3-year time intervals within the 9 years. Thus, the model would enable the identification of patients with an imminent risk of osteoarthritis progression resulting in arthroplasty within 3 years as well as aid in monitoring of the patients predicted to be at risk for THR in the 2 later time periods and arranging appropriately timed interventions. A total of 736 participants from the OAI data set were analyzed, including 184 with OA who subsequently underwent THR and 552 controls. Over 4,000 individuals were excluded from the analysis of the OAI data set for not meeting the previously defined selection criteria or not having a propensity-score-based match. Cases and controls were each split at 72% (n = 528), 14% (n = 104), and 14% (n = 104) into training, validation, and testing cohorts. This split implies a cohort of just 26 patients each for validation and testing in the case group. Most participants were White and most had relatively high levels of education, income, and medical insurance, which may have impacted patient decision-making in favor of THR and the generalizability of the results. A high rate of loss to follow-up is to be expected for a study with this design and this duration (108 months in the OAI data set). The study defined the outcome as the performance of THR during various time periods, which enabled training of the DCNN model to classify patients regarding whether or not they were expected to undergo THR at any time during one of the time periods. Predicting a particular likely time to THR would have required a very different approach, using regression rather than classification. Although pure researchers and data scientists may prefer precise estimates of timing to THR, in clinical practice the choice and determination of THR timing are multifactorial; furthermore, the timing involves shared-decision-making between the patient and surgeon. The radiographs utilized in this DCNN model were entirely anteroposterior pelvic radiographs, which could be considered a limitation as other models have selectively utilized other views, allowing for greater transfer learning. An additional limitation of the study is that the methodological steps involved in the learning process and the parsing of the input data are inherently indiscernible with the use of artificial intelligence (AI); however, the precision and accuracy of the model are adequate indirect corroborators that the model was appropriately developed. Such deep-learning models to predict THR in patients with OA need to be refined and validated in a large, diverse, prospective cohort study before being adopted into routine clinical practice; however, such superior prospective studies would be resource-intensive, and their feasibility is uncertain. Realistically, the incorporation of other statistical models, especially in countries with robust clinical data, may complement and improve the accuracy of deep-learning models. In theory, the use of AI eliminates the potential for human error of interpretation, ensures diagnostic accuracy comparable with that of expert interpretation, prognosticates the potential risk and timing of THR, and informs shared clinical decision-making. The model described by Xu et al. provides an estimate of the risk of arthroplasty, and of its timing (in 3-year intervals), within 9 years with use of basic anteroposterior radiographs and clinical data. These results represent a fascinating prospect for patient counseling and operative planning, and could factor into the formulation of institutional, regional, and national policy and into health-care delivery. The use of deep-learning AI to predict operative risk, both generally and within specific time intervals, represents an interesting, imaginative, and innovative field with immense potential for evolution, and it may well prove to be a useful addition to the clinical frontier.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
老牛完成签到 ,获得积分10
1秒前
深情安青应助曹济采纳,获得10
1秒前
mengmenglv完成签到 ,获得积分0
1秒前
tong了个包子完成签到,获得积分10
1秒前
arniu2008完成签到,获得积分10
2秒前
hunajx完成签到,获得积分10
3秒前
中原第一深情完成签到,获得积分10
3秒前
尊尊完成签到,获得积分10
4秒前
蜗牛完成签到,获得积分10
5秒前
蘑菇完成签到,获得积分10
5秒前
任性吐司完成签到 ,获得积分10
5秒前
luobote完成签到 ,获得积分10
6秒前
茶柠完成签到 ,获得积分10
7秒前
英俊雅柏完成签到,获得积分10
7秒前
ao123发布了新的文献求助10
8秒前
大脸猫完成签到 ,获得积分10
8秒前
8秒前
9秒前
已歌完成签到 ,获得积分10
9秒前
Brave发布了新的文献求助10
11秒前
hehe完成签到 ,获得积分10
11秒前
里大炮发布了新的文献求助10
12秒前
gypsi完成签到,获得积分10
13秒前
星城浮轩完成签到 ,获得积分10
14秒前
wl5289完成签到 ,获得积分10
14秒前
崔康佳完成签到,获得积分10
14秒前
浑续完成签到,获得积分20
14秒前
chuzihang完成签到 ,获得积分10
15秒前
15秒前
开心的人杰完成签到,获得积分10
15秒前
沙拉酱完成签到 ,获得积分10
15秒前
糖炒李子完成签到,获得积分10
15秒前
科研通AI2S应助CC采纳,获得10
16秒前
HE完成签到 ,获得积分10
16秒前
万能图书馆应助ao123采纳,获得10
18秒前
18秒前
mzrrong完成签到 ,获得积分10
19秒前
长长的名字完成签到 ,获得积分10
19秒前
缓慢的饼干完成签到,获得积分10
21秒前
曹济发布了新的文献求助10
21秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 2000
Digital Twins of Advanced Materials Processing 2000
晋绥日报合订本24册(影印本1986年)【1940年9月–1949年5月】 1000
Social Cognition: Understanding People and Events 1000
Polymorphism and polytypism in crystals 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6034730
求助须知:如何正确求助?哪些是违规求助? 7745897
关于积分的说明 16206346
捐赠科研通 5181057
什么是DOI,文献DOI怎么找? 2772907
邀请新用户注册赠送积分活动 1756027
关于科研通互助平台的介绍 1640869