Evolution in Development of a Predictive Deep-Learning Model for Total Hip Replacement Based on Radiographs

接收机工作特性 深度学习 射线照相术 医学 人工智能 骨关节炎 观察研究 分级(工程) 卷积神经网络 机器学习 医学物理学 放射科 计算机科学 内科学 病理 工程类 土木工程 替代医学
作者
K S R K Prasad
出处
期刊:Journal of Bone and Joint Surgery, American Volume [Wolters Kluwer]
卷期号:106 (5): e12-e12
标识
DOI:10.2106/jbjs.23.01317
摘要

Commentary Although a multitude of predictive factors for total hip replacement (THR) in osteoarthritis (OA) patients have been identified for use in traditional predictive statistical models1, their use in emerging deep-learning models has been limited, with deep-learning predominantly being utilized for the prediction of established radiographic grades2-4. von Schacky et al.2 developed a multitask deep convoluted neural network (DCNN) model to grade OA features in hip radiographs obtained from the Osteoarthritis Initiative (OAI) study. The DCNN model demonstrated similar diagnostic accuracy compared with an expert musculoskeletal radiologist. Leung et al.3 developed a deep-learning model that utilized radiographs from the OAI study to predict the Kellgren-Lawrence grade and probability of total knee replacement within 9 years, and the deep-learning model outperformed human binary outcome models based on standard grading systems. However, prior to the study by Xu et al., no study is believed to have constructed a DCNN model to assess the risk of THR. This retrospective, multicenter, case-control study thus represents the first to utilize a DCNN model to assess the risk of THR with use of baseline radiographs and basic clinical symptoms. The authors applied robust data from the OAI, a National Institutes of Health-initiated longitudinal, multicenter, observational study. Within limitations, the DCNN-based study model achieved an overall sensitivity and specificity of 92.59% and 86.96%, respectively, and a high area under the receiver operating characteristic curve (AUC) of 0.944 to predict THR within 9 years. The AUC for the most likely time frame was 0.907 for 0 to 2 years, 0.916 for 3 to 5 years, and 0.841 (95% confidence interval, 0.697 to 0.985) for 6 to 9 years. These high values for the DCNN deep-learning model developed in this study indicate the feasibility of using the model for predicting the risk of THR from baseline radiographs and clinical symptoms. The model not only resulted in a high AUC for the 9-year risk estimate, but also displayed good discrimination between patients who would and would not undergo THR during the three 3-year time intervals within the 9 years. Thus, the model would enable the identification of patients with an imminent risk of osteoarthritis progression resulting in arthroplasty within 3 years as well as aid in monitoring of the patients predicted to be at risk for THR in the 2 later time periods and arranging appropriately timed interventions. A total of 736 participants from the OAI data set were analyzed, including 184 with OA who subsequently underwent THR and 552 controls. Over 4,000 individuals were excluded from the analysis of the OAI data set for not meeting the previously defined selection criteria or not having a propensity-score-based match. Cases and controls were each split at 72% (n = 528), 14% (n = 104), and 14% (n = 104) into training, validation, and testing cohorts. This split implies a cohort of just 26 patients each for validation and testing in the case group. Most participants were White and most had relatively high levels of education, income, and medical insurance, which may have impacted patient decision-making in favor of THR and the generalizability of the results. A high rate of loss to follow-up is to be expected for a study with this design and this duration (108 months in the OAI data set). The study defined the outcome as the performance of THR during various time periods, which enabled training of the DCNN model to classify patients regarding whether or not they were expected to undergo THR at any time during one of the time periods. Predicting a particular likely time to THR would have required a very different approach, using regression rather than classification. Although pure researchers and data scientists may prefer precise estimates of timing to THR, in clinical practice the choice and determination of THR timing are multifactorial; furthermore, the timing involves shared-decision-making between the patient and surgeon. The radiographs utilized in this DCNN model were entirely anteroposterior pelvic radiographs, which could be considered a limitation as other models have selectively utilized other views, allowing for greater transfer learning. An additional limitation of the study is that the methodological steps involved in the learning process and the parsing of the input data are inherently indiscernible with the use of artificial intelligence (AI); however, the precision and accuracy of the model are adequate indirect corroborators that the model was appropriately developed. Such deep-learning models to predict THR in patients with OA need to be refined and validated in a large, diverse, prospective cohort study before being adopted into routine clinical practice; however, such superior prospective studies would be resource-intensive, and their feasibility is uncertain. Realistically, the incorporation of other statistical models, especially in countries with robust clinical data, may complement and improve the accuracy of deep-learning models. In theory, the use of AI eliminates the potential for human error of interpretation, ensures diagnostic accuracy comparable with that of expert interpretation, prognosticates the potential risk and timing of THR, and informs shared clinical decision-making. The model described by Xu et al. provides an estimate of the risk of arthroplasty, and of its timing (in 3-year intervals), within 9 years with use of basic anteroposterior radiographs and clinical data. These results represent a fascinating prospect for patient counseling and operative planning, and could factor into the formulation of institutional, regional, and national policy and into health-care delivery. The use of deep-learning AI to predict operative risk, both generally and within specific time intervals, represents an interesting, imaginative, and innovative field with immense potential for evolution, and it may well prove to be a useful addition to the clinical frontier.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
WYMD应助seven采纳,获得60
刚刚
蓝色的梦完成签到,获得积分10
刚刚
hjkk发布了新的文献求助10
刚刚
邵钰博应助HYD采纳,获得10
1秒前
tt发布了新的文献求助30
1秒前
迷路谷菱完成签到,获得积分10
1秒前
1秒前
木又完成签到,获得积分10
1秒前
从容面包发布了新的文献求助10
2秒前
YML发布了新的文献求助10
2秒前
2秒前
馒头完成签到,获得积分10
2秒前
我滴个完成签到,获得积分10
2秒前
2秒前
Kestis.发布了新的文献求助10
2秒前
内向东蒽完成签到,获得积分10
3秒前
3秒前
zhang完成签到,获得积分10
3秒前
huofuman完成签到,获得积分10
3秒前
烟花应助犹豫晓啸采纳,获得10
3秒前
369ninja应助jiang1998采纳,获得10
4秒前
4秒前
友好的小虾米完成签到,获得积分10
4秒前
4秒前
AHYSGUGSY给AHYSGUGSY的求助进行了留言
5秒前
李浩然完成签到,获得积分10
5秒前
枣核完成签到 ,获得积分10
5秒前
合适惋清发布了新的文献求助10
6秒前
沉静的煎蛋完成签到,获得积分10
6秒前
万能图书馆应助Andrew123采纳,获得10
6秒前
小懒发布了新的文献求助10
6秒前
6秒前
7秒前
Lyy发布了新的文献求助10
7秒前
棉花糖完成签到,获得积分10
7秒前
乔垣结衣发布了新的文献求助10
7秒前
盒子先生完成签到,获得积分10
7秒前
石头完成签到,获得积分10
7秒前
机智的香菇完成签到,获得积分10
7秒前
鲤跃完成签到,获得积分10
7秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Cronologia da história de Macau 5000
Merrill's Atlas of Radiographic Positioning and Procedures - 3-Volume Set, 16th Edition 2000
Matrix Methods in Data Mining and Pattern Recognition 540
Interactions of Vowel Quality and Prosody in East Slavic 500
Vander's Renal Physiology第10版 500
Materials Informatics Molecules, Crystals and Beyond A volume in Acta Materialia Book Series 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7066380
求助须知:如何正确求助?哪些是违规求助? 8727682
关于积分的说明 18469429
捐赠科研通 6596858
什么是DOI,文献DOI怎么找? 3125920
关于科研通互助平台的介绍 2221800
邀请新用户注册赠送积分活动 2101513