清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Development and internal-external validation of statistical and machine learning models for breast cancer prognostication: cohort study

乳腺癌 医学 四分位间距 癌症登记处 置信区间 队列 人口 比例危险模型 癌症 队列研究 机器学习 内科学 计算机科学 环境卫生
作者
Ash Kieran Clift,David Dodwell,Simon Lord,Stavros Petrou,Michael Brady,Gary S. Collins,Julia Hippisley‐Cox
标识
DOI:10.1136/bmj-2022-073800
摘要

Abstract Objective To develop a clinically useful model that estimates the 10 year risk of breast cancer related mortality in women (self-reported female sex) with breast cancer of any stage, comparing results from regression and machine learning approaches. Design Population based cohort study. Setting QResearch primary care database in England, with individual level linkage to the national cancer registry, Hospital Episodes Statistics, and national mortality registers. Participants 141 765 women aged 20 years and older with a diagnosis of invasive breast cancer between 1 January 2000 and 31 December 2020. Main outcome measures Four model building strategies comprising two regression (Cox proportional hazards and competing risks regression) and two machine learning (XGBoost and an artificial neural network) approaches. Internal-external cross validation was used for model evaluation. Random effects meta-analysis that pooled estimates of discrimination and calibration metrics, calibration plots, and decision curve analysis were used to assess model performance, transportability, and clinical utility. Results During a median 4.16 years (interquartile range 1.76-8.26) of follow-up, 21 688 breast cancer related deaths and 11 454 deaths from other causes occurred. Restricting to 10 years maximum follow-up from breast cancer diagnosis, 20 367 breast cancer related deaths occurred during a total of 688 564.81 person years. The crude breast cancer mortality rate was 295.79 per 10 000 person years (95% confidence interval 291.75 to 299.88). Predictors varied for each regression model, but both Cox and competing risks models included age at diagnosis, body mass index, smoking status, route to diagnosis, hormone receptor status, cancer stage, and grade of breast cancer. The Cox model’s random effects meta-analysis pooled estimate for Harrell’s C index was the highest of any model at 0.858 (95% confidence interval 0.853 to 0.864, and 95% prediction interval 0.843 to 0.873). It appeared acceptably calibrated on calibration plots. The competing risks regression model had good discrimination: pooled Harrell’s C index 0.849 (0.839 to 0.859, and 0.821 to 0.876, and evidence of systematic miscalibration on summary metrics was lacking. The machine learning models had acceptable discrimination overall (Harrell’s C index: XGBoost 0.821 (0.813 to 0.828, and 0.805 to 0.837); neural network 0.847 (0.835 to 0.858, and 0.816 to 0.878)), but had more complex patterns of miscalibration and more variable regional and stage specific performance. Decision curve analysis suggested that the Cox and competing risks regression models tested may have higher clinical utility than the two machine learning approaches. Conclusion In women with breast cancer of any stage, using the predictors available in this dataset, regression based methods had better and more consistent performance compared with machine learning approaches and may be worthy of further evaluation for potential clinical use, such as for stratified follow-up.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
宁幼萱完成签到,获得积分10
8秒前
香丿完成签到 ,获得积分10
9秒前
lilylwy完成签到 ,获得积分0
14秒前
南宫若翠完成签到 ,获得积分10
18秒前
儒雅的如松完成签到 ,获得积分10
19秒前
专注的觅云完成签到 ,获得积分10
19秒前
Rui完成签到,获得积分10
22秒前
Shandongdaxiu完成签到 ,获得积分10
23秒前
jxjsdlh完成签到 ,获得积分10
26秒前
小文殊完成签到 ,获得积分10
26秒前
35秒前
zpf发布了新的文献求助10
40秒前
明天吖在吗完成签到,获得积分10
47秒前
zpf完成签到,获得积分20
48秒前
文6完成签到 ,获得积分10
48秒前
小山己几完成签到,获得积分10
49秒前
kevin完成签到 ,获得积分10
56秒前
Aimee完成签到 ,获得积分10
1分钟前
Hindiii完成签到,获得积分10
1分钟前
George完成签到,获得积分10
1分钟前
科研通AI2S应助科研通管家采纳,获得10
1分钟前
虞无声完成签到,获得积分10
1分钟前
虚心青梦完成签到 ,获得积分10
1分钟前
1分钟前
呆橘完成签到 ,获得积分10
1分钟前
海英完成签到,获得积分10
1分钟前
xuxu发布了新的文献求助10
1分钟前
加贝完成签到 ,获得积分10
1分钟前
赖氨酸完成签到,获得积分10
1分钟前
轻语完成签到 ,获得积分10
1分钟前
朴素海亦完成签到 ,获得积分10
1分钟前
周周周完成签到 ,获得积分10
2分钟前
2分钟前
玉米之路发布了新的文献求助10
2分钟前
yong完成签到 ,获得积分10
2分钟前
美满的皮卡丘完成签到 ,获得积分10
2分钟前
哈哈完成签到,获得积分10
2分钟前
玉米之路完成签到,获得积分20
2分钟前
cugwzr完成签到,获得积分10
2分钟前
语恒完成签到,获得积分10
2分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 3000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
Principles of town planning : translating concepts to applications 500
Social Work and Social Welfare: An Invitation(7th Edition) 410
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6059013
求助须知:如何正确求助?哪些是违规求助? 7891570
关于积分的说明 16297060
捐赠科研通 5203346
什么是DOI,文献DOI怎么找? 2783932
邀请新用户注册赠送积分活动 1766619
关于科研通互助平台的介绍 1647154