Deep Learning-based Propensity Scores for Confounding Control in Comparative Effectiveness Research: A Large-scale, Real-world Data Study.

倾向得分匹配 计算机科学 因果推理 观察研究 混淆 协变量 机器学习 医学 统计
作者
Janick Weberpals,Tim Becker,Jessica Davies,Fabian Schmich,Dominik Rüttinger,Fabian J. Theis,Anna Bauer-Mehren
出处
期刊:Epidemiology [Lippincott Williams & Wilkins]
卷期号:32 (3): 378-388
标识
DOI:10.1097/ede.0000000000001338
摘要

BACKGROUND Due to the non-randomized nature of real-world data, prognostic factors need to be balanced, which is often done by propensity scores (PSs). This study aimed to investigate whether autoencoders, which are unsupervised deep learning architectures, might be leveraged to compute PS. METHODS We selected patient-level data of 128,368 first-line treated cancer patients from the Flatiron Health EHR-derived de-identified database. We trained an autoencoder architecture to learn a lower-dimensional patient representation, which we used to compute PS. To compare the performance of an autoencoder-based PS with established methods, we performed a simulation study. We assessed the balancing and adjustment performance using standardized mean differences, root mean square errors (RMSE), percent bias, and confidence interval coverage. To illustrate the application of the autoencoder-based PS, we emulated the PRONOUNCE trial by applying the trial's protocol elements within an observational database setting, comparing two chemotherapy regimens. RESULTS All methods but the manual variable selection approach led to well-balanced cohorts with average standardized mean differences <0.1. LASSO yielded on average the lowest deviation of resulting estimates (RMSE 0.0205) followed by the autoencoder approach (RMSE 0.0248). Altering the hyperparameter setup in sensitivity analysis, the autoencoder approach led to similar results as LASSO (RMSE 0.0203 and 0.0205, respectively). In the case study, all methods provided a similar conclusion with point estimates clustered around the null (e.g., HRautoencoder 1.01 [95% confidence interval = 0.80, 1.27] vs. HRPRONOUNCE 1.07 [0.83, 1.36]). CONCLUSIONS Autoencoder-based PS computation was a feasible approach to control for confounding but did not perform better than some established approaches like LASSO.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
111完成签到 ,获得积分10
刚刚
蝶儿完成签到,获得积分10
3秒前
田様应助啧啧啧采纳,获得10
4秒前
4秒前
橙子发布了新的文献求助10
5秒前
孙小子完成签到,获得积分10
5秒前
HLQF完成签到,获得积分10
5秒前
6秒前
6秒前
7秒前
xiaoluoluo完成签到,获得积分10
7秒前
知性的成完成签到 ,获得积分10
8秒前
搜集达人应助艳艳子采纳,获得10
9秒前
CodeCraft应助艳艳子采纳,获得10
9秒前
冰雪物语发布了新的文献求助10
9秒前
10秒前
11发布了新的文献求助10
11秒前
123发布了新的文献求助30
12秒前
想养阿拉斯加完成签到,获得积分20
12秒前
丘比特应助开放天亦采纳,获得10
12秒前
13秒前
辣堡完成签到 ,获得积分10
14秒前
14秒前
Jasper应助科研通管家采纳,获得10
15秒前
Sea_U应助科研通管家采纳,获得10
15秒前
小二郎应助科研通管家采纳,获得10
15秒前
15秒前
15秒前
隐形曼青应助liguyi采纳,获得10
17秒前
今天也升级了完成签到,获得积分10
18秒前
19秒前
123完成签到,获得积分10
19秒前
啧啧啧发布了新的文献求助10
20秒前
领导范儿应助尊敬的灰狼采纳,获得10
20秒前
卓奕雯完成签到 ,获得积分10
20秒前
Null发布了新的文献求助10
21秒前
亭2007完成签到,获得积分10
22秒前
lchen发布了新的文献求助10
22秒前
蜡笔小心眼子完成签到,获得积分10
23秒前
晚湖完成签到,获得积分10
24秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Applied Min-Max Approach to Missile Guidance and Control 5000
Metallurgy at high pressures and high temperatures 2000
Inorganic Chemistry Eighth Edition 1200
The Organic Chemistry of Biological Pathways Second Edition 1000
The Psychological Quest for Meaning 800
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6326670
求助须知:如何正确求助?哪些是违规求助? 8143408
关于积分的说明 17075145
捐赠科研通 5380287
什么是DOI,文献DOI怎么找? 2854388
邀请新用户注册赠送积分活动 1831959
关于科研通互助平台的介绍 1683204