计算机科学
比例危险模型
杠杆(统计)
随机梯度下降算法
数据集
回归
人工神经网络
算法
人工智能
数据挖掘
统计
数学
作者
Aliasghar Tarkhan,Noah Simon
出处
期刊:Biostatistics
[Oxford University Press]
日期:2022-10-26
卷期号:25 (1): 134-153
被引量:2
标识
DOI:10.1093/biostatistics/kxac039
摘要
Abstract In many biomedical applications, outcome is measured as a “time-to-event” (e.g., disease progression or death). To assess the connection between features of a patient and this outcome, it is common to assume a proportional hazards model and fit a proportional hazards regression (or Cox regression). To fit this model, a log-concave objective function known as the “partial likelihood” is maximized. For moderate-sized data sets, an efficient Newton–Raphson algorithm that leverages the structure of the objective function can be employed. However, in large data sets this approach has two issues: (i) The computational tricks that leverage structure can also lead to computational instability; (ii) The objective function does not naturally decouple: Thus, if the data set does not fit in memory, the model can be computationally expensive to fit. This additionally means that the objective is not directly amenable to stochastic gradient-based optimization methods. To overcome these issues, we propose a simple, new framing of proportional hazards regression: This results in an objective function that is amenable to stochastic gradient descent. We show that this simple modification allows us to efficiently fit survival models with very large data sets. This also facilitates training complex, for example, neural-network-based, models with survival data.
科研通智能强力驱动
Strongly Powered by AbleSci AI