Scalable Algorithms for Large Competing Risks Data
收藏DataCite Commons2024-02-07 更新2024-07-28 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Scalable_Algorithms_for_Large_Competing_Risks_Data/13166579
下载链接
链接失效反馈官方服务:
资源简介:
This paper develops two orthogonal contributions to scalable sparse regression for competing risks time-to-event data. First, we study and accelerate the broken adaptive ridge method (BAR), a surrogate <i>ℓ</i>
<sub>0</sub>-based iteratively reweighted <i>ℓ</i>
<sub>2</sub>-penalization algorithm that achieves sparsity in its limit, in the context of the Fine-Gray (1999) proportional subdistributional hazards (PSH) model. In particular, we derive a new algorithm for BAR regression, named cycBAR, that performs cyclic update of each coordinate using an explicit thresholding formula. The new cycBAR algorithm effectively avoids fitting multiple reweighted <i>ℓ</i>
<sub>2</sub>-penalizations and thus yields impressive speedups over the original BAR algorithm. Second, we address a pivotal computational issue related to fitting the PSH model. Specifically, the computation costs of the log-pseudo likelihood and its derivatives for PSH model grow at the rate of <i>O</i>(<i>n</i>
<sup>2</sup>) with the sample size <i>n</i> in current implementations. We propose a novel forward-backward scan algorithm that reduces the computation costs to <i>O</i>(<i>n</i>). The proposed method applies to both unpenalized and penalized estimation for the PSH model and has exhibited drastic speedups over current implementations. Finally, combining the two algorithms can yields >1,000 fold speedups over the original BAR algorithm. Illustrations of the impressive scalability of our proposed algorithm for large competing risks data are given using both simulations and a United States Renal Data System data. Supplementary materials for this article are available online.
提供机构:
Taylor & Francis
创建时间:
2021-09-16



