Extrapolation Before Imputation Reduces Bias When Imputing Censored Covariates
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Extrapolation_before_imputation_reduces_bias_when_imputing_censored_covariates/28143444
下载链接
链接失效反馈官方服务:
资源简介:
Modeling symptom progression to identify ideal subjects for a Huntington’s disease clinical trial is problematic since time to diagnosis, a key covariate, can be heavily censored. Imputation is an appealing strategy that replaces the censored covariate with its conditional mean, but existing methods saw over 200% bias under heavy censoring. Calculating conditional means well requires estimating and then integrating over the survival function of the censored covariate from the censored value to infinity. To estimate the survival function flexibly, existing methods use the semiparametric Cox model with Breslow’s estimator, leaving the integrand for the conditional means (the survival function) undefined beyond the observed data. The integral is then estimated up to the largest observed covariate value, and this approximation can cut off the tail of the survival function and lead to severe bias. We combine the semiparametric survival estimator with a parametric extension to approximate the integral up to infinity. In simulations, our proposed extrapolation-before-imputation approach substantially reduces the bias seen with existing imputation methods, sometimes even when the parametric extension was misspecified. We further demonstrate how imputing with corrected conditional means can prioritize subjects for clinical trials. The R code to reproduce results is available in the supplementary material. Supplementary materials for this article are available online.
创建时间:
2025-01-06



