five

Analysis of bootstrap and subsampling in high-dimensional regularized regression (code)

收藏
DataCite Commons2026-03-12 更新2026-05-04 收录
下载链接:
https://archive.materialscloud.org/doi/10.24435/materialscloud:dv-7d
下载链接
链接失效反馈
官方服务:
资源简介:
We investigate popular resampling methods for estimating the uncertainty of statistical models, such as subsampling, bootstrap and the jackknife, and their performance in high-dimensional supervised regression tasks. We provide a tight asymptotic description of the biases and variances estimated by these methods in the context of generalized linear models, such as ridge and logistic regression, taking the limit where the number of samples n and dimension d of the covariates grow at a comparable fixed rate α = n/d. Our findings are three-fold: i) resampling methods are fraught with problems in high dimensions and exhibit the double-descent-like behavior typical of these situations; ii) only when α is large enough do they provide consistent and reliable error estimations (we give convergence rates); iii) in the over-parametrized regime α < 1 relevant to modern machine learning practice, their predictions are not consistent, even with optimal regularization. This record provides the code to reproduce the numerical experiments of the related paper "Analysis of bootstrap and subsampling in high-dimensional regularized regression".
提供机构:
Materials Cloud
创建时间:
2025-06-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作