five

Information Sharing for Robust and Stable Cross-Validation

收藏
Figshare2025-08-04 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Information_Sharing_for_Robust_and_Stable_Cross-Validation/29821430
下载链接
链接失效反馈
官方服务:
资源简介:
Robust estimators for linear regression require non-convex objective functions to shield against adverse effects of contamination, including outliers. This non-convexity brings challenges, particularly when combined with penalization in high-dimensional settings. A crucial challenge is selecting hyperparameters for the penalty based on a finite sample. In practice, cross-validation (CV) is the prevalent strategy with good performance for convex estimators. Applied with robust estimators, however, CV often gives subpar results due to the interplay between multiple local minima and the penalty. The best local minimum attained on the full training data may not be the minimum with the desired statistical properties. Furthermore, there may be a mismatch between this minimum and the minima attained in the CV folds which are used for evaluating the prediction error. This article introduces a novel adaptive CV strategy that tracks multiple minima for each combination of hyperparameters and subsets of the data. A matching scheme is presented for correctly evaluating minima computed on the full training data using the best-matching minima from the CV folds. We show that the proposed strategy reduces the variability of the estimated performance metric, leads to smoother CV curves, and therefore substantially increases the reliability and utility of robust penalized estimators. Supplementary materials for this article are available online.
创建时间:
2025-08-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作