Fractional Cross-Validation for Optimizing Hyperparameters of Supervised Learning Algorithms
收藏DataCite Commons2025-07-29 更新2025-09-08 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Fractional_cross-validation_for_optimizing_hyperparameters_of_supervised_learning_algorithms/29269838/2
下载链接
链接失效反馈官方服务:
资源简介:
K-fold cross-validation (CV) is a robust method for estimating generalization performance of supervised learning models. Although CV is more reliable than using a single hold-out test set, it is also more computationally expensive since the model must be fit K times. This can be prohibitive when optimizing the hyperparameters, since this involves conducting K-fold CV repeatedly at many hyperparameter configurations. In this work, we propose a highly-efficient Bayesian optimization algorithm for optimizing the hyperparameters of supervised learning algorithms with K-fold CV error as the evaluation criterion. Our approach exploits the fact that the single-fold out-of-sample error is pairwise correlated across different hyperparameter configurations. We introduce a hierarchical Gaussian process model that is well-suited to accommodate this inherent correlation structure across folds and across the hyperparameter space. Our resulting algorithm requires evaluating only a single fold for many hyperparameter configurations, enabling us to efficiently find the optimal hyperparameters. We refer to this as “fractional CV”, since it requires only a small fraction of the folds to be evaluated, relative to what is required for full K-fold CV. We demonstrate the efficacy of our method on a number of models and real datasets.
提供机构:
Taylor & Francis
创建时间:
2025-07-29



