five

Cross-validation to select the optimum rank for a reduced-rank approximation to multivariate data

收藏
Taylor & Francis Group2024-06-20 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Cross-validation_to_select_the_optimum_rank_for_a_reduced-rank_approximation_to_multivariate_data/25859602/1
下载链接
链接失效反馈
官方服务:
资源简介:
In this paper we consider the Gabriel form of cross-validation (CV) and we investigate how to estimate the optimum rank for lower rank approximations of any dataset that can be written in matrix form, with particular application in multivariate analysis and in the analysis of multienvironment trials. The literature related to the method suggests that it can produce overfitting and poor-quality predictions, characteristics that result in overestimation of the rank. Because of this, it is proposed to change the rank selection criterion, testing thirteen statistics both in the original method and in four proposed extensions that seek to solve the above problems. A comparison is made with two gold standard methods for CV through a simulation study and through the analysis of seventeen real datasets, two of which are general multivariate and fifteen are from experiments with genotype-by-environment interaction. It is concluded that from a predictive point of view, the highest accuracy in estimating the rank is obtained by using a regularized singular value decomposition.
提供机构:
Arciniegas-Alarcón, Sergio; García-Peña, Marisol; Rengifo, Camilo; Krzanowski, Wojtek J.
创建时间:
2024-05-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作