five

Variable selection in linear regressions with possibly all strongly correlated covariates

收藏
Figshare2025-02-02 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Variable_selection_in_linear_regressions_with_possibly_all_strongly_correlated_covariates/28329411
下载链接
链接失效反馈
官方服务:
资源简介:
Penalized regression methods, particularly LASSO, require the irrepresentable condition for variable selection consistency. This condition places an upper bound on the magnitudes of correlation between the signals and the rest of the covariates under consideration. Chudik, Kapetanios, and Pesaran (2018) proposed an alternative procedure called the one covariate at a time multiple testing (OCMT) that does not impose restrictions on the magnitude of these correlations. However, it requires that the number of covariates correlated with the signals grows at a rate less than the square root of the number of observations, denoted by T. Notably, the required conditions for both LASSO and OCMT can be challenged when possibly all the covariates under consideration are correlated with the signals. In this article, we follow the ideas from latent factor literature to adapt OCMT to allow for such scenarios. We refer to our proposed method as generalized one covariate at a time multiple testing (GOCMT). We establish that GOCMT selects a model that contains all the signals and none of the noise variables asymptotically. We also show that the least squares estimator of the post GOCMT selected model is T-consistent. The proposed method demonstrates promising finite-sample performance in our Monte Carlo experiments. An empirical application in risk factor selection within asset pricing further underscores the utility of our method.
创建时间:
2025-02-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作