five

Estimating Number of Factors by Adjusted Eigenvalues Thresholding

收藏
DataCite Commons2022-06-08 更新2024-07-28 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Estimating_Number_of_Factors_by_Adjusted_Eigenvalues_Thresholding/12984285/1
下载链接
链接失效反馈
官方服务:
资源简介:
Determining the number of common factors is an important and practical topic in high-dimensional factor models. The existing literature is mainly based on the eigenvalues of the covariance matrix. Owing to the incomparability of the eigenvalues of the covariance matrix caused by the heterogeneous scales of the observed variables, it is not easy to find an accurate relationship between these eigenvalues and the number of common factors. To overcome this limitation, we appeal to the correlation matrix and demonstrate, surprisingly, that the number of eigenvalues greater than 1 of the population correlation matrix is the same as the number of common factors under certain mild conditions. To utilize such a relationship, we study random matrix theory based on the sample correlation matrix in order to correct biases in estimating the top eigenvalues and to take into account of estimation errors in eigenvalue estimation. Thus, we propose a tuning-free scale-invariant adjusted correlation thresholding (ACT) method for determining the number of common factors in high-dimensional factor models, taking into account the sampling variabilities and biases of top sample eigenvalues. We also establish the optimality of the proposed ACT method in terms of minimal signal strength and the optimal threshold. Simulation studies lend further support to our proposed method and show that our estimator outperforms competing methods in most test cases.

高维因子模型中,公共因子个数的确定是一项兼具重要性与实用性的研究课题。现有相关研究主要基于协方差矩阵的特征值,但由于观测变量的量纲异质性,导致协方差矩阵的特征值不具备可比性,因此难以建立这些特征值与公共因子个数之间的准确关联。为克服这一局限,本文转而采用相关矩阵展开研究,并意外发现:在若干温和正则条件下,总体相关矩阵中大于1的特征值个数与公共因子个数完全一致。为利用这一关联,本文基于样本相关矩阵展开随机矩阵理论研究,以校正顶层特征值估计中的偏差,并充分考虑特征值估计过程中的误差。据此,本文提出一种无需调参的尺度不变校正相关阈值法(adjusted correlation thresholding, ACT),用于高维因子模型的公共因子个数确定,该方法充分考虑了样本顶层特征值的抽样变异性与估计偏差。此外,本文还从最小信号强度与最优阈值两个维度,证明了所提ACT方法的最优性。仿真实验进一步验证了所提方法的有效性,并表明在绝大多数测试场景下,本文提出的估计器性能优于现有同类竞争方法。
提供机构:
Taylor & Francis
创建时间:
2020-09-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作