A Normality Test for High-dimensional Data based on a Nearest Neighbor Approach
收藏Taylor & Francis Group2024-02-15 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/A_Normality_Test_for_High-dimensional_Data_based_on_a_Nearest_Neighbor_Approach/14963845/1
下载链接
链接失效反馈官方服务:
资源简介:
Many statistical methodologies for high-dimensional data assume the population is normal. Although a few multivariate normality tests have been proposed, to the best of our knowledge, none of them can properly control the type I error when the dimension is larger than the number of observations. In this work, we propose a novel nonparametric test that utilizes the nearest neighbor information. The proposed method guarantees the asymptotic type I error control under the high-dimensional setting. Simulation studies verify the empirical size performance of the proposed test when the dimension grows with the sample size and at the same time exhibit a superior power performance of the new test compared with alternative methods. We also illustrate our approach through two popularly used data sets in high-dimensional classification and clustering literatures where deviation from the normality assumption may lead to invalid conclusions.
提供机构:
Chen, Hao; Xia, Yin
创建时间:
2021-07-12



