Nonparametric high-dimensional multi-sample tests based on graph theory
收藏DataCite Commons2026-01-26 更新2024-08-19 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Nonparametric_high-dimensional_multi-sample_tests_based_on_graph_theory/25871326/1
下载链接
链接失效反馈官方服务:
资源简介:
High-dimensional data pose unique challenges for data processing in an era of ever-increasing amounts of data availability. Graph theory can provide a structure of high-dimensional data. We introduce two key properties desirable for graphs in testing homogeneity. Roughly speaking, these properties may be described as: unboundedness of edge counts under the same distribution and boundedness of edge counts under different distributions. It turns out that the minimum spanning tree violates these properties but the shortest Hamiltonian path posses them. Based on the shortest Hamiltonian path, we propose two combinations of edge counts in multiple samples to test for homogeneity. We give the permutation null distributions of proposed statistics when sample sizes go to infinity. The power is analyzed by assuming both sample sizes and dimensionality tend to infinity. Simulations show that our new tests behave very well overall in comparison with various competitors. Real data analysis of tumors and images further convince the value of our proposed tests. Software implementing the test is available in the R package GRelevance. Supplemental materials are available online.
提供机构:
Taylor & Francis
创建时间:
2024-05-21



