five

Feature screening for ultrahigh-dimensional data via the adapted sliced Wasserstein correlation coefficient

收藏
中国科学数据2026-04-10 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.1007/s11425-024-2410-6
下载链接
链接失效反馈
官方服务:
资源简介:
Various methods including model-based and model-free feature screening procedures have beendeveloped to detect active predictors for ultrahigh-dimensional data. The existing feature screening procedures mainly focus on the association between the scalar predictor and the response variable, and are not available for detecting the association between two probability measures, which are commonly encountered in various fields. To address this issue, we propose a novel model-free feature screening procedure for ultrahigh-dimensional data based on the Wasserstein correlation coefficient. Its key merits include that (i) it can be applied to various types of covariates and response variables including univariate and multivariate discrete, categorical and continuous variables; (ii) it is robust to outliers or heavy-tailed data because of only distribution functions involved in evaluating the Wasserstein distance; (iii) it works well for the linear or nonlinear relationship between response variables and predictors. Under some mild conditions, we establish sure screening and rank consistency properties for the proposed screening procedure without imposing any moment conditions on the covariates. Simulation studies are implemented to assess the finite sample performance of the proposed method.A real-world example is used to illustrate the proposed procedure.
创建时间:
2025-04-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作