five

Category-Adaptive Variable Screening for Ultra-High Dimensional Heterogeneous Categorical Data

收藏
DataCite Commons2023-08-16 更新2024-07-27 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Category-Adaptive_Variable_Screening_for_Ultra-high_Dimensional_Heterogeneous_Categorical_Data/7819544
下载链接
链接失效反馈
官方服务:
资源简介:
The populations of interest in modern studies are very often heterogeneous. The population heterogeneity, the qualitative nature of the outcome variable and the high dimensionality of the predictors pose significant challenge in statistical analysis. In this article, we introduce a category-adaptive screening procedure with high-dimensional heterogeneous data, which is to detect category-specific important covariates. The proposal is a model-free approach without any specification of a regression model and an adaptive procedure in the sense that the set of active variables is allowed to vary across different categories, thus making it more flexible to accommodate heterogeneity. For response-selective sampling data, another main discovery of this article is that the proposed method works directly without any modification. Under mild regularity conditions, the newly procedure is shown to possess the sure screening and ranking consistency properties. Simulation studies contain supportive evidence that the proposed method performs well under various settings and it is effective to extract category-specific information. Applications are illustrated with two real datasets. Supplementary materials for this article are available online.
提供机构:
Taylor & Francis
创建时间:
2019-03-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作