five

Distributed Estimation of Principal Support Vector Machines for Sufficient Dimension Reduction

收藏
DataCite Commons2024-12-23 更新2024-11-05 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Distributed_Estimation_of_Principal_Support_Vector_Machines_for_Sufficient_Dimension_Reduction/27607014/1
下载链接
链接失效反馈
官方服务:
资源简介:
The principal support vector machines method is a powerful tool for sufficient dimension reduction that replaces original predictors with their low-dimensional linear combinations while preserving the information for regression and classification. However, the computational burden of the principal support vector machines method constrains its use for massive data. To address this issue, we propose a naive and a refined distributed estimation algorithms for fast implementation when the sample size is large. Both distributed sufficient dimension reduction estimators exhibit the same statistical efficiency as when all the data is merged together, which provides rigorous statistical guarantees for their application to large-scale datasets, while the refined method requires smaller batch sample sizes and hence is more advantageous when memory limitations exist on distributed machines. The two distributed algorithms are further adapted to principal weighted support vector machines for sufficient dimension reduction in binary classification. The statistical accuracy and computational complexity of our proposed methods are examined through comprehensive simulation studies and in a real data application with more than 600,000 samples.
提供机构:
Taylor & Francis
创建时间:
2024-11-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作