five

stepp1/is_sparse_15d

收藏
Hugging Face2025-11-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/stepp1/is_sparse_15d
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - tabular-classification language: - en tags: - synthetic - sparse-learning - classification size_categories: - 100K<n<1M --- # is_sparse/sparse15d ## Dataset Description This is a synthetic 15-dimensional classification dataset designed for sparse learning research. The dataset contains 3 classes and is specifically designed to have sparse optimal representations, where only a subset of features are informative for the classification task. ### Dataset Summary - **Variant**: sparse15d - **Features**: 15 continuous features - **Classes**: 3 - **Entropy(Y)**: 1.4855 - **Mutual Information (joint)**: 1.1819 - **Maximum Achievable Accuracy**: 0.8967 ## Dataset Structure ### Data Instances Each instance consists of: - `data`: A 15-dimensional feature vector (float32) - `label`: An integer class label (0, 1, or 2) ### Data Splits | Split | Number of Instances | |-------|---------------------| | Train | Variable (see below) | | Test | Variable (see below) | ## Dataset Creation This dataset was synthetically generated for research on sparse learning and optimal feature selection. The mutual information values between feature subsets and labels are provided in the metadata. ### Mutual Information Structure The dataset includes ground-truth mutual information values for various feature subsets, enabling: - Feature importance analysis - Information-theoretic learning algorithms - Benchmarking of MI estimation methods Key MI values: - joint: 1.1819 - 1: 0.3273 - 1-2: 0.3273 - 1-2-3: 0.6634 - 1-2-3-4: 0.6634 - 1-2-3-4-5: 1.1819 - 1-2-3-5: 1.1819 - 1-2-4: 0.3273 - 1-2-4-5: 1.0492 - 1-2-5: 1.0492 ## Citation If you use this dataset, please cite the associated research paper (to be added). ## License MIT License
提供机构:
stepp1
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作