ivanjaenm/ot-dataset_bins50_size1M_prec6
收藏Hugging Face2025-09-30 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ivanjaenm/ot-dataset_bins50_size1M_prec6
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含五个特征字段:源距离(source_dist)、目标距离(target_dist)、输入索引(input_idx)、总输入(total_input)和输出映射稀疏表示(output_map_sparse)。数据集分为训练集和测试集,其中训练集包含4000万个示例,大小为66.4GB;测试集包含1000万个示例,大小为16.6GB。数据集总共大小约为83.0GB。
The dataset includes five feature fields: source distance (source_dist), target distance (target_dist), input index (input_idx), total input (total_input), and output map sparse representation (output_map_sparse). The dataset is split into a training set and a test set, with the training set containing 40 million examples and is 66.4GB in size, while the test set contains 10 million examples and is 16.6GB in size. The total size of the dataset is approximately 83.0GB.
提供机构:
ivanjaenm



