kothasuhas/dclm_tokenized_n3681260_ctx64_commbin5
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kothasuhas/dclm_tokenized_n3681260_ctx64_commbin5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含整数序列特征,适用于训练机器学习模型。它包括一个训练集,共有3681260个样本,数据大小为957127600字节。提供了默认配置,方便用户获取训练数据。
The dataset includes integer sequence features suitable for training machine learning models. It consists of a training set with 3,681,260 samples and a data size of 957,127,600 bytes. A default configuration is provided for easy access to the training data.
提供机构:
kothasuhas



