Onuii/DAMI-Pretrain-Dataset-Tokenize8192
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Onuii/DAMI-Pretrain-Dataset-Tokenize8192
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本数据,适用于机器学习模型的训练。它被划分为训练集,共有51522个文本示例,数据集总大小约为1.1GB。数据集通过默认配置指定了训练数据的文件路径。
The dataset consists of text data suitable for training machine learning models. It is split into a training set with a total of 51,522 text examples, and the overall dataset size is approximately 1.1GB. The dataset specifies the file path for the training data through the default configuration.
提供机构:
Onuii



