aklein4/math-tokenized
收藏Hugging Face2025-07-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/aklein4/math-tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本处理任务所需的信息,如输入和输出的ID序列,以及输入和输出token的数量。训练集包含超过422万个示例,文件大小约为4.15GB。数据集适用于机器学习模型训练,特别是自然语言处理任务。
The dataset includes information necessary for text processing tasks, such as input and output ID sequences, and the number of input and output tokens. The training set contains over 4.22 million examples, with a file size of approximately 4.15GB. The dataset is suitable for machine learning model training, especially for natural language processing tasks.
提供机构:
aklein4



