JoeyLLM/Gutenberg-Australia-Chunks-1024
收藏Hugging Face2025-04-24 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/JoeyLLM/Gutenberg-Australia-Chunks-1024
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:训练集、验证集和测试集。每个部分都包含序列形式的int32类型数据。训练集包含61332个示例,大小为251461200字节;验证集包含11435个示例,大小为46883500字节;测试集包含3832个示例,大小为15711200字节。整个数据集的大小为314055900字节,下载大小为156021331字节。
The dataset consists of three parts: training set, validation set, and test set. Each part contains int32 type data in the form of sequences. The training set includes 61332 examples, with a size of 251461200 bytes; the validation set includes 11435 examples, with a size of 46883500 bytes; the test set includes 3832 examples, with a size of 15711200 bytes. The entire dataset is 314055900 bytes in size, and the download size is 156021331 bytes.
提供机构:
JoeyLLM



