cjiao/wikitext-tokenized
收藏Hugging Face2024-10-30 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/cjiao/wikitext-tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个分割,包括train_512、train_2048、validation_512和validation_2048,每个分割都有特定的字节大小和示例数量。数据集的特征包括input_ids和labels,分别使用int32和int64数据类型。总下载大小为470561711字节,数据集总大小为1414956616字节。
The dataset includes multiple splits such as train_512, train_2048, validation_512, and validation_2048, each with specific byte sizes and number of examples. The features of the dataset include input_ids and labels, using int32 and int64 data types respectively. The total download size is 470561711 bytes, and the total dataset size is 1414956616 bytes.
提供机构:
cjiao



