CausalNLP/multilingual_tinystories_data_1
收藏Hugging Face2025-06-26 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/CausalNLP/multilingual_tinystories_data_1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多种语言的文本数据,具体包括德语(deu)、法语(fra及其分片fra_shard_00030和fra_shard_00031)、阿拉伯语(arb)和中文(cmn)。每个语言的数据集都包含了若干示例,示例数量为2048,除德语数据集包含53248个示例。数据集的下载大小为1057920673字节,总大小为56251569字节。
The dataset consists of text data in various languages, including German (deu), French (fra and its shards fra_shard_00030 and fra_shard_00031), Arabic (arb), and Chinese (cmn). Each language dataset contains a certain number of examples, with 2048 examples per dataset except for the German dataset which contains 53248 examples. The download size of the dataset is 1057920673 bytes, and the total size is 56251569 bytes.
提供机构:
CausalNLP



