ptsv/tinystories_upsampled_tom_250k
收藏Hugging Face2025-03-22 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/ptsv/tinystories_upsampled_tom_250k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:句子(sentences)、句子解析(tom_sentence_parses)和文本(text),均为字符串类型。数据集被划分为训练集,包含250,000个示例,总字节数为1,334,644,762字节。数据集的下载大小为548,852,137字节。尽管README未提供详细描述,但根据字段名称和配置信息,可以推断这是一个用于自然语言处理任务的数据集。
The dataset includes three fields: sentences, sentence parses, and text, all of which are string sequences. The dataset is split into a training set containing 250,000 examples, with a total size of 1,334,644,762 bytes. The download size of the dataset is 548,852,137 bytes. Although the README does not provide a detailed description, based on the field names and configuration information, it can be inferred that this is a dataset for natural language processing tasks.
提供机构:
ptsv



