HayatoHongo/TinyStories
收藏Hugging Face2025-12-14 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/HayatoHongo/TinyStories
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含由GPT-3.5和GPT-4生成的仅使用小词汇量的合成短篇小说。这些故事在论文https://arxiv.org/abs/2305.07759中有描述。数据集使用的模型可在Huggingface上的roneneldan/TinyStories-1M/3M/8M/28M/33M/1Layer-21M找到。附加资源包括包含所有故事及其元数据和用于生成每个故事的提示的tar文件,以及基于GPT-4生成的TinyStoriesV2数据集的新版本。
The dataset consists of synthetically generated short stories using GPT-3.5 and GPT-4, which only use a small vocabulary. These stories are described in the paper https://arxiv.org/abs/2305.07759. The models used for this dataset can be found on Huggingface at roneneldan/TinyStories-1M/3M/8M/28M/33M/1Layer-21M. Additional resources include a tar file containing all stories with their metadata and the prompts used to create each story, as well as a new version of the TinyStoriesV2 dataset based on GPT-4 generations.
提供机构:
HayatoHongo



