schuler/TinyStories4Pascal-Tokenized-v2
收藏Hugging Face2024-09-16 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/schuler/TinyStories4Pascal-Tokenized-v2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含由GPT-3.5和GPT-4生成的短篇故事,这些故事仅使用小词汇量。数据集经过重新处理,以便Pascal开发者使用,包含两个CSV文件:词汇表和标记化数据集。
The Tiny Stories Dataset Reprocessed for Pascal Developers is a dataset containing short stories synthetically generated by GPT-3.5 and GPT-4, using a small vocabulary. The dataset has been reprocessed to be usable by Pascal developers. It consists of two CSV files, containing vocabularies and tokenized datasets.
提供机构:
schuler



