tiny_shakespeare
收藏OpenCSG2024-07-19 更新2026-01-19 收录
下载链接:
https://opencsg.com/datasets/AIWizards/tiny_shakespeare?tab=summary
下载链接
链接失效反馈官方服务:
资源简介:
TinyShakespeare 包含约4万行莎士比亚戏剧文本,主要用于字符建模等任务。数据来源于 Andrej Karpathy 的博客文章,并被划分为训练集、验证集和测试集。每个数据样本包含一个文本字段。该仓库提供标准化数据操作,方便用户进行文本处理和模型训练。
TinyShakespeare contains approximately 40,000 lines of Shakespearean play texts, primarily intended for tasks including character-level modeling. The dataset is sourced from a blog post by Andrej Karpathy, and is partitioned into training, validation, and test sets. Each data sample contains a single text field. This repository offers standardized data manipulation tools to facilitate text processing and model training for users.
提供机构:
AIWizards
创建时间:
2024-07-19



