five

kurumikz/animepedia

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/kurumikz/animepedia
下载链接
链接失效反馈
官方服务:
资源简介:
Animepedia是一个大规模、多格式的英文文本数据集,涵盖了22,099个独特的动漫标题,专为语言模型训练、指令调优和百科全书知识注入而设计。每个标题至少以三种不同的写作风格(教科书式、问答式、叙述式)呈现,确保数据结构的完整性和高质量。数据集涵盖了日本动画的整个历史,从经典作品到现代季节性发布。AI仅负责文本生成,而其他所有环节(如架构设计、过滤、验证等)均由人工独立完成。数据集还包括详细的统计数据,如字数、类型分布等,并明确了其预期用途和局限性。

Animepedia is a large-scale, multi-format English text dataset covering 22,099 unique anime titles — designed for language model training, instruction tuning, and encyclopedic knowledge injection. Every title is represented in at least three distinct writing styles (textbook, qa, narrative), resulting in a structurally complete corpus with zero corrupt records and zero entries below the minimum quality threshold. The dataset spans the full history of Japanese animation, from golden-age classics to modern seasonal releases. The AI was used only for text generation, while all other aspects (architecture, filtering, verification, etc.) were built and controlled externally. The README also includes detailed statistics on word counts, genre distribution, and other metrics, along with intended uses and limitations.
提供机构:
kurumikz
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作