qundao/data-zh-poetry
收藏Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/qundao/data-zh-poetry
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-4.0
task_categories:
- text-classification
- text-generation
language:
- zh
tags:
- poetry
- text
- literature
size_categories:
- 1M<n<10M
---
# Classical Chinese Poetry 中国古诗词
中国古典诗歌(包括少量文言文作品,以及现当代创作的古体诗词)。
资料汇聚自网络,目前约170万篇诗文,作者超过4万。
数据虽经进行粗略清洗,但仍错谬较多,相似文本重复、繁简混杂、PUA和乱码字符、内容残缺、作者错误等。
本资料仅供参考,古诗词学习参考正规出版社书籍或原始古籍。
主要参考资料:
- https://github.com/chinese-poetry/chinese-poetry
- https://github.com/javayhu/haitang/
- https://github.com/liuhuanyong/PoemMining
- https://github.com/open-chinese/poetry-collection
- https://github.com/Werneror/Poetry
- https://github.com/xiu-ze/Poetry/
- https://huggingface.co/datasets/erhwenkuo/poetry-chinese-zhtw
- https://huggingface.co/datasets/larryvrh/Chinese-Poems
- https://huggingface.co/datasets/zhangqiaobit/chinese_poetrys/tree/main
提供机构:
qundao



