five

wordgrammer/The_Entire_Western_Canon

收藏
Hugging Face2024-08-31 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/wordgrammer/The_Entire_Western_Canon
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了西方经典著作的一部分,包括《圣经》的多个译本和各种电子书。这些电子书是通过筛选包含特定关键词的书籍列表得到的,关键词包括《伊利亚特》、《奥德赛》、欧几里得、柏拉图、亚里士多德等。数据集经过进一步处理以去除噪音,但仍可能包含一些不相关的书籍、重复内容、格式错误以及贡献者的个人信息。数据集的目标是逐步完善成一个更好的开源西方经典著作数据集。

A dataset containing the entire Western Canon, or at least a part of it. The dataset includes several translations of the Bible and various ebooks. The author constructed this dataset by filtering a large list of ebooks to only include those that contained one of the specified keywords, which are names and titles of famous Western literary and philosophical works. The dataset contains approximately 500 ebooks, but it is not a complete list and still contains some noise data such as emails and phone numbers. The books in the dataset may contain a mix of the authors native language and English translations, and there may be duplicates and formatting errors. The author hopes to cultivate a better open-source dataset of the entire Western Canon over time.
提供机构:
wordgrammer
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作