five

glossAPI/1000_prwta_xronia_ellhnikhs

收藏
Hugging Face2025-01-16 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/glossAPI/1000_prwta_xronia_ellhnikhs
下载链接
链接失效反馈
官方服务:
资源简介:
本数据集包含古希腊语文本,时间跨度从古代到公元250年。它包括各种文本类型的片段,如悲剧、喜剧、史诗/抒情诗、科学和哲学文本以及宗教文本。此外,还包括词汇注释和解释。在某些情况下,保留了拼写,但删除了脚注和阿拉伯数字,以保持文本的连贯性。数据集从XML格式转换为parquet格式。

In this dataset, the user will find Greek texts of the Ancient Greek era till 250 CE. It includes fragments from less well-known texts (tragedy, comedy, epic/lyric poetry, scientific and philosophical texts) along with religious ones. There are also commentary archives. It is worth mentioning that, in some cases, the spelling is maintained but the footnotes and the Arabic numbers have been erased, in order for us to maintain the adequacy of the texts. The archives can be found at https://github.com/OpenGreekAndLatin/First1KGreek as XML, and here are uploaded as parquet.
提供机构:
glossAPI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作