five

HeliPaD: the Heliand Parsed Database

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/4395039
下载链接
链接失效反馈
官方服务:
资源简介:
This corpus contains all 5,968 lines of the C manuscript of the Old Saxon Heliand, a gospel harmony written in alliterative verse, using the Sievers (1878) edition. Compared to the standard Behaghel critical edition, this one has the advantages for linguistic research that a) it does not conflate the different forms found in different manuscripts, b) it is not as heavily emended, and c) it is now in the public domain. The corpus is a UTF-8 plain text file designed to be searched using the program CorpusSearch 2, with the standard extension .psd, broadly following the format of the Penn Corpora of Historical English and related projects (IcePaHC, Early New High German Parsed Corpus, MCVF). It is annotated on a number of levels: Textual and metrical (page in manuscript, page in edition, line number, caesura) Lemmatization Parts of speech and morphology Syntactic parsing The total size of the corpus is 46,067 words (not including punctuation and code).
创建时间:
2024-07-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作