Nerthus-Project/Old_English-OEDT-NER
收藏Hugging Face2025-09-26 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/Nerthus-Project/Old_English-OEDT-NER
下载链接
链接失效反馈官方服务:
资源简介:
这是一个基于120,000词的旧英语原始语料库合成的命名实体识别(NER)语料库,包含6401个被识别为命名实体的标记。语料库文本包括散文和拉丁文翻译,如Orosius、圣马克福音、Ælfric的罗马天主教布道集I、盎格鲁-撒克逊编年史A和法律文本。该树库是手动注释的修订和扩展版本,用于评估基于自然语言处理库的计算模型的性能。
This is a synthetically annotated NER corpus based on a raw 120,000-word corpus of Old English, containing 6401 tokens recognized as named entities. The texts include both prose and Latin translations such as Orosius, St. Marks Gospel, Ælfrics Catholic Homilies I, The Anglo-Saxon Chronicle A, and legal texts. The treebank is a revised and expanded version of the manual annotation carried out for assessing the performance of a computational model based on a Natural Language Processing library.
提供机构:
Nerthus-Project



