Named Entity Recognition in the Regesta of Emperor Frederick III. of the Holy Roman Empire
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8319313
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains Named Entity Recognition and Entity Linking across 8883 Regesta of Emperor Frederick III. of the Holy Roman Empire. The Regesta are from the Regesta Imperii Edition project: http://www.regesta-imperii.de/en/the-project.html.
There are 4554 different names normalized to 2849 distinct entities, identified by the index entry identifier of the yet to be fully published index of the Regesta of Frederick III. All in all, 17258 instances of names are identified.
Not all named entities are identified!
This dataset is intended as a step stone for recording already identified entities and training NER classifiers for the Regesta Imperii. Some named entities were deliberately omitted.
Only those that could be identified with reasonable certainty via the index were included, for example: "Mgf. Albrecht von Brandenburg" is included, while "Albrecht", even when refering to the same entity as the former name, is not. Even of this subset, some entities may have been missed. A newer version may rectify that in the future.
The JSONL format is modeled after the Doccano format for entity linking.
创建时间:
2023-09-05



