medieval-data/medieval-latin-ner-HOME-Alcar
收藏数据集概述
数据集信息
- 名称: HOME-Alcar: Aligned and Annotated Cartularies
- 作者:
- Stutzmann, Dominique
- Torres Aguilar, Sergio
- Chaffenet, Paul
- 发布日期: 2021年11月
- 发布者: Zenodo
- DOI: 10.5281/zenodo.5600884
- 许可证: CC BY 4.0
数据集描述
- 内容: 该数据集包含嵌套的实体标注,主要用于命名实体识别(NER)任务。数据集中的实体类型包括地点(LOC)和人物(PERS)。
- 格式转换: 数据集从原始格式转换为spaCy格式,使用
convert.ipynb笔记本进行转换。 - 使用建议: 由于数据集包含嵌套的实体标注,建议使用spaCy的SpanCat管道进行处理。
示例数据
json { "text": "In nomine Domini , amen . Ego Mauricius , Dei gracia Parisiensis episcopus , universitati presencium ac futurorum hujus pagine attestatione notificare curamus quod dominus Guido de Levies , pia et honesta consideratione ductus , ad edificandam quandam novellam plantationem , amore Dei et remedio anime sue et animarum parentum predecessorum suorum , fratribus ibi Deo servituris in perpetuam elemosinam donavit unam carrucam de terra quam emit des Fers Dasnois , et de decima duas partes quas ab hiisdem emit , et unam partem nemoris quantum semita dividit versus terram datam ; hanc elemosinam in manu nostra resignatam benigne tribuit . Sciendum autem quod de hac elemosina investivimus Guidonem , quondam presbiterum de Meencort , pro se et pro aliis ibi Deo se reddituris . Actum apud Sanctum Victorem , astantibus Petro , precentore Parisiensi ; Nicholao , presbitero ; Philippo , canonico ; Haimerico , capellano nostro ; Enardo , presbitero de Balneolis ; fratre Stephano de Monte-Fermeolo ; incarnationis dominice anno millesimo centesimo XCu00ba , episcopatus nostri tricesimo sexto . ", "spans": [ {"text": "Parisiensis", "label": "LOC", "start": 11, "end": 12}, {"text": "Levies", "label": "LOC", "start": 27, "end": 28}, {"text": "Fers Dasnois", "label": "LOC", "start": 68, "end": 70}, {"text": "Meencort", "label": "LOC", "start": 113, "end": 114}, {"text": "Sanctum Victorem", "label": "LOC", "start": 127, "end": 129}, {"text": "Parisiensi", "label": "LOC", "start": 134, "end": 135}, {"text": "Balneolis", "label": "LOC", "start": 153, "end": 154}, {"text": "Monte-Fermeolo", "label": "LOC", "start": 158, "end": 159}, {"text": "Mauricius", "label": "PERS", "start": 7, "end": 8}, {"text": "Guido de Levies", "label": "PERS", "start": 25, "end": 28}, {"text": "Guidonem", "label": "PERS", "start": 108, "end": 109}, {"text": "Petro", "label": "PERS", "start": 131, "end": 132}, {"text": "Nicholao", "label": "PERS", "start": 136, "end": 137}, {"text": "Philippo", "label": "PERS", "start": 140, "end": 141}, {"text": "Haimerico", "label": "PERS", "start": 144, "end": 145}, {"text": "Enardo", "label": "PERS", "start": 149, "end": 150}, {"text": "Stephano de Monte-Fermeolo", "label": "PERS", "start": 156, "end": 159} ], "ms": "Notre_Dame_Roche_Paris_BnF_10996" }
引用
plaintext @dataset{stutzmann_2021_5600884, author = {Stutzmann, Dominique and Torres Aguilar, Sergio and Chaffenet, Paul}, title = {HOME-Alcar: Aligned and Annotated Cartularies}, month = nov, year = 2021, publisher = {Zenodo}, doi = {10.5281/zenodo.5600884}, url = {https://doi.org/10.5281/zenodo.5600884} }



