SrpELTeC-gold - Named Entity Recognition Training corpus for Serbian
收藏DataCite Commons2022-06-01 更新2024-07-13 收录
下载链接:
https://live.european-language-grid.eu/catalogue/corpus/9485
下载链接
链接失效反馈官方服务:
资源简介:
<p>The selection of 11 full novels and excerpts from 15 novels from Serbian literary corpus of novels written more than a century ago, have been automatically labelled with SrpNER system for Serbian in the first stage of the gold standard preparation. Based on the specifically tailored guidelines, different evaluators performed careful checks and corrections, yielding a gold standard (SrpELTeC-gold). Corpus is annotated with 7 different named entity types: PERS, ROLE, LOC, DEMO, ORG, WORK, EVENT, as specified by Distant Reading for European Literary History (COST Action CA16204). Total number of text files is 242 with stend-off annotation in 242 .ann files. Total number of annotations is 330119, where PERS has 14788, ROLE has 10405, LOC has 1979, DEMO 1568, ORG 323, WORK 198, EVENT 149.</p>
提供机构:
ELG
创建时间:
2022-06-01



