HadjerHaninebgt7878/ELNER-DZ
收藏Hugging Face2025-07-12 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/HadjerHaninebgt7878/ELNER-DZ
下载链接
链接失效反馈官方服务:
资源简介:
ELNER-DZ是一个阿尔及利亚阿拉伯方言(Darija)的大型数据集,用于命名实体识别和实体链接,包含了超过200万句方言句子,标记了超过190万个命名实体,并且链接到了Wikidata QIDs。数据集支持阿拉伯语、阿拉伯语拉丁字母、法语和英语,格式为JSON。
ELNER-DZ is a large-scale dataset for Named Entity Recognition and Entity Linking in Algerian Arabic Dialect (Darija), containing over 2 million dialectal sentences annotated with more than 1.9 million named entities linked to Wikidata QIDs. It supports Arabic, Arabizi, French, and English, and is formatted in JSON.
提供机构:
HadjerHaninebgt7878



