Subhash219/naamapadam
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Subhash219/naamapadam
下载链接
链接失效反馈官方服务:
资源简介:
Naamapadam是11种印度语言中最大的公开可用的命名实体标注数据集。该语料库是通过将英语-印度语言平行语料库中英语部分的命名实体投影到印度语言部分创建的。数据集还包含8种印度语言的手动标注测试集,每语言包含500-1000个句子。
Naamapadam is the largest publicly available Named Entity Annotated dataset for 11 Indic languages. This corpora was created by projecting named entities from English side to the Indic language side of the English-Indic languages parallel corpus. The dataset additionally contains manually labelled test set for 8 Indic languages containing 500-1000 sentences.
提供机构:
Subhash219



