Khasi Named Entity Recognition (NER) Datasets
收藏India Data2025-05-12 更新2026-05-16 收录
下载链接:
https://india-data.org/dataset-details/87aaac39-1e0b-4af8-b9de-674c644236d9
下载链接
链接失效反馈官方服务:
资源简介:
This dataset provides annotated Named Entity Recognition (NER) data for the Khasi language. It contains: Gold Data: Manually annotated with named entity labels and keyword identification tags. Synthetic Data: Automatically generated data to supplement the gold dataset. The data follows the CoNLL format with token-wise annotation. The gold dataset is split into training, validation, and test sets, while the synthetic dataset provides additional data for pretraining models.
提供机构:
Natural Language Processing (NLP)
创建时间:
2025-05-12



