AlbNER
收藏arXiv2023-09-16 更新2024-06-21 收录
下载链接:
http://hdl.handle.net/11234/1-5214
下载链接
链接失效反馈官方服务:
资源简介:
AlbNER是一个针对阿尔巴尼亚语的命名实体识别数据集,由维也纳大学的Erion Çano创建。该数据集包含900个从阿尔巴尼亚语维基百科文章中提取并手动标注的句子,旨在促进阿尔巴尼亚语的命名实体识别研究。数据集内容涵盖阿尔巴尼亚历史、地理及历史人物等多个领域,每个句子都经过分词和手动标注。AlbNER数据集的应用领域主要集中在自然语言处理和计算语言学,特别是命名实体识别任务,旨在解决资源匮乏语言的研究难题。
AlbNER is a named entity recognition (NER) dataset tailored for the Albanian language, developed by Erion Çano from the University of Vienna. This dataset contains 900 manually annotated sentences extracted from Albanian Wikipedia articles, with the goal of facilitating research on Albanian language NER. The content of the dataset spans multiple domains including Albanian history, geography, and historical figures, and each sentence has undergone word segmentation and manual annotation. The AlbNER dataset is primarily applied in the fields of natural language processing (NLP) and computational linguistics, particularly for NER tasks, and aims to address the research challenges posed by low-resource languages.
提供机构:
维也纳大学
创建时间:
2023-09-16



