NaiveNeuron/wikigoldsk
收藏Hugging Face2023-04-11 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/NaiveNeuron/wikigoldsk
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-3.0
---
# Dataset Card for WikiGoldSK
- **Repository:** [https://github.com/NaiveNeuron/WikiGoldSK](https://github.com/NaiveNeuron/WikiGoldSK)
- **Paper:** [https://arxiv.org/abs/2304.04026](https://arxiv.org/abs/2304.04026)
### Dataset Summary
WikiGoldSK is manually annotated slovak NER dataset created from Wikipedia.
It contains more than 10k named entities from categories PER, LOC, ORG and MISC in IOB2 format.
### Citation Information
```
@inproceedings{}
```
提供机构:
NaiveNeuron
原始信息汇总
数据集概述
数据集名称
WikiGoldSK
数据集描述
WikiGoldSK是一个手动标注的斯洛伐克命名实体识别(NER)数据集,源自Wikipedia。该数据集包含超过10,000个来自PER、LOC、ORG和MISC类别的命名实体,采用IOB2格式。
数据集内容
- 实体类型:PER(人名)、LOC(地点)、ORG(组织)、MISC(杂项)
- 格式:IOB2
- 实体数量:超过10,000个
许可证
CC-BY-SA-3.0



