Jean-Baptiste/wikiner_fr
收藏Hugging Face2023-06-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Jean-Baptiste/wikiner_fr
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- fr
dataset_info:
features:
- name: id
dtype: int64
- name: tokens
sequence: string
- name: ner_tags
sequence:
class_label:
names:
'0': O
'1': LOC
'2': PER
'3': MISC
'4': ORG
splits:
- name: test
num_bytes: 5954708
num_examples: 13410
- name: train
num_bytes: 54305659
num_examples: 120682
download_size: 12147768
dataset_size: 60260367
train-eval-index:
- config: Jean-Baptiste--wikiner_fr
task: token-classification
task_id: entity_extraction
splits:
eval_split: test
col_mapping:
tokens: tokens
ner_tags: tags
task_categories:
- token-classification
---
# Dataset Card for "wikiner_fr"
Dataset Description:
- **Homepage:** https://metatext.io/datasets/wikiner
- **Repository:**
- **Paper:** https://www.sciencedirect.com/science/article/pii/S0004370212000276?via%3Dihub
- **Leaderboard:**
- **Point of Contact:**
提供机构:
Jean-Baptiste
原始信息汇总
数据集概述
数据集名称
- wikiner_fr
数据集特征
- id: 整数类型 (int64)
- tokens: 字符串序列
- ner_tags: 标签序列
- 类别标签:
- 0: O
- 1: LOC
- 2: PER
- 3: MISC
- 4: ORG
- 类别标签:
数据集分割
- train: 120682个样本,占用54305659字节
- test: 13410个样本,占用5954708字节
数据集大小
- 下载大小: 12147768字节
- 数据集总大小: 60260367字节
任务与配置
- 任务: 词元分类 (token-classification)
- 任务ID: entity_extraction
- 评估分割: test
- 列映射:
- tokens: tokens
- ner_tags: tags
语言
- 法语 (fr)



