arubenruben/portuguese-mapa
收藏Hugging Face2023-05-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/arubenruben/portuguese-mapa
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: tokens
sequence: string
- name: ner_tags
sequence:
class_label:
names:
'0': O
'1': B-PESSOA
'2': I-PESSOA
'3': B-ORGANIZACAO
'4': I-ORGANIZACAO
'5': B-LOCAL
'6': I-LOCAL
'7': B-TEMPO
'8': I-TEMPO
'9': B-VALOR
'10': I-VALOR
splits:
- name: train
num_bytes: 970478
num_examples: 1086
- name: validation
num_bytes: 119282
num_examples: 105
- name: test
num_bytes: 335581
num_examples: 390
download_size: 218401
dataset_size: 1425341
---
# Dataset Card for "portuguese-mapa"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
arubenruben
原始信息汇总
数据集概述
特征信息
- tokens: 序列类型为字符串。
- ner_tags: 序列类型为类别标签,具体标签包括:
- 0: O
- 1: B-PESSOA
- 2: I-PESSOA
- 3: B-ORGANIZACAO
- 4: I-ORGANIZACAO
- 5: B-LOCAL
- 6: I-LOCAL
- 7: B-TEMPO
- 8: I-TEMPO
- 9: B-VALOR
- 10: I-VALOR
数据分割
- train:
- 字节数: 970478
- 样本数: 1086
- validation:
- 字节数: 119282
- 样本数: 105
- test:
- 字节数: 335581
- 样本数: 390
数据集大小
- 下载大小: 218401 字节
- 数据集大小: 1425341 字节



