arubenruben/ontonotes5.0-pt-harem-selective
收藏Hugging Face2023-05-12 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/arubenruben/ontonotes5.0-pt-harem-selective
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: tokens
sequence: string
- name: ner_tags
sequence:
class_label:
names:
'0': O
'1': B-PESSOA
'2': I-PESSOA
'3': B-ORGANIZACAO
'4': I-ORGANIZACAO
'5': B-LOCAL
'6': I-LOCAL
'7': B-TEMPO
'8': I-TEMPO
'9': B-VALOR
'10': I-VALOR
splits:
- name: train
num_bytes: 16511400
num_examples: 1898
- name: validation
num_bytes: 2417378
num_examples: 279
- name: test
num_bytes: 1564609
num_examples: 163
download_size: 3181837
dataset_size: 20493387
---
# Dataset Card for "ontonotes5.0-pt-harem-selective"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
arubenruben
原始信息汇总
数据集概述
数据集名称
- 名称: ontonotes5.0-pt-harem-selective
数据集特征
- 特征1: tokens
- 类型: 字符串序列
- 特征2: ner_tags
- 类型: 序列
- 类别标签:
- 0: O
- 1: B-PESSOA
- 2: I-PESSOA
- 3: B-ORGANIZACAO
- 4: I-ORGANIZACAO
- 5: B-LOCAL
- 6: I-LOCAL
- 7: B-TEMPO
- 8: I-TEMPO
- 9: B-VALOR
- 10: I-VALOR
数据集分割
- 训练集:
- 大小: 16511400 字节
- 示例数量: 1898
- 验证集:
- 大小: 2417378 字节
- 示例数量: 279
- 测试集:
- 大小: 1564609 字节
- 示例数量: 163
数据集大小
- 下载大小: 3181837 字节
- 数据集总大小: 20493387 字节



