ashwathjadhav23/wnut17_filtered_entities
收藏Hugging Face2023-03-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ashwathjadhav23/wnut17_filtered_entities
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: tokens
sequence: string
- name: ner_tags
sequence:
class_label:
names:
'0': O
'1': B-corporation
'2': I-corporation
'3': B-creative-work
'4': I-creative-work
'5': B-group
'6': I-group
'7': B-location
'8': I-location
'9': B-person
'10': I-person
'11': B-product
'12': I-product
splits:
- name: train
num_bytes: 390167.873895109
num_examples: 1228
- name: validation
num_bytes: 259367
num_examples: 1009
- name: test
num_bytes: 405520
num_examples: 1287
download_size: 316469
dataset_size: 1055054.873895109
---
# Dataset Card for "wnut17_filtered_entities"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
ashwathjadhav23
原始信息汇总
数据集概述
数据集名称
wnut17_filtered_entities
数据集特征
- id: 数据类型为字符串(string)。
- tokens: 数据类型为字符串序列(sequence: string)。
- ner_tags: 数据类型为标签序列(sequence),包含以下类别:
- O
- B-corporation
- I-corporation
- B-creative-work
- I-creative-work
- B-group
- I-group
- B-location
- I-location
- B-person
- I-person
- B-product
- I-product
数据集分割
- train: 包含1228个样本,占用390167.873895109字节。
- validation: 包含1009个样本,占用259367字节。
- test: 包含1287个样本,占用405520字节。
数据集大小
- 下载大小: 316469字节
- 数据集总大小: 1055054.873895109字节



