anonymoussubmissions/switchboard-ner-normalized
收藏Hugging Face2023-02-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/anonymoussubmissions/switchboard-ner-normalized
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: tokens
sequence: string
- name: labels
sequence: string
- name: labels_orig
sequence: string
- name: tags
sequence:
class_label:
names:
'0': O
'1': B-CARDINAL
'2': B-DATE
'3': B-EVENT
'4': B-FAC
'5': B-GPE
'6': B-LANGUAGE
'7': B-LAW
'8': B-LOC
'9': B-MONEY
'10': B-NORP
'11': B-ORDINAL
'12': B-ORG
'13': B-PERCENT
'14': B-PERSON
'15': B-PRODUCT
'16': B-QUANTITY
'17': B-TIME
'18': B-WORK_OF_ART
'19': I-CARDINAL
'20': I-DATE
'21': I-EVENT
'22': I-FAC
'23': I-GPE
'24': I-LANGUAGE
'25': I-LAW
'26': I-LOC
'27': I-MONEY
'28': I-NORP
'29': I-ORDINAL
'30': I-ORG
'31': I-PERCENT
'32': I-PERSON
'33': I-PRODUCT
'34': I-QUANTITY
'35': I-TIME
'36': I-WORK_OF_ART
- name: swbd_id
dtype: string
- name: swne_sentence_no
dtype: string
- name: id
dtype: string
splits:
- name: train
num_bytes: 2651450
num_examples: 7721
- name: validation
num_bytes: 1778041
num_examples: 5148
- name: test
num_bytes: 2962454
num_examples: 8580
download_size: 1002621
dataset_size: 7391945
---
# Dataset Card for "switchboard-ner-normalized"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
anonymoussubmissions
原始信息汇总
数据集概述
数据集信息
- 特征:
- tokens: 序列类型,字符串
- labels: 序列类型,字符串
- labels_orig: 序列类型,字符串
- tags: 序列类型,包含类别标签,具体标签如下:
- 0: O
- 1: B-CARDINAL
- 2: B-DATE
- 3: B-EVENT
- 4: B-FAC
- 5: B-GPE
- 6: B-LANGUAGE
- 7: B-LAW
- 8: B-LOC
- 9: B-MONEY
- 10: B-NORP
- 11: B-ORDINAL
- 12: B-ORG
- 13: B-PERCENT
- 14: B-PERSON
- 15: B-PRODUCT
- 16: B-QUANTITY
- 17: B-TIME
- 18: B-WORK_OF_ART
- 19: I-CARDINAL
- 20: I-DATE
- 21: I-EVENT
- 22: I-FAC
- 23: I-GPE
- 24: I-LANGUAGE
- 25: I-LAW
- 26: I-LOC
- 27: I-MONEY
- 28: I-NORP
- 29: I-ORDINAL
- 30: I-ORG
- 31: I-PERCENT
- 32: I-PERSON
- 33: I-PRODUCT
- 34: I-QUANTITY
- 35: I-TIME
- 36: I-WORK_OF_ART
- swbd_id: 数据类型,字符串
- swne_sentence_no: 数据类型,字符串
- id: 数据类型,字符串
数据分割
- train:
- 字节数: 2651450
- 样本数: 7721
- validation:
- 字节数: 1778041
- 样本数: 5148
- test:
- 字节数: 2962454
- 样本数: 8580
数据集大小
- 下载大小: 1002621 字节
- 数据集大小: 7391945 字节



