qmeeus/slue-voxpopuli
收藏Hugging Face2023-01-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/qmeeus/slue-voxpopuli
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
- name: sentence
dtype: string
- name: entities
sequence:
class_label:
names:
'0': B-DATE
'1': I-DATE
'2': B-TIME
'3': I-TIME
'4': B-CARDINAL
'5': I-CARDINAL
'6': B-ORDINAL
'7': I-ORDINAL
'8': B-QUANTITY
'9': I-QUANTITY
'10': B-MONEY
'11': I-MONEY
'12': B-PERCENT
'13': I-PERCENT
'14': B-GPE
'15': I-GPE
'16': B-LOC
'17': I-LOC
'18': B-NORP
'19': I-NORP
'20': B-ORG
'21': I-ORG
'22': B-LAW
'23': I-LAW
'24': B-PERSON
'25': I-PERSON
'26': B-FAC
'27': I-FAC
'28': B-EVENT
'29': I-EVENT
'30': B-WORK_OF_ART
'31': I-WORK_OF_ART
'32': B-PRODUCT
'33': I-PRODUCT
'34': B-LANGUAGE
'35': I-LANGUAGE
'36': O
- name: id
dtype: int64
- name: combined
sequence:
class_label:
names:
'0': B-WHEN
'1': I-WHEN
'2': B-QUANT
'3': I-QUANT
'4': B-PLACE
'5': I-PLACE
'6': B-NORP
'7': I-NORP
'8': B-ORG
'9': I-ORG
'10': B-LAW
'11': I-LAW
'12': B-PERSON
'13': I-PERSON
'14': O
splits:
- name: train
num_bytes: 240457330.0
num_examples: 5000
- name: dev
num_bytes: 83070289.972
num_examples: 1753
download_size: 319368269
dataset_size: 323527619.972
---
# Dataset Card for "slue-voxpopuli"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
qmeeus
原始信息汇总
数据集概述
数据集特征
-
audio:
- 数据类型: 音频
- 采样率: 16000 Hz
-
sentence:
- 数据类型: 字符串
-
entities:
- 序列类型: 实体标注
- 类别标签名称:
- 0: B-DATE
- 1: I-DATE
- 2: B-TIME
- 3: I-TIME
- 4: B-CARDINAL
- 5: I-CARDINAL
- 6: B-ORDINAL
- 7: I-ORDINAL
- 8: B-QUANTITY
- 9: I-QUANTITY
- 10: B-MONEY
- 11: I-MONEY
- 12: B-PERCENT
- 13: I-PERCENT
- 14: B-GPE
- 15: I-GPE
- 16: B-LOC
- 17: I-LOC
- 18: B-NORP
- 19: I-NORP
- 20: B-ORG
- 21: I-ORG
- 22: B-LAW
- 23: I-LAW
- 24: B-PERSON
- 25: I-PERSON
- 26: B-FAC
- 27: I-FAC
- 28: B-EVENT
- 29: I-EVENT
- 30: B-WORK_OF_ART
- 31: I-WORK_OF_ART
- 32: B-PRODUCT
- 33: I-PRODUCT
- 34: B-LANGUAGE
- 35: I-LANGUAGE
- 36: O
-
id:
- 数据类型: int64
-
combined:
- 序列类型: 组合标注
- 类别标签名称:
- 0: B-WHEN
- 1: I-WHEN
- 2: B-QUANT
- 3: I-QUANT
- 4: B-PLACE
- 5: I-PLACE
- 6: B-NORP
- 7: I-NORP
- 8: B-ORG
- 9: I-ORG
- 10: B-LAW
- 11: I-LAW
- 12: B-PERSON
- 13: I-PERSON
- 14: O
数据集分割
-
train:
- 数据大小: 240457330.0 字节
- 示例数量: 5000
-
dev:
- 数据大小: 83070289.972 字节
- 示例数量: 1753
数据集大小
- 下载大小: 319368269 字节
- 数据集总大小: 323527619.972 字节



