five

qmeeus/slue-voxpopuli

收藏
Hugging Face2023-01-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/qmeeus/slue-voxpopuli
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: audio dtype: audio: sampling_rate: 16000 - name: sentence dtype: string - name: entities sequence: class_label: names: '0': B-DATE '1': I-DATE '2': B-TIME '3': I-TIME '4': B-CARDINAL '5': I-CARDINAL '6': B-ORDINAL '7': I-ORDINAL '8': B-QUANTITY '9': I-QUANTITY '10': B-MONEY '11': I-MONEY '12': B-PERCENT '13': I-PERCENT '14': B-GPE '15': I-GPE '16': B-LOC '17': I-LOC '18': B-NORP '19': I-NORP '20': B-ORG '21': I-ORG '22': B-LAW '23': I-LAW '24': B-PERSON '25': I-PERSON '26': B-FAC '27': I-FAC '28': B-EVENT '29': I-EVENT '30': B-WORK_OF_ART '31': I-WORK_OF_ART '32': B-PRODUCT '33': I-PRODUCT '34': B-LANGUAGE '35': I-LANGUAGE '36': O - name: id dtype: int64 - name: combined sequence: class_label: names: '0': B-WHEN '1': I-WHEN '2': B-QUANT '3': I-QUANT '4': B-PLACE '5': I-PLACE '6': B-NORP '7': I-NORP '8': B-ORG '9': I-ORG '10': B-LAW '11': I-LAW '12': B-PERSON '13': I-PERSON '14': O splits: - name: train num_bytes: 240457330.0 num_examples: 5000 - name: dev num_bytes: 83070289.972 num_examples: 1753 download_size: 319368269 dataset_size: 323527619.972 --- # Dataset Card for "slue-voxpopuli" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
qmeeus
原始信息汇总

数据集概述

数据集特征

  • audio:

    • 数据类型: 音频
    • 采样率: 16000 Hz
  • sentence:

    • 数据类型: 字符串
  • entities:

    • 序列类型: 实体标注
    • 类别标签名称:
      • 0: B-DATE
      • 1: I-DATE
      • 2: B-TIME
      • 3: I-TIME
      • 4: B-CARDINAL
      • 5: I-CARDINAL
      • 6: B-ORDINAL
      • 7: I-ORDINAL
      • 8: B-QUANTITY
      • 9: I-QUANTITY
      • 10: B-MONEY
      • 11: I-MONEY
      • 12: B-PERCENT
      • 13: I-PERCENT
      • 14: B-GPE
      • 15: I-GPE
      • 16: B-LOC
      • 17: I-LOC
      • 18: B-NORP
      • 19: I-NORP
      • 20: B-ORG
      • 21: I-ORG
      • 22: B-LAW
      • 23: I-LAW
      • 24: B-PERSON
      • 25: I-PERSON
      • 26: B-FAC
      • 27: I-FAC
      • 28: B-EVENT
      • 29: I-EVENT
      • 30: B-WORK_OF_ART
      • 31: I-WORK_OF_ART
      • 32: B-PRODUCT
      • 33: I-PRODUCT
      • 34: B-LANGUAGE
      • 35: I-LANGUAGE
      • 36: O
  • id:

    • 数据类型: int64
  • combined:

    • 序列类型: 组合标注
    • 类别标签名称:
      • 0: B-WHEN
      • 1: I-WHEN
      • 2: B-QUANT
      • 3: I-QUANT
      • 4: B-PLACE
      • 5: I-PLACE
      • 6: B-NORP
      • 7: I-NORP
      • 8: B-ORG
      • 9: I-ORG
      • 10: B-LAW
      • 11: I-LAW
      • 12: B-PERSON
      • 13: I-PERSON
      • 14: O

数据集分割

  • train:

    • 数据大小: 240457330.0 字节
    • 示例数量: 5000
  • dev:

    • 数据大小: 83070289.972 字节
    • 示例数量: 1753

数据集大小

  • 下载大小: 319368269 字节
  • 数据集总大小: 323527619.972 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作