five

distil-whisper/librispeech_asr-noise

收藏
Hugging Face2023-09-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/distil-whisper/librispeech_asr-noise
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: test-pub-noise features: - name: audio dtype: audio - name: text dtype: string - name: id dtype: string splits: - name: '40' num_bytes: 2517727265.74 num_examples: 2620 - name: '35' num_bytes: 2517727265.74 num_examples: 2620 - name: '30' num_bytes: 2517727265.74 num_examples: 2620 - name: '25' num_bytes: 2517727265.74 num_examples: 2620 - name: '20' num_bytes: 2517727265.74 num_examples: 2620 - name: '15' num_bytes: 2517727265.74 num_examples: 2620 - name: '10' num_bytes: 2517727265.74 num_examples: 2620 - name: '5' num_bytes: 2517727265.74 num_examples: 2620 - name: '0' num_bytes: 2517727265.74 num_examples: 2620 - name: minus5 num_bytes: 2517727265.74 num_examples: 2620 - name: minus10 num_bytes: 2517727265.74 num_examples: 2620 download_size: 9029521258 dataset_size: 27694999923.13999 - config_name: test-white-noise features: - name: audio dtype: audio - name: text dtype: string - name: id dtype: string splits: - name: '40' num_bytes: 2517727265.74 num_examples: 2620 - name: '35' num_bytes: 2517727265.74 num_examples: 2620 - name: '30' num_bytes: 2517727265.74 num_examples: 2620 - name: '25' num_bytes: 2517727265.74 num_examples: 2620 - name: '20' num_bytes: 2517727265.74 num_examples: 2620 - name: '15' num_bytes: 2517727265.74 num_examples: 2620 - name: '10' num_bytes: 2517727265.74 num_examples: 2620 - name: '5' num_bytes: 2517727265.74 num_examples: 2620 - name: '0' num_bytes: 2517727265.74 num_examples: 2620 - name: minus5 num_bytes: 2517727265.74 num_examples: 2620 - name: minus10 num_bytes: 2517727265.74 num_examples: 2620 download_size: 15639888311 dataset_size: 27694999923.13999 - config_name: validation-pub-noise features: - name: audio dtype: audio - name: text dtype: string - name: id dtype: string splits: - name: '40' num_bytes: 2313039107.07 num_examples: 2703 - name: '35' num_bytes: 2313039107.07 num_examples: 2703 - name: '30' num_bytes: 2313039107.07 num_examples: 2703 - name: '25' num_bytes: 2313039107.07 num_examples: 2703 - name: '20' num_bytes: 2313039107.07 num_examples: 2703 - name: '15' num_bytes: 2313039107.07 num_examples: 2703 - name: '10' num_bytes: 2313039107.07 num_examples: 2703 - name: '5' num_bytes: 2313039107.07 num_examples: 2703 - name: '0' num_bytes: 2313039107.07 num_examples: 2703 - name: minus5 num_bytes: 2313039107.07 num_examples: 2703 - name: minus10 num_bytes: 2313039107.07 num_examples: 2703 download_size: 15441254231 dataset_size: 25443430177.77 - config_name: validation-white-noise features: - name: audio dtype: audio - name: text dtype: string - name: id dtype: string splits: - name: '40' num_bytes: 2313039107.07 num_examples: 2703 - name: '35' num_bytes: 2313039107.07 num_examples: 2703 - name: '30' num_bytes: 2313039107.07 num_examples: 2703 - name: '25' num_bytes: 2313039107.07 num_examples: 2703 - name: '20' num_bytes: 2313039107.07 num_examples: 2703 - name: '15' num_bytes: 2313039107.07 num_examples: 2703 - name: '10' num_bytes: 2313039107.07 num_examples: 2703 - name: '5' num_bytes: 2313039107.07 num_examples: 2703 - name: '0' num_bytes: 2313039107.07 num_examples: 2703 - name: minus5 num_bytes: 2313039107.07 num_examples: 2703 - name: minus10 num_bytes: 2313039107.07 num_examples: 2703 download_size: 15581612447 dataset_size: 25443430177.77 configs: - config_name: test-pub-noise data_files: - split: '40' path: test-pub-noise/40-* - split: '35' path: test-pub-noise/35-* - split: '30' path: test-pub-noise/30-* - split: '25' path: test-pub-noise/25-* - split: '20' path: test-pub-noise/20-* - split: '15' path: test-pub-noise/15-* - split: '10' path: test-pub-noise/10-* - split: '5' path: test-pub-noise/5-* - split: '0' path: test-pub-noise/0-* - split: minus5 path: test-pub-noise/minus5-* - split: minus10 path: test-pub-noise/minus10-* - config_name: test-white-noise data_files: - split: '40' path: test-white-noise/40-* - split: '35' path: test-white-noise/35-* - split: '30' path: test-white-noise/30-* - split: '25' path: test-white-noise/25-* - split: '20' path: test-white-noise/20-* - split: '15' path: test-white-noise/15-* - split: '10' path: test-white-noise/10-* - split: '5' path: test-white-noise/5-* - split: '0' path: test-white-noise/0-* - split: minus5 path: test-white-noise/minus5-* - split: minus10 path: test-white-noise/minus10-* - config_name: validation-pub-noise data_files: - split: '40' path: validation-pub-noise/40-* - split: '35' path: validation-pub-noise/35-* - split: '30' path: validation-pub-noise/30-* - split: '25' path: validation-pub-noise/25-* - split: '20' path: validation-pub-noise/20-* - split: '15' path: validation-pub-noise/15-* - split: '10' path: validation-pub-noise/10-* - split: '5' path: validation-pub-noise/5-* - split: '0' path: validation-pub-noise/0-* - split: minus5 path: validation-pub-noise/minus5-* - split: minus10 path: validation-pub-noise/minus10-* - config_name: validation-white-noise data_files: - split: '40' path: validation-white-noise/40-* - split: '35' path: validation-white-noise/35-* - split: '30' path: validation-white-noise/30-* - split: '25' path: validation-white-noise/25-* - split: '20' path: validation-white-noise/20-* - split: '15' path: validation-white-noise/15-* - split: '10' path: validation-white-noise/10-* - split: '5' path: validation-white-noise/5-* - split: '0' path: validation-white-noise/0-* - split: minus5 path: validation-white-noise/minus5-* - split: minus10 path: validation-white-noise/minus10-* --- # Dataset Card for "librispeech_asr-noise" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
distil-whisper
原始信息汇总

数据集概述

数据集配置

配置名称:test-pub-noise

  • 特征
    • audio: 音频数据
    • text: 字符串
    • id: 字符串
  • 分割
    • 40: 2620个样本,2517727265.74字节
    • 35: 2620个样本,2517727265.74字节
    • 30: 2620个样本,2517727265.74字节
    • 25: 2620个样本,2517727265.74字节
    • 20: 2620个样本,2517727265.74字节
    • 15: 2620个样本,2517727265.74字节
    • 10: 2620个样本,2517727265.74字节
    • 5: 2620个样本,2517727265.74字节
    • 0: 2620个样本,2517727265.74字节
    • minus5: 2620个样本,2517727265.74字节
    • minus10: 2620个样本,2517727265.74字节
  • 下载大小:9029521258字节
  • 数据集大小:27694999923.13999字节

配置名称:test-white-noise

  • 特征
    • audio: 音频数据
    • text: 字符串
    • id: 字符串
  • 分割
    • 40: 2620个样本,2517727265.74字节
    • 35: 2620个样本,2517727265.74字节
    • 30: 2620个样本,2517727265.74字节
    • 25: 2620个样本,2517727265.74字节
    • 20: 2620个样本,2517727265.74字节
    • 15: 2620个样本,2517727265.74字节
    • 10: 2620个样本,2517727265.74字节
    • 5: 2620个样本,2517727265.74字节
    • 0: 2620个样本,2517727265.74字节
    • minus5: 2620个样本,2517727265.74字节
    • minus10: 2620个样本,2517727265.74字节
  • 下载大小:15639888311字节
  • 数据集大小:27694999923.13999字节

配置名称:validation-pub-noise

  • 特征
    • audio: 音频数据
    • text: 字符串
    • id: 字符串
  • 分割
    • 40: 2703个样本,2313039107.07字节
    • 35: 2703个样本,2313039107.07字节
    • 30: 2703个样本,2313039107.07字节
    • 25: 2703个样本,2313039107.07字节
    • 20: 2703个样本,2313039107.07字节
    • 15: 2703个样本,2313039107.07字节
    • 10: 2703个样本,2313039107.07字节
    • 5: 2703个样本,2313039107.07字节
    • 0: 2703个样本,2313039107.07字节
    • minus5: 2703个样本,2313039107.07字节
    • minus10: 2703个样本,2313039107.07字节
  • 下载大小:15441254231字节
  • 数据集大小:25443430177.77字节

配置名称:validation-white-noise

  • 特征
    • audio: 音频数据
    • text: 字符串
    • id: 字符串
  • 分割
    • 40: 2703个样本,2313039107.07字节
    • 35: 2703个样本,2313039107.07字节
    • 30: 2703个样本,2313039107.07字节
    • 25: 2703个样本,2313039107.07字节
    • 20: 2703个样本,2313039107.07字节
    • 15: 2703个样本,2313039107.07字节
    • 10: 2703个样本,2313039107.07字节
    • 5: 2703个样本,2313039107.07字节
    • 0: 2703个样本,2313039107.07字节
    • minus5: 2703个样本,2313039107.07字节
    • minus10: 2703个样本,2313039107.07字节
  • 下载大小:15581612447字节
  • 数据集大小:25443430177.77字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作