JST-SUPERB/noisy-alpaca-test-MUSAN-music

Name: JST-SUPERB/noisy-alpaca-test-MUSAN-music
Creator: JST-SUPERB
Published: 2024-05-18 07:04:35
License: 暂无描述

Hugging Face2024-05-18 更新2025-04-26 收录

下载链接：

https://hf-mirror.com/datasets/JST-SUPERB/noisy-alpaca-test-MUSAN-music

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: speech_input dtype: string - name: clean_audio dtype: audio: sampling_rate: 16000 - name: noisy_10dB dtype: audio: sampling_rate: 16000 - name: noisy_5dB dtype: audio: sampling_rate: 16000 - name: noisy_0dB dtype: audio: sampling_rate: 16000 - name: noisy_-5dB dtype: audio: sampling_rate: 16000 - name: noisy_-10dB dtype: audio: sampling_rate: 16000 - name: noisy_-20dB dtype: audio - name: noisy_10dB_transcription_whisper-small.en dtype: string - name: noisy_5dB_transcription_whisper-small.en dtype: string - name: noisy_0dB_transcription_whisper-small.en dtype: string - name: noisy_-5dB_transcription_whisper-small.en dtype: string - name: noisy_-10dB_transcription_whisper-small.en dtype: string - name: noisy_10dB_transcription_whisper-medium.en dtype: string - name: noisy_5dB_transcription_whisper-medium.en dtype: string - name: noisy_0dB_transcription_whisper-medium.en dtype: string - name: noisy_-5dB_transcription_whisper-medium.en dtype: string - name: noisy_-10dB_transcription_whisper-medium.en dtype: string - name: noisy_10dB_transcription_whisper-large-v3 dtype: string - name: noisy_5dB_transcription_whisper-large-v3 dtype: string - name: noisy_0dB_transcription_whisper-large-v3 dtype: string - name: noisy_-5dB_transcription_whisper-large-v3 dtype: string - name: noisy_-10dB_transcription_whisper-large-v3 dtype: string - name: output dtype: string - name: clean_audio_transcription_whisper-small.en dtype: string - name: clean_audio_transcription_whisper-medium.en dtype: string - name: clean_audio_transcription_whisper-large-v3 dtype: string splits: - name: test num_bytes: 6807410930.1 num_examples: 5135 download_size: 6676685031 dataset_size: 6807410930.1 configs: - config_name: default data_files: - split: test path: data/test-* --- # Dataset Card for "music" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

数据集信息：特征字段： - 名称：语音输入（speech_input），数据类型：字符串 - 名称：纯净音频（clean_audio），数据类型为音频（audio），采样率（sampling_rate）为16000Hz - 名称：10分贝带噪音频（noisy_10dB），数据类型为音频（audio），采样率（sampling_rate）为16000Hz - 名称：5分贝带噪音频（noisy_5dB），数据类型为音频（audio），采样率（sampling_rate）为16000Hz - 名称：0分贝带噪音频（noisy_0dB），数据类型为音频（audio），采样率（sampling_rate）为16000Hz - 名称：负5分贝带噪音频（noisy_-5dB），数据类型为音频（audio），采样率（sampling_rate）为16000Hz - 名称：负10分贝带噪音频（noisy_-10dB），数据类型为音频（audio），采样率（sampling_rate）为16000Hz - 名称：负20分贝带噪音频（noisy_-20dB），数据类型为音频（audio） - 名称：10分贝带噪音频的Whisper-small.en模型转录结果（noisy_10dB_transcription_whisper-small.en），数据类型：字符串 - 名称：5分贝带噪音频的Whisper-small.en模型转录结果（noisy_5dB_transcription_whisper-small.en），数据类型：字符串 - 名称：0分贝带噪音频的Whisper-small.en模型转录结果（noisy_0dB_transcription_whisper-small.en），数据类型：字符串 - 名称：负5分贝带噪音频的Whisper-small.en模型转录结果（noisy_-5dB_transcription_whisper-small.en），数据类型：字符串 - 名称：负10分贝带噪音频的Whisper-small.en模型转录结果（noisy_-10dB_transcription_whisper-small.en），数据类型：字符串 - 名称：10分贝带噪音频的Whisper-medium.en模型转录结果（noisy_10dB_transcription_whisper-medium.en），数据类型：字符串 - 名称：5分贝带噪音频的Whisper-medium.en模型转录结果（noisy_5dB_transcription_whisper-medium.en），数据类型：字符串 - 名称：0分贝带噪音频的Whisper-medium.en模型转录结果（noisy_0dB_transcription_whisper-medium.en），数据类型：字符串 - 名称：负5分贝带噪音频的Whisper-medium.en模型转录结果（noisy_-5dB_transcription_whisper-medium.en），数据类型：字符串 - 名称：负10分贝带噪音频的Whisper-medium.en模型转录结果（noisy_-10dB_transcription_whisper-medium.en），数据类型：字符串 - 名称：10分贝带噪音频的Whisper-large-v3模型转录结果（noisy_10dB_transcription_whisper-large-v3），数据类型：字符串 - 名称：5分贝带噪音频的Whisper-large-v3模型转录结果（noisy_5dB_transcription_whisper-large-v3），数据类型：字符串 - 名称：0分贝带噪音频的Whisper-large-v3模型转录结果（noisy_0dB_transcription_whisper-large-v3），数据类型：字符串 - 名称：负5分贝带噪音频的Whisper-large-v3模型转录结果（noisy_-5dB_transcription_whisper-large-v3），数据类型：字符串 - 名称：负10分贝带噪音频的Whisper-large-v3模型转录结果（noisy_-10dB_transcription_whisper-large-v3），数据类型：字符串 - 名称：输出（output），数据类型：字符串 - 名称：纯净音频的Whisper-small.en模型转录结果（clean_audio_transcription_whisper-small.en），数据类型：字符串 - 名称：纯净音频的Whisper-medium.en模型转录结果（clean_audio_transcription_whisper-medium.en），数据类型：字符串 - 名称：纯净音频的Whisper-large-v3模型转录结果（clean_audio_transcription_whisper-large-v3），数据类型：字符串划分集： - 名称：测试集（test），字节数：6807410930.1，样本数量：5135 下载大小：6676685031 数据集总大小：6807410930.1 配置项： - 配置名称：默认（default），数据文件： - 划分集：测试集（test），路径：data/test-* # 「音乐（music）」数据集卡片 [需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) ##

提供机构：

JST-SUPERB

5,000+

优质数据集

54 个

任务类型

进入经典数据集