JST-SUPERB/noisy-alpaca-test-MUSAN-music
收藏Hugging Face2024-05-18 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/JST-SUPERB/noisy-alpaca-test-MUSAN-music
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: speech_input
dtype: string
- name: clean_audio
dtype:
audio:
sampling_rate: 16000
- name: noisy_10dB
dtype:
audio:
sampling_rate: 16000
- name: noisy_5dB
dtype:
audio:
sampling_rate: 16000
- name: noisy_0dB
dtype:
audio:
sampling_rate: 16000
- name: noisy_-5dB
dtype:
audio:
sampling_rate: 16000
- name: noisy_-10dB
dtype:
audio:
sampling_rate: 16000
- name: noisy_-20dB
dtype: audio
- name: noisy_10dB_transcription_whisper-small.en
dtype: string
- name: noisy_5dB_transcription_whisper-small.en
dtype: string
- name: noisy_0dB_transcription_whisper-small.en
dtype: string
- name: noisy_-5dB_transcription_whisper-small.en
dtype: string
- name: noisy_-10dB_transcription_whisper-small.en
dtype: string
- name: noisy_10dB_transcription_whisper-medium.en
dtype: string
- name: noisy_5dB_transcription_whisper-medium.en
dtype: string
- name: noisy_0dB_transcription_whisper-medium.en
dtype: string
- name: noisy_-5dB_transcription_whisper-medium.en
dtype: string
- name: noisy_-10dB_transcription_whisper-medium.en
dtype: string
- name: noisy_10dB_transcription_whisper-large-v3
dtype: string
- name: noisy_5dB_transcription_whisper-large-v3
dtype: string
- name: noisy_0dB_transcription_whisper-large-v3
dtype: string
- name: noisy_-5dB_transcription_whisper-large-v3
dtype: string
- name: noisy_-10dB_transcription_whisper-large-v3
dtype: string
- name: output
dtype: string
- name: clean_audio_transcription_whisper-small.en
dtype: string
- name: clean_audio_transcription_whisper-medium.en
dtype: string
- name: clean_audio_transcription_whisper-large-v3
dtype: string
splits:
- name: test
num_bytes: 6807410930.1
num_examples: 5135
download_size: 6676685031
dataset_size: 6807410930.1
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
---
# Dataset Card for "music"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
数据集信息:
特征字段:
- 名称:语音输入(speech_input),数据类型:字符串
- 名称:纯净音频(clean_audio),数据类型为音频(audio),采样率(sampling_rate)为16000Hz
- 名称:10分贝带噪音频(noisy_10dB),数据类型为音频(audio),采样率(sampling_rate)为16000Hz
- 名称:5分贝带噪音频(noisy_5dB),数据类型为音频(audio),采样率(sampling_rate)为16000Hz
- 名称:0分贝带噪音频(noisy_0dB),数据类型为音频(audio),采样率(sampling_rate)为16000Hz
- 名称:负5分贝带噪音频(noisy_-5dB),数据类型为音频(audio),采样率(sampling_rate)为16000Hz
- 名称:负10分贝带噪音频(noisy_-10dB),数据类型为音频(audio),采样率(sampling_rate)为16000Hz
- 名称:负20分贝带噪音频(noisy_-20dB),数据类型为音频(audio)
- 名称:10分贝带噪音频的Whisper-small.en模型转录结果(noisy_10dB_transcription_whisper-small.en),数据类型:字符串
- 名称:5分贝带噪音频的Whisper-small.en模型转录结果(noisy_5dB_transcription_whisper-small.en),数据类型:字符串
- 名称:0分贝带噪音频的Whisper-small.en模型转录结果(noisy_0dB_transcription_whisper-small.en),数据类型:字符串
- 名称:负5分贝带噪音频的Whisper-small.en模型转录结果(noisy_-5dB_transcription_whisper-small.en),数据类型:字符串
- 名称:负10分贝带噪音频的Whisper-small.en模型转录结果(noisy_-10dB_transcription_whisper-small.en),数据类型:字符串
- 名称:10分贝带噪音频的Whisper-medium.en模型转录结果(noisy_10dB_transcription_whisper-medium.en),数据类型:字符串
- 名称:5分贝带噪音频的Whisper-medium.en模型转录结果(noisy_5dB_transcription_whisper-medium.en),数据类型:字符串
- 名称:0分贝带噪音频的Whisper-medium.en模型转录结果(noisy_0dB_transcription_whisper-medium.en),数据类型:字符串
- 名称:负5分贝带噪音频的Whisper-medium.en模型转录结果(noisy_-5dB_transcription_whisper-medium.en),数据类型:字符串
- 名称:负10分贝带噪音频的Whisper-medium.en模型转录结果(noisy_-10dB_transcription_whisper-medium.en),数据类型:字符串
- 名称:10分贝带噪音频的Whisper-large-v3模型转录结果(noisy_10dB_transcription_whisper-large-v3),数据类型:字符串
- 名称:5分贝带噪音频的Whisper-large-v3模型转录结果(noisy_5dB_transcription_whisper-large-v3),数据类型:字符串
- 名称:0分贝带噪音频的Whisper-large-v3模型转录结果(noisy_0dB_transcription_whisper-large-v3),数据类型:字符串
- 名称:负5分贝带噪音频的Whisper-large-v3模型转录结果(noisy_-5dB_transcription_whisper-large-v3),数据类型:字符串
- 名称:负10分贝带噪音频的Whisper-large-v3模型转录结果(noisy_-10dB_transcription_whisper-large-v3),数据类型:字符串
- 名称:输出(output),数据类型:字符串
- 名称:纯净音频的Whisper-small.en模型转录结果(clean_audio_transcription_whisper-small.en),数据类型:字符串
- 名称:纯净音频的Whisper-medium.en模型转录结果(clean_audio_transcription_whisper-medium.en),数据类型:字符串
- 名称:纯净音频的Whisper-large-v3模型转录结果(clean_audio_transcription_whisper-large-v3),数据类型:字符串
划分集:
- 名称:测试集(test),字节数:6807410930.1,样本数量:5135
下载大小:6676685031
数据集总大小:6807410930.1
配置项:
- 配置名称:默认(default),数据文件:
- 划分集:测试集(test),路径:data/test-*
# 「音乐(music)」数据集卡片
[需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
##
提供机构:
JST-SUPERB



