CodecSR/librispeech_asr_test_24k_synth
收藏Hugging Face2024-03-24 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/CodecSR/librispeech_asr_test_24k_synth
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio
dtype:
audio:
sampling_rate: 24000
- name: text
dtype: string
- name: id
dtype: string
splits:
- name: original
num_bytes: 1238771045.0
num_examples: 5559
- name: academicodec_hifi_16k_320d_large_uni
num_bytes: 1856162446.125
num_examples: 5559
- name: academicodec_hifi_24k_320d
num_bytes: 1856162446.125
num_examples: 5559
- name: audiodec_24k_300d
num_bytes: 1859036285.125
num_examples: 5559
- name: audiodec_48k_300d_uni
num_bytes: 1859036285.125
num_examples: 5559
- name: dac_16k
num_bytes: 1857688815.125
num_examples: 5559
- name: dac_24k
num_bytes: 1857688810.566
num_examples: 5559
- name: dac_44k
num_bytes: 1857688815.125
num_examples: 5559
- name: encodec_24k_12bps
num_bytes: 1857688815.125
num_examples: 5559
- name: encodec_24k_1_5bps
num_bytes: 1857688816.125
num_examples: 5559
- name: encodec_24k_24bps
num_bytes: 1857688810.566
num_examples: 5559
- name: encodec_24k_3bps
num_bytes: 1857688816.125
num_examples: 5559
- name: encodec_24k_6bps
num_bytes: 1857688815.125
num_examples: 5559
- name: facodec_16k
num_bytes: 1857256285.125
num_examples: 5559
- name: funcodec_en_libritts_16k_nq32ds320
num_bytes: 1857688810.566
num_examples: 5559
- name: funcodec_en_libritts_16k_nq32ds640
num_bytes: 1857688815.125
num_examples: 5559
- name: funcodec_zh_en_16k_nq32ds320
num_bytes: 1857688810.566
num_examples: 5559
- name: funcodec_zh_en_16k_nq32ds640
num_bytes: 1857688815.125
num_examples: 5559
- name: language_codec_chinese_24k_nq8_12kbps
num_bytes: 1859217805.125
num_examples: 5559
- name: language_codec_paper_24k_nq8_12kbps
num_bytes: 1859217804.007
num_examples: 5559
- name: speech_tokenizer_16k
num_bytes: 1859217804.007
num_examples: 5559
download_size: 37183397106
dataset_size: 38396343971.028015
configs:
- config_name: default
data_files:
- split: original
path: data/original-*
- split: academicodec_hifi_16k_320d_large_uni
path: data/academicodec_hifi_16k_320d_large_uni-*
- split: academicodec_hifi_24k_320d
path: data/academicodec_hifi_24k_320d-*
- split: audiodec_24k_300d
path: data/audiodec_24k_300d-*
- split: audiodec_48k_300d_uni
path: data/audiodec_48k_300d_uni-*
- split: dac_16k
path: data/dac_16k-*
- split: dac_24k
path: data/dac_24k-*
- split: dac_44k
path: data/dac_44k-*
- split: encodec_24k_12bps
path: data/encodec_24k_12bps-*
- split: encodec_24k_1_5bps
path: data/encodec_24k_1_5bps-*
- split: encodec_24k_24bps
path: data/encodec_24k_24bps-*
- split: encodec_24k_3bps
path: data/encodec_24k_3bps-*
- split: encodec_24k_6bps
path: data/encodec_24k_6bps-*
- split: facodec_16k
path: data/facodec_16k-*
- split: funcodec_en_libritts_16k_nq32ds320
path: data/funcodec_en_libritts_16k_nq32ds320-*
- split: funcodec_en_libritts_16k_nq32ds640
path: data/funcodec_en_libritts_16k_nq32ds640-*
- split: funcodec_zh_en_16k_nq32ds320
path: data/funcodec_zh_en_16k_nq32ds320-*
- split: funcodec_zh_en_16k_nq32ds640
path: data/funcodec_zh_en_16k_nq32ds640-*
- split: language_codec_chinese_24k_nq8_12kbps
path: data/language_codec_chinese_24k_nq8_12kbps-*
- split: language_codec_paper_24k_nq8_12kbps
path: data/language_codec_paper_24k_nq8_12kbps-*
- split: speech_tokenizer_16k
path: data/speech_tokenizer_16k-*
---
提供机构:
CodecSR
原始信息汇总
数据集概述
特征
- 音频 (audio)
- 采样率: 24000
- 文本 (text)
- 数据类型: 字符串
- ID (id)
- 数据类型: 字符串
分割
- original
- 字节数: 1238771045.0
- 样本数: 5559
- academicodec_hifi_16k_320d_large_uni
- 字节数: 1856162446.125
- 样本数: 5559
- academicodec_hifi_24k_320d
- 字节数: 1856162446.125
- 样本数: 5559
- audiodec_24k_300d
- 字节数: 1859036285.125
- 样本数: 5559
- audiodec_48k_300d_uni
- 字节数: 1859036285.125
- 样本数: 5559
- dac_16k
- 字节数: 1857688815.125
- 样本数: 5559
- dac_24k
- 字节数: 1857688810.566
- 样本数: 5559
- dac_44k
- 字节数: 1857688815.125
- 样本数: 5559
- encodec_24k_12bps
- 字节数: 1857688815.125
- 样本数: 5559
- encodec_24k_1_5bps
- 字节数: 1857688816.125
- 样本数: 5559
- encodec_24k_24bps
- 字节数: 1857688810.566
- 样本数: 5559
- encodec_24k_3bps
- 字节数: 1857688816.125
- 样本数: 5559
- encodec_24k_6bps
- 字节数: 1857688815.125
- 样本数: 5559
- facodec_16k
- 字节数: 1857256285.125
- 样本数: 5559
- funcodec_en_libritts_16k_nq32ds320
- 字节数: 1857688810.566
- 样本数: 5559
- funcodec_en_libritts_16k_nq32ds640
- 字节数: 1857688815.125
- 样本数: 5559
- funcodec_zh_en_16k_nq32ds320
- 字节数: 1857688810.566
- 样本数: 5559
- funcodec_zh_en_16k_nq32ds640
- 字节数: 1857688815.125
- 样本数: 5559
- language_codec_chinese_24k_nq8_12kbps
- 字节数: 1859217805.125
- 样本数: 5559
- language_codec_paper_24k_nq8_12kbps
- 字节数: 1859217804.007
- 样本数: 5559
- speech_tokenizer_16k
- 字节数: 1859217804.007
- 样本数: 5559
数据集大小
- 下载大小: 37183397106
- 数据集大小: 38396343971.028015
配置
- default
- 数据文件路径:
- original: data/original-*
- academicodec_hifi_16k_320d_large_uni: data/academicodec_hifi_16k_320d_large_uni-*
- academicodec_hifi_24k_320d: data/academicodec_hifi_24k_320d-*
- audiodec_24k_300d: data/audiodec_24k_300d-*
- audiodec_48k_300d_uni: data/audiodec_48k_300d_uni-*
- dac_16k: data/dac_16k-*
- dac_24k: data/dac_24k-*
- dac_44k: data/dac_44k-*
- encodec_24k_12bps: data/encodec_24k_12bps-*
- encodec_24k_1_5bps: data/encodec_24k_1_5bps-*
- encodec_24k_24bps: data/encodec_24k_24bps-*
- encodec_24k_3bps: data/encodec_24k_3bps-*
- encodec_24k_6bps: data/encodec_24k_6bps-*
- facodec_16k: data/facodec_16k-*
- funcodec_en_libritts_16k_nq32ds320: data/funcodec_en_libritts_16k_nq32ds320-*
- funcodec_en_libritts_16k_nq32ds640: data/funcodec_en_libritts_16k_nq32ds640-*
- funcodec_zh_en_16k_nq32ds320: data/funcodec_zh_en_16k_nq32ds320-*
- funcodec_zh_en_16k_nq32ds640: data/funcodec_zh_en_16k_nq32ds640-*
- language_codec_chinese_24k_nq8_12kbps: data/language_codec_chinese_24k_nq8_12kbps-*
- language_codec_paper_24k_nq8_12kbps: data/language_codec_paper_24k_nq8_12kbps-*
- speech_tokenizer_16k: data/speech_tokenizer_16k-*
- 数据文件路径:



