asahi417/experiment-audio-tokenizer
收藏Hugging Face2024-05-30 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/experiment-audio-tokenizer
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: ja_asr.jsut_basic5000
features:
- name: original
dtype: audio
- name: reconstructed_2codes
dtype: audio
- name: reconstructed_3codes
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.crossfade
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.first
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.last
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.crossfade
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.first
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.last
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.crossfade
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.first
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.last
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.crossfade
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.first
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.last
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.enhancer
dtype: audio
- name: reconstructed_4codes
dtype: audio
- name: reconstructed_5codes
dtype: audio
- name: reconstructed_6codes
dtype: audio
splits:
- name: test
num_bytes: 210150174.0
num_examples: 16
download_size: 181357972
dataset_size: 210150174.0
- config_name: ja_asr.reazonspeech_test
features:
- name: original
dtype: audio
- name: reconstructed_2codes
dtype: audio
- name: reconstructed_3codes
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.crossfade
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.first
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.last
dtype: audio
- name: reconstructed_3codes.150chunks.120strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.crossfade
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.first
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.last
dtype: audio
- name: reconstructed_3codes.150chunks.140strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.crossfade
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.first
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.last
dtype: audio
- name: reconstructed_3codes.75chunks.55strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.crossfade
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.crossfade.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.first
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.first.enhancer
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.last
dtype: audio
- name: reconstructed_3codes.75chunks.65strides.last.enhancer
dtype: audio
- name: reconstructed_3codes.enhancer
dtype: audio
- name: reconstructed_4codes
dtype: audio
- name: reconstructed_5codes
dtype: audio
- name: reconstructed_6codes
dtype: audio
splits:
- name: test
num_bytes: 361331646.0
num_examples: 16
download_size: 274007178
dataset_size: 361331646.0
- config_name: sample_audio
features:
- name: reconstructed_2codes
dtype: audio
- name: reconstructed_3codes
dtype: audio
- name: reconstructed_3codes.enhancer
dtype: audio
- name: reconstructed_4codes
dtype: audio
- name: reconstructed_5codes
dtype: audio
- name: reconstructed_6codes
dtype: audio
splits:
- name: test
num_bytes: 4472522.0
num_examples: 3
download_size: 4344170
dataset_size: 4472522.0
configs:
- config_name: ja_asr.jsut_basic5000
data_files:
- split: test
path: ja_asr.jsut_basic5000/test-*
- config_name: ja_asr.reazonspeech_test
data_files:
- split: test
path: ja_asr.reazonspeech_test/test-*
- config_name: sample_audio
data_files:
- split: test
path: sample_audio/test-*
---
提供机构:
asahi417
原始信息汇总
数据集概述
数据集配置
配置名称:ja_asr.jsut_basic5000
- 特征:
original: 音频reconstructed_2codes: 音频reconstructed_3codes: 音频reconstructed_3codes.150chunks.120strides.crossfade: 音频reconstructed_3codes.150chunks.120strides.crossfade.enhancer: 音频reconstructed_3codes.150chunks.120strides.first: 音频reconstructed_3codes.150chunks.120strides.first.enhancer: 音频reconstructed_3codes.150chunks.120strides.last: 音频reconstructed_3codes.150chunks.120strides.last.enhancer: 音频reconstructed_3codes.150chunks.140strides.crossfade: 音频reconstructed_3codes.150chunks.140strides.crossfade.enhancer: 音频reconstructed_3codes.150chunks.140strides.first: 音频reconstructed_3codes.150chunks.140strides.first.enhancer: 音频reconstructed_3codes.150chunks.140strides.last: 音频reconstructed_3codes.150chunks.140strides.last.enhancer: 音频reconstructed_3codes.75chunks.55strides.crossfade: 音频reconstructed_3codes.75chunks.55strides.crossfade.enhancer: 音频reconstructed_3codes.75chunks.55strides.first: 音频reconstructed_3codes.75chunks.55strides.first.enhancer: 音频reconstructed_3codes.75chunks.55strides.last: 音频reconstructed_3codes.75chunks.55strides.last.enhancer: 音频reconstructed_3codes.75chunks.65strides.crossfade: 音频reconstructed_3codes.75chunks.65strides.crossfade.enhancer: 音频reconstructed_3codes.75chunks.65strides.first: 音频reconstructed_3codes.75chunks.65strides.first.enhancer: 音频reconstructed_3codes.75chunks.65strides.last: 音频reconstructed_3codes.75chunks.65strides.last.enhancer: 音频reconstructed_3codes.enhancer: 音频reconstructed_4codes: 音频reconstructed_5codes: 音频reconstructed_6codes: 音频
- 分割:
test: 16个样本,210150174.0字节
- 下载大小: 181357972字节
- 数据集大小: 210150174.0字节
配置名称:ja_asr.reazonspeech_test
- 特征:
original: 音频reconstructed_2codes: 音频reconstructed_3codes: 音频reconstructed_3codes.150chunks.120strides.crossfade: 音频reconstructed_3codes.150chunks.120strides.crossfade.enhancer: 音频reconstructed_3codes.150chunks.120strides.first: 音频reconstructed_3codes.150chunks.120strides.first.enhancer: 音频reconstructed_3codes.150chunks.120strides.last: 音频reconstructed_3codes.150chunks.120strides.last.enhancer: 音频reconstructed_3codes.150chunks.140strides.crossfade: 音频reconstructed_3codes.150chunks.140strides.crossfade.enhancer: 音频reconstructed_3codes.150chunks.140strides.first: 音频reconstructed_3codes.150chunks.140strides.first.enhancer: 音频reconstructed_3codes.150chunks.140strides.last: 音频reconstructed_3codes.150chunks.140strides.last.enhancer: 音频reconstructed_3codes.75chunks.55strides.crossfade: 音频reconstructed_3codes.75chunks.55strides.crossfade.enhancer: 音频reconstructed_3codes.75chunks.55strides.first: 音频reconstructed_3codes.75chunks.55strides.first.enhancer: 音频reconstructed_3codes.75chunks.55strides.last: 音频reconstructed_3codes.75chunks.55strides.last.enhancer: 音频reconstructed_3codes.75chunks.65strides.crossfade: 音频reconstructed_3codes.75chunks.65strides.crossfade.enhancer: 音频reconstructed_3codes.75chunks.65strides.first: 音频reconstructed_3codes.75chunks.65strides.first.enhancer: 音频reconstructed_3codes.75chunks.65strides.last: 音频reconstructed_3codes.75chunks.65strides.last.enhancer: 音频reconstructed_3codes.enhancer: 音频reconstructed_4codes: 音频reconstructed_5codes: 音频reconstructed_6codes: 音频
- 分割:
test: 16个样本,361331646.0字节
- 下载大小: 274007178字节
- 数据集大小: 361331646.0字节
配置名称:sample_audio
- 特征:
reconstructed_2codes: 音频reconstructed_3codes: 音频reconstructed_3codes.enhancer: 音频reconstructed_4codes: 音频reconstructed_5codes: 音频reconstructed_6codes: 音频
- 分割:
test: 3个样本,4472522.0字节
- 下载大小: 4344170字节
- 数据集大小: 4472522.0字节
数据文件路径
- ja_asr.jsut_basic5000:
test:ja_asr.jsut_basic5000/test-*
- ja_asr.reazonspeech_test:
test:ja_asr.reazonspeech_test/test-*
- sample_audio:
test:sample_audio/test-*



