five

asahi417/experiment-audio-tokenizer

收藏
Hugging Face2024-05-30 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/experiment-audio-tokenizer
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: ja_asr.jsut_basic5000 features: - name: original dtype: audio - name: reconstructed_2codes dtype: audio - name: reconstructed_3codes dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.first dtype: audio - name: reconstructed_3codes.150chunks.120strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.last dtype: audio - name: reconstructed_3codes.150chunks.120strides.last.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.first dtype: audio - name: reconstructed_3codes.150chunks.140strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.last dtype: audio - name: reconstructed_3codes.150chunks.140strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.first dtype: audio - name: reconstructed_3codes.75chunks.55strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.last dtype: audio - name: reconstructed_3codes.75chunks.55strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.first dtype: audio - name: reconstructed_3codes.75chunks.65strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.last dtype: audio - name: reconstructed_3codes.75chunks.65strides.last.enhancer dtype: audio - name: reconstructed_3codes.enhancer dtype: audio - name: reconstructed_4codes dtype: audio - name: reconstructed_5codes dtype: audio - name: reconstructed_6codes dtype: audio splits: - name: test num_bytes: 210150174.0 num_examples: 16 download_size: 181357972 dataset_size: 210150174.0 - config_name: ja_asr.reazonspeech_test features: - name: original dtype: audio - name: reconstructed_2codes dtype: audio - name: reconstructed_3codes dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.first dtype: audio - name: reconstructed_3codes.150chunks.120strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.last dtype: audio - name: reconstructed_3codes.150chunks.120strides.last.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.first dtype: audio - name: reconstructed_3codes.150chunks.140strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.last dtype: audio - name: reconstructed_3codes.150chunks.140strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.first dtype: audio - name: reconstructed_3codes.75chunks.55strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.last dtype: audio - name: reconstructed_3codes.75chunks.55strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.first dtype: audio - name: reconstructed_3codes.75chunks.65strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.last dtype: audio - name: reconstructed_3codes.75chunks.65strides.last.enhancer dtype: audio - name: reconstructed_3codes.enhancer dtype: audio - name: reconstructed_4codes dtype: audio - name: reconstructed_5codes dtype: audio - name: reconstructed_6codes dtype: audio splits: - name: test num_bytes: 361331646.0 num_examples: 16 download_size: 274007178 dataset_size: 361331646.0 - config_name: sample_audio features: - name: reconstructed_2codes dtype: audio - name: reconstructed_3codes dtype: audio - name: reconstructed_3codes.enhancer dtype: audio - name: reconstructed_4codes dtype: audio - name: reconstructed_5codes dtype: audio - name: reconstructed_6codes dtype: audio splits: - name: test num_bytes: 4472522.0 num_examples: 3 download_size: 4344170 dataset_size: 4472522.0 configs: - config_name: ja_asr.jsut_basic5000 data_files: - split: test path: ja_asr.jsut_basic5000/test-* - config_name: ja_asr.reazonspeech_test data_files: - split: test path: ja_asr.reazonspeech_test/test-* - config_name: sample_audio data_files: - split: test path: sample_audio/test-* ---
提供机构:
asahi417
原始信息汇总

数据集概述

数据集配置

配置名称:ja_asr.jsut_basic5000

  • 特征:
    • original: 音频
    • reconstructed_2codes: 音频
    • reconstructed_3codes: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.first: 音频
    • reconstructed_3codes.150chunks.120strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.last: 音频
    • reconstructed_3codes.150chunks.120strides.last.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.first: 音频
    • reconstructed_3codes.150chunks.140strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.last: 音频
    • reconstructed_3codes.150chunks.140strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.first: 音频
    • reconstructed_3codes.75chunks.55strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.last: 音频
    • reconstructed_3codes.75chunks.55strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.first: 音频
    • reconstructed_3codes.75chunks.65strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.last: 音频
    • reconstructed_3codes.75chunks.65strides.last.enhancer: 音频
    • reconstructed_3codes.enhancer: 音频
    • reconstructed_4codes: 音频
    • reconstructed_5codes: 音频
    • reconstructed_6codes: 音频
  • 分割:
    • test: 16个样本,210150174.0字节
  • 下载大小: 181357972字节
  • 数据集大小: 210150174.0字节

配置名称:ja_asr.reazonspeech_test

  • 特征:
    • original: 音频
    • reconstructed_2codes: 音频
    • reconstructed_3codes: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.first: 音频
    • reconstructed_3codes.150chunks.120strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.last: 音频
    • reconstructed_3codes.150chunks.120strides.last.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.first: 音频
    • reconstructed_3codes.150chunks.140strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.last: 音频
    • reconstructed_3codes.150chunks.140strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.first: 音频
    • reconstructed_3codes.75chunks.55strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.last: 音频
    • reconstructed_3codes.75chunks.55strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.first: 音频
    • reconstructed_3codes.75chunks.65strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.last: 音频
    • reconstructed_3codes.75chunks.65strides.last.enhancer: 音频
    • reconstructed_3codes.enhancer: 音频
    • reconstructed_4codes: 音频
    • reconstructed_5codes: 音频
    • reconstructed_6codes: 音频
  • 分割:
    • test: 16个样本,361331646.0字节
  • 下载大小: 274007178字节
  • 数据集大小: 361331646.0字节

配置名称:sample_audio

  • 特征:
    • reconstructed_2codes: 音频
    • reconstructed_3codes: 音频
    • reconstructed_3codes.enhancer: 音频
    • reconstructed_4codes: 音频
    • reconstructed_5codes: 音频
    • reconstructed_6codes: 音频
  • 分割:
    • test: 3个样本,4472522.0字节
  • 下载大小: 4344170字节
  • 数据集大小: 4472522.0字节

数据文件路径

  • ja_asr.jsut_basic5000:
    • test: ja_asr.jsut_basic5000/test-*
  • ja_asr.reazonspeech_test:
    • test: ja_asr.reazonspeech_test/test-*
  • sample_audio:
    • test: sample_audio/test-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作